KR102389358B1 - A novel nicotine biosynthetic gene and use thereof - Google Patents

A novel nicotine biosynthetic gene and use thereof Download PDF

Info

Publication number
KR102389358B1
KR102389358B1 KR1020190175480A KR20190175480A KR102389358B1 KR 102389358 B1 KR102389358 B1 KR 102389358B1 KR 1020190175480 A KR1020190175480 A KR 1020190175480A KR 20190175480 A KR20190175480 A KR 20190175480A KR 102389358 B1 KR102389358 B1 KR 102389358B1
Authority
KR
South Korea
Prior art keywords
gene
seq
nucleotide sequence
pmt1b
sequence
Prior art date
Application number
KR1020190175480A
Other languages
Korean (ko)
Other versions
KR20210023646A (en
Inventor
서효석
이영기
이정헌
김광철
나웅현
오경환
전은영
Original Assignee
주식회사 케이티앤지
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 주식회사 케이티앤지 filed Critical 주식회사 케이티앤지
Publication of KR20210023646A publication Critical patent/KR20210023646A/en
Application granted granted Critical
Publication of KR102389358B1 publication Critical patent/KR102389358B1/en

Links

Images

Classifications

    • AHUMAN NECESSITIES
    • A24TOBACCO; CIGARS; CIGARETTES; SIMULATED SMOKING DEVICES; SMOKERS' REQUISITES
    • A24BMANUFACTURE OR PREPARATION OF TOBACCO FOR SMOKING OR CHEWING; TOBACCO; SNUFF
    • A24B15/00Chemical features or treatment of tobacco; Tobacco substitutes, e.g. in liquid form
    • A24B15/18Treatment of tobacco products or tobacco substitutes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8243Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1003Transferases (2.) transferring one-carbon groups (2.1)
    • C12N9/1007Methyltransferases (general) (2.1.1.)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y201/00Transferases transferring one-carbon groups (2.1)
    • C12Y201/01Methyltransferases (2.1.1)
    • C12Y201/01053Putrescine N-methyltransferase (2.1.1.53)

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Biomedical Technology (AREA)
  • Microbiology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Medicinal Chemistry (AREA)
  • Nutrition Science (AREA)
  • Cell Biology (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Plant Pathology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)

Abstract

신규한 서열번호 1의 염기 서열로 이루어진 PMT1b(Putrescine N-methyltransferase 1b) 유전자 및 그의 용도에 관한 것이다. 상기 유전자는 PMT1a, PMT2, PMT3 및 PMT4 유전자와 함께 발현이 억제되는 경우 식물의 니코틴 생합성이 억제될 수 있어, 니코틴 함량이 없거나 적은 식물 세포를 제조할 수 있다.It relates to a novel PMT1b (Putrescine N-methyltransferase 1b) gene consisting of the nucleotide sequence of SEQ ID NO: 1 and uses thereof. When the expression of the gene is suppressed together with the PMT1a, PMT2, PMT3 and PMT4 genes, nicotine biosynthesis of plants may be inhibited, and plant cells with no or less nicotine content may be produced.

Description

신규한 니코틴 생합성 유전자 및 그의 용도{A NOVEL NICOTINE BIOSYNTHETIC GENE AND USE THEREOF}A NOVEL NICOTINE BIOSYNTHETIC GENE AND USE THEREOF

신규한 니코틴 생합성 유전자 및 그의 용도에 관한 것이다.Novel nicotine biosynthesis genes and uses thereof.

우리나라는 남성 흡연율이 1992년 68.9%에서 2006년 44.1%, 2016년 40.7% 등으로 감소하여 왔으나, 주요 선진국 및 OECD 국가 평균 23.9%를 크게 웃도는 수준을 유지하고 있다. 담뱃세 인상, 금연구역 확대 등 정부에서 적극적인 담배규제 정책을 펼치고 있으며, 미국 식품의약국(FDA)은 ‘담배의 니코틴 함량을 낮춰 중독성을 감소시키거나 없애는 것’을 골자로 한 사전 입법제안 예고를 2017년 공고하였으며, 2030년까지 유엔의 지속가능한 발전목표달성을 위한 건강 보장과 증진의 수단으로서 WHO담배규제기본협약의 관리방안을 반영하여 저니코틴 담배 품종의 개발이 필요한 실정이었다.In Korea, the male smoking rate has decreased from 68.9% in 1992 to 44.1% in 2006 and 40.7% in 2016, but remains at a level far exceeding the average of 23.9% in major advanced countries and OECD countries. The government is implementing an active tobacco control policy, such as raising the cigarette tax and expanding non-smoking areas. Announced in 2017, it was necessary to develop low-nicotine tobacco varieties by reflecting the management plan of the WHO Framework Convention on Tobacco Control as a means of guaranteeing and promoting health to achieve the UN's sustainable development goals by 2030.

이에, 유전자 교정 기술을 이용하여 담배 식물의 니코틴 생합성에 관련된 유전자의 기능을 억제시킴으로써 니코틴 함량을 낮추는 방법이 중요하게 부각되고 있다. 그러나, 기존에 알려진 니코틴 생합성 유전자인 PMT 유전자 중 PMT1a, PMT2, PMT3, PMT4 유전자의 염기 서열은 밝혀져 있으나, PMT1b 유전자의 염기 서열은 완벽하게 밝혀진 바가 없었고, 이로 인해 유전자의 기능을 100% 억제할 수 없었으며, 니코틴 함량이 미량 검출되는 한계가 있었다.Accordingly, a method of lowering the nicotine content by suppressing the function of a gene related to nicotine biosynthesis in tobacco plants using a gene editing technology has been highlighted. However, the nucleotide sequence of the PMT1a, PMT2, PMT3, and PMT4 genes among the PMT genes, which are known nicotine biosynthesis genes, have been revealed, but the nucleotide sequence of the PMT1b gene has not been fully elucidated. There was no limit in detecting trace amounts of nicotine.

니코틴 함량이 미량이라도 검출되지 않도록 하기 위해서는 니코틴 생합성 유전자 중 PMT1a, PMT2, PMT3, PMT4 유전자 뿐만 아니라 PMT1b 유전자의 염기서열도 밝혀야 하는 바, 본 발명을 완성하기에 이르렀다.In order to prevent detection of even a trace amount of nicotine, the nucleotide sequence of the PMT1b gene as well as the PMT1a, PMT2, PMT3, and PMT4 genes among the nicotine biosynthesis genes should be revealed, and thus the present invention has been completed.

한국특허출원 제10-2013-7031875호Korean Patent Application No. 10-2013-7031875

일 양상은 서열번호 1의 염기 서열로 이루어진 PMT1b(Putrescine N-methyltransferase 1b) 유전자를 제공한다.One aspect provides a PMT1b (Putrescine N-methyltransferase 1b) gene consisting of the nucleotide sequence of SEQ ID NO: 1.

다른 양상은 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작된 식물 세포를 제공한다.Another aspect provides a plant cell genetically engineered to reduce the expression or activity of the PMT1b gene comprising the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene compared to the parent cell.

또 다른 양상은 상기 식물 세포를 포함하는 식물을 제공한다.Another aspect provides a plant comprising the plant cell.

또 다른 양상은 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작하는 단계를 포함하는 식물의 니코틴 생합성 억제 방법을 제공한다.Another aspect provides a method for inhibiting nicotine biosynthesis in plants, comprising genetically engineering to reduce the expression or activity of the PMT1b gene comprising the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene compared to the parental cell.

또 다른 양상은 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작하는 단계를 포함하는 니코틴의 생합성이 억제된 식물 세포의 제조 방법을 제공한다.Another aspect is the production of plant cells in which the biosynthesis of nicotine is inhibited, comprising the step of genetically engineering to reduce the expression or activity of the PMT1b gene comprising the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene compared to the parental cell provide a way

일 양상은 서열번호 1의 염기 서열로 이루어진 PMT1b(Putrescine N-methyltransferase 1b) 유전자에 관한 것이다.One aspect relates to the PMT1b (Putrescine N-methyltransferase 1b) gene consisting of the nucleotide sequence of SEQ ID NO: 1.

담배의 대표적 알칼로이드인 니코틴은 트로판 알칼로이드(tropane alkaloid)와 같이 뿌리에서 오르니틴(ornithine)과 아르기닌(arginine)으로부터 생합성 되어진 푸트레신(putrescine)이 N-메틸푸트레신(N-methylputrescine)으로 전환되고 디아민 옥시다아제(diamine oxidase)에 의해 산화된 후 4-메틸아미노부타놀(4-methylaminobutanol)을 거쳐 1-메틸-Δ1-피롤리니움 양이온(1-methyl-Δ1-pyrrolinium cation)이 되고 니코틴산(nicotinic acid)을 거쳐 생합성된다(도 1 참조). 그 중에서 "푸트레신-N-메틸트랜스퍼라아제(putrescine-N-methyltransferase, 이하 PMT)"는 니코틴의 생합성과정에서 아주 중요한 효소로 니코틴의 생합성과정에서 속도제한효소(Rate limiting enzyme)라고 할 수 있고, 일 실시예에 있어서, 상기 유전자는 니코틴 생합성 유전자일 수 있다.Nicotine, a representative alkaloid in tobacco, is biosynthesized from ornithine and arginine at the root, like tropane alkaloid, putrescine is converted to N-methylputrescine. After being converted and oxidized by diamine oxidase, it goes through 4-methylaminobutanol to become 1-methyl-Δ1-pyrrolinium cation and nicotinic acid ( nicotinic acid) through biosynthesis (see FIG. 1). Among them, "putrescine-N-methyltransferase (hereinafter PMT)" is a very important enzyme in the biosynthesis of nicotine and can be called a rate limiting enzyme in the biosynthesis of nicotine. and, in one embodiment, the gene may be a nicotine biosynthesis gene.

상기 PMT1b 유전자는 기존의 PMT 유전자로 밝혀진 PMT1a, PMT2, PMT3 및 PMT4와 함께 PMT 단백질의 일부를 발현하는 유전자로서 구체적인 서열은 본 명세서에서 최초로 밝혀졌으며, 구체적으로 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자일 수 있다.The PMT1b gene is a gene that expresses a part of the PMT protein together with PMT1a, PMT2, PMT3 and PMT4, which have been identified as the existing PMT genes. can be

다른 양상은 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작된 식물 세포에 관한 것이다.Another aspect relates to a plant cell genetically engineered to reduce the expression or activity of the PMT1b gene comprising the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene compared to the parent cell.

상기 “모세포”는 일 양상에 따른 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 인위적인 조작을 수행하지 않은 세포로서, 인체에서 갓 분리된 세포 및 이를 배양한 세포를 의미한다.The “parent cell” refers to a cell that has not been artificially manipulated to reduce the expression or activity of the PMT1b gene or the protein expressed by the PMT1b gene according to an aspect, and refers to a cell freshly isolated from the human body and a cell cultured therewith.

또한, 본 명세서에서 용어 "유전적 조작(genetic engineering)" 또는 "유전적으로 조작된(genetically engineered)"은 세포에 대하여 하나 이상의 유전적 변형(genetic modification)을 도입하는 행위 또는 그에 의하여 만들어진 세포를 나타낸다.Also, as used herein, the term "genetic engineering" or "genetically engineered" refers to the act of introducing one or more genetic modifications into a cell or a cell made thereby. .

상기 유전적 조작은 물리적인 방법에 의해 PMT1b 유전자의 핵산서열 내 변형으로 유도되는 것일 수 있다. 상기 물리적인 방법은 예를 들어, X-선 조사, 감마선 조사 등일 수 있다.The genetic manipulation may be induced by modification in the nucleic acid sequence of the PMT1b gene by a physical method. The physical method may be, for example, X-ray irradiation, gamma-ray irradiation, or the like.

또한, 상기 유전적 조작은 화학적인 방법에 의해 PMT1b 유전자의 핵산서열 내 변형 또는 유전자의 발현의 변화로 유도되는 것일 수 있다. 상기 화학적인 방법은 예를 들어, 에틸 메탄설포네이트(ethyl methanesulfonate) 처리, 다이메틸 설페이트(dimethyl sulfate) 처리 등일 수 있다.In addition, the genetic manipulation may be induced by a change in the nucleic acid sequence of the PMT1b gene or a change in the expression of the gene by a chemical method. The chemical method may be, for example, ethyl methanesulfonate treatment, dimethyl sulfate treatment, or the like.

또한, 상기 유전적 조작은 유전자 편집 시스템에 의해 PMT1b 유전자의 핵산서열 내 변형으로 유도되는 것일 수 있다. 상기 유전자 편집 시스템은 예를 들어, 메가뉴클레아제(Meganuclease) 시스템, 징크핑거 뉴클레아제(Zinc finger nuclease) 시스템, 탈렌(TALEN; Transcription Activator-Like Effector Nuclease) 시스템, CRISPR/Cas9 시스템 등일 수 있다.In addition, the genetic manipulation may be induced by modification in the nucleic acid sequence of the PMT1b gene by a gene editing system. The gene editing system may be, for example, a Meganuclease system, a zinc finger nuclease system, a TALEN (Transcription Activator-Like Effector Nuclease) system, a CRISPR/Cas9 system, and the like. .

또한, 상기 유전적 조작은 RNA 간섭(RNAi; RNA interference) 시스템에 의해 PMT1b 유전자로부터 전사된 mRNA에 결합하여 유전자 발현의 변화로 유도되는 것일 수 있다.In addition, the genetic manipulation may be induced by a change in gene expression by binding to mRNA transcribed from the PMT1b gene by an RNA interference (RNAi) system.

RNA 간섭(RNAi; RNA interference) 시스템, 메가뉴클레아제(Meganuclease) 시스템, 징크핑거 뉴클레아제(Zinc finger nuclease) 시스템, 탈렌(TALEN; Transcription Activator-Like Effector Nuclease) 시스템, CRISPR/Cas9 시스템, X-선 조사, 감마선 조사, 에틸 메탄설포네이트(ethyl methanesulfonate) 처리, 다이메틸 설페이트(dimethyl sulfate) 처리로 이루어진 군에서 선택되는 적어도 하나에 의한 것일 수 있고, 구체적으로는 CRISPR/Cas9 시스템에 의한 것일 수 있고, 상기 유전적 조작은 PMT1b 유전자의 핵산서열 내 변형으로 유도되는 것일 수 있다.RNA interference (RNAi) system, Meganuclease system, Zinc finger nuclease system, TALEN (Transcription Activator-Like Effector Nuclease) system, CRISPR/Cas9 system, X - It may be by at least one selected from the group consisting of radiation irradiation, gamma irradiation, ethyl methanesulfonate treatment, and dimethyl sulfate treatment, and specifically, it may be by the CRISPR/Cas9 system And, the genetic manipulation may be induced by modification in the nucleic acid sequence of the PMT1b gene.

이에 일 양상에 따른 식물 세포는 CRISPR/Cas9 시스템을 상기 식물 세포에 전달하여 니코틴 생합성에 참여하는 PMT1b 유전자를 불활성화시켜 변형된 PMT1b 단백질이 발현되어, 니코틴(Nicotine) 생합성을 억제할 수 있다. 또한, 노르니코틴(Nornicotine) 및 아나타빈(Anatabine)의 생합성도 억제할 수 있다.Accordingly, in the plant cell according to an aspect, the modified PMT1b protein is expressed by inactivating the PMT1b gene participating in nicotine biosynthesis by delivering the CRISPR/Cas9 system to the plant cell, thereby inhibiting nicotine biosynthesis. In addition, it can also inhibit the biosynthesis of nornicotine (Nornicotine) and anatabine (Anatabine).

용어 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질의 "발현 또는 활성이 감소" 또는 PMT1b 유전자의 "불활성화", PMT1b 단백질의 "발현 또는 활성이 감소" 또는 PMT1b 유전자의 "불활성화"된 유전적으로 조작된 식물 세포는 상기 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질이 비교 가능한 동일 종의 식물 세포 또는 그의 모세포에서 측정된 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성 수준보다 낮은 정도의 발현 또는 활성이 나타내거나 발현 또는 활성이 없는 것을 의미한다. 즉 식물 세포에 있어서 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 본래 조작되지 않은 식물 세포의 발현 또는 활성보다 약 20% 이 상, 약 30% 이상, 약 40% 이상, 약 50% 이상, 약 55% 이상, 약 60% 이상, 약 70% 이상, 약 75% 이상, 약 80% 이상, 약 85% 이상, 약 90% 이상, 약 95% 이상, 또는 약 100% 감소된 것일 수 있다. PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소된 유전적으로 조작된 식물 세포는 당업계에 공지된 임의의 방법을 사용하여 확인될 수 있다. 용어 "불활성화 (inactivation)"는 전혀 발현이 되지 않는 유전자 또는 발현이 되더라도 그 활성이 없는 단백질이 생성되는 것을 의미할 수 있다. 용어 "감소 (depression)"는 PMT1b 유전자가 조작되지 않은 식물 세포에 비하여 낮은 수준으로 발현되거나, 또는 PMT1b 유전자가 발현하는 단백질이 발현이 되더라도 그 활성이 낮은 것을 의미할 수 있다.The term "reduced expression or activity" of the PMT1b gene or protein expressed by the PMT1b gene or "inactivation" of the PMT1b gene, "reduced expression or activity" of the PMT1b protein or "inactivation" of the PMT1b gene. In the plant cell, the expression or activity of the PMT1b gene or the protein expressed by the PMT1b gene is lower than the expression or activity level of the PMT1b gene or the protein expressed by the PMT1b gene measured in a comparable plant cell or parent cell of the same species. or no expression or activity. That is, in plant cells, the expression or activity of the PMT1b gene or the protein expressed by the PMT1b gene is about 20% or more, about 30% or more, about 40% or more, about 50% or more, than the expression or activity of the original non-engineered plant cell. , about 55% or more, about 60% or more, about 70% or more, about 75% or more, about 80% or more, about 85% or more, about 90% or more, about 95% or more, or about 100% reduction. . Genetically engineered plant cells with reduced expression or activity of the PMT1b gene or the protein expressed by the PMT1b gene can be identified using any method known in the art. The term “inactivation” may mean that a gene that is not expressed at all or a protein that has no activity even if it is expressed is generated. The term “depression” may mean that the PMT1b gene is expressed at a low level compared to unengineered plant cells, or that the activity of the PMT1b gene is low even when the protein expressed is expressed.

상기 PMT1b 유전자 또는 PMT1b 단백질의 발현 또는 활성이 감소되는 것은 상기 PMT1b를 코딩하는 유전자의 일부 또는 전부가 변이, 치환, 삭제되거나 상기 유전자에 하나 이상의 염기가 삽입되는 것에 의한 것일 수 있으며, PMT1b 유전자 교정 수단에 의할 수 있다.The decrease in the expression or activity of the PMT1b gene or PMT1b protein may be due to mutation, substitution, deletion of some or all of the gene encoding PMT1b, or one or more bases being inserted into the gene, PMT1b gene correction means can depend on

일 실시예에 있어서, 상기 PMT1b 유전자는 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자로 이루어진 군에서 선택되는 적어도 하나의 유전자 및 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자일 수 있고, 보다 구체적으로 상기 PMT1b 유전자는 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자, 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자일 수 있다.In one embodiment, the PMT1b gene comprises the PMT1a gene consisting of the nucleotide sequence of SEQ ID NO: 2, the PMT2 gene consisting of the nucleotide sequence of SEQ ID NO: 3, the PMT3 gene consisting of the nucleotide sequence of SEQ ID NO: 4, and the nucleotide sequence of SEQ ID NO: 5 It may be a PMT1b gene consisting of at least one gene selected from the group consisting of the PMT4 gene and the nucleotide sequence of SEQ ID NO: 1, more specifically, the PMT1b gene is the PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 2 It may be a PMT1a gene consisting of a nucleotide sequence, a PMT2 gene consisting of a nucleotide sequence of SEQ ID NO: 3, a PMT3 gene consisting of a nucleotide sequence of SEQ ID NO: 4, and a PMT4 gene consisting of a nucleotide sequence of SEQ ID NO: 5.

상기 PMT1b 유전자 뿐만 아니라 PMT1a, PMT2, PMT3 및 PMT4 유전자 모두의 발현 또는 활성을 감소시키는 경우, 니코틴 생합성이 최대로 억제될 수 있어, 상기 식물 세포 내 니코틴이 존재하지 않을 수 있다. 나아가, 노르니코틴 및 아나타빈의 생합성도 최대로 억제될 수 있다.When the expression or activity of all of the PMT1a, PMT2, PMT3 and PMT4 genes as well as the PMT1b gene is reduced, nicotine biosynthesis may be maximally inhibited, and nicotine may not exist in the plant cell. Furthermore, the biosynthesis of nornicotine and anatabine can also be maximally inhibited.

상기 CRISPR/Cas9 시스템은 crRNA(CRISPR RNA) 및 tracrRNA(transactivating crRNA)를 포함하는 sgRNA(single guide RNA)를 포함할 수 있다.The CRISPR/Cas9 system may include single guide RNA (sgRNA) including crRNA (CRISPR RNA) and tracrRNA (transactivating crRNA).

또한, 상기 CRISPR/Cas9 시스템은 Cas9(CRISPR associated protein 9) 단백질 또는 Cas9 단백질을 코딩하는 유전자 및 NLS(Nuclear Localization Signal) 단백질 또는 NLS 단백질을 코딩하는 유전자를 포함할 수 있다.In addition, the CRISPR/Cas9 system may include a CRISPR associated protein 9 (Cas9) protein or a gene encoding a Cas9 protein and a Nuclear Localization Signal (NLS) protein or a gene encoding the NLS protein.

일반적으로, 널리 알려진 유전자 교정 수단인 "CRISPR(Clustered Regularly Interspaced Short Palindromic Repeats) 시스템"은 집합적으로 Cas 유전자를 코딩하는 서열, tracr(트랜스-활성화 CRISPR) 서열(예를 들어, tracrRNA 또는 활성 부분 tracrRNA), tracr-메이트 서열(내인성 CRISPR 시스템의 맥락에서 "직접 반복부" 및 tracrRNA-가공 부분 직접 반복부 포함), 가이드 서열(내인성 CRISPR 시스템의 맥락에서 "스페이서"로도 지칭), 가이드 RNA 또는 CRISPR 유전자좌로부터의 기타 서열 및 전사물을 포함하는 CRISPR-관련(CRISPR-associated; 이하 Cas) 유전자의 발현에 수반되거나, 그의 활성을 유도하는 전사물 및 다른 요소를 지칭한다. 일부 구현 예에서, CRISPR 시스템의 하나 이상의 요소는 I형, II형 또는 III형 CRISPR 시스템으로부터 유래된다. 일부 구현 예에서, CRISPR 시스템의 하나 이상의 요소는 내인성 CRISPR 시스템을 포함하는 특정 유기체, 예를 들어, 스트렙토코커스 피요게네스(Streptococcus pyogenes)로부터 유래된다. 일반적으로, CRISPR 시스템은 표적 서열의 부위에서 CRISPR 복합체의 형성을 증진시키는 요소(내인성 CRISPR 시스템의 맥락에서 프로토스페이서로도 지칭)를 특징으로 한다. CRISPR 복합체의 형성의 맥락에서, "표적 서열" 또는 "표적 유전자"는 가이드 서열이 상보성을 갖도록 설계된 서열을 지칭하며, 여기서, 표적 서열과 가이드 서열 간의 혼성화는 CRISPR 복합체의 형성을 증진시킨다. 본질적으로 완전한 상보성이 필요하지 않지만, 혼성화를 야기하고, CRISPR 복합체의 형성을 증진시키는 충분한 상보성이 존재한다. 표적 서열은 임의의 폴리뉴클레오티드, 예를 들어, DNA 또는 RNA 폴리뉴클레오티드를 포함할 수 있다. 일부 구현 예에서, 표적 서열은 세포의 핵 또는 세포질 내에 위치한다. 일부 구현 예에서, 표적 서열은 진핵 세포의 세포기관, 예를 들어, 미토콘드리아 또는 엽록체 내에 존재할 수 있다.In general, the "Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) system", a well-known means of gene editing, collectively comprises a sequence encoding a Cas gene, a trans-activating CRISPR (tracr) sequence (e.g., tracrRNA or active moiety tracrRNA). ), a tracr-mate sequence (including "direct repeats" and tracrRNA-processing portion direct repeats in the context of an endogenous CRISPR system), a guide sequence (also referred to as a "spacer" in the context of an endogenous CRISPR system), a guide RNA or CRISPR locus Refers to transcripts and other elements involved in the expression of, or inducing activity of, CRISPR-associated (Cas) genes, including other sequences and transcripts from In some embodiments, one or more elements of the CRISPR system are from a type I, type II, or type III CRISPR system. In some embodiments, one or more elements of the CRISPR system are derived from a particular organism comprising the endogenous CRISPR system, eg, Streptococcus pyogenes . In general, CRISPR systems are characterized by elements that promote the formation of CRISPR complexes at the site of the target sequence (also referred to as protospacers in the context of endogenous CRISPR systems). In the context of the formation of a CRISPR complex, a "target sequence" or "target gene" refers to a sequence to which a guide sequence is designed to have complementarity, wherein hybridization between the target sequence and the guide sequence enhances the formation of the CRISPR complex. Although essentially perfect complementarity is not required, there is sufficient complementarity to cause hybridization and promote formation of the CRISPR complex. The target sequence may comprise any polynucleotide, eg, a DNA or RNA polynucleotide. In some embodiments, the target sequence is located in the nucleus or cytoplasm of the cell. In some embodiments, the target sequence may be present in an organelle of a eukaryotic cell, such as a mitochondrion or chloroplast.

상기 Cas 단백질은 CRISPR RNA(crRNA) 및 트랜스-활성화 crRNA(trans-activating crRNA, tracrRNA)로 불리는 두 RNA와 복합체를 형성할 때, 활성 엔도뉴클레아제 또는 니카아제(nickase)를 형성한다. 상기 Cas 단백질의 비제한적인 예는 Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9(Csn1 및 Csx12로도 알려짐), Cas10, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csx1, Csx15, Csf1, Csf2, Csf3, Csf4, 그의 상동체 또는 그의 변형된 버전을 포함한다. 이들 효소가 알려져 있으며; 예를 들어, 스트렙토코커스 피오게네스 Cas9 단백질의 아미노산 서열은 수탁 번호 Q99ZW2 하에 스위스프로트(SwissProt) 데이터베이스에서 얻을 수 있다. 일부 구현 예에서, 비변형 CRISPR 효소, 예를 들어, Cas9는 DNA 절단 활성을 갖는다.When the Cas protein forms a complex with two RNAs called CRISPR RNA (crRNA) and trans-activating crRNA (tracrRNA), it forms an active endonuclease or nickase. Non-limiting examples of such Cas proteins include Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csn1 and Csx12), Cas10, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx3, Csx10, Csx3, Csx10, Csx1 Csf2, Csf3, Csf4, homologues thereof or modified versions thereof. These enzymes are known; For example, the amino acid sequence of the Streptococcus pyogenes Cas9 protein can be obtained from the SwissProt database under accession number Q99ZW2. In some embodiments, the unmodified CRISPR enzyme, eg, Cas9, has DNA cleavage activity.

일부 구현 예에서, CRISPR 효소는 Cas9 단백질이며, 상기 Cas9 단백질은 스트렙토코커스 피요게네스(Streptococcus pyogenes) 유래 Cas9 단백질, 캄필로박터 제주니(Campylobacter jejuni) 유래 Cas9 단백질, 스트렙토코커스 써모필러스(Streptococcus thermophiles) 유래 Cas9 단백질, 스트렙토코커스 아우레우스(Streptocuccus aureus) 유래 Cas9 단백질 및 네이세리아 메닝기디티스(Neisseria meningitidis) 유래 Cas9 단백질로 이루어진 군에서 선택되는 적어도 하나의 Cas9 단백질일 수 있고, 구체적으로 스트렙토코커스 피요게네스(Streptococcus pyogenes) 유래 Cas9 단백질일 수 있다. 일부 구현 예에서, Cas9 단백질은 진핵 세포에서의 발현을 위해 코돈-최적화되며, 상기 스트렙토코커스 피요게네스(Streptococcus pyogenes) 유래 Cas9 단백질을 사용하는 경우, PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 최대로 감소될 수 있다.In some embodiments, the CRISPR enzyme is a Cas9 protein, wherein the Cas9 protein is a Cas9 protein from Streptococcus pyogenes , a Cas9 protein from Campylobacter jejuni , Streptococcus thermophiles )-derived Cas9 protein, Streptococcus aureus -derived Cas9 protein and Neisseria meningitidis -derived Cas9 protein It may be at least one Cas9 protein selected from the group consisting of, specifically Streptococcus p. It may be a Cas9 protein derived from Yogenes ( Streptococcus pyogenes ). In some embodiments, the Cas9 protein is codon-optimized for expression in a eukaryotic cell, and when using the Streptococcus pyogenes -derived Cas9 protein, the expression of the PMT1b gene or the protein expressed by the PMT1b gene or Activity can be reduced to a maximum.

일부 구현 예에서, Cas9 단백질은 진핵 세포 내의 핵 내에 위치하기 위하여 Cas9 단백질의 5'- 또는 3'-, 또는 양 말단 부분에 NLS(nuclear localization sequence or signal)를 포함하는 것일 수 있으며, 상기 NLS는 하나 또는 그 이상일 수 있다.In some embodiments, the Cas9 protein may include a nuclear localization sequence or signal (NLS) at 5'- or 3'- or both terminal portions of the Cas9 protein for localization in the nucleus in a eukaryotic cell, wherein the NLS is It can be one or more.

본 명세서에서 용어 “핵 위치화 서열 또는 신호(Nuclear localization sequence or signal, NLS)"는 특정물질(예컨대, 단백질)을 세포 핵 내로 운반하는 역할을 하는 아미노산 서열을 의미하며, 대체적으로 핵공(Nuclear Pore)을 통하여 세포 핵 내로 운반하는 작용을 한다. 상기 핵 위치화 서열은 진핵생물에서 CRISPR 복합체 활성에 필요하지 않지만, 이러한 서열을 포함하여, 시스템의 활성을 증진시켜, 특히 핵 내의 핵산 분자를 표적화하는 것으로 여겨진다.As used herein, the term “nuclear localization sequence or signal (NLS)” refers to an amino acid sequence that serves to transport a specific substance (eg, protein) into a cell nucleus, and is generally a nuclear pore (Nuclear Pore). ) through the cell nucleus.These nuclear localization sequences are not required for CRISPR complex activity in eukaryotes, but contain these sequences to enhance the activity of the system, in particular to target nucleic acid molecules in the nucleus It is believed to be

또한 RNA 유전자 가위(RNA-guided CRISPR)(clustered regularly interspaced short palindrome repeats)-연관된 뉴클레아제 Cas9는 표적 유전자의 넉아웃, 전사 활성화 및 single guide RNA(sgRNA)(즉, crRNA-tracrRNA 융합 전사체)를 이용한 억제에 대한 획기적인 기술을 제공하며, 이 기술은 수많은 유전자 위치를 타겟팅하는 것으로 알려져 있다.In addition, RNA-guided CRISPR (clustered regularly interspaced short palindrome repeats)-associated nuclease Cas9 is responsible for knockout of target genes, transcriptional activation and single guide RNA (sgRNA) (i.e., crRNA-tracrRNA fusion transcripts). provides a groundbreaking technology for inhibition using

Cas9 (또는 Cpf1) 단백질은 CRISPR/Cas9 시스템에서 필수적인 단백질 요소를 의미하고, 상기 Cas9 (또는 Cpf1) 유전자 및 단백질의 정보는 국립생명공학정보센터(national center for biotechnology information, NCBI)의 GenBank에서 구할 수 있으나, 이에 제한되지 않는다. Cas (또는 Cpf1)단백질을 암호화하는 CRISPR-연관 유전자는 약 40 개 이상의 서로 다른 Cas (또는 Cpf1) 단백질 패밀리가 존재하는 것으로 알려져 있으며, cas 유전자 및 반복 구조(repeat structure)의 특정 조합에 따라 8개의 CRISPR 하위 유형 (Ecoli, Ypest, Nmeni, Dvulg, Tneap, Hmari, Apern, 및 Mtube)을 정의할 수 있다. 따라서 상기 각 CRISPR 하위 유형이 반복단위를 이루어 폴리리보뉴클레오티드-단백질 복합체를 형성할 수 있다.Cas9 (or Cpf1) protein refers to an essential protein element in the CRISPR/Cas9 system, and information on the Cas9 (or Cpf1) gene and protein can be obtained from GenBank of the National Center for Biotechnology Information (NCBI). However, the present invention is not limited thereto. The CRISPR-associated gene encoding the Cas (or Cpf1) protein is known to exist in about 40 or more different Cas (or Cpf1) protein families, and according to the specific combination of the cas gene and the repeat structure, 8 CRISPR subtypes (Ecoli, Ypest, Nmeni, Dvulg, Tneap, Hmari, Apern, and Mtube) can be defined. Therefore, each of the CRISPR subtypes can form a repeating unit to form a polyribonucleotide-protein complex.

일 양상에 따른 식물 세포는 PMT1b 유전자의 특정 타겟 부위가 절단되어 상기 유전자가 넉아웃된 식물 세포로서, 넉아웃에 의하면 식물 세포의 니코틴 생합성 경로에서 오르니틴(ornithine)과 아르기닌(arginine)으로부터 생합성 되어진 푸트레신(putrescine)이 N-메틸푸트레신(N-methylputrescine)으로 전환되지 않아, 니코틴의 생합성이 억제될 수 있다. 일 양상에 따른 PMT1b 넉아웃 식물 세포를 제조하기 위하여 유전체 교정 기술로서 유전체 중 존재 비율이 매우 낮은 희귀 유전자 서열을 절단하는 희귀 절단 엔도뉴클레아제(rare-cutting endonuclease)를 사용하는 기술이 채용될 수 있다.The plant cell according to an aspect is a plant cell in which a specific target region of the PMT1b gene is cut and the gene is knocked out. Since putrescine is not converted to N-methylputrescine, biosynthesis of nicotine may be inhibited. In order to produce a PMT1b knockout plant cell according to an aspect, as a genome editing technology, a technology using a rare-cutting endonuclease that cuts a rare gene sequence having a very low abundance in the genome may be employed. there is.

상기 유전자 넉아웃은 유전자의 전부 또는 일부 (예컨대, 하나 이상의 뉴클레오티드)의 결실, 치환, 및/또는 하나 이상의 뉴클레오티드의 삽입에 의한 유전자의 활성 조절, 예컨대, 불활성화를 의미하는 것일 수 있다. 상기 유전자 불활성화는 유전자의 발현 억제 또는 발현 감소 (downregulation) 또는 본래의 기능을 상실한 단백질을 코딩하도록 변형된 것을 의미한다. 또한 유전자 조절은 타겟 유전자의 하나 이상의 Exon을 둘러싸고 있는 양쪽 intron 부위를 동시에 targeting함으로 인한 Exon 부위의 결실로 인해 얻어지는 단백질의 구조 변형, Dominant negative 형태의 단백질 발현, soluble 형태로 분비되는 경쟁적 저해제 발현 등의 결과에 의한 유전자의 기능 변화를 의미하는 것일 수 있다.The gene knockout may refer to the regulation of gene activity by deletion, substitution, and/or insertion of one or more nucleotides, eg, inactivation, of all or part of a gene (eg, one or more nucleotides). The gene inactivation refers to a modification to encode a protein that has lost its original function or suppressed or downregulated the expression of a gene. In addition, gene regulation involves structural modification of proteins obtained due to deletion of exon sites due to simultaneous targeting of both intron sites surrounding one or more exons of the target gene, dominant negative protein expression, and competitive inhibitor expression secreted in soluble form. It may mean a change in the function of a gene as a result.

상기 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질 발현 또는 활성이 감소되도록 인위적으로 수행하는 유전자 조작은 Cas9 단백질 또는 Cpf1 단백질에 의하여 유도될 수 있다. 상기 유전자 조작에 사용될 수 있는 Cas9 단백질은 스트렙토코커스 피요게네스(Streptococcus pyogenes) 유래 Cas9 단백질, 캄필로박터 제주니(Campylobacter jejuni) 유래 Cas9 단백질, 스트렙토코커스 써모필러스(Streptococcus thermophiles) 유래 Cas9 단백질, 스트렙토코커스 아우레우스(Streptocuccus aureus) 유래 Cas9 단백질 및 네이세리아 메닝기디티스(Neisseria meningitidis) 유래 Cas9 단백질로 이루어진 군에서 선택되는 적어도 하나의 Cas9 단백질을 사용하여 유도될 수 있으며, 보다 구체적으로 스트렙토코커스 피요게네스(Streptococcus pyogenes) 유래 Cas9 단백질을 사용하여 유도될 수 있다.Genetic manipulation artificially performed to reduce the expression or activity of the PMT1b gene or a protein expressed by the PMT1b gene may be induced by Cas9 protein or Cpf1 protein. Cas9 proteins that can be used for the genetic manipulation include Streptococcus pyogenes -derived Cas9 protein, Campylobacter jejuni -derived Cas9 protein, Streptococcus thermophiles-derived Cas9 protein, Streptococcus thermophiles -derived Cas9 protein, Streptococcus thermophiles Caucus aureus ( Streptocuccus aureus ) Derived Cas9 protein and Neisseria meningitidis derived using at least one Cas9 protein selected from the group consisting of Cas9 protein, more specifically Streptococcus pyoge Ness ( Streptococcus pyogenes ) can be induced using Cas9 protein.

상기 Cas9가 DNA로 암호화되어 개체 또는 세포로 전달되는 경우, 상기 DNA는 일반적으로 (그러나 필수적이지는 않음) 타겟 세포에서 작동 가능한 조절 요소 (예컨대, 프로모터)를 포함할 수 있다. 상기 Cas9 발현을 위한 프로모터는, 예컨대, CMV, EF-l a, EFS, MSCV, PGK, 또는 CAG 프로모터일 수 있다. gRNA 발현을 위한 프로모터는, 예컨대, HI, EF-la, tRNA 또는 U6 프로모터일 수 있다. Cas9를 코딩하는 유전자의 서열은 nuclear localization signal(NLS) (e.g., SV40 NLS)를 포함할 수 있다. 일 예에서, 상기 프로모터는 조직 특이성 또는 세포 특이성을 갖는 것일 수 있다.When the Cas9 is encoded by DNA and delivered to a subject or cell, the DNA may generally (but not necessarily) include regulatory elements (eg, a promoter) operable in a target cell. The promoter for Cas9 expression may be, for example, a CMV, EF-1 a, EFS, MSCV, PGK, or CAG promoter. A promoter for gRNA expression may be, for example, a HI, EF-la, tRNA or U6 promoter. The sequence of the gene encoding Cas9 may include a nuclear localization signal (NLS) (e.g., SV40 NLS). In one example, the promoter may have tissue specificity or cell specificity.

상기 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질 발현 또는 활성이 감소되도록 인위적으로 수행하는 유전자 조작은 PMT1b 유전자를 구성하는 핵산서열 내 PAM(protospacer-adjacent Motif) 서열 중 또는 이의 5' 말단 또는 3'말단에 인접하여 위치하는 연속하는 1bp 내지 50bp의 염기 서열 부위 내의 유전자의 전부 또는 연속하는 염기 서열 부위의 하나 이상의 뉴클레오티드의 결실, 야생형 유전자와 상이한 뉴클레오티드로의 치환, 각각 독립적으로 A, T, C 및 G 중에서 선택된 1 내지 23개의 뉴클레오티드의 삽입, 또는 상기 변형의 조합에 의한 것일 수 있다. The genetic manipulation artificially performed so as to reduce the expression or activity of the PMT1b gene or the protein expressed by the PMT1b gene is performed at the 5' end or 3' end of the PAM (protospacer-adjacent motif) sequence in the nucleic acid sequence constituting the PMT1b gene. Deletion of all of the gene or one or more nucleotides of the contiguous nucleotide sequence region within the consecutive 1 bp to 50 bp nucleotide sequence region located adjacently, substitution with a nucleotide different from that of the wild-type gene, each independently among A, T, C and G It may be by insertion of selected 1 to 23 nucleotides, or a combination of the above modifications.

일 구현 예에서, 가이드 RNA는 상기 식물 세포 내 적어도 하나의 대립유전자의 PMT1b 유전자에 결합되는 것일 수 있으며, 구체적으로 모든 대립유전자의 PMT1b 유전자에 결합되는 것일 수 있다. 모든 대립유전자의 PMT1b 유전자에 가이드 RNA가 결합하여 PMT1b 유전자를 넉아웃시키는 경우, 동일 세대 뿐만 아니라 후 세대의 식물 세포에서도 니코틴 생합성이 억제될 수 있다.In one embodiment, the guide RNA may bind to the PMT1b gene of at least one allele in the plant cell, and specifically bind to the PMT1b gene of all alleles. When the guide RNA binds to the PMT1b gene of all alleles to knock out the PMT1b gene, nicotine biosynthesis may be inhibited in plant cells of the same generation as well as subsequent generations.

일 구현 예에서 상기 식물 세포에서 PMT1b 유전자를 넉아웃 시키기 위하여 사용한 타겟 시퀀스는 예컨대 PMT1b 유전자의 Exon 1 내지 3으로 이루어진 부위에서 적어도 하나의 부위일 수 있다.In one embodiment, the target sequence used to knock out the PMT1b gene in the plant cell may be, for example, at least one site in the region consisting of Exons 1 to 3 of the PMT1b gene.

상기 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질 발현 또는 활성이 감소되도록 인위적으로 수행하는 유전자 조작은 PMT1b 유전자로부터 코딩되는 단백질이 본래의 기능을 갖는 단백질 형태로 발현되지 않도록 하는 것일 수 있다. 상기 유전자의 조작은 다음 중 하나 이상에 의하여 유도된 것일 수 있다:The genetic manipulation artificially performed to reduce the expression or activity of the PMT1b gene or the protein expressed by the PMT1b gene may be such that the protein encoded by the PMT1b gene is not expressed in the form of a protein having an original function. The manipulation of the gene may be induced by one or more of the following:

1) PMT1b 유전자의 전부 또는 일부 결실, 예컨대, PMT1b 유전자의 1bp 이상의 뉴클레오티드, 예컨대, 1 내지 30개, 1 내지 27개, 1 내지 25개, 1 내지 23개, 1 내지 20개, 1 내지 15개, 1 내지 10개, 1 내지 5개, 1 내지 3개, 또는 1개의 뉴클레오티드의 결실,1) All or part deletion of the PMT1b gene, for example, 1 bp or more nucleotides of the PMT1b gene, such as 1 to 30, 1 to 27, 1 to 25, 1 to 23, 1 to 20, 1 to 15 , 1 to 10, 1 to 5, 1 to 3, or 1 nucleotide deletion,

2) PMT1b 유전자의 1bp 이상의 뉴클레오티드, 예컨대, 1 내지 30개, 1 내지 27개, 1 내지 25개, 1 내지 23개, 1 내지 20개, 1 내지 15개, 1 내지 10개, 1 내지 5개, 1 내지 3개, 또는 1개의 뉴클레오티드의 원래(야생형)와 상이한 뉴클레오티드로의 치환,2) 1 bp or more nucleotides of the PMT1b gene, such as 1 to 30, 1 to 27, 1 to 25, 1 to 23, 1 to 20, 1 to 15, 1 to 10, 1 to 5 , substitution of 1 to 3, or 1 nucleotide with a nucleotide different from the original (wild-type);

3) 하나 이상의 뉴클레오티드, 예컨대, 1 내지 30개, 1 내지 27개, 1 내지 25개, 1 내지 23개, 1 내지 20개, 1 내지 15개, 1 내지 10개, 1 내지 5개, 1 내지 3개, 또는 1개의 뉴클레오티드 (각각 독립적으로 A, T, C 및 G 중에서 선택됨)의 타겟 유전자의 임의의 위치에의 삽입, 및3) one or more nucleotides, such as 1 to 30, 1 to 27, 1 to 25, 1 to 23, 1 to 20, 1 to 15, 1 to 10, 1 to 5, 1 to insertion of 3 or 1 nucleotides (each independently selected from A, T, C and G) at any position in the target gene, and

4) 상기 1) 내지 3) 중에서 선택된 2 가지 이상의 조합.4) A combination of two or more selected from 1) to 3) above.

상기 PMT1b 유전자의 변형되는 일부 ('타겟 부위')는 상기 유전자 중의 1bp 이상, 3bp 이상, 5bp 이상, 7bp 이상, 10bp 이상, 12bp 이상, 15bp 이상, 17bp 이상, 20bp 이상, 예컨대, 1bp 내지 30bp, 3bp 내지 30bp, 5bp 내지 30bp, 7bp 내지 30bp, 10bp 내지 30bp, 12bp 내지 30bp, 15bp 내지 30bp, 17bp 내지 30bp, 20bp 내지 30bp, 1bp 내지 27bp, 3bp 내지 27bp, 5bp 내지 27bp, 7bp 내지 27bp, 10bp 내지 27bp, 12bp 내지 27bp, 15bp 내지 27bp, 17bp 내지 27bp, 20bp 내지 27bp, 1bp 내지 25bp, 3bp 내지 25bp, 5bp 내지 25bp, 7bp 내지 25bp, 10bp 내지 25bp, 12bp 내지 25bp, 15bp 내지 25bp, 17bp 내지 25bp, 20bp 내지 25bp, 1bp 내지 23bp, 3bp 내지 23bp, 5bp 내지 23bp, 7bp 내지 23bp, 10bp 내지 23bp, 12bp 내지 23bp, 15bp 내지 23bp, 17bp 내지 23bp, 20bp 내지 23bp, 1bp 내지 20bp, 3bp 내지 20bp, 5bp 내지 20bp, 7bp 내지 20bp, 10bp 내지 20bp, 12bp 내지 20bp, 15bp 내지 20bp, 17bp 내지 20bp, 21bp 내지 25bp, 18bp 내지 22bp, 또는 21bp 내지 23bp의 연속하는 염기 서열 부위일 수 있다.The modified part of the PMT1b gene ('target site') is 1 bp or more, 3 bp or more, 5 bp or more, 7 bp or more, 10 bp or more, 12 bp or more, 15 bp or more, 17 bp or more, 20 bp or more, such as 1 bp to 30 bp, 3bp to 30bp, 5bp to 30bp, 7bp to 30bp, 10bp to 30bp, 12bp to 30bp, 15bp to 30bp, 17bp to 30bp, 20bp to 30bp, 1bp to 27bp, 3bp to 27bp, 5bp to 27bp, 7bp to 27bp, 10bp to 27bp, 12bp to 27bp, 15bp to 27bp, 17bp to 27bp, 20bp to 27bp, 1bp to 25bp, 3bp to 25bp, 5bp to 25bp, 7bp to 25bp, 10bp to 25bp, 12bp to 25bp, 15bp to 25bp, 17bp to 25bp, 20bp to 25bp, 1bp to 23bp, 3bp to 23bp, 5bp to 23bp, 7bp to 23bp, 10bp to 23bp, 12bp to 23bp, 15bp to 23bp, 17bp to 23bp, 20bp to 23bp, 1bp to 20bp, 3bp to 20bp, 5bp to It may be a continuous nucleotide sequence region of 20bp, 7bp to 20bp, 10bp to 20bp, 12bp to 20bp, 15bp to 20bp, 17bp to 20bp, 21bp to 25bp, 18bp to 22bp, or 21bp to 23bp.

일 예에서, 상기 유전자 넉아웃은 희귀 절단 엔도뉴클레아제를 포함하는 유전체 교정 시스템에 의하여 타겟된 유전자 내의 특정 부위의 단일가닥 또는 이중가닥 절단(cleavage)을 촉매화하여 타겟된 유전자인 PMT1b 유전자를 발현을 감소시키는 것일 수 있다. 상기 희귀 절단 엔도뉴클레아제에 의하여 촉매되는 핵산가닥 손상(breaks)은 상동(homologous) 재조합(recombination) 또는 비상동 말단 연결 (non-homologous end joining; NHEJ) 등의 메커니즘들을 통하여 수선될 수 있다. 이 경우, NHEJ 메커니즘이 일어나면, 절단 위치(cleavage site)에서 DNA 서열에 변화가 유발되고, 이에 의하여 유전자가 불활성화될 수 있다. NHEJ을 통한 수선은 짧은 유전자 단편의 치환들, 삽입들 또는 결실을 야기하고, 해당 유전자 넉아웃(knockouts)의 유도에 사용될 수 있다. 상기 변형은 하나 이상의 뉴클레오티드, 예컨대, 1 내지 30bp, 1 내지 27bp, 1 내지 25bp, 1 내지 23bp, 1 내지 20bp, 1 내지 15bp, 1 내지 10bp, 1 내지 5bp, 1 내지 3bp, 또는 1bp의 뉴클레오티드의 치환, 결실, 및/또는 삽입일 수 있다.In one example, the gene knockout catalyzes single-stranded or double-stranded cleavage of a specific site in a targeted gene by a genome editing system including a rare cut endonuclease to convert the targeted gene, PMT1b gene It may be to decrease expression. Nucleic acid strand breaks catalyzed by the rare cleavage endonuclease can be repaired through mechanisms such as homologous recombination or non-homologous end joining (NHEJ). In this case, when the NHEJ mechanism occurs, a change is induced in the DNA sequence at the cleavage site, thereby inactivating the gene. Repair through NHEJ results in substitutions, insertions or deletions of short gene fragments and can be used to induce gene knockouts. The modification is of one or more nucleotides, e.g., 1 to 30 bp, 1 to 27 bp, 1 to 25 bp, 1 to 23 bp, 1 to 20 bp, 1 to 15 bp, 1 to 10 bp, 1 to 5 bp, 1 to 3 bp, or 1 bp of nucleotides. substitutions, deletions, and/or insertions.

상기 희귀 절단 엔도뉴클레아제는 메가뉴클레아제(meganuclease), 징크핑거(Zinc finger) 뉴클레아제, CRISPR/Cas9 (Cas9 단백질), CRISPR-Cpf1 (Cpf1 단백질) 및 TALE-뉴클레아제로 이루어진 군에서 선택된 하나 이상일 수 있다. 일 구체예에서, 상기 희귀 절단 엔도뉴클레아제는 Cas9 단백질 또는 Cpf1 단백질일 수 있다.The rare cleavage endonuclease is from the group consisting of meganuclease, zinc finger nuclease, CRISPR/Cas9 (Cas9 protein), CRISPR-Cpf1 (Cpf1 protein) and TALE-nuclease It may be one or more selected. In one embodiment, the rare cleavage endonuclease may be a Cas9 protein or a Cpf1 protein.

용어 "키메라 RNA", "키메라 가이드 RNA", "가이드 RNA", "단일의 가이드 RNA(single guide RNA, sgRNA)" 및 "합성 가이드 RNA"는 상호교환가능하게 사용되며, 가이드 서열, tracr 서열 및/또는 tracr 메이트 서열을 포함하는 폴리뉴클레오티드 서열을 지칭한다. 용어 "가이드 서열"은 표적 부위를 지정하는 가이드 RNA 내의 약 20bp 서열을 지칭하며, 용어 "가이드" 또는 "스페이서"와 상호교환가능하게 사용될 수 있다. 또한, 용어 "tracr 메이트 서열"은 용어 "직접 반복부(들)"와 상호교환가능하게 사용될 수 있다. 상기 가이드 RNA는 두 개의 RNA, 즉, CRISPR RNA (crRNA) 및 트랜스 활성화 crRNA (transactivating crRNA, tracrRNA)로 이루어져 있는 것일 수 있으며, 또는 crRNA 및 tracrRNA의 부분을 포함하고 상기 표적 DNA와 혼성화하는 단일 사슬 RNA (single-chain RNA, sgRNA)일 수 있다.The terms “chimeric RNA”, “chimeric guide RNA”, “guide RNA”, “single guide RNA (sgRNA)” and “synthetic guide RNA” are used interchangeably and refer to a guide sequence, a tracr sequence and / or a polynucleotide sequence comprising a tracr mate sequence. The term “guide sequence” refers to a sequence of about 20 bp within a guide RNA that directs a target site, and may be used interchangeably with the terms “guide” or “spacer”. Also, the term “tracr mate sequence” may be used interchangeably with the term “direct repeat(s)”. The guide RNA may be composed of two RNAs, that is, CRISPR RNA (crRNA) and transactivating crRNA (tracrRNA), or a single-stranded RNA comprising portions of crRNA and tracrRNA and hybridizing with the target DNA. (single-chain RNA, sgRNA).

일반적으로, 가이드 서열은 표적 서열과 혼성화하고, 표적 서열로의 CRISPR 복합체의 서열-특이적 결합을 유도하기에 충분한, 표적 폴리뉴클레오티드 서열과의 상보성을 갖는 임의의 폴리뉴클레오티드 서열이다. 또한 PMT1b 유전자 또는 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성을 감소시키기 위한 유전자 조작에 이용할 수 있는 염기 서열이라면 제한 없이 가이드 RNA로 이용할 수 있으며, 예컨대 상기 염기 서열은 PMT1b 유전자와 혼성화할 수 있는 서열일 수 있다. 또한 상기 가이드 RNA의 기능을 변형/증진시키기 위하여 가이드 RNA 염기 서열의 일부분을 변형할 수 있다. 또한 일부 구현 예에서, 가이드 서열과 그의 상응하는 표적 서열 간의 상보성의 정도는 적절한 정렬 알고리즘을 사용하여 최적으로 정렬되는 경우, 약 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99% 이상이다. 최적의 정렬은 서열을 정렬하기에 적절한 임의의 알고리즘의 사용으로 결정될 수 있으며, 그의 비제한적인 예는 스미스-워터만(Smith-Waterman) 알고리즘, 니들만-분쉬(Needleman-Wunsch) 알고리즘, 버로우즈-휠러 트랜스폼(Burrows-Wheeler Transform)에 기초한 알고리즘(예를 들어, 버로우즈 휠러 얼라이너(Burrows Wheeler Aligner)), ClustalW, Clustal X, BLAT, 노보얼라인(Novoalign)(노보크라프트 테크놀로지즈(Novocraft Technologies), ELAND(일루미나(Illumina), 미국 캘리포니아주 샌디에고), SOAP(soap.genomics.org.cn에서 이용가능) 및 Maq(maq.sourceforge.net에서 이용가능)를 포함한다. 일부 구현 예에서, 가이드 서열은 예컨대 약 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 75개 이상의 뉴클레오티드 길이이다. 일부 구현 예에서, 가이드 서열은 약 75, 50, 45, 40, 35, 30, 25, 20, 15, 12개 이하의 뉴클레오티드 길이이다. 표적 서열로의 CRISPR 복합체의 서열-특이적 결합을 유도하는 가이드 서열의 능력은 임의의 적절한 검정에 의해 평가될 수 있다. 예를 들어, 시험되는 가이드 서열을 포함하는 CRISPR 복합체를 형성하기에 충분한 CRISPR 시스템의 성분은 예를 들어, CRISPR 서열의 성분을 인코딩하는 벡터로의 트랜스펙션 후에, 예를 들어, 본원에 기술된 바와 같은 서베이어 검정에 의한 표적 서열 내의 우선적인 절단의 평가에 의해서와 같이, 상응하는 표적 서열을 갖는 숙주 세포로 제공될 수 있다. 유사하게, 표적 폴리뉴클레오티드 서열의 절단은 표적 서열, 시험되는 가이드 서열 및 시험 가이드 서열과 상이한 대조군 가이드 서열을 포함하는 CRISPR 복합체의 성분을 제공하고, 표적 서열에서 시험 및 대조군 가이드 서열 반응 간의 결합 또는 절단 비율을 비교함으로써 시험관에서 평가될 수 있다. 다른 검정이 가능하며, 당업자에게 용이하게 사용될 수 있을 것이다.In general, a guide sequence is any polynucleotide sequence having sufficient complementarity with a target polynucleotide sequence to hybridize with the target sequence and induce sequence-specific binding of the CRISPR complex to the target sequence. In addition, any nucleotide sequence that can be used for genetic manipulation to reduce the expression or activity of the PMT1b gene or the protein expressed by the PMT1b gene can be used as a guide RNA without limitation, for example, the nucleotide sequence is a sequence capable of hybridizing with the PMT1b gene. can In addition, a portion of the guide RNA nucleotide sequence may be modified in order to modify/enhance the function of the guide RNA. Also in some embodiments, the degree of complementarity between a guide sequence and its corresponding target sequence, when optimally aligned using an appropriate alignment algorithm, is about 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99% or more. Optimal alignment can be determined using any algorithm suitable for aligning sequences, non-limiting examples of which include the Smith-Waterman algorithm, the Needleman-Wunsch algorithm, the Burroughs- Algorithms based on the Burrows-Wheeler Transform (eg Burrows Wheeler Aligner), ClustalW, Clustal X, BLAT, Novoalign (Novocraft Technologies) , ELAND (Illumina, San Diego, CA), SOAP (available at soap.genomics.org.cn) and Maq (available at maq.sourceforge.net). In some embodiments, guide sequences is about 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40 , 45, 50, 75 or more nucleotides in length.In some embodiments, the guide sequence is about 75, 50, 45, 40, 35, 30, 25, 20, 15, 12 or less nucleotides in length. The ability of the guide sequence to induce the sequence-specific binding of the CRISPR complex of can be assessed by any suitable assay, for example, a component of the CRISPR system sufficient to form a CRISPR complex comprising the guide sequence being tested. is, e.g., after transfection with a vector encoding a component of a CRISPR sequence, followed by evaluation of preferential cleavage in the target sequence by a SURVEYOR assay as described herein. Can provide host cell with sequence.Similarly, cleavage of target polynucleotide sequence is CRIS containing target sequence, test guide sequence and control guide sequence different from test guide sequence. It can be assessed in vitro by providing the components of the PR complex and comparing the rates of binding or cleavage between test and control guide sequence reactions at the target sequence. Other assays are possible and will be readily available to those skilled in the art.

가이드 서열은 임의의 표적 서열을 표적화하도록 선택될 수 있다. 일부 구현 예에서, 표적 서열은 세포의 게놈 내의 서열이다. 예시적인 표적 서열은 표적 게놈에서 독특한 것들을 포함한다. 예를 들어, 스트렙토코커스 피오게네스 유래 Cas9에 대하여, 게놈 내의 독특한 표적 서열은 형태 MMMMMMMMNNNNNNNNNNNNXGG의 Cas9 표적 부위를 포함할 수 있으며, 여기서, NNNNNNNNNNNNXGG (N은 A, G, T 또는 C이며; X는 임의의 것일 수 있음)는 게놈 내에 단일의 존재를 갖는다. 게놈 내의 독특한 표적 서열은 형태 MMMMMMMMMNNNNNNNNNNNXGG의 스트렙토코커스 피오게네스 Cas9 표적 부위를 포함할 수 있으며, 여기서, NNNNNNNNNNNXGG (N은 A, G, T 또는 C 이며; X는 임의의 것일 수 있음)는 게놈 내에 단일의 존재를 갖는다. 스트렙토코커스 써모필러스 CRISPR1 Cas9에 대하여, 게놈 내의 독특한 표적 서열은 형태 MMMMMMMMNNNNNNNNNNNNXXAGAAW의 Cas9 표적 부위를 포함할 수 있으며, 여기서, NNNNNNNNNNNNXXAGAAW (N은 A, G, T 또는 C이고; X는 임의의 것일 수 있으며; W는 A 또는 T임)는 게놈 내에 단일의 존재를 갖는다. 게놈 내의 독특한 표적 서열은 형태 MMMMMMMMMNNNNNNNNNNNXXAGAAW의 스트렙토코커스 써모필러스 CRISPR1 Cas9 표적 부위를 포함할 수 있으며, 여기서, NNNNNNNNNNNXXAGAAW(N은 A, G, T 또는 C이고; X는 임의의 것일 수 있으며; W는 A 또는 T임)는 게놈 내에 단일의 존재를 갖는다. 스트렙토코커스 피오게네스 Cas9에 대하여, 게놈 내의 독특한 표적 서열은 형태 MMMMMMMMNNNNNNNNNNNNXGGXG의 Cas9 표적 부위를 포함할 수 있으며, 여기서, NNNNNNNNNNNNXGGXG (N은 A, G, T 또는 C이고; X는 임의의 것일 수 있음)는 게놈 내에 단일의 존재를 갖는다. 게놈 내의 독특한 표적 서열은 형태 MMMMMMMMMNNNNNNNNNNNXGGXG의 스트렙토코커스 피오게네스 Cas9 표적 부위를 포함할 수 있으며, 여기서, NNNNNNNNNNNXGGXG (N은 A, G, T 또는 C이고; X는 임의의 것일 수 있음)는 게놈 내에 단일의 존재를 갖는다. 이들 서열 각각에서, "M"은 A, G, T 또는 C일 수 있다.The guide sequence can be selected to target any target sequence. In some embodiments, the target sequence is a sequence within the genome of a cell. Exemplary target sequences include those that are unique in the target genome. For example, for Cas9 from Streptococcus pyogenes, a unique target sequence in the genome may include a Cas9 target site of the form MMMMMMMMNNNNNNNNNNNNNXGG, where NNNNNNNNNNNNXGG (N is A, G, T or C; X is any ) has a single presence in the genome. A unique target sequence in a genome may comprise a Streptococcus pyogenes Cas9 target site of the form MMMMMMMMMNNNNNNNNNNNXGG, wherein NNNNNNNNNNNXGG (N is A, G, T or C; X may be any) is a single in the genome has the existence of For Streptococcus thermophilus CRISPR1 Cas9, the unique target sequence in the genome may comprise a Cas9 target site of the form MMMMMMMMNNNNNNNNNNNNXXAGAAW, wherein NNNNNNNNNNNNXXAGAAW (N is A, G, T or C; X can be any; ; W is A or T) has a single occurrence in the genome. A unique target sequence within a genome may comprise a Streptococcus thermophilus CRISPR1 Cas9 target site of the form MMMMMMMMMNNNNNNNNNNNXXAGAAW, wherein NNNNNNNNNNNXXAGAAW (N is A, G, T or C; X can be anything; W is A or T) has a single occurrence in the genome. For Streptococcus pyogenes Cas9, a unique target sequence in the genome may comprise a Cas9 target site of the form MMMMMMMMNNNNNNNNNNNNNXGGXG, wherein NNNNNNNNNNNNXGGXG (N is A, G, T or C; X may be anything) has a single presence in the genome. A unique target sequence in a genome may comprise a Streptococcus pyogenes Cas9 target site of the form MMMMMMMMMNNNNNNNNNNNXGGXG, wherein NNNNNNNNNNNXGGXG (N is A, G, T, or C; X may be any) is single in the genome has the existence of In each of these sequences, “M” can be A, G, T or C.

일 구현 예에서, 상기 PMT1b에 결합하는 가이드 RNA를 코딩하는 유전자는 5'-말단에 서열번호 19의 염기 서열로 이루어진 폴리뉴클레오티드가 결합되어 있고, 3'-말단에 5개의 연속된 티민(Thymine)이 결합되어 있는 것일 수 있고, 상기 PMT1b에 결합하는 가이드 RNA는 5'-말단에 서열번호 19의 염기 서열로 이루어진 폴리뉴클레오티드가 전사된 폴리뉴클레오티드가 결합되어 있고, 3'-말단에 5개의 연속된 우라실(Uracil)가 결합되어 있는 것일 수 있다.In one embodiment, the gene encoding the guide RNA binding to PMT1b has a polynucleotide consisting of the nucleotide sequence of SEQ ID NO: 19 coupled to the 5'-end, and five consecutive thymines at the 3'-end. This may be bound, and the guide RNA binding to PMT1b has a polynucleotide transcribed into a polynucleotide consisting of the nucleotide sequence of SEQ ID NO: 19 at the 5'-end, and five consecutive polynucleotides at the 3'-end. It may be one to which uracil (Uracil) is bound.

상기 PMT1b에 결합하는 가이드 RNA를 코딩하는 유전자는 5'-말단에 서열번호 19의 염기 서열로 이루어진 폴리뉴클레오티드(P_U6)가 결합되어 있고, 3'-말단에 5개의 연속된 티민(Thymine)이 결합되어 있거나, 상기 PMT1b에 결합하는 가이드 RNA는 5'-말단에 서열번호 19의 염기 서열로 이루어진 폴리뉴클레오티드가 전사된 폴리뉴클레오티드가 결합되어 있고, 3'-말단에 5개의 연속된 우라실(Uracil)가 결합되어 있는 경우, 불안정한 가이드 RNA가 안정화될 수 있다. 상기 서열번호 19의 염기 서열로 이루어진 폴리뉴클레오티드가 전사된 폴리뉴클레오티드는 U6 프로모터(P_U6)에 해당한다.In the gene encoding the guide RNA binding to PMT1b, a polynucleotide (P_U6) consisting of the nucleotide sequence of SEQ ID NO: 19 is bound to the 5'-end, and five consecutive thymines are bound to the 3'-end. Or, the guide RNA binding to PMT1b has a polynucleotide transcribed from the polynucleotide consisting of the nucleotide sequence of SEQ ID NO: 19 at the 5'-end, and five consecutive uracils at the 3'-end When bound, the unstable guide RNA can be stabilized. The polynucleotide into which the polynucleotide consisting of the nucleotide sequence of SEQ ID NO: 19 is transcribed corresponds to the U6 promoter (P_U6).

일 구현 예에서, 상기 조작은 상기 PMT1b에 결합하는 가이드 RNA를 코딩하는 유전자, 상기 Cas9 단백질을 코딩하는 유전자 및 상기 NLS를 코딩하는 유전자를 포함하는 벡터를 형질전환시켜 이루어지는 것일 수 있다.In one embodiment, the manipulation may be made by transforming a vector comprising a gene encoding a guide RNA binding to PMT1b, a gene encoding the Cas9 protein, and a gene encoding the NLS.

용어 "벡터(vector)"는 숙주 세포에서 목적 유전자를 발현시키기 위한 수단을 의미한다. 예를 들어, 플라스미드 벡터, 코즈미드 벡터 및 박테리오파아지 벡터, 아데노바이러스 벡터, 레트로바이러스 벡터 및 아데노-연관 바이러스 벡터와 같은 바이러스 벡터를 포함한다. 상기 재조합 벡터로 사용될 수 있는 벡터는 당업계에서 종종 사용되는 플라스미드 (예를 들면, V1k_GE, pSC101, pGV1106, pACYC177, ColE1, pKT230, pME290, pBR322, pUC8/9, pUC6, pBD9, pHC79, pIJ61, pLAFR1, pHV14, pGEX 시리즈, pET 시리즈 및 pUC19 등), 파지 또는 바이러스 (예를 들면, SV40 등)를 조작하여 제작될 수 있다.The term “vector” refers to a means for expressing a gene of interest in a host cell. Viral vectors such as, for example, plasmid vectors, cosmid vectors and bacteriophage vectors, adenoviral vectors, retroviral vectors and adeno-associated viral vectors are included. Vectors that can be used as the recombinant vector include plasmids often used in the art (eg, V1k_GE, pSC101, pGV1106, pACYC177, ColE1, pKT230, pME290, pBR322, pUC8/9, pUC6, pBD9, pHC79, pIJ61, pLAFR1). , pHV14, pGEX series, pET series and pUC19, etc.), phage or virus (eg, SV40, etc.).

상기 벡터에서 상기 PMT1b에 결합하는 가이드 RNA를 코딩하는 유전자, 상기 Cas9 단백질을 코딩하는 유전자 및 상기 NLS를 코딩하는 유전자는 프로모터에 작동 가능하게 연결될 수 있다. 용어 "작동 가능하게 연결된(operatively linked)"은 뉴클레오타이드 발현 조절 서열(예를 들어, 프로모터 서열)과 다른 뉴클레오타이드 서열 사이의 기능적인 결합을 의미한다. 상기 조절 서열은 "작동 가능하게 연결(operatively linked)"됨으로써 다른 뉴클레오타이드 서열의 전사 및/또는 해독을 조절할 수 있다.In the vector, the gene encoding the guide RNA binding to PMT1b, the gene encoding the Cas9 protein, and the gene encoding the NLS may be operably linked to a promoter. The term “operatively linked” refers to a functional linkage between a nucleotide expression control sequence (eg, a promoter sequence) and another nucleotide sequence. Such regulatory sequences may be "operatively linked" to control the transcription and/or translation of other nucleotide sequences.

상기 벡터는, 전형적으로 클로닝을 위한 벡터 또는 발현을 위한 벡터로서 구축될 수 있다. 상기 발현용 벡터는 당업계에서 식물, 동물 또는 미생물에서 외래의 단백질을 발현하는 데 사용되는 통상의 것을 사용할 수 있다. 상기 벡터는 당업계에 공지된 다양한 방법을 통해 구축될 수 있다.The vector can be constructed, typically as a vector for cloning or as a vector for expression. The expression vector may be a conventional vector used to express a foreign protein in plants, animals, or microorganisms in the art. The vector can be constructed through various methods known in the art.

상기 벡터는 원핵 세포 또는 진핵 세포를 숙주로 하여 구축될 수 있다. 예를 들어, 사용되는 벡터가 발현 벡터이고, 원핵 세포를 숙주로 하는 경우에는, 전사를 진행시킬 수 있는 강력한 프로모터(예를 들어, CMV 프로모터, trp 프로모터, lac 프로모터, tac 프로모터, T7 프로모터 등), 해독의 개시를 위한 라이보좀 결합 자리 및 전사/해독 종결 서열을 포함하는 것이 일반적이다. 진핵 세포를 숙주로 하는 경우에는, 벡터에 포함되는 진핵 세포에서 작동하는 복제원점은 f1 복제원점, SV40 복제원점, pMB1 복제원점, 아데노 복제원점, AAV 복제원점 및 BBV 복제원점 등을 포함하나, 이에 한정되는 것은 아니다. 또한, 포유동물 세포의 게놈으로부터 유래된 프로모터(예를 들어, 메탈로티오닌 프로모터) 또는 포유동물 바이러스로부터 유래된 프로모터(예를 들어, 아데노바이러스 후기 프로모터, 백시니아 바이러스 7.5K 프로모터, SV40 프로모터, 사이토메갈로바이러스 프로모터 및 HSV의 tk프로모터)가 이용될 수 있으며, 전사 종결 서열로서 폴리아데닐화 서열을 일반적으로 갖는다.The vector may be constructed using a prokaryotic cell or a eukaryotic cell as a host. For example, when the vector used is an expression vector and a prokaryotic cell is used as a host, a strong promoter (eg, CMV promoter, trp promoter, lac promoter, tac promoter, T7 promoter, etc.) , a ribosome binding site for initiation of translation and a transcription/translation termination sequence. In the case of a eukaryotic cell as a host, the replication origin operating in the eukaryotic cell contained in the vector includes the f1 origin of replication, the SV40 origin of replication, the pMB1 origin of replication, the adeno origin of replication, the AAV origin of replication and the BBV origin of replication. It is not limited. In addition, promoters derived from the genome of mammalian cells (eg, metallotionine promoter) or from mammalian viruses (eg, adenovirus late promoter, vaccinia virus 7.5K promoter, SV40 promoter, cytomegalovirus promoter and tk promoter of HSV) can be used, and generally have a polyadenylation sequence as a transcription termination sequence.

상기 형질전환에 있어서, 당업계에 주지된 형질전환 방법으로서 벼의 유전자의 형질전환이 가능한 방법이라면 특별히 제한되지는 아니하나, 구체적인 예를 들자면, 아그로박테리움-매개 형질전환법, 폴리에틸렌글리콜(Polyethylene glycol; PEG)-매개 프로토플라스트 형질전환법, 유전자총법, 전극(electrode) 형질전환법, 진공 침윤(Vacuum infiltration) 형질전환법 및 탄화규소 섬유-매개 형질전환법으로 이루어진 군에서 선택된 하나에 의해 수행되는 것일 수 있고, 벼의 특성과 형질전환율을 고려할 때 아그로박테리움-매개 형질전환법을 사용할 수 있다.In the transformation, as a transformation method well known in the art, it is not particularly limited as long as it is a method capable of transformation of the rice gene, but, for example, Agrobacterium-mediated transformation method, polyethylene glycol glycol; PEG)-mediated protoplast transformation method, gene gun method, electrode transformation method, vacuum infiltration transformation method and silicon carbide fiber-mediated transformation method It may be carried out, and considering the characteristics and transformation rate of rice, Agrobacterium-mediated transformation may be used.

일 실시예에 있어서, 상기 식물은 재배담배(Nicotiana tabacum)일 수 있고, 구체적으로 황색종, 버어리종, 재래종, 흑담배 또는 오리엔트종일 수 있고, 보다 구체적으로는 버어리종 또는 황색종일 수 있다.In an embodiment, the plant may be a cultivated tobacco ( Nicotiana tabacum ), specifically xanth species, burry species, native species, black tobacco or orient species, more specifically burery species or yellow species. .

다른 양상은 상기 식물 세포를 포함하는 식물에 관한 것이다.Another aspect relates to a plant comprising said plant cell.

상기 식물은 동일 세대 뿐만 아니라 후 세대에서도 니코틴 생합성이 억제된 식물일 수 있고, 이에 따라 계속해서 니코틴 생산이 저감화된 식물을 얻을 수 있다.The plant may be a plant in which nicotine biosynthesis is suppressed not only in the same generation but also in subsequent generations, thereby obtaining a plant in which nicotine production is continuously reduced.

상기 식물 세포는 전술한 범위 내의 식물 세포일 수 있다.The plant cell may be a plant cell within the aforementioned range.

또 다른 양상은 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작하는 단계를 포함하는 식물의 니코틴 생합성 억제 방법에 관한 것이다.Another aspect relates to a method for inhibiting nicotine biosynthesis in plants, comprising genetically engineering to reduce the expression or activity of the PMT1b gene comprising the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene compared to the parental cell.

일 실시예에 있어서, 상기 PMT1b 유전자는 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자로 이루어진 군에서 선택되는 적어도 하나의 유전자 및 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자일 수 있고, 보다 구체적으로 상기 PMT1b 유전자는 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자, 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자일 수 있다.In one embodiment, the PMT1b gene comprises the PMT1a gene consisting of the nucleotide sequence of SEQ ID NO: 2, the PMT2 gene consisting of the nucleotide sequence of SEQ ID NO: 3, the PMT3 gene consisting of the nucleotide sequence of SEQ ID NO: 4, and the nucleotide sequence of SEQ ID NO: 5 It may be a PMT1b gene consisting of at least one gene selected from the group consisting of the PMT4 gene and the nucleotide sequence of SEQ ID NO: 1, more specifically, the PMT1b gene is the PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 2 It may be a PMT1a gene consisting of a nucleotide sequence, a PMT2 gene consisting of a nucleotide sequence of SEQ ID NO: 3, a PMT3 gene consisting of a nucleotide sequence of SEQ ID NO: 4, and a PMT4 gene consisting of a nucleotide sequence of SEQ ID NO: 5.

상기 PMT1b 유전자 뿐만 아니라 PMT1a, PMT2, PMT3 및 PMT4 유전자 모두의 발현 또는 활성을 감소시키는 경우, 니코틴 생합성이 최대로 억제될 수 있어, 상기 식물 세포 내 니코틴이 존재하지 않을 수 있다. 나아가, 노르니코틴 및 아나타빈의 생합성도 최대로 억제될 수 있다.When the expression or activity of all of the PMT1a, PMT2, PMT3 and PMT4 genes as well as the PMT1b gene is reduced, nicotine biosynthesis may be maximally inhibited, and nicotine may not exist in the plant cell. Furthermore, the biosynthesis of nornicotine and anatabine can also be maximally inhibited.

상기 용어 "모세포", "니코틴", "PMT1b", "유전적으로 조작" 등은 전술한 범위 내일 수 있다.The terms “parent cell”, “nicotine”, “PMT1b”, “genetically engineered” and the like may be within the aforementioned scope.

또 다른 양상은 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작하는 단계를 포함하는 니코틴의 생합성이 억제된 식물 세포의 제조 방법에 관한 것이다.Another aspect is the production of plant cells in which the biosynthesis of nicotine is inhibited, comprising the step of genetically engineering to reduce the expression or activity of the PMT1b gene comprising the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene compared to the parental cell it's about how

일 실시예에 있어서, 상기 PMT1b 유전자는 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자로 이루어진 군에서 선택되는 적어도 하나의 유전자 및 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자일 수 있고, 보다 구체적으로 상기 PMT1b 유전자는 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자, 서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자일 수 있다.In one embodiment, the PMT1b gene comprises the PMT1a gene consisting of the nucleotide sequence of SEQ ID NO: 2, the PMT2 gene consisting of the nucleotide sequence of SEQ ID NO: 3, the PMT3 gene consisting of the nucleotide sequence of SEQ ID NO: 4, and the nucleotide sequence of SEQ ID NO: 5 It may be a PMT1b gene consisting of at least one gene selected from the group consisting of the PMT4 gene and the nucleotide sequence of SEQ ID NO: 1, more specifically, the PMT1b gene is the PMT1b gene consisting of the nucleotide sequence of SEQ ID NO: 1, SEQ ID NO: 2 It may be a PMT1a gene consisting of a nucleotide sequence, a PMT2 gene consisting of a nucleotide sequence of SEQ ID NO: 3, a PMT3 gene consisting of a nucleotide sequence of SEQ ID NO: 4, and a PMT4 gene consisting of a nucleotide sequence of SEQ ID NO: 5.

상기 PMT1b 유전자 뿐만 아니라 PMT1a, PMT2, PMT3 및 PMT4 유전자 모두의 발현 또는 활성을 감소시키는 경우, 니코틴 생합성이 최대로 억제될 수 있어, 상기 니코틴이 존재하지 않는 식물 세포를 제조할 수 있다. 나아가, 노르니코틴 및 아나타빈의 생합성도 최대로 억제된 식물 세포를 제조할 수 있다.When the expression or activity of all of the PMT1a, PMT2, PMT3 and PMT4 genes as well as the PMT1b gene is reduced, nicotine biosynthesis can be maximally inhibited, so that plant cells in which nicotine does not exist can be produced. Furthermore, it is possible to prepare plant cells in which the biosynthesis of nornicotine and anatabine is also maximally inhibited.

상기 용어 "모세포", "니코틴", "PMT1b", "유전적으로 조작" 등은 전술한 범위 내일 수 있다.The terms “parent cell”, “nicotine”, “PMT1b”, “genetically engineered” and the like may be within the aforementioned scope.

중복되는 내용은 본 명세서의 복잡성을 고려하여 생락하며, 본 명세서에서 달리 정의되지 않은 용어들은 본 발명이 속하는 기술분야에서 통상적으로 사용되는 의미를 갖는다.Overlapping contents are omitted in consideration of the complexity of the present specification, and terms not defined otherwise in the present specification have the meanings commonly used in the art to which the present invention pertains.

본 발명은 니코틴 생합성 유전자인 PMT 유전자 중 지금까지 알려지지 않았던 PMT1b 유전자의 구체적인 염기 서열을 밝힘으로써, 유전자 교정 방법을 통하더라도 기존에 미량이라도 존재하던 식물 내 니코틴을 완벽하게 조절할 수 있도록 한다.The present invention reveals the specific nucleotide sequence of the PMT1b gene, which has not been known so far among the PMT genes, which is a nicotine biosynthesis gene, so that even through a gene editing method, it is possible to perfectly control nicotine in plants, even in a small amount.

도 1은 담배 식물체에서 일어나는 니코틴 생합성 경로이다.
도 2는 NtPMT1a, NtPMT2, NtPMT3, NtPMT4 유전자 CDS 부위 염기 서열 중 절단 위치로 선정한 (A) sg1, (B) sg2, (C) sg3 부위 염기 서열 비교도이다.
도 3 내지 5는 유전자 운반체 제작에 사용한 운반체 및 발현 블록으로서, 도 3(A)는 GE_block, 도 3(B)는 벡터인 V1k_GE, 도 4(A)는 Cas9_block을, 도 4(B)는 sgRNA_PMT 블록을, 도 5는 최종 유전자 가위 발현 블록을 나타낸다.
도 6은 단계별 식물 조직배양 사진으로서, (A)는 잎 조직을 잘라 agrobacterium과 함께 배양하여 형질전환 시키는 단계, (B)는 callus 분화 및 shoot 분화를 유도하는 단계, (C)는 root 분화를 유도하는 단계, (D)는 분화가 완료된 소식물체 상태를 나타낸다.
도 7은 돌연변이 여부 및 양상을 확인하기 위하여 조직배양체 4 개체의 gDNA 내 PMT 유전자를 증폭하여 전기영동한 사진으로서, Lane M1: 100 bp DNA marker, Lane 1~4: NtPMT1a 부위 증폭 결과, Lane 5~8: NtPMT2 부위 증폭 결과, Lane 9~12: NtPMT3 부위 증폭 결과, Lane 13~16: NtPMT4 부위 증폭 결과, Lane M2: 1 kb DNA marker를 나타낸다.
도 8은 Wild type (A)과 21번 mutant (B)의 PMT1a 유전자 염기 서열 분석 프로파일을 나타낸 도로서, 단일 염기가 insertion 된 부분은 빨간색 사각형으로 표시되었다.
도 9 내지 12는 CRISPR/Cas9 시스템을 통해 돌연변이가 확인된 조직배양체로부터 CRISPR/Cas9 시스템이 포함된 유전자블록이 제거된 최종 선별된 F1 식물(108_PMTm_F1)과 야생형(wild-type) 담배의 PMT 아미노산 서열을 비교한 도로서, 각각 도 9는 PMT1a, 도 10은 PMT2, 도 11은 PMT3, 도 12는 PMT4의 아미노산 서열을 비교한 도이다.
1 is a nicotine biosynthesis pathway occurring in tobacco plants.
FIG. 2 is a diagram comparing the nucleotide sequences of (A) sg1, (B) sg2, and (C) sg3 regions selected as cleavage positions among the nucleotide sequences of the NtPMT1a, NtPMT2, NtPMT3, and NtPMT4 gene CDS regions.
3 to 5 are transporters and expression blocks used to construct a gene transporter. FIG. 3(A) is a GE_block, FIG. 3(B) is a vector V1k_GE, FIG. 4(A) is a Cas9_block, and FIG. 4(B) is sgRNA_PMT. block, Figure 5 shows the final gene scissors expression block.
6 is a step-by-step plant tissue culture picture, (A) is a step of cutting the leaf tissue and culturing it with agrobacterium for transformation, (B) is a step of inducing callus differentiation and shoot differentiation, (C) is a step of inducing root differentiation The step, (D) shows the state of the plastids in which differentiation has been completed.
7 is a picture of amplifying the PMT gene in the gDNA of 4 individuals of tissue culture to confirm the presence and appearance of mutation and electrophoresis, Lane M1: 100 bp DNA marker, Lane 1-4: NtPMT1a region amplification result, Lane 5 8: NtPMT2 site amplification result, Lane 9-12: NtPMT3 site amplification result, Lane 13-16: NtPMT4 site amplification result, Lane M2: 1 kb DNA marker is shown.
8 is a diagram showing the PMT1a gene sequencing profile of wild type (A) and mutant 21 (B), and the portion into which a single base is inserted is indicated by a red rectangle.
9 to 12 show the final selected F 1 plant (108_PMTm_F 1 ) and wild-type tobacco PMT from which the gene block containing the CRISPR/Cas9 system has been removed from the tissue culture medium in which the mutation was confirmed through the CRISPR/Cas9 system. As a road comparing amino acid sequences, FIG. 9 is a diagram comparing the amino acid sequences of PMT1a, FIG. 10 is PMT2, FIG. 11 is PMT3, and FIG. 12 is a diagram comparing the amino acid sequences of PMT4.

이하 본 발명을 실시예를 통하여 보다 상세하게 설명한다. 그러나, 이들 실시예는 본 발명을 예시적으로 설명하기 위한 것으로 본 발명의 범위가 이들 실시예에 한정되는 것은 아니다. Hereinafter, the present invention will be described in more detail through examples. However, these examples are for illustrative purposes only, and the scope of the present invention is not limited to these examples.

실시예Example

1. PMT1b 유전자를 제외한 PMT1a, PMT2, PMT3 및 PMT4 유전자를 타겟으로 CRISPR/Cas9 시스템을 이용한 저니코틴 담배 생산1. Low nicotine tobacco production using CRISPR/Cas9 system targeting PMT1a, PMT2, PMT3 and PMT4 genes except for PMT1b gene

(1) 유전자 운반체 제작(1) Gene carrier production

1) PMT(putrescine N-methyltransferase) 유전자1) PMT (putrescine N-methyltransferase) gene

재배담배(Nicotiana tabacum)의 니코틴 생합성 경로로부터 니코틴 생합성 유전자 중 PMT가 관여되어 있음을 알려져 있어, 니코틴 담배 생산을 위하여 PMT 유전자를 타겟으로 넉아웃시키면, 니코틴 함량이 저감화된 재배담배를 생산할 수 있음을 알 수 있었다(도 1).It is known that PMT is involved in the nicotine biosynthesis gene from the nicotine biosynthesis pathway of cultivated tobacco ( Nicotiana tabacum ). was found (FIG. 1).

2) 니코틴 생합성 관련 유전자 (NtPMT) 염기 서열 확인2) Confirmation of nicotine biosynthesis related gene (NtPMT) nucleotide sequence

버어리종(KB108) 대상으로 하기 표 1의 프라이머를 이용해 유전자 증폭 반응(PCR, Polymerase Chain Reaction) 수행하였고, 염기 서열 분석 서비스를 의뢰하였다.A gene amplification reaction (PCR, Polymerase Chain Reaction) was performed using the primers in Table 1 below for Burri species (KB108), and a nucleotide sequence analysis service was requested.

프라이머
명칭
primer
designation
프라이머 염기 서열primer sequence 증폭
크기 (bp)
amplification
size (bp)
유전자 증폭 조건Gene amplification conditions 서열번호SEQ ID NO:
Annealing
Tmp (℃)
Annealing
Tmp (℃)
Elongation
time (sec)
elongation
time (sec)
CyclesCycles
F_PMT1aF_PMT1a GCATAGATGTTTATTGGGAGTGTACAGCAAGCATAGATGTTTATTGGGAGTGTACAGCAA 724724 74~68
(Step down)
74~68
(Step down)
3030 4040 66
R_PMT1aR_PMT1a AGTTGCTGACTGTACGTGCACAAGTTGCTGACTGTACTGTGCACA 77 F_PMT2F_PMT2 CATATGTGTCTATTGGGAGTGTACATCAACATATGTGTCTATTGGGAGTGTACATCAA 685685 68
(2 step)
68
(2 steps)
3030 4040 88
R_PMT2R_PMT2 AAGTTGCTGACTGTACGTACACATAAGTTGCTGACTGTACGTACACAT 99 F_PMT3F_PMT3 CATAGATATGTTTATTGGGAGTGTACATCAACATAGATATGTTTATTGGAGTGTACATCAA 783783 62
(3 step)
62
(3 steps)
3030 4040 1010
R_PMT3R_PMT3 CCACTTTAAATATAATCATACATGAGCTTGACCCACTTTAAATATAATCATACATGAGCTTGAC 1111 F_PMT4F_PMT4 GCGAGCTTTCATAAAGTACAAATCAGCGAGCTTTCATAAAGTACAAATCA 812812 62
(3 step)
62
(3 steps)
3030 4040 1212
R_PMT4R_PMT4 AGTATTTATTCATATTAAAGCATGTGTATTAGAAGTATTTATTCATATTAAAGCATGTGTATTAGA 1313

KB108 품종의 exon 1 ~ exon 3를 포함하는 gDNA 영역을 증폭하여 염기 서열 분석한 결과, NCBI (National Center for Biotechnology Information) 데이터베이스에 공개된 재배담배(Nicotiana tabacum)의 PMT 유전자 염기 서열(표 2)과 100% 일치하였다.As a result of amplifying and sequencing the gDNA region containing exon 1 to exon 3 of KB108 variety, the PMT gene nucleotide sequence (Table 2) of cultivated tobacco ( Nicotiana tabacum ) published in the NCBI (National Center for Biotechnology Information) database and 100% match.

유전자gene GeneBank Accession No.GeneBank Accession No. 서열번호SEQ ID NO: NtPMT1aNtPMT1a AF126810 (gDNA, CDS)AF126810 (gDNA, CDS) 22 NtPMT2NtPMT2 AF126809 (gDNA, CDS)AF126809 (gDNA, CDS) 33 NtPMT3NtPMT3 AF126811 (gDNA, CDS)AF126811 (gDNA, CDS) 44 NtPMT4NtPMT4 AF126812 (gDNA, CDS)AF126812 (gDNA, CDS) 55

2) 유전자 가위 블록 디자인 및 운반체 재조합2) Genetic scissors block design and carrier recombination

PMT 유전자로부터 폴리펩티드가 만들어지는 번역(translation) 단계의 초기에 중단될 수 있도록 exon 1 ~ exon 3 부분에 절단 위치를 선정하였고, PMT1a, PMT2, PMT3 및 PMT4 유전자 모두를 절단할 수 있는 공통된 부위를 선정하였다(표 3 및 도 2).The cleavage site was selected for exon 1 ~ exon 3 so that it could be interrupted at the beginning of the translation stage in which the polypeptide is made from the PMT gene, and a common site that can cut all of the PMT1a, PMT2, PMT3 and PMT4 genes was selected. (Table 3 and Figure 2).

sgRNAsgRNA 염기 서열base sequence PMT 유전자 내 결합 위치Binding site in the PMT gene 서열번호SEQ ID NO: PMT1aPMT1a PMT2PMT2 PMT3PMT3 PMT4PMT4 sg1sg1 AAGTCATATCTACCAACACAAATGG AAGTCATATCTACCAACACAAA TGG Exon 1Exon 1 1414 sg2sg2 TTTTCAGAGTTTAGCGCATTATGG TTTTCAGAGTTTAGCGCATTA TGG Exon 2Exon 2 Exon 1Exon 1 Exon 1Exon 1 Exon 1Exon 1 1515 sg3sg3 CTCAAAGAGCATGACATCTTGG CTCAAAGAGCATGACATCT TGG Exon 3Exon 3 Exon 2Exon 2 Exon 2Exon 2 Exon 2Exon 2 1616

또한, 식물 발현 유전자 운반체(V1k_GE, 서열번호 17) 내에 유전자 가위 운반체 클로닝하여 최종 유전자 운반체(V1k_GE_PMT, 서열번호 18)를 완성하였다(도 3 내지 5).In addition, the gene carrier was cloned into a plant expression gene carrier (V1k_GE, SEQ ID NO: 17) to complete the final gene carrier (V1k_GE_PMT, SEQ ID NO: 18) ( FIGS. 3 to 5 ).

pBI121은 E. coliagrobacterium에서 복제 가능한 binary vector이며 식물 형질전환에 널리 쓰이고 있는 벡터이다. pBI121을 HindIII와 EcoRI으로 절단하여 CRISPR/Cas9 시스템에 필요한 GE_block을 클로닝할 수 있도록 준비하였다(도 3A). GE_block은 차례대로 CaMV 35S promoter with dual enhancer, TEV leader sequence, Cas9 블록을 클로닝하기 위한 multi cloning site (MCS), CaMV 35S terminator, linker sequence, 그리고 sgRNA 블록을 클로닝하기 위한 multi cloning site (MCS)로 구성되어 있으며 양 끝에는 HindIII와 EcoRI의 인식 염기서열이 추가되어 있다. GE_block의 각 블록들은 DNA 합성으로 준비되었으며 순차적으로 클로닝하여 완성하였다. HindIII와 EcoRI으로 절단된 pBI121과 GE_block을 ligation하여 V1k_GE를 제작하였다(도 3B).pBI121 is a binary vector that can be replicated in E. coli and agrobacterium and is widely used for plant transformation. pBI121 was digested with HindIII and EcoRI to prepare for cloning of the GE_block required for the CRISPR/Cas9 system (FIG. 3A). GE_block consists of CaMV 35S promoter with dual enhancer, TEV leader sequence, multi cloning site (MCS) for cloning Cas9 block, CaMV 35S terminator, linker sequence, and multi cloning site (MCS) for cloning sgRNA block. and HindIII and EcoRI recognition sequences are added at both ends. Each block of GE_block was prepared by DNA synthesis and was completed by sequential cloning. V1k_GE was prepared by ligation of pBI121 and GE_block cut with HindIII and EcoRI (FIG. 3B).

Cas9_block은 N-terminus nuclear localization sequence (NLS), Cas9 coding sequence (CDS), C-terminus nuclear localization sequence (NLS)로 구성된다. 양 말단에는 BamHI과 SacI 인식 염기서열이 추가되어 있다(도 4A). Cas9_block is composed of N-terminus nuclear localization sequence (NLS), Cas9 coding sequence (CDS), and C-terminus nuclear localization sequence (NLS). BamHI and SacI recognition sequences are added to both ends (FIG. 4A).

sgRNA가 발현될 수 있는 블록은 U6 promoter(P_U6, 서열번호 19)와 sgRNA, 그리고 poly T로 구성된다. PMT 유전자에 특이적으로 결합할 수 있는 세 종류의 sgRNA 블록을 overlap extension PCR 기법을 통해 하나의 연속된 DNA로 연결하여 sgRNA_PMT를 완성하였다. sgRNA_PMT의 양 끝에는 SalI, SpeI의 인식 염기서열이 존재한다(도 4B). The block in which sgRNA can be expressed is composed of the U6 promoter (P_U6, SEQ ID NO: 19), sgRNA, and poly T. Three types of sgRNA blocks that can specifically bind to the PMT gene were linked into one continuous DNA through overlap extension PCR technique to complete sgRNA_PMT. At both ends of sgRNA_PMT, there are recognition sequences of SalI and SpeI (FIG. 4B).

BamHI과 SacI으로 절단된 V1k_GE와 Cas9_block을 ligation 반응을 통해 이어 붙인 후, SalI과 SpeI으로 절단하여 sgRNA_PMT를 삽입하여 V1k_GE_PMT를 제작하였다(도 5).V1k_GE and Cas9_block cut with BamHI and SacI were attached through a ligation reaction, cut with SalI and SpeI, and sgRNA_PMT was inserted to prepare V1k_GE_PMT (FIG. 5).

4) 유전자 전달 미생물 (Agrobacterium) 내 재조합 운반체 도입4) Introduction of recombinant carrier into gene transfer microorganism (Agrobacterium)

Freeze-thaw method 실험법으로 식물용 운반체를 Agrobacterium LBA4404 strain에 형질전환시켰다.A plant carrier was transformed into Agrobacterium LBA4404 strain by the freeze-thaw method.

구체적으로, Agrobacterium을 YEP 액체 배지 (yeast extract 10 g, bacto peptone 10 g, NacCl 5 g)에 접종한 후, 16시간 동안 28℃, 250 rpm 조건으로 진탕 배양하였다. 배양액을 3,000 g의 속도 및 4℃ 조건으로 20분간 원심분리하여 세포를 분리하고, 20 mM CaCl2에 부유시켜서 competent cell을 만들었다. Competent cell 100 μL에 plasmid DNA (식물용 운반체) 5 μL를 첨가한 후, 액체질소에서 5분간, 37℃ 온도에서 5분간 배양하였다. YEP 액체배지 1 mL를 첨가하여 28℃, 250 rpm 조건으로 2시간 동안 진탕 배양하였다. 100 μL 배양액을 kanamycin 100 mg/L가 포함된 YEP 고체배지에 spreading 한 후, 28℃ 온도에서 3일 동안 배양하였다. Single colony들을 각각 계대배양한 후, plasmid DNA가 형질전환되었는지 PCR을 통해 확인하였다. Specifically, Agrobacterium was inoculated in YEP liquid medium (yeast extract 10 g, bacto peptone 10 g, NacCl 5 g), and then cultured with shaking at 28° C. and 250 rpm for 16 hours. The cells were separated by centrifugation of the culture medium at a speed of 3,000 g and 4° C. for 20 minutes, and then suspended in 20 mM CaCl 2 to prepare competent cells. After adding 5 µL of plasmid DNA (vegetable carrier) to 100 µL of competent cells, incubated in liquid nitrogen for 5 minutes and at 37°C for 5 minutes. 1 mL of YEP broth was added and incubated with shaking for 2 hours at 28°C and 250 rpm. 100 μL of the culture medium was spread on YEP solid medium containing 100 mg/L of kanamycin, and then cultured at 28° C. for 3 days. After subculture of each single colony, it was confirmed by PCR whether the plasmid DNA was transformed.

(2) 식물 조직배양(2) plant tissue culture

1) 식물 형질전환1) Plant transformation

Agrobacterium를 YEP liquid media (70 mg/L kanamycin, 70 mg/L streptomycin 포함), 28℃ 조건에서 24시간 배양하였다.Agrobacterium was cultured in YEP liquid media (including 70 mg/L kanamycin and 70 mg/L streptomycin) at 28°C for 24 hours.

또한, 발아된지 한 달된 식물체의 잎을 70% 에탄올과 락스로 멸균한 후, 3 mm X 3 mm 크기의 절편으로 자르고, MS liquid 배지가 5 ml 담겨 있는 petri-dish에 절편을 올려 놓은 뒤, Agrobacterium 배양액 1 mL을 골고루 뿌려줌으로써 담배 잎 절편을 준비하였다. 이후 25℃, 암조건에서 48 시간 배양시켰다.In addition, after sterilizing the leaves of the germinated plant one month old with 70% ethanol and lactose, cut into 3 mm X 3 mm slices, and place the slices on petri-dish containing 5 ml of MS liquid medium, and then Agrobacterium Tobacco leaf slices were prepared by evenly spraying 1 mL of the culture solution. Thereafter, it was incubated at 25° C. under dark conditions for 48 hours.

2) 식물 조직배양2) Plant tissue culture

잎 절편을 멸균 증류수(200 ug/ml cefotaxim 포함)에 4 회 세척한 후, shooting 배지(MS 배지, 2 mg/L BA, 0.1 mg/L NAA, 200 mg/L cefotaxim, 100 mg/L kanamycin 포함)에 치상하여 25℃, 16시간/8시간 광주기 조건에서 배양하며, 2주 마다 새로운 배지로 계대배양함으로써 Washing 및 selection 배지 치상을 수행하였다.After washing the leaf sections in sterile distilled water (including 200 ug/ml cefotaxim) 4 times, shooting medium (MS medium, 2 mg/L BA, 0.1 mg/L NAA, 200 mg/L cefotaxim, 100 mg/L kanamycin included) ) and cultured at 25°C, 16 hours/8 hours photoperiod conditions, and subcultured with a new medium every 2 weeks to perform washing and selection medium placement.

또한, 잎 절편에서 분화된 shoot을 절단하여 rooting 배지(MS 배지, 200 mg/L cefotaxim 포함)에 치상하여 25℃, 16시간/8시간 광주기 조건에서 배양함으로써 Rooting 배지 치상을 수행하였다.In addition, by cutting the differentiated shoots from the leaf sections, dentin in the rooting medium (MS medium, including 200 mg/L cefotaxim), and culturing at 25 ℃, 16 hours / 8 hours photoperiod conditions, rooting medium dentition was performed.

Agrobacterium mediated transformation 방법으로 담배 잎 조직에 형질전환 시킨 후, 캘러스 분화, 잎 분화, 뿌리 분화가 차례로 잘 이루어졌음을 확인하였으며, 조직배양을 통해 잎, 줄기, 뿌리를 갖춘 조직배양 소식물체 50 개체를 확보하였다(도 6).After transforming tobacco leaf tissue with the Agrobacterium mediated transformation method, it was confirmed that callus differentiation, leaf differentiation, and root differentiation were performed well in sequence. (Fig. 6).

(3) 돌연변이체 선별(3) Mutant selection

1) 목표 유전자 내 돌연변이 발생 여부 및 양상 확인1) Confirmation of mutation in the target gene and its pattern

건강한 잎 조직 100 mg을 샘플링하여 균일하게 분쇄한 후, silica column을 이용하는 상용화 키트 (예: Nucleospin 96 plant II, Macherey Nagel, 독일)를 이용하여 genomic DNA를 추출 및 정제하였다.100 mg of healthy leaf tissue was sampled and uniformly pulverized, and then genomic DNA was extracted and purified using a commercial kit (eg, Nucleospin 96 plant II, Macherey Nagel, Germany) using a silica column.

잎 조직에서 gDNA를 추출/정제한 후, PCR을 통해 목표 유전자 부위를 증폭하여 염기 서열 분석을 수행하였다.After gDNA was extracted/purified from the leaf tissue, the target gene site was amplified through PCR and nucleotide sequence analysis was performed.

조직 배양체들의 PMT1a, PMT2, PMT3, PMT4 유전자를 각각 증폭하여 염기 서열을 분석한 결과, sg3 sgRNA 결합 부위의 염기 서열에서 돌연변이가 자주 유발됨을 확인하였다(도 7).As a result of analyzing the nucleotide sequences by amplifying the PMT1a, PMT2, PMT3, and PMT4 genes of tissue cultures, it was confirmed that mutations were frequently induced in the nucleotide sequence of the sg3 sgRNA binding site (FIG. 7).

또한, 총 세 개의 sgRNA (sg1, sg2, sg3) 중 sg3가 결합하는 위치에서 돌연변이가 유발된 확률은 50.25%에 이르며, sg1 결합 위치에는 2.75% 발생률을 보였음을 알 수 있었다(표 4).In addition, it was found that the probability of mutagenesis at the sg3 binding site among the three sgRNAs (sg1, sg2, sg3) reached 50.25%, and the sg1 binding site showed a 2.75% incidence (Table 4).

sgRNAsgRNA 돌연변이 발생 횟수number of mutations 목표 유전자 개수target gene count 돌연변이 발생율 (%)Mutation rate (%) sg1sg1 1111 400400 2.752.75 sg2sg2 00 400400 00 sg3sg3 201201 400400 50.2550.25

또한, sg3 sgRNA 결합 위치에만 dual mutation 양상을 확인할 수 있었으며 dual mutation이 발생한 확률은 31%이었음을 확인하였다(표 5).In addition, the dual mutation pattern could be confirmed only at the sg3 sgRNA binding site, and it was confirmed that the probability that dual mutation occurred was 31% (Table 5).

sgRNAsgRNA Dual mutation
(M + M)
Dual mutation
(M + M)
Half mutation
(M +W)
half mutation
(M + W)
Wild type
(W)
wild type
(W)
합계Sum
개수Count 비율(%)ratio(%) 개수Count 비율(%)ratio(%) 개수Count 비율(%)ratio(%) 개수Count 비율(%)ratio(%) sg1sg1 00 00 1111 5.55.5 189189 94.594.5 200200 100100 sg2sg2 00 00 00 00 200200 100100 200200 100100 sg3sg3 6262 3131 7777 38.538.5 6161 30.530.5 200200 100100

이 후, PMT1a, PMT2, PMT3, PMT4, 네 유전자의 allele 모두에 돌연변이가 유발된 조직배양체 12 개체를 선별하였다(표 6 및 도 8).Thereafter, PMT1a, PMT2, PMT3, PMT4, and 12 tissue cultures in which alleles of the four genes were mutated were selected (Tables 6 and 8).

번호number 개체 번호object number PMT1aPMT1a PMT2PMT2 PMT3PMT3 PMT4PMT4 1One 1One DualDual DualDual DualDual DualDual 22 22 DualDual Dual
(T ins, G ins)
Dual
(T ins, G ins)
DualDual DualDual
33 33 Dual(G ins, A ins)Dual (G ins, A ins) DualDual Homo
(T ins)
Homo
(T ins)
DualDual
44 44 Dual(G ins, A ins)Dual (G ins, A ins) Homo
(T del)
Homo
(T del)
Homo
(A ins)
Homo
(A ins)
Homo
(T del)
Homo
(T del)
55 55 DualDual Homo(T ins)Homo(T ins) Homo
(Tins)
Homo
(Tins)
Homo
(35bp del)
Homo
(35bp del)
66 88 DualDual Homo
(23bp del)
Homo
(23bp del)
DualDual Homo
(TGACTACCAAGATGTC del)
Homo
(TGACTACCAAGATGTC del)
77 1010 DualDual Homo
(TG del)
Homo
(TG del)
Homo
(Ains)
Homo
(Ains)
Homo
(TG del)
Homo
(TG del)
88 1111 DualDual DualDual Homo(Ains)Homo (Ains) Homo
(T del)
Homo
(T del)
99 1313 Homo(GA del)Homo (GA del) DualDual DualDual Homo
(T del)
Homo
(T del)
1010 1818 DualDual Homo(T del)Homo(T del) DualDual Homo
(T ins)
Homo
(T ins)
1111 2121 Dual(G ins, A ins)Dual (G ins, A ins) Homo
(T del)
Homo
(T del)
Homo
(T del)
Homo
(T del)
Homo
(T del)
Homo
(T del)
1212 2424 Dual(G ins, A ins)Dual (G ins, A ins) Homo
(T del)
Homo
(T del)
Homo
(T del)
Homo
(T del)
Homo
(T del)
Homo
(T del)

※ Dual은 두 개의 대립 유전자(allele)에 각기 다른 유형의 돌연변이가 생성된 것을 나타낸다(예를 들어, 개체번호 24번의 PMT1a 유전자의 경우, A allele에는 G insertion 돌연변이가 일어났고, B allele에는 A insertion 돌연변이가 유도됨).※ Dual indicates that different types of mutations are generated in two alleles (for example, in the case of the PMT1a gene of item number 24, G insertion mutation occurred in A allele, and A insertion occurred in B allele) mutation is induced).

※※ Homo는 두 개의 대립 유전자(allele)에 같은 유형의 돌연변이가 생성된 것을 나타낸다(예를 들어, 개체번호 24번의 PMT2 유전자의 경우, A allele과 B allel에 똑같이 A deletion이 일어났음).※※ Homo indicates that the same type of mutation was generated in both alleles (for example, in the case of the PMT2 gene of individual number 24, A deletion occurred equally in both A and B alleles).

2) 식물체 순화2) Purification of plants

돌연변이가 확인된 조직배양체 12 개체를 상토가 담긴 화분으로 이식하여 온실에서 재배하였다.Twelve tissue cultures with confirmed mutations were transplanted into pots with top soil and grown in a greenhouse.

3) F3) F 1One 세대 종자 확보 및 도입 유전자 제거 확인 Confirmation of generation of seeds and removal of transgenes

CRISPR/Cas9 발현을 위해 도입된 유전자 블록을 제거하기 위하여 자가 수정을 통해 F1 세대 종자를 확보하였다. F1 세대 종자를 128구 트레이에 파종하여 30일 동안 기른 후, 잎을 채취하여 균일하게 분쇄한 후, silica column을 이용하는 상용화 키트 (예: Nucleospin 96 plant II, Macherey Nagel, 독일)를 이용하여 genomic DNA를 추출 및 정제하였다. 35S 프로모터와 Cas9 DNA 블록을 특이적으로 증폭하는 프라이머(하기 표 7)를 이용하여 PCR을 진행하였다. 35S 프로모터와 Cas9 DNA 블록이 검출되지 않는 식물체를 선별하였다.F 1 generation seeds were obtained through self-fertilization to remove the gene block introduced for CRISPR/Cas9 expression. F 1st generation seeds were sown in 128 trays and grown for 30 days, then leaves were collected and ground uniformly, and then genomic using a commercial kit using silica column (eg Nucleospin 96 plant II, Macherey Nagel, Germany). DNA was extracted and purified. PCR was performed using the 35S promoter and primers specifically amplifying the Cas9 DNA block (Table 7 below). Plants in which the 35S promoter and Cas9 DNA block were not detected were selected.

NameName SequenceSequence PCR lengthPCR length NoteNote 서열번호SEQ ID NO: F_C9F_C9 GACCATCCTGGACTTCCTGAAGAGCGACCATCCTGGACTTCCTGAAGAGC 420 bp420 bp Cas9 검출Cas9 detection 2020 R_C9R_C9 TGCAGGTAGTACAGGTACAGCTTCTCGTGCAGGTAGTACAGGTACAGCTTCTCG 2121 F_35SF_35S GCTCCTACAAATGCCATCAGCTCCTACAAATGCCATCA 195 bp195 bp 35S 프로모터 검출35S promoter detection 2222 R_35SR_35S GATAGTGGGATTGTGCGTCAGATAGTGGGATTGTGCGTCA 2323

4) 최종 선별 식물체의 PMT 유전자 유전형4) PMT gene genotype of final selected plants

최종 선별한 F1 식물(108_PMTm_F1)은 도입 유전자를 포함하지 않으며, PMT 유전자들에 각각 동형접합 돌연변이(homozygous mutation)가 일어난 식물이었다. 108_PMTm_F1의 네 가지 PMT 유전자의 DNA 염기서열을 아미노산 서열로 변환하여 야생형(wild-type) 담배의 PMT 아미노산 서열과 비교해 본 결과, 비정상적으로 종결코돈(stop codon)이 일찍 생성됨을 확인하였다(표 8 및 도 9 내지 12).The final selected F 1 plant (108_PMTm_F 1 ) did not include a transgene, and was a plant in which each of the PMT genes had a homozygous mutation. The DNA base sequence of the four PMT genes of 108_PMTm_F 1 was converted into an amino acid sequence and compared with the PMT amino acid sequence of wild-type tobacco, it was confirmed that an abnormal stop codon was generated early (Table 8). and FIGS. 9 to 12).

유전자gene PMT1PMT1 PMT2PMT2 PMT3PMT3 PMT4PMT4 돌연변이 양상108_PMTm_F1 Mutation pattern 108_PMTm_F 1 A insA in T delT del T delT del T delT del

(4) F(4) F 1One 세대 니코틴 함량 분석 Generation nicotine content analysis

1) 니코틴 분석용 샘플 준비1) Sample preparation for nicotine analysis

온실 환경에서 60일 동안 재배한 식물체의 꽃대를 자른 뒤, 2주 후에 식물체의 모든 잎을 수확하였고, 수확한 잎을 65℃ 드라이 오븐에서 48 시간 동안 건조 시킨 후, glass bead가 담긴 용기에 담아 gyro-shaker를 이용하여 분쇄하였다. After cutting the flower stalk of a plant grown for 60 days in a greenhouse environment, all leaves of the plant were harvested 2 weeks later. After drying the harvested leaves in a dry oven at 65℃ for 48 hours, put them in a container with glass beads and gyro -Pulverized using a shaker.

2) 니코틴 함량 분석2) Analysis of nicotine content

GC/MS 분석을 통한 니코틴 함량 분석을 수행하였다.Analysis of nicotine content through GC/MS analysis was performed.

GC/MS 분석기법으로 KB108 (wild type)과 돌연변이체의 니코틴 함량을 정량 분석한 결과, KB108 모든 잎의 니코틴 함량은 건조엽 중량 기준 평균 7.1 mg/g이며, 돌연변이체 시료에서는 니코틴이 검출되지 않았다.As a result of quantitative analysis of the nicotine content of KB108 (wild type) and the mutant by GC/MS analysis, the nicotine content of all KB108 leaves was an average of 7.1 mg/g based on the dry leaf weight, and nicotine was not detected in the mutant sample. .

또한, 돌연변이체의 Nornicotine과 Anatabine의 함량을 대조군의 함량과 상대 비교하려 하였으나, 돌연변이체에서 두 가지 성분 또한 검출이 되지 않았다(표 9).In addition, an attempt was made to compare the contents of Nornicotine and Anatabine of the mutant with that of the control, but neither component was detected in the mutant (Table 9).

실험군experimental group 개체 수number of objects Nicotine
(mg/g) 1)
Nicotine
(mg/g) 1)
Nornicotine
(ratio)
Nornicotine
(ratio)
Anatabine
(ratio)
Anatabine
(ratio)
KB108(대조군)KB108 (control) 1010 7.17.1 0.0240.024 0.0220.022 108_PMTm108_PMTm 1212 ND 2) ND 2) NDND NDND

1) 잎의 건조중량 (g) 당 니코틴 함량 (mg)2) ND (Not Detected) : 검출 한계값 (0.0002 mg/g) 미만1) Nicotine content (mg) per dry weight (g) of leaves 2) ND (Not Detected): less than the detection limit (0.0002 mg/g)

결과적으로, 니코틴 생합성에 중요한 역할을 하는 PMT 유전자에 특이적으로 돌연변이를 유발하는 것을 목적으로 CRISPR/Cas9 시스템을 디자인하였다.Consequently, the CRISPR/Cas9 system was designed with the goal of specifically mutating the PMT gene, which plays an important role in nicotine biosynthesis.

재배담배(Nicotiana tabacum)에 존재하는 4가지 종류의 PMT 유전자에만 특이적으로 결합할 수 있는 sgRNA 결합 부위를 세 곳 선정하였으며 실험 결과, sgRNA3의 절단 효율이 나머지 두 개보다 현저히 높았다.Three sgRNA binding sites that can specifically bind only four types of PMT genes present in cultivated tobacco ( Nicotiana tabacum ) were selected. As a result of the experiment, the cleavage efficiency of sgRNA3 was significantly higher than the other two.

PMT1a, PMT2, PMT3 및 PMT4 유전자 모두에 돌연변이가 유발되었고, 이 돌연변이로 인해 PMT 효소가 정상적으로 만들어지지 않을 것으로 생각되는 식물체를 선별하였다.Mutations were induced in all of the PMT1a, PMT2, PMT3 and PMT4 genes, and plants that were thought to not normally produce the PMT enzyme due to this mutation were selected.

돌연변이 담배 식물체(108_PMTm) 및 대조군(KB108)의 잎 샘플에서 니코틴, 노르니코틴, 아나타빈의 함량(평균)을 확인한 결과, 상기 표 9와 같이, 니코틴이 검출되지 않았다.As a result of confirming the contents (average) of nicotine, nornicotine, and anatabine in the leaf samples of the mutant tobacco plant (108_PMTm) and the control (KB108), as shown in Table 9 above, nicotine was not detected.

2. 새로운 PMT1b 유전자를 포함한 PMT1a, PMT1b, PMT2, PMT3 및 PMT4 유전자를 타겟으로 CRISPR/Cas9 시스템을 이용한 저니코틴 담배 생산2. Low-nicotine tobacco production using CRISPR/Cas9 system targeting PMT1a, PMT1b, PMT2, PMT3 and PMT4 genes including novel PMT1b gene

(1) NtPMT1b 유전자의 중요성 (1) Importance of the NtPMT1b gene

PMT 유전자 편집을 통한 NtPMT1a, NtPMT2, NtPMT3, NtPMT4 유전자 뿐 아니라 NtPMT1b 유전자(서열번호 1)까지 비활성화 시켜야 제로 니코틴이 가능한 것을 알 수 있었고, 최초로 NtPMT1b 유전자의 완성된 CDS를 포함하는 gDNA 염기 서열을 밝혀내었다.It was found that zero nicotine was possible only when the NtPMT1a, NtPMT2, NtPMT3, and NtPMT4 genes as well as the NtPMT1b gene (SEQ ID NO: 1) were inactivated through PMT gene editing. .

현재까지 Genebank에 NtPMT1b로 명명한 nucleotide는 없었다. 또한, 이 후 PMT 유전자 관련 연구에서 PMT1b 유전자를 대상으로 한 연구가 없었다. 이번 실험에서 밝혀 낸 염기 서열은 전체 CDS를 포함하는 gDNA의 염기 서열이었다.Until now, there was no nucleotide named NtPMT1b in Genebank. In addition, there have been no studies targeting the PMT1b gene in the PMT gene-related studies since then. The nucleotide sequence revealed in this experiment was the nucleotide sequence of gDNA including the entire CDS.

(2) NtPMT1b 유전자의 비활성화 여부에 따른 니코틴 함량 비교(2) Comparison of nicotine content according to inactivation of NtPMT1b gene

CRISPR/Cas9 기술을 이용하여 NtPMT 유전자의 exon 2 부위에 돌연변이를 유도 시킨 결과, NtPMT1a, NtPMT2, NtPMT3, NtPMT4 유전자에는 공통적으로 돌연변이가 유발되었으면서, NtPMT1b에도 돌연변이가 유발된 식물체(7_23_1S)와 NtPMT1b에만 돌연변이가 유발되지 않은 식물체(7_23_4S)를 선별하였다. 7_23_1S 식물체의 잎에서는 니코틴이 검출되지 않는 반면, 7_23_4S 식물체에서는 니코틴이 0.5 mg/g 검출되었다(표 10).As a result of inducing mutations in the exon 2 region of the NtPMT gene using CRISPR/Cas9 technology, mutations were commonly induced in the NtPMT1a, NtPMT2, NtPMT3, and NtPMT4 genes, but only in NtPMT1b mutated plants (7_23_1S) and NtPMT1b. Plants without mutagenesis (7_23_4S) were selected. While no nicotine was detected in the leaves of the 7_23_1S plant, 0.5 mg/g of nicotine was detected in the 7_23_4S plant (Table 10).

식물체plant 7_23_1S7_23_1S 7_23_4S7_23_4S NtPMT1aNtPMT1a T delT del AGAT delAGAT del NtPMT1bNtPMT1b T delT del Wild typewild type NtPMT2NtPMT2 T delT del T delT del NtPMT3NtPMT3 T delT del T delT del NtPMT4NtPMT4 T insT ins T insT ins 건조엽 내 니코틴 함량Nicotine content in dried leaves 미검출
(검출 한계 : 0.0002 mg/g)
not detected
(detection limit: 0.0002 mg/g)
0.5 mg/g0.5 mg/g

상기 결과로부터 실시예 1에서 제조된 108_PMTm의 경우, PMT1a, PMT2, PMT3 및 PMT4만을 타겟으로 삼아 sgRNA를 설계하였으나, 실제 설계된 sg3 sgRNA는 PMT1b의 돌연변이도 유발하였음을 알 수 있었다. 다만, 본 실험결과로부터 PMT1b까지 비활성화되어야만 니코틴을 완벽하게 조절할 수 있음을 알 수 있었다.From the above results, in the case of 108_PMTm prepared in Example 1, the sgRNA was designed by targeting only PMT1a, PMT2, PMT3 and PMT4, but it was found that the actually designed sg3 sgRNA also induced PMT1b mutation. However, from the results of this experiment, it was found that nicotine could be completely controlled only when PMT1b was inactivated.

(3) PMT (putrescine N-methyltransferase) 유전자(3) PMT (putrescine N-methyltransferase) gene

PMT 유전자는 S-adenosylmethionine-dependent N-methylation of putrescine 반응의 촉매 역할을 하여 nicotine과 tropane alkaloids가 생성되도록 한다.The PMT gene catalyzes the S-adenosylmethionine-dependent N-methylation of putrescine reaction to produce nicotine and tropane alkaloids.

한편, 재배담배(Nicotiana tabacum)에는 다섯 개의 PMT 기능을 하는 유전자가 존재한다(NtPMT1a, NtPMT1b, NtPMT2, NtPMT3, NtPMT4).On the other hand, cultivated tobacco ( Nicotiana tabacum ) has five PMT-functioning genes (NtPMT1a, NtPMT1b, NtPMT2, NtPMT3, NtPMT4).

이 중 NtPMT2, NtPMT3, NtPMT4 유전자는 각각 N.sysvestris의 NsPMT1, NsPMT2, NsPMT3 유전자로부터 유래된 것으로 예상되었다(exon 1 내의 반복서열 개수 및 intron 염기서열 비교 결과).Among them, NtPMT2, NtPMT3, and NtPMT4 genes were expected to be derived from NsPMT1, NsPMT2, and NsPMT3 genes of N. sysvestris, respectively (comparison of the number of repeat sequences and intron sequences in exon 1).

NtPMT1은 NtPMT1a와 NtPMT1b로 구분되는데, 일반적으로 NtPMT1이라 하면 NtPMT1a를 지칭한다.NtPMT1 is divided into NtPMT1a and NtPMT1b. In general, NtPMT1 refers to NtPMT1a.

NtPMT1a는 N.tomentosiformis에서 유래된 것으로 예상하며, NtPMT1b는 N.tomentosiformisN.otophora에서 유래되었을 것으로 예상된다. NtPMT1a is expected to be derived from N. tomentosiformis, and NtPMT1b is expected to be derived from N. tomentosiformis and N. otophora .

<110> KT&G CORPORATION <120> A NOVEL NICOTINE BIOSYNTHETIC GENE AND USE THEREOF <130> P20190961KR(PN130282KR) <150> KR 10-2019-0103310 <151> 2019-08-22 <160> 23 <170> KoPatentIn 3.0 <210> 1 <211> 2211 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(2211) <223> PMT1b <400> 1 atggaagtca tatctaccaa cacaaatggc tctaccatct tcaagaatgg taccattccc 60 atgaacggcc accaaaatgg ctcttccgaa cacctcaacg gctaccagaa tggcatttcc 120 aaacaccaaa acgggcacca gaatggcact tccgaacatc ggaacggcca ccagaatggg 180 acatccgaac aacagaacgg gacaatcagc catgacaatg gcaacgagct actgggaagc 240 tccaactcta ttaagcctgg ttggttttca gagtttagcg cattatggcc aggttagtat 300 taagaaagaa actcaaatgc atcgtactct tgtattttgg tttgtgtata atttataata 360 tggataaatt atgacgaagt aaactcgcat acattaattt gaaataatct aattttgcag 420 gtgaagcatt ctcacttaag gtcgagaagt tactattcca ggggaaatct gattaccaag 480 atgtcatgct ctttgaggta attaatattc tgatacacat gctttaatct aaagtgatac 540 ttttaattta cttttagttt attgcatgtg cacgtacagt cagcaactta tgggaaggtt 600 ctgactttgg atggagcaat tcaacataca gagaatggtg gatttccata cactgaaatg 660 attgttcatc taccacttgg ttccatccca aacccaaaaa aggttttgat catcggcgga 720 ggaattggtt ttacattatt cgaaatgctt cgttatcctt caatcgaaaa aattgacatt 780 gttgagatcg atgacgtggt agttgatgta agtcaaactt cttttaccca cataaagaaa 840 atgatttaga ttgcaattct ttttattttt ctaaaagaat aaatatattc tctctttttt 900 ttttttaaaa caaaattctc tttcttacag gtatccagaa aatttttccc ttatctggca 960 gctaatttta acgatcctcg tgtaacccta gttctcggag atggtgcgta tatgatagtc 1020 tcgttttata ttttatttca cttgattttt accttttttt gtggttaatt aatcatctac 1080 cattggttct ctttaccttc aggagctgca tttgtaaagg ctgcacaagc gggatattat 1140 gatgctatta tagtggactc ttctgatccc attggtacgc tattactatt taataccaag 1200 actattctta ttatataagc tactaagaaa ctaattgaat aattaataaa cgtaactgta 1260 attgatttct aaaataatat atataatttc aggtccagca aaagatttgt ttgagaggcc 1320 attctttgag gcagtagcca aagcccttag gccaggagga gttgtatgca cacaggctga 1380 aagcatttgg cttcatatgc atattattaa gcaaatcatt gctaactgtc gtcaagtctt 1440 taagggttct gtcaactatg cttggacaac cgttccaaca tatcccacgt attctttttc 1500 tctctctctt cctgtctttt tcgatgcaat gtaaatttat aaaattggaa gtccgtttta 1560 cttttctata gacgtagatc ctaaaattgt caagaaatgg agaattgact tacaagaaaa 1620 atcaacttct tttcatttac tattcttttt ggtgacaaac tttacttatt atttcgttct 1680 aaaatgaaaa tttattttta tattttaaaa taatttagct ttaaactttt aattttactt 1740 gttatatttt taataaaaaa gatttattgt caaataaatg ttgtggccat acaataagtt 1800 tcaaattatg tcacaaaaat tgaaacagag tgagcaaatt tttatttcaa gtaaactttc 1860 aaggaattgt gtttaagttt tctcaactga tactgatcaa tgatgtctta aatattttga 1920 tttcagcggt gtgattggtt atatgctctg ctctactgaa gggccagaag ttaacttcaa 1980 gaatccagta aatccaattg acaaagagac aactcaagtc aagtccaaat taggacctct 2040 caagttctac aactctgatg taacttcata tctcacaatt tctttttccg ttttactgta 2100 tgttcttcat caaattttat aactaactct tttcatattg tctttttttt tcagattcac 2160 aaagcagcat tcattttgcc atctttcgcc cgaagtatga tcgagtctta a 2211 <210> 2 <211> 3129 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3129) <223> PMT1a <400> 2 gaattcaatg gagaaggaaa atatttccag tgtaaacaca agtgaatgaa gagaagccaa 60 aataatctct atcattcaag ccttaggtgg agattaaaaa aattatttac tttcttatca 120 aagtaatagg tgatcaacag ctttcgtaaa acgtcattag gagaatatta taatctcttt 180 tatgctgaag aacccacata aggaagatca taaaatacat gactttcaga tgacttcttg 240 gagctttatt tttaaagagt ggctagctgg tcagcaaaga ggtgctcgtc agatatcata 300 aaattttact attatttgtt ttaagaggga gatggggcac acatgcttgt gacaaaagta 360 agaggaagaa aggagacaga agaggaaata gatttggggg gggggggggg ggtttcacaa 420 tcaaagaaaa tttttaaaat ggagagagaa atgagcacac acatatacta acaaaatttt 480 actaataatt gcaccgagac aaacttatat tttagttcca aaatgtcagt ctaaccctgc 540 acgttgtaat gaatttttaa ctattatatt atatcgagtt gcgccctcca ctcctcggtg 600 tccaaattgt atttaaatgc atagatgttt attgggagtg tacagcaagc tttcggaaaa 660 tacaaaccat aatactttct cttcttcaat ttgtttagtt taattttgaa aatggaagtc 720 atatctacca acacaaatgg ctctaccatc ttcaagaatg gtgccattcc catgaacggc 780 caccaaaatg gcacttctga acacctcaac ggctaccaga atggcacttc caaacaccaa 840 aacgggcacc agaatggcac tttcgaacat cggaacggcc accagaatgg gacatccgaa 900 caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaag ctccgactct 960 attaagcctg gctggttttc agagtttagc gcattatggc caggttagta ctaagaaagc 1020 aactcaaatg catcggcctc ttgttgctac taaatataga gagctatcat acttttaggg 1080 actaactaaa aaggaaagat tatcacaggg acgaagtgag cagttaactt cgcatattat 1140 cagacgcatt aatttgaaat aatcgaattt tgcaggtgaa gcattctcac ttaaggttga 1200 gaagttacta ttccagggga agtctgatta ccaagatgtc atgctctttg aggtaattaa 1260 tattctaata cacatgcttt aatttaaagt gatactttta atttactttt agtttattgc 1320 atgtgcacgt acagtcagca acttatggga aggttctgac tttggatgga gcaattcaac 1380 atacagagaa tggtggattt ccatacactg aaatgattgt tcatctacca cttggttcca 1440 tcccaaaccc aaaaaaggtt ttgatcatcg gcggaggaat tggttttaca ttattcgaaa 1500 tgcttcgtta tccttcaatc gaaaaaattg acattgttga gatcgatgac gtggtagttg 1560 atgtaagtca aacttctttt acccacataa agaaaatgat ttagattgca attcttttta 1620 tttttctaaa agaataaata tattctcttt ttttttttta aaacaaaatt ctctttctta 1680 caggtatcca gaaaattttt cccttatctg gcagctaatt ttaacgatcc tcgtgtaacc 1740 ctagttctcg gagatggtgc gtatatgata gtctcgtttt atattttatt tcacttgatt 1800 tttacctttt tttgtggtta attaatcatc taccattggt tctctttacc ttcaggagct 1860 gcatttgtaa aggctgcaca agcgggatat tatgatgcta ttatagtgga ctcttctgat 1920 cccattggta cgctattact atttaatacc aagactattc ttattaaata agctactaag 1980 aaactaattg aataattaat aaacgtaact gtaattgatt tctaaaataa tatatataat 2040 ttcaggtcca gcaaaagatt tgtttgagag gccattcttt gaggcagtag ccaaagccct 2100 taggccagga ggagttgtat gcacacaggc tgaaagcatt tggcttcata tgcatattat 2160 taagcaaatc attgctaact gtcgtcaagt ctttaagggt tctgtcaact atgcttggac 2220 aaccgttcca acatatccca cgtattcttt ttctctctct ctcttcctgt ctttttcgat 2280 gcaatgtaaa tttataaaat tggaagtccg ttttactttt ctatagacgt agatcctaaa 2340 attgtcaaga aatggagaat tgacttacaa gaaaaatcaa cttcttttca tttactattc 2400 tttttggtga caaactttac ttattatttc gttctaaaat gaaaatttat ttttatattt 2460 taaaataatt tagctttaaa cttttaattt tacttgttat atttttaata aaaaagattt 2520 atagtcaaat aaatgttgtg accatataaa aacctccgca tttttaagat cataagtttc 2580 agagtcaaac gagttaattt atttttagta tgccggtgcg gagtcaaatt atgtcataaa 2640 aattgaaacg gagtgagaac atttttattt cgagtaaact ttcaaggtat tgtgtttaat 2700 ttcaagtgat actgatcaat gatgtcttaa atattttgat ttcagcggtg tgatcggtta 2760 tatgctctgc tctactgaag ggccagaagt tgacttcaag aatccagtaa atccaattga 2820 caaagagaca actcaagtca agtccaaatt aggacctctc aagttctaca actctgatgt 2880 aacttcatat ctcacaattt ctttttccgt tttactgtat gttcttcgtc aaattttata 2940 actaactctt ttcatattgt cttttttttc agattcacaa agcagcattc attttaccat 3000 ctttcgccag aagtatgatc gagtcttaat caagtgaata atgaacactg gtagtacaat 3060 cattggacca agatcgagtc ttaatcaagt gaataaataa gtgaaatgcg acgtattgta 3120 ggagaattc 3129 <210> 3 <211> 3091 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3091) <223> PMT2 <400> 3 ctgagttgac aagaacaatt cctggtgaat cagatggatg aagataatag aggtgggtgg 60 aatctataac caaagcagct ggttgagtga ctgtgcgagt tgcagaaaca attgaagggt 120 catttgtgga atttggggcc atttcaaagg aaaaagaaaa gatgacttag cattaataaa 180 tcaaattaaa ataaggctta gcgttaaaat caaaggaaat ggcaagcctg gctcctggag 240 caatgcttct gaggacagta gtaaaaacaa tatcagacaa aaagtaaagt tgtattattt 300 agcttgagga taaagtatgt cattagtttt gtgagagatt tggtgtcctc tacaatgatt 360 gttgaagtcc ctatttatag ctatacacag gaaacaaaat cctaggatca agcccctctt 420 aaatgacaat aatggggtta atgatgaata tgtagcggca tgacatgaat gccaaaattc 480 tccgcaacga ctatttattt aatattgagg aatatttttt attaaatact atctggtgac 540 aagcattcgt ttgcttccgt tgattacgtt gattttggga tctactctat accaaccgaa 600 gccgttgtcc ttgatcttcg ctttcattta attcatcttc cgtctgcctc cgatttcaca 660 agtcatgcac ccattcaatt atttaatgga aaccaatttt accctataca aatggtacat 720 cattcgtcaa atactttact tggatataaa caattttgcc cgaggagtaa acagatgcga 780 agaaagaaag cagacgatta aagaaatttt taaaaaagga gagagaaatg aacacacaca 840 tgtactaata aaattagggt actactttac taataattgg acagagacta aattcatatt 900 ttagttccaa aatgtctcgg gcagtccaac catgcacgtt gtaatgattt tttaactcta 960 ttatatcgag ttgcgccctc cactcctcgg tgtccaaatt gtatataaat gcatatgtgt 1020 ctattgggag tgtacatcaa gctttcataa agtacaaatc gtaatacttg ttgaaacata 1080 atactttctc ttctccaatt tgtttagttt aattttgaaa atggaagtca tatctaccaa 1140 cacaaatggc tctaccatct tcaagagtgg tgccattccc atgaatggcc accataatgg 1200 cacttccaaa caccaaaacg gccacaagaa tgggacttcc gaacaacaga acgggacaat 1260 cagccttgat aatggcaacg agctactggg aaactccaat tgtattaagc ctggttggtt 1320 ttcagagttt agcgcattat ggccaggtta gtactgagaa agaaactcaa atgcatattt 1380 aaagttaaaa ttgttaggct aatataagga gttgatattc ttttagtgat taattaaaaa 1440 ggaaaaagta tcaaataaat tcaaaaaatg gatagtaact tcgcatatta ctctacacat 1500 taatttgaaa taaatcgaat tttgcaggtg aagcattctc acttaaggtt gagaagttac 1560 tgttccaggg gaagtctgac taccaagatg tcatgctctt tgaggtaaat aatattttaa 1620 tacacatgct tccatttaaa ttgatacttt taatttactt ttactttatt gcatgtgtac 1680 gtacagtcag caacttatgg gaaggttctg actttggatg gagcaattca acacacagag 1740 aatggtggat ttccatacac tgaaatgatt gttcatcttc cacttggttc catcccaaac 1800 ccaaaaaagg ttttgatcat cggcggagga attggtttta cattattcga aatgcttcgt 1860 tatcctacaa tcgaaaaaat tgacattgtt gagatcgatg acgtggtagt tgatgtaagt 1920 caaacttctt ttactcacat aaaaaaatgg tttagattgc ttcttgttat ttttctaaaa 1980 gaatactatt tttttaaaac aaaattttct tttttacagg tatctagaaa atttttccct 2040 tatctcgctg ctaattttaa cgatcctcgt gtaaccctag tccttggaga tggtgcgtat 2100 ttgataatct cgcttttgtt ttatctttta tttttattgc atttaatttt taccttttgg 2160 tgtgtggtta attcacctgc cattggttct ctttcatttc aggggctgca tttgtaaagg 2220 ctgcacaagc agaatattat gatgctatta tagtggactc ttctgatccc attggtactc 2280 tattacttct taataccaag actaatctta ttgaataagc tactaataaa cggtaattga 2340 tttctaaaac aatataattt caggtccagc aaaagatttg tttgagaggc cattctttga 2400 ggcagtagct aaagccctaa ggccaggagg agttgtatgc acacaggctg aaagcatttg 2460 gcttcatatg catattatta agcaaatcat tgctaactgt cgtcaagtct ttaagggctc 2520 tgtcaactat gcttggacta ctgttccaac atatccaacg tatttttctc tctctctctc 2580 ttcctataaa attggaagtt ttgattctat aattgtcaag aaatggagaa tcagttccaa 2640 gaaaaaccaa cttcttttct tttactcttc aaggtattgt gtttaatttt ttttcaactg 2700 atatgatcaa ttattttgat ttcagcggtg tgattggtta tatgctctgc tctactgaag 2760 gaccagaaat tgacttcaag aatccagtaa atccaattga caaagagaca gctcaagtca 2820 agtccaaatt agcacctctc aagttctaca actctgatgt aacttcatat ctcacaattt 2880 cttttttcct attgtacttt atgttcttcg tcaaatttta taattaactc ttttcaaatt 2940 gtcttttttt ttttcagatt cacaaagcag cattcatttt gccatctttc gccagaagta 3000 tgatcgagtc ttaatcaact gattaatgaa tactggtggt acaatcattg gaccaagatc 3060 aataagtgaa agacgtattg tatgagaatt c 3091 <210> 4 <211> 3269 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3269) <223> PMT3 <400> 4 gctgtacaaa aggatgtctc aaatcatttg gaatattaat tctgcaatca acaagaaata 60 ccccactatt aagacccatt atcactggca caaaaattat gagatcatta aacatcttaa 120 acctgtccct atttggaaga gtgtggtatg ggagatgcct cccagggagt acctaaagct 180 gaatactgat ggaagtttta acaaacaaat tgggaaagca gggattggag ggattctcag 240 agatgaagag ggaggctttg tcatggcttt ttcgatgcct ataatctata ataacatcag 300 tgaagcagaa ttgaaagcca tcaagtatgg gtgtgaatgg tgcaaataca aaggaatatc 360 aaacttcatt gtggaaactg actcgaggat gatctatgac atactacaga ccaaaaatct 420 aagcaacaac aagttgaaac aagagaccga gaaattaatg gagattctgg acacctgcag 480 gacacctgtt acccattgcc ttcgcgaagc aaatcaagtg gcagactggt ttgctaaaga 540 ggccaccaga gctaacgaag gtatcactca tacagatttt agacaggtat caaaagcggc 600 caagggccct ttcttcatgg atatgtggca ggtcccttat tttagaatta gatatgaaaa 660 atctaatttt tttttgtaag ttaattctgt gtatagtgag aggaaatcgt ctaatatgta 720 tttttgccca tagactcttc ctctccttag gtaaaaaggt agctccgagg taaggtttat 780 gttcccctca gtgtaacctt tttttgttta tataatagac atggtatggg tccagctaaa 840 cccccaacac cacaggggat agatacctgg gtgattggtt tattttttaa aaaaaaaaac 900 tttactaata attgcacgga gacaaaactt atattttagt tccaaaatga cagtccaacc 960 atgcacgttg taatgatttt ttaactctat tatatcgagt tccgccctcc actcctcggt 1020 gtccaaattg tatttaaatg catagatatg tttattggga gtgtacatca agctttcaga 1080 aaatacaaac cataatactt tctcttctcc aatttgctta gtttaatttg gaaaatggaa 1140 gtcatatcta ccaacacaaa tggctctact atcttcaaga atggtgccat tcccatgaac 1200 ggttaccaga atggcacttc caaacaccaa aacggccacc agaatggcac ttccgaacat 1260 cggaacggcc accagaatgg gatttccgaa caccaaaacg gccaccagaa tggcacttcc 1320 gagcatcaga acggccatca gaatgggaca atcagccatg acaacggcaa cgagctacag 1380 ctactgggaa gctccaactc tattaagcct ggttggtttt cagagtttag cgcattatgg 1440 ccaggttagt actaagaaag aaactcaaat gcatcgtact cttgtattct gctttgcgta 1500 taatttagat gatggtgttt gactaagcac tgagtttaaa aataaaaagt ttaaagttaa 1560 attgttacta tagagagcta tatctttagg aactaactaa aaaggaaaaa ttatcacata 1620 aaattgggat gaagtaagca gttaacttcg catattattc gacacattaa tttgaaataa 1680 atcgaatttt gcaggtgaag cattctcact taaggttgag aagttactat tccaggggaa 1740 gtctgattac caagatgtca tgctctttga ggtaattaat taatactaat agtcaagctc 1800 atgtatgatt atatttaaag tggtattttt cgtttatttt taatttattg cacgtgtacg 1860 tacagtcagc aacatatggg aaggttctga ctttggatgg agcaattcaa cacacagaga 1920 atggtggatt tccatacact gaaatgattg ttcatcttcc acttggttcc atcccaaacc 1980 ctaaaaaggt tttgatcatc ggcggaggaa ttggttttac attattcgaa atgcttcgtt 2040 atcctacaat cgaaaaaatt gacattgttg agatcgatga cgtggtagtt gatgtaagtc 2100 aaacttcttt tactcacata aaaaaatgat ttagattctt atttttctaa aagaattaaa 2160 acaaaatttt ccgttttaca ggtatctaga aaatttttcc cttatcttgc tgctaatttt 2220 agcgatcctc gtgtaaccct agtccttgga gatggtgcgt atttgataat ctcgttttta 2280 ttttatcttt tacttttatt ttatttaatt tttacctttt tgtgtgtggt taattcacct 2340 gccattggtt ctttttattt caggggctgc atttgtaaag gccgcacaag caggatatta 2400 tgatgctatt atagtggact cttctgatcc cattggtact ctattactac ttaataccaa 2460 gactattctt attaaataag ctactaataa acgtaactct gatagttttc taaaataata 2520 taatttcagg tccagcaaaa gacttgtttg agaggccatt ctttgaggca gtagccaaag 2580 ccctaaggcc aggaggagtt gtatgcacac aggctgaaag catttggctt catatgcata 2640 ttattaagca aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactatgctt 2700 ggactactgt tccaacatat ccaacgtatt tttctctctc tcttcctata aaattggaag 2760 ttttgattct ataattgtca agaaatggag aatcagttcc aagaaaaacc aaattctttt 2820 cttttactct tcaaggtgtg tttaagtttt ttaaactgat actgatcaat tattttgatt 2880 tcagcggtgt gattggttat atgctctgtt ctactgaagg accagaagtt gacttcaaga 2940 atccagtaaa tccaattgac aaagagacaa ctcaagtcaa gtccaaatta gcacctctca 3000 agttctacaa ctctgatgta acttcatatc tcaatttctt ttttcttatt gtactttatg 3060 ttcttagtca aattttataa ttaactcttt tcaaattgtc tttttttttc agattcacaa 3120 agcagcattc attttgccat ctttcgccag aagtatgatc gagtcttaat caagtgacta 3180 atgaatactg gcggtacaat cattggacca agatcgagtc ttaatcaagt gaataaataa 3240 gtgaaatgcg acgtattgta taagaattc 3269 <210> 5 <211> 3001 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3001) <223> PMT4 <400> 5 gtcgacctct gattccacaa gtcatgcacc cattcaatta tttaatggaa accaatttta 60 ccctgtacaa atggtacaaa tactttcctt ggataaaaac aattttgcct aaggagtaaa 120 cagatgcgaa gtaagaaagc agacgactaa agaaaatttt aaaaaaggag agagaaatga 180 gcacacacac gtactaataa aattagggta ctactttact aataattgga cagagactaa 240 attcatattt tagttccaaa atgtctcggg cagtccaacc atgcacgttg taatgagttt 300 ttaactctat tatctcgagt tgcgccctcc actcctctgt gtccaagttg tatataaatg 360 catatatgtc tattgggagt gtacagcgag ctttcataaa gtacaaatca taatacttgt 420 tgaaacataa tactttctct tctccaattt gtttagttta attttgaaaa tggaagtcat 480 atctaccaac acaaatggct cgaccatctt caagaatggt gccattccca tgaatggcca 540 ccagagtggc acttccaaac acctcaacgg ctaccagaac ggcacttcca aacaccaaaa 600 cggccaccat aatggcactt ccgaacatcg gaacggccac cagaatggga tttccgaaca 660 ccaaaacggc caccagaatg ggacttccga acatcggaac ggccaccaga atgggatttc 720 cgaacaccaa aacggccacc agaatgggac ttccgaacac caaaacggcc accagaatgg 780 gacttccgaa caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaaa 840 ctccaactct attaagcttg gttggttttc agagtttagc gcattatggc caggttagta 900 ctgagaaaga aactcaaatt catatttaaa gttaaaattg ttaggctaat ataagaagtt 960 gattttcttt tagtgattaa ttaaaaaagg aaagagtatc aaataaattc caaaaaatga 1020 ccagtaactt cgcatattat tctacacatt aatttgaaat aaatcgaatt ttgcaggtga 1080 agcattctcc cttaaggttg agaagttact atttcagggg aagtctgact accaagatgt 1140 catgctcttt gaggtaaata atattctaat acacatgctt taatatgaat aaatactttt 1200 aatttacttt tagtttattg cacgtgtacg tacagtcagc aacatatggg aaggttttga 1260 ctttggatgg agcaattcaa cacacagaga atggtggatt tccatacact gaaatgattg 1320 ttcatcttcc acttggttcc atcccaaacc caaaaaaggt tttgatcatc ggcggaggaa 1380 ttggttttac attattcgaa atgcttcgtt atcctacaat cgaaaaaatt gacattgttg 1440 aaatcgatga cgtggtagtt gatgtaagtc aaatttcttt tactcacata aaaaaatgat 1500 ttagattgct tctttttatt tttctaaaag aataaatata ttctctctta gttttaaaca 1560 aaattctctt tcttacaggt atctagaaaa tctttccctt atctcgcagc taattttaat 1620 gatcctcgtg taaccctcgt tctcggagat ggtgcgtatt tataatctcg tttttgtttt 1680 atcttttatt tttatttcat ttaatttacc tttttgtgtg tggttaattt acccgtcatt 1740 ggttctcttt catttcaggg gctgcatttg taaaggctgc acaagcagga tattatgatg 1800 ctattatagt ggactcttct gatcccattg gtactctatt actacttaat accaagacta 1860 atcttattga ataagctact aataaactgt aattgatttc taaaataata taatttcagg 1920 tccagcaaaa gatttgtttg agaggccatt ctttgaggca gtagccaaag ccctaaggcc 1980 aggaggagtt gtatgcacac aggccgaaag catttggctt catatgcata ttattaagca 2040 aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactacgctt ggactactgt 2100 tccaacatat cccacgtatt ttctctctct ctctcttcat ctttgaaaat tgaaaatcct 2160 gactactttc cttcctttga ttcctcggtt aaaggggcgt agatcataag attttcaaga 2220 aatagataat gacgtccaag aaaaactaac ttcttttcat ttactattct ttttggtgac 2280 aaactttatt tattatttcg ttctaaagag aaaatttatt tttatatttt aaaataattt 2340 tgttttaaac ttttattttt acttattata tctttaataa aaaaattata gtcaaataaa 2400 tattatggcc acactaaaca tccaagtttt tgaaaccata agttttagag ccaaatgagt 2460 taatttgttt ttggtatgcg ggtgcggagt caaattatgt cacaaaaatt gtaatggagt 2520 gagcaaattt ttatttcgag taaactttca aggtattgtg ttaaagtttt ttcaactgat 2580 actaatcaat tatgtctcaa ccattttgat ttcagtggtg taattgggta tatgctctgc 2640 tctactgaag ggccagaagt tgacttcaag aatccaataa atccaattga caaagagaca 2700 actcaagtca agtccaaatt agcacctctc aagttttaca attctgatgt aacttcatat 2760 ctaacaattt ctttttctgt tttactgtat cttcattgtc aaaattttat aattaactct 2820 tctcaaatta tctttttttt tagattcaca aagcagcatt cattttgcca tctttcgcca 2880 gaagtatgat cgagtcttaa tcaagtgaat aatgaacact ggtggtgcaa tcattggacc 2940 aagatcgagt cttaatcaag tgaataaata agtgaaatgc cgacgtattg tatgagaatt 3000 c 3001 <210> 6 <211> 30 <212> DNA <213> Artificial Sequence <220> <223> F_PMT1a <400> 6 gcatagatgt ttattgggag tgtacagcaa 30 <210> 7 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> R_PMT1a <400> 7 agttgctgac tgtacgtgca ca 22 <210> 8 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> F_PMT2 <400> 8 catatgtgtc tattgggagt gtacatcaa 29 <210> 9 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> R_PMT2 <400> 9 aagttgctga ctgtacgtac acat 24 <210> 10 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> F_PMT3 <400> 10 catagatatg tttattggga gtgtacatca a 31 <210> 11 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> R_PMT3 <400> 11 ccactttaaa tataatcata catgagcttg ac 32 <210> 12 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> F_PMT4 <400> 12 gcgagctttc ataaagtaca aatca 25 <210> 13 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> R_PMT4 <400> 13 agtatttatt catattaaag catgtgtatt aga 33 <210> 14 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> sgRNA_sg1 <400> 14 aagtcatatc taccaacaca aatgg 25 <210> 15 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> sgRNA_sg2 <400> 15 ttttcagagt ttagcgcatt atgg 24 <210> 16 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> sgRNA_sg3 <400> 16 ctcaaagagc atgacatctt gg 22 <210> 17 <211> 17283 <212> DNA <213> Artificial Sequence <220> <223> V1k_GE <400> 17 tgagcgtcgc aaaggcgctc ggtcttgcct tgctcgtcgg tgatgtactt caccagctcc 60 gcgaagtcgc tcttcttgat ggagcgcatg gggacgtgct tggcaatcac gcgcaccccc 120 cggccgtttt agcggctaaa aaagtcatgg ctctgccctc gggcggacca cgcccatcat 180 gaccttgcca agctcgtcct gcttctcttc gatcttcgcc agcagggcga ggatcgtggc 240 atcaccgaac cgcgccgtgc gcgggtcgtc ggtgagccag agtttcagca ggccgcccag 300 gcggcccagg tcgccattga tgcgggccag ctcgcggacg tgctcatagt ccacgacgcc 360 cgtgattttg tagccctggc cgacggccag caggtaggcc gacaggctca tgccggccgc 420 cgccgccttt tcctcaatcg ctcttcgttc gtctggaagg cagtacacct tgataggtgg 480 gctgcccttc ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc 540 ggcggtagcc ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata 600 agggacagtg aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg 660 ctgacgccgt tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata 720 tcgtgcgaaa aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta 780 tgcagcggaa aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 840 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 900 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 960 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1020 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1080 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1140 cagtgagcga ggaagcggaa gagcgccaga aggccgccag agaggccgag cgcggccgtg 1200 aggcttggac gctagggcag ggcatgaaaa agcccgtagc gggctgctac gggcgtctga 1260 cgcggtggaa agggggaggg gatgttgtct acatggctct gctgtagtga gtgggttgcg 1320 ctccggcagc ggtcctgatc aatcgtcacc ctttctcggt ccttcaacgt tcctgacaac 1380 gagcctcctt ttcgccaatc catcgacaat caccgcgagt ccctgctcga acgctgcgtc 1440 cggaccggct tcgtcgaagg cgtctatcgc ggcccgcaac agcggcgaga gcggagcctg 1500 ttcaacggtg ccgccgcgct cgccggcatc gctgtcgccg gcctgctcct caagcacggc 1560 cccaacagtg aagtagctga ttgtcatcag cgcattgacg gcgtccccgg ccgaaaaacc 1620 cgcctcgcag aggaagcgaa gctgcgcgtc ggccgtttcc atctgcggtg cgcccggtcg 1680 cgtgccggca tggatgcgcg cgccatcgcg gtaggcgagc agcgcctgcc tgaagctgcg 1740 ggcattcccg atcagaaatg agcgccagtc gtcgtcggct ctcggcaccg aatgcgtatg 1800 attctccgcc agcatggctt cggccagtgc gtcgagcagc gcccgcttgt tcctgaagtg 1860 ccagtaaagc gccggctgct gaacccccaa ccgttccgcc agtttgcgtg tcgtcagacc 1920 gtctacgccg acctcgttca acaggtccag ggcggcacgg atcactgtat tcggctgcaa 1980 ctttgtcatg cttgacactt tatcactgat aaacataata tgtccaccaa cttatcagtg 2040 ataaagaatc cgcgcgttca atcggaccag cggaggctgg tccggaggcc agacgtgaaa 2100 cccaacatac ccctgatcgt aattctgagc actgtcgcgc tcgacgctgt cggcatcggc 2160 ctgattatgc cggtgctgcc gggcctcctg cgcgatctgg ttcactcgaa cgacgtcacc 2220 gcccactatg gcattctgct ggcgctgtat gcgttggtgc aatttgcctg cgcacctgtg 2280 ctgggcgcgc tgtcggatcg tttcgggcgg cggccaatct tgctcgtctc gctggccggc 2340 gccagatctg gggaaccctg tggttggcat gcacatacaa atggacgaac ggataaacct 2400 tttcacgccc ttttaaatat ccgattattc taataaacgc tcttttctct taggtttacc 2460 cgccaatata tcctgtcaaa cactgatagt ttaaactgaa ggcgggaaac gacaatctga 2520 tcatgagcgg agaattaagg gagtcacgtt atgacccccg ccgatgacgc gggacaagcc 2580 gttttacgtt tggaactgac agaaccgcaa cgttgaagga gccactcagc cgcgggtttc 2640 tggagtttaa tgagctaagc acatacgtca gaaaccatta ttgcgcgttc aaaagtcgcc 2700 taaggtcact atcagctagc aaatatttct tgtcaaaaat gctccactga cgttccataa 2760 attcccctcg gtatccaatt agagtctcat attcactctc aatccaaata atctgcaccg 2820 gatctggatc gtttcgcatg attgaacaag atggattgca cgcaggttct ccggccgctt 2880 gggtggagag gctattcggc tatgactggg cacaacagac aatcggctgc tctgatgccg 2940 ccgtgttccg gctgtcagcg caggggcgcc cggttctttt tgtcaagacc gacctgtccg 3000 gtgccctgaa tgaactgcag gacgaggcag cgcggctatc gtggctggcc acgacgggcg 3060 ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg aagggactgg ctgctattgg 3120 gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca 3180 tcatggctga tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc ccattcgacc 3240 accaagcgaa acatcgcatc gagcgagcac gtactcggat ggaagccggt cttgtcgatc 3300 aggatgatct ggacgaagag catcaggggc tcgcgccagc cgaactgttc gccaggctca 3360 aggcgcgcat gcccgacggc gatgatctcg tcgtgaccca tggcgatgcc tgcttgccga 3420 atatcatggt ggaaaatggc cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg 3480 cggaccgcta tcaggacata gcgttggcta cccgtgatat tgctgaagag cttggcggcg 3540 aatgggctga ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg 3600 ccttctatcg ccttcttgac gagttcttct gagcgggact ctggggttcg aaatgaccga 3660 ccaagcgacg cccaacctgc catcacgaga tttcgattcc accgccgcct tctatgaaag 3720 gttgggcttc ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct 3780 catgctggag ttcttcgccc acgggatctc tgcggaacag gcggtcgaag gtgccgatat 3840 cattacgaca gcaacggccg acaagcacaa cgccacgatc ctgagcgaca atatgatcgg 3900 gcccggcgtc cacatcaacg gcgtcggcgg cgactgccca ggcaagaccg agatgcaccg 3960 cgatatcttg ctgcgttcgg atattttcgt ggagttcccg ccacagaccc ggatgatccc 4020 cgatcgttca aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc 4080 gatgattatc atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg 4140 catgacgtta tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata 4200 cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc 4260 tatgttacta gatcgggcct cctgtcaatg ctggcggcgg ctctggtggt ggttctggtg 4320 gcggctctga gggtggtggc tctgagggtg gcggttctga gggtggcggc tctgagggag 4380 gcggttccgg tggtggctct ggttccggtg attttgatta tgaaaagatg gcaaacgcta 4440 ataagggggc tatgaccgaa aatgccgatg aaaacgcgct acagtctgac gctaaaggca 4500 aacttgattc tgtcgctact gattacggtg ctgctatcga tggtttcatt ggtgacgttt 4560 ccggccttgc taatggtaat ggtgctactg gtgattttgc tggctctaat tcccaaatgg 4620 ctcaagtcgg tgacggtgat aattcacctt taatgaataa tttccgtcaa tatttacctt 4680 ccctccctca atcggttgaa tgtcgccctt ttgtctttgg cccaatacgc aaaccgcctc 4740 tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag 4800 cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt 4860 tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca 4920 caggaaacag ctatgaccat gattacgcca agcttcacgt gaacatggtg gagcacgaca 4980 cacttgtcta ctccaaaaat atcaaagata cagtctcaga agaccaaagg gcaattgaga 5040 cttttcaaca aagggtaata tccggaaacc tcctcggatt ccattgccca gctatctgtc 5100 actttattgt gaagatagtg gaaaaggaag gtggctccta caaatgccat cattgcgata 5160 aaggaaaggc catcgttgaa gatgcctctg ccgacagtgg tcccaaagat ggacccccac 5220 ccacgaggag catcgtggaa aaagaagacg ttccaaccac gtcttcaaag caagtggatt 5280 gatgtgataa catggtggag cacgacacac ttgtctactc caaaaatatc aaagatacag 5340 tctcagaaga ccaaagggca attgagactt ttcaacaaag ggtaatatcc ggaaacctcc 5400 tcggattcca ttgcccagct atctgtcact ttattgtgaa gatagtggaa aaggaaggtg 5460 gctcctacaa atgccatcat tgcgataaag gaaaggccat cgttgaagat gcctctgccg 5520 acagtggtcc caaagatgga cccccaccca cgaggagcat cgtggaaaaa gaagacgttc 5580 caaccacgtc ttcaaagcaa gtggattgat gtgatatctc cactgacgta agggatgacg 5640 cacaatccca ctatccttcg caagaccctt cctctatata aggaagttca tttcatttgg 5700 agaggacctc gaaattctca acacaacata tacaaaacaa acgaatctca agcaatcaag 5760 cattctactt ctattgcagc aatttaaatc atttctttta aagcaaaagc aattttctga 5820 aaattttcac catttacgaa cgatagggat ccgagaccca agcttgccac catgccgaag 5880 aagaagcgca aggtggacaa gaagtacagc atcggcctgg acatcggtac caacagcgtg 5940 ggctgggccg tgatcaccga cgagtacaag gtgcccagca agaagttcaa ggtgctgggc 6000 aacaccgacc gccacagcat caagaagaac ctgatcggcg ccctgctgtt cgacagcggc 6060 gagaccgccg aggccacccg cctgaagcgc accgcccgcc gccgctacac ccgccgcaag 6120 aaccgcatct gctacctgca ggagatcttc agcaacgaga tggccaaggt ggacgacagc 6180 ttcttccacc gcctggagga gagcttcctg gtggaggagg acaagaagca cgagcgccac 6240 cccatcttcg gcaacatcgt ggacgaggtg gcctaccacg agaagtaccc caccatctac 6300 cacctgcgca agaagctggt ggacagcacc gacaaggccg acctgcgcct gatctacctg 6360 gccctggccc acatgatcaa gttccgcggc cacttcctga tcgagggcga cctgaacccc 6420 gacaacagcg acgtggacaa gctgttcatc cagctggtgc agacctacaa ccagctgttc 6480 gaggagaacc ccatcaacgc cagcggcgtg gacgccaagg ccatcctgag cgcccgcctg 6540 agcaagagcc gccgcctgga gaacctgatc gcccagctgc ccggcgagaa gaagaacggc 6600 ctgttcggca acctgatcgc cctgagcctg ggcctgaccc ccaacttcaa gagcaacttc 6660 gacctggccg aggacgccaa gctgcagctg agcaaggaca cctacgacga cgacctggac 6720 aacctgctgg cccagatcgg cgaccagtac gccgacctgt tcctggccgc caagaacctg 6780 agcgacgcca tcctgctgag cgacatcctg cgcgtgaaca ccgagatcac caaggccccc 6840 ctgagcgcca gcatgatcaa gcgctacgac gagcaccacc aggacctgac cctgctgaag 6900 gccctggtgc gccagcagct gcccgagaag tacaaggaga tcttcttcga ccagagcaag 6960 aacggctacg ccggctacat cgacggcggc gccagccagg aggagttcta caagttcatc 7020 aagcccatcc tggagaagat ggacggcacc gaggagctgc tggtgaagct gaaccgcgag 7080 gacctgctgc gcaagcagcg caccttcgac aacggcagca tcccccacca gatccacctg 7140 ggcgagctgc acgccatcct gcgccgccag gaggacttct accccttcct gaaggacaac 7200 cgcgagaaga tcgagaagat cctgaccttc cgcatcccct actacgtggg ccccctggcc 7260 cgcggcaaca gccgcttcgc ctggatgacc cgcaagagcg aggagaccat caccccctgg 7320 aacttcgagg aggtggtgga caagggcgcc agcgcccaga gcttcatcga gcgcatgacc 7380 aacttcgaca agaacctgcc caacgagaag gtgctgccca agcacagcct gctgtacgag 7440 tacttcaccg tgtacaacga gctgaccaag gtgaagtacg tgaccgaggg catgcgcaag 7500 cccgccttcc tgagcggcga gcagaagaag gccatcgtgg acctgctgtt caagaccaac 7560 cgcaaggtga ccgtgaagca gctgaaggag gactacttca agaagatcga gtgcttcgac 7620 agcgtggaga tcagcggcgt ggaggaccgc ttcaacgcca gcctgggcac ctaccacgac 7680 ctgctgaaga tcatcaagga caaggacttc ctggacaacg aggagaacga ggacatcctg 7740 gaggacatcg tgctgaccct gaccctgttc gaggaccgcg agatgatcga ggagcgcctg 7800 aagacctacg cccacctgtt cgacgacaag gtgatgaagc agctgaagcg ccgccgctac 7860 accggctggg gccgcctgag ccgcaagctt atcaacggca tccgcgacaa gcagagcggc 7920 aagaccatcc tggacttcct gaagagcgac ggcttcgcca accgcaactt catgcagctg 7980 atccacgacg acagcctgac cttcaaggag gacatccaga aggcccaggt gagcggccag 8040 ggcgacagcc tgcacgagca catcgccaac ctggccggca gccccgccat caagaagggc 8100 atcctgcaga ccgtgaaggt ggtggacgag ctggtgaagg tgatgggccg ccacaagccc 8160 gagaacatcg tgatcgagat ggcccgcgag aaccagacca cccagaaggg ccagaagaac 8220 agccgcgagc gcatgaagcg catcgaggag ggcatcaagg agctgggcag ccagatcctg 8280 aaggagcacc ccgtggagaa cacccagctg cagaacgaga agctgtacct gtactacctg 8340 cagaacggcc gcgacatgta cgtggaccag gagctggaca tcaaccgcct gagcgactac 8400 gacgtggacc acatcgtgcc ccagagcttc ctgaaggacg acagcatcga caacaaggtg 8460 ctgacccgca gcgacaagaa ccgcggcaag agcgacaacg tgcccagcga ggaggtggtg 8520 aagaagatga agaactactg gcgccagctg ctgaacgcca agctgatcac ccagcgcaag 8580 ttcgacaacc tgaccaaggc cgagcgcggc ggcctgagcg agctggacaa ggccggcttc 8640 atcaagcgcc agctggtgga gacccgccag atcaccaagc acgtggccca gatcctggac 8700 agccgcatga acaccaagta cgacgagaac gacaagctga tccgcgaggt gaaggtgatc 8760 accctgaaga gcaagctggt gagcgacttc cgcaaggact tccagttcta caaggtgcgc 8820 gagatcaaca actaccacca cgcccacgac gcctacctga acgccgtggt gggcaccgcc 8880 ctgatcaaga agtaccccaa gctggagagc gagttcgtgt acggcgacta caaggtgtac 8940 gacgtgcgca agatgatcgc caagagcgag caggagatcg gcaaggccac cgccaagtac 9000 ttcttctaca gcaacatcat gaacttcttc aagaccgaga tcaccctggc caacggcgag 9060 atccgcaagc gccccctgat cgagaccaac ggcgagaccg gcgagatcgt gtgggacaag 9120 ggccgcgact tcgccaccgt gcgcaaggtg ctgagcatgc cccaggtgaa catcgtgaag 9180 aagaccgagg tgcagaccgg cggcttcagc aaggagagca tcctgcccaa gcgcaacagc 9240 gacaagctga tcgcccgcaa gaaggactgg gaccccaaga agtacggcgg cttcgacagc 9300 cccaccgtgg cctacagcgt gctggtggtg gccaaggtgg agaagggcaa gagcaagaag 9360 ctgaagagcg tgaaggagct gctgggcatc accatcatgg agcgcagcag cttcgagaag 9420 aaccccatcg acttcctgga ggccaagggc tacaaggagg tgaagaagga cctgatcatc 9480 aagctgccca agtacagcct gttcgagctg gagaacggcc gcaagcgcat gctggccagc 9540 gccggcgagc tgcagaaggg caacgagctg gccctgccca gcaagtacgt gaacttcctg 9600 tacctggcca gccactacga gaagctgaag ggcagccccg aggacaacga gcagaagcag 9660 ctgttcgtgg agcagcacaa gcactacctg gacgagatca tcgagcagat cagcgagttc 9720 agcaagcgcg tgatcctggc cgacgccaac ctggacaagg tgctgagcgc ctacaacaag 9780 caccgcgaca agcccatccg cgagcaggcc gagaacatca tccacctgtt caccctgacc 9840 aacctgggcg cccccgccgc cttcaagtac ttcgacacca ccatcgaccg caagcgctac 9900 accagcacca aggaggtgct ggacgccacc ctgatccacc agagcatcac cggtctgtac 9960 gagacccgca tcgacctgag ccagctgggc ggcgacggcg gctccggaaa gagaccagct 10020 gctaccaaga aggctggaca ggctaagaag aagaagtaag agctcgtccg caaaaatcac 10080 cagtctctct ctacaaatct atctctctct atttttctcc agaataatgt gtgagtagtt 10140 cccagataag ggaattaggg ttcttatagg gtttcgctca tgtgttgagc atataagaaa 10200 cccttagtat gtatttgtat ttgtaaaata cttctatcaa taaaatttct aattcctaaa 10260 accaaaatcc agtgactata attagccttc ctcataagtt gcactacttc agcgtcccaa 10320 ttgcaccctt accacgaaga caggtttgtc caatcccata ttgcgacctt ggcagggggt 10380 tcgcaagtcc cacccgaaac gttgctgaag gctcaggttt ctgagcgaca aatgctttaa 10440 acgcgagttc ccgctcataa cctggaccga atgcgggtcg accccgggac tagtctcgag 10500 cacgtggaat tcactggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac 10560 ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc 10620 ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgcccgct cctttcgctt 10680 tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc 10740 tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgatttgg 10800 gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg 10860 agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct 10920 cgggctattc ttttgattta taagggattt tgccgatttc ggaaccacca tcaaacagga 10980 ttttcgcctg ctggggcaaa ccagcgtgga ccgcttgctg caactctctc agggccaggc 11040 ggtgaagggc aatcagctgt tgcccgtctc actggtgaaa agaaaaacca ccccagtaca 11100 ttaaaaacgt ccgcaatgtg ttattaagtt gtctaagcgt caatttgttt acaccacaat 11160 atatcctgcc accagccagc caacagctcc ccgaccggca gctcggcaca aaatcaccac 11220 tcgatacagg cagcccatca gtccgggacg gcgtcagcgg gagagccgtt gtaaggcggc 11280 agactttgct catgttaccg atgctattcg gaagaacggc aactaagctg ccgggtttga 11340 aacacggatg atctcgcgga gggtagcatg ttgattgtaa cgatgacaga gcgttgctgc 11400 ctgtgatcaa atatcatctc cctcgcagag atccgaatta tcagccttct tattcatttc 11460 tcgcttaacc gtgacaggct gtcgatcttg agaactatgc cgacataata ggaaatcgct 11520 ggataaagcc gctgaggaag ctgagtggcg ctatttcttt agaagtgaac gttgacgata 11580 tcaactcccc tatccattgc tcaccgaatg gtacaggtcg gggacccgaa gttccgactg 11640 tcggcctgat gcatccccgg ctgatcgacc ccagatctgg ggctgagaaa gcccagtaag 11700 gaaacaactg taggttcgag tcgcgagatc ccccggaacc aaaggaagta ggttaaaccc 11760 gctccgatca ggccgagcca cgccaggccg agaacattgg ttcctgtagg catcgggatt 11820 ggcggatcaa acactaaagc tactggaacg agcagaagtc ctccggccgc cagttgccag 11880 gcggtaaagg tgagcagagg cacgggaggt tgccacttgc gggtcagcac ggttccgaac 11940 gccatggaaa ccgcccccgc caggcccgct gcgacgccga caggatctag cgctgcgttt 12000 ggtgtcaaca ccaacagcgc cacgcccgca gttccgcaaa tagcccccag gaccgccatc 12060 aatcgtatcg ggctacctag cagagcggca gagatgaaca cgaccatcag cggctgcaca 12120 gcgcctaccg tcgccgcgac cccgcccggc aggcggtaga ccgaaataaa caacaagctc 12180 cagaatagcg aaatattaag tgcgccgagg atgaagatgc gcatccacca gattcccgtt 12240 ggaatctgtc ggacgatcat cacgagcaat aaacccgccg gcaacgcccg cagcagcata 12300 ccggcgaccc ctcggcctcg ctgttcgggc tccacgaaaa cgccggacag atgcgccttg 12360 tgagcgtcct tggggccgtc ctcctgtttg aagaccgaca gcccaatgat ctcgccgtcg 12420 atgtaggcgc cgaatgccac ggcatctcgc aaccgttcag cgaacgcctc catgggcttt 12480 ttctcctcgt gctcgtaaac ggacccgaac atctctggag ctttcttcag ggccgacaat 12540 cggatctcgc ggaaatcctg cacgtcggcc gctccaagcc gtcgaatctg agccttaatc 12600 acaattgtca attttaatcc tctgtttatc ggcagttcgt agagcgcgcc gtgcgtcccg 12660 agcgatactg agcgaagcaa gtgcgtcgag cagtgcccgc ttgttcctga aatgccagta 12720 aagcgctggc tgctgaaccc ccagccggaa ctgaccccac aaggccctag cgtttgcaat 12780 gcaccaggtc atcattgacc caggcgtgtt ccaccaggcc gctgcctcgc aactcttcgc 12840 aggcttcgcc gacctgctcg cgccacttct tcacgcgggt ggaatccgat ccgcacatga 12900 ggcggaaggt ttccagcttg agcgggtacg gctcccggtg cgagctgaaa tagtcgaaca 12960 tccgtcgggc cgtcggcgac agcttgcggt acttctccca tatgaatttc gtgtagtggt 13020 cgccagcaaa cagcacgacg atttcctcgt cgatcaggac ctggcaacgg gacgttttct 13080 tgccacggtc caggacgcgg aagcggtgca gcagcgacac cgattccagg tgcccaacgc 13140 ggtcggacgt gaagcccatc gccgtcgcct gtaggcgcga caggcattcc tcggccttcg 13200 tgtaataccg gccattgatc gaccagccca ggtcctggca aagctcgtag aacgtgaagg 13260 tgatcggctc gccgataggg gtgcgcttcg cgtactccaa cacctgctgc cacaccagtt 13320 cgtcatcgtc ggcccgcagc tcgacgccgg tgtaggtgat cttcacgtcc ttgttgacgt 13380 ggaaaatgac cttgttttgc agcgcctcgc gcgggatttt cttgttgcgc gtggtgaaca 13440 gggcagagcg ggccgtgtcg tttggcatcg ctcgcatcgt gtccggccac ggcgcaatat 13500 cgaacaagga aagctgcatt tccttgatct gctgcttcgt gtgtttcagc aacgcggcct 13560 gcttggcctc gctgacctgt tttgccaggt cctcgccggc ggtttttcgc ttcttggtcg 13620 tcatagttcc tcgcgtgtcg atggtcatcg acttcgccaa acctgccgcc tcctgttcga 13680 gacgacgcga acgctccacg gcggccgatg gcgcgggcag ggcaggggga gccagttgca 13740 cgctgtcgcg ctcgatcttg gccgtagctt gctggaccat cgagccgacg gactggaagg 13800 tttcgcgggg cgcacgcatg acggtgcggc ttgcgatggt ttcggcatcc tcggcggaaa 13860 accccgcgtc gatcagttct tgcctgtatg ccttccggtc aaacgtccga ttcattcacc 13920 ctccttgcgg gattgccccg actcacgccg gggcaatgtg cccttattcc tgatttgacc 13980 cgcctggtgc cttggtgtcc agataatcca ccttatcggc aatgaagtcg gtcccgtaga 14040 ccgtctggcc gtccttctcg tacttggtat tccgaatctt gccctgcacg aataccagcg 14100 accccttgcc caaatacttg ccgtgggcct cggcctgaga gccaaaacac ttgatgcgga 14160 agaagtcggt gcgctcctgc ttgtcgccgg catcgttgcg ccacatctag gtactaaaac 14220 aattcatcca gtaaaatata atattttatt ttctcccaat caggcttgat ccccagtaag 14280 tcaaaaaata gctcgacata ctgttcttcc ccgatatcct ccctgatcga ccggacgcag 14340 aaggcaatgt cataccactt gtccgccctg ccgcttctcc caagatcaat aaagccactt 14400 actttgccat ctttcacaaa gatgttgctg tctcccaggt cgccgtggga aaagacaagt 14460 tcctcttcgg gcttttccgt ctttaaaaaa tcatacagct cgcgcggatc tttaaatgga 14520 gtgtcttctt cccagttttc gcaatccaca tcggccagat cgttattcag taagtaatcc 14580 aattcggcta agcggctgtc taagctattc gtatagggac aatccgatat gtcgatggag 14640 tgaaagagcc tgatgcactc cgcatacagc tcgataatct tttcagggct ttgttcatct 14700 tcatactctt ccgagcaaag gacgccatcg gcctcactca tgagcagatt gctccagcca 14760 tcatgccgtt caaagtgcag gacctttgga acaggcagct ttccttccag ccatagcatc 14820 atgtcctttt cccgttccac atcataggtg gtccctttat accggctgtc cgtcattttt 14880 aaatataggt tttcattttc tcccaccagc ttatatacct tagcaggaga cattccttcc 14940 gtatctttta cgcagcggta tttttcgatc agttttttca attccggtga tattctcatt 15000 ttagccattt attatttcct tcctcttttc tacagtattt aaagataccc caagaagcta 15060 attataacaa gacgaactcc aattcactgt tccttgcatt ctaaaacctt aaataccaga 15120 aaacagcttt ttcaaagttg ttttcaaagt tggcgtataa catagtatcg acggagccga 15180 ttttgaaacc acaattatgg gtgatgctgc caacttactg atttagtgta tgatggtgtt 15240 tttgaggtgc tccagtggct tctgtgtcta tcagctgtcc ctcctgttca gctactgacg 15300 gggtggtgcg taacggcaaa agcaccgccg gacatcagcg ctatctctgc tctcactgcc 15360 gtaaaacatg gcaactgcag ttcacttaca ccgcttctca acccggtacg caccagaaaa 15420 tcattgatat ggccatgaat ggcgttggat gccgggcaac agcccgcatt atgggcgttg 15480 gcctcaacac gattttacgt cacttaaaaa actcaggccg cagtcggtaa cctcgcgcat 15540 acagccgggc agtgacgtca tcgtctgcgc ggaaatggac gaacagtggg gctatgtcgg 15600 ggctaaatcg cgccagcgct ggctgtttta cgcgtatgac agtctccgga agacggttgt 15660 tgcgcacgta ttcggtgaac gcactatggc gacgctgggg cgtcttatga gcctgctgtc 15720 accctttgac gtggtgatat ggatgacgga tggctggccg ctgtatgaat cccgcctgaa 15780 gggaaagctg cacgtaatca gcaagcgata tacgcagcga attgagcggc ataacctgaa 15840 tctgaggcag cacctggcac ggctgggacg gaagtcgctg tcgttctcaa aatcggtgga 15900 gctgcatgac aaagtcatcg ggcattatct gaacataaaa cactatcaat aagttggagt 15960 cattacccaa ttatgataga atttacaagc tataaggtta ttgtcctggg tttcaagcat 16020 tagtccatgc aagtttttat gctttgccca ttctatagat atattgataa gcgcgctgcc 16080 tatgccttgc cccctgaaat ccttacatac ggcgatatct tctatataaa agatatatta 16140 tcttatcagt attgtcaata tattcaaggc aatctgcctc ctcatcctct tcatcctctt 16200 cgtcttggta gctttttaaa tatggcgctt catagagtaa ttctgtaaag gtccaattct 16260 cgttttcata cctcggtata atcttaccta tcacctcaaa tggttcgctg ggtttatcgc 16320 acccccgaac acgagcacgg cacccgcgac cactatgcca agaatgccca aggtaaaaat 16380 tgccggcccc gccatgaagt ccgtgaatgc cccgacggcc gaagtgaagg gcaggccgcc 16440 acccaggccg ccgccctcac tgcccggcac ctggtcgctg aatgtcgatg ccagcacctg 16500 cggcacgtca atgcttccgg gcgtcgcgct cgggctgatc gcccatcccg ttactgcccc 16560 gatcccggca atggcaagga ctgccagcgc tgccattttt ggggtgaggc cgttcgcggc 16620 cgaggggcgc agcccctggg gggatgggag gcccgcgtta gcgggccggg agggttcgag 16680 aagggggggc accccccttc ggcgtgcgcg gtcacgcgca cagggcgcag ccctggttaa 16740 aaacaaggtt tataaatatt ggtttaaaag caggttaaaa gacaggttag cggtggccga 16800 aaaacgggcg gaaacccttg caaatgctgg attttctgcc tgtggacagc ccctcaaatg 16860 tcaataggtg cgcccctcat ctgtcagcac tctgcccctc aagtgtcaag gatcgcgccc 16920 ctcatctgtc agtagtcgcg cccctcaagt gtcaataccg cagggcactt atccccaggc 16980 ttgtccacat catctgtggg aaactcgcgt aaaatcaggc gttttcgccg atttgcgagg 17040 ctggccagct ccacgtcgcc ggccgaaatc gagcctgccc ctcatctgtc aacgccgcgc 17100 cgggtgagtc ggcccctcaa gtgtcaacgt ccgcccctca tctgtcagtg agggccaagt 17160 tttccgcgag gtatccacaa cgccggcggc cgcggtgtct cgcacacggc ttcgacggcg 17220 tttctggcgc gtttgcaggg ccatagacgg ccgccagccc agcggcgagg gcaaccagcc 17280 cgg 17283 <210> 18 <211> 18700 <212> DNA <213> Artificial Sequence <220> <223> V1k_GE_PMT <400> 18 tgagcgtcgc aaaggcgctc ggtcttgcct tgctcgtcgg tgatgtactt caccagctcc 60 gcgaagtcgc tcttcttgat ggagcgcatg gggacgtgct tggcaatcac gcgcaccccc 120 cggccgtttt agcggctaaa aaagtcatgg ctctgccctc gggcggacca cgcccatcat 180 gaccttgcca agctcgtcct gcttctcttc gatcttcgcc agcagggcga ggatcgtggc 240 atcaccgaac cgcgccgtgc gcgggtcgtc ggtgagccag agtttcagca ggccgcccag 300 gcggcccagg tcgccattga tgcgggccag ctcgcggacg tgctcatagt ccacgacgcc 360 cgtgattttg tagccctggc cgacggccag caggtaggcc gacaggctca tgccggccgc 420 cgccgccttt tcctcaatcg ctcttcgttc gtctggaagg cagtacacct tgataggtgg 480 gctgcccttc ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc 540 ggcggtagcc ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata 600 agggacagtg aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg 660 ctgacgccgt tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata 720 tcgtgcgaaa aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta 780 tgcagcggaa aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 840 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 900 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 960 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1020 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1080 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1140 cagtgagcga ggaagcggaa gagcgccaga aggccgccag agaggccgag cgcggccgtg 1200 aggcttggac gctagggcag ggcatgaaaa agcccgtagc gggctgctac gggcgtctga 1260 cgcggtggaa agggggaggg gatgttgtct acatggctct gctgtagtga gtgggttgcg 1320 ctccggcagc ggtcctgatc aatcgtcacc ctttctcggt ccttcaacgt tcctgacaac 1380 gagcctcctt ttcgccaatc catcgacaat caccgcgagt ccctgctcga acgctgcgtc 1440 cggaccggct tcgtcgaagg cgtctatcgc ggcccgcaac agcggcgaga gcggagcctg 1500 ttcaacggtg ccgccgcgct cgccggcatc gctgtcgccg gcctgctcct caagcacggc 1560 cccaacagtg aagtagctga ttgtcatcag cgcattgacg gcgtccccgg ccgaaaaacc 1620 cgcctcgcag aggaagcgaa gctgcgcgtc ggccgtttcc atctgcggtg cgcccggtcg 1680 cgtgccggca tggatgcgcg cgccatcgcg gtaggcgagc agcgcctgcc tgaagctgcg 1740 ggcattcccg atcagaaatg agcgccagtc gtcgtcggct ctcggcaccg aatgcgtatg 1800 attctccgcc agcatggctt cggccagtgc gtcgagcagc gcccgcttgt tcctgaagtg 1860 ccagtaaagc gccggctgct gaacccccaa ccgttccgcc agtttgcgtg tcgtcagacc 1920 gtctacgccg acctcgttca acaggtccag ggcggcacgg atcactgtat tcggctgcaa 1980 ctttgtcatg cttgacactt tatcactgat aaacataata tgtccaccaa cttatcagtg 2040 ataaagaatc cgcgcgttca atcggaccag cggaggctgg tccggaggcc agacgtgaaa 2100 cccaacatac ccctgatcgt aattctgagc actgtcgcgc tcgacgctgt cggcatcggc 2160 ctgattatgc cggtgctgcc gggcctcctg cgcgatctgg ttcactcgaa cgacgtcacc 2220 gcccactatg gcattctgct ggcgctgtat gcgttggtgc aatttgcctg cgcacctgtg 2280 ctgggcgcgc tgtcggatcg tttcgggcgg cggccaatct tgctcgtctc gctggccggc 2340 gccagatctg gggaaccctg tggttggcat gcacatacaa atggacgaac ggataaacct 2400 tttcacgccc ttttaaatat ccgattattc taataaacgc tcttttctct taggtttacc 2460 cgccaatata tcctgtcaaa cactgatagt ttaaactgaa ggcgggaaac gacaatctga 2520 tcatgagcgg agaattaagg gagtcacgtt atgacccccg ccgatgacgc gggacaagcc 2580 gttttacgtt tggaactgac agaaccgcaa cgttgaagga gccactcagc cgcgggtttc 2640 tggagtttaa tgagctaagc acatacgtca gaaaccatta ttgcgcgttc aaaagtcgcc 2700 taaggtcact atcagctagc aaatatttct tgtcaaaaat gctccactga cgttccataa 2760 attcccctcg gtatccaatt agagtctcat attcactctc aatccaaata atctgcaccg 2820 gatctggatc gtttcgcatg attgaacaag atggattgca cgcaggttct ccggccgctt 2880 gggtggagag gctattcggc tatgactggg cacaacagac aatcggctgc tctgatgccg 2940 ccgtgttccg gctgtcagcg caggggcgcc cggttctttt tgtcaagacc gacctgtccg 3000 gtgccctgaa tgaactgcag gacgaggcag cgcggctatc gtggctggcc acgacgggcg 3060 ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg aagggactgg ctgctattgg 3120 gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca 3180 tcatggctga tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc ccattcgacc 3240 accaagcgaa acatcgcatc gagcgagcac gtactcggat ggaagccggt cttgtcgatc 3300 aggatgatct ggacgaagag catcaggggc tcgcgccagc cgaactgttc gccaggctca 3360 aggcgcgcat gcccgacggc gatgatctcg tcgtgaccca tggcgatgcc tgcttgccga 3420 atatcatggt ggaaaatggc cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg 3480 cggaccgcta tcaggacata gcgttggcta cccgtgatat tgctgaagag cttggcggcg 3540 aatgggctga ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg 3600 ccttctatcg ccttcttgac gagttcttct gagcgggact ctggggttcg aaatgaccga 3660 ccaagcgacg cccaacctgc catcacgaga tttcgattcc accgccgcct tctatgaaag 3720 gttgggcttc ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct 3780 catgctggag ttcttcgccc acgggatctc tgcggaacag gcggtcgaag gtgccgatat 3840 cattacgaca gcaacggccg acaagcacaa cgccacgatc ctgagcgaca atatgatcgg 3900 gcccggcgtc cacatcaacg gcgtcggcgg cgactgccca ggcaagaccg agatgcaccg 3960 cgatatcttg ctgcgttcgg atattttcgt ggagttcccg ccacagaccc ggatgatccc 4020 cgatcgttca aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc 4080 gatgattatc atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg 4140 catgacgtta tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata 4200 cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc 4260 tatgttacta gatcgggcct cctgtcaatg ctggcggcgg ctctggtggt ggttctggtg 4320 gcggctctga gggtggtggc tctgagggtg gcggttctga gggtggcggc tctgagggag 4380 gcggttccgg tggtggctct ggttccggtg attttgatta tgaaaagatg gcaaacgcta 4440 ataagggggc tatgaccgaa aatgccgatg aaaacgcgct acagtctgac gctaaaggca 4500 aacttgattc tgtcgctact gattacggtg ctgctatcga tggtttcatt ggtgacgttt 4560 ccggccttgc taatggtaat ggtgctactg gtgattttgc tggctctaat tcccaaatgg 4620 ctcaagtcgg tgacggtgat aattcacctt taatgaataa tttccgtcaa tatttacctt 4680 ccctccctca atcggttgaa tgtcgccctt ttgtctttgg cccaatacgc aaaccgcctc 4740 tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag 4800 cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt 4860 tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca 4920 caggaaacag ctatgaccat gattacgcca agcttcacgt gaacatggtg gagcacgaca 4980 cacttgtcta ctccaaaaat atcaaagata cagtctcaga agaccaaagg gcaattgaga 5040 cttttcaaca aagggtaata tccggaaacc tcctcggatt ccattgccca gctatctgtc 5100 actttattgt gaagatagtg gaaaaggaag gtggctccta caaatgccat cattgcgata 5160 aaggaaaggc catcgttgaa gatgcctctg ccgacagtgg tcccaaagat ggacccccac 5220 ccacgaggag catcgtggaa aaagaagacg ttccaaccac gtcttcaaag caagtggatt 5280 gatgtgataa catggtggag cacgacacac ttgtctactc caaaaatatc aaagatacag 5340 tctcagaaga ccaaagggca attgagactt ttcaacaaag ggtaatatcc ggaaacctcc 5400 tcggattcca ttgcccagct atctgtcact ttattgtgaa gatagtggaa aaggaaggtg 5460 gctcctacaa atgccatcat tgcgataaag gaaaggccat cgttgaagat gcctctgccg 5520 acagtggtcc caaagatgga cccccaccca cgaggagcat cgtggaaaaa gaagacgttc 5580 caaccacgtc ttcaaagcaa gtggattgat gtgatatctc cactgacgta agggatgacg 5640 cacaatccca ctatccttcg caagaccctt cctctatata aggaagttca tttcatttgg 5700 agaggacctc gaaattctca acacaacata tacaaaacaa acgaatctca agcaatcaag 5760 cattctactt ctattgcagc aatttaaatc atttctttta aagcaaaagc aattttctga 5820 aaattttcac catttacgaa cgatagggat ccgagaccca agcttgccac catgccgaag 5880 aagaagcgca aggtggacaa gaagtacagc atcggcctgg acatcggtac caacagcgtg 5940 ggctgggccg tgatcaccga cgagtacaag gtgcccagca agaagttcaa ggtgctgggc 6000 aacaccgacc gccacagcat caagaagaac ctgatcggcg ccctgctgtt cgacagcggc 6060 gagaccgccg aggccacccg cctgaagcgc accgcccgcc gccgctacac ccgccgcaag 6120 aaccgcatct gctacctgca ggagatcttc agcaacgaga tggccaaggt ggacgacagc 6180 ttcttccacc gcctggagga gagcttcctg gtggaggagg acaagaagca cgagcgccac 6240 cccatcttcg gcaacatcgt ggacgaggtg gcctaccacg agaagtaccc caccatctac 6300 cacctgcgca agaagctggt ggacagcacc gacaaggccg acctgcgcct gatctacctg 6360 gccctggccc acatgatcaa gttccgcggc cacttcctga tcgagggcga cctgaacccc 6420 gacaacagcg acgtggacaa gctgttcatc cagctggtgc agacctacaa ccagctgttc 6480 gaggagaacc ccatcaacgc cagcggcgtg gacgccaagg ccatcctgag cgcccgcctg 6540 agcaagagcc gccgcctgga gaacctgatc gcccagctgc ccggcgagaa gaagaacggc 6600 ctgttcggca acctgatcgc cctgagcctg ggcctgaccc ccaacttcaa gagcaacttc 6660 gacctggccg aggacgccaa gctgcagctg agcaaggaca cctacgacga cgacctggac 6720 aacctgctgg cccagatcgg cgaccagtac gccgacctgt tcctggccgc caagaacctg 6780 agcgacgcca tcctgctgag cgacatcctg cgcgtgaaca ccgagatcac caaggccccc 6840 ctgagcgcca gcatgatcaa gcgctacgac gagcaccacc aggacctgac cctgctgaag 6900 gccctggtgc gccagcagct gcccgagaag tacaaggaga tcttcttcga ccagagcaag 6960 aacggctacg ccggctacat cgacggcggc gccagccagg aggagttcta caagttcatc 7020 aagcccatcc tggagaagat ggacggcacc gaggagctgc tggtgaagct gaaccgcgag 7080 gacctgctgc gcaagcagcg caccttcgac aacggcagca tcccccacca gatccacctg 7140 ggcgagctgc acgccatcct gcgccgccag gaggacttct accccttcct gaaggacaac 7200 cgcgagaaga tcgagaagat cctgaccttc cgcatcccct actacgtggg ccccctggcc 7260 cgcggcaaca gccgcttcgc ctggatgacc cgcaagagcg aggagaccat caccccctgg 7320 aacttcgagg aggtggtgga caagggcgcc agcgcccaga gcttcatcga gcgcatgacc 7380 aacttcgaca agaacctgcc caacgagaag gtgctgccca agcacagcct gctgtacgag 7440 tacttcaccg tgtacaacga gctgaccaag gtgaagtacg tgaccgaggg catgcgcaag 7500 cccgccttcc tgagcggcga gcagaagaag gccatcgtgg acctgctgtt caagaccaac 7560 cgcaaggtga ccgtgaagca gctgaaggag gactacttca agaagatcga gtgcttcgac 7620 agcgtggaga tcagcggcgt ggaggaccgc ttcaacgcca gcctgggcac ctaccacgac 7680 ctgctgaaga tcatcaagga caaggacttc ctggacaacg aggagaacga ggacatcctg 7740 gaggacatcg tgctgaccct gaccctgttc gaggaccgcg agatgatcga ggagcgcctg 7800 aagacctacg cccacctgtt cgacgacaag gtgatgaagc agctgaagcg ccgccgctac 7860 accggctggg gccgcctgag ccgcaagctt atcaacggca tccgcgacaa gcagagcggc 7920 aagaccatcc tggacttcct gaagagcgac ggcttcgcca accgcaactt catgcagctg 7980 atccacgacg acagcctgac cttcaaggag gacatccaga aggcccaggt gagcggccag 8040 ggcgacagcc tgcacgagca catcgccaac ctggccggca gccccgccat caagaagggc 8100 atcctgcaga ccgtgaaggt ggtggacgag ctggtgaagg tgatgggccg ccacaagccc 8160 gagaacatcg tgatcgagat ggcccgcgag aaccagacca cccagaaggg ccagaagaac 8220 agccgcgagc gcatgaagcg catcgaggag ggcatcaagg agctgggcag ccagatcctg 8280 aaggagcacc ccgtggagaa cacccagctg cagaacgaga agctgtacct gtactacctg 8340 cagaacggcc gcgacatgta cgtggaccag gagctggaca tcaaccgcct gagcgactac 8400 gacgtggacc acatcgtgcc ccagagcttc ctgaaggacg acagcatcga caacaaggtg 8460 ctgacccgca gcgacaagaa ccgcggcaag agcgacaacg tgcccagcga ggaggtggtg 8520 aagaagatga agaactactg gcgccagctg ctgaacgcca agctgatcac ccagcgcaag 8580 ttcgacaacc tgaccaaggc cgagcgcggc ggcctgagcg agctggacaa ggccggcttc 8640 atcaagcgcc agctggtgga gacccgccag atcaccaagc acgtggccca gatcctggac 8700 agccgcatga acaccaagta cgacgagaac gacaagctga tccgcgaggt gaaggtgatc 8760 accctgaaga gcaagctggt gagcgacttc cgcaaggact tccagttcta caaggtgcgc 8820 gagatcaaca actaccacca cgcccacgac gcctacctga acgccgtggt gggcaccgcc 8880 ctgatcaaga agtaccccaa gctggagagc gagttcgtgt acggcgacta caaggtgtac 8940 gacgtgcgca agatgatcgc caagagcgag caggagatcg gcaaggccac cgccaagtac 9000 ttcttctaca gcaacatcat gaacttcttc aagaccgaga tcaccctggc caacggcgag 9060 atccgcaagc gccccctgat cgagaccaac ggcgagaccg gcgagatcgt gtgggacaag 9120 ggccgcgact tcgccaccgt gcgcaaggtg ctgagcatgc cccaggtgaa catcgtgaag 9180 aagaccgagg tgcagaccgg cggcttcagc aaggagagca tcctgcccaa gcgcaacagc 9240 gacaagctga tcgcccgcaa gaaggactgg gaccccaaga agtacggcgg cttcgacagc 9300 cccaccgtgg cctacagcgt gctggtggtg gccaaggtgg agaagggcaa gagcaagaag 9360 ctgaagagcg tgaaggagct gctgggcatc accatcatgg agcgcagcag cttcgagaag 9420 aaccccatcg acttcctgga ggccaagggc tacaaggagg tgaagaagga cctgatcatc 9480 aagctgccca agtacagcct gttcgagctg gagaacggcc gcaagcgcat gctggccagc 9540 gccggcgagc tgcagaaggg caacgagctg gccctgccca gcaagtacgt gaacttcctg 9600 tacctggcca gccactacga gaagctgaag ggcagccccg aggacaacga gcagaagcag 9660 ctgttcgtgg agcagcacaa gcactacctg gacgagatca tcgagcagat cagcgagttc 9720 agcaagcgcg tgatcctggc cgacgccaac ctggacaagg tgctgagcgc ctacaacaag 9780 caccgcgaca agcccatccg cgagcaggcc gagaacatca tccacctgtt caccctgacc 9840 aacctgggcg cccccgccgc cttcaagtac ttcgacacca ccatcgaccg caagcgctac 9900 accagcacca aggaggtgct ggacgccacc ctgatccacc agagcatcac cggtctgtac 9960 gagacccgca tcgacctgag ccagctgggc ggcgacggcg gctccggaaa gagaccagct 10020 gctaccaaga aggctggaca ggctaagaag aagaagtaag agctcgtccg caaaaatcac 10080 cagtctctct ctacaaatct atctctctct atttttctcc agaataatgt gtgagtagtt 10140 cccagataag ggaattaggg ttcttatagg gtttcgctca tgtgttgagc atataagaaa 10200 cccttagtat gtatttgtat ttgtaaaata cttctatcaa taaaatttct aattcctaaa 10260 accaaaatcc agtgactata attagccttc ctcataagtt gcactacttc agcgtcccaa 10320 ttgcaccctt accacgaaga caggtttgtc caatcccata ttgcgacctt ggcagggggt 10380 tcgcaagtcc cacccgaaac gttgctgaag gctcaggttt ctgagcgaca aatgctttaa 10440 acgcgagttc ccgctcataa cctggaccga atgcgggtcg acagaaatct caaaattccg 10500 gcagaacaat tttgaatctc gatccgtaga aacgagacgg tcattgtttt agttccacca 10560 cgattatatt tgaaatttac gtgagtgtga gtgagacttg cataagaaaa taaaatcttt 10620 agttgggaaa aaattcaata atataaatgg gcttgagaag gaagcgaggg ataggccttt 10680 ttctaaaata ggcccattta agctattaac aatcttcaaa agtaccacag cgcttaggta 10740 aagaaagcag ctgagtttat atatggttag agacgaagta gtgattgaag tcatatctac 10800 caacacaaag ttttagagct agaaatagca agttaaaata aggctagtcc gttatcaact 10860 tgaaaaagtg gcaccgagtc ggtgcttttt tcaaatgaca attaaccacc gtgtattcgt 10920 tataacatca ggcagtttaa gtcgggacaa taggagccgc aatacacagt ttaccgcatc 10980 ttgacctaac tagaaatctc aaaattccgg cagaacaatt ttgaatctcg atccgtagaa 11040 acgagacggt cattgtttta gttccaccac gattatattt gaaatttacg tgagtgtgag 11100 tgagacttgc ataagaaaat aaaatcttta gttgggaaaa aattcaataa tataaatggg 11160 cttgagaagg aagcgaggga taggcctttt tctaaaatag gcccatttaa gctattaaca 11220 atcttcaaaa gtaccacagc gcttaggtaa agaaagcagc tgagtttata tatggttaga 11280 gacgaagtag tgattgtttt cagagtttag cgcattagtt ttagagctag aaatagcaag 11340 ttaaaataag gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc 11400 ctgcgcttcg atcttagatc aaggttgcct ggagtgtctc ttgaactgta cagagttgag 11460 ctagtcgcga agcagatcag caacacattt ctgcttgcga gaaatctcaa aattccggca 11520 gaacaatttt gaatctcgat ccgtagaaac gagacggtca ttgttttagt tccaccacga 11580 ttatatttga aatttacgtg agtgtgagtg agacttgcat aagaaaataa aatctttagt 11640 tgggaaaaaa ttcaataata taaatgggct tgagaaggaa gcgagggata ggcctttttc 11700 taaaataggc ccatttaagc tattaacaat cttcaaaagt accacagcgc ttaggtaaag 11760 aaagcagctg agtttatata tggttagaga cgaagtagtg attgctcaaa gagcatgaca 11820 tctgttttag agctagaaat agcaagttaa aataaggcta gtccgttatc aacttgaaaa 11880 agtggcaccg agtcggtgct tttttactag tctcgagcac gtggaattca ctggccgtcg 11940 ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac 12000 atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac 12060 agttgcgcag cctgaatggc gcccgctcct ttcgctttct tcccttcctt tctcgccacg 12120 ttcgccggct ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt 12180 gctttacggc acctcgaccc caaaaaactt gatttgggtg atggttcacg tagtgggcca 12240 tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga 12300 ctcttgttcc aaactggaac aacactcaac cctatctcgg gctattcttt tgatttataa 12360 gggattttgc cgatttcgga accaccatca aacaggattt tcgcctgctg gggcaaacca 12420 gcgtggaccg cttgctgcaa ctctctcagg gccaggcggt gaagggcaat cagctgttgc 12480 ccgtctcact ggtgaaaaga aaaaccaccc cagtacatta aaaacgtccg caatgtgtta 12540 ttaagttgtc taagcgtcaa tttgtttaca ccacaatata tcctgccacc agccagccaa 12600 cagctccccg accggcagct cggcacaaaa tcaccactcg atacaggcag cccatcagtc 12660 cgggacggcg tcagcgggag agccgttgta aggcggcaga ctttgctcat gttaccgatg 12720 ctattcggaa gaacggcaac taagctgccg ggtttgaaac acggatgatc tcgcggaggg 12780 tagcatgttg attgtaacga tgacagagcg ttgctgcctg tgatcaaata tcatctccct 12840 cgcagagatc cgaattatca gccttcttat tcatttctcg cttaaccgtg acaggctgtc 12900 gatcttgaga actatgccga cataatagga aatcgctgga taaagccgct gaggaagctg 12960 agtggcgcta tttctttaga agtgaacgtt gacgatatca actcccctat ccattgctca 13020 ccgaatggta caggtcgggg acccgaagtt ccgactgtcg gcctgatgca tccccggctg 13080 atcgacccca gatctggggc tgagaaagcc cagtaaggaa acaactgtag gttcgagtcg 13140 cgagatcccc cggaaccaaa ggaagtaggt taaacccgct ccgatcaggc cgagccacgc 13200 caggccgaga acattggttc ctgtaggcat cgggattggc ggatcaaaca ctaaagctac 13260 tggaacgagc agaagtcctc cggccgccag ttgccaggcg gtaaaggtga gcagaggcac 13320 gggaggttgc cacttgcggg tcagcacggt tccgaacgcc atggaaaccg cccccgccag 13380 gcccgctgcg acgccgacag gatctagcgc tgcgtttggt gtcaacacca acagcgccac 13440 gcccgcagtt ccgcaaatag cccccaggac cgccatcaat cgtatcgggc tacctagcag 13500 agcggcagag atgaacacga ccatcagcgg ctgcacagcg cctaccgtcg ccgcgacccc 13560 gcccggcagg cggtagaccg aaataaacaa caagctccag aatagcgaaa tattaagtgc 13620 gccgaggatg aagatgcgca tccaccagat tcccgttgga atctgtcgga cgatcatcac 13680 gagcaataaa cccgccggca acgcccgcag cagcataccg gcgacccctc ggcctcgctg 13740 ttcgggctcc acgaaaacgc cggacagatg cgccttgtga gcgtccttgg ggccgtcctc 13800 ctgtttgaag accgacagcc caatgatctc gccgtcgatg taggcgccga atgccacggc 13860 atctcgcaac cgttcagcga acgcctccat gggctttttc tcctcgtgct cgtaaacgga 13920 cccgaacatc tctggagctt tcttcagggc cgacaatcgg atctcgcgga aatcctgcac 13980 gtcggccgct ccaagccgtc gaatctgagc cttaatcaca attgtcaatt ttaatcctct 14040 gtttatcggc agttcgtaga gcgcgccgtg cgtcccgagc gatactgagc gaagcaagtg 14100 cgtcgagcag tgcccgcttg ttcctgaaat gccagtaaag cgctggctgc tgaaccccca 14160 gccggaactg accccacaag gccctagcgt ttgcaatgca ccaggtcatc attgacccag 14220 gcgtgttcca ccaggccgct gcctcgcaac tcttcgcagg cttcgccgac ctgctcgcgc 14280 cacttcttca cgcgggtgga atccgatccg cacatgaggc ggaaggtttc cagcttgagc 14340 gggtacggct cccggtgcga gctgaaatag tcgaacatcc gtcgggccgt cggcgacagc 14400 ttgcggtact tctcccatat gaatttcgtg tagtggtcgc cagcaaacag cacgacgatt 14460 tcctcgtcga tcaggacctg gcaacgggac gttttcttgc cacggtccag gacgcggaag 14520 cggtgcagca gcgacaccga ttccaggtgc ccaacgcggt cggacgtgaa gcccatcgcc 14580 gtcgcctgta ggcgcgacag gcattcctcg gccttcgtgt aataccggcc attgatcgac 14640 cagcccaggt cctggcaaag ctcgtagaac gtgaaggtga tcggctcgcc gataggggtg 14700 cgcttcgcgt actccaacac ctgctgccac accagttcgt catcgtcggc ccgcagctcg 14760 acgccggtgt aggtgatctt cacgtccttg ttgacgtgga aaatgacctt gttttgcagc 14820 gcctcgcgcg ggattttctt gttgcgcgtg gtgaacaggg cagagcgggc cgtgtcgttt 14880 ggcatcgctc gcatcgtgtc cggccacggc gcaatatcga acaaggaaag ctgcatttcc 14940 ttgatctgct gcttcgtgtg tttcagcaac gcggcctgct tggcctcgct gacctgtttt 15000 gccaggtcct cgccggcggt ttttcgcttc ttggtcgtca tagttcctcg cgtgtcgatg 15060 gtcatcgact tcgccaaacc tgccgcctcc tgttcgagac gacgcgaacg ctccacggcg 15120 gccgatggcg cgggcagggc agggggagcc agttgcacgc tgtcgcgctc gatcttggcc 15180 gtagcttgct ggaccatcga gccgacggac tggaaggttt cgcggggcgc acgcatgacg 15240 gtgcggcttg cgatggtttc ggcatcctcg gcggaaaacc ccgcgtcgat cagttcttgc 15300 ctgtatgcct tccggtcaaa cgtccgattc attcaccctc cttgcgggat tgccccgact 15360 cacgccgggg caatgtgccc ttattcctga tttgacccgc ctggtgcctt ggtgtccaga 15420 taatccacct tatcggcaat gaagtcggtc ccgtagaccg tctggccgtc cttctcgtac 15480 ttggtattcc gaatcttgcc ctgcacgaat accagcgacc ccttgcccaa atacttgccg 15540 tgggcctcgg cctgagagcc aaaacacttg atgcggaaga agtcggtgcg ctcctgcttg 15600 tcgccggcat cgttgcgcca catctaggta ctaaaacaat tcatccagta aaatataata 15660 ttttattttc tcccaatcag gcttgatccc cagtaagtca aaaaatagct cgacatactg 15720 ttcttccccg atatcctccc tgatcgaccg gacgcagaag gcaatgtcat accacttgtc 15780 cgccctgccg cttctcccaa gatcaataaa gccacttact ttgccatctt tcacaaagat 15840 gttgctgtct cccaggtcgc cgtgggaaaa gacaagttcc tcttcgggct tttccgtctt 15900 taaaaaatca tacagctcgc gcggatcttt aaatggagtg tcttcttccc agttttcgca 15960 atccacatcg gccagatcgt tattcagtaa gtaatccaat tcggctaagc ggctgtctaa 16020 gctattcgta tagggacaat ccgatatgtc gatggagtga aagagcctga tgcactccgc 16080 atacagctcg ataatctttt cagggctttg ttcatcttca tactcttccg agcaaaggac 16140 gccatcggcc tcactcatga gcagattgct ccagccatca tgccgttcaa agtgcaggac 16200 ctttggaaca ggcagctttc cttccagcca tagcatcatg tccttttccc gttccacatc 16260 ataggtggtc cctttatacc ggctgtccgt catttttaaa tataggtttt cattttctcc 16320 caccagctta tataccttag caggagacat tccttccgta tcttttacgc agcggtattt 16380 ttcgatcagt tttttcaatt ccggtgatat tctcatttta gccatttatt atttccttcc 16440 tcttttctac agtatttaaa gataccccaa gaagctaatt ataacaagac gaactccaat 16500 tcactgttcc ttgcattcta aaaccttaaa taccagaaaa cagctttttc aaagttgttt 16560 tcaaagttgg cgtataacat agtatcgacg gagccgattt tgaaaccaca attatgggtg 16620 atgctgccaa cttactgatt tagtgtatga tggtgttttt gaggtgctcc agtggcttct 16680 gtgtctatca gctgtccctc ctgttcagct actgacgggg tggtgcgtaa cggcaaaagc 16740 accgccggac atcagcgcta tctctgctct cactgccgta aaacatggca actgcagttc 16800 acttacaccg cttctcaacc cggtacgcac cagaaaatca ttgatatggc catgaatggc 16860 gttggatgcc gggcaacagc ccgcattatg ggcgttggcc tcaacacgat tttacgtcac 16920 ttaaaaaact caggccgcag tcggtaacct cgcgcataca gccgggcagt gacgtcatcg 16980 tctgcgcgga aatggacgaa cagtggggct atgtcggggc taaatcgcgc cagcgctggc 17040 tgttttacgc gtatgacagt ctccggaaga cggttgttgc gcacgtattc ggtgaacgca 17100 ctatggcgac gctggggcgt cttatgagcc tgctgtcacc ctttgacgtg gtgatatgga 17160 tgacggatgg ctggccgctg tatgaatccc gcctgaaggg aaagctgcac gtaatcagca 17220 agcgatatac gcagcgaatt gagcggcata acctgaatct gaggcagcac ctggcacggc 17280 tgggacggaa gtcgctgtcg ttctcaaaat cggtggagct gcatgacaaa gtcatcgggc 17340 attatctgaa cataaaacac tatcaataag ttggagtcat tacccaatta tgatagaatt 17400 tacaagctat aaggttattg tcctgggttt caagcattag tccatgcaag tttttatgct 17460 ttgcccattc tatagatata ttgataagcg cgctgcctat gccttgcccc ctgaaatcct 17520 tacatacggc gatatcttct atataaaaga tatattatct tatcagtatt gtcaatatat 17580 tcaaggcaat ctgcctcctc atcctcttca tcctcttcgt cttggtagct ttttaaatat 17640 ggcgcttcat agagtaattc tgtaaaggtc caattctcgt tttcatacct cggtataatc 17700 ttacctatca cctcaaatgg ttcgctgggt ttatcgcacc cccgaacacg agcacggcac 17760 ccgcgaccac tatgccaaga atgcccaagg taaaaattgc cggccccgcc atgaagtccg 17820 tgaatgcccc gacggccgaa gtgaagggca ggccgccacc caggccgccg ccctcactgc 17880 ccggcacctg gtcgctgaat gtcgatgcca gcacctgcgg cacgtcaatg cttccgggcg 17940 tcgcgctcgg gctgatcgcc catcccgtta ctgccccgat cccggcaatg gcaaggactg 18000 ccagcgctgc catttttggg gtgaggccgt tcgcggccga ggggcgcagc ccctgggggg 18060 atgggaggcc cgcgttagcg ggccgggagg gttcgagaag ggggggcacc ccccttcggc 18120 gtgcgcggtc acgcgcacag ggcgcagccc tggttaaaaa caaggtttat aaatattggt 18180 ttaaaagcag gttaaaagac aggttagcgg tggccgaaaa acgggcggaa acccttgcaa 18240 atgctggatt ttctgcctgt ggacagcccc tcaaatgtca ataggtgcgc ccctcatctg 18300 tcagcactct gcccctcaag tgtcaaggat cgcgcccctc atctgtcagt agtcgcgccc 18360 ctcaagtgtc aataccgcag ggcacttatc cccaggcttg tccacatcat ctgtgggaaa 18420 ctcgcgtaaa atcaggcgtt ttcgccgatt tgcgaggctg gccagctcca cgtcgccggc 18480 cgaaatcgag cctgcccctc atctgtcaac gccgcgccgg gtgagtcggc ccctcaagtg 18540 tcaacgtccg cccctcatct gtcagtgagg gccaagtttt ccgcgaggta tccacaacgc 18600 cggcggccgc ggtgtctcgc acacggcttc gacggcgttt ctggcgcgtt tgcagggcca 18660 tagacggccg ccagcccagc ggcgagggca accagcccgg 18700 <210> 19 <211> 304 <212> DNA <213> Artificial Sequence <220> <223> P_U6 <400> 19 agaaatctca aaattccggc agaacaattt tgaatctcga tccgtagaaa cgagacggtc 60 attgttttag ttccaccacg attatatttg aaatttacgt gagtgtgagt gagacttgca 120 taagaaaata aaatctttag ttgggaaaaa attcaataat ataaatgggc ttgagaagga 180 agcgagggat aggccttttt ctaaaatagg cccatttaag ctattaacaa tcttcaaaag 240 taccacagcg cttaggtaaa gaaagcagct gagtttatat atggttagag acgaagtagt 300 gatt 304 <210> 20 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> F_C9 <400> 20 gaccatcctg gacttcctga agagc 25 <210> 21 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> R_C9 <400> 21 tgcaggtagt acaggtacag cttctcg 27 <210> 22 <211> 19 <212> DNA <213> Artificial Sequence <220> <223> F_35S <400> 22 gctcctacaa atgccatca 19 <210> 23 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> R_35S <400> 23 gatagtggga ttgtgcgtca 20 <110> KT&G CORPORATION <120> A NOVEL NICOTINE BIOSYNTHETIC GENE AND USE THEREOF <130> P20190961KR(PN130282KR) <150> KR 10-2019-0103310 <151> 2019-08-22 <160> 23 <170> KoPatentIn 3.0 < 210> 1 <211> 2211 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(2211) <223> PMT1b <400> 1 atggaagtca tatctaccaa cacaaatggc tctaccatct tcaagaatgg taccattccc 60 atgaaccgcc accaaaatgg ctcttccgaa cacctcaacg gctaccagaa tggcatttcc 120 aaacaccaaa acgggcacca gaatggcact tccgaacatc ggaacggcca ccagaatggg 180 acatccgaac aacagaacgg gacaatcagc catgacaatg gcaacgagct actgggaagc 240 tccaactcta ttaagcctgg ttggttttca gagtttagcg cattatggcc aggttagtat 300 taagaaagaa actcaaatgc atcgtactct tgtattttgg tttgtgtata atttataata 360 tggataaatt atgacgaagt aaactcgcat acattaattt gaaataatct aattttgcag 420 gtgaagcatt ctcacttaag gtcgagaagt tactattcca ggggaaatct gattaccaag 480 atgtcatgct ctttgaggta attaatattc tgatacacat gctttaatct aaagtgatac 540 ttttaattta cttttagttt attgcatgtg cacgtacagt cagcaactta tgggaaggtt 600 ctgactttgg atggagcaat tcaacataca gagaatggtg gatttccata cactgaaatg 660 attgttcatc taccacttgg ttccatccca aacccaaaaa aggttttgat catcggcgga 720 ggaattggtt ttacattatt cgaaatgctt cgttatcctt caatcgaaaa aattgacatt 780 gttgagatcg atgacgtggt agttgatgta agtcaaactt cttttaccca cataaagaaa 840 atgatttaga ttgcaattct ttttattttt ctaaaagaat aaatatattc tctctttttt 900 ttttttaaaa caaaattctc tttcttacag gtatccagaa aatttttccc ttatctggca 960 gctaatttta acgatcctcg tgtaacccta gttctcggag atggtgcgta tatgatagtc 1020 tcgttttata ttttatttca cttgattttt accttttttt gtggttaatt aatcatctac 1080 cattggttct ctttaccttc aggagctgca tttgtaaagg ctgcacaagc gggatattat 1140 gatgctatta tagtggactc ttctgatccc attggtacgc tattactatt taataccaag 1200 actattctta ttatataagc tactaagaaa ctaattgaat aattaataaa cgtaactgta 1260 attgatttct aaaataatat atataatttc aggtccagca aaagatttgt ttgagaggcc 1320 attctttgag gcagtagcca aagcccttag gccaggagga gttgtatgca cacaggctga 1380 aagcatttgg cttcatatgc atattattaa gcaaatcatt gctaactgtc gtcaagtctt 1440 taagggttct gtcaactatg c ttggacaac cgttccaaca tatcccacgt attctttttc 1500 tctctctctt cctgtctttt tcgatgcaat gtaaatttat aaaattggaa gtccgtttta 1560 cttttctata gacgtagatc ctaaaattgt caagaaatgg agaattgact tacaagaaaa 1620 atcaacttct tttcatttac tattcttttt ggtgacaaac tttacttatt atttcgttct 1680 aaaatgaaaa tttattttta tattttaaaa taatttagct ttaaactttt aattttactt 1740 gttatatttt taataaaaaa gatttattgt caaataaatg ttgtggccat acaataagtt 1800 tcaaattatg tcacaaaaat tgaaacagag tgagcaaatt tttatttcaa gtaaactttc 1860 aaggaattgt gtttaagttt tctcaactga tactgatcaa tgatgtctta aatattttga 1920 tttcagcggt gtgattggtt atatgctctg ctctactgaa gggccagaag ttaacttcaa 1980 gaatccagta aatccaattg acaaagagac aactcaagtc aagtccaaat taggacctct 2040 caagttctac aactctgatg taacttcata tctcacaatt tctttttccg ttttactgta 2100 tgttcttcat caaattttat aactaactct tttcatattg tctttttttt tcagattcac 2160 aaagcagcat tcattttgcc atctttcgcc cgaagtatga tcgagtctta a 2211 <210> 2 <211> 3129 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3129) <223> PMT1a <400> 2 g aattcaatg gagaaggaaa atatttccag tgtaaacaca agtgaatgaa gagaagccaa 60 aataatctct atcattcaag ccttaggtgg agattaaaaa aattatttac tttcttatca 120 aagtaatagg tgatcaacag ctttcgtaaa acgtcattag gagaatatta taatctcttt 180 tatgctgaag aacccacata aggaagatca taaaatacat gactttcaga tgacttcttg 240 gagctttatt tttaaagagt ggctagctgg tcagcaaaga ggtgctcgtc agatatcata 300 aaattttact attatttgtt ttaagaggga gatggggcac acatgcttgt gacaaaagta 360 agaggaagaa aggagacaga agaggaaata gatttggggg gggggggggg ggtttcacaa 420 tcaaagaaaa tttttaaaat ggagagagaa atgagcacac acatatacta acaaaatttt 480 actaataatt gcaccgagac aaacttatat tttagttcca aaatgtcagt ctaaccctgc 540 acgttgtaat gaatttttaa ctattatatt atatcgagtt gcgccctcca ctcctcggtg 600 tccaaattgt atttaaatgc atagatgttt attgggagtg tacagcaagc tttcggaaaa 660 tacaaaccat aatactttct cttcttcaat ttgtttagtt taattttgaa aatggaagtc 720 atatctacca acacaaatgg ctctaccatc ttcaagaatg gtgccattcc catgaacggc 780 caccaaaatg gcacttctga acacctcaac ggctaccaga atggcacttc caaacaccaa 840 aacgggcacc agaatggcactttcgaacat cggaacggcc accagaatgg gacatccgaa 900 caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaag ctccgactct 960 attaagcctg gctggttttc agagtttagc gcattatggc caggttagta ctaagaaagc 1020 aactcaaatg catcggcctc ttgttgctac taaatataga gagctatcat acttttaggg 1080 actaactaaa aaggaaagat tatcacaggg acgaagtgag cagttaactt cgcatattat 1140 cagacgcatt aatttgaaat aatcgaattt tgcaggtgaa gcattctcac ttaaggttga 1200 gaagttacta ttccagggga agtctgatta ccaagatgtc atgctctttg aggtaattaa 1260 tattctaata cacatgcttt aatttaaagt gatactttta atttactttt agtttattgc 1320 atgtgcacgt acagtcagca acttatggga aggttctgac tttggatgga gcaattcaac 1380 atacagagaa tggtggattt ccatacactg aaatgattgt tcatctacca cttggttcca 1440 tcccaaaccc aaaaaaggtt ttgatcatcg gcggaggaat tggttttaca ttattcgaaa 1500 tgcttcgtta tccttcaatc gaaaaaattg acattgttga gatcgatgac gtggtagttg 1560 atgtaagtca aacttctttt acccacataa agaaaatgat ttagattgca attcttttta 1620 tttttctaaa agaataaata tattctcttt ttttttttta aaacaaaatt ctctttctta 1680 caggtatcca gaaaattttt cccttatc tg gcagctaatt ttaacgatcc tcgtgtaacc 1740 ctagttctcg gagatggtgc gtatatgata gtctcgtttt atattttatt tcacttgatt 1800 tttacctttt tttgtggtta attaatcatc taccattggt tctctttacc ttcaggagct 1860 gcatttgtaa aggctgcaca agcgggatat tatgatgcta ttatagtgga ctcttctgat 1920 cccattggta cgctattact atttaatacc aagactattc ttattaaata agctactaag 1980 aaactaattg aataattaat aaacgtaact gtaattgatt tctaaaataa tatatataat 2040 ttcaggtcca gcaaaagatt tgtttgagag gccattcttt gaggcagtag ccaaagccct 2100 taggccagga ggagttgtat gcacacaggc tgaaagcatt tggcttcata tgcatattat 2160 taagcaaatc attgctaact gtcgtcaagt ctttaagggt tctgtcaact atgcttggac 2220 aaccgttcca acatatccca cgtattcttt ttctctctct ctcttcctgt ctttttcgat 2280 gcaatgtaaa tttataaaat tggaagtccg ttttactttt ctatagacgt agatcctaaa 2340 attgtcaaga aatggagaat tgacttacaa gaaaaatcaa cttcttttca tttactattc 2400 tttttggtga caaactttac ttattatttc gttctaaaat gaaaatttat ttttatattt 2460 taaaataatt tagctttaaa cttttaattt tacttgttat atttttaata aaaaagattt 2520 atagtcaaat aaatgttgtg accatataaa aac ctccgca tttttaagat cataagtttc 2580 agagtcaaac gagttaattt atttttagta tgccggtgcg gagtcaaatt atgtcataaa 2640 aattgaaacg gagtgagaac atttttattt cgagtaaact ttcaaggtat tgtgtttaat 2700 ttcaagtgat actgatcaat gatgtcttaa atattttgat ttcagcggtg tgatcggtta 2760 tatgctctgc tctactgaag ggccagaagt tgacttcaag aatccagtaa atccaattga 2820 caaagagaca actcaagtca agtccaaatt aggacctctc aagttctaca actctgatgt 2880 aacttcatat ctcacaattt ctttttccgt tttactgtat gttcttcgtc aaattttata 2940 actaactctt ttcatattgt cttttttttc agattcacaa agcagcattc attttaccat 3000 ctttcgccag aagtatgatc gagtcttaat caagtgaata atgaacactg gtagtacaat 3060 cattggacca agatcgagtc ttaatcaagt gaataaaataa gtgaaatgcg acgtattgta 3211. .(3091) <223> PMT2 <400> 3 ctgagttgac aagaacaatt cctggtgaat cagatggatg aagataatag aggtgggtgg 60 aatctataac caaagcagct ggttgagtga ctgtgcgagt att tgcagaaaca attgaaggga att tatagga tt caagaaaca attgaaggga att tatagga act 180 tcaaattaaa ataaggctta gcgttaaaat caaaggaaat ggcaagcctg gctcctggag 240 caatgcttct gaggacagta gtaaaaacaa tatcagacaa aaagtaaagt tgtattattt 300 agcttgagga taaagtatgt cattagtttt gtgagagatt tggtgtcctc tacaatgatt 360 gttgaagtcc ctatttatag ctatacacag gaaacaaaat cctaggatca agcccctctt 420 aaatgacaat aatggggtta atgatgaata tgtagcggca tgacatgaat gccaaaattc 480 tccgcaacga ctatttattt aatattgagg aatatttttt attaaatact atctggtgac 540 aagcattcgt ttgcttccgt tgattacgtt gattttggga tctactctat accaaccgaa 600 gccgttgtcc ttgatcttcg ctttcattta attcatcttc cgtctgcctc cgatttcaca 660 agtcatgcac ccattcaatt atttaatgga aaccaatttt accctataca aatggtacat 720 cattcgtcaa atactttact tggatataaa caattttgcc cgaggagtaa acagatgcga 780 agaaagaaag cagacgatta aagaaatttt taaaaaagga gagagaaatg aacacacaca 840 tgtactaata aaattagggt actactttac taataattgg acagagacta aattcatatt 900 ttagttccaa aatgtctcgg gcagtccaac catgcacgtt gtaatgattt tttaactcta 960 ttatatcgag ttgcgccctc cactcctcgg tgtccaaatt gtatataaat gcatatgtgt 1020 ctattgggag tgt acatcaa gctttcataa agtacaaatc gtaatacttg ttgaaacata 1080 atactttctc ttctccaatt tgtttagttt aattttgaaa atggaagtca tatctaccaa 1140 cacaaatggc tctaccatct tcaagagtgg tgccattccc atgaatggcc accataatgg 1200 cacttccaaa caccaaaacg gccacaagaa tgggacttcc gaacaacaga acgggacaat 1260 cagccttgat aatggcaacg agctactggg aaactccaat tgtattaagc ctggttggtt 1320 ttcagagttt agcgcattat ggccaggtta gtactgagaa agaaactcaa atgcatattt 1380 aaagttaaaa ttgttaggct aatataagga gttgatattc ttttagtgat taattaaaaa 1440 ggaaaaagta tcaaataaat tcaaaaaatg gatagtaact tcgcatatta ctctacacat 1500 taatttgaaa taaatcgaat tttgcaggtg aagcattctc acttaaggtt gagaagttac 1560 tgttccaggg gaagtctgac taccaagatg tcatgctctt tgaggtaaat aatattttaa 1620 tacacatgct tccatttaaa ttgatacttt taatttactt ttactttatt gcatgtgtac 1680 gtacagtcag caacttatgg gaaggttctg actttggatg gagcaattca acacacagag 1740 aatggtggat ttccatacac tgaaatgatt gttcatcttc cacttggttc catcccaaac 1800 ccaaaaaagg ttttgatcat cggcggagga attggtttta cattattcga aatgcttcgt 1860 tatcctacaa tcgaaaaaa t tgacattgtt gagatcgatg acgtggtagt tgatgtaagt 1920 caaacttctt ttactcacat aaaaaaatgg tttagattgc ttcttgttat ttttctaaaa 1980 gaatactatt tttttaaaac aaaattttct tttttacagg tatctagaaa atttttccct 2040 tatctcgctg ctaattttaa cgatcctcgt gtaaccctag tccttggaga tggtgcgtat 2100 ttgataatct cgcttttgtt ttatctttta tttttattgc atttaatttt taccttttgg 2160 tgtgtggtta attcacctgc cattggttct ctttcatttc aggggctgca tttgtaaagg 2220 ctgcacaagc agaatattat gatgctatta tagtggactc ttctgatccc attggtactc 2280 tattacttct taataccaag actaatctta ttgaataagc tactaataaa cggtaattga 2340 tttctaaaac aatataattt caggtccagc aaaagatttg tttgagaggc cattctttga 2400 ggcagtagct aaagccctaa ggccaggagg agttgtatgc acacaggctg aaagcatttg 2460 gcttcatatg catattatta agcaaatcat tgctaactgt cgtcaagtct ttaagggctc 2520 tgtcaactat gcttggacta ctgttccaac atatccaacg tatttttctc tctctctctc 2580 ttcctataaa attggaagtt ttgattctat aattgtcaag aaatggagaa tcagttccaa 2640 gaaaaaccaa cttcttttct tttactcttc aaggtattgt gtttaatttt ttttcaactg 2700 atatgatcaa ttattttgat ttca gcggtg tgattggtta tatgctctgc tctactgaag 2760 gaccagaaat tgacttcaag aatccagtaa atccaattga caaagagaca gctcaagtca 2820 agtccaaatt agcacctctc aagttctaca actctgatgt aacttcatat ctcacaattt 2880 cttttttcct attgtacttt atgttcttcg tcaaatttta taattaactc ttttcaaatt 2940 gtcttttttt ttttcagatt cacaaagcag cattcatttt gccatctttc gccagaagta 3000 tgatcgagtc ttaatcaact gattaatgaa tactggtggt acaatcattg gaccaagatc 3060 aataagtgaa agacgtattg tatgagaatt c 3091 <210> 4 < 211> 3269 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3269) <223> PMT3 <400> 4 gctgtacaaa aggatgtctc aaatcatttg gaatattaat tctgcaatca acaagaaata 60 ccccactatt aagacccatt atcactggca caaaaattat gagatcatta aacatcttaa 120 acctgtccct atttggaaga gtgtggtatg ggagatgcct cccagggagt acctaaagct 180 gaatactgat ggaagtttta acaaacaaat tgggaaagca gggattggag ggattctcag 240 agatgaagag ggaggctttg tcatggcttt ttcgatgcct ataatctata ataacatcag 300 tgaagcagaa ttgaaagcca tcaagtatgg gtgtgaatgg tgcaaataca aaggaatatc 360 aaacttcatt gtggaaactg actcgaggat gatctatgac atactacaga ccaaaaatct 420 aagcaacaac aagttgaaac aagagaccga gaaattaatg gagattctgg acacctgcag 480 gacacctgtt acccattgcc ttcgcgaagc aaatcaagtg gcagactggt ttgctaaaga 540 ggccaccaga gctaacgaag gtatcactca tacagatttt agacaggtat caaaagcggc 600 caagggccct ttcttcatgg atatgtggca ggtcccttat tttagaatta gatatgaaaa 660 atctaatttt tttttgtaag ttaattctgt gtatagtgag aggaaatcgt ctaatatgta 720 tttttgccca tagactcttc ctctccttag gtaaaaaggt agctccgagg taaggtttat 780 gttcccctca gtgtaacctt tttttgttta tataatagac atggtatggg tccagctaaa 840 cccccaacac cacaggggat agatacctgg gtgattggtt tattttttaa aaaaaaaaac 900 tttactaata attgcacgga gacaaaactt atattttagt tccaaaatga cagtccaacc 960 atgcacgttg taatgatttt ttaactctat tatatcgagt tccgccctcc actcctcggt 1020 gtccaaattg tatttaaatg catagatatg tttattggga gtgtacatca agctttcaga 1080 aaatacaaac cataatactt tctcttctcc aatttgctta gtttaatttg gaaaatggaa 1140 gtcatatcta ccaacacaaa tggctctact atcttcaaga atggtgccat tcccatgaac 1200 ggttaccaga atggcacttc caaacaccaa aacggccacc agaatggcac ttccgaacat 1260 cggaacggcc accagaatgg gatttccgaa caccaaaacg gccaccagaa tggcacttcc 1320 gagcatcaga acggccatca gaatgggaca atcagccatg acaacggcaa cgagctacag 1380 ctactgggaa gctccaactc tattaagcct ggttggtttt cagagtttag cgcattatgg 1440 ccaggttagt actaagaaag aaactcaaat gcatcgtact cttgtattct gctttgcgta 1500 taatttagat gatggtgttt gactaagcac tgagtttaaa aataaaaagt ttaaagttaa 1560 attgttacta tagagagcta tatctttagg aactaactaa aaaggaaaaa ttatcacata 1620 aaattgggat gaagtaagca gttaacttcg catattattc gacacattaa tttgaaataa 1680 atcgaatttt gcaggtgaag cattctc act taaggttgag aagttactat tccaggggaa 1740 gtctgattac caagatgtca tgctctttga ggtaattaat taatactaat agtcaagctc 1800 atgtatgatt atatttaaag tggtattttt cgtttatttt taatttattg cacgtgtacg 1860 tacagtcagc aacatatggg aaggttctga ctttggatgg agcaattcaa cacacagaga 1920 atggtggatt tccatacact gaaatgattg ttcatcttcc acttggttcc atcccaaacc 1980 ctaaaaaggt tttgatcatc ggcggaggaa ttggttttac attattcgaa atgcttcgtt 2040 atcctacaat cgaaaaaatt gacattgttg agatcgatga cgtggtagtt gatgtaagtc 2100 aaacttcttt tactcacata aaaaaatgat ttagattctt atttttctaa aagaattaaa 2160 acaaaatttt ccgttttaca ggtatctaga aaatttttcc cttatcttgc tgctaatttt 2220 agcgatcctc gtgtaaccct agtccttgga gatggtgcgt atttgataat ctcgttttta 2280 ttttatcttt tacttttatt ttatttaatt tttacctttt tgtgtgtggt taattcacct 2340 gccattggtt ctttttattt caggggctgc atttgtaaag gccgcacaag caggatatta 2400 tgatgctatt atagtggact cttctgatcc cattggtact ctattactac ttaataccaa 2460 gactattctt attaaataag ctactaataa acgtaactct gatagttttc taaaataata 2520 taatttcagg tccagcaaaa gacttgtttg ag aggccatt ctttgaggca gtagccaaag 2580 ccctaaggcc aggaggagtt gtatgcacac aggctgaaag catttggctt catatgcata 2640 ttattaagca aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactatgctt 2700 ggactactgt tccaacatat ccaacgtatt tttctctctc tcttcctata aaattggaag 2760 ttttgattct ataattgtca agaaatggag aatcagttcc aagaaaaacc aaattctttt 2820 cttttactct tcaaggtgtg tttaagtttt ttaaactgat actgatcaat tattttgatt 2880 tcagcggtgt gattggttat atgctctgtt ctactgaagg accagaagtt gacttcaaga 2940 atccagtaaa tccaattgac aaagagacaa ctcaagtcaa gtccaaatta gcacctctca 3000 agttctacaa ctctgatgta acttcatatc tcaatttctt ttttcttatt gtactttatg 3060 ttcttagtca aattttataa ttaactcttt tcaaattgtc tttttttttc agattcacaa 3120 agcagcattc attttgccat ctttcgccag aagtatgatc gagtcttaat caagtgacta 3180 atgaatactg gcggtacaat cattggacca agatcgagtc ttaatcaagt gaataaataa 3240 gtgaaatgcg acgtattgta taagaattc 3269 <210> 5 <211> 3001 <212> DNA <213> Nicotiana tabacum <220> <221> gene <222> (1)..(3001) <223> PMT4 <400> 5 gtcgacctct gattccacaa gtcatgcacc cattca atta tttaatggaa accaatttta 60 ccctgtacaa atggtacaaa tactttcctt ggataaaaac aattttgcct aaggagtaaa 120 cagatgcgaa gtaagaaagc agacgactaa agaaaatttt aaaaaaggag agagaaatga 180 gcacacacac gtactaataa aattagggta ctactttact aataattgga cagagactaa 240 attcatattt tagttccaaa atgtctcggg cagtccaacc atgcacgttg taatgagttt 300 ttaactctat tatctcgagt tgcgccctcc actcctctgt gtccaagttg tatataaatg 360 catatatgtc tattgggagt gtacagcgag ctttcataaa gtacaaatca taatacttgt 420 tgaaacataa tactttctct tctccaattt gtttagttta attttgaaaa tggaagtcat 480 atctaccaac acaaatggct cgaccatctt caagaatggt gccattccca tgaatggcca 540 ccagagtggc acttccaaac acctcaacgg ctaccagaac ggcacttcca aacaccaaaa 600 cggccaccat aatggcactt ccgaacatcg gaacggccac cagaatggga tttccgaaca 660 ccaaaacggc caccagaatg ggacttccga acatcggaac ggccaccaga atgggatttc 720 cgaacaccaa aacggccacc agaatgggac ttccgaacac caaaacggcc accagaatgg 780 gacttccgaa caacagaacg ggacaatcag ccatgacaat ggcaacgagc tactgggaaa 840 ctccaactct attaagcttg gttggttttc agagtttagc gcattatggc cagt tagta 900 ctgagaaaga aactcaaatt catatttaaa gttaaaattg ttaggctaat ataagaagtt 960 gattttcttt tagtgattaa ttaaaaaagg aaagagtatc aaataaattc caaaaaatga 1020 ccagtaactt cgcatattat tctacacatt aatttgaaat aaatcgaatt ttgcaggtga 1080 agcattctcc cttaaggttg agaagttact atttcagggg aagtctgact accaagatgt 1140 catgctcttt gaggtaaata atattctaat acacatgctt taatatgaat aaatactttt 1200 aatttacttt tagtttattg cacgtgtacg tacagtcagc aacatatggg aaggttttga 1260 ctttggatgg agcaattcaa cacacagaga atggtggatt tccatacact gaaatgattg 1320 ttcatcttcc acttggttcc atcccaaacc caaaaaaggt tttgatcatc ggcggaggaa 1380 ttggttttac attattcgaa atgcttcgtt atcctacaat cgaaaaaatt gacattgttg 1440 aaattaattga agtaagta ttactgatga cgta a ttagattgct tctttttatt tttctaaaag aataaatata ttctctctta gttttaaaca 1560 aaattctctt tcttacaggt atctagaaaa tctttccctt atctcgcagc taattttaat 1620 gatcctcgtg taaccctcgt tctcggagat ggtgcgtatt tataatctcg tttttgtttt 1680 atcttttatt tttatttcat ttaatttacc tttttgtgtg tggttaattt acccgtcatt 1740 ggttctcttt catttcaggg gctgcatttg taaaggctgc acaagcagga tattatgatg 1800 ctattatagt ggactcttct gatcccattg gtactctatt actacttaat accaagacta 1860 atcttattga ataagctact aataaactgt aattgatttc taaaataata taatttcagg 1920 tccagcaaaa gatttgtttg agaggccatt ctttgaggca gtagccaaag ccctaaggcc 1980 aggaggagtt gtatgcacac aggccgaaag catttggctt catatgcata ttattaagca 2040 aatcattgct aactgtcgtc aagtctttaa gggctctgtc aactacgctt ggactactgt 2100 tccaacatat cccacgtatt ttctctctct ctctcttcat ctttgaaaat tgaaaatcct 2160 gactactttc cttcctttga ttcctcggtt aaaggggcgt agatcataag attttcaaga 2220 aatagataat gacgtccaag aaaaactaac ttcttttcat ttactattct ttttggtgac 2280 aaactttatt tattatttcg ttctaaagag aaaatttatt tttatatttt aaaataattt 2340 tgtttt aaac ttttattttt acttattata tctttaataa aaaaattata gtcaaataaa 2400 tattatggcc acactaaaca tccaagtttt tgaaaccata agttttagag ccaaatgagt 2460 taatttgttt ttggtatgcg ggtgcggagt caaattatgt cacaaaaatt gtaatggagt 2520 gagcaaattt ttatttcgag taaactttca aggtattgtg ttaaagtttt ttcaactgat 2580 actaatcaat tatgtctcaa ccattttgat ttcagtggtg taattgggta tatgctctgc 2640 tctactgaag ggccagaagt tgacttcaag aatccaataa atccaattga caaagagaca 2700 actcaagtca agtccaaatt agcacctctc aagttttaca attctgatgt aacttcatat 2760 ctaacaattt ctttttctgt tttactgtat cttcattgtc aaaattttat aattaactct 2820 tctcaaatta tctttttttt tagattcaca aagcagcatt cattttgcca tctttcgcca 2880 gaagtatgat cgagtcttaa tcaagtgaat aatgaacact ggtggtgcaa tcattggacc 2940 aagatcgagt cttaatcaag tgaataaata agtgaaatgc cgacgtattg tatgagaatt 3000 c 3001 <210> 6 <211> 30 <212> DNA <213> Artificial Sequence <220> < 223> F_PMT1a <400> 6 gcatagatgt ttattgggag tgtacagcaa 30 <210> 7 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> R_PMT1a <400> 7 agttgctgac tgtacgtgc a ca 22 <210> 8 <211> 29 <212> DNA <213> Artificial Sequence <220> <223> F_PMT2 <400> 8 catatgtgtc tattgggagt gtacatcaa 29 <210> 9 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> R_PMT2 <400> 9 aagttgctga ctgtacgtac acat 24 <210> 10 <211> 31 <212> DNA <213> Artificial Sequence <220> <223> F_PMT3 <400> 10 catagatatg tttattggga gtgtacatca a 31 <210> 11 <211> 32 <212> DNA <213> Artificial Sequence <220> <223> R_PMT3 <400> 11 ccactttaaa tataatcata catgagcttg ac 32 <210> 12 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> F_PMT4 <400> 12 gcgagctttc ataaagtaca aatca 25 <210> 13 <211> 33 <212> DNA <213> Artificial Sequence <220> <223> R_PMT4 <400> 13 agtatttatt catattaaag catgtgtatt aga 33 <210 > 14 <211> 25 <212> DNA <213> Artificial Sequence <220> <223> sgRNA_sg1 <400> 14 aagtcatatc taccaacaca aatgg 25 <210> 15 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> sgRNA_sg2 <400> 15 ttttcagagt ttagcgcatt atgg 24 <210> 16 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> sgRNA_sg3 <400> 16 ctcaaagagc atgacatctt gg 22 <210> 17 <211> 17283 <212> DNA <213> Artificial Sequence <220> <223> V1k_GE <400> 17 tgagcgtcgc aaagtgcgctc tggctcttt caccagctc ggtgtcttgcatgtg gggacgtgct tggcaatcac gcgcaccccc 120 cggccgtttt agcggctaaa aaagtcatgg ctctgccctc gggcggacca cgcccatcat 180 gaccttgcca agctcgtcct gcttctcttc gatcttcgcc agcagggcga ggatcgtggc 240 atcaccgaac cgcgccgtgc gcgggtcgtc ggtgagccag agtttcagca ggccgcccag 300 gcggcccagg tcgccattga tgcgggccag ctcgcggacg tgctcatagt ccacgacgcc 360 cgtgattttg tagccctggc cgacggccag caggtaggcc gacaggctca tgccggccgc 420 cgccgccttt tcctcaatcg ctcttcgttc gtctggaagg cagtacacct tgataggtgg 480 gctgcccttc ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc 540 ggcggtagcc ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata 600 agggacagtg aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg 660 ctgacgccgt tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata 720 tcgtgcgaaa aaggatggat a taccgaaaa aatcgctata atgaccccga agcagggtta 780 tgcagcggaa aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 840 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 900 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 960 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1020 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1080 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1140 cagtgagcga ggaagcggaa gagcgccaga aggccgccag agaggccgag cgcggccgtg 1200 aggcttggac gctagggcag ggcatgaaaa agcccgtagc gggctgctac gggcgtctga 1260 cgcggtggaa agggggaggg gatgttgtct acatggctct gctgtagtga gtgggttgcg 1320 ctccggcagc ggtcctgatc aatcgtcacc ctttctcggt ccttcaacgt tcctgacaac 1380 gagcctcctt ttcgccaatc catcgacaat caccgcgagt ccctgctcga acgctgcgtc 1440 cggaccggct tcgtcgaagg cgtctatcgc ggcccgcaac agcggcgaga gcggagcctg 1500 ttcaacggtg ccgccgcgct cgccggcatc gctgtcgccg gcctgctcct caagcacggc 1560 cccaacagtg aagtagctga ttgtcatcag cgcattgacg gcgtccccgg ccgaaaaacc 1620 cgcctcgcag aggaagcgaa gctgcgcgtc ggccgtttcc atctgcggtg cgcccggtcg 1680 cgtgccggca tggatgcgcg cgccatcgcg gtaggcgagc agcgcctgcc tgaagctgcg 1740 ggcattcccg atcagaaatg agcgccagtc gtcgtcggct ctcggcaccg aatgcgtatg 1800 attctccgcc agcatggctt cggccagtgc gtcgagcagc gcccgcttgt tcctgaagtg 1860 ccagtaaagc gccggctgct gaacccccaa ccgttccgcc agtttgcgtg tcgtcagacc 1920 gtctacgccg acctcgttca acaggtccag ggcggcacgg atcactgtat tcggctgcaa 1980 ctttgtcatg cttgacactt tatcactgat aaacataata tgtccaccaa cttatcagtg 2040 ataaagaatc cgcgcgttca atcggaccag cggaggctgg tccggaggcc agacgtgaaa 2100 cccaacatac ccctgatcgt aattctgagc actgtcgcgc tcgacgctgt cggcatcggc 2160 ctgattatgc cggtgctgcc gggcctcctg cgcgatctgg ttcactcgaa cgacgtcacc 2220 gcccactatg gcattctgct ggcgctgtat gcgttggtgc aatttgcctg cgcacctgtg 2280 ctgggcgcgc tgtcggatcg tttcgggcgg cggccaatct tgctcgtctc gctggccggc 2340 gccagatctg gggaaccctg tggttggcat gcacatacaa atggacgaac ggataaacct 2400 tttcacgccc ttttaaatat ccgattattc taataa acgc tcttttctct taggtttacc 2460 cgccaatata tcctgtcaaa cactgatagt ttaaactgaa ggcgggaaac gacaatctga 2520 tcatgagcgg agaattaagg gagtcacgtt atgacccccg ccgatgacgc gggacaagcc 2580 gttttacgtt tggaactgac agaaccgcaa cgttgaagga gccactcagc cgcgggtttc 2640 tggagtttaa tgagctaagc acatacgtca gaaaccatta ttgcgcgttc aaaagtcgcc 2700 taaggtcact atcagctagc aaatatttct tgtcaaaaat gctccactga cgttccataa 2760 attcccctcg gtatccaatt agagtctcat attcactctc aatccaaata atctgcaccg 2820 gatctggatc gtttcgcatg attgaacaag atggattgca cgcaggttct ccggccgctt 2880 gggtggagag gctattcggc tatgactggg cacaacagac aatcggctgc tctgatgccg 2940 ccgtgttccg gctgtcagcg caggggcgcc cggttctttt tgtcaagacc gacctgtccg 3000 gtgccctgaa tgaactgcag gacgaggcag cgcggctatc gtggctggcc acgacgggcg 3060 ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg aagggactgg ctgctattgg 3120 gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca 3180 tcatggctga tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc ccattcgacc 3240 accaagcgaa acatcgcatc gagcgagcac gtactcggat g gaagccggt cttgtcgatc 3300 aggatgatct ggacgaagag catcaggggc tcgcgccagc cgaactgttc gccaggctca 3360 aggcgcgcat gcccgacggc gatgatctcg tcgtgaccca tggcgatgcc tgcttgccga 3420 atatcatggt ggaaaatggc cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg 3480 cggaccgcta tcaggacata gcgttggcta cccgtgatat tgctgaagag cttggcggcg 3540 aatgggctga ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg 3600 ccttctatcg ccttcttgac gagttcttct gagcgggact ctggggttcg aaatgaccga 3660 ccaagcgacg cccaacctgc catcacgaga tttcgattcc accgccgcct tctatgaaag 3720 gttgggcttc ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct 3780 catgctggag ttcttcgccc acgggatctc tgcggaacag gcggtcgaag gtgccgatat 3840 cattacgaca gcaacggccg acaagcacaa cgccacgatc ctgagcgaca atatgatcgg 3900 gcccggcgtc cacatcaacg gcgtcggcgg cgactgccca ggcaagaccg agatgcaccg 3960 cgatatcttg ctgcgttcgg atattttcgt ggagttcccg ccacagaccc ggatgatccc 4020 cgatcgttca aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc 4080 gatgattatc atataatttc tgttgaatta cgttaagcat gtaataa tta acatgtaatg 4140 catgacgtta tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata 4200 cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc 4260 tatgttacta gatcgggcct cctgtcaatg ctggcggcgg ctctggtggt ggttctggtg 4320 gcggctctga gggtggtggc tctgagggtg gcggttctga gggtggcggc tctgagggag 4380 gcggttccgg tggtggctct ggttccggtg attttgatta tgaaaagatg gcaaacgcta 4440 ataagggggc tatgaccgaa aatgccgatg aaaacgcgct acagtctgac gctaaaggca 4500 aacttgattc tgtcgctact gattacggtg ctgctatcga tggtttcatt ggtgacgttt 4560 ccggccttgc taatggtaat ggtgctactg gtgattttgc tggctctaat tcccaaatgg 4620 ctcaagtcgg tgacggtgat aattcacctt taatgaataa tttccgtcaa tatttacctt 4680 ccctccctca atcggttgaa tgtcgccctt ttgtctttgg cccaatacgc aaaccgcctc 4740 tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag 4800 cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt 4860 tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca 4920 caggaaacag ctatgaccat gattacgcca agcttcacgt gaacatggtg ga gcacgaca 4980 cacttgtcta ctccaaaaat atcaaagata cagtctcaga agaccaaagg gcaattgaga 5040 cttttcaaca aagggtaata tccggaaacc tcctcggatt ccattgccca gctatctgtc 5100 actttattgt gaagatagtg gaaaaggaag gtggctccta caaatgccat cattgcgata 5160 aaggaaaggc catcgttgaa gatgcctctg ccgacagtgg tcccaaagat ggacccccac 5220 ccacgaggag catcgtggaa aaagaagacg ttccaaccac gtcttcaaag caagtggatt 5280 gatgtgataa catggtggag cacgacacac ttgtctactc caaaaatatc aaagatacag 5340 tctcagaaga ccaaagggca attgagactt ttcaacaaag ggtaatatcc ggaaacctcc 5400 tcggattcca ttgcccagct atctgtcact ttattgtgaa gatagtggaa aaggaaggtg 5460 gctcctacaa atgccatcat tgcgataaag gaaaggccat cgttgaagat gcctctgccg 5520 acagtggtcc caaagatgga cccccaccca cgaggagcat cgtggaaaaa gaagacgttc 5580 caaccacgtc ttcaaagcaa gtggattgat gtgatatctc cactgacgta agggatgacg 5640 cacaatccca ctatccttcg caagaccctt cctctatata aggaagttca tttcatttgg 5700 agaggacctc gaaattctca acacaacata tacaaaacaa acgaatctca agcaatcaag 5760 cattctactt ctattgcagc aatttaaatc atttctttta aagcaaaagc aattttct ga 5820 aaattttcac catttacgaa cgatagggat ccgagaccca agcttgccac catgccgaag 5880 aagaagcgca aggtggacaa gaagtacagc atcggcctgg acatcggtac caacagcgtg 5940 ggctgggccg tgatcaccga cgagtacaag gtgcccagca agaagttcaa ggtgctgggc 6000 aacaccgacc gccacagcat caagaagaac ctgatcggcg ccctgctgtt cgacagcggc 6060 gagaccgccg aggccacccg cctgaagcgc accgcccgcc gccgctacac ccgccgcaag 6120 aaccgcatct gctacctgca ggagatcttc agcaacgaga tggccaaggt ggacgacagc 6180 ttcttccacc gcctggagga gagcttcctg gtggaggagg acaagaagca cgagcgccac 6240 cccatcttcg gcaacatcgt ggacgaggtg gcctaccacg agaagtaccc caccatctac 6300 cacctgcgca agaagctggt ggacagcacc gacaaggccg acctgcgcct gatctacctg 6360 gccctggccc acatgatcaa gttccgcggc cacttcctga tcgagggcga cctgaacccc 6420 gacaacagcg acgtggacaa gctgttcatc cagctggtgc agacctacaa ccagctgttc 6480 gaggagaacc ccatcaacgc cagcggcgtg gacgccaagg ccatcctgag cgcccgcctg 6540 agcaagagcc gccgcctgga gaacctgatc gcccagctgc ccggcgagaa gaagaacggc 6600 ctgttcggca acctgatcgc cctgagcctg ggcctgaccc ccaacttcaa gagcaacttc 666 0 gacctggccg aggacgccaa gctgcagctg agcaaggaca cctacgacga cgacctggac 6720 aacctgctgg cccagatcgg cgaccagtac gccgacctgt tcctggccgc caagaacctg 6780 agcgacgcca tcctgctgag cgacatcctg cgcgtgaaca ccgagatcac caaggccccc 6840 ctgagcgcca gcatgatcaa gcgctacgac gagcaccacc aggacctgac cctgctgaag 6900 gccctggtgc gccagcagct gcccgagaag tacaaggaga tcttcttcga ccagagcaag 6960 aacggctacg ccggctacat cgacggcggc gccagccagg aggagttcta caagttcatc 7020 aagcccatcc tggagaagat ggacggcacc gaggagctgc tggtgaagct gaaccgcgag 7080 gacctgctgc gcaagcagcg caccttcgac aacggcagca tcccccacca gatccacctg 7140 ggcgagctgc acgccatcct gcgccgccag gaggacttct accccttcct gaaggacaac 7200 cgcgagaaga tcgagaagat cctgaccttc cgcatcccct actacgtggg ccccctggcc 7260 cgcggcaaca gccgcttcgc ctggatgacc cgcaagagcg aggagaccat caccccctgg 7320 aacttcgagg aggtggtgga caagggcgcc agcgcccaga gcttcatcga gcgcatgacc 7380 aacttcgaca agaacctgcc caacgagaag gtgctgccca agcacagcct gctgtacgag 7440 tacttcaccg tgtacaacga gctgaccaag gtgaagtacg tgaccgaggg catgcgcaag 7500 cccg ccttcc tgagcggcga gcagaagaag gccatcgtgg acctgctgtt caagaccaac 7560 cgcaaggtga ccgtgaagca gctgaaggag gactacttca agaagatcga gtgcttcgac 7620 agcgtggaga tcagcggcgt ggaggaccgc ttcaacgcca gcctgggcac ctaccacgac 7680 ctgctgaaga tcatcaagga caaggacttc ctggacaacg aggagaacga ggacatcctg 7740 gaggacatcg tgctgaccct gaccctgttc gaggaccgcg agatgatcga ggagcgcctg 7800 aagacctacg cccacctgtt cgacgacaag gtgatgaagc agctgaagcg ccgccgctac 7860 accggctggg gccgcctgag ccgcaagctt atcaacggca tccgcgacaa gcagagcggc 7920 aagaccatcc tggacttcct gaagagcgac ggcttcgcca accgcaactt catgcagctg 7980 atccacgacg acagcctgac cttcaaggag gacatccaga aggcccaggt gagcggccag 8040 ggcgacagcc tgcacgagca catcgccaac ctggccggca gccccgccat caagaagggc 8100 atcctgcaga ccgtgaaggt ggtggacgag ctggtgaagg tgatgggccg ccacaagccc 8160 gagaacatcg tgatcgagat ggcccgcgag aaccagacca cccagaaggg ccagaagaac 8220 agccgcgagc gcatgaagcg catcgaggag ggcatcaagg agctgggcag ccagatcctg 8280 aaggagcacc ccgtggagaa cacccagctg cagaacgaga agctgtacct gtactacctg 8340 cagaacggcc gcgacatgta cgtggaccag gagctggaca tcaaccgcct gagcgactac 8400 gacgtggacc acatcgtgcc ccagagcttc ctgaaggacg acagcatcga caacaaggtg 8460 ctgacccgca gcgacaagaa ccgcggcaag agcgacaacg tgcccagcga ggaggtggtg 8520 aagaagatga agaactactg gcgccagctg ctgaacgcca agctgatcac ccagcgcaag 8580 ttcgacaacc tgaccaaggc cgagcgcggc ggcctgagcg agctggacaa ggccggcttc 8640 atcaagcgcc agctggtgga gacccgccag atcaccaagc acgtggccca gatcctggac 8700 agccgcatga acaccaagta cgacgagaac gacaagctga tccgcgaggt gaaggtgatc 8760 accctgaaga gcaagctggt gagcgacttc cgcaaggact tccagttcta caaggtgcgc 8820 gagatcaaca actaccacca cgcccacgac gcctacctga acgccgtggt gggcaccgcc 8880 ctgatcaaga agtaccccaa gctggagagc gagttcgtgt acggcgacta caaggtgtac 8940 gacgtgcgca agatgatcgc caagagcgag caggagatcg gcaaggccac cgccaagtac 9000 ttcttctaca gcaacatcat gaacttcttc aagaccgaga tcaccctggc caacggcgag 9060 atccgcaagc gccccctgat cgagaccaac ggcgagaccg gcgagatcgt gtgggacaag 9120 ggccgcgact tcgccaccgt gcgcaaggtg ctgagcatgc cccaggtgaa catcgtgaag 9180 aagaccgagg tgcag accgg cggcttcagc aaggagagca tcctgcccaa gcgcaacagc 9240 gacaagctga tcgcccgcaa gaaggactgg gaccccaaga agtacggcgg cttcgacagc 9300 cccaccgtgg cctacagcgt gctggtggtg gccaaggtgg agaagggcaa gagcaagaag 9360 ctgaagagcg tgaaggagct gctgggcatc accatcatgg agcgcagcag cttcgagaag 9420 aaccccatcg acttcctgga ggccaagggc tacaaggagg tgaagaagga cctgatcatc 9480 aagctgccca agtacagcct gttcgagctg gagaacggcc gcaagcgcat gctggccagc 9540 gccggcgagc tgcagaaggg caacgagctg gccctgccca gcaagtacgt gaacttcctg 9600 tacctggcca gccactacga gaagctgaag ggcagccccg aggacaacga gcagaagcag 9660 ctgttcgtgg agcagcacaa gcactacctg gacgagatca tcgagcagat cagcgagttc 9720 agcaagcgcg tgatcctggc cgacgccaac ctggacaagg tgctgagcgc ctacaacaag 9780 caccgcgaca agcccatccg cgagcaggcc gagaacatca tccacctgtt caccctgacc 9840 aacctgggcg cccccgccgc cttcaagtac ttcgacacca ccatcgaccg caagcgctac 9900 accagcacca aggaggtgct ggacgccacc ctgatccacc agagcatcac cggtctgtac 9960 gagacccgca tcgacctgag ccagctgggc ggcgacggcg gctccggaaa gagaccagct 10020 gctaccaaga aggctggaca ggctaagaag aagaagtaag agctcgtccg caaaaatcac 10080 cagtctctct ctacaaatct atctctctct atttttctcc agaataatgt gtgagtagtt 10140 cccagataag ggaattaggg ttcttatagg gtttcgctca tgtgttgagc atataagaaa 10200 cccttagtat gtatttgtat ttgtaaaata cttctatcaa taaaatttct aattcctaaa 10260 accaaaatcc agtgactata attagccttc ctcataagtt gcactacttc agcgtcccaa 10320 ttgcaccctt accacgaaga caggtttgtc caatcccata ttgcgacctt ggcagggggt 10380 tcgcaagtcc cacccgaaac gttgctgaag gctcaggttt ctgagcgaca aatgctttaa 10440 acgcgagttc ccgctcataa cctggaccga atgcgggtcg accccgggac tagtctcgag 10500 cacgtggaat tcactggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac 10560 ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc 10620 ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgcccgct cctttcgctt 10680 tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc 10740 tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgatttgg 10800 gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg 10860 agtccacgtt ct ttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct 10920 cgggctattc ttttgattta taagggattt tgccgatttc ggaaccacca tcaaacagga 10980 ttttcgcctg ctggggcaaa ccagcgtgga ccgcttgctg caactctctc agggccaggc 11040 ggtgaagggc aatcagctgt tgcccgtctc actggtgaaa agaaaaacca ccccagtaca 11100 ttaaaaacgt ccgcaatgtg ttattaagtt gtctaagcgt caatttgttt acaccacaat 11160 atatcctgcc accagccagc caacagctcc ccgaccggca gctcggcaca aaatcaccac 11220 tcgatacagg cagcccatca gtccgggacg gcgtcagcgg gagagccgtt gtaaggcggc 11280 agactttgct catgttaccg atgctattcg gaagaacggc aactaagctg ccgggtttga 11340 aacacggatg atctcgcgga gggtagcatg ttgattgtaa cgatgacaga gcgttgctgc 11400 ctgtgatcaa atatcatctc cctcgcagag atccgaatta tcagccttct tattcatttc 11460 tcgcttaacc gtgacaggct gtcgatcttg agaactatgc cgacataata ggaaatcgct 11520 ggataaagcc gctgaggaag ctgagtggcg ctatttcttt agaagtgaac gttgacgata 11580 tcaactcccc tatccattgc tcaccgaatg gtacaggtcg gggacccgaa gttccgactg 11640 tcggcctgat gcatccccgg ctgatcgacc ccagatctgg ggctgagaaa gcccagtaag 11700 gaaacaactg taggttcgag tcgcgagatc ccccggaacc aaaggaagta ggttaaaccc 11760 gctccgatca ggccgagcca cgccaggccg agaacattgg ttcctgtagg catcgggatt 11820 ggcggatcaa acactaaagc tactggaacg agcagaagtc ctccggccgc cagttgccag 11880 gcggtaaagg tgagcagagg cacgggaggt tgccacttgc gggtcagcac ggttccgaa c 11940 gccatggaaa ccgcccccgc caggcccgct gcgacgccga caggatctag cgctgcgttt 12000 ggtgtcaaca ccaacagcgc cacgcccgca gttccgcaaa tagcccccag gaccgccatc 12060 aatcgtatcg ggctacctag cagagcggca gagatgaaca cgaccatcag cggctgcaca 12120 gcgcctaccg tcgccgcgac cccgcccggc aggcggtaga ccgaaataaa caacaagctc 12180 cagaatagcg aaatattaag tgcgccgagg atgaagatgc gcatccacca gattcccgtt 12240 ggaatctgtc ggacgatcat cacgagcaat aaacccgccg gcaacgcccg cagcagcata 12300 ccggcgaccc ctcggcctcg ctgttcgggc tccacgaaaa cgccggacag atgcgccttg 12360 tgagcgtcct tggggccgtc ctcctgtttg aagaccgaca gcccaatgat ctcgccgtcg 12420 atgtaggcgc cgaatgccac ggcatctcgc aaccgttcag cgaacgcctc catgggcttt 12480 ttctcctcgt gctcgtaaac ggacccgaac atctctggag ctttcttcag ggccgacaat 12540 cggatctcgc ggaaatcctg cacgtcggcc gctccaagcc gtcgaatctg agccttaatc 12600 acaattgtca attttaatcc tctgtttatc ggcagttcgt agagcgcgcc gtgcgtcccg 12660 agcgatactg agcgaagcaa gtgcgtcgag cagtgcccgc ttgttcctga aatgccagta 12720 aagcgctggc tgctgaaccc ccagccggaa ctgaccccac aaggccctag c gtttgcaat 12780 gcaccaggtc atcattgacc caggcgtgtt ccaccaggcc gctgcctcgc aactcttcgc 12840 aggcttcgcc gacctgctcg cgccacttct tcacgcgggt ggaatccgat ccgcacatga 12900 ggcggaaggt ttccagcttg agcgggtacg gctcccggtg cgagctgaaa tagtcgaaca 12960 tccgtcgggc cgtcggcgac agcttgcggt acttctccca tatgaatttc gtgtagtggt 13020 cgccagcaaa cagcacgacg atttcctcgt cgatcaggac ctggcaacgg gacgttttct 13080 tgccacggtc caggacgcgg aagcggtgca gcagcgacac cgattccagg tgcccaacgc 13140 ggtcggacgt gaagcccatc gccgtcgcct gtaggcgcga caggcattcc tcggccttcg 13200 tgtaataccg gccattgatc gaccagccca ggtcctggca aagctcgtag aacgtgaagg 13260 tgatcggctc gccgataggg gtgcgcttcg cgtactccaa cacctgctgc cacaccagtt 13320 cgtcatcgtc ggcccgcagc tcgacgccgg tgtaggtgat cttcacgtcc ttgttgacgt 13380 ggaaaatgac cttgttttgc agcgcctcgc gcgggatttt cttgttgcgc gtggtgaaca 13440 gggcagagcg ggccgtgtcg tttggcatcg ctcgcatcgt gtccggccac ggcgcaatat 13500 cgaacaagga aagctgcatt tccttgatct gctgcttcgt gtgtttcagc aacgcggcct 13560 gcttggcctc gctgacctgt tttgccaggt cctcgccggc ggtt tttcgc ttcttggtcg 13620 tcatagttcc tcgcgtgtcg atggtcatcg acttcgccaa acctgccgcc tcctgttcga 13680 gacgacgcga acgctccacg gcggccgatg gcgcgggcag ggcaggggga gccagttgca 13740 cgctgtcgcg ctcgatcttg gccgtagctt gctggaccat cgagccgacg gactggaagg 13800 tttcgcgggg cgcacgcatg acggtgcggc ttgcgatggt ttcggcatcc tcggcggaaa 13860 accccgcgtc gatcagttct tgcctgtatg ccttccggtc aaacgtccga ttcattcacc 13920 ctccttgcgg gattgccccg actcacgccg gggcaatgtg cccttattcc tgatttgacc 13980 cgcctggtgc cttggtgtcc agataatcca ccttatcggc aatgaagtcg gtcccgtaga 14040 ccgtctggcc gtccttctcg tacttggtat tccgaatctt gccctgcacg aataccagcg 14100 accccttgcc caaatacttg ccgtgggcct cggcctgaga gccaaaacac ttgatgcgga 14160 agaagtcggt gcgctcctgc ttgtcgccgg catcgttgcg ccacatctag gtactaaaac 14220 aattcatcca gtaaaatata atattttatt ttctcccaat caggcttgat ccccagtaag 14280 tcaaaaaata gctcgacata ctgttcttcc ccgatatcct ccctgatcga ccggacgcag 14340 aaggcaatgt cataccactt gtccgccctg ccgcttctcc caagatcaat aaagccactt 14400 actttgccat ctttcacaaa gatgttgctg tctccca ggt cgccgtggga aaagacaagt 14460 tcctcttcgg gcttttccgt ctttaaaaaa tcatacagct cgcgcggatc tttaaatgga 14520 gtgtcttctt cccagttttc gcaatccaca tcggccagat cgttattcag taagtaatcc 14580 aattcggcta agcggctgtc taagctattc gtatagggac aatccgatat gtcgatggag 14640 tgaaagagcc tgatgcactc cgcatacagc tcgataatct tttcagggct ttgttcatct 14700 tcatactctt ccgagcaaag gacgccatcg gcctcactca tgagcagatt gctccagcca 14760 tcatgccgtt caaagtgcag gacctttgga acaggcagct ttccttccag ccatagcatc 14820 atgtcctttt cccgttccac atcataggtg gtccctttat accggctgtc cgtcattttt 14880 aaatataggt tttcattttc tcccaccagc ttatatacct tagcaggaga cattccttcc 14940 gtatctttta cgcagcggta tttttcgatc agttttttca attccggtga tattctcatt 15000 ttagccattt attatttcct tcctcttttc tacagtattt aaagataccc caagaagcta 15060 attataacaa gacgaactcc aattcactgt tccttgcatt ctaaaacctt aaataccaga 15120 aaacagcttt ttcaaagttg ttttcaaagt tggcgtataa catagtatcg acggagccga 15180 ttttgaaacc acaattatgg gtgatgctgc caacttactg atttagtgta tgatggtgtt 15240 tttgaggtgc tccagtggct tctgtgtcta tcagctgtcc ctcctgttca gctactgacg 15300 gggtggtgcg taacggcaaa agcaccgccg gacatcagcg ctatctctgc tctcactgcc 15360 gtaaaacatg gcaactgcag ttcacttaca ccgcttctca acccggtacg caccagaaaa 15420 tcattgatat ggccatgaat ggcgttggat gccgggcaac agcccgcatt atgggcgttg 15480 gcctcaacac gattttacgt cacttaaaaa actcaggccg cagtcggtaa cctcgcgcat 15540 acagccgggc agtgacgtca tcgtctgcgc ggaaatggac gaacagtggg gctatgtcgg 15600 ggctaaatcg cgccagcgct ggctgtttta cgcgtatgac agtctccgga agacggttgt 15660 tgcgcacgta ttcggtgaac gcactatggc gacgctgggg cgtcttatga gcctgctgtc 15720 accctttgac gtggtgatat ggatgacgga tggctggccg ctgtatgaat cccgcctgaa 15780 gggaaagctg cacgtaatca gcaagcgata tacgcagcga attgagcggc ataacctgaa 15840 tctgaggcag cacctggcac ggctgggacg gaagtcgctg tcgttctcaa aatcggtgga 15900 gctgcatgac aaagtcatcg ggcattatct gaacataaaa cactatcaat aagttggagt 15960 cattacccaa ttatgataga atttacaagc tataaggtta ttgtcctggg tttcaagcat 16020 tagtccatgc aagtttttat gctttgccca ttctatagat atattgataa gcgcgctgcc 16080 tatgccttgc cccctgaaat cc ttacatac ggcgatatct tctatataaa agatatatta 16140 tcttatcagt attgtcaata tattcaaggc aatctgcctc ctcatcctct tcatcctctt 16200 cgtcttggta gctttttaaa tatggcgctt catagagtaa ttctgtaaag gtccaattct 16260 cgttttcata cctcggtata atcttaccta tcacctcaaa tggttcgctg ggtttatcgc 16320 acccccgaac acgagcacgg cacccgcgac cactatgcca agaatgccca aggtaaaaat 16380 tgccggcccc gccatgaagt ccgtgaatgc cccgacggcc gaagtgaagg gcaggccgcc 16440 acccaggccg ccgccctcac tgcccggcac ctggtcgctg aatgtcgatg ccagcacctg 16500 cggcacgtca atgcttccgg gcgtcgcgct cgggctgatc gcccatcccg ttactgcccc 16560 gatcccggca atggcaagga ctgccagcgc tgccattttt ggggtgaggc cgttcgcggc 16620 cgaggggcgc agcccctggg gggatgggag gcccgcgtta gcgggccggg agggttcgag 16680 aagggggggc accccccttc ggcgtgcgcg gtcacgcgca cagggcgcag ccctggttaa 16740 aaacaaggtt tataaatatt ggtttaaaag caggttaaaa gacaggttag cggtggccga 16800 aaaacgggcg gaaacccttg caaatgctgg attttctgcc tgtggacagc ccctcaaatg 16860 tcaataggtg cgcccctcat ctgtcagcac tctgcccctc aagtgtcaag gatcgcgccc 16920 ctcatctgtc agtag tcgcg cccctcaagt gtcaataccg cagggcactt atccccaggc 16980 ttgtccacat catctgtggg aaactcgcgt aaaatcaggc gttttcgccg atttgcgagg 17040 ctggccagct ccacgtcgcc ggccgaaatc gagcctgccc ctcatctgtc aacgccgcgc 17100 cgggtgagtc ggcccctcaa gtgtcaacgt ccgcccctca tctgtcagtg agggccaagt 17160 tttccgcgag gtatccacaa cgccggcggc cgcggtgtct cgcacacggc ttcgacggcg 17220 tttctggcgc gtttgcaggg ccatagacgg ccgccagccc agcggcgagg gcaaccagcc 17280 cgg 17283 <210> 18 <211> 18700 <212> DNA <213> Artificial Sequence <220> <223> V1k_GE_PMT <400> 18 tgagcgtcgc aaaggcgctc ggtcttgcct tgctcgtcgg tgatgtactt caccagctcc 60 gcgaagtcgc tcttcttgat ggagcgcatg gggacgtgct tggcaatcac gcgcaccccc 120 cggccgtttt agcggctaaa aaagtcatgg ctctgccctc gggcggacca cgcccatcat 180 gaccttgcca agctcgtcct gcttctcttc gatcttcgcc agcagggcga ggatcgtggc 240 atcaccgaac cgcgccgtgc gcgggtcgtc ggtgagccag agtttcagca ggccgcccag 300 gcggcccagg tcgccattga tgcgggccag ctcgcggacg tgctcatagt ccacgacgcc 360 cgtgatttttgacaggcctcag tgccggcc 420 cgccgccttt tcctcaatcg ctcttcgttc gtctggaagg cagtacacct tgataggtgg 480 gctgcccttc ctggttggct tggtttcatc agccatccgc ttgccctcat ctgttacgcc 540 ggcggtagcc ggccagcctc gcagagcagg attcccgttg agcaccgcca ggtgcgaata 600 agggacagtg aagaaggaac acccgctcgc gggtgggcct acttcaccta tcctgcccgg 660 ctgacgccgt tggatacacc aaggaaagtc tacacgaacc ctttggcaaa atcctgtata 720 tcgtgcgaaa aaggatggat ataccgaaaa aatcgctata atgaccccga agcagggtta 780 tgcagcggaa aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 840 gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 900 atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 960 gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 1020 gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta 1080 ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt 1140 cagtgagcga ggaagcggaa gagcgccaga aggccgccag agaggccgag cgcggccgtg 1200 aggcttggac gctagggcag ggcatgaaaa agcccgtagc gggctgctac gggcgtctga 1260 cgcggtggaaagggggaggg gatgttgtct acatggctct gctgtagtga gtgggttgcg 1320 ctccggcagc ggtcctgatc aatcgtcacc ctttctcggt ccttcaacgt tcctgacaac 1380 gagcctcctt ttcgccaatc catcgacaat caccgcgagt ccctgctcga acgctgcgtc 1440 cggaccggct tcgtcgaagg cgtctatcgc ggcccgcaac agcggcgaga gcggagcctg 1500 ttcaacggtg ccgccgcgct cgccggcatc gctgtcgccg gcctgctcct caagcacggc 1560 cccaacagtg aagtagctga ttgtcatcag cgcattgacg gcgtccccgg ccgaaaaacc 1620 cgcctcgcag aggaagcgaa gctgcgcgtc ggccgtttcc atctgcggtg cgcccggtcg 1680 cgtgccggca tggatgcgcg cgccatcgcg gtaggcgagc agcgcctgcc tgaagctgcg 1740 ggcattcccg atcagaaatg agcgccagtc gtcgtcggct ctcggcaccg aatgcgtatg 1800 attctccgcc agcatggctt cggccagtgc gtcgagcagc gcccgcttgt tcctgaagtg 1860 ccagtaaagc gccggctgct gaacccccaa ccgttccgcc agtttgcgtg tcgtcagacc 1920 gtctacgccg acctcgttca acaggtccag ggcggcacgg atcactgtat tcggctgcaa 1980 ctttgtcatg cttgacactt tatcactgat aaacataata tgtccaccaa cttatcagtg 2040 ataaagaatc cgcgcgttca atcggaccag cggaggctgg tccggaggcc agacgtgaaa 2100 cccaacatac ccctga tcgt aattctgagc actgtcgcgc tcgacgctgt cggcatcggc 2160 ctgattatgc cggtgctgcc gggcctcctg cgcgatctgg ttcactcgaa cgacgtcacc 2220 gcccactatg gcattctgct ggcgctgtat gcgttggtgc aatttgcctg cgcacctgtg 2280 ctgggcgcgc tgtcggatcg tttcgggcgg cggccaatct tgctcgtctc gctggccggc 2340 gccagatctg gggaaccctg tggttggcat gcacatacaa atggacgaac ggataaacct 2400 tttcacgccc ttttaaatat ccgattattc taataaacgc tcttttctct taggtttacc 2460 cgccaatata tcctgtcaaa cactgatagt ttaaactgaa ggcgggaaac gacaatctga 2520 tcatgagcgg agaattaagg gagtcacgtt atgacccccg ccgatgacgc gggacaagcc 2580 gttttacgtt tggaactgac agaaccgcaa cgttgaagga gccactcagc cgcgggtttc 2640 tggagtttaa tgagctaagc acatacgtca gaaaccatta ttgcgcgttc aaaagtcgcc 2700 taaggtcact atcagctagc aaatatttct tgtcaaaaat gctccactga cgttccataa 2760 attcccctcg gtatccaatt agagtctcat attcactctc aatccaaata atctgcaccg 2820 gatctggatc gtttcgcatg attgaacaag atggattgca cgcaggttct ccggccgctt 2880 gggtggagag gctattcggc tatgactggg cacaacagac aatcggctgc tctgatgccg 2940 ccgtgttccg gctgtcagcg c aggggcgcc cggttctttt tgtcaagacc gacctgtccg 3000 gtgccctgaa tgaactgcag gacgaggcag cgcggctatc gtggctggcc acgacgggcg 3060 ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg aagggactgg ctgctattgg 3120 gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag aaagtatcca 3180 tcatggctga tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc ccattcgacc 3240 accaagcgaa acatcgcatc gagcgagcac gtactcggat ggaagccggt cttgtcgatc 3300 aggatgatct ggacgaagag catcaggggc tcgcgccagc cgaactgttc gccaggctca 3360 aggcgcgcat gcccgacggc gatgatctcg tcgtgaccca tggcgatgcc tgcttgccga 3420 atatcatggt ggaaaatggc cgcttttctg gattcatcga ctgtggccgg ctgggtgtgg 3480 cggaccgcta tcaggacata gcgttggcta cccgtgatat tgctgaagag cttggcggcg 3540 aatgggctga ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg cagcgcatcg 3600 ccttctatcg ccttcttgac gagttcttct gagcgggact ctggggttcg aaatgaccga 3660 ccaagcgacg cccaacctgc catcacgaga tttcgattcc accgccgcct tctatgaaag 3720 gttgggcttc ggaatcgttt tccgggacgc cggctggatg atcctccagc gcggggatct 3780 catgctggag ttcttcgccc acgggat ctc tgcggaacag gcggtcgaag gtgccgatat 3840 cattacgaca gcaacggccg acaagcacaa cgccacgatc ctgagcgaca atatgatcgg 3900 gcccggcgtc cacatcaacg gcgtcggcgg cgactgccca ggcaagaccg agatgcaccg 3960 cgatatcttg ctgcgttcgg atattttcgt ggagttcccg ccacagaccc ggatgatccc 4020 cgatcgttca aacatttggc aataaagttt cttaagattg aatcctgttg ccggtcttgc 4080 gatgattatc atataatttc tgttgaatta cgttaagcat gtaataatta acatgtaatg 4140 catgacgtta tttatgagat gggtttttat gattagagtc ccgcaattat acatttaata 4200 cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc 4260 tatgttacta gatcgggcct cctgtcaatg ctggcggcgg ctctggtggt ggttctggtg 4320 gcggctctga gggtggtggc tctgagggtg gcggttctga gggtggcggc tctgagggag 4380 gcggttccgg tggtggctct ggttccggtg attttgatta tgaaaagatg gcaaacgcta 4440 ataagggggc tatgaccgaa aatgccgatg aaaacgcgct acagtctgac gctaaaggca 4500 aacttgattc tgtcgctact gattacggtg ctgctatcga tggtttcatt ggtgacgttt 4560 ccggccttgc taatggtaat ggtgctactg gtgattttgc tggctctaat tcccaaatgg 4620 ctcaagtcgg tgacggtgat aattcacctt ta atgaataa tttccgtcaa tatttacctt 4680 ccctccctca atcggttgaa tgtcgccctt ttgtctttgg cccaatacgc aaaccgcctc 4740 tccccgcgcg ttggccgatt cattaatgca gctggcacga caggtttccc gactggaaag 4800 cgggcagtga gcgcaacgca attaatgtga gttagctcac tcattaggca ccccaggctt 4860 tacactttat gcttccggct cgtatgttgt gtggaattgt gagcggataa caatttcaca 4920 caggaaacag ctatgaccat gattacgcca agcttcacgt gaacatggtg gagcacgaca 4980 cacttgtcta ctccaaaaat atcaaagata cagtctcaga agaccaaagg gcaattgaga 5040 cttttcaaca aagggtaata tccggaaacc tcctcggatt ccattgccca gctatctgtc 5100 actttattgt gaagatagtg gaaaaggaag gtggctccta caaatgccat cattgcgata 5160 aaggaaaggc catcgttgaa gatgcctctg ccgacagtgg tcccaaagat ggacccccac 5220 ccacgaggag catcgtggaa aaagaagacg ttccaaccac gtcttcaaag caagtggatt 5280 gatgtgataa catggtggag cacgacacac ttgtctactc caaaaatatc aaagatacag 5340 tctcagaaga ccaaagggca attgagactt ttcaacaaag ggtaatatcc ggaaacctcc 5400 tcggattcca ttgcccagct atctgtcact ttattgtgaa gatagtggaa aaggaaggtg 5460 gctcctacaa atgccatcat tgcgataaag gaaaggcc at cgttgaagat gcctctgccg 5520 acagtggtcc caaagatgga cccccaccca cgaggagcat cgtggaaaaa gaagacgttc 5580 caaccacgtc ttcaaagcaa gtggattgat gtgatatctc cactgacgta agggatgacg 5640 cacaatccca ctatccttcg caagaccctt cctctatata aggaagttca tttcatttgg 5700 agaggacctc gaaattctca acacaacata tacaaaacaa acgaatctca agcaatcaag 5760 cattctactt ctattgcagc aatttaaatc atttctttta aagcaaaagc aattttctga 5820 aaattttcac catttacgaa cgatagggat ccgagaccca agcttgccac catgccgaag 5880 aagaagcgca aggtggacaa gaagtacagc atcggcctgg acatcggtac caacagcgtg 5940 ggctgggccg tgatcaccga cgagtacaag gtgcccagca agaagttcaa ggtgctgggc 6000 aacaccgacc gccacagcat caagaagaac ctgatcggcg ccctgctgtt cgacagcggc 6060 gagaccgccg aggccacccg cctgaagcgc accgcccgcc gccgctacac ccgccgcaag 6120 aaccgcatct gctacctgca ggagatcttc agcaacgaga tggccaaggt ggacgacagc 6180 ttcttccacc gcctggagga gagcttcctg gtggaggagg acaagaagca cgagcgccac 6240 cccatcttcg gcaacatcgt ggacgaggtg gcctaccacg agaagtaccc caccatctac 6300 cacctgcgca agaagctggt ggacagcacc gacaaggccg acc tgcgcct gatctacctg 6360 gccctggccc acatgatcaa gttccgcggc cacttcctga tcgagggcga cctgaacccc 6420 gacaacagcg acgtggacaa gctgttcatc cagctggtgc agacctacaa ccagctgttc 6480 gaggagaacc ccatcaacgc cagcggcgtg gacgccaagg ccatcctgag cgcccgcctg 6540 agcaagagcc gccgcctgga gaacctgatc gcccagctgc ccggcgagaa gaagaacggc 6600 ctgttcggca acctgatcgc cctgagcctg ggcctgaccc ccaacttcaa gagcaacttc 6660 gacctggccg aggacgccaa gctgcagctg agcaaggaca cctacgacga cgacctggac 6720 aacctgctgg cccagatcgg cgaccagtac gccgacctgt tcctggccgc caagaacctg 6780 agcgacgcca tcctgctgag cgacatcctg cgcgtgaaca ccgagatcac caaggccccc 6840 ctgagcgcca gcatgatcaa gcgctacgac gagcaccacc aggacctgac cctgctgaag 6900 gccctggtgc gccagcagct gcccgagaag tacaaggaga tcttcttcga ccagagcaag 6960 aacggctacg ccggctacat cgacggcggc gccagccagg aggagttcta caagttcatc 7020 aagcccatcc tggagaagat ggacggcacc gaggagctgc tggtgaagct gaaccgcgag 7080 gacctgctgc gcaagcagcg caccttcgac aacggcagca tcccccacca gatccacctg 7140 ggcgagctgc acgccatcct gcgccgccag gaggacttct accccttcc t gaaggacaac 7200 cgcgagaaga tcgagaagat cctgaccttc cgcatcccct actacgtggg ccccctggcc 7260 cgcggcaaca gccgcttcgc ctggatgacc cgcaagagcg aggagaccat caccccctaggcat aacttcgaca agaacctgcc caacgagaag gtgctgccca agcacagcct gctgtacgag 7440 tacttcaccg tgtacaacga gctgaccaag gtgaagtacg tgaccgaggg catgcgcaag 7500 cccgccttcc tgagcggcga gcagaagaag gccatcgtgg acctgctgtt caagaccaac 7560 cgcaaggtga ccgtgaagca gctgaaggag gactacttca agaagatcga gtgcttcgac 7620 agcgtggaga tcagcggcgt ggaggaccgc ttcaacgcca gcctgggcac ctaccacgac 7680 ctgctgaaga tcatcaagga caaggacttc ctggacaacg aggagaacga ggacatcctg 7740 gaggacatcg tgctgaccct gaccctgttc gaggaccgcg agatgatcga ggagcgcctg 7800 aagacctacg cccacctgtt cgacgacaag gtgatgaagc agctgaagcg ccgccgctac 7860 accggctggg gccgcctgag ccgcaagctt atcaacggca tccgcgacaa gcagagcggc 7920 aagaccatcc tggacttcct gaagagcgac ggcttcgcca accgcaactt catgcagctg 7980 atccacgacg acagcctgac cttcaaggag gacatccaga aggcccaggt gagcggccag 8040 ggcgacagcc tgcacgagca catcgccaac ctggccggca gccccgccat caagaagggc 8100 atcctgcaga ccgtgaaggt ggtggacgag ctggtgaagg tgatgggccg ccacaagccc 8160 gagaacatcg tgatcgagat ggcccgcgag aaccagacca cccagaaggg ccagaagaac 8220 agccgc gagc gcatgaagcg catcgaggag ggcatcaagg agctgggcag ccagatcctg 8280 aaggagcacc ccgtggagaa cacccagctg cagaacgaga agctgtacct gtactacctg 8340 cagaacggcc gcgacatgta cgtggaccag gagctggaca tcaaccgcct gagcgactac 8400 gacgtggacc acatcgtgcc ccagagcttc ctgaaggacg acagcatcga caacaaggtg 8460 ctgacccgca gcgacaagaa ccgcggcaag agcgacaacg tgcccagcga ggaggtggtg 8520 aagaagatga agaactactg gcgccagctg ctgaacgcca agctgatcac ccagcgcaag 8580 ttcgacaacc tgaccaaggc cgagcgcggc ggcctgagcg agctggacaa ggccggcttc 8640 atcaagcgcc agctggtgga gacccgccag atcaccaagc acgtggccca gatcctggac 8700 agccgcatga acaccaagta cgacgagaac gacaagctga tccgcgaggt gaaggtgatc 8760 accctgaaga gcaagctggt gagcgacttc cgcaaggact tccagttcta caaggtgcgc 8820 gagatcaaca actaccacca cgcccacgac gcctacctga acgccgtggt gggcaccgcc 8880 ctgatcaaga agtaccccaa gctggagagc gagttcgtgt acggcgacta caaggtgtac 8940 gacgtgcgca agatgatcgc caagagcgag caggagatcg gcaaggccac cgccaagtac 9000 ttcttctaca gcaacatcat gaacttcttc aagaccgaga tcaccctggc caacggcgag 9060 atccgcaagc g ccccctgat cgagaccaac ggcgagaccg gcgagatcgt gtgggacaag 9120 ggccgcgact tcgccaccgt gcgcaaggtg ctgagcatgc cccaggtgaa catcgtgaag 9180 aagaccgagg tgcagaccgg cggcttcagc aaggagagca tcctgcccaa gcgcaacagc 9240 gacaagctga tcgcccgcaa gaaggactgg gaccccaaga agtacggcgg cttcgacagc 9300 cccaccgtgg cctacagcgt gctggtggtg gccaaggtgg agaagggcaa gagcaagaag 9360 ctgaagagcg tgaaggagct gctgggcatc accatcatgg agcgcagcag cttcgagaag 9420 aaccccatcg acttcctgga ggccaagggc tacaaggagg tgaagaagga cctgatcatc 9480 aagctgccca agtacagcct gttcgagctg gagaacggcc gcaagcgcat gctggccagc 9540 gccggcgagc tgcagaaggg caacgagctg gccctgccca gcaagtacgt gaacttcctg 9600 tacctggcca gccactacga gaagctgaag ggcagccccg aggacaacga gcagaagcag 9660 ctgttcgtgg agcagcacaa gcactacctg gacgagatca tcgagcagat cagcgagttc 9720 agcaagcgcg tgatcctggc cgacgccaac ctggacaagg tgctgagcgc ctacaacaag 9780 caccgcgaca agcccatccg cgagcaggcc gagaacatca tccacctgtt caccctgacc 9840 aacctgggcg cccccgccgc cttcaagtac ttcgacacca ccatcgaccg caagcgctac 9900 accagcacca aggaggt gct ggacgccacc ctgatccacc agagcatcac cggtctgtac 9960 gagacccgca tcgacctgag ccagctgggc ggcgacggcg gctccggaaa gagaccagct 10020 gctaccaaga aggctggaca ggctaagaag aagaagtaag agctcgtccg caaaaatcac 10080 cagtctctct ctacaaatct atctctctct atttttctcc agaataatgt gtgagtagtt 10140 cccagataag ggaattaggg ttcttatagg gtttcgctca tgtgttgagc atataagaaa 10200 cccttagtat gtatttgtat ttgtaaaata cttctatcaa taaaatttct aattcctaaa 10260 accaaaatcc agtgactata attagccttc ctcataagtt gcactacttc agcgtcccaa 10320 ttgcaccctt accacgaaga caggtttgtc caatcccata ttgcgacctt ggcagggggt 10380 tcgcaagtcc cacccgaaac gttgctgaag gctcaggttt ctgagcgaca aatgctttaa 10440 acgcgagttc ccgctcataa cctggaccga atgcgggtcg acagaaatct caaaattccg 10500 gcagaacaat tttgaatctc gatccgtaga aacgagacgg tcattgtttt agttccacca 10560 cgattatatt tgaaatttac gtgagtgtga gtgagacttg cataagaaaa taaaatcttt 10620 agttgggaaa aaattcaata atataaatgg gcttgagaag gaagcgaggg ataggccttt 10680 ttctaaaata ggcccattta agctattaac aatcttcaaa agtaccacag cgcttaggta 10740 aagaaagcag ctgagtttat atatggttag agacgaagta gtgattgaag tcatatctac 10800 caacacaaag ttttagagct agaaatagca agttaaaata aggctagtcc gttatcaact 10860 tgaaaaagtg gcaccgagtc ggtgcttttt tcaaatgaca attaaccacc gtgtattcgt 10920 tataacatca ggcagtttaa gtcgggacaa taggagccgc aatacacagt ttaccgcatc 10980 ttgacctaac tagaaatctc aaaattccgg cagaacaatt ttgaatctcg atccgtagaa 11040 acgagacggt cattgtttta gttccaccac gattatattt gaaatttacg tgagtgtgag 11100 tgagacttgc ataagaaaat aaaatcttta gttgggaaaa aattcaataa tataaatggg 11160 cttgagaagg aagcgaggga taggcctttt tctaaaatag gcccatttaa gctattaaca 11220 atcttcaaaa gtaccacagc gcttaggtaa agaaagcagc tgagtttata tatggttaga 11280 gacgaagtag tgattgtttt cagagtttag cgcattagtt ttagagctag aaatagcaag 11340 ttaaaataag gctagtccgt tatcaacttg aaaaagtggc accgagtcgg tgcttttttc 11400 ctgcgcttcg atcttagatc aaggttgcct ggagtgtctc ttgaactgta cagagttgag 11460 ctagtcgcga agcagatcag caacacattt ctgcttgcga gaaatctcaa aattccggca 11520 gaacaatttt gaatctcgat ccgtagaaac gagacggtca ttgttttagt tccaccacga 11580 tta tatttga aatttacgtg agtgtgagtg agacttgcat aagaaaataa aatctttagt 11640 tgggaaaaaa ttcaataata taaatgggct tgagaaggaa gcgagggata ggcctttttc 11700 taaaataggc ccatttaagc tattaacaat cttcaaaagt accacagcgc ttaggtaaag 11760 aaagcagctg agtttatata tggttagaga cgaagtagtg attgctcaaa gagcatgaca 11820 tctgttttag agctagaaat agcaagttaa aataaggcta gtccgttatc aacttgaaaa 11880 agtggcaccg agtcggtgct tttttactag tctcgagcac gtggaattca ctggccgtcg 11940 ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca acttaatcgc cttgcagcac 12000 atcccccttt cgccagctgg cgtaatagcg aagaggcccg caccgatcgc ccttcccaac 12060 agttgcgcag cctgaatggc gcccgctcct ttcgctttct tcccttcctt tctcgccacg 12120 ttcgccggct ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt 12180 gctttacggc acctcgaccc caaaaaactt gatttgggtg atggttcacg tagtgggcca 12240 tcgccctgat agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga 12300 ctcttgttcc aaactggaac aacactcaac cctatctcgg gctattcttt tgatttataa 12360 gggattttgc cgatttcgga accaccatca aacaggattt tcgcctgctg gggcaaacca 1 2420 gcgtggaccg cttgctgcaa ctctctcagg gccaggcggt gaagggcaat cagctgttgc 12480 ccgtctcact ggtgaaaaga aaaaccaccc cagtacatta aaaacgtccg caatgtgtta 12540 ttaagttgtc taagcgtcaa tttgtttaca ccacaatata tcctgccacc agccagccaa 12600 cagctccccg accggcagct cggcacaaaa tcaccactcg atacaggcag cccatcagtc 12660 cgggacggcg tcagcgggag agccgttgta aggcggcaga ctttgctcat gttaccgatg 12720 ctattcggaa gaacggcaac taagctgccg ggtttgaaac acggatgatc tcgcggaggg 12780 tagcatgttg attgtaacga tgacagagcg ttgctgcctg tgatcaaata tcatctccct 12840 cgcagagatc cgaattatca gccttcttat tcatttctcg cttaaccgtg acaggctgtc 12900 gatcttgaga actatgccga cataatagga aatcgctgga taaagccgct gaggaagctg 12960 agtggcgcta tttctttaga agtgaacgtt gacgatatca actcccctat ccattgctca 13020 ccgaatggta caggtcgggg acccgaagtt ccgactgtcg gcctgatgca tccccggctg 13080 atcgacccca gatctggggc tgagaaagcc cagtaaggaa acaactgtag gttcgagtcg 13140 cgagatcccc cggaaccaaa ggaagtaggt taaacccgct ccgatcaggc cgagccacgc 13200 caggccgaga acattggttc ctgtaggcat cgggattggc ggatcaaaca ctaa agctac 13260 tggaacgagc agaagtcctc cggccgccag ttgccaggcg gtaaaggtga gcagaggcac 13320 gggaggttgc cacttgcggg tcagcacggt tccgaacgcc atggaaaccg cccccgccag 13380 gcccgctgcg acgccgacag gatctagcgc tgcgtttggt gtcaacacca acagcgccac 13440 gcccgcagtt ccgcaaatag cccccaggac cgccatcaat cgtatcgggc tacctagcag 13500 agcggcagag atgaacacga ccatcagcgg ctgcacagcg cctaccgtcg ccgcgacccc 13560 gcccggcagg cggtagaccg aaataaacaa caagctccag aatagcgaaa tattaagtgc 13620 gccgaggatg aagatgcgca tccaccagat tcccgttgga atctgtcgga cgatcatcac 13680 gagcaataaa cccgccggca acgcccgcag cagcataccg gcgacccctc ggcctcgctg 13740 ttcgggctcc acgaaaacgc cggacagatg cgccttgtga gcgtccttgg ggccgtcctc 13800 ctgtttgaag accgacagcc caatgatctc gccgtcgatg taggcgccga atgccacggc 13860 atctcgcaac cgttcagcga acgcctccat gggctttttc tcctcgtgct cgtaaacgga 13920 cccgaacatc tctggagctt tcttcagggc cgacaatcgg atctcgcgga aatcctgcac 13980 gtcggccgct ccaagccgtc gaatctgagc cttaatcaca attgtcaatt ttaatcctct 14040 gtttatcggc agttcgtaga gcgcgccgtg cgtcccgagc gatactg agc gaagcaagtg 14100 cgtcgagcag tgcccgcttg ttcctgaaat gccagtaaag cgctggctgc tgaaccccca 14160 gccggaactg accccacaag gccctagcgt ttgcaatgca ccaggtcatc attgacccag 14220 gcgtgttcca ccaggccgct gcctcgcaac tcttcgcagg cttcgccgac ctgctcgcgc 14280 cacttcttca cgcgggtgga atccgatccg cacatgaggc ggaaggtttc cagcttgagc 14340 gggtacggct cccggtgcga gctgaaatag tcgaacatcc gtcgggccgt cggcgacagc 14400 ttgcggtact tctcccatat gaatttcgtg tagtggtcgc cagcaaacag cacgacgatt 14460 tcctcgtcga tcaggacctg gcaacgggac gttttcttgc cacggtccag gacgcggaag 14520 cggtgcagca gcgacaccga ttccaggtgc ccaacgcggt cggacgtgaa gcccatcgcc 14580 gtcgcctgta ggcgcgacag gcattcctcg gccttcgtgt aataccggcc attgatcgac 14640 cagcccaggt cctggcaaag ctcgtagaac gtgaaggtga tcggctcgcc gataggggtg 14700 cgcttcgcgt actccaacac ctgctgccac accagttcgt catcgtcggc ccgcagctcg 14760 acgccggtgt aggtgatctt cacgtccttg ttgacgtgga aaatgacctt gttttgcagc 14820 gcctcgcgcg ggattttctt gttgcgcgtg gtgaacaggg cagagcgggc cgtgtcgttt 14880 ggcatcgctc gcatcgtgtc cggccacggc gcaatatcga acaaggaaag ctgcatttcc 14940 ttgatctgct gcttcgtgtg tttcagcaac gcggcctgct tggcctcgct gacctgtttt 15000 gccaggtcct cgccggcggt ttttcgcttc ttggtcgtca tagttcctcg cgtgtcgatg 15060 gtcatcgact tcgccaaacc tgccgcctcc tgttcgagac gacgcgaacg ctccacggcg 15120 gccgatggcg cgggcagggc agggggagcc agttgcacgc tgtcgcgctc gatcttggcc 15180 gtagcttgct ggaccatcga gccgacggac tggaaggttt cgcggggcgc acgcatgacg 15240 gtgcggcttg cgatggtttc ggcatcctcg gcggaaaacc ccgcgtcgat cagttcttgc 15300 ctgtatgcct tccggtcaaa cgtccgattc attcaccctc cttgcgggat tgccccgact 15360 cacgccgggg caatgtgccc ttattcctga tttgacccgc ctggtgcctt ggtgtccaga 15420 taatccacct tatcggcaat gaagtcggtc ccgtagaccg tctggccgtc cttctcgtac 15480 ttggtattcc gaatcttgcc ctgcacgaat accagcgacc ccttgcccaa atacttgccg 15540 tgggcctcgg cctgagagcc aaaacacttg atgcggaaga agtcggtgcg ctcctgcttg 15600 tcgccggcat cgttgcgcca catctaggta ctaaaacaat tcatccagta aaatataata 15660 ttttattttc tcccaatcag gcttgatccc cagtaagtca aaaaatagct cgacatactg 15720 ttcttccccg atatcctccc tgatcgaccg ga cgcagaag gcaatgtcat accacttgtc 15780 cgccctgccg cttctcccaa gatcaataaa gccacttact ttgccatctt tcacaaagat 15840 gttgctgtct cccaggtcgc cgtgggaaaa gacaagttcc tcttcgggct tttccgtctt 15900 taaaaaatca tacagctcgc gcggatcttt aaatggagtg tcttcttccc agttttcgca 15960 atccacatcg gccagatcgt tattcagtaa gtaatccaat tcggctaagc ggctgtctaa 16020 gctattcgta tagggacaat ccgatatgtc gatggagtga aagagcctga tgcactccgc 16080 atacagctcg ataatctttt cagggctttg ttcatcttca tactcttccg agcaaaggac 16140 gccatcggcc tcactcatga gcagattgct ccagccatca tgccgttcaa agtgcaggac 16200 ctttggaaca ggcagctttc cttccagcca tagcatcatg tccttttccc gttccacatc 16260 ataggtggtc cctttatacc ggctgtccgt catttttaaa tataggtttt cattttctcc 16320 caccagctta tataccttag caggagacat tccttccgta tcttttacgc agcggtattt 16380 ttcgatcagt tttttcaatt ccggtgatat tctcatttta gccatttatt atttccttcc 16440 tcttttctac agtatttaaa gataccccaa gaagctaatt ataacaagac gaactccaat 16500 tcactgttcc ttgcattcta aaaccttaaa taccagaaaa cagctttttc aaagttgttt 16560 tcaaagttgg cgtataacat agtat cgacg gagccgattt tgaaaccaca attatgggtg 16620 atgctgccaa cttactgatt tagtgtatga tggtgttttt gaggtgctcc agtggcttct 16680 gtgtctatca gctgtccctc ctgttcagct actgacgggg tggtgcgtaa cggcaaaagc 16740 accgccggac atcagcgcta tctctgctct cactgccgta aaacatggca actgcagttc 16800 acttacaccg cttctcaacc cggtacgcac cagaaaatca ttgatatggc catgaatggc 16860 gttggatgcc gggcaacagc ccgcattatg ggcgttggcc tcaacacgat tttacgtcac 16920 ttaaaaaact caggccgcag tcggtaacct cgcgcataca gccgggcagt gacgtcatcg 16980 tctgcgcgga aatggacgaa cagtggggct atgtcggggc taaatcgcgc cagcgctggc 17040 tgttttacgc gtatgacagt ctccggaaga cggttgttgc gcacgtattc ggtgaacgca 17100 ctatggcgac gctggggcgt cttatgagcc tgctgtcacc ctttgacgtg gtgatatgga 17160 tgacggatgg ctggccgctg tatgaatccc gcctgaaggg aaagctgcac gtaatcagca 17220 agcgatatac gcagcgaatt gagcggcata acctgaatct gaggcagcac ctggcacggc 17280 tgggacggaa gtcgctgtcg ttctcaaaat cggtggagct gcatgacaaa gtcatcgggc 17340 attatctgaa cataaaacac tatcaataag ttggagtcat tacccaatta tgatagaatt 17400 tacaagctat aaggttat tg tcctgggttt caagcattag tccatgcaag tttttatgct 17460 ttgcccattc tatagatata ttgataagcg cgctgcctat gccttgcccc ctgaaatcct 17520 tacatacggc gatatcttct atataaaaga tatattatct tatcagtatt gtcaatatat 17580 tcaaggcaat ctgcctcctc atcctcttca tcctcttcgt cttggtagct ttttaaatat 17640 ggcgcttcat agagtaattc tgtaaaggtc caattctcgt tttcatacct cggtataatc 17700 ttacctatca cctcaaatgg ttcgctgggt ttatcgcacc cccgaacacg agcacggcac 17760 ccgcgaccac tatgccaaga atgcccaagg taaaaattgc cggccccgcc atgaagtccg 17820 tgaatgcccc gacggccgaa gtgaagggca ggccgccacc caggccgccg ccctcactgc 17880 ccggcacctg gtcgctgaat gtcgatgcca gcacctgcgg cacgtcaatg cttccgggcg 17940 tcgcgctcgg gctgatcgcc catcccgtta ctgccccgat cccggcaatg gcaaggactg 18000 ccagcgctgc catttttggg gtgaggccgt tcgcggccga ggggcgcagc ccctgggggg 18060 atgggaggcc cgcgttagcg ggccgggagg gttcgagaag ggggggcacc ccccttcggc 18120 gtgcgcggtc acgcgcacag ggcgcagccc tggttaaaaa caaggtttat aaatattggt 18180 ttaaaagcag gttaaaagac aggttagcgg tggccgaaaa acgggcggaa acccttgcaa 18240 atgctggatt ttctgcctgt ggacagcccc tcaaatgtca ataggtgcgc ccctcatctg 18300 tcagcactct gcccctcaag tgtcaaggat cgcgcccctc atctgtcagt agtcgcgccc 18360 ctcaagtgtc aataccgcag ggcacttatc cccaggcttg tccacatcat ctgtgggaaa 18420 ctcgcgtaaa atcaggcgtt ttcgccgatt tgcgaggctg gccagctcca cgtcgccggc 18480 cgaaatcgag cctgcccctc atctgtcaac gccgcgccgg gtgagtcggc ccctcaagtg 18540 tcaacgtccg cccctcatct gtcagtgagg gccaagtttt ccgcgaggta tccacaacgc 18600 cggcggccgc ggtgtctcgc acacggcttc gacggcgttt ctggcgcgtt tgcagggcca 18660 tagacggccg ccagcccagc ggcgagggca accagcccgg 18700 <210> 19 <211> 304 <212> DNA <213> Artificial Sequence <220> <223> P_U6 <400> 19 agaaatctca aaattccggc agaacaattt tgaatctcga tccgtagaaa cgagacggtc 60 attgttttag ttccaccacg attatatttg aaatttacgt gagtgtgagt gagacttgca 120 taagaaaata aaatctttag ttgggaaaaa attcaataat ataaatgggc ttgagaagga 180 agcgagggat aggccttttt ctaaaatagg cccatttaag ctattaacaa tcttcaaaag 240 taccacagcg cttaggtaaa gaaagcagct gagtttatat atggttagag acgaagtagt 300 gatt 304 25 <210> 20 <211> <212> DNA <213> Artificial Sequence <220> <223> F_C9 <400> 20 gaccatcctg gacttcctga agagc 25 <210> 21 <211> 27 <212> DNA <213> Artificial Sequence <220> <223> R_C9 <400 > 21 tgcaggtagt acaggtacag cttctcg 27 <210> 22 <211> 19 <212> DNA <213> Artificial Sequence <220> <223> F_35S <400> 22 gctcctacaa atgccatca 19 <210> 23 <211> 20 <212> DNA < 213> Artificial Sequence <220> <223> R_35S<400> 23 gatagtggga ttgtgcgtca 20

Claims (11)

삭제delete 삭제delete 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b(Putrescine N-methyltransferase 1b) 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 유전적으로 조작된 식물 세포로서,
상기 유전적 조작은 RNA 간섭(RNAi; RNA interference) 시스템, 메가뉴클레아제(Meganuclease) 시스템, 징크핑거 뉴클레아제(Zinc finger nuclease) 시스템, 탈렌(TALEN; Transcription Activator-Like Effector Nuclease) 시스템, CRISPR/Cas9 시스템, X-선 조사, 감마선 조사, 에틸 메탄설포네이트(ethyl methanesulfonate) 처리, 및 다이메틸 설페이트(dimethyl sulfate) 처리로 이루어진 군에서 선택되는 적어도 하나에 의한 것인, 식물 세포.
A plant cell genetically engineered to reduce the expression or activity of a PMT1b (Putrescine N-methyltransferase 1b) gene or a protein expressed by the PMT1b gene, compared to the parental cell, comprising the nucleotide sequence of SEQ ID NO: 1,
The genetic manipulation is an RNA interference (RNAi) system, a meganuclease system, a zinc finger nuclease system, a TALEN (Transcription Activator-Like Effector Nuclease) system, CRISPR /Cas9 system, by at least one selected from the group consisting of X-ray irradiation, gamma irradiation, ethyl methanesulfonate treatment, and dimethyl sulfate treatment, plant cells.
청구항 3에 있어서, 상기 식물 세포는 추가적으로
서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자; 또는
상기 유전자가 발현하는 단백질;
의 발현 또는 활성이 감소되도록 유전적으로 조작된 식물 세포.
4. The method of claim 3, wherein the plant cell is additionally
The PMT1a gene consisting of the nucleotide sequence of SEQ ID NO: 2, the PMT2 gene consisting of the nucleotide sequence of SEQ ID NO: 3, the PMT3 gene consisting of the nucleotide sequence of SEQ ID NO: 4, and the PMT4 gene consisting of the nucleotide sequence of SEQ ID NO: 5; or
a protein expressed by the gene;
A plant cell genetically engineered to reduce the expression or activity of
삭제delete 청구항 3에 있어서, 상기 식물은 재배담배(Nicotiana tabacum)인, 식물 세포.The plant cell of claim 3 , wherein the plant is a cultivated tobacco ( Nicotiana tabacum ). 청구항 3 내지 4 및 6 중 어느 한 항의 식물 세포를 포함하는 식물.A plant comprising the plant cell of any one of claims 3 to 4 and 6. 모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 식물 세포를 유전적으로 조작하는 단계를 포함하는 식물의 니코틴 생합성 억제 방법으로서,
상기 유전적으로 조작하는 단계는 RNA 간섭(RNAi; RNA interference) 시스템, 메가뉴클레아제(Meganuclease) 시스템, 징크핑거 뉴클레아제(Zinc finger nuclease) 시스템, 탈렌(TALEN; Transcription Activator-Like Effector Nuclease) 시스템, CRISPR/Cas9 시스템, X-선 조사, 감마선 조사, 에틸 메탄설포네이트(ethyl methanesulfonate) 처리, 및 다이메틸 설페이트(dimethyl sulfate) 처리로 이루어진 군에서 선택되는 적어도 하나에 의한 것인, 식물의 니코틴 생합성 억제 방법.
A method for inhibiting nicotine biosynthesis in plants, comprising genetically engineering plant cells to reduce the expression or activity of the PMT1b gene comprising the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene compared to the parental cell,
The genetic manipulation step includes an RNA interference (RNAi) system, a meganuclease system, a zinc finger nuclease system, and a TALEN (Transcription Activator-Like Effector Nuclease) system. , CRISPR / Cas9 system, by at least one selected from the group consisting of X-ray irradiation, gamma irradiation, ethyl methanesulfonate treatment, and dimethyl sulfate treatment, nicotine biosynthesis of plants suppression method.
청구항 8에 있어서, 상기 식물 세포는 추가적으로
서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자; 또는
상기 유전자가 발현하는 단백질;
의 발현 또는 활성이 감소되도록 유전적으로 조작된 것인, 식물의 니코틴 생합성 억제 방법.
9. The method of claim 8, wherein the plant cell is additionally
The PMT1a gene consisting of the nucleotide sequence of SEQ ID NO: 2, the PMT2 gene consisting of the nucleotide sequence of SEQ ID NO: 3, the PMT3 gene consisting of the nucleotide sequence of SEQ ID NO: 4, and the PMT4 gene consisting of the nucleotide sequence of SEQ ID NO: 5; or
a protein expressed by the gene;
A method for inhibiting nicotine biosynthesis in plants, which is genetically engineered to reduce the expression or activity of.
모세포에 비하여 서열번호 1의 염기 서열로 이루어진 PMT1b 유전자 또는 상기 PMT1b 유전자가 발현하는 단백질의 발현 또는 활성이 감소되도록 식물 세포를 유전적으로 조작하는 단계를 포함하는 니코틴의 생합성이 억제된 식물 세포의 제조 방법으로서,
상기 유전적으로 조작하는 단계는 RNA 간섭(RNAi; RNA interference) 시스템, 메가뉴클레아제(Meganuclease) 시스템, 징크핑거 뉴클레아제(Zinc finger nuclease) 시스템, 탈렌(TALEN; Transcription Activator-Like Effector Nuclease) 시스템, CRISPR/Cas9 시스템, X-선 조사, 감마선 조사, 에틸 메탄설포네이트(ethyl methanesulfonate) 처리, 및 다이메틸 설페이트(dimethyl sulfate) 처리로 이루어진 군에서 선택되는 적어도 하나에 의한 것인, 니코틴의 생합성이 억제된 식물 세포의 제조 방법.
A method for producing a plant cell in which biosynthesis of nicotine is inhibited, comprising genetically engineering the plant cell to reduce the expression or activity of the PMT1b gene comprising the nucleotide sequence of SEQ ID NO: 1 or the protein expressed by the PMT1b gene compared to the parental cell As,
The genetic manipulation step includes an RNA interference (RNAi) system, a meganuclease system, a zinc finger nuclease system, and a TALEN (Transcription Activator-Like Effector Nuclease) system. , CRISPR / Cas9 system, X-ray irradiation, gamma irradiation, ethyl methanesulfonate (ethyl methanesulfonate) treatment, and at least one selected from the group consisting of dimethyl sulfate treatment, the biosynthesis of nicotine is A method for producing suppressed plant cells.
청구항 10에 있어서, 상기 식물 세포는 추가적으로
서열번호 2의 염기 서열로 이루어진 PMT1a 유전자, 서열번호 3의 염기 서열로 이루어진 PMT2 유전자, 서열번호 4의 염기 서열로 이루어진 PMT3 유전자 및 서열번호 5의 염기 서열로 이루어진 PMT4 유전자; 또는
상기 유전자가 발현하는 단백질;
의 발현 또는 활성이 감소되도록 유전적으로 조작된 니코틴 생합성이 억제된 식물 세포의 제조 방법.
11. The method of claim 10, wherein the plant cell is additionally
The PMT1a gene consisting of the nucleotide sequence of SEQ ID NO: 2, the PMT2 gene consisting of the nucleotide sequence of SEQ ID NO: 3, the PMT3 gene consisting of the nucleotide sequence of SEQ ID NO: 4, and the PMT4 gene consisting of the nucleotide sequence of SEQ ID NO: 5; or
a protein expressed by the gene;
A method for producing a plant cell with suppressed nicotine biosynthesis genetically engineered to reduce the expression or activity of
KR1020190175480A 2019-08-22 2019-12-26 A novel nicotine biosynthetic gene and use thereof KR102389358B1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR20190103310 2019-08-22
KR1020190103310 2019-08-22

Publications (2)

Publication Number Publication Date
KR20210023646A KR20210023646A (en) 2021-03-04
KR102389358B1 true KR102389358B1 (en) 2022-04-22

Family

ID=75174660

Family Applications (2)

Application Number Title Priority Date Filing Date
KR1020190175480A KR102389358B1 (en) 2019-08-22 2019-12-26 A novel nicotine biosynthetic gene and use thereof
KR1020190175479A KR102395185B1 (en) 2019-08-22 2019-12-26 Genetically engineered plant cell and using method of the same

Family Applications After (1)

Application Number Title Priority Date Filing Date
KR1020190175479A KR102395185B1 (en) 2019-08-22 2019-12-26 Genetically engineered plant cell and using method of the same

Country Status (1)

Country Link
KR (2) KR102389358B1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2023534348A (en) * 2021-06-21 2023-08-09 ケーティー アンド ジー コーポレイション Plant cell with engineered QPT gene and method of using the same

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100899993B1 (en) 2007-03-07 2009-05-28 한국생명공학연구원 A Low Nicotine Transgenic Tabacco and A method of producing thereof
WO2018222667A1 (en) * 2017-05-31 2018-12-06 22Nd Century Limited, Llc Genome editing methods for producing low-nicotine tobacco products
US20200029522A1 (en) 2018-07-26 2020-01-30 Altria Client Services Llc Compositions and Methods Based on PMT Engineering for Producing Tobacco Plants and Products Having Altered Alkaloid Levels

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100899993B1 (en) 2007-03-07 2009-05-28 한국생명공학연구원 A Low Nicotine Transgenic Tabacco and A method of producing thereof
WO2018222667A1 (en) * 2017-05-31 2018-12-06 22Nd Century Limited, Llc Genome editing methods for producing low-nicotine tobacco products
US20200029522A1 (en) 2018-07-26 2020-01-30 Altria Client Services Llc Compositions and Methods Based on PMT Engineering for Producing Tobacco Plants and Products Having Altered Alkaloid Levels

Also Published As

Publication number Publication date
KR20210023645A (en) 2021-03-04
KR102395185B1 (en) 2022-05-10
KR20210023646A (en) 2021-03-04

Similar Documents

Publication Publication Date Title
AU2013254857B2 (en) Targeted genome engineering in plants
WO2018103686A1 (en) Chloroplast genome editing method
WO2019153902A1 (en) Plant genome site-directed substitution method
US20040205839A1 (en) Methods for obtaining plant varieties
JPS61502166A (en) Improved methods and vectors for transformation of plant cells
KR19980702139A (en) Method for producing recombinant adenovirus genome
WO2018115202A1 (en) Conferring resistance to geminiviruses in plants in alternative manner to gene drive, using crispr/cas systems
JPH0231683A (en) Transfer activating element of t-dna 780 gene
CN116157144A (en) Uracil stabilizing proteins and active fragments and variants thereof and methods of use
KR102389358B1 (en) A novel nicotine biosynthetic gene and use thereof
CN112852827A (en) Application of arabidopsis transcription factor AT3G46090 gene in cultivation of disease-resistant transgenic plants
WO1999061034A1 (en) Improved vectors
JP7489112B2 (en) Target sequence modification technology using the CRISPR type I-D system
CN112779282A (en) Method for creating multiple fruit color materials from tomato background materials by gene editing technology
KR102617424B1 (en) Composition and kit for reducing methylation of target DNA and induction of expression of target gene in animal cell, and method using the same
KR102516522B1 (en) pPLAⅡη gene inducing haploid plant and uses thereof
CN110791525A (en) Method for knocking out rice tillering number regulation gene OsFWL4 to increase rice tillering number and yield
KR20180128864A (en) Gene editing composition comprising sgRNAs with matched 5&#39; nucleotide and gene editing method using the same
KR20210062588A (en) Antibody produced by using afucosylated n.benthamiana and uses thereof
EP4130262A1 (en) Plant cells having engineered qpt gene and method for using same
JP2021052763A (en) Means for generating adenoviral vectors for cloning large nucleic acids
KR102522125B1 (en) Plant with reduced heavy metals and methods for preparation thereof
KR102550308B1 (en) Method for producing genome-edited tomato plant with increased salt tolerance by SlHKT1;2 gene editing and genome-edited tomato plant with increased salt tolerance produced by the same method
KR102675540B1 (en) Method for producing tomato plant having controlled disease-resistance using gene editing and tomato plant produced by the same method
JP4159620B2 (en) Method for producing recombinant adenovirus

Legal Events

Date Code Title Description
E902 Notification of reason for refusal
AMND Amendment
E601 Decision to refuse application
X091 Application refused [patent]
AMND Amendment
X701 Decision to grant (after re-examination)
GRNT Written decision to grant