CN114657156A - 吡咯赖氨酸氨酰-tRNA合成酶突变体及其筛选系统和应用 - Google Patents

吡咯赖氨酸氨酰-tRNA合成酶突变体及其筛选系统和应用 Download PDF

Info

Publication number
CN114657156A
CN114657156A CN202210243139.3A CN202210243139A CN114657156A CN 114657156 A CN114657156 A CN 114657156A CN 202210243139 A CN202210243139 A CN 202210243139A CN 114657156 A CN114657156 A CN 114657156A
Authority
CN
China
Prior art keywords
seq
plasmid
leu
ala
lys
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210243139.3A
Other languages
English (en)
Inventor
王南溪
李欢欢
唐槊
程剑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University of Chinese Medicine
Original Assignee
Nanjing University of Chinese Medicine
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University of Chinese Medicine filed Critical Nanjing University of Chinese Medicine
Priority to CN202210243139.3A priority Critical patent/CN114657156A/zh
Publication of CN114657156A publication Critical patent/CN114657156A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/93Ligases (6)
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/43504Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates
    • C07K14/43595Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates from coelenteratae, e.g. medusae
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/65Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression using markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/70Vectors or expression systems specially adapted for E. coli
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • C12N15/90Stable introduction of foreign DNA into chromosome
    • C12N15/902Stable introduction of foreign DNA into chromosome using homologous recombination
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P13/00Preparation of nitrogen-containing organic compounds
    • C12P13/04Alpha- or beta- amino acids
    • C12P13/08Lysine; Diaminopimelic acid; Threonine; Valine
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/02Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving viable microorganisms
    • C12Q1/04Determining presence or kind of microorganism; Use of selective media for testing antibiotics or bacteriocides; Compositions containing a chemical indicator therefor
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y601/00Ligases forming carbon-oxygen bonds (6.1)
    • C12Y601/01Ligases forming aminoacyl-tRNA and related compounds (6.1.1)
    • C12Y601/01026Pyrrolysine-tRNAPyl ligase (6.1.1.26)
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N2333/00Assays involving biological materials from specific organisms or of a specific nature
    • G01N2333/195Assays involving biological materials from specific organisms or of a specific nature from bacteria
    • G01N2333/24Assays involving biological materials from specific organisms or of a specific nature from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
    • G01N2333/245Escherichia (G)

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Toxicology (AREA)
  • Medicinal Chemistry (AREA)
  • Analytical Chemistry (AREA)
  • Immunology (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Mycology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

为解决现有技术中无法在特定位点特异性引入带有乳酰化基团的蛋白的问题,本发明提供一种吡咯赖氨酸氨酰‑tRNA合成酶突变体和一种筛选前述吡咯赖氨酸氨酰‑tRNA合成酶突变体的系统和方法。基于基因密码子扩展技术,通过建立一种正交的双报告基因筛选系统,筛选到吡咯赖氨酸氨酰‑tRNA合成酶突变体,可用于将乳酰化赖氨酸定点引入大肠杆菌、哺乳细胞等目标宿主中,实现含乳酰化赖氨酸修饰的组蛋白以及非组蛋白的表达及应用(翻译后修饰及生理、病理过程),扩展了该正交对的使用范围。

Description

吡咯赖氨酸氨酰-tRNA合成酶突变体及其筛选系统和应用
技术领域
本发明涉及生物技术领域,具体地说,本发明涉及吡咯赖氨酸氨酰-tRNA合成酶突变体及其筛选系统和应用。
背景技术
组蛋白存在于真核细胞中,是将DNA包装到核小体中的蛋白质。组蛋白翻译后修饰(HPTM)是一类调控基因表达的翻译后修饰,参与许多重要的生命过程,如DNA的复制和修复、RNA的转录等。不同类型的HPTM可以单独或组合起来以调节与染色质相关的各种活性。HPTM识别事件的失调会导致许多疾病的发生,例如癌症。近些年来,质谱学的方法技术在蛋白质组学中的应用日渐深入,学者们亦相继鉴定出了多种新型组蛋白赖氨酸酰化反应,例如丙酰化,丁酰化,丙二酰化,戊二酰化,β-羟基丁酰化,2-羟基异丁酰化,琥珀酰化,巴豆酰化,以及乳酰化等。研究表明,这些修饰参与调节染色质重塑,基因表达,细胞周期和细胞代谢等多种生命活动。最近的研究表明,HPTM与癌症,代谢性疾病,神经精神疾病,不育症,肾脏疾病及后天免疫机能丧失综合症(AIDS)等的发病机理密切相关。
乳酸是机体在缺氧条件下细胞呼吸的重要副产物,细胞在缺氧条件下糖酵解产生的丙酮酸会进一步被分解生成乳酸以及NADH(还原型辅酶Ⅰ),进而为细胞的生命活动供能。但是就肿瘤细胞而言,即便是在氧气很充足的条件下,其仍然偏向于利用糖酵解的方式来进行呼吸作用,从而会产生大量乳酸,这种现象被称为Warburg效应(瓦博格效应)。先前乳酸一直被认为仅仅是糖酵解产生的副产物,2019年有学者首次鉴定到了组蛋白上的乳酰化,并证实乳酰化修饰水平受到内源乳酸水平的调控,此外还发现了在M1巨噬细胞极化过程中,组蛋白乳酰化修饰可直接调控特定基因的转录,这一现象表明乳酰化修饰具有重要的生物学功能。
蛋白质乳酰化是于2019年新发现的一种组蛋白翻译后修饰。目前,蛋白质的乳酰化修饰依赖于(1)内源性乳酸修饰(体内);(2)化学修饰(体外)。这两种方法都具有随机性、非特异性和非均一性。目前,尚无报道证明可在原核及真核系统中实现位点特异性的蛋白质乳酰化修饰,尚无一种技术手段可将乳酰化赖氨酸基因编码至目标蛋白,无论是组蛋白和非组蛋白。针对以上问题,迫切需要在特定位点特异性引入带有乳酰化基团的蛋白的方法和技术,以更好的研究各种生命过程的发生、发展机制。
发明内容
为解决现有技术中存在的上述技术问题,本发明基于基因密码子扩展技术,通过建立一种正交的双报告基因筛选系统,筛选到吡咯赖氨酸氨酰-tRNA合成酶突变体,可用于将乳酰化赖氨酸定点引入大肠杆菌、哺乳细胞等目标宿主中,实现含乳酰化赖氨酸修饰的组蛋白以及非组蛋白的表达及应用(翻译后修饰及生理、病理过程),扩展了该正交对的使用范围。
本发明的第一个目的是提供一种吡咯赖氨酸氨酰-tRNA合成酶突变体。
本发明的第二个目的是提供一种筛选前述吡咯赖氨酸氨酰-tRNA合成酶突变体的方法。
本发明的第三个目的是提供包括所述吡咯赖氨酸氨酰-tRNA合成酶突变体的重组载体。
本发明的第四个目的是提供前述吡咯赖氨酸氨酰-tRNA合成酶突变体或前述的筛选系统在蛋白质乳酰化修饰中的应用。
本发明的第五个目的是提供前述吡咯赖氨酸氨酰-tRNA合成酶突变体或前述的筛选系统在将乳酰化赖氨酸定点引入目标宿主中的应用。
一种吡咯赖氨酸氨酰-tRNA合成酶突变体,所述吡咯赖氨酸氨酰-tRNA合成酶突变体为将GenBank Sequence ID:WP_011033391.1所示吡咯赖氨酸氨酰-tRNA合成酶的氨基酸序列中301位、306位、309位、348位的氨基酸残基突变得到,所述突变为:
301位替换为缬氨酸(V),306位替换为亮氨酸(L),309位替换为丙氨酸(A),348位替换为苯丙氨酸(F)(SEQ ID NO.7);
或,301位替换为蛋氨酸(M),306位替换为亮氨酸(L),309位替换为丙氨酸(A),348位替换为苯丙氨酸(F)(SEQ ID NO.8);
或,301位替换为异亮氨酸(I),306位替换为亮氨酸(L),309位替换为丙氨酸(A),348位替换为苯丙氨酸(F)(SEQ ID NO.9)。
一种筛选前述吡咯赖氨酸氨酰-tRNA合成酶突变体的方法,所述方法包括以下步骤:
S1:构建包括吡咯赖氨酸氨酰-tRNA合成酶饱和突变库的质粒:所述吡咯赖氨酸氨酰tRNA 合成酶饱和突变库为将如GenBank Sequence ID:WP_011033391.1所示野生型吡咯赖氨酸氨酰-tRNA合成酶的氨基酸序列突变得到,突变位点包括301位、306位、309位、以及348位的氨基酸残基;将编码吡咯赖氨酸氨酰tRNA合成酶饱和突变库的核苷酸序列连入基础质粒,获得所述包括吡咯赖氨酸氨酰-tRNA合成酶饱和突变库的质粒;
优选的,所述重组载体包含GlnRS启动子、编码吡咯赖氨酸氨酰tRNA合成酶饱和突变库的核苷酸序列、GlnRS终止子、ColE1复制原点、硫酸卡那霉素抗性基因;
在某一个特殊的实施例中,先将如SEQ ID NO.10所示编码野生型吡咯赖氨酸氨酰-tRNA合成酶的核苷酸序列按照GlnRS启动子-SEQ ID NO.10所示编码野生型吡咯赖氨酸氨酰-tRNA合成酶的核苷酸序列-GlnRS终止子-ColE1复制原点-硫酸卡那霉素抗性基因连接成重组质粒后,对所述编码野生型吡咯赖氨酸氨酰-tRNA合成酶的核苷酸序列进行突变,获得包括吡咯赖氨酸氨酰-tRNA合成酶饱和突变库的质粒;
S2:构建筛选质粒:将SEQ ID NO.2所示包括氯霉素抗性报告基因的核苷酸片段、SEQ ID NO.3所示包括绿色荧光蛋白报告基因的核苷酸片段、SEQ ID NO.4所示包括T7RNA聚合酶基因的核苷酸片段通过同源重组连接起来,得到重组质粒,所述重组质粒核苷酸序列如SEQ ID NO.11所示;再经过双点突变,将SEQ ID NO.11所示重组质粒的9868bp~9873bp的ctagtc突变位atccta,使重组质粒中的氯霉素抗性基因编码的氨基酸序列的112号位点突变为112D,111位点突变成为琥珀终止密码子TAG,得到所述筛选质粒;
S3:将S2得到的筛选质粒转入大肠杆菌DH10B感受态细胞,得到含有筛选质粒的感受态细胞;将S1构建的包括吡咯赖氨酸氨酰-tRNA合成酶饱和突变库的质粒转化至含有筛选质粒的感受态细胞,分别接种至添加非天然氨基酸的液体培养基和不添加非天然氨基酸的液体培养基中进行筛选,如果突变样品同时满足:
(1)在添加非天然氨基酸的液体培养基上具有氯霉素抗性、能够观察到荧光;
(2)在不添加非天然氨基酸的液体培养基上不具有氯霉素抗性、不能够观察到荧光;
则筛选得到前述吡咯赖氨酸氨酰-tRNA合成酶突变体。
在某个特殊的实施例中,所述非天然氨基酸为乳酰化赖氨酸(Kla)。
本方法S1和S2的顺序无特殊限定,操作顺序可以调换或同时进行。
进一步的,S2所述筛选质粒核苷酸序列如SEQ ID NO.1所示。
进一步的,SEQ ID NO.3所示包括绿色荧光蛋白报告基因的核苷酸片段命名为Frag-pET-28a-sfGFP核苷酸序列,为采用pSel-sfGFP引物对,从SEQ ID NO.5所述质粒pET-28a-sfGFP中扩增得到;SEQ ID NO.4所示包括T7RNA聚合酶基因的核苷酸片段命名为Frag-pBad-T7RNAP核苷酸序列,为采用pSel-RNAP引物对,从SEQ ID NO.6所述质粒pBad-T7RNAP-TAG中扩增得到;
所述pSel-sfGFP引物对为:
pSel-sfGFP-F:5'-ctctCAGCAAAAAACCCCT CAAGACC-3'(SEQ ID NO.12),pSel-sfGFP-R:5'-ataagaaatgaggccgctcaCGATCCCGCGAAATTAATACG-3'
(SEQ ID NO.13);
所述pSel-RNAP引物对为:
pSel-RNAP-F:5'-atgactgagagtcaacgccaGCTGTGACCGTCTCCGGG-3'(SEQ IDNO.14),
pSel-RNAP-R:5'-gaggggttttttgctgAGAGTTTGTAGAAACGCAAAAAGGC-3'(SEQ IDNO.15);
SEQ ID NO.2所示包括氯霉素抗性报告基因的核苷酸片段为线性化pACYC184-Cm112TAG-PylT载体得到,线性化pACYC184-Cm112TAG-PylT载体的引物为:pSel-vec-F-new:5'-cgtgagcggcctcatttcttattctgagttacaac-3'(SEQ ID NO.16)、
pSel-vec-R-new:5'-tggcgttgactctcagtcatagtatcgtggtatc-3'(SEQ IDNO.17);
然后与Frag-pET-28a-sfGFP、Frag-pBad-T7RNAP做3片段同源重组连接。
进一步的,S2中双点突变为以SEQ ID NO.11所示重组质粒核苷酸序列为模板,双点突变引物为:
SDM-Cm112TAG-F:5'-TACCACtagGATTTCCGGCAGTTTCTACACAT-3'(SEQ ID NO.18),
SDM-Cm112 TAG-R:
5'-CGGAAATCctaGTGGTATTCACTCCAGAGCGATG-3'(SEQ ID NO.19);
一种用于筛选前述吡咯赖氨酸氨酰-tRNA合成酶突变体的筛选质粒,所述筛选质粒为将SEQ ID NO.2所示包括氯霉素抗性报告基因的核苷酸片段、SEQ ID NO.3所示包括绿色荧光蛋白报告基因的核苷酸片段、SEQ ID NO.4所示包括T7RNA聚合酶基因的核苷酸片段通过同源重组连接起来,得到重组质粒,所述重组质粒核苷酸序列如SEQ ID NO.11所示;再经过双点突变,将SEQ ID NO.11所示重组质粒的9868bp~9873bp的ctagtc突变位atccta,使重组质粒中的氯霉素抗性基因编码的氨基酸序列的112号位点突变为112D,111位点突变成为琥珀终止密码子TAG,PCR同源重组得到所述筛选质粒,所述筛选质粒核苷酸序列如SEQID NO.1所示。
一种重组载体,所述重组载体包括编码前述吡咯赖氨酸氨酰-tRNA合成酶突变体的核苷酸序列。
将编码吡咯赖氨酸氨酰-tRNA合成酶突变体的核苷酸序列连入基础质粒,获得所述包括吡咯赖氨酸氨酰-tRNA合成酶饱和突变库的质粒;
在某个特殊的实施例中,所述重组载体包含GlnRS启动子、编码吡咯赖氨酸氨酰-tRNA合成酶突变体的核苷酸序列、GlnRS终止子、ColE1复制原点、硫酸卡那霉素抗性基因;所述吡咯赖氨酸氨酰-tRNA合成酶突变体为将如GENBANK SEQUENCE ID:WP_011033391.1所示野生型吡咯赖氨酸氨酰-tRNA合成酶的氨基酸序列突变得到,突变位点包括301位、306位、309位、以及348位的氨基酸残基;所述突变为:
301位替换为缬氨酸(V),306位替换为亮氨酸(L),309位替换为丙氨酸(A),348位替换为苯丙氨酸(F)(SEQ ID NO.7);
或,301位替换为蛋氨酸(M),306位替换为亮氨酸(L),309位替换为丙氨酸(A),348位替换为苯丙氨酸(F)(SEQ ID NO.8);
或,301位替换为异亮氨酸(I),306位替换为亮氨酸(L),309位替换为丙氨酸(A),348位替换为苯丙氨酸(F)(SEQ ID NO.9)。
在某一个特殊的实施例中,先将如SEQ ID NO.10所示编码野生型吡咯赖氨酸氨酰-tRNA合成酶的核苷酸序列按照GlnRS启动子-SEQ ID NO.10所示编码野生型吡咯赖氨酸氨酰-tRNA合成酶的核苷酸序列-GlnRS终止子-ColE1复制原点-硫酸卡那霉素抗性基因连接成重组质粒后,对所述编码野生型吡咯赖氨酸氨酰-tRNA合成酶的核苷酸序列进行上述突变,获得所述重组载体。
前述吡咯赖氨酸氨酰-tRNA合成酶突变体,或前述的筛选质粒,或前述重组载体在蛋白质乳酰化修饰中的应用。
前述吡咯赖氨酸氨酰-tRNA合成酶突变体,或前所述的筛选质粒,或前述重组载体在将乳酰化赖氨酸定点引入目标蛋白中的应用。
本发明技术方案相对于现有技术具有以下有益效果:
本发明基于基因密码子扩展技术,通过建立一种正交的双报告基因筛选系统,改造了吡咯赖氨酸氨酰-tRNA合成酶,可用于将乳酰化赖氨酸定点引入大肠杆菌、哺乳细胞等目标蛋白中,实现含乳酰化赖氨酸修饰的组蛋白以及非组蛋白的表达及应用(翻译后修饰及生理、病理过程),扩展了该正交对的使用范围。本发明所构建的筛选系统为全新的发明,筛选指示阳性结果准确且全面,结果直观性良好。通过对吡咯赖氨酸氨酰-tRNA合成酶(PylRS)的定向进化文库的筛选,发现了三种可定向引入乳酰化赖氨酸的吡咯赖氨酸氨酰-tRNA合成酶突变体。由本筛选系统筛选得到的吡咯赖氨酸氨酰-tRNA合成酶突变体与传统化学方法相比,能够特异性地在目标蛋白序列中引入乳酰化赖氨酸,生物相容性良好,且过程可控。
所述的乳酰化赖氨酸的化学结构为:
Figure BDA0003543551000000061
附图说明
图1.筛选系统由筛选质粒pSel(图1a)和编码氨酰tRNA合成酶的质粒pLH(图1b)组成。
图2.筛选系统的功能验证,其中,图2a为含有pLH-MmPylRS和pSel质粒的大肠杆菌DH10B细胞,在不添加(左)或添加(右)非天然氨基酸BocK培养,0.2%阿拉伯糖诱导后的菌体,在365nm激发波长下不产生或产生荧光的示例,图2b为含有pLH-MmPylRS和pSel质粒的大肠杆菌DH10B细胞,在不添加(左)或添加(右)非天然氨基酸BocK培养,0.2%阿拉伯糖诱导后的菌体,600nm激发波长下测定OD值,485nm激发波长,检测535nm发射波长下的荧光值,计算得相对荧光强度RFU。
图3.筛选获得定点引入乳酰化赖氨酸(Kla)的吡咯赖氨酸氨酰-tRNA合成酶突变体。将此三种突变体分别与pEvol-EGFP-4*共转至大肠杆菌DH10B细胞,在不添加(左)或添加(右)非天然氨基酸Kla培养,0.2%阿拉伯糖诱导后的菌体,600nm激发波长下测定OD值,485nm激发波长,检测535nm发射波长下的荧光值,计算得相对荧光强度RFU。
图4.在大肠杆菌中引入乳酰化赖氨酸(Kla),其中,图4a为L-Kla的化学结构,图4b为含有包含pEvol-KlaRS质粒和pBad-EGFP-4*质粒的大肠杆菌DH10B细胞,在不添加(左)或添加(右)非天然氨基酸Kla培养,0.2%阿拉伯糖诱导后的菌体,600nm激发波长下测定OD值,485nm激发波长,检测535nm发射波长下的荧光值,计算得相对荧光强度RFU,图4c为pEvol-KlaRS与pBad-EGFP-4*质粒共转至DH10B细胞,加入0.2%阿拉伯糖诱导,并加入1mMKla,可获得4号位被Kla修饰的均一的纯品EGFP蛋白,蛋白免疫印迹结果如图所示。
图5.在哺乳细胞中引入乳酰化赖氨酸(Kla)。
图6.通过筛选过的吡咯赖氨酸氨酰-tRNA合成酶突变体将Kla定点引入目标蛋白的方法流程简图。
图7.荧光筛选过程示例如图7所示,其中图7a为含有Kana 50、Tet 12.5、Cm50三重抗性以及0.2%阿拉伯糖、含有1mM Kla的LB平板上在365nm波长紫外下观察菌体生长情况;图7b为含有Kana 50、Tet 12.5、Cm50三重抗性以及0.2%阿拉伯糖、不含有1mM Kla的LB平板上在365nm波长紫外下观察菌体生长情况。在图7a中显示强烈荧光,在图7b中无菌落生长的克隆被认为是阳性克隆;对其进行测序和进一步验证。
具体实施方式
下面结合具体实施例对本发明作更进一步的描述,但要求保护的范围并不局限于所述。此处所描述的具体实施仅仅用以解释本发明,并不用于限定本发明。
实施例1筛选系统报告基因质粒的构建(pSel)
本发明中提到的筛选系统报告基因质粒构建时各部分组件及构建过程如下:最终质粒pSel是由pACYC184-Cm112TAG-PylT(SEQ ID NO.2)、Frag-pET-28a-sfGFP(SEQ IDNO.3)、Frag-pBad-T7RNAP(SEQ ID NO.4)三个片段通过同源重组的方法连接在一起,再经双点突变构成。
具体操作过程为:线性化pACYC184-Cm112TAG-PylT,pACYC184-Cm112TAG-PylT中包括氯霉素抗性的报告基因,所述氯霉素抗性的报告基因位于SEQ ID NO.2的3985-4425bp,1-219bp;所述氯霉素抗性的报告基因编码的氨基酸序列由SEQ ID NO.47所示突变前氯霉素抗性基因编码的氨基酸序列1和SEQ ID NO.48所示突变前氯霉素抗性基因编码的氨基酸序列2组成(终止密码子在序列中用*表示):
MEKKITGYTTVDISQWHRKEHFEAFQSVAQCTYNQTVQLDITAFLKTVKKNKHKFYPAFIHILARLMNAHPEFRMAMKDGELVIWDSVHPCYTVFHEQTETFSSLWSEYHD*FRQFLHIYSQDVACYGENLAYFPKGFIENMFFVSANPWVSFTSFDLNVANMDNFFAPVFTMGKYYTQGDKVLMPLAIQVHHAVCDGFHVGRMLNELQQYCDEWQGGA。
将Frag-pET-28a-sfGFP(SEQ ID NO.3)、Frag-pBad-T7RNAP(SEQ ID NO.4)两个片段与线性化的pACYC184-Cm112TAG-PylT(SEQ ID NO.2)连接在一起,重组成为目标质粒——pSel。
Frag-pET-28a-sfGFP(SEQ ID NO.3)来源于质粒pET-28a-sfGFP(其序列示于SEQID NO.5),Frag-pET-28a-sfGFP中包括绿色荧光蛋白的报告基因,所述绿色荧光蛋白的报告基因位于SEQ ID NO.3的157~879bp;所述绿色荧光蛋白的报告基因编码的氨基酸序列如GenBank Sequence ID:APA32056.1所示。
Frag-pBad-T7RNAP来源质粒pBad-T7RNAP-TAG(其序列示于SEQ ID NO.6),Frag-pBad-T7RNAP中包括T7RNA聚合酶基因,所述T7RNA聚合酶基因位于SEQ ID NO.4的1439~4087bp;T7RNA聚合酶基因编码的氨基酸序列由SEQ ID NO.49所示的T7RNA聚合酶编码的氨基酸序列1和SEQ ID NO.50所示的T7RNA聚合酶编码的氨基酸序列2组成(终止密码子在序列中用*表示):*NTINIAKNDFSDIELAAIPFNTLADHYGERLAREQLALEHESYEMGEARFRKMFERQLKAGEVADNAAAKPLITTLLPKMIARINDWFEEVKAKRGKRPTAFQFL*EIKPEAVAYITIKTTLACLTSADNTTVQAVASAIGRAIEDEARFGRIRDLEAKHFKKNVEEQLNKRVGHVYKKAFMQVVEADMLSKGLLGGEAWSSWHKEDSIHVGVRCIEMLIESTGMVSLHRQNAGVVGQDSETIELAPEYAEAIATRAGALAGISPMFQPCVVPPKPWTGITGGGYWANGRRPLALVRTHSKKALMRYEDVYMPEVYKAINIAQNTAWKINKKVLAVANVITKWKHCPVEDIPAIEREELPMKPEDIDMNPEALTAWKRAAAAVYRKDKARKSRRISLEFMLEQANKFANHKAIWFPYNMDWRGRVYAVSMFNPQGNDMTKGLLTLAKGKPIGKEGYYWLKIHGANCAGVDKVPFPERIKFIEENHENIMACAKSPLENTWWAEQDSPFCFLAFCFEYAGVQHHGLSYNCSLPLAFDGSCSGIQHFSAMLRDEVGGRAVNLLPSETVQDIYGIVAKKVNEILQADAINGTDNEVVTVTDENTGEISEKVKLGTKALAGQWLAYGVTRSVTKRSVMTLAYGSKEFGFRQQVLEDTIQPAIDSGKGLMFTQPNQAAGYMAKLIWESVSVTVVAAVEAMNWLKSAAKLLAAEVKDKKTGEILRKRCAVHWVTPDGFPVWQEYKKPIQTRLNLMFLGQFRLQPTINTNKDSEIDAHKQESGIAPNFVHSQDGSHLRKTVVWAHEKYGIESFALIHDSFGTIPADAANLFKAVRETMVDTYESCDVLADFYDQFADQLHESQLDKMPALPAKGNLNLRDILESDFAFA(SEQ ID NO.50)
(野生型T7RNA聚合酶氨基酸序列如GenBank Sequence ID:WP_001092355.1所示。)
质粒Frag-pBad-T7RNAP的某一种构建方法为:
使用NcoI和HindIII对pBad/HisA(Invitrogen,编号V430-01)载体进行双酶切,切胶回后收获得线性化载体pBad-NcoI-HindIII。
使用引物BL21-T7RNAP-F:
5’-GCTAACAGGAGGAATTAACCATGAACACGATTAACATCGCTAAGA-3’
(SEQ ID NO.40);和BL21-T7RNAP-R:
5’-CCGCCAAAACAGCCAAGCTTTTACGCGAACGCGAAGTCC-3’(SEQ ID NO.41);从大肠杆菌BL21(DE3)基因组里扩增T7RNAP基因,测序成功后,使用引物T7RNAP-TAG1-F:
5’-accatgattaccgtgcactagAACACGATTAACATCGCTAAGAACG-3’(SEQ ID NO.42);和T7RNAP-TAG1-R:
5’-gtgcacggtaatcatggtcatGGTTAATTCCTCCTGTTAGCCCA-3’(SEQ ID NO.43),将野生型T7RNA聚合酶基因的1号位的M突变为终止密码子(atg突变为tag)获得插入片段T7RNAP-TAG1。
再以插入片段T7RNAP-TAG1作为模板使用引物
T7RNAP-TAG107-F-5’-CCAGTTCCTGtagGAAATCAAGCCGGAAGCCG-3’
(SEQ ID NO.44);和T7RNAP-TAG107-R:
5’-TTTCctaCAGGAACTGGAAGGCTGTCGGGCGC-3’(SEQ ID NO.45),将107位的Q突变为终止密码子(caa突变为tag)获得插入片段T7RNAP-TAG,此时已在野生型T7RNAP基因内引入了两个终止密码子。将插入片段T7RNAP-TAG与线性化载体pBad-NcoI-HindIII进行同源重组获得pBad-T7RNAP-TAG质粒。
设计的克隆引物(PCR同源重组)详情如下:
线性化pACYC184-Cm112TAG-PylT载体的引物为:
pSel-vec-F-new:5'-cgtgagcggcctcatttcttattctgagttacaac-3'(SEQ IDNO.16)、
pSel-vec-R-new:5'-tggcgttgactctcagtcatagtatcgtggtatc-3'(SEQ IDNO.17);
用于从质粒pBad-T7RNAP-TAG中得到Frag-pBad-T7RNAP片段的引物pSel-RNAP为:
pSel-RNAP-F:5'-atgactgagagtcaacgccaGCTGTGACCGTCTCCGGG-3'(SEQ IDNO.14)、
pSel-RNAP-R:5'-gaggggttttttgctgAGAGTTTGTAGAAACGCAAAAAGGC-3'(SEQ IDNO.15),
用于从质粒pET-28a-sfGFP中得到Frag-pET-28a-sfGFP片段的引物pSel-sfGFP为:
pSel-sfGFP-F:5'-ctctCAGCAAAAAACCCCT CAAGACC-3'(SEQ ID NO.12)、
pSel-sfGFP-R:5'-ataagaaatgaggccgctcaCGATCCCGCGAAATTAATACG-3'(SEQ IDNO.13)。
之后采用诺唯赞非连接酶依赖型的多片段一步克隆技术试剂盒,货号C113-01,按照试剂盒说明书所述条件,将线性化的pACYC184-Cm112TAG-PylT(SEQ ID NO.2)、Frag-pET-28a-sfGFP(SEQ ID NO.3)、Frag-pBad-T7RNAP(SEQ ID NO.4)三片段同源重组连接成核苷酸序列如SEQ ID NO.11所示重组质粒。
最后将重组质粒进行双点突变的引物为:
SDM-Cm112TAG-F:5'-TACCACtagGATTTCCGGCAGTTTCTACACAT-3'(SEQ ID NO.18)、
SDM-Cm112 TAG-R:
5'-CGGAAATCctaGTGGTATTCACTCCAGAGCGATG-3'(SEQ ID NO.19)。突变结束后,氯霉素抗性的报告基因编码的氨基酸序列由SEQ ID NO.51所示突变后氯霉素抗性基因编码的氨基酸序列1和SEQ ID NO.52所示突变后氯霉素抗性基因编码的氨基酸序列2组成(终止密码子在序列中用*表示)::MEKKITGYTTVDISQWHRKEHFEAFQSVAQCTYNQTVQLDITAFLKTVKKNKHKFYPAFIHILARLMNAHPEFRMAMKDGELVIWDSVHPCYTVFHEQTETFSSLWSEYH*DFRQFLHIYSQDVACYGENLAYFPKGFIENMFFVSANPWVSFTSFDLNVANMDNFFAPVFTMGKYYTQGDKVLMPLAIQVHHAVCDGFHVGRMLNELQQYCDEWQGGA
即112号位点突变为112D,111位点突变成为琥珀终止密码子TAG。得到SEQ IDNO.1所示筛选质粒pSel(图1a)。
实施例2构建包括吡咯赖氨酸氨酰-tRNA合成酶饱和突变库的质粒pLH
构建包括吡咯赖氨酸氨酰-tRNA合成酶饱和突变库的质粒pLH,质粒pLH为自主构建的适配于筛选报告基因质粒功能的一种全新的质粒,该质粒主要由GlnRS启动子、301位、306位、309位、以及348位饱和突变的吡咯赖氨酸氨酰-tRNA合成酶、GlnRS终止子、ColE1复制起始位点、硫酸卡那霉素抗性基因组成。
(1)构建含有野生型吡咯赖氨酸氨酰tRNA合成酶pLH-MmPylRS的质粒:
构建包含GlnRS启动子、野生型吡咯赖氨酸氨酰tRNA合成酶pLH-MmPylRS、GlnRS终止子、ColE1复制原点、硫酸卡那霉素抗性基因的重组质粒pLH。
其中GlnRS启动子和GlnRS终止子基因为委托公司:北京擎科生物科技有限公司合成得到;GlnRS启动子核苷酸序列位于GenBank Sequence ID:V01575.1的241~306bp处,GlnRS终止子核苷酸序列位于GenBank Sequence ID:
V01575.1的2038~2211bp处。
pLH-GlnRS-P-F:5'-tccccgaaaagtgccacctgCTCGGGAGTTGTCAGCCTGTC-3'(SEQ IDNO.20)和pLH-GlnRS-P-R:5'-ccatATGGGATTCCTCAAAGCGTAAA-3'(SEQ ID NO.21)用于克隆GlnRS启动子;
pLH-GlnRS-T-F:5'-ctactaacctgtaaCTGCAGTTTCAAACGCTAAATTG-3'(SEQ IDNO.22)和pLH-GlnRS-T-R:
5'-ctggccttttgctcacatgtTGCGAAGCGGAATTAATTCG-3'(SEQ ID NO.23)用于克隆GlnRS终止子;
编码野生型吡咯赖氨酸氨酰-tRNA合成酶pLH-MmPylRS的核苷酸序列如SEQ IDNO.10所示,可以直接合成,也可以质粒pEvol-MmPylRS为模板,采用PCR方式克隆,克隆引物为:pLH-MmPylRS-F:
5'-ctttgaggaatcccatATGGATAAAAAGCCTCTGAACACTC-3'(SEQ ID NO.24)和pLH-MmPylRS-R:5'-ctgcagTTACAGGTTAGTAGAAATACCATTGTAATAGG-3'(SEQ ID NO.25);ColE1复制原点和硫酸卡那霉素抗性来源于pET-28a载体,克隆引物为:pLH-pET28a-F:5'-ACATGTGAGCAAAAGGCCAGC-3'(SEQ ID NO.26)和pLH-pET28a-R:5'-CAGGTGGCACTTTTCGGGG-3'(SEQ ID NO.27)。
其中pLH-GlnRS-P-F和pLH-pET28a-R有同源臂;pLH-GlnRS-P-R和pLH-MmPylRS-F有同源臂;pLH-MmPylRS-R和pLH-GlnRS-T-F有同源臂;
pLH-GlnRS-T-R和pLH-pET28a-F有同源臂。
综上,通过聚合酶链式反应、同源重组等技术手段构建出编码野生型吡咯赖氨酸氨酰-tRNA合成酶的质粒。
(2)通过定点突变,构建吡咯赖氨酸氨酰-tRNA合成酶饱和突变库,突变位点包括301、306、309、以及348残基,此饱和突变库命名为pLH-Lib,包括吡咯赖氨酸氨酰-tRNA合成酶饱和突变库的质粒命名为pLH。定点突变方法如下:
首先构建包括301、306位点的饱和突变文库,突变引物如下:
pLH-PylRS-F306NNK-F:
5'-GAGAGTAGGTCTCACCGAACCTGNNKAACTATgcGCGTAAACTGGA-3'
(SEQ ID NO.28)、
pLH-PylRS-301MNN-R:
5'-GAGGAAAGGTCTCATCGGGGCMNNCAtCGGACGCAGACAGAAG-3'(SEQ ID NO.29),
其中引物pLH-PylRS-F306NNK-F包含306位点的饱和突变,引物pLH-PylRS-301MNN-R包含301位点的饱和突变,通过聚合酶链式反应扩增出包含301、306位点饱和突变文库的片段,再经限制性外切酶BsaI-HFv2酶切后,采用T4连接的方法连接成包括301、306位点的饱和突变文库的环形质粒。
接着构建包括309、348位点的饱和突变文库,构建采用分步构建的方法,先构建包括309位点的饱和突变文库:以所述包括301、306位点的饱和突变文库的环形质粒为模板,构建包括309位点的饱和突变文库,经过聚合酶链式反应、酶切、连接得到包括301、306、309位点的饱和突变文库的质粒,接着以包括301、306、309位点的饱和突变文库的质粒为模板,再经上述步骤,最终构建成包括301、306、309、348位点的饱和突变文库质粒pLH,用于筛选。
所用聚合酶链式反应突变引物涉及如下:
pLH-PylRS-309-F:
5'-GAGAGTAGGTCTCACCCTGCCGGACCCGATCAAAATTTTCGAG-3'(SEQ ID NO.30),
pLH-PylRS-309MNN-R:
5'-GAGGAAAGGTCTCAAGGGCACGGTCCAGTTTACGMNNATAGTT-3'(SEQ ID NO.31),
其中引物pLH-PylRS-309MNN-R包含309位点饱和突变;
pLH-PylRS-348NNK-F:
5'-GAGAGTAGGTCTCACCATGCTGAACTTCNNKCAGATGGGTAGCG-3'(SEQ ID NO.32),pLH-PylRS-348-R:
5'-GAGGAAAGGTCTCAATGGTGAATTCTTCCAGGTGCTCTTTACC-3'(SEQ ID NO.33),其中引物pLH-PylRS-348NNK-F包含348位点饱和突变。
实施例3筛选系统功能的验证
以商业化非天然氨基酸N(ε)-Boc-L-lysine(Nε-Boc-L-赖氨酸,简称BocK),以及能够特异性引入N(ε)-Boc-L-lysine[N(ε)-Boc-L-赖氨酸]的氨酰tRNA合成酶/tRNA对为模型,探究该筛选系统的功能。
实施例2(1)构建的含有野生型吡咯赖氨酸氨酰tRNA合成酶pLH-MmPylRS的质粒是表达野生吡咯赖氨酸氨酰-tRNA合成酶的质粒,野生吡咯赖氨酸氨酰-tRNA合成酶可以催化N(ε)-Boc-L-lysine[N(ε)-Boc-L-赖氨酸]与tRNA的连接反应,生成可特异性识别N(ε)-Boc-L-lysine[N(ε)-Boc-L-赖氨酸]以及TAG琥珀终止密码子的N(ε)-Boc-L-赖氨酰-tRNA,将N(ε)-Boc-L-lysine[N(ε)-Boc-L-赖氨酸]特异性地引入到目标蛋白中。因此在诱导蛋白表达后,含有野生型吡咯赖氨酸氨酰tRNA合成酶pLH-MmPylRS的质粒表达的野生吡咯赖氨酸氨酰-tRNA合成酶所产生的tRNA顺利通读实施例1构建的pSel质粒上的终止密码子,将N(ε)-Boc-L-lysine[N(ε)-Boc-L-赖氨酸]引入到pSel质粒中的T7RNA聚合酶以及氯霉素抗性蛋白中,从而发挥双报告基因的功能。
将实施例1构建的筛选质粒pSel和实施例2(1)构建的含有野生型吡咯赖氨酸氨酰tRNA合成酶pLH-MmPylRS的质粒质粒共转至大肠杆菌化学感受态细胞DH10B中,37℃,220rpm复苏1h后将菌液涂布至含有Kana 50(硫酸卡那霉素,50μg/mL)和Tet 12.5(四环素,12.5μg/mL)双重抗性的LB平板上,于37℃培养箱倒置培养12h,得到转化子。
挑转化子于5mL LB/Kana 50、Tet 12.5中,于37℃,220rpm摇4.5h,OD达到0.531。
1:10稀释后分装至3mL LB/Kana 50、Tet 12.5、Ara(阿拉伯糖)0.2%中,
其中a管加Cm 50;b管加Cm 50、BocK1 mM,37℃,220rpm摇床过夜。测OD值,取1mL菌液,1300rpm离心10min,弃上清,在365nm激发波长下观察菌体的荧光。将菌体重悬在1mL无菌1x PBS缓冲液中,测定菌液的相对荧光单位RFU。结果显示:添加BocK的菌体在365nm激发波长下产生荧光,不添加BocK的菌体在365nm激发波长下未产生荧光,说明筛选系统可以准确直观地指示阳性结果(图2a);不加入BocK的重悬菌液RFU为0.84×108,加入BocK的重悬菌液RFU为5.19×108,说明筛选系统指示的阳性结果准确可靠,筛选结果可被定量检测(图2b)。综上所述,该筛选系统能够通过定性或定量的荧光信号直观准确地指示阳性结果,可用于其他氨酰tRNA合成酶的筛选。
实施例4定点引入乳酰化赖氨酸的氨酰tRNA合成酶的筛选
实施例1构建的筛选质粒pSel和实施例2(2)构建的pLH进行筛选。筛选步骤如下:
(1)构建含有筛选质粒pSel的大肠杆菌DH10B感受态细胞。具体的:将筛选质粒pSel转化至DH10B化学感受态细胞,涂于Tet12.5抗性平板上,于37℃培养箱培养过夜,次日挑单克隆于5mL LB/Tet 12.5中,于37℃,220rpm摇过夜作为种子液,测OD值为1.8,1:50扩大培养于200mL LB/Tet 12.5中,37℃,220r/min振荡培养2-3h至OD600达到0.5左右(OD600范围0.4-0.6),将菌液分装至预冷的50ml聚丙烯离心管中,4000×g,4℃,离心20min,倒尽吸尽残夜,回收细胞。含有筛选质粒pSel的大肠杆菌DH10B化转感受态的制备:每50ml菌体用4ml预冷的CCMB80 Buffer(配方:10mM醋酸钾pH 7.0;80mM二水合氯化钙;20mM四水合氯化锰;10mM六水合氯化镁;10%甘油,稀盐酸调pH至6.4,过除菌滤膜,存于4℃)重悬,4000×g,4℃,离心20min,倒尽吸尽残夜,回收细胞,再次用4ml预冷的CCMB80 Buffer重悬,将细胞分装成小份,100μL/份,液氮速冻后放于-80℃冻存。全程低温操作。含有筛选质粒pSel的大肠杆菌DH10B电转感受态的制备:每50ml菌体用50ml预冷的10%甘油-水缓冲液重悬,4000×g,4℃,离心20min,倒尽吸尽残夜,回收细胞。重复1次以上操作,将细胞分装成小份,100μL/份,液氮速冻后放于-80℃冻存。全程低温操作。
将实施例2(2)构建的pLH筛选质粒通过化学转化或电转导入本实施例(1)构建的含有筛选质粒pSel的大肠杆菌DH10B感受态细胞,涂于含有卡那霉素Kana 50、四环素Tet12.5、氯霉素Cm50三重抗性以及0.2%阿拉伯糖、1mM Kla的LB平板上,其中卡那霉素和四环素分别为pSel和pLH自身携带的抗性标签,其作用是保持pSel和pLH质粒在大肠杆菌中的复制,氯霉素用于目标酶的筛选。于37℃培养箱倒置培养36-72h,365nm紫外光下观察,挑取所有能发出荧光的单克隆分别于50μL LB培养基中,各孔取2μL分别点滴于含有Kana 50、Tet12.5、Cm50三重抗性以及0.2%阿拉伯糖、含或者不含1mM Kla的LB平板上,在37℃培养36-72h后观察荧光,结果如图7所示,图7a显示在含有Kana 50、Tet 12.5、Cm50三重抗性以及0.2%阿拉伯糖、含有1mM Kla的LB平板上在365nm波长紫外下观察阳性菌落能生长且可以发出荧光,图7b为含有Kana 50、Tet 12.5、Cm50三重抗性以及0.2%阿拉伯糖,但不含有1mMKla的LB平板上在365nm波长紫外下观察菌体不能生长且不发出荧光,初步筛选出可定点引入乳酰化赖氨酸的氨酰tRNA合成酶。取在含1mM Kla的LB平板上能生长且发光,在不含1mMKla的LB平板上不能生长且不发光的阳性菌团于5mL LB/Kana30、Tet 12.5中过夜,提质粒,送测序,测序引物序列:pLH-PylRS-seq-F:5’-AACGGCTTTTTCAAAAATATG-3’(SEQ IDNO.34);pLH-PylRS-seq-R:5’-gtatctttatagtcctgtcgggt-3’(SEQ ID NO.35)。归纳测序结果即获得三种可定点引入乳酰化赖氨酸的氨酰tRNA合成酶,分别为Enz1、Enz2、Enz3,其蛋白序列分别示于SEQ ID NO.7、SEQ ID NO.8、SEQ ID NO.9。
(3)以EGFP为待修饰的目标蛋白,用于验证Kla等非天然氨基酸的引入,乳酰化赖氨酸修饰的目标位点为4号位:
本实施例中采用pEvol-EGFP-4*表达EGFP,pEvol-EGFP-4*构建使用SpeI和SalI对pEvol-PylRS(爱迪基因,编号137976)载体进行双酶切,切胶回后收获得线性化载体pEvol-SpeI-SalI。使用引物pEvol-EGFP-SpeI-F:5’-ctaacaggaggaattactagtatggtgagctagggcgaagaa-3’(SEQ ID NO.36)和引物pEvol-EGFP-SalI-R:5’-atgatgatgatgatggtcgacttaatgatgatgatgatgatgtttgtaca-3’(SEQ ID NO.37)扩增EGFP-4TAG基因(化学合成),获得插入片段EGFP-4TAG。通过同源重组(诺唯赞C112试剂盒)连接线性化载体pEvol-SpeI-SalI和插入片段EGFP-4TAG,获得目标质粒pEvol-EGFP-4*,质粒核酸序列信息如SEQ ID NO.46所示。
pEvol-EGFP-4*表达的EGFP氨基酸序列由SEQ ID NO.53所示pEvol-EGFP-4*中EGFP部分的氨基酸1和SEQ ID NO.54所示pEvol-EGFP-4*中EGFP部分的氨基酸2组成(序列中用*表示TAG终止密码子):MVS*GEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNH
YLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYKHHHHHH。将筛选得到的三种酶与pEvol-EGFP-4*,共转至大肠杆菌DH10B细胞,得到转化子,在液态培养中扩增培养后,加入0.2%阿拉伯糖诱导,并加入1mM乳酰化赖氨酸Kla,结果显示:加入和不加入非天然氨基酸Kla时大肠杆菌内EGFP的相对荧光值分别为Enz1:0.31×106和1.13×106、Enz2:0.25×106和0.86×106、Enz3:0.25×106和0.90×106。(图3)
选用Enz2,克隆至pEvol(大肠杆菌表达载体)和pNEU(哺乳细胞表达载体),获得pEvol-KlaRS和pNEU-KlaRS质粒。
以EGFP为待修饰的目标蛋白,用于验证Kla的引入,乳酰化赖氨酸修饰的目标位点为4号位:
本实施例中采用pBad-EGFP-4*表达EGFP,pBad-EGFP-4*的构建使用NcoI和HindIII对pBad/HisA(Invitrogen,编号V430-01)载体进行双酶切,切胶回后收获得线性化载体pBad-NcoI-HindIII。使用引物pBad-EGFP-NcoI-F:
5’-aacaggaggaattaaccatggTGAGCTAGGGCGAAGAACTGTT-3’(SEQ ID NO.38)和引物pBad-EGFP-HindIII-R:
5’-tccgccaaaacagccaagcttTTAATGATGATGATGATGATGTTTGTACA-3’(SEQ IDNO.39)扩增EGFP-4TAG基因(化学合成),获得插入片段EGFP-4TAG。通过同源重组(诺唯赞C112试剂盒)连接线性化载体pBad-NcoI-HindIII和插入片段EGFP-4TAG,获得目标质粒pBad-EGFP-4*。
EGFP氨基酸序列如下(序列中用*表示TAG终止密码子):MVS*GEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYKHHHHHH
将pEvol-KlaRS与pBad-EGFP-4*质粒,共转至大肠杆菌DH10B化学感受态细胞中,得到转化子。在液态培养中扩增培养后,加入0.2%阿拉伯糖诱导,并加入1mM乳酰化赖氨酸Kla,结果显示加入和不加入非天然氨基酸Kla时大肠杆菌内EGFP的RFU分别为6.15×107,0.68×107,免疫印迹法表征加入和不加入Kla时大肠杆菌内EGFP的表达情况分别为加入Kla时EGFP表达蛋白全长,Anti-His(Proteintech,货号HRP-66005)和Anti-Kla(PTMBiolabs,货号PTM-1401RM)抗体孵育后显示有目的条带;不加入Kla时EGFP不能表达蛋白全长,Anti-His抗体孵育后显示有目的条带,Anti-Kla抗体孵育后无目的条带(图4)。说明本发明筛选到的吡咯赖氨酸氨酰-tRNA合成酶突变体,转入大肠杆菌后,可获得4号位被乳酰化赖氨酸修饰的均一的纯品增强绿色荧光蛋白EGFP蛋白。
以EGFP为待修饰的目标蛋白,用于验证Kla的引入,乳酰化赖氨酸修饰的目标位点为4号位:
本实施例中采用pCDNA-EGFP-4*表达EGFP,pCDNA-EGFP-4*的构建为使用BamHI和EcoRI对pCDNA3.1(+)(GeneBank编号MN996867.1)载体进行双酶切,切胶回后收获得线性化载体pCDNA-BamHI-EcoRI。使用引物pCDNA-EGFP-BamHI-F
(5’-cttggtaccgagctcggatccATGGTGAGCTAGGGCGAAGAA-3’)和引物pCDNA-EGFP-EcoRI-R
(5’-tgctggatatctgcagaattcTTAATGATGATGATGATGATGTTTGTACA-3’)扩增EGFP-4TAG基因(化学合成),获得插入片段EGFP-4TAG。通过同源重组(诺唯赞C112试剂盒)连接线性化载体pCDNA-BamHI-EcoRI和插入片段EGFP-4TAG,获得目标质粒pCDNA-EGFP-4*。
EGFP氨基酸序列如下(序列中用*表示TAG终止密码子):
MVS*GEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYKHHHHHH
将pNEU-KlaRS与pcDNA-EGFP-4*质粒,瞬转至HEK293T等哺乳细胞中,将1mM Kla加入培养基。经过24小时及48小时培养,结果显示不添加1mM Kla的细胞无荧光,加入1mM Kla的细胞可观察到绿色荧光,且相比于24h,培养48h后荧光有所增强(图5),说明本发明筛选到的吡咯赖氨酸氨酰-tRNA合成酶突变体,转入哺乳细胞后,可获得4号位被乳酰化赖氨酸修饰的均一的纯品增强绿色荧光蛋白EGFP蛋白。
该方法适用于所有需要Kla修饰的蛋白。
对所公开的实施例的上述说明,使本领域专业技术人员能够实现或使用本发明。对这些实施例的多种修改对本领域的专业技术人员来说将是显而易见的,本文中所定义的一般原理可以在不脱离本发明的精神或范围的情况下,在其它实施例中实现。因此,本发明将不会被限制于本文所示的这些实施例,而是要符合与本文所公开的原理和新颖特点相一致的最宽的范围。
序列表
<110> 南京中医药大学
<120> 吡咯赖氨酸氨酰-tRNA合成酶突变体及其筛选系统和应用
<160> 54
<170> SIPOSequenceListing 1.0
<210> 1
<211> 9984
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 1
gaattccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 60
gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 120
ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 180
tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 240
aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 300
ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 360
ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 420
ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 480
gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 540
acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtatact 600
ggcttactat gttggcggta cccccatcaa aaaaatattc tcaacataaa aaactttgtg 660
taatacttgt aacgctgaat tcggaaacct gatcatgtag atcgaatgga ctctaaatcc 720
gttcagccgg gttagattcc cggggtttcc gccactgcag atccttagcg aaagctaagg 780
atttttttta ctcgagactg atgagggtgt cagtgaagtg cttcatgtgg caggagaaaa 840
aaggctgcac cggtgcgtca gcagaatatg tgatacagga tatattccgc ttcctcgctc 900
actgactcgc tacgctcggt cgttcgactg cggcgagcgg aaatggctta cgaacggggc 960
ggagatttcc tggaagatgc caggaagata cttaacaggg aagtgagagg gccgcggcaa 1020
agccgttttt ccataggctc cgcccccctg acaagcatca cgaaatctga cgctcaaatc 1080
agtggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggcggctccc 1140
tcgtgcgctc tcctgttcct gcctttcggt ttaccggtgt cattccgctg ttatggccgc 1200
gtttgtctca ttccacgcct gacactcagt tccgggtagg cagttcgctc caagctggac 1260
tgtatgcacg aaccccccgt tcagtccgac cgctgcgcct tatccggtaa ctatcgtctt 1320
gagtccaacc cggaaagaca tgcaaaagca ccactggcag cagccactgg taattgattt 1380
agaggagtta gtcttgaagt catgcgccgg ttaaggctaa actgaaagga caagttttgg 1440
tgactgcgct cctccaagcc agttacctcg gttcaaagag ttggtagctc agagaacctt 1500
cgaaaaaccg ccctgcaagg cggttttttc gttttcagag caagagatta cgcgcagacc 1560
aaaacgatct caagaagatc atcttattaa tcagataaaa tatttctaga tttcagtgca 1620
atttatctct tcaaatgtag cacctgaagt cagccccata cgatataagt tgtaattctc 1680
atgtttgaca gcttatcatc gataagcttt aatgcggtag tttatcacag ttaaattgct 1740
aacgcagtca ggcaccgtgt atgaaatcta acaatgcgct catcgtcatc ctcggcaccg 1800
tcaccctgga tgctgtaggc ataggcttgg ttatgccggt actgccgggc ctcttgcggg 1860
atatcgtcca ttccgacagc atcgccagtc actatggcgt gctgctagcg ctatatgcgt 1920
tgatgcaatt tctatgcgca cccgttctcg gagcactgtc cgaccgcttt ggccgccgcc 1980
cagtcctgct cgcttcgcta cttggagcca ctatcgacta cgcgatcatg gcgaccacac 2040
ccgtcctgtg gatcctctac gccggacgca tcgtggccgg catcaccggc gccacaggtg 2100
cggttgctgg cgcctatatc gccgacatca ccgatgggga agatcgggct cgccacttcg 2160
ggctcatgag cgcttgtttc ggcgtgggta tggtggcagg ccccgtggcc gggggactgt 2220
tgggcgccat ctccttgcat gcaccattcc ttgcggcggc ggtgctcaac ggcctcaacc 2280
tactactggg ctgcttccta atgcaggagt cgcataaggg agagcgtcga ccgatgccct 2340
tgagagcctt caacccagtc agctccttcc ggtgggcgcg gggcatgact atcgtcgccg 2400
cacttatgac tgtcttcttt atcatgcaac tcgtaggaca ggtgccggca gcgctctggg 2460
tcattttcgg cgaggaccgc tttcgctgga gcgcgacgat gatcggcctg tcgcttgcgg 2520
tattcggaat cttgcacgcc ctcgctcaag ccttcgtcac tggtcccgcc accaaacgtt 2580
tcggcgagaa gcaggccatt atcgccggca tggcggccga cgcgctgggc tacgtcttgc 2640
tggcgttcgc gacgcgaggc tggatggcct tccccattat gattcttctc gcttccggcg 2700
gcatcgggat gcccgcgttg caggccatgc tgtccaggca ggtagatgac gaccatcagg 2760
gacagcttca aggatcgctc gcggctctta ccagcctaac ttcgatcatt ggaccgctga 2820
tcgtcacggc gatttatgcc gcctcggcga gcacatggaa cgggttggca tggattgtag 2880
gcgccgccct ataccttgtc tgcctccccg cgttgcgtcg cggtgcatgg agccgggcca 2940
cctcgacctg aatggaagcc ggcggcacct cgctaacgga ttcaccactc caagaattgg 3000
agccaatcaa ttcttgcgga gaactgtgaa tgcgcaaacc aacccttggc agaacatatc 3060
catcgcgtcc gccatctcca gcagccgcac gcggcgcatc tcgggcagcg ttgggtcctg 3120
gccacgggtg cgcatgatcg tgctcctgtc gttgaggacc cggctaggct ggcggggttg 3180
ccttactggt tagcagaatg aatcaccgat acgcgagcga acgtgaagcg actgctgctg 3240
caaaacgtct gcgacctgag caacaacatg aatggtcttc ggtttccgtg tttcgtaaag 3300
tctggaaacg cggaagtccc ctacgtgctg ctgaagttgc ccgcaacaga gagtggaacc 3360
aaccggtgat accacgatac tatgactgag agtcaacgcc agctgtgacc gtctccggga 3420
gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg cgcgaggcag cagatcaatt 3480
cgcgcgcgaa ggcgaagcgg catgcataat gtgcctgtca aatggacgaa gcagggattc 3540
tgcaaaccct atgctactcc gtcaagccgt caattgtctg attcgttacc aattatgaca 3600
acttgacggc tacatcattc actttttctt cacaaccggc acggaactcg ctcgggctgg 3660
ccccggtgca ttttttaaat acccgcgaga aatagagttg atcgtcaaaa ccaacattgc 3720
gaccgacggt ggcgataggc atccgggtgg tgctcaaaag cagcttcgcc tggctgatac 3780
gttggtcctc gcgccagctt aagacgctaa tccctaactg ctggcggaaa agatgtgaca 3840
gacgcgacgg cgacaagcaa acatgctgtg cgacgctggc gatatcaaaa ttgctgtctg 3900
ccaggtgatc gctgatgtac tgacaagcct cgcgtacccg attatccatc ggtggatgga 3960
gcgactcgtt aatcgcttcc atgcgccgca gtaacaattg ctcaagcaga tttatcgcca 4020
gcagctccga atagcgccct tccccttgcc cggcgttaat gatttgccca aacaggtcgc 4080
tgaaatgcgg ctggtgcgct tcatccgggc gaaagaaccc cgtattggca aatattgacg 4140
gccagttaag ccattcatgc cagtaggcgc gcggacgaaa gtaaacccac tggtgatacc 4200
attcgcgagc ctccggatga cgaccgtagt gatgaatctc tcctggcggg aacagcaaaa 4260
tatcacccgg tcggcaaaca aattctcgtc cctgattttt caccaccccc tgaccgcgaa 4320
tggtgagatt gagaatataa cctttcattc ccagcggtcg gtcgataaaa aaatcgagat 4380
aaccgttggc ctcaatcggc gttaaacccg ccaccagatg ggcattaaac gagtatcccg 4440
gcagcagggg atcattttgc gcttcagcca tacttttcat actcccgcca ttcagagaag 4500
aaaccaattg tccatattgc atcagacatt gccgtcactg cgtcttttac tggctcttct 4560
cgctaaccaa accggtaacc ccgcttatta aaagcattct gtaacaaagc gggaccaaag 4620
ccatgacaaa aacgcgtaac aaaagtgtct ataatcacgg cagaaaagtc cacattgatt 4680
atttgcacgg cgtcacactt tgctatgcca tagcattttt atccataaga ttagcggatc 4740
ctacctgacg ctttttatcg caactctcta ctgtttctcc atacccgttt tttgggctaa 4800
caggaggaat taaccatgac catgattacc gtgcactaga acacgattaa catcgctaag 4860
aacgacttct ctgacatcga actggctgct atcccgttca acactctggc tgaccattac 4920
ggtgagcgtt tagctcgcga acagttggcc cttgagcatg agtcttacga gatgggtgaa 4980
gcacgcttcc gcaagatgtt tgagcgtcaa cttaaagctg gtgaggttgc ggataacgct 5040
gccgccaagc ctctcatcac taccctactc cctaagatga ttgcacgcat caacgactgg 5100
tttgaggaag tgaaagctaa gcgcggcaag cgcccgacag ccttccagtt cctgtaggaa 5160
atcaagccgg aagccgtagc gtacatcacc attaagacca ctctggcttg cctaaccagt 5220
gctgacaata caaccgttca ggctgtagca agcgcaatcg gtcgggccat tgaggacgag 5280
gctcgcttcg gtcgtatccg tgaccttgaa gctaagcact tcaagaaaaa cgttgaggaa 5340
caactcaaca agcgcgtagg gcacgtctac aagaaagcat ttatgcaagt tgtcgaggct 5400
gacatgctct ctaagggtct actcggtggc gaggcgtggt cttcgtggca taaggaagac 5460
tctattcatg taggagtacg ctgcatcgag atgctcattg agtcaaccgg aatggttagc 5520
ttacaccgcc aaaatgctgg cgtagtaggt caagactctg agactatcga actcgcacct 5580
gaatacgctg aggctatcgc aacccgtgca ggtgcgctgg ctggcatctc tccgatgttc 5640
caaccttgcg tagttcctcc taagccgtgg actggcatta ctggtggtgg ctattgggct 5700
aacggtcgtc gtcctctggc gctggtgcgt actcacagta agaaagcact gatgcgctac 5760
gaagacgttt acatgcctga ggtgtacaaa gcgattaaca ttgcgcaaaa caccgcatgg 5820
aaaatcaaca agaaagtcct agcggtcgcc aacgtaatca ccaagtggaa gcattgtccg 5880
gtcgaggaca tccctgcgat tgagcgtgaa gaactcccga tgaaaccgga agacatcgac 5940
atgaatcctg aggctctcac cgcgtggaaa cgtgctgccg ctgctgtgta ccgcaaggac 6000
aaggctcgca agtctcgccg tatcagcctt gagttcatgc ttgagcaagc caataagttt 6060
gctaaccata aggccatctg gttcccttac aacatggact ggcgcggtcg tgtttacgct 6120
gtgtcaatgt tcaacccgca aggtaacgat atgaccaaag gactgcttac gctggcgaaa 6180
ggtaaaccaa tcggtaagga aggttactac tggctgaaaa tccacggtgc aaactgtgcg 6240
ggtgtcgata aggttccgtt ccctgagcgc atcaagttca ttgaggaaaa ccacgagaac 6300
atcatggctt gcgctaagtc tccactggag aacacttggt gggctgagca agattctccg 6360
ttctgcttcc ttgcgttctg ctttgagtac gctggggtac agcaccacgg cctgagctat 6420
aactgctccc ttccgctggc gtttgacggg tcttgctctg gcatccagca cttctccgcg 6480
atgctccgag atgaggtagg tggtcgcgcg gttaacttgc ttcctagtga aaccgttcag 6540
gacatctacg ggattgttgc taagaaagtc aacgagattc tacaagcaga cgcaatcaat 6600
gggaccgata acgaagtagt taccgtgacc gatgagaaca ctggtgaaat ctctgagaaa 6660
gtcaagctgg gcactaaggc actggctggt caatggctgg cttacggtgt tactcgcagt 6720
gtgactaagc gttcagtcat gacgctggct tacgggtcca aagagttcgg cttccgtcaa 6780
caagtgctgg aagataccat tcagccagct attgattccg gcaagggtct gatgttcact 6840
cagccgaatc aggctgctgg atacatggct aagctgattt gggaatctgt gagcgtgacg 6900
gtggtagctg cggttgaagc aatgaactgg cttaagtctg ctgctaagct gctggctgct 6960
gaggtcaaag ataagaagac tggagagatt cttcgcaagc gttgcgctgt gcattgggta 7020
actcctgatg gtttccctgt gtggcaggaa tacaagaagc ctattcagac gcgcttgaac 7080
ctgatgttcc tcggtcagtt ccgcttacag cctaccatta acaccaacaa agatagcgag 7140
attgatgcac acaaacagga gtctggtatc gctcctaact ttgtacacag ccaagacggt 7200
agccaccttc gtaagactgt agtgtgggca cacgagaagt acggaatcga atcttttgca 7260
ctgattcacg actccttcgg taccattccg gctgacgctg cgaacctgtt caaagcagtg 7320
cgcgaaacta tggttgacac atatgagtct tgtgatgtac tggctgattt ctacgaccag 7380
ttcgctgacc agttgcacga gtctcaattg gacaaaatgc cagcacttcc ggctaaaggt 7440
aacttgaacc tccgtgacat cttagagtcg gacttcgcgt tcgcgtaaaa gcttggctgt 7500
tttggcggat gagagaagat tttcagcctg atacagatta aatcagaacg cagaagcggt 7560
ctgataaaac agaatttgcc tggcggcagt agcgcggtgg tcccacctga ccccatgccg 7620
aactcagaag tgaaacgccg taacgccgat ggtagtgtgg ggtctcccca tgcgagagta 7680
gggaactgcc aggcatcaaa taaaacgaaa ggctcagtcg aaagactggg cctttcgttt 7740
tatctgttgt ttgtcggtga acgctctcct gagtaggaca aatccgccgg gagcggattt 7800
gaacgttgcg aagcaacggc ccggagggtg gcgggcagga cgcccgccat aaactgccag 7860
gcatcaaatt aagcagaagg ccatcctgac ggatggcctt tttgcgtttc tacaaactct 7920
cagcaaaaaa cccctcaaga cccgtttaga ggccccaagg ggttatgcta gttattgctc 7980
agcggtggca gcagccaact cagcttcctt tcgggctttg ttagcagccg gatctcagtg 8040
gtggtggtgg tggtgctcga gtgcggccgc aagcttttag gagcctttgt acagttcgtc 8100
catgccgtgg gtaatgccgg cggcggtcac aaattccagc agcaccatgt ggtcccgttt 8160
ttcgttgggg tctttgctca gcacgctctg ggtggacagg tagtggttgt caggcagcag 8220
cacggggccg tcgccgatgg gggtgttctg ctggtagtgg tcggccagct gcacgctgcc 8280
gtcctccacg ttgtgtctga ttttgaagtt ggccttgatg ccgttcttct gcttgtcggc 8340
ggtgatatac acgttatggc tgttgaagtt atactccagc ttgtgtccca ggatgttgcc 8400
gtcctccttg aagtcgatgc ccttcagctc aatccggttc accagggtgt cgccctcgaa 8460
cttcacctcg gcccgggtct tataggtgcc gtcgtccttg aagctgatgg ttctctcctg 8520
cacgtagcct tcgggcatgg cgctcttgaa aaagtcgtgt ctcttcatgt ggtcggggta 8580
tctgctgaag cactgcactc cgtaggtcag ggttgtcacc agtgttggcc atggcacggg 8640
cagcttcccg gttgtgcaaa tgaacttcag tgtcagcttg ccgttggtgg cgtctccctc 8700
tccctctcct ctcacgctga acttgtggcc gttcacgtcg ccgtccagct ccaccaggat 8760
gggcaccacc ccggtgaaca gctcctcccc cttgctcata tggctgccgc gcggcaccag 8820
gccgctgctg tgatgatgat gatgatggct gctgcccatg gtatatctcc ttcttaaagt 8880
taaacaaaat tatttctaga ggggaattgt tatccgctca caattcccct atagtgagtc 8940
gtattaattt cgcgggatcg tgagcggcct catttcttat tctgagttac aacagtccgc 9000
accgctgtcc ggtagctcct tccggtgggc gcggggcatg actatcgtcg ccgcacttat 9060
gactgtcttc tttatcatgc aactcgtagg acaggtgccg gcagcgccca acagtccccc 9120
ggccacgggg cctgccacca tacccacgcc gaaacaagcg ccctgcacca ttatgttccg 9180
gatctgcatc gcaggatgct gctggctacc ctgtggaaca cctacatctg tattaacgaa 9240
gcgctaaccg tttttatcag gctctgggag gcagaataaa tgatcatatc gtcaattatt 9300
acctccacgg ggagagcctg agcaaactgg cctcaggcat ttgagaagca cacggtcaca 9360
ctgcttccgg tagtcaataa accggtaaac cagcaataga cataagcggc tatttaacga 9420
ccctgccctg aaccgacgac cgggtcgaat ttgctttcga atttctgcca ttcatccgct 9480
tattatcact tattcaggcg tagcaccagg cgtttaaggg caccaataac tgccttaaaa 9540
aaattacgcc ccgccctgcc actcatcgca gtactgttgt aattcattaa gcattctgcc 9600
gacatggaag ccatcacaga cggcatgatg aacctgaatc gccagcggca tcagcacctt 9660
gtcgccttgc gtataatatt tgcccatggt gaaaacgggg gcgaagaagt tgtccatatt 9720
ggccacgttt aaatcaaaac tggtgaaact cacccaggga ttggctgaga cgaaaaacat 9780
attctcaata aaccctttag ggaaataggc caggttttca ccgtaacacg ccacatcttg 9840
cgaatatatg tgtagaaact gccggaaatc ctagtggtat tcactccaga gcgatgaaaa 9900
cgtttcagtt tgctcatgga aaacggtgta acaagggtga acactatccc atatcaccag 9960
ctcaccgtct ttcattgcca tacg 9984
<210> 2
<211> 4425
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 2
gaattccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 60
gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 120
ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 180
tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 240
aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 300
ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 360
ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 420
ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 480
gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 540
acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtatact 600
ggcttactat gttggcggta cccccatcaa aaaaatattc tcaacataaa aaactttgtg 660
taatacttgt aacgctgaat tcggaaacct gatcatgtag atcgaatgga ctctaaatcc 720
gttcagccgg gttagattcc cggggtttcc gccactgcag atccttagcg aaagctaagg 780
atttttttta ctcgagactg atgagggtgt cagtgaagtg cttcatgtgg caggagaaaa 840
aaggctgcac cggtgcgtca gcagaatatg tgatacagga tatattccgc ttcctcgctc 900
actgactcgc tacgctcggt cgttcgactg cggcgagcgg aaatggctta cgaacggggc 960
ggagatttcc tggaagatgc caggaagata cttaacaggg aagtgagagg gccgcggcaa 1020
agccgttttt ccataggctc cgcccccctg acaagcatca cgaaatctga cgctcaaatc 1080
agtggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggcggctccc 1140
tcgtgcgctc tcctgttcct gcctttcggt ttaccggtgt cattccgctg ttatggccgc 1200
gtttgtctca ttccacgcct gacactcagt tccgggtagg cagttcgctc caagctggac 1260
tgtatgcacg aaccccccgt tcagtccgac cgctgcgcct tatccggtaa ctatcgtctt 1320
gagtccaacc cggaaagaca tgcaaaagca ccactggcag cagccactgg taattgattt 1380
agaggagtta gtcttgaagt catgcgccgg ttaaggctaa actgaaagga caagttttgg 1440
tgactgcgct cctccaagcc agttacctcg gttcaaagag ttggtagctc agagaacctt 1500
cgaaaaaccg ccctgcaagg cggttttttc gttttcagag caagagatta cgcgcagacc 1560
aaaacgatct caagaagatc atcttattaa tcagataaaa tatttctaga tttcagtgca 1620
atttatctct tcaaatgtag cacctgaagt cagccccata cgatataagt tgtaattctc 1680
atgtttgaca gcttatcatc gataagcttt aatgcggtag tttatcacag ttaaattgct 1740
aacgcagtca ggcaccgtgt atgaaatcta acaatgcgct catcgtcatc ctcggcaccg 1800
tcaccctgga tgctgtaggc ataggcttgg ttatgccggt actgccgggc ctcttgcggg 1860
atatcgtcca ttccgacagc atcgccagtc actatggcgt gctgctagcg ctatatgcgt 1920
tgatgcaatt tctatgcgca cccgttctcg gagcactgtc cgaccgcttt ggccgccgcc 1980
cagtcctgct cgcttcgcta cttggagcca ctatcgacta cgcgatcatg gcgaccacac 2040
ccgtcctgtg gatcctctac gccggacgca tcgtggccgg catcaccggc gccacaggtg 2100
cggttgctgg cgcctatatc gccgacatca ccgatgggga agatcgggct cgccacttcg 2160
ggctcatgag cgcttgtttc ggcgtgggta tggtggcagg ccccgtggcc gggggactgt 2220
tgggcgccat ctccttgcat gcaccattcc ttgcggcggc ggtgctcaac ggcctcaacc 2280
tactactggg ctgcttccta atgcaggagt cgcataaggg agagcgtcga ccgatgccct 2340
tgagagcctt caacccagtc agctccttcc ggtgggcgcg gggcatgact atcgtcgccg 2400
cacttatgac tgtcttcttt atcatgcaac tcgtaggaca ggtgccggca gcgctctggg 2460
tcattttcgg cgaggaccgc tttcgctgga gcgcgacgat gatcggcctg tcgcttgcgg 2520
tattcggaat cttgcacgcc ctcgctcaag ccttcgtcac tggtcccgcc accaaacgtt 2580
tcggcgagaa gcaggccatt atcgccggca tggcggccga cgcgctgggc tacgtcttgc 2640
tggcgttcgc gacgcgaggc tggatggcct tccccattat gattcttctc gcttccggcg 2700
gcatcgggat gcccgcgttg caggccatgc tgtccaggca ggtagatgac gaccatcagg 2760
gacagcttca aggatcgctc gcggctctta ccagcctaac ttcgatcatt ggaccgctga 2820
tcgtcacggc gatttatgcc gcctcggcga gcacatggaa cgggttggca tggattgtag 2880
gcgccgccct ataccttgtc tgcctccccg cgttgcgtcg cggtgcatgg agccgggcca 2940
cctcgacctg aatggaagcc ggcggcacct cgctaacgga ttcaccactc caagaattgg 3000
agccaatcaa ttcttgcgga gaactgtgaa tgcgcaaacc aacccttggc agaacatatc 3060
catcgcgtcc gccatctcca gcagccgcac gcggcgcatc tcgggcagcg ttgggtcctg 3120
gccacgggtg cgcatgatcg tgctcctgtc gttgaggacc cggctaggct ggcggggttg 3180
ccttactggt tagcagaatg aatcaccgat acgcgagcga acgtgaagcg actgctgctg 3240
caaaacgtct gcgacctgag caacaacatg aatggtcttc ggtttccgtg tttcgtaaag 3300
tctggaaacg cggaagtccc ctacgtgctg ctgaagttgc ccgcaacaga gagtggaacc 3360
aaccggtgat accacgatac tatgactgag agtcaacgcc atgagcggcc tcatttctta 3420
ttctgagtta caacagtccg caccgctgtc cggtagctcc ttccggtggg cgcggggcat 3480
gactatcgtc gccgcactta tgactgtctt ctttatcatg caactcgtag gacaggtgcc 3540
ggcagcgccc aacagtcccc cggccacggg gcctgccacc atacccacgc cgaaacaagc 3600
gccctgcacc attatgttcc ggatctgcat cgcaggatgc tgctggctac cctgtggaac 3660
acctacatct gtattaacga agcgctaacc gtttttatca ggctctggga ggcagaataa 3720
atgatcatat cgtcaattat tacctccacg gggagagcct gagcaaactg gcctcaggca 3780
tttgagaagc acacggtcac actgcttccg gtagtcaata aaccggtaaa ccagcaatag 3840
acataagcgg ctatttaacg accctgccct gaaccgacga ccgggtcgaa tttgctttcg 3900
aatttctgcc attcatccgc ttattatcac ttattcaggc gtagcaccag gcgtttaagg 3960
gcaccaataa ctgccttaaa aaaattacgc cccgccctgc cactcatcgc agtactgttg 4020
taattcatta agcattctgc cgacatggaa gccatcacag acggcatgat gaacctgaat 4080
cgccagcggc atcagcacct tgtcgccttg cgtataatat ttgcccatgg tgaaaacggg 4140
ggcgaagaag ttgtccatat tggccacgtt taaatcaaaa ctggtgaaac tcacccaggg 4200
attggctgag acgaaaaaca tattctcaat aaacccttta gggaaatagg ccaggttttc 4260
accgtaacac gccacatctt gcgaatatat gtgtagaaac tgccggaact agtcgtggta 4320
ttcactccag agcgatgaaa acgtttcagt ttgctcatgg aaaacggtgt aacaagggtg 4380
aacactatcc catatcacca gctcaccgtc tttcattgcc atacg 4425
<210> 3
<211> 1040
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 3
cagcaaaaaa cccctcaaga cccgtttaga ggccccaagg ggttatgcta gttattgctc 60
agcggtggca gcagccaact cagcttcctt tcgggctttg ttagcagccg gatctcagtg 120
gtggtggtgg tggtgctcga gtgcggccgc aagcttttag gagcctttgt acagttcgtc 180
catgccgtgg gtaatgccgg cggcggtcac aaattccagc agcaccatgt ggtcccgttt 240
ttcgttgggg tctttgctca gcacgctctg ggtggacagg tagtggttgt caggcagcag 300
cacggggccg tcgccgatgg gggtgttctg ctggtagtgg tcggccagct gcacgctgcc 360
gtcctccacg ttgtgtctga ttttgaagtt ggccttgatg ccgttcttct gcttgtcggc 420
ggtgatatac acgttatggc tgttgaagtt atactccagc ttgtgtccca ggatgttgcc 480
gtcctccttg aagtcgatgc ccttcagctc aatccggttc accagggtgt cgccctcgaa 540
cttcacctcg gcccgggtct tataggtgcc gtcgtccttg aagctgatgg ttctctcctg 600
cacgtagcct tcgggcatgg cgctcttgaa aaagtcgtgt ctcttcatgt ggtcggggta 660
tctgctgaag cactgcactc cgtaggtcag ggttgtcacc agtgttggcc atggcacggg 720
cagcttcccg gttgtgcaaa tgaacttcag tgtcagcttg ccgttggtgg cgtctccctc 780
tccctctcct ctcacgctga acttgtggcc gttcacgtcg ccgtccagct ccaccaggat 840
gggcaccacc ccggtgaaca gctcctcccc cttgctcata tggctgccgc gcggcaccag 900
gccgctgctg tgatgatgat gatgatggct gctgcccatg gtatatctcc ttcttaaagt 960
taaacaaaat tatttctaga ggggaattgt tatccgctca caattcccct atagtgagtc 1020
gtattaattt cgcgggatcg 1040
<210> 4
<211> 4519
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 4
gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc 60
gcgaggcagc agatcaattc gcgcgcgaag gcgaagcggc atgcataatg tgcctgtcaa 120
atggacgaag cagggattct gcaaacccta tgctactccg tcaagccgtc aattgtctga 180
ttcgttacca attatgacaa cttgacggct acatcattca ctttttcttc acaaccggca 240
cggaactcgc tcgggctggc cccggtgcat tttttaaata cccgcgagaa atagagttga 300
tcgtcaaaac caacattgcg accgacggtg gcgataggca tccgggtggt gctcaaaagc 360
agcttcgcct ggctgatacg ttggtcctcg cgccagctta agacgctaat ccctaactgc 420
tggcggaaaa gatgtgacag acgcgacggc gacaagcaaa catgctgtgc gacgctggcg 480
atatcaaaat tgctgtctgc caggtgatcg ctgatgtact gacaagcctc gcgtacccga 540
ttatccatcg gtggatggag cgactcgtta atcgcttcca tgcgccgcag taacaattgc 600
tcaagcagat ttatcgccag cagctccgaa tagcgccctt ccccttgccc ggcgttaatg 660
atttgcccaa acaggtcgct gaaatgcggc tggtgcgctt catccgggcg aaagaacccc 720
gtattggcaa atattgacgg ccagttaagc cattcatgcc agtaggcgcg cggacgaaag 780
taaacccact ggtgatacca ttcgcgagcc tccggatgac gaccgtagtg atgaatctct 840
cctggcggga acagcaaaat atcacccggt cggcaaacaa attctcgtcc ctgatttttc 900
accaccccct gaccgcgaat ggtgagattg agaatataac ctttcattcc cagcggtcgg 960
tcgataaaaa aatcgagata accgttggcc tcaatcggcg ttaaacccgc caccagatgg 1020
gcattaaacg agtatcccgg cagcagggga tcattttgcg cttcagccat acttttcata 1080
ctcccgccat tcagagaaga aaccaattgt ccatattgca tcagacattg ccgtcactgc 1140
gtcttttact ggctcttctc gctaaccaaa ccggtaaccc cgcttattaa aagcattctg 1200
taacaaagcg ggaccaaagc catgacaaaa acgcgtaaca aaagtgtcta taatcacggc 1260
agaaaagtcc acattgatta tttgcacggc gtcacacttt gctatgccat agcattttta 1320
tccataagat tagcggatcc tacctgacgc tttttatcgc aactctctac tgtttctcca 1380
tacccgtttt ttgggctaac aggaggaatt aaccatgacc atgattaccg tgcactagaa 1440
cacgattaac atcgctaaga acgacttctc tgacatcgaa ctggctgcta tcccgttcaa 1500
cactctggct gaccattacg gtgagcgttt agctcgcgaa cagttggccc ttgagcatga 1560
gtcttacgag atgggtgaag cacgcttccg caagatgttt gagcgtcaac ttaaagctgg 1620
tgaggttgcg gataacgctg ccgccaagcc tctcatcact accctactcc ctaagatgat 1680
tgcacgcatc aacgactggt ttgaggaagt gaaagctaag cgcggcaagc gcccgacagc 1740
cttccagttc ctgtaggaaa tcaagccgga agccgtagcg tacatcacca ttaagaccac 1800
tctggcttgc ctaaccagtg ctgacaatac aaccgttcag gctgtagcaa gcgcaatcgg 1860
tcgggccatt gaggacgagg ctcgcttcgg tcgtatccgt gaccttgaag ctaagcactt 1920
caagaaaaac gttgaggaac aactcaacaa gcgcgtaggg cacgtctaca agaaagcatt 1980
tatgcaagtt gtcgaggctg acatgctctc taagggtcta ctcggtggcg aggcgtggtc 2040
ttcgtggcat aaggaagact ctattcatgt aggagtacgc tgcatcgaga tgctcattga 2100
gtcaaccgga atggttagct tacaccgcca aaatgctggc gtagtaggtc aagactctga 2160
gactatcgaa ctcgcacctg aatacgctga ggctatcgca acccgtgcag gtgcgctggc 2220
tggcatctct ccgatgttcc aaccttgcgt agttcctcct aagccgtgga ctggcattac 2280
tggtggtggc tattgggcta acggtcgtcg tcctctggcg ctggtgcgta ctcacagtaa 2340
gaaagcactg atgcgctacg aagacgttta catgcctgag gtgtacaaag cgattaacat 2400
tgcgcaaaac accgcatgga aaatcaacaa gaaagtccta gcggtcgcca acgtaatcac 2460
caagtggaag cattgtccgg tcgaggacat ccctgcgatt gagcgtgaag aactcccgat 2520
gaaaccggaa gacatcgaca tgaatcctga ggctctcacc gcgtggaaac gtgctgccgc 2580
tgctgtgtac cgcaaggaca aggctcgcaa gtctcgccgt atcagccttg agttcatgct 2640
tgagcaagcc aataagtttg ctaaccataa ggccatctgg ttcccttaca acatggactg 2700
gcgcggtcgt gtttacgctg tgtcaatgtt caacccgcaa ggtaacgata tgaccaaagg 2760
actgcttacg ctggcgaaag gtaaaccaat cggtaaggaa ggttactact ggctgaaaat 2820
ccacggtgca aactgtgcgg gtgtcgataa ggttccgttc cctgagcgca tcaagttcat 2880
tgaggaaaac cacgagaaca tcatggcttg cgctaagtct ccactggaga acacttggtg 2940
ggctgagcaa gattctccgt tctgcttcct tgcgttctgc tttgagtacg ctggggtaca 3000
gcaccacggc ctgagctata actgctccct tccgctggcg tttgacgggt cttgctctgg 3060
catccagcac ttctccgcga tgctccgaga tgaggtaggt ggtcgcgcgg ttaacttgct 3120
tcctagtgaa accgttcagg acatctacgg gattgttgct aagaaagtca acgagattct 3180
acaagcagac gcaatcaatg ggaccgataa cgaagtagtt accgtgaccg atgagaacac 3240
tggtgaaatc tctgagaaag tcaagctggg cactaaggca ctggctggtc aatggctggc 3300
ttacggtgtt actcgcagtg tgactaagcg ttcagtcatg acgctggctt acgggtccaa 3360
agagttcggc ttccgtcaac aagtgctgga agataccatt cagccagcta ttgattccgg 3420
caagggtctg atgttcactc agccgaatca ggctgctgga tacatggcta agctgatttg 3480
ggaatctgtg agcgtgacgg tggtagctgc ggttgaagca atgaactggc ttaagtctgc 3540
tgctaagctg ctggctgctg aggtcaaaga taagaagact ggagagattc ttcgcaagcg 3600
ttgcgctgtg cattgggtaa ctcctgatgg tttccctgtg tggcaggaat acaagaagcc 3660
tattcagacg cgcttgaacc tgatgttcct cggtcagttc cgcttacagc ctaccattaa 3720
caccaacaaa gatagcgaga ttgatgcaca caaacaggag tctggtatcg ctcctaactt 3780
tgtacacagc caagacggta gccaccttcg taagactgta gtgtgggcac acgagaagta 3840
cggaatcgaa tcttttgcac tgattcacga ctccttcggt accattccgg ctgacgctgc 3900
gaacctgttc aaagcagtgc gcgaaactat ggttgacaca tatgagtctt gtgatgtact 3960
ggctgatttc tacgaccagt tcgctgacca gttgcacgag tctcaattgg acaaaatgcc 4020
agcacttccg gctaaaggta acttgaacct ccgtgacatc ttagagtcgg acttcgcgtt 4080
cgcgtaaaag cttggctgtt ttggcggatg agagaagatt ttcagcctga tacagattaa 4140
atcagaacgc agaagcggtc tgataaaaca gaatttgcct ggcggcagta gcgcggtggt 4200
cccacctgac cccatgccga actcagaagt gaaacgccgt aacgccgatg gtagtgtggg 4260
gtctccccat gcgagagtag ggaactgcca ggcatcaaat aaaacgaaag gctcagtcga 4320
aagactgggc ctttcgtttt atctgttgtt tgtcggtgaa cgctctcctg agtaggacaa 4380
atccgccggg agcggatttg aacgttgcga agcaacggcc cggagggtgg cgggcaggac 4440
gcccgccata aactgccagg catcaaatta agcagaaggc catcctgacg gatggccttt 4500
ttgcgtttct acaaactct 4519
<210> 5
<211> 6031
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 5
atccggatat agttcctcct ttcagcaaaa aacccctcaa gacccgttta gaggccccaa 60
ggggttatgc tagttattgc tcagcggtgg cagcagccaa ctcagcttcc tttcgggctt 120
tgttagcagc cggatctcag tggtggtggt ggtggtgctc gagtgcggcc gcaagctttt 180
aggagccttt gtacagttcg tccatgccgt gggtaatgcc ggcggcggtc acaaattcca 240
gcagcaccat gtggtcccgt ttttcgttgg ggtctttgct cagcacgctc tgggtggaca 300
ggtagtggtt gtcaggcagc agcacggggc cgtcgccgat gggggtgttc tgctggtagt 360
ggtcggccag ctgcacgctg ccgtcctcca cgttgtgtct gattttgaag ttggccttga 420
tgccgttctt ctgcttgtcg gcggtgatat acacgttatg gctgttgaag ttatactcca 480
gcttgtgtcc caggatgttg ccgtcctcct tgaagtcgat gcccttcagc tcaatccggt 540
tcaccagggt gtcgccctcg aacttcacct cggcccgggt cttataggtg ccgtcgtcct 600
tgaagctgat ggttctctcc tgcacgtagc cttcgggcat ggcgctcttg aaaaagtcgt 660
gtctcttcat gtggtcgggg tatctgctga agcactgcac tccgtaggtc agggttgtca 720
ccagtgttgg ccatggcacg ggcagcttcc cggttgtgca aatgaacttc agtgtcagct 780
tgccgttggt ggcgtctccc tctccctctc ctctcacgct gaacttgtgg ccgttcacgt 840
cgccgtccag ctccaccagg atgggcacca ccccggtgaa cagctcctcc cccttgctca 900
tatggctgcc gcgcggcacc aggccgctgc tgtgatgatg atgatgatgg ctgctgccca 960
tggtatatct ccttcttaaa gttaaacaaa attatttcta gaggggaatt gttatccgct 1020
cacaattccc ctatagtgag tcgtattaat ttcgcgggat cgagatctcg atcctctacg 1080
ccggacgcat cgtggccggc atcaccggcg ccacaggtgc ggttgctggc gcctatatcg 1140
ccgacatcac cgatggggaa gatcgggctc gccacttcgg gctcatgagc gcttgtttcg 1200
gcgtgggtat ggtggcaggc cccgtggccg ggggactgtt gggcgccatc tccttgcatg 1260
caccattcct tgcggcggcg gtgctcaacg gcctcaacct actactgggc tgcttcctaa 1320
tgcaggagtc gcataaggga gagcgtcgag atcccggaca ccatcgaatg gcgcaaaacc 1380
tttcgcggta tggcatgata gcgcccggaa gagagtcaat tcagggtggt gaatgtgaaa 1440
ccagtaacgt tatacgatgt cgcagagtat gccggtgtct cttatcagac cgtttcccgc 1500
gtggtgaacc aggccagcca cgtttctgcg aaaacgcggg aaaaagtgga agcggcgatg 1560
gcggagctga attacattcc caaccgcgtg gcacaacaac tggcgggcaa acagtcgttg 1620
ctgattggcg ttgccacctc cagtctggcc ctgcacgcgc cgtcgcaaat tgtcgcggcg 1680
attaaatctc gcgccgatca actgggtgcc agcgtggtgg tgtcgatggt agaacgaagc 1740
ggcgtcgaag cctgtaaagc ggcggtgcac aatcttctcg cgcaacgcgt cagtgggctg 1800
atcattaact atccgctgga tgaccaggat gccattgctg tggaagctgc ctgcactaat 1860
gttccggcgt tatttcttga tgtctctgac cagacaccca tcaacagtat tattttctcc 1920
catgaagacg gtacgcgact gggcgtggag catctggtcg cattgggtca ccagcaaatc 1980
gcgctgttag cgggcccatt aagttctgtc tcggcgcgtc tgcgtctggc tggctggcat 2040
aaatatctca ctcgcaatca aattcagccg atagcggaac gggaaggcga ctggagtgcc 2100
atgtccggtt ttcaacaaac catgcaaatg ctgaatgagg gcatcgttcc cactgcgatg 2160
ctggttgcca acgatcagat ggcgctgggc gcaatgcgcg ccattaccga gtccgggctg 2220
cgcgttggtg cggatatctc ggtagtggga tacgacgata ccgaagacag ctcatgttat 2280
atcccgccgt taaccaccat caaacaggat tttcgcctgc tggggcaaac cagcgtggac 2340
cgcttgctgc aactctctca gggccaggcg gtgaagggca atcagctgtt gcccgtctca 2400
ctggtgaaaa gaaaaaccac cctggcgccc aatacgcaaa ccgcctctcc ccgcgcgttg 2460
gccgattcat taatgcagct ggcacgacag gtttcccgac tggaaagcgg gcagtgagcg 2520
caacgcaatt aatgtaagtt agctcactca ttaggcaccg ggatctcgac cgatgccctt 2580
gagagccttc aacccagtca gctccttccg gtgggcgcgg ggcatgacta tcgtcgccgc 2640
acttatgact gtcttcttta tcatgcaact cgtaggacag gtgccggcag cgctctgggt 2700
cattttcggc gaggaccgct ttcgctggag cgcgacgatg atcggcctgt cgcttgcggt 2760
attcggaatc ttgcacgccc tcgctcaagc cttcgtcact ggtcccgcca ccaaacgttt 2820
cggcgagaag caggccatta tcgccggcat ggcggcccca cgggtgcgca tgatcgtgct 2880
cctgtcgttg aggacccggc taggctggcg gggttgcctt actggttagc agaatgaatc 2940
accgatacgc gagcgaacgt gaagcgactg ctgctgcaaa acgtctgcga cctgagcaac 3000
aacatgaatg gtcttcggtt tccgtgtttc gtaaagtctg gaaacgcgga agtcagcgcc 3060
ctgcaccatt atgttccgga tctgcatcgc aggatgctgc tggctaccct gtggaacacc 3120
tacatctgta ttaacgaagc gctggcattg accctgagtg atttttctct ggtcccgccg 3180
catccatacc gccagttgtt taccctcaca acgttccagt aaccgggcat gttcatcatc 3240
agtaacccgt atcgtgagca tcctctctcg tttcatcggt atcattaccc ccatgaacag 3300
aaatccccct tacacggagg catcagtgac caaacaggaa aaaaccgccc ttaacatggc 3360
ccgctttatc agaagccaga cattaacgct tctggagaaa ctcaacgagc tggacgcgga 3420
tgaacaggca gacatctgtg aatcgcttca cgaccacgct gatgagcttt accgcagctg 3480
cctcgcgcgt ttcggtgatg acggtgaaaa cctctgacac atgcagctcc cggagacggt 3540
cacagcttgt ctgtaagcgg atgccgggag cagacaagcc cgtcagggcg cgtcagcggg 3600
tgttggcggg tgtcggggcg cagccatgac ccagtcacgt agcgatagcg gagtgtatac 3660
tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccatat atgcggtgtg 3720
aaataccgca cagatgcgta aggagaaaat accgcatcag gcgctcttcc gcttcctcgc 3780
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 3840
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 3900
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 3960
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 4020
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 4080
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgctttctc 4140
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag ctgggctgtg 4200
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat cgtcttgagt 4260
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac aggattagca 4320
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac tacggctaca 4380
ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc ggaaaaagag 4440
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt tttgtttgca 4500
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc ttttctacgg 4560
ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg aacaataaaa 4620
ctgtctgctt acataaacag taatacaagg ggtgttatga gccatattca acgggaaacg 4680
tcttgctcta ggccgcgatt aaattccaac atggatgctg atttatatgg gtataaatgg 4740
gctcgcgata atgtcgggca atcaggtgcg acaatctatc gattgtatgg gaagcccgat 4800
gcgccagagt tgtttctgaa acatggcaaa ggtagcgttg ccaatgatgt tacagatgag 4860
atggtcagac taaactggct gacggaattt atgcctcttc cgaccatcaa gcattttatc 4920
cgtactcctg atgatgcatg gttactcacc actgcgatcc ccgggaaaac agcattccag 4980
gtattagaag aatatcctga ttcaggtgaa aatattgttg atgcgctggc agtgttcctg 5040
cgccggttgc attcgattcc tgtttgtaat tgtcctttta acagcgatcg cgtatttcgt 5100
ctcgctcagg cgcaatcacg aatgaataac ggtttggttg atgcgagtga ttttgatgac 5160
gagcgtaatg gctggcctgt tgaacaagtc tggaaagaaa tgcataaact tttgccattc 5220
tcaccggatt cagtcgtcac tcatggtgat ttctcacttg ataaccttat ttttgacgag 5280
gggaaattaa taggttgtat tgatgttgga cgagtcggaa tcgcagaccg ataccaggat 5340
cttgccatcc tatggaactg cctcggtgag ttttctcctt cattacagaa acggcttttt 5400
caaaaatatg gtattgataa tcctgatatg aataaattgc agtttcattt gatgctcgat 5460
gagtttttct aagaattaat tcatgagcgg atacatattt gaatgtattt agaaaaataa 5520
acaaataggg gttccgcgca catttccccg aaaagtgcca cctgaaattg taaacgttaa 5580
tattttgtta aaattcgcgt taaatttttg ttaaatcagc tcatttttta accaataggc 5640
cgaaatcggc aaaatccctt ataaatcaaa agaatagacc gagatagggt tgagtgttgt 5700
tccagtttgg aacaagagtc cactattaaa gaacgtggac tccaacgtca aagggcgaaa 5760
aaccgtctat cagggcgatg gcccactacg tgaaccatca ccctaatcaa gttttttggg 5820
gtcgaggtgc cgtaaagcac taaatcggaa ccctaaaggg agcccccgat ttagagcttg 5880
acggggaaag ccggcgaacg tggcgagaaa ggaagggaag aaagcgaaag gagcgggcgc 5940
tagggcgctg gcaagtgtag cggtcacgct gcgcgtaacc accacacccg ccgcgcttaa 6000
tgcgccgcta cagggcgcgt cccattcgcc a 6031
<210> 6
<211> 6629
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 6
aagaaaccaa ttgtccatat tgcatcagac attgccgtca ctgcgtcttt tactggctct 60
tctcgctaac caaaccggta accccgctta ttaaaagcat tctgtaacaa agcgggacca 120
aagccatgac aaaaacgcgt aacaaaagtg tctataatca cggcagaaaa gtccacattg 180
attatttgca cggcgtcaca ctttgctatg ccatagcatt tttatccata agattagcgg 240
atcctacctg acgcttttta tcgcaactct ctactgtttc tccatacccg ttttttgggc 300
taacaggagg aattaaccat gaccatgatt accgtgcact agaacacgat taacatcgct 360
aagaacgact tctctgacat cgaactggct gctatcccgt tcaacactct ggctgaccat 420
tacggtgagc gtttagctcg cgaacagttg gcccttgagc atgagtctta cgagatgggt 480
gaagcacgct tccgcaagat gtttgagcgt caacttaaag ctggtgaggt tgcggataac 540
gctgccgcca agcctctcat cactacccta ctccctaaga tgattgcacg catcaacgac 600
tggtttgagg aagtgaaagc taagcgcggc aagcgcccga cagccttcca gttcctgtag 660
gaaatcaagc cggaagccgt agcgtacatc accattaaga ccactctggc ttgcctaacc 720
agtgctgaca atacaaccgt tcaggctgta gcaagcgcaa tcggtcgggc cattgaggac 780
gaggctcgct tcggtcgtat ccgtgacctt gaagctaagc acttcaagaa aaacgttgag 840
gaacaactca acaagcgcgt agggcacgtc tacaagaaag catttatgca agttgtcgag 900
gctgacatgc tctctaaggg tctactcggt ggcgaggcgt ggtcttcgtg gcataaggaa 960
gactctattc atgtaggagt acgctgcatc gagatgctca ttgagtcaac cggaatggtt 1020
agcttacacc gccaaaatgc tggcgtagta ggtcaagact ctgagactat cgaactcgca 1080
cctgaatacg ctgaggctat cgcaacccgt gcaggtgcgc tggctggcat ctctccgatg 1140
ttccaacctt gcgtagttcc tcctaagccg tggactggca ttactggtgg tggctattgg 1200
gctaacggtc gtcgtcctct ggcgctggtg cgtactcaca gtaagaaagc actgatgcgc 1260
tacgaagacg tttacatgcc tgaggtgtac aaagcgatta acattgcgca aaacaccgca 1320
tggaaaatca acaagaaagt cctagcggtc gccaacgtaa tcaccaagtg gaagcattgt 1380
ccggtcgagg acatccctgc gattgagcgt gaagaactcc cgatgaaacc ggaagacatc 1440
gacatgaatc ctgaggctct caccgcgtgg aaacgtgctg ccgctgctgt gtaccgcaag 1500
gacaaggctc gcaagtctcg ccgtatcagc cttgagttca tgcttgagca agccaataag 1560
tttgctaacc ataaggccat ctggttccct tacaacatgg actggcgcgg tcgtgtttac 1620
gctgtgtcaa tgttcaaccc gcaaggtaac gatatgacca aaggactgct tacgctggcg 1680
aaaggtaaac caatcggtaa ggaaggttac tactggctga aaatccacgg tgcaaactgt 1740
gcgggtgtcg ataaggttcc gttccctgag cgcatcaagt tcattgagga aaaccacgag 1800
aacatcatgg cttgcgctaa gtctccactg gagaacactt ggtgggctga gcaagattct 1860
ccgttctgct tccttgcgtt ctgctttgag tacgctgggg tacagcacca cggcctgagc 1920
tataactgct cccttccgct ggcgtttgac gggtcttgct ctggcatcca gcacttctcc 1980
gcgatgctcc gagatgaggt aggtggtcgc gcggttaact tgcttcctag tgaaaccgtt 2040
caggacatct acgggattgt tgctaagaaa gtcaacgaga ttctacaagc agacgcaatc 2100
aatgggaccg ataacgaagt agttaccgtg accgatgaga acactggtga aatctctgag 2160
aaagtcaagc tgggcactaa ggcactggct ggtcaatggc tggcttacgg tgttactcgc 2220
agtgtgacta agcgttcagt catgacgctg gcttacgggt ccaaagagtt cggcttccgt 2280
caacaagtgc tggaagatac cattcagcca gctattgatt ccggcaaggg tctgatgttc 2340
actcagccga atcaggctgc tggatacatg gctaagctga tttgggaatc tgtgagcgtg 2400
acggtggtag ctgcggttga agcaatgaac tggcttaagt ctgctgctaa gctgctggct 2460
gctgaggtca aagataagaa gactggagag attcttcgca agcgttgcgc tgtgcattgg 2520
gtaactcctg atggtttccc tgtgtggcag gaatacaaga agcctattca gacgcgcttg 2580
aacctgatgt tcctcggtca gttccgctta cagcctacca ttaacaccaa caaagatagc 2640
gagattgatg cacacaaaca ggagtctggt atcgctccta actttgtaca cagccaagac 2700
ggtagccacc ttcgtaagac tgtagtgtgg gcacacgaga agtacggaat cgaatctttt 2760
gcactgattc acgactcctt cggtaccatt ccggctgacg ctgcgaacct gttcaaagca 2820
gtgcgcgaaa ctatggttga cacatatgag tcttgtgatg tactggctga tttctacgac 2880
cagttcgctg accagttgca cgagtctcaa ttggacaaaa tgccagcact tccggctaaa 2940
ggtaacttga acctccgtga catcttagag tcggacttcg cgttcgcgta aaagcttggc 3000
tgttttggcg gatgagagaa gattttcagc ctgatacaga ttaaatcaga acgcagaagc 3060
ggtctgataa aacagaattt gcctggcggc agtagcgcgg tggtcccacc tgaccccatg 3120
ccgaactcag aagtgaaacg ccgtaacgcc gatggtagtg tggggtctcc ccatgcgaga 3180
gtagggaact gccaggcatc aaataaaacg aaaggctcag tcgaaagact gggcctttcg 3240
ttttatctgt tgtttgtcgg tgaacgctct cctgagtagg acaaatccgc cgggagcgga 3300
tttgaacgtt gcgaagcaac ggcccggagg gtggcgggca ggacgcccgc cataaactgc 3360
caggcatcaa attaagcaga aggccatcct gacggatggc ctttttgcgt ttctacaaac 3420
tcttttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct 3480
gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg 3540
cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg 3600
tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc 3660
tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca 3720
cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt tgacgccggg caagagcaac 3780
tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa 3840
agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg 3900
ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt 3960
ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg 4020
aagccatacc aaacgacgag cgtgacacca cgatgcctgt agcaatggca acaacgttgc 4080
gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga 4140
tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta 4200
ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc 4260
cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg 4320
atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt 4380
cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa 4440
ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 4500
cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 4560
ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 4620
tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 4680
taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 4740
caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 4800
agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 4860
gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 4920
gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 4980
ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 5040
acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 5100
tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 5160
ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt 5220
ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga 5280
ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg tattttctcc 5340
ttacgcatct gtgcggtatt tcacaccgca tatggtgcac tctcagtaca atctgctctg 5400
atgccgcata gttaagccag tatacactcc gctatcgcta cgtgactggg tcatggctgc 5460
gccccgacac ccgccaacac ccgctgacgc gccctgacgg gcttgtctgc tcccggcatc 5520
cgcttacaga caagctgtga ccgtctccgg gagctgcatg tgtcagaggt tttcaccgtc 5580
atcaccgaaa cgcgcgaggc agcagatcaa ttcgcgcgcg aaggcgaagc ggcatgcata 5640
atgtgcctgt caaatggacg aagcagggat tctgcaaacc ctatgctact ccgtcaagcc 5700
gtcaattgtc tgattcgtta ccaattatga caacttgacg gctacatcat tcactttttc 5760
ttcacaaccg gcacggaact cgctcgggct ggccccggtg cattttttaa atacccgcga 5820
gaaatagagt tgatcgtcaa aaccaacatt gcgaccgacg gtggcgatag gcatccgggt 5880
ggtgctcaaa agcagcttcg cctggctgat acgttggtcc tcgcgccagc ttaagacgct 5940
aatccctaac tgctggcgga aaagatgtga cagacgcgac ggcgacaagc aaacatgctg 6000
tgcgacgctg gcgatatcaa aattgctgtc tgccaggtga tcgctgatgt actgacaagc 6060
ctcgcgtacc cgattatcca tcggtggatg gagcgactcg ttaatcgctt ccatgcgccg 6120
cagtaacaat tgctcaagca gatttatcgc cagcagctcc gaatagcgcc cttccccttg 6180
cccggcgtta atgatttgcc caaacaggtc gctgaaatgc ggctggtgcg cttcatccgg 6240
gcgaaagaac cccgtattgg caaatattga cggccagtta agccattcat gccagtaggc 6300
gcgcggacga aagtaaaccc actggtgata ccattcgcga gcctccggat gacgaccgta 6360
gtgatgaatc tctcctggcg ggaacagcaa aatatcaccc ggtcggcaaa caaattctcg 6420
tccctgattt ttcaccaccc cctgaccgcg aatggtgaga ttgagaatat aacctttcat 6480
tcccagcggt cggtcgataa aaaaatcgag ataaccgttg gcctcaatcg gcgttaaacc 6540
cgccaccaga tgggcattaa acgagtatcc cggcagcagg ggatcatttt gcgcttcagc 6600
catacttttc atactcccgc cattcagag 6629
<210> 7
<211> 454
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 7
Met Asp Lys Lys Pro Leu Asn Thr Leu Ile Ser Ala Thr Gly Leu Trp
1 5 10 15
Met Ser Arg Thr Gly Thr Ile His Lys Ile Lys His His Glu Val Ser
20 25 30
Arg Ser Lys Ile Tyr Ile Glu Met Ala Cys Gly Asp His Leu Val Val
35 40 45
Asn Asn Ser Arg Ser Ser Arg Thr Ala Arg Ala Leu Arg His His Lys
50 55 60
Tyr Arg Lys Thr Cys Lys Arg Cys Arg Val Ser Asp Glu Asp Leu Asn
65 70 75 80
Lys Phe Leu Thr Lys Ala Asn Glu Asp Gln Thr Ser Val Lys Val Lys
85 90 95
Val Val Ser Ala Pro Thr Arg Thr Lys Lys Ala Met Pro Lys Ser Val
100 105 110
Ala Arg Ala Pro Lys Pro Leu Glu Asn Thr Glu Ala Ala Gln Ala Gln
115 120 125
Pro Ser Gly Ser Lys Phe Ser Pro Ala Ile Pro Val Ser Thr Gln Glu
130 135 140
Ser Val Ser Val Pro Ala Ser Val Ser Thr Ser Ile Ser Ser Ile Ser
145 150 155 160
Thr Gly Ala Thr Ala Ser Ala Leu Val Lys Gly Asn Thr Asn Pro Ile
165 170 175
Thr Ser Met Ser Ala Pro Val Gln Ala Ser Ala Pro Ala Leu Thr Lys
180 185 190
Ser Gln Thr Asp Arg Leu Glu Val Leu Leu Asn Pro Lys Asp Glu Ile
195 200 205
Ser Leu Asn Ser Gly Lys Pro Phe Arg Glu Leu Glu Ser Glu Leu Leu
210 215 220
Ser Arg Arg Lys Lys Asp Leu Gln Gln Ile Tyr Ala Glu Glu Arg Glu
225 230 235 240
Asn Tyr Leu Gly Lys Leu Glu Arg Glu Ile Thr Arg Phe Phe Val Asp
245 250 255
Arg Gly Phe Leu Glu Ile Lys Ser Pro Ile Leu Ile Pro Leu Glu Tyr
260 265 270
Ile Glu Arg Met Gly Ile Asp Asn Asp Thr Glu Leu Ser Lys Gln Ile
275 280 285
Phe Arg Val Asp Lys Asn Phe Cys Leu Arg Pro Met Val Ala Pro Asn
290 295 300
Leu Leu Asn Tyr Ala Arg Lys Leu Asp Arg Ala Leu Pro Asp Pro Ile
305 310 315 320
Lys Ile Phe Glu Ile Gly Pro Cys Tyr Arg Lys Glu Ser Asp Gly Lys
325 330 335
Glu His Leu Glu Glu Phe Thr Met Leu Asn Phe Phe Gln Met Gly Ser
340 345 350
Gly Cys Thr Arg Glu Asn Leu Glu Ser Ile Ile Thr Asp Phe Leu Asn
355 360 365
His Leu Gly Ile Asp Phe Lys Ile Val Gly Asp Ser Cys Met Val Tyr
370 375 380
Gly Asp Thr Leu Asp Val Met His Gly Asp Leu Glu Leu Ser Ser Ala
385 390 395 400
Val Val Gly Pro Ile Pro Leu Asp Arg Glu Trp Gly Ile Asp Lys Pro
405 410 415
Trp Ile Gly Ala Gly Phe Gly Leu Glu Arg Leu Leu Lys Val Lys His
420 425 430
Asp Phe Lys Asn Ile Lys Arg Ala Ala Arg Ser Glu Ser Tyr Tyr Asn
435 440 445
Gly Ile Ser Thr Asn Leu
450
<210> 8
<211> 454
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 8
Met Asp Lys Lys Pro Leu Asn Thr Leu Ile Ser Ala Thr Gly Leu Trp
1 5 10 15
Met Ser Arg Thr Gly Thr Ile His Lys Ile Lys His His Glu Val Ser
20 25 30
Arg Ser Lys Ile Tyr Ile Glu Met Ala Cys Gly Asp His Leu Val Val
35 40 45
Asn Asn Ser Arg Ser Ser Arg Thr Ala Arg Ala Leu Arg His His Lys
50 55 60
Tyr Arg Lys Thr Cys Lys Arg Cys Arg Val Ser Asp Glu Asp Leu Asn
65 70 75 80
Lys Phe Leu Thr Lys Ala Asn Glu Asp Gln Thr Ser Val Lys Val Lys
85 90 95
Val Val Ser Ala Pro Thr Arg Thr Lys Lys Ala Met Pro Lys Ser Val
100 105 110
Ala Arg Ala Pro Lys Pro Leu Glu Asn Thr Glu Ala Ala Gln Ala Gln
115 120 125
Pro Ser Gly Ser Lys Phe Ser Pro Ala Ile Pro Val Ser Thr Gln Glu
130 135 140
Ser Val Ser Val Pro Ala Ser Val Ser Thr Ser Ile Ser Ser Ile Ser
145 150 155 160
Thr Gly Ala Thr Ala Ser Ala Leu Val Lys Gly Asn Thr Asn Pro Ile
165 170 175
Thr Ser Met Ser Ala Pro Val Gln Ala Ser Ala Pro Ala Leu Thr Lys
180 185 190
Ser Gln Thr Asp Arg Leu Glu Val Leu Leu Asn Pro Lys Asp Glu Ile
195 200 205
Ser Leu Asn Ser Gly Lys Pro Phe Arg Glu Leu Glu Ser Glu Leu Leu
210 215 220
Ser Arg Arg Lys Lys Asp Leu Gln Gln Ile Tyr Ala Glu Glu Arg Glu
225 230 235 240
Asn Tyr Leu Gly Lys Leu Glu Arg Glu Ile Thr Arg Phe Phe Val Asp
245 250 255
Arg Gly Phe Leu Glu Ile Lys Ser Pro Ile Leu Ile Pro Leu Glu Tyr
260 265 270
Ile Glu Arg Met Gly Ile Asp Asn Asp Thr Glu Leu Ser Lys Gln Ile
275 280 285
Phe Arg Val Asp Lys Asn Phe Cys Leu Arg Pro Met Met Ala Pro Asn
290 295 300
Leu Leu Asn Tyr Ala Arg Lys Leu Asp Arg Ala Leu Pro Asp Pro Ile
305 310 315 320
Lys Ile Phe Glu Ile Gly Pro Cys Tyr Arg Lys Glu Ser Asp Gly Lys
325 330 335
Glu His Leu Glu Glu Phe Thr Met Leu Asn Phe Phe Gln Met Gly Ser
340 345 350
Gly Cys Thr Arg Glu Asn Leu Glu Ser Ile Ile Thr Asp Phe Leu Asn
355 360 365
His Leu Gly Ile Asp Phe Lys Ile Val Gly Asp Ser Cys Met Val Tyr
370 375 380
Gly Asp Thr Leu Asp Val Met His Gly Asp Leu Glu Leu Ser Ser Ala
385 390 395 400
Val Val Gly Pro Ile Pro Leu Asp Arg Glu Trp Gly Ile Asp Lys Pro
405 410 415
Trp Ile Gly Ala Gly Phe Gly Leu Glu Arg Leu Leu Lys Val Lys His
420 425 430
Asp Phe Lys Asn Ile Lys Arg Ala Ala Arg Ser Glu Ser Tyr Tyr Asn
435 440 445
Gly Ile Ser Thr Asn Leu
450
<210> 9
<211> 454
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 9
Met Asp Lys Lys Pro Leu Asn Thr Leu Ile Ser Ala Thr Gly Leu Trp
1 5 10 15
Met Ser Arg Thr Gly Thr Ile His Lys Ile Lys His His Glu Val Ser
20 25 30
Arg Ser Lys Ile Tyr Ile Glu Met Ala Cys Gly Asp His Leu Val Val
35 40 45
Asn Asn Ser Arg Ser Ser Arg Thr Ala Arg Ala Leu Arg His His Lys
50 55 60
Tyr Arg Lys Thr Cys Lys Arg Cys Arg Val Ser Asp Glu Asp Leu Asn
65 70 75 80
Lys Phe Leu Thr Lys Ala Asn Glu Asp Gln Thr Ser Val Lys Val Lys
85 90 95
Val Val Ser Ala Pro Thr Arg Thr Lys Lys Ala Met Pro Lys Ser Val
100 105 110
Ala Arg Ala Pro Lys Pro Leu Glu Asn Thr Glu Ala Ala Gln Ala Gln
115 120 125
Pro Ser Gly Ser Lys Phe Ser Pro Ala Ile Pro Val Ser Thr Gln Glu
130 135 140
Ser Val Ser Val Pro Ala Ser Val Ser Thr Ser Ile Ser Ser Ile Ser
145 150 155 160
Thr Gly Ala Thr Ala Ser Ala Leu Val Lys Gly Asn Thr Asn Pro Ile
165 170 175
Thr Ser Met Ser Ala Pro Val Gln Ala Ser Ala Pro Ala Leu Thr Lys
180 185 190
Ser Gln Thr Asp Arg Leu Glu Val Leu Leu Asn Pro Lys Asp Glu Ile
195 200 205
Ser Leu Asn Ser Gly Lys Pro Phe Arg Glu Leu Glu Ser Glu Leu Leu
210 215 220
Ser Arg Arg Lys Lys Asp Leu Gln Gln Ile Tyr Ala Glu Glu Arg Glu
225 230 235 240
Asn Tyr Leu Gly Lys Leu Glu Arg Glu Ile Thr Arg Phe Phe Val Asp
245 250 255
Arg Gly Phe Leu Glu Ile Lys Ser Pro Ile Leu Ile Pro Leu Glu Tyr
260 265 270
Ile Glu Arg Met Gly Ile Asp Asn Asp Thr Glu Leu Ser Lys Gln Ile
275 280 285
Phe Arg Val Asp Lys Asn Phe Cys Leu Arg Pro Met Ile Ala Pro Asn
290 295 300
Leu Leu Asn Tyr Ala Arg Lys Leu Asp Arg Ala Leu Pro Asp Pro Ile
305 310 315 320
Lys Ile Phe Glu Ile Gly Pro Cys Tyr Arg Lys Glu Ser Asp Gly Lys
325 330 335
Glu His Leu Glu Glu Phe Thr Met Leu Asn Phe Phe Gln Met Gly Ser
340 345 350
Gly Cys Thr Arg Glu Asn Leu Glu Ser Ile Ile Thr Asp Phe Leu Asn
355 360 365
His Leu Gly Ile Asp Phe Lys Ile Val Gly Asp Ser Cys Met Val Tyr
370 375 380
Gly Asp Thr Leu Asp Val Met His Gly Asp Leu Glu Leu Ser Ser Ala
385 390 395 400
Val Val Gly Pro Ile Pro Leu Asp Arg Glu Trp Gly Ile Asp Lys Pro
405 410 415
Trp Ile Gly Ala Gly Phe Gly Leu Glu Arg Leu Leu Lys Val Lys His
420 425 430
Asp Phe Lys Asn Ile Lys Arg Ala Ala Arg Ser Glu Ser Tyr Tyr Asn
435 440 445
Gly Ile Ser Thr Asn Leu
450
<210> 11
<211> 1365
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 11
atggataaaa agcctctgaa cactctgatt tctgcgaccg gtctgtggat gtcccgcacc 60
ggcaccatcc acaaaatcaa acaccatgaa gttagccgtt ccaaaatcta cattgaaatg 120
gcttgcggcg atcacctggt tgtcaacaac tcccgttctt ctcgtaccgc tcgcgcactg 180
cgccaccaca aatatcgcaa aacctgcaaa cgttgccgtg ttagcgatga ggacctgaac 240
aaattcctga ccaaagctaa cgaggatcag acctccgtaa aagtgaaggt agtaagcgct 300
ccgacccgta ctaaaaaggc tatgccaaaa agcgtggccc gtgccccgaa acctctggaa 360
aacaccgagg cggctcaggc tcaaccatcc ggttctaaat tttctccggc gatcccagtg 420
tccacccaag aatctgtttc cgtaccagca agcgtgtcta ccagcattag cagcatttct 480
accggtgcta ccgcttctgc gctggtaaaa ggtaacacta acccgattac tagcatgtct 540
gcaccggtac aggcaagcgc cccagctctg actaaatccc agacggaccg tctggaggtg 600
ctgctgaacc caaaggatga aatctctctg aacagcggca agcctttccg tgagctggaa 660
agcgagctgc tgtctcgtcg taaaaaggat ctgcaacaga tctacgctga ggaacgcgag 720
aactatctgg gtaagctgga gcgcgaaatt actcgcttct tcgtggatcg cggtttcctg 780
gagatcaaat ctccgattct gattccgctg gaatacattg aacgtatggg catcgataat 840
gataccgaac tgtctaaaca gatcttccgt gtggataaaa acttctgtct gcgtccgatg 900
ctggccccga acctgtacaa ctatctgcgt aaactggacc gtgccctgcc ggacccgatc 960
aaaattttcg agatcggtcc ttgctaccgt aaagagtccg acggtaaaga gcacctggaa 1020
gaattcacca tgctgaactt ctgccagatg ggtagcggtt gcacgcgtga aaacctggaa 1080
tccattatca ccgacttcct gaatcacctg ggtatcgatt tcaaaattgt tggtgacagc 1140
tgtatggtgt atggcgatac gctggatgtt atgcacggcg atctggagct gtcttccgca 1200
gtagtgggcc caatcccgct ggatcgtgag tggggtatcg acaaaccttg gatcggtgcg 1260
ggttttggtc tggagcgtct gctgaaagta aaacacgact tcaagaacat caaacgtgct 1320
gcacgttccg agtcctatta caatggtatt tctactaacc tgtaa 1365
<210> 12
<211> 9984
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 12
gaattccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg gataaaactt 60
gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 120
ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat gccattggga 180
tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct tagctcctga 240
aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat ggtgaaagtt 300
ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc agggcttccc 360
ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg tcacaggtat 420
ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt gtatgatggt 480
gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt tcagctactg 540
acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg agtgtatact 600
ggcttactat gttggcggta cccccatcaa aaaaatattc tcaacataaa aaactttgtg 660
taatacttgt aacgctgaat tcggaaacct gatcatgtag atcgaatgga ctctaaatcc 720
gttcagccgg gttagattcc cggggtttcc gccactgcag atccttagcg aaagctaagg 780
atttttttta ctcgagactg atgagggtgt cagtgaagtg cttcatgtgg caggagaaaa 840
aaggctgcac cggtgcgtca gcagaatatg tgatacagga tatattccgc ttcctcgctc 900
actgactcgc tacgctcggt cgttcgactg cggcgagcgg aaatggctta cgaacggggc 960
ggagatttcc tggaagatgc caggaagata cttaacaggg aagtgagagg gccgcggcaa 1020
agccgttttt ccataggctc cgcccccctg acaagcatca cgaaatctga cgctcaaatc 1080
agtggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggcggctccc 1140
tcgtgcgctc tcctgttcct gcctttcggt ttaccggtgt cattccgctg ttatggccgc 1200
gtttgtctca ttccacgcct gacactcagt tccgggtagg cagttcgctc caagctggac 1260
tgtatgcacg aaccccccgt tcagtccgac cgctgcgcct tatccggtaa ctatcgtctt 1320
gagtccaacc cggaaagaca tgcaaaagca ccactggcag cagccactgg taattgattt 1380
agaggagtta gtcttgaagt catgcgccgg ttaaggctaa actgaaagga caagttttgg 1440
tgactgcgct cctccaagcc agttacctcg gttcaaagag ttggtagctc agagaacctt 1500
cgaaaaaccg ccctgcaagg cggttttttc gttttcagag caagagatta cgcgcagacc 1560
aaaacgatct caagaagatc atcttattaa tcagataaaa tatttctaga tttcagtgca 1620
atttatctct tcaaatgtag cacctgaagt cagccccata cgatataagt tgtaattctc 1680
atgtttgaca gcttatcatc gataagcttt aatgcggtag tttatcacag ttaaattgct 1740
aacgcagtca ggcaccgtgt atgaaatcta acaatgcgct catcgtcatc ctcggcaccg 1800
tcaccctgga tgctgtaggc ataggcttgg ttatgccggt actgccgggc ctcttgcggg 1860
atatcgtcca ttccgacagc atcgccagtc actatggcgt gctgctagcg ctatatgcgt 1920
tgatgcaatt tctatgcgca cccgttctcg gagcactgtc cgaccgcttt ggccgccgcc 1980
cagtcctgct cgcttcgcta cttggagcca ctatcgacta cgcgatcatg gcgaccacac 2040
ccgtcctgtg gatcctctac gccggacgca tcgtggccgg catcaccggc gccacaggtg 2100
cggttgctgg cgcctatatc gccgacatca ccgatgggga agatcgggct cgccacttcg 2160
ggctcatgag cgcttgtttc ggcgtgggta tggtggcagg ccccgtggcc gggggactgt 2220
tgggcgccat ctccttgcat gcaccattcc ttgcggcggc ggtgctcaac ggcctcaacc 2280
tactactggg ctgcttccta atgcaggagt cgcataaggg agagcgtcga ccgatgccct 2340
tgagagcctt caacccagtc agctccttcc ggtgggcgcg gggcatgact atcgtcgccg 2400
cacttatgac tgtcttcttt atcatgcaac tcgtaggaca ggtgccggca gcgctctggg 2460
tcattttcgg cgaggaccgc tttcgctgga gcgcgacgat gatcggcctg tcgcttgcgg 2520
tattcggaat cttgcacgcc ctcgctcaag ccttcgtcac tggtcccgcc accaaacgtt 2580
tcggcgagaa gcaggccatt atcgccggca tggcggccga cgcgctgggc tacgtcttgc 2640
tggcgttcgc gacgcgaggc tggatggcct tccccattat gattcttctc gcttccggcg 2700
gcatcgggat gcccgcgttg caggccatgc tgtccaggca ggtagatgac gaccatcagg 2760
gacagcttca aggatcgctc gcggctctta ccagcctaac ttcgatcatt ggaccgctga 2820
tcgtcacggc gatttatgcc gcctcggcga gcacatggaa cgggttggca tggattgtag 2880
gcgccgccct ataccttgtc tgcctccccg cgttgcgtcg cggtgcatgg agccgggcca 2940
cctcgacctg aatggaagcc ggcggcacct cgctaacgga ttcaccactc caagaattgg 3000
agccaatcaa ttcttgcgga gaactgtgaa tgcgcaaacc aacccttggc agaacatatc 3060
catcgcgtcc gccatctcca gcagccgcac gcggcgcatc tcgggcagcg ttgggtcctg 3120
gccacgggtg cgcatgatcg tgctcctgtc gttgaggacc cggctaggct ggcggggttg 3180
ccttactggt tagcagaatg aatcaccgat acgcgagcga acgtgaagcg actgctgctg 3240
caaaacgtct gcgacctgag caacaacatg aatggtcttc ggtttccgtg tttcgtaaag 3300
tctggaaacg cggaagtccc ctacgtgctg ctgaagttgc ccgcaacaga gagtggaacc 3360
aaccggtgat accacgatac tatgactgag agtcaacgcc agctgtgacc gtctccggga 3420
gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg cgcgaggcag cagatcaatt 3480
cgcgcgcgaa ggcgaagcgg catgcataat gtgcctgtca aatggacgaa gcagggattc 3540
tgcaaaccct atgctactcc gtcaagccgt caattgtctg attcgttacc aattatgaca 3600
acttgacggc tacatcattc actttttctt cacaaccggc acggaactcg ctcgggctgg 3660
ccccggtgca ttttttaaat acccgcgaga aatagagttg atcgtcaaaa ccaacattgc 3720
gaccgacggt ggcgataggc atccgggtgg tgctcaaaag cagcttcgcc tggctgatac 3780
gttggtcctc gcgccagctt aagacgctaa tccctaactg ctggcggaaa agatgtgaca 3840
gacgcgacgg cgacaagcaa acatgctgtg cgacgctggc gatatcaaaa ttgctgtctg 3900
ccaggtgatc gctgatgtac tgacaagcct cgcgtacccg attatccatc ggtggatgga 3960
gcgactcgtt aatcgcttcc atgcgccgca gtaacaattg ctcaagcaga tttatcgcca 4020
gcagctccga atagcgccct tccccttgcc cggcgttaat gatttgccca aacaggtcgc 4080
tgaaatgcgg ctggtgcgct tcatccgggc gaaagaaccc cgtattggca aatattgacg 4140
gccagttaag ccattcatgc cagtaggcgc gcggacgaaa gtaaacccac tggtgatacc 4200
attcgcgagc ctccggatga cgaccgtagt gatgaatctc tcctggcggg aacagcaaaa 4260
tatcacccgg tcggcaaaca aattctcgtc cctgattttt caccaccccc tgaccgcgaa 4320
tggtgagatt gagaatataa cctttcattc ccagcggtcg gtcgataaaa aaatcgagat 4380
aaccgttggc ctcaatcggc gttaaacccg ccaccagatg ggcattaaac gagtatcccg 4440
gcagcagggg atcattttgc gcttcagcca tacttttcat actcccgcca ttcagagaag 4500
aaaccaattg tccatattgc atcagacatt gccgtcactg cgtcttttac tggctcttct 4560
cgctaaccaa accggtaacc ccgcttatta aaagcattct gtaacaaagc gggaccaaag 4620
ccatgacaaa aacgcgtaac aaaagtgtct ataatcacgg cagaaaagtc cacattgatt 4680
atttgcacgg cgtcacactt tgctatgcca tagcattttt atccataaga ttagcggatc 4740
ctacctgacg ctttttatcg caactctcta ctgtttctcc atacccgttt tttgggctaa 4800
caggaggaat taaccatgac catgattacc gtgcactaga acacgattaa catcgctaag 4860
aacgacttct ctgacatcga actggctgct atcccgttca acactctggc tgaccattac 4920
ggtgagcgtt tagctcgcga acagttggcc cttgagcatg agtcttacga gatgggtgaa 4980
gcacgcttcc gcaagatgtt tgagcgtcaa cttaaagctg gtgaggttgc ggataacgct 5040
gccgccaagc ctctcatcac taccctactc cctaagatga ttgcacgcat caacgactgg 5100
tttgaggaag tgaaagctaa gcgcggcaag cgcccgacag ccttccagtt cctgtaggaa 5160
atcaagccgg aagccgtagc gtacatcacc attaagacca ctctggcttg cctaaccagt 5220
gctgacaata caaccgttca ggctgtagca agcgcaatcg gtcgggccat tgaggacgag 5280
gctcgcttcg gtcgtatccg tgaccttgaa gctaagcact tcaagaaaaa cgttgaggaa 5340
caactcaaca agcgcgtagg gcacgtctac aagaaagcat ttatgcaagt tgtcgaggct 5400
gacatgctct ctaagggtct actcggtggc gaggcgtggt cttcgtggca taaggaagac 5460
tctattcatg taggagtacg ctgcatcgag atgctcattg agtcaaccgg aatggttagc 5520
ttacaccgcc aaaatgctgg cgtagtaggt caagactctg agactatcga actcgcacct 5580
gaatacgctg aggctatcgc aacccgtgca ggtgcgctgg ctggcatctc tccgatgttc 5640
caaccttgcg tagttcctcc taagccgtgg actggcatta ctggtggtgg ctattgggct 5700
aacggtcgtc gtcctctggc gctggtgcgt actcacagta agaaagcact gatgcgctac 5760
gaagacgttt acatgcctga ggtgtacaaa gcgattaaca ttgcgcaaaa caccgcatgg 5820
aaaatcaaca agaaagtcct agcggtcgcc aacgtaatca ccaagtggaa gcattgtccg 5880
gtcgaggaca tccctgcgat tgagcgtgaa gaactcccga tgaaaccgga agacatcgac 5940
atgaatcctg aggctctcac cgcgtggaaa cgtgctgccg ctgctgtgta ccgcaaggac 6000
aaggctcgca agtctcgccg tatcagcctt gagttcatgc ttgagcaagc caataagttt 6060
gctaaccata aggccatctg gttcccttac aacatggact ggcgcggtcg tgtttacgct 6120
gtgtcaatgt tcaacccgca aggtaacgat atgaccaaag gactgcttac gctggcgaaa 6180
ggtaaaccaa tcggtaagga aggttactac tggctgaaaa tccacggtgc aaactgtgcg 6240
ggtgtcgata aggttccgtt ccctgagcgc atcaagttca ttgaggaaaa ccacgagaac 6300
atcatggctt gcgctaagtc tccactggag aacacttggt gggctgagca agattctccg 6360
ttctgcttcc ttgcgttctg ctttgagtac gctggggtac agcaccacgg cctgagctat 6420
aactgctccc ttccgctggc gtttgacggg tcttgctctg gcatccagca cttctccgcg 6480
atgctccgag atgaggtagg tggtcgcgcg gttaacttgc ttcctagtga aaccgttcag 6540
gacatctacg ggattgttgc taagaaagtc aacgagattc tacaagcaga cgcaatcaat 6600
gggaccgata acgaagtagt taccgtgacc gatgagaaca ctggtgaaat ctctgagaaa 6660
gtcaagctgg gcactaaggc actggctggt caatggctgg cttacggtgt tactcgcagt 6720
gtgactaagc gttcagtcat gacgctggct tacgggtcca aagagttcgg cttccgtcaa 6780
caagtgctgg aagataccat tcagccagct attgattccg gcaagggtct gatgttcact 6840
cagccgaatc aggctgctgg atacatggct aagctgattt gggaatctgt gagcgtgacg 6900
gtggtagctg cggttgaagc aatgaactgg cttaagtctg ctgctaagct gctggctgct 6960
gaggtcaaag ataagaagac tggagagatt cttcgcaagc gttgcgctgt gcattgggta 7020
actcctgatg gtttccctgt gtggcaggaa tacaagaagc ctattcagac gcgcttgaac 7080
ctgatgttcc tcggtcagtt ccgcttacag cctaccatta acaccaacaa agatagcgag 7140
attgatgcac acaaacagga gtctggtatc gctcctaact ttgtacacag ccaagacggt 7200
agccaccttc gtaagactgt agtgtgggca cacgagaagt acggaatcga atcttttgca 7260
ctgattcacg actccttcgg taccattccg gctgacgctg cgaacctgtt caaagcagtg 7320
cgcgaaacta tggttgacac atatgagtct tgtgatgtac tggctgattt ctacgaccag 7380
ttcgctgacc agttgcacga gtctcaattg gacaaaatgc cagcacttcc ggctaaaggt 7440
aacttgaacc tccgtgacat cttagagtcg gacttcgcgt tcgcgtaaaa gcttggctgt 7500
tttggcggat gagagaagat tttcagcctg atacagatta aatcagaacg cagaagcggt 7560
ctgataaaac agaatttgcc tggcggcagt agcgcggtgg tcccacctga ccccatgccg 7620
aactcagaag tgaaacgccg taacgccgat ggtagtgtgg ggtctcccca tgcgagagta 7680
gggaactgcc aggcatcaaa taaaacgaaa ggctcagtcg aaagactggg cctttcgttt 7740
tatctgttgt ttgtcggtga acgctctcct gagtaggaca aatccgccgg gagcggattt 7800
gaacgttgcg aagcaacggc ccggagggtg gcgggcagga cgcccgccat aaactgccag 7860
gcatcaaatt aagcagaagg ccatcctgac ggatggcctt tttgcgtttc tacaaactct 7920
cagcaaaaaa cccctcaaga cccgtttaga ggccccaagg ggttatgcta gttattgctc 7980
agcggtggca gcagccaact cagcttcctt tcgggctttg ttagcagccg gatctcagtg 8040
gtggtggtgg tggtgctcga gtgcggccgc aagcttttag gagcctttgt acagttcgtc 8100
catgccgtgg gtaatgccgg cggcggtcac aaattccagc agcaccatgt ggtcccgttt 8160
ttcgttgggg tctttgctca gcacgctctg ggtggacagg tagtggttgt caggcagcag 8220
cacggggccg tcgccgatgg gggtgttctg ctggtagtgg tcggccagct gcacgctgcc 8280
gtcctccacg ttgtgtctga ttttgaagtt ggccttgatg ccgttcttct gcttgtcggc 8340
ggtgatatac acgttatggc tgttgaagtt atactccagc ttgtgtccca ggatgttgcc 8400
gtcctccttg aagtcgatgc ccttcagctc aatccggttc accagggtgt cgccctcgaa 8460
cttcacctcg gcccgggtct tataggtgcc gtcgtccttg aagctgatgg ttctctcctg 8520
cacgtagcct tcgggcatgg cgctcttgaa aaagtcgtgt ctcttcatgt ggtcggggta 8580
tctgctgaag cactgcactc cgtaggtcag ggttgtcacc agtgttggcc atggcacggg 8640
cagcttcccg gttgtgcaaa tgaacttcag tgtcagcttg ccgttggtgg cgtctccctc 8700
tccctctcct ctcacgctga acttgtggcc gttcacgtcg ccgtccagct ccaccaggat 8760
gggcaccacc ccggtgaaca gctcctcccc cttgctcata tggctgccgc gcggcaccag 8820
gccgctgctg tgatgatgat gatgatggct gctgcccatg gtatatctcc ttcttaaagt 8880
taaacaaaat tatttctaga ggggaattgt tatccgctca caattcccct atagtgagtc 8940
gtattaattt cgcgggatcg tgagcggcct catttcttat tctgagttac aacagtccgc 9000
accgctgtcc ggtagctcct tccggtgggc gcggggcatg actatcgtcg ccgcacttat 9060
gactgtcttc tttatcatgc aactcgtagg acaggtgccg gcagcgccca acagtccccc 9120
ggccacgggg cctgccacca tacccacgcc gaaacaagcg ccctgcacca ttatgttccg 9180
gatctgcatc gcaggatgct gctggctacc ctgtggaaca cctacatctg tattaacgaa 9240
gcgctaaccg tttttatcag gctctgggag gcagaataaa tgatcatatc gtcaattatt 9300
acctccacgg ggagagcctg agcaaactgg cctcaggcat ttgagaagca cacggtcaca 9360
ctgcttccgg tagtcaataa accggtaaac cagcaataga cataagcggc tatttaacga 9420
ccctgccctg aaccgacgac cgggtcgaat ttgctttcga atttctgcca ttcatccgct 9480
tattatcact tattcaggcg tagcaccagg cgtttaaggg caccaataac tgccttaaaa 9540
aaattacgcc ccgccctgcc actcatcgca gtactgttgt aattcattaa gcattctgcc 9600
gacatggaag ccatcacaga cggcatgatg aacctgaatc gccagcggca tcagcacctt 9660
gtcgccttgc gtataatatt tgcccatggt gaaaacgggg gcgaagaagt tgtccatatt 9720
ggccacgttt aaatcaaaac tggtgaaact cacccaggga ttggctgaga cgaaaaacat 9780
attctcaata aaccctttag ggaaataggc caggttttca ccgtaacacg ccacatcttg 9840
cgaatatatg tgtagaaact gccggaacta gtcgtggtat tcactccaga gcgatgaaaa 9900
cgtttcagtt tgctcatgga aaacggtgta acaagggtga acactatccc atatcaccag 9960
ctcaccgtct ttcattgcca tacg 9984
<210> 13
<211> 26
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 13
ctctcagcaa aaaacccctc aagacc 26
<210> 13
<211> 41
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 13
ataagaaatg aggccgctca cgatcccgcg aaattaatac g 41
<210> 14
<211> 38
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 14
atgactgaga gtcaacgcca gctgtgaccg tctccggg 38
<210> 15
<211> 41
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 15
gaggggtttt ttgctgagag tttgtagaaa cgcaaaaagg c 41
<210> 16
<211> 35
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 16
cgtgagcggc ctcatttctt attctgagtt acaac 35
<210> 17
<211> 34
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 17
tggcgttgac tctcagtcat agtatcgtgg tatc 34
<210> 18
<211> 32
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 18
taccactagg atttccggca gtttctacac at 32
<210> 19
<211> 34
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 19
cggaaatcct agtggtattc actccagagc gatg 34
<210> 20
<211> 41
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 20
tccccgaaaa gtgccacctg ctcgggagtt gtcagcctgt c 41
<210> 21
<211> 26
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 21
ccatatggga ttcctcaaag cgtaaa 26
<210> 22
<211> 37
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 22
ctactaacct gtaactgcag tttcaaacgc taaattg 37
<210> 23
<211> 40
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 23
ctggcctttt gctcacatgt tgcgaagcgg aattaattcg 40
<210> 24
<211> 41
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 24
ctttgaggaa tcccatatgg ataaaaagcc tctgaacact c 41
<210> 25
<211> 38
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 25
ctgcagttac aggttagtag aaataccatt gtaatagg 38
<210> 26
<211> 21
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 26
acatgtgagc aaaaggccag c 21
<210> 27
<211> 19
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 27
caggtggcac ttttcgggg 19
<210> 28
<211> 46
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 28
gagagtaggt ctcaccgaac ctgnnkaact atgcgcgtaa actgga 46
<210> 29
<211> 43
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 29
gaggaaaggt ctcatcgggg cmnncatcgg acgcagacag aag 43
<210> 30
<211> 43
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 30
gagagtaggt ctcaccctgc cggacccgat caaaattttc gag 43
<210> 31
<211> 43
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 31
gaggaaaggt ctcaagggca cggtccagtt tacgmnnata gtt 43
<210> 32
<211> 44
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 32
gagagtaggt ctcaccatgc tgaacttcnn kcagatgggt agcg 44
<210> 33
<211> 43
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 33
gaggaaaggt ctcaatggtg aattcttcca ggtgctcttt acc 43
<210> 34
<211> 21
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 34
aacggctttt tcaaaaatat g 21
<210> 35
<211> 23
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 35
gtatctttat agtcctgtcg ggt 23
<210> 36
<211> 42
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 36
ctaacaggag gaattactag tatggtgagc tagggcgaag aa 42
<210> 37
<211> 50
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 37
atgatgatga tgatggtcga cttaatgatg atgatgatga tgtttgtaca 50
<210> 38
<211> 43
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 38
aacaggagga attaaccatg gtgagctagg gcgaagaact gtt 43
<210> 39
<211> 50
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 39
tccgccaaaa cagccaagct tttaatgatg atgatgatga tgtttgtaca 50
<210> 40
<211> 45
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 40
gctaacagga ggaattaacc atgaacacga ttaacatcgc taaga 45
<210> 41
<211> 39
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 41
ccgccaaaac agccaagctt ttacgcgaac gcgaagtcc 39
<210> 42
<211> 46
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 42
accatgatta ccgtgcacta gaacacgatt aacatcgcta agaacg 46
<210> 43
<211> 44
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 43
gtgcacggta atcatggtca tggttaattc ctcctgttag ccca 44
<210> 44
<211> 32
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 44
ccagttcctg taggaaatca agccggaagc cg 32
<210> 45
<211> 32
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 45
tttcctacag gaactggaag gctgtcgggc gc 32
<210> 46
<211> 5025
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 46
ttatgacaac ttgacggcta catcattcac tttttcttca caaccggcac ggaactcgct 60
cgggctggcc ccggtgcatt ttttaaatac ccgcgagaaa tagagttgat cgtcaaaacc 120
aacattgcga ccgacggtgg cgataggcat ccgggtggtg ctcaaaagca gcttcgcctg 180
gctgatacgt tggtcctcgc gccagcttaa gacgctaatc cctaactgct ggcggaaaag 240
atgtgacaga cgcgacggcg acaagcaaac atgctgtgcg acgctggcga tatcaaaatt 300
gctgtctgcc aggtgatcgc tgatgtactg acaagcctcg cgtacccgat tatccatcgg 360
tggatggagc gactcgttaa tcgcttccat gcgccgcagt aacaattgct caagcagatt 420
tatcgccagc agctccgaat agcgcccttc cccttgcccg gcgttaatga tttgcccaaa 480
caggtcgctg aaatgcggct ggtgcgcttc atccgggcga aagaaccccg tattggcaaa 540
tattgacggc cagttaagcc attcatgcca gtaggcgcgc ggacgaaagt aaacccactg 600
gtgataccat tcgcgagcct ccggatgacg accgtagtga tgaatctctc ctggcgggaa 660
cagcaaaata tcactcggtc ggcaaacaaa ttctcgtccc tgatttttca ccaccccctg 720
accgcgaatg gtgagattga gaatataacc tttcattccc agcggtcggt cgataaaaaa 780
atcgagataa ccgttggcct caatcggcgt taaacccgcc accagatggg cattaaacga 840
gtatcccggc agcaggggat cattttgcgc ttcagccata cttttcatac tcccgccatt 900
cagagaagaa accaattgtc catattgcat cagacattgc cgtcactgcg tcttttactg 960
gctcttctcg ctaaccaaac cggtaacccc gcttattaaa agcattctgt aacaaagcgg 1020
gaccaaagcc atgacaaaaa cgcgtaacaa aagtgtctat aatcacggca gaaaagtcca 1080
cattgattat ttgcacggcg tcacactttg ctatgccata gcatttttat ccataagatt 1140
agcggatcct acctgacgct ttttatcgca actctctact gtttctccat acccgttttt 1200
ttgggctaac aggaggaatt actagtatgg tgagctaggg cgaagaactg ttcaccggtg 1260
tggtgccgat tctggttgaa ctggacggcg atgtgaacgg ccacaaattc agcgtgagtg 1320
gcgaaggtga aggcgatgcg acctacggca aactgacgct gaagttcatc tgcaccacgg 1380
gcaaactgcc ggttccatgg ccgacgctgg tgaccacgct gacctatggc gtgcaatgct 1440
tcagtcgcta cccggaccac atgaagcagc acgacttctt caaaagcgcg atgccagaag 1500
gctacgtgca agaacgcacc atcttcttca aggacgacgg caactacaaa acccgcgccg 1560
aggtgaaatt cgagggtgat acgctggtga accgcatcga gctcaagggc atcgacttca 1620
aagaagacgg caacattctg ggccacaagc tggagtataa ctacaacagc cacaacgtgt 1680
acatcatggc ggacaagcag aaaaacggca tcaaagttaa cttcaagatc cgccacaaca 1740
tcgaggatgg cagcgtgcag ctggccgatc attaccagca gaatacgccg attggcgacg 1800
gtccggttct gctgccagat aaccactatc tgagcacgca gagcgcgctg agcaaagatc 1860
cgaacgagaa gcgcgatcac atggttctgc tggaattcgt gaccgccgcg ggcatcaccc 1920
tcggtatgga tgaactgtac aaacatcatc atcatcatca ttaagtcgac catcatcatc 1980
atcatcattg agtttaaacg gtctccagct tggctgtttt ggcggatgag agaagatttt 2040
cagcctgata cagattaaat cagaacgcag aagcggtctg ataaaacaga atttgcctgg 2100
cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag 2160
cgccgatggt agtgtggggt ctccccatgc gagagtaggg aactgccagg catcaaataa 2220
aacgaaaggc tcagtcgaaa gactgggcct tgtttgtgag ctcccggtca tcaatcatcc 2280
ccataatcct tgttagatta tcaattttaa aaaactaaca gttgtcagcc tgtcccgctt 2340
taatatcata cgccgttata cgttgtttac gctttgagga atcccatatg atacgcggcc 2400
gctttcaaac gctaaattgc ctgatgcgct acgcttatca ggcctacatg atctctgcaa 2460
tatattgagt ttgcgtgctt ttgtaggccg gataaggcgt tcacgccgca tccggcaaga 2520
aacagcaaac aatccaaaac gccgcgttca gcggcgtttt ttctgctttt cttcgcgaat 2580
taattccgct tcgcaacatg tgagcaccgg tttattgact accggaagca gtgtgaccgt 2640
gtgcttctca aatgcctgag gccagtttgc tcaggctctc cccgtggagg taataattga 2700
cgatatgatc agtgcacggc taactaagcg gcctgctgac tttctcgccg atcaaaaggc 2760
attttgctat taagggattg acgagggcgt atctgcgcag taagatgcgc cccgcattgg 2820
aaacctgatc atgtagatcg aatggactct aaatccgttc agccgggtta gattcccggg 2880
gtttccgcca aattcgaaaa gcctgctcaa cgagcaggct tttttgcatg ctcgagcagc 2940
tcagggtcga atttgctttc gaatttctgc cattcatccg cttattatca cttattcagg 3000
cgtagcaacc aggcgtttaa gggcaccaat aactgcctta aaaaaattac gccccgccct 3060
gccactcatc gcagtactgt tgtaattcat taagcattct gccgacatgg aagccatcac 3120
aaacggcatg atgaacctga atcgccagcg gcatcagcac cttgtcgcct tgcgtataat 3180
atttgcccat ggtgaaaacg ggggcgaaga agttgtccat attggccacg tttaaatcaa 3240
aactggtgaa actcacccag ggattggctg agacgaaaaa catattctca ataaaccctt 3300
tagggaaata ggccaggttt tcaccgtaac acgccacatc ttgcgaatat atgtgtagaa 3360
actgccggaa atcgtcgtgg tattcactcc agagcgatga aaacgtttca gtttgctcat 3420
ggaaaacggt gtaacaaggg tgaacactat cccatatcac cagctcaccg tctttcattg 3480
ccatacggaa ttccggatga gcattcatca ggcgggcaag aatgtgaata aaggccggat 3540
aaaacttgtg cttatttttc tttacggtct ttaaaaaggc cgtaatatcc agctgaacgg 3600
tctggttata ggtacattga gcaactgact gaaatgcctc aaaatgttct ttacgatgcc 3660
attgggatat atcaacggtg gtatatccag tgattttttt ctccatttta gcttccttag 3720
ctcctgaaaa tctcgataac tcaaaaaata cgcccggtag tgatcttatt tcattatggt 3780
gaaagttgga acctcttacg tgccgatcaa cgtctcattt tcgccaaaag ttggcccagg 3840
gcttcccggt atcaacaggg acaccaggat ttatttattc tgcgaagtga tcttccgtca 3900
caggtattta ttcggcgcaa agtgcgtcgg gtgatgctgc caacttactg atttagtgta 3960
tgatggtgtt tttgaggtgc tccagtggct tctgtttcta tcagctgtcc ctcctgttca 4020
gctactgacg gggtggtgcg taacggcaaa agcaccgccg gacatcagcg ctagcggagt 4080
gtatactggc ttactatgtt ggcactgatg agggtgtcag tgaagtgctt catgtggcag 4140
gagaaaaaag gctgcaccgg tgcgtcagca gaatatgtga tacaggatat attccgcttc 4200
ctcgctcact gactcgctac gctcggtcgt tcgactgcgg cgagcggaaa tggcttacga 4260
acggggcgga gatttcctgg aagatgccag gaagatactt aacagggaag tgagagggcc 4320
gcggcaaagc cgtttttcca taggctccgc ccccctgaca agcatcacga aatctgacgc 4380
tcaaatcagt ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctggc 4440
ggctccctcg tgcgctctcc tgttcctgcc tttcggttta ccggtgtcat tccgctgtta 4500
tggccgcgtt tgtctcattc cacgcctgac actcagttcc gggtaggcag ttcgctccaa 4560
gctggactgt atgcacgaac cccccgttca gtccgaccgc tgcgccttat ccggtaacta 4620
tcgtcttgag tccaacccgg aaagacatgc aaaagcacca ctggcagcag ccactggtaa 4680
ttgatttaga ggagttagtc ttgaagtcat gcgccggtta aggctaaact gaaaggacaa 4740
gttttggtga ctgcgctcct ccaagccagt tacctcggtt caaagagttg gtagctcaga 4800
gaaccttcga aaaaccgccc tgcaaggcgg ttttttcgtt ttcagagcaa gagattacgc 4860
gcagaccaaa acgatctcaa gaagatcatc ttattaatca gataaaatat ttctagattt 4920
cagtgcaatt tatctcttca aatgtagcac ctgaagtcag ccccatacga tataagttgt 4980
aattctcatg tttgacagct tatcatcgat aagcttggta cccaa 5025
<210> 47
<211> 111
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 47
Met Glu Lys Lys Ile Thr Gly Tyr Thr Thr Val Asp Ile Ser Gln Trp
1 5 10 15
His Arg Lys Glu His Phe Glu Ala Phe Gln Ser Val Ala Gln Cys Thr
20 25 30
Tyr Asn Gln Thr Val Gln Leu Asp Ile Thr Ala Phe Leu Lys Thr Val
35 40 45
Lys Lys Asn Lys His Lys Phe Tyr Pro Ala Phe Ile His Ile Leu Ala
50 55 60
Arg Leu Met Asn Ala His Pro Glu Phe Arg Met Ala Met Lys Asp Gly
65 70 75 80
Glu Leu Val Ile Trp Asp Ser Val His Pro Cys Tyr Thr Val Phe His
85 90 95
Glu Gln Thr Glu Thr Phe Ser Ser Leu Trp Ser Glu Tyr His Asp
100 105 110
<210> 48
<211> 107
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 48
Phe Arg Gln Phe Leu His Ile Tyr Ser Gln Asp Val Ala Cys Tyr Gly
1 5 10 15
Glu Asn Leu Ala Tyr Phe Pro Lys Gly Phe Ile Glu Asn Met Phe Phe
20 25 30
Val Ser Ala Asn Pro Trp Val Ser Phe Thr Ser Phe Asp Leu Asn Val
35 40 45
Ala Asn Met Asp Asn Phe Phe Ala Pro Val Phe Thr Met Gly Lys Tyr
50 55 60
Tyr Thr Gln Gly Asp Lys Val Leu Met Pro Leu Ala Ile Gln Val His
65 70 75 80
His Ala Val Cys Asp Gly Phe His Val Gly Arg Met Leu Asn Glu Leu
85 90 95
Gln Gln Tyr Cys Asp Glu Trp Gln Gly Gly Ala
100 105
<210> 49
<211> 105
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 49
Asn Thr Ile Asn Ile Ala Lys Asn Asp Phe Ser Asp Ile Glu Leu Ala
1 5 10 15
Ala Ile Pro Phe Asn Thr Leu Ala Asp His Tyr Gly Glu Arg Leu Ala
20 25 30
Arg Glu Gln Leu Ala Leu Glu His Glu Ser Tyr Glu Met Gly Glu Ala
35 40 45
Arg Phe Arg Lys Met Phe Glu Arg Gln Leu Lys Ala Gly Glu Val Ala
50 55 60
Asp Asn Ala Ala Ala Lys Pro Leu Ile Thr Thr Leu Leu Pro Lys Met
65 70 75 80
Ile Ala Arg Ile Asn Asp Trp Phe Glu Glu Val Lys Ala Lys Arg Gly
85 90 95
Lys Arg Pro Thr Ala Phe Gln Phe Leu
100 105
<210> 50
<211> 776
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 50
Glu Ile Lys Pro Glu Ala Val Ala Tyr Ile Thr Ile Lys Thr Thr Leu
1 5 10 15
Ala Cys Leu Thr Ser Ala Asp Asn Thr Thr Val Gln Ala Val Ala Ser
20 25 30
Ala Ile Gly Arg Ala Ile Glu Asp Glu Ala Arg Phe Gly Arg Ile Arg
35 40 45
Asp Leu Glu Ala Lys His Phe Lys Lys Asn Val Glu Glu Gln Leu Asn
50 55 60
Lys Arg Val Gly His Val Tyr Lys Lys Ala Phe Met Gln Val Val Glu
65 70 75 80
Ala Asp Met Leu Ser Lys Gly Leu Leu Gly Gly Glu Ala Trp Ser Ser
85 90 95
Trp His Lys Glu Asp Ser Ile His Val Gly Val Arg Cys Ile Glu Met
100 105 110
Leu Ile Glu Ser Thr Gly Met Val Ser Leu His Arg Gln Asn Ala Gly
115 120 125
Val Val Gly Gln Asp Ser Glu Thr Ile Glu Leu Ala Pro Glu Tyr Ala
130 135 140
Glu Ala Ile Ala Thr Arg Ala Gly Ala Leu Ala Gly Ile Ser Pro Met
145 150 155 160
Phe Gln Pro Cys Val Val Pro Pro Lys Pro Trp Thr Gly Ile Thr Gly
165 170 175
Gly Gly Tyr Trp Ala Asn Gly Arg Arg Pro Leu Ala Leu Val Arg Thr
180 185 190
His Ser Lys Lys Ala Leu Met Arg Tyr Glu Asp Val Tyr Met Pro Glu
195 200 205
Val Tyr Lys Ala Ile Asn Ile Ala Gln Asn Thr Ala Trp Lys Ile Asn
210 215 220
Lys Lys Val Leu Ala Val Ala Asn Val Ile Thr Lys Trp Lys His Cys
225 230 235 240
Pro Val Glu Asp Ile Pro Ala Ile Glu Arg Glu Glu Leu Pro Met Lys
245 250 255
Pro Glu Asp Ile Asp Met Asn Pro Glu Ala Leu Thr Ala Trp Lys Arg
260 265 270
Ala Ala Ala Ala Val Tyr Arg Lys Asp Lys Ala Arg Lys Ser Arg Arg
275 280 285
Ile Ser Leu Glu Phe Met Leu Glu Gln Ala Asn Lys Phe Ala Asn His
290 295 300
Lys Ala Ile Trp Phe Pro Tyr Asn Met Asp Trp Arg Gly Arg Val Tyr
305 310 315 320
Ala Val Ser Met Phe Asn Pro Gln Gly Asn Asp Met Thr Lys Gly Leu
325 330 335
Leu Thr Leu Ala Lys Gly Lys Pro Ile Gly Lys Glu Gly Tyr Tyr Trp
340 345 350
Leu Lys Ile His Gly Ala Asn Cys Ala Gly Val Asp Lys Val Pro Phe
355 360 365
Pro Glu Arg Ile Lys Phe Ile Glu Glu Asn His Glu Asn Ile Met Ala
370 375 380
Cys Ala Lys Ser Pro Leu Glu Asn Thr Trp Trp Ala Glu Gln Asp Ser
385 390 395 400
Pro Phe Cys Phe Leu Ala Phe Cys Phe Glu Tyr Ala Gly Val Gln His
405 410 415
His Gly Leu Ser Tyr Asn Cys Ser Leu Pro Leu Ala Phe Asp Gly Ser
420 425 430
Cys Ser Gly Ile Gln His Phe Ser Ala Met Leu Arg Asp Glu Val Gly
435 440 445
Gly Arg Ala Val Asn Leu Leu Pro Ser Glu Thr Val Gln Asp Ile Tyr
450 455 460
Gly Ile Val Ala Lys Lys Val Asn Glu Ile Leu Gln Ala Asp Ala Ile
465 470 475 480
Asn Gly Thr Asp Asn Glu Val Val Thr Val Thr Asp Glu Asn Thr Gly
485 490 495
Glu Ile Ser Glu Lys Val Lys Leu Gly Thr Lys Ala Leu Ala Gly Gln
500 505 510
Trp Leu Ala Tyr Gly Val Thr Arg Ser Val Thr Lys Arg Ser Val Met
515 520 525
Thr Leu Ala Tyr Gly Ser Lys Glu Phe Gly Phe Arg Gln Gln Val Leu
530 535 540
Glu Asp Thr Ile Gln Pro Ala Ile Asp Ser Gly Lys Gly Leu Met Phe
545 550 555 560
Thr Gln Pro Asn Gln Ala Ala Gly Tyr Met Ala Lys Leu Ile Trp Glu
565 570 575
Ser Val Ser Val Thr Val Val Ala Ala Val Glu Ala Met Asn Trp Leu
580 585 590
Lys Ser Ala Ala Lys Leu Leu Ala Ala Glu Val Lys Asp Lys Lys Thr
595 600 605
Gly Glu Ile Leu Arg Lys Arg Cys Ala Val His Trp Val Thr Pro Asp
610 615 620
Gly Phe Pro Val Trp Gln Glu Tyr Lys Lys Pro Ile Gln Thr Arg Leu
625 630 635 640
Asn Leu Met Phe Leu Gly Gln Phe Arg Leu Gln Pro Thr Ile Asn Thr
645 650 655
Asn Lys Asp Ser Glu Ile Asp Ala His Lys Gln Glu Ser Gly Ile Ala
660 665 670
Pro Asn Phe Val His Ser Gln Asp Gly Ser His Leu Arg Lys Thr Val
675 680 685
Val Trp Ala His Glu Lys Tyr Gly Ile Glu Ser Phe Ala Leu Ile His
690 695 700
Asp Ser Phe Gly Thr Ile Pro Ala Asp Ala Ala Asn Leu Phe Lys Ala
705 710 715 720
Val Arg Glu Thr Met Val Asp Thr Tyr Glu Ser Cys Asp Val Leu Ala
725 730 735
Asp Phe Tyr Asp Gln Phe Ala Asp Gln Leu His Glu Ser Gln Leu Asp
740 745 750
Lys Met Pro Ala Leu Pro Ala Lys Gly Asn Leu Asn Leu Arg Asp Ile
755 760 765
Leu Glu Ser Asp Phe Ala Phe Ala
770 775
<210> 51
<211> 110
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 51
Met Glu Lys Lys Ile Thr Gly Tyr Thr Thr Val Asp Ile Ser Gln Trp
1 5 10 15
His Arg Lys Glu His Phe Glu Ala Phe Gln Ser Val Ala Gln Cys Thr
20 25 30
Tyr Asn Gln Thr Val Gln Leu Asp Ile Thr Ala Phe Leu Lys Thr Val
35 40 45
Lys Lys Asn Lys His Lys Phe Tyr Pro Ala Phe Ile His Ile Leu Ala
50 55 60
Arg Leu Met Asn Ala His Pro Glu Phe Arg Met Ala Met Lys Asp Gly
65 70 75 80
Glu Leu Val Ile Trp Asp Ser Val His Pro Cys Tyr Thr Val Phe His
85 90 95
Glu Gln Thr Glu Thr Phe Ser Ser Leu Trp Ser Glu Tyr His
100 105 110
<210> 52
<211> 108
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 52
Asp Phe Arg Gln Phe Leu His Ile Tyr Ser Gln Asp Val Ala Cys Tyr
1 5 10 15
Gly Glu Asn Leu Ala Tyr Phe Pro Lys Gly Phe Ile Glu Asn Met Phe
20 25 30
Phe Val Ser Ala Asn Pro Trp Val Ser Phe Thr Ser Phe Asp Leu Asn
35 40 45
Val Ala Asn Met Asp Asn Phe Phe Ala Pro Val Phe Thr Met Gly Lys
50 55 60
Tyr Tyr Thr Gln Gly Asp Lys Val Leu Met Pro Leu Ala Ile Gln Val
65 70 75 80
His His Ala Val Cys Asp Gly Phe His Val Gly Arg Met Leu Asn Glu
85 90 95
Leu Gln Gln Tyr Cys Asp Glu Trp Gln Gly Gly Ala
100 105
<210> 53
<211> 3
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 53
Met Val Ser
1
<210> 54
<211> 241
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 54
Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp
1 5 10 15
Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly
20 25 30
Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly
35 40 45
Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly
50 55 60
Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe
65 70 75 80
Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe
85 90 95
Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu
100 105 110
Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys
115 120 125
Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser
130 135 140
His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val
145 150 155 160
Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala
165 170 175
Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu
180 185 190
Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro
195 200 205
Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala
210 215 220
Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys His His His His His
225 230 235 240
His

Claims (10)

1.一种吡咯赖氨酸氨酰-tRNA合成酶突变体,其特征在于,所述吡咯赖氨酸氨酰-tRNA合成酶突变体为SEQ ID NO.10所示吡咯赖氨酸氨酰-tRNA合成酶的氨基酸序列中301位、306位、309位、348位的氨基酸残基突变得到,所述突变为:
301位替换为缬氨酸(V),306位替换为亮氨酸(L),309位替换为丙氨酸(A),348位替换为苯丙氨酸(F)(SEQ ID NO.7);
或,301位替换为蛋氨酸(M),306位替换为亮氨酸(L),309位替换为丙氨酸(A),348位替换为苯丙氨酸(F)(SEQ ID NO.8);
或,301位替换为异亮氨酸(I),306位替换为亮氨酸(L),309位替换为丙氨酸(A),348位替换为苯丙氨酸(F)(SEQ ID NO.9)。
2.一种筛选权利要求1所述吡咯赖氨酸氨酰-tRNA合成酶突变体的方法,其特征在于,所述方法包括以下步骤:
S1:构建包括吡咯赖氨酸氨酰-tRNA合成酶饱和突变库的质粒:所述吡咯赖氨酸氨酰tRNA合成酶饱和突变库为将如GenBank Sequence ID:WP_011033391.1所示野生型吡咯赖氨酸氨酰-tRNA合成酶的氨基酸序列突变得到,突变位点包括301位、306位、309位、以及348位的氨基酸残基;将编码吡咯赖氨酸氨酰tRNA合成酶饱和突变库的核苷酸序列连入基础质粒,获得所述包括吡咯赖氨酸氨酰-tRNA合成酶饱和突变库的质粒;优选的,所述重组载体包含GlnRS启动子、编码吡咯赖氨酸氨酰tRNA合成酶饱和突变库的核苷酸序列、GlnRS终止子、ColE1复制原点、硫酸卡那霉素抗性基因;
S2:构建筛选质粒:将SEQ ID NO.2所示包括氯霉素抗性报告基因的核苷酸片段、SEQID NO.3所示包括绿色荧光蛋白报告基因的核苷酸片段、SEQ ID NO.4所示包括T7RNA聚合酶基因的核苷酸片段通过同源重组连接起来,得到重组质粒,所述重组质粒核苷酸序列如SEQ ID NO.11所示;SEQ ID NO.11所示重组质粒的1-219bp和9544-9984bp的底部链编码氯霉素抗性基因,经过双点突变,将SEQ ID NO.11所示重组质粒的9868bp~9873bp的ctagtc突变位atccta,使重组质粒中的氯霉素抗性基因编码的氨基酸序列的112号位点突变为112D,111位点突变成为琥珀终止密码子TAG,得到所述筛选质粒;
S3:将S2得到的筛选质粒转入大肠杆菌感受态细胞,得到含有筛选质粒的感受态细胞;将S1构建的包括吡咯赖氨酸氨酰-tRNA合成酶饱和突变库的质粒转化至含有筛选质粒的感受态细胞,分别接种至添加非天然氨基酸的液体培养基和不添加非天然氨基酸的液体培养基中进行筛选,如果突变样品同时满足:
(1)在添加非天然氨基酸的液体培养基上具有氯霉素抗性、能够观察到荧光;
(2)在不添加非天然氨基酸的液体培养基上不具有氯霉素抗性、不能够观察到荧光;则筛选得到权利要求1所述吡咯赖氨酸氨酰-tRNA合成酶突变体。
3.根据权利要求2所述的方法,其特征在于,S2所述筛选质粒核苷酸序列如SEQ IDNO.1所示。
4.根据权利要求3所述的方法,其特征在于,SEQ ID NO.3所示包括绿色荧光蛋白报告基因的核苷酸片段,为采用pSel-sfGFP引物对从SEQ ID NO.5所述质粒pET-28a-sfGFP中扩增得到,SEQ ID NO.4所示包括T7RNA聚合酶基因的核苷酸片段,为采用pSel-RNAP引物对从SEQ ID NO.6所述质粒pBad-T7RNAP-TAG中扩增得到。
5.根据权利要求4所述的方法,其特征在于,所述pSel-sfGFP引物对为:
pSel-sfGFP-F:5'-ctctCAGCAAAAAACCCCT CAAGACC-3'(SEQ ID NO.12),
pSel-sfGFP-R:5'-ataagaaatgaggccgctcaCGATCCCGCGAAATTAATACG-3'(SEQ IDNO.13);所述pSel-RNAP引物对为:
pSel-RNAP-F:5'-atgactgagagtcaacgccaGCTGTGACCGTCTCCGGG-3'(SEQ ID NO.14),pSel-RNAP-R:5'-gaggggttttttgctgAGAGTTTGTAGAAACGCAAAAAGGC-3'(SEQ ID NO.15)。
6.根据权利要求2所述的方法,其特征在于,S2中SEQ ID NO.2所示包括氯霉素抗性报告基因的核苷酸片段为线性化pACYC184-Cm112TAG-PylT载体得到,线性化
pACYC184-Cm112TAG-PylT载体的引物为:
pSel-vec-F-new:5'-cgtgagcggcctcatttcttattctgagttacaac-3'(SEQ ID NO.16)、
pSel-vec-R-new:5'-tggcgttgactctcagtcatagtatcgtggtatc-3'(SEQ ID NO.17);
所述双点突变为以SEQ ID NO.12所示重组质粒核苷酸序列为模板,双点突变引物为SDM-Cm112TAG-F:5'-TACCACtagGATTTCCGGCAGTTTCTACACAT-3'(SEQ ID NO.18),SDM-Cm112TAG-R:5'-CGGAAATCctaGTGGTATTCACTCCAGAGCGATG-3'(SEQ ID NO.19)。
7.一种用于筛选权利要求1所述吡咯赖氨酸氨酰-tRNA合成酶突变体的筛选质粒,其特征在于,所述筛选质粒为将SEQ ID NO.2所示包括氯霉素抗性报告基因的核苷酸片段、SEQID NO.3所示包括绿色荧光蛋白报告基因的核苷酸片段、SEQ ID NO.4所示包括T7RNA聚合酶基因的核苷酸片段通过同源重组连接起来,得到重组质粒,所述重组质粒核苷酸序列如SEQ ID NO.11所示;再经过双点突变,将SEQ ID NO.11所示重组质粒的9868bp~9873bp的ctagtc突变位atccta,使重组质粒中的氯霉素抗性基因编码的氨基酸序列的112号位点突变为112D,111位点突变成为琥珀终止密码子TAG,得到所述筛选质粒,所述筛选质粒核苷酸序列如SEQ ID NO.1所示。
8.一种重组载体,其特征在于,所述重组载体包括编码权利要求1所述吡咯赖氨酸氨酰-tRNA合成酶突变体的核苷酸序列。
9.权利要求1所述吡咯赖氨酸氨酰-tRNA合成酶突变体,或权利要求7所述的筛选质粒,或权利要求8所述的重组载体在蛋白质乳酰化修饰中的应用。
10.权利要求1所述吡咯赖氨酸氨酰-tRNA合成酶突变体,或权利要求7所述的筛选质粒,或权利要求8所述的重组载体在将乳酰化赖氨酸定点引入目标蛋白中的应用。
CN202210243139.3A 2022-03-11 2022-03-11 吡咯赖氨酸氨酰-tRNA合成酶突变体及其筛选系统和应用 Pending CN114657156A (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210243139.3A CN114657156A (zh) 2022-03-11 2022-03-11 吡咯赖氨酸氨酰-tRNA合成酶突变体及其筛选系统和应用

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210243139.3A CN114657156A (zh) 2022-03-11 2022-03-11 吡咯赖氨酸氨酰-tRNA合成酶突变体及其筛选系统和应用

Publications (1)

Publication Number Publication Date
CN114657156A true CN114657156A (zh) 2022-06-24

Family

ID=82029931

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210243139.3A Pending CN114657156A (zh) 2022-03-11 2022-03-11 吡咯赖氨酸氨酰-tRNA合成酶突变体及其筛选系统和应用

Country Status (1)

Country Link
CN (1) CN114657156A (zh)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101952428A (zh) * 2007-09-20 2011-01-19 独立行政法人理化学研究所 突变体吡咯赖氨酰-tRNA合成酶以及使用该酶的非天然氨基酸整合蛋白质的制造方法
CN104450746A (zh) * 2014-08-30 2015-03-25 武汉佰福泰制药有限公司 一种向蛋白质定点引入非天然氨基酸的方法
CN110172467A (zh) * 2019-05-24 2019-08-27 浙江大学 一种利用嵌合设计方法构建正交的氨酰-tRNA合成酶/tRNA体系
CN111850020A (zh) * 2019-04-25 2020-10-30 苏州鲲鹏生物技术有限公司 利用质粒系统在蛋白中引入非天然氨基酸
CN111849929A (zh) * 2019-04-30 2020-10-30 苏州鲲鹏生物技术有限公司 高效引入赖氨酸衍生物的氨酰基—tRNA合成酶
CN112739823A (zh) * 2018-08-31 2021-04-30 国立研究开发法人理化学研究所 吡咯赖氨酰-tRNA合成酶
CN113481228A (zh) * 2021-06-13 2021-10-08 中国药科大学 微生物非天然氨基酸引入体系及构建方法

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101952428A (zh) * 2007-09-20 2011-01-19 独立行政法人理化学研究所 突变体吡咯赖氨酰-tRNA合成酶以及使用该酶的非天然氨基酸整合蛋白质的制造方法
CN104450746A (zh) * 2014-08-30 2015-03-25 武汉佰福泰制药有限公司 一种向蛋白质定点引入非天然氨基酸的方法
CN112739823A (zh) * 2018-08-31 2021-04-30 国立研究开发法人理化学研究所 吡咯赖氨酰-tRNA合成酶
CN111850020A (zh) * 2019-04-25 2020-10-30 苏州鲲鹏生物技术有限公司 利用质粒系统在蛋白中引入非天然氨基酸
CN111849929A (zh) * 2019-04-30 2020-10-30 苏州鲲鹏生物技术有限公司 高效引入赖氨酸衍生物的氨酰基—tRNA合成酶
CN110172467A (zh) * 2019-05-24 2019-08-27 浙江大学 一种利用嵌合设计方法构建正交的氨酰-tRNA合成酶/tRNA体系
CN113481228A (zh) * 2021-06-13 2021-10-08 中国药科大学 微生物非天然氨基酸引入体系及构建方法

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
UMEHARA ET AL.: "N-Acetyl lysyl-tRNA synthetases evolved by a CcdB-based selection possess N-acetyl lysine specificity in vitro and in vivo", 《FEBS LETTERS》, vol. 586, pages 3 - 3 *
VOLKWEIN ET AL.: "A Versatile Toolbox for the Control of Protein Levels using N#-acetyl-L-lysine Dependent Amber Suppression", 《ACS SYNTH. BIOL.》, pages 1 - 34 *
WEI WAN ET AL.: "Pyrrolysyl-tRNA Synthetase: an ordinary enzyme but an outstanding genetic code expansion tool", 《BIOCHIM BIOPHYS ACTA》, vol. 1844, no. 6, pages 1059, XP028638306, DOI: 10.1016/j.bbapap.2014.03.002 *

Similar Documents

Publication Publication Date Title
CN107586779A (zh) 使用crispr‑cas系统对间充质干细胞进行casp3基因敲除的方法
CN107619829A (zh) 使用crispr‑cas系统对间充质干细胞进行gins2基因敲除的方法
CN109943581B (zh) 一种质粒以及噬菌体辅助的连续定向进化系统和定向进化方法
CN111154707B (zh) 基因工程化大肠杆菌及褪黑素的生产方法
CN111235080A (zh) 基因重组大肠杆菌及5-羟色胺的生产方法
CN106867952A (zh) 一株大肠杆菌基因工程菌及利用其生产l‑苏氨酸的方法
CN107988250B (zh) 一种通用型衣藻外源基因表达载体构建方法
CN110564775A (zh) 一种提高基因组定点修饰效率的方法
CN101027403B (zh) 生产伯醇的方法
CN110499336B (zh) 一种利用小分子化合物提高基因组定点修饰效率的方法
CN110066813A (zh) 一种调控杨树木材形成的油菜素内酯合成限速基因及其应用
KR101443052B1 (ko) 변형된 콘드로이틴 합성 효소 폴리펩타이드 및 이의 결정
CN114657156A (zh) 吡咯赖氨酸氨酰-tRNA合成酶突变体及其筛选系统和应用
CN110964725A (zh) 特异性识别猪KIT基因的sgRNA及其编码DNA、试剂盒和应用
CN114836446B (zh) 一种抗草甘膦的植物及其制法
CN111909914B (zh) 核酸内切酶SpCas9的高PAM兼容性截短型变异体txCas9及其应用
CN101098967B (zh) 用全细胞催化剂制备光学活性醇
CN110272881B (zh) 核酸内切酶SpCas9高特异性截短变异体TSpCas9-V1/V2及其应用
KR101226644B1 (ko) 지방산 생합성 경로의 과발현용 형질전환 대장균 및 그의 제조방법
CN112662697B (zh) 一种莱茵衣藻tctn1表达质粒及其构建方法和应用
KR100918121B1 (ko) 아세틸-CoA 소비를 증가시키는 대장균 균주 및 상기균주 및 흡착제 수지를 이용한 바닐린 생산 방법
CN106636023B (zh) 一种增强zwf基因启动子表达强度的方法
CN111254105B (zh) 基因工程大肠杆菌及制备方法和吲哚-3-乙酸的生产方法
CN112553237A (zh) 一种新型mariner转座子系统、应用和构建枯草芽孢杆菌插入突变株文库
CN106520818B (zh) 一种快速回补鸭疫里默氏杆菌缺失基因的方法

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20220624

WD01 Invention patent application deemed withdrawn after publication