具体实施方案
以下叙述根据本发明实施方案的实施例。应该说明的是,本发明的实施例对于本发明只有说明作用,而没有限制作用。
实施例1、对鳃金龟地下害虫有效的Bt菌株185的筛选与鉴定
土壤采自河北省保定市顺平县苹果园。取0.1-0.2g土样放入装有10ml灭菌水和玻璃珠的试管中,在旋涡振荡器上振荡3分钟,将土粒打碎,然后放于200rpm摇床上振荡10分钟,75℃水浴锅中水浴17分钟,充分将非芽孢菌杀死,待稍静置后,选取10-2、10-3、10-4三个稀释度,分别吸取100ul菌悬液于BP平板上,涂布均匀,于37℃培养三天,挑取似Bt菌落涂片镜检。发现一株含有球形晶体的Bt菌株(见附图1)。
该菌株为芽孢杆菌属(Bacillus)、苏云金芽孢杆菌种。将该菌株在中国微生物菌种保藏管理委员会普通微生物中心提交保藏,保藏日期为2004年11月5日,保藏编号为CGMCCNO.1242。经鉴定,关于该菌株的信息包括:能形成芽孢,同时能形成球形伴孢晶体,SDS-PAGE电泳表明其杀虫晶体蛋白约为130kDa(见附图2),其晶体蛋白在14小时开始表达,而生长曲线表明其14小时已进入停滞期(见附图3),表明该晶体蛋白启动子可能为依赖芽孢形成的;生物学测定表明,该菌株对金龟科地下害虫暗黑鳃金龟具有明显杀虫作用,7天校正死亡率达90%,对椰心叶甲也具有一定的杀虫活性。
实施例2、菌株185中cry基因鉴定
根据cry8类基因保守区设计了一对通用引物
S5un8:5-’CGGCAAACTTAGTAGAATGC-3’
S3un8:5-’CTGACTGATTTCCACCATCACG-3。
用下列PCR反应体系(50μL)鉴定了Bt菌株185:
10×PCR buffer |
5μL |
MgCl2(20mM) |
6μL |
dNTP(10mM) |
1μL |
引物对(10mM) |
1μL/个 |
模板 |
1μL |
Taq聚合酶(5U/μL) |
0.5μL |
超纯水补至50μL,混匀离心,加石蜡油30μL。
扩增循环:94℃变性1分钟,54℃退火1分钟,72℃延伸4分钟,25个循环,最后72℃延伸10分钟。
结果(见附图4)显示与已知cry8类基因的图谱不同,表明菌株185中可能含有新的cry8杀虫基因。
实施例3、菌株185中cry8E基因的克隆
用PstI和KpnI酶切Bt菌株总DNA,用载体pBluescript SK(+)分别建立了两个DNA片段库,然后用引物S5un8/S3un8和PCR方法检测两个DNA库,得到两个阳性克隆pSS3612和pSS162,分别插入11kb的pstI片段和2.0kb的KpnI片段(见附图5)。酶切分析pSS3612,7kb的PstI和KpnI双酶切片段含有cry8Ea1的全长基因,用pBluescript SK(+)亚克隆该片段,得到pSS3612-7,同时亚克隆了4kb的KpnI片段中,得到pSS3612-4(如附图6)。将pSS162和pSS3612-7的插入序列测序,得到序列SEQ ID NO 1。该序列为pSS3612中PstI和KpnI双酶切片段,序列全长7276bps,分析表明其含有两个较大的开放阅读框,ORF1的位置是3658-7152,ORF2的位置是2799-3377。
ORF1的位置是3658-7152,GC含量为38.03%,编码1164个氨基酸组成的蛋白,经测定,其氨基酸序列为SEQ ID NO 2所示。同源分析表明该蛋白与Cry8类蛋白具有较高同源性(见表5)。由于与已知的Cry8类蛋白氨基酸同源性均低于78%,最高只有58.2%(Cry8Bb1),被Bt杀虫晶体蛋白命名委员会命名为Cry8Ea1。
本发明进一步分析了Cry8Ea1蛋白的氨基酸组成(见表6和附图7),得知其分子量为131.56kDa,等电点为pH4.735(见附图8)。
实施例4、cry8Fa1基因的克隆
对pSS162质粒中插入序列进行了分析,片段长2.3kb,具有完整的3’端序列,与cry8Ea1序列完全同源,但5’端差异较大,并且缺少完整的读码框,根据该序列的特异区段设计了1对引物(5-185-KpnI:TTGGTATGGCGTTTCGTTG;和3-185-Kpnl:TATTGCAGGTCCAGGATTCAC),用于克隆该全长基因,将185质粒DNA用XbaI酶切,与载体pBluescript SK(+)连接,筛选得到插入约9Kb外源片段的阳性克隆pSS266(见附图9),进一步酶切分析表明ClaI酶切产生的3.0kb片段含有5’端读码框(见附图10),用pBluescript SK(+)亚克隆得到该片段,阳性克隆命名为pSS266-3,对该片段进行了测序,把得到的序列与pSS162中的插入序列进行拼接得到3.9kb片段。对核酸片段进行测序,得到如SEQ ID NO 3所示的核苷酸序列。对该序列进行分析表明:该序列含有1个较大的开放阅读框357-3878;GC含量为36.88%;编码1174个氨基酸组成的蛋白(该蛋白质的氨基酸序列如SEQ ID NO 4所示)。进一步的同源分析表明,该蛋白质与Cry8类蛋白有较高的同源性(见表5)。由于与已知的Cry8类蛋白氨基酸同源性均低于78%,最高只有64.8%(Cry8Eal),该蛋白被Bt杀虫晶体蛋白命名委员会命名为Cry8Fa1。
本发明用生物分析软件Bioedit进一步分析了Cry8Fa1蛋白的氨基酸组成(见表7和附图11)。结果表明,该蛋白质的分子量为133.08kDa,等电点为pH4.565(见附图12)。
实施例5、cry8E和cry8F基因的表达
根据克隆的cry8Ea1和cry8Fa1基因的全长序列,设计了用于表达两种基因的引物,序列如下:
8E1:CGC
GGATCC(Bam HI)GATGAGTCCAAATAATCAAAATG
8E2:ACGC
GTCGAC(Sal I)CTCTACGTCAACAATCAATCAATTC
8F1:CGC
GGATCC(Bam HI)GATGAGTCCAAATAATCAAAATG
8F2:CATTAACTCTGCCCACGGATC
T(C)TCTGGTGCAAAGAAGTCCAG
8F3:CTTGACTTCTTTGCACCAGA
A(G)GATCCGTGGGCAGTTAATG
8F4:CCG
CTCGAG(Xho I)CTCTACGTCAACAATCAATCAATTC
引物8E1和8E2分别引入BamHI和SalI位点,以含全长cry8Ea1的pSS3612质粒DNA为模板,扩增得到全长基因,插入Bt表达载体pSXY422b中,转化大肠杆菌SCS110,提取质粒,电击转化Bt无晶体突变株HD-73-中,得到工程菌BioT8E。
由于cry8Fa1全长基因内存在1个BamHI切点,用重叠引物PCR的方法对这一位点进行了突变,重叠引物8F2和8F3中引入了点突变(划线部分)、引物8F1和8F4分别引入BamHI和XhoI。以含全长cry8Fa1基因的pSS266质粒DNA为模板,分别用8F1和8F2、8F3和8F4扩增得到0.3kb和3.1kb产物,再分别以它们为模板,分别利用引物8F1和8F4扩增得到3.4kb的全长基因,插入Bt表达载体pSXY422b中,转化大肠杆菌SCS110,提取质粒,电击转化Bt无晶体突变株HD-73-中,得到工程菌BioT8F。
分别将上述两株工程菌30℃于牛肉膏培养基中培养30小时,取500μL菌液至Eppendorf管中,超声波破碎30秒钟(B.Braun U Labsonic,230V,T间隔=0.5秒);取100μL加入25μL新配0.5NNaOH,25℃作用5分钟;加入65μL3×样品缓冲液(925μL上样缓冲液+75μLβ-巯基乙醇),100℃煮沸5分钟。离心除去沉淀。上样10uL进行SDS-PAGE电泳分析结果。结果(见附图13)表明,工程菌Biot8E和Biot8F中的cry8Ea1和cry8Fa1基因均获得了表达,表达物的分子量为130kDa左右。
实施例6、Cry8E和Cry8F蛋白的活性测定
将Bt工程菌株接种在普通细菌琼脂克氏瓶培养基上培养3天。将受体菌株HD-73-接种在普通细菌琼脂克氏瓶培养基上培养4天。将培养物洗下,2倍梯度浓度稀释,将40ml菌悬液加入到200g有均匀粗细土豆丝的细土(紫外线灭菌)中,混匀,使土壤含水量保持在18%-20%。接入暗黑鳃金龟15天龄幼虫20头,以加入清水的处理为空白对照,28℃感染饲养,14天检查死虫数,计算LC50。结果(见表8)表明工程菌株对暗黑鳃金龟具有极高的毒杀活性。其表达的Cry8E和Cry8F蛋白均具有杀暗黑鳃金龟幼虫的活性。
附:本发明所涉及的DNA序列和蛋白质序列
SEQ ID NO 1(cry8Ea1基因的核苷酸序列):
ctgcagaata gacacggata cgatcgcctt cacataaatg ctgaaatctt cttctagaca 60
PstI
ttcttgtgtc acctcatttt ttgtttttaa actacagtat gttatatgca aaagaagggg 120
tagaggattg ggccttttac tacaaaaata caaaaacata cttatgattg catatggaga 180
tgtcaaagtg catgcattaa aaatggatta gaaatgattt caaataggca aaagcctatt 240
ccaatgaaga aagattgaca taggctattg tatatagaag aaggtaacga ggaacatact 300
gttagggtac acctaacata gaagtatgct tgtttgaagc atgtacatct tgaaatacca 360
gtagaaatat gggggaacat gttattttaa taattggtaa aatcttttgt tagaaggtga 420
aggcgtatga gacaacaaag agtatgtgag tgtaacaatt gtaggaaaaa gggtgagaag 480
aatcatcttt gtcaatatat aaaacgtggg gattgtatat gggtcagttc ctttggaagc 540
aagttacaaa agagtggtat tttcctgtta ataaaagatt cttttttatt atggtttgat 600
gaaaaacatc aattaaatca aaccagtcta caggggatcc atattgaaaa aagacaataa 660
atgaagaagg agtctcattg ttaacgaagt gtgtacatct atcatgtaca catcgtaagt 720
cgtatgttct acctgtatct ggtaggaaag aattgtcgca tgtgcaaggc gtatatacac 780
aaacatgttt tgttatattt ttgaataatt tgaaaataaa tatgttataa ttaatatact 840
ttcgtgtgtt ttttttgcga aatccctaga aagtatcgta aaaagtccct aacaattttg 900
tgaactgaac ccaaaaaatt agacaaatat attaagcagc tactaaggat tgaactctgt 960
attgcacggg ggacaatcct tttagttttg ttttaattct tttgtgattg tagtaatgaa 1020
tataagtttc tagttcttgc ttaaattgtt ccatactttc aaactcttta agataaagta 1080
attcagactt taataagcca aagaaatttt ccatgactgc attatctaag caatttccct 1140
tacgggacat actttggata acgttatgtt ttttaagcga ctgatgatat tgtcgcattt 1200
gataatgcca accttgatcc gagtgtaaaa taggagtttc cttatcattc aaacattgaa 1260
acgccttatt taacatttta gaaacaaggg aataggcagg tctatgttct atattgtaag 1320
ctataatttc tccgttatat aagtcttaaa tgggtgatag atatagtttt ttaccatgta 1380
agtggaactc cgtcacatct gttacccatt tctcgtttgg ttttgatgcg tgaaaattac 1440
gttttaaaat attaggagcg aatttcccga cagtcccttt atatgaacga tattttttta 1500
atcgaacaag acattttaat cccaggatat tcattaaacg tcgaacggtt ttatgattta 1560
atgcatggcc tcgattacgt aattccaatg taatacgacg ataaccatac ctcccaaaat 1620
tctcactaaa aatctcttta attaattctt taactttctt atatttatct ggacgtttcg 1680
cttgtttcat ccagtaataa tacgtactac gagcgatatt agcgactttc acaaggtcaa 1740
cgaccttata tttatgcctt aattcataaa tcaattgcgc tttgtcttgg tctgtgatgt 1800
tttcttcttt tgaactaagg cattcaactt ttttaaatag tcattttcca tacgcagacg 1860
ttcattctct gcttgtagcg cttctataga accttcaaga aatacttcgt tttgttttaa 1920
atgttgtagc ttagcttttt ctttggccat ggttagacgc ccctttttct ttgattttag 1980
ggcatctaat ccttctgttt cataagctac tttccatttt cggagtgttt cgcaagaagg 2040
aatattaaaa aaagcagctg tttctctcag agatgtccca ttttcattca tataatgaat 2100
tacatctagt ttatactcga gagggtaagt tgtatagcgt ttttcaaacg ccttttcccc 2160
tgaaaattca aaccgtttaa tccattgata aagttctcta ggatgaaccc ctatagaatt 2220
agcaatggtt tttccgcctt ccgtaccttc tagatatcgt tttactgctt gtattttatc 2280
ttttgaagaa aatttagcca taaaaaatgc acctccaatt gttaattatg tgtctaacaa 2340
ttggggtgca cttcattgtt ggggactttt gtatgttcct ttttagacat tcttttaagt 2400
tctttatata gaaagactga agaaagcaga ataagaagtc catcccctga ttcatgagaa 2460
ccgaaaaatt catgatgcac tggatgccaa atatttagat acatttccta ttgatattct 2520
acgagattga attgatgtaa tgttgttccc ttttggtcaa actgaccaat ggcgatagct 2580
ctccttgcat gagtacttct caaacttcca ttacacattg tattccccat cttttttatg 2640
tatatctttt ggggaaaatc gtaatttctg cttatgatga caagatttta ctaaaataag 2700
aagagtggaa tattttactc tatgtcaaac aaaaaagcaa tatatgttta aacgcgaaaa 2760
taatcatcat atcaacaatg cccggtacat aaagatag
gggggattt ttcgaaatga 2820
ORF2→
ttcgaaaagg ctccattgat tcgataggag gtgcacagaa aaaaatggaa gaacaatatg 2880
catcgcaaga tcagtcagat gtagaaggtt tcaagcggaa gaaaaaacat accattccct 2940
ttcaatgtat ggtttctatt ccaacagggt ttcaaattca aaaaccgaat acaccaaaac 3000
ttgtttatga tgtaagccat ttatctatgg taaaagagat gtgtaaacga gtgattgacg 3060
tagaggattg tgggcaagtc gaaatcgatt tacatgtctt aaaaatcaaa ggtgtcttac 3120
cctttattgt gaacgtttcc attgagccgc ttagtatgga acatgtgtat accacaagtg 3180
gtagagacac atccctattt ttaagttgtc aagaaaccgt atatgtggat catattttaa 3240
aatatagtgt cgatcatgtc ccgtattatg tgattgatgg tcatcatatt ctagtacgtg 3300
atgtcgtgat aaagttgttg gaagaaaacc cgcaaacggc tcaaatatca ggtgtttttt 3360
attttgatta tgca
ttt caatagaaac aaaaacgttc tcttatacgg cattcccaaa 3420
End codon
agcatcgcca ccttttttat catacaatag ttcgttctaa gaagagccgt aatatttttc 3480
tatctaacag gaattttatc atctacagaa gaatattctt atcatggtaa tgaggagagg 3540
gattgaaagt caaaagatta cctgatttgt catgtaagaa aaaggaatcg atcgtacagg 3600
aaagtcaaaa gaaagtgtaa aaattttata tcttgtgtat gtata
aaaatag
3660
RBS ORF1→
agtccaaata atcaaaatga atatgaaatt atagatatgg caccttctac atctgtatcc 3720
aatgattcta acagataccc ttttgcgagt gatccaacaa atgcattaca aaatatgaat 3780
tataaagagt atttaagaat gtctgaggga tatgatagtg aatattctgg ctcacctgaa 3840
gtgcttatta gtgagcgaga tgcggttaag acagcaatca gtttggtagg tactatatta 3900
ggaaaattag gagttccatt ggtaggaccg attgtgagcc tatatagtac acttattgat 3960
gttttgtggc caggtggaaa gagtcaatgg gaaattttta tggaacaagt agaagcactt 4020
attaatcaaa aaatagcaga atacgcaagg gctaaggcac ttgcagaatt agaagggtta 4080
ggaaataact atcaattata tttaacagca cttgaagaat ggcaggaaaa tccaagcagt 4140
acaagagtct tacgtgatgt tcggaatcga tttgaaatcc ttgatagctt atttacacaa 4200
tatatgcctt cttttcgggt aacaggttat gaagtaccat tactttcagt atatgcgcaa 4260
gcagctaacc ttcatttatt gttattaaag gacgcttcta tttttggaga agaatggggg 4320
ttctctacaa ccgctattaa taactattat aatcgtcaaa tgagtcttat cgcgcaatat 4380
tctgatcatt gtgtacaatg gtatagaact gggttagatc gattaaaagg atcgaatgct 4440
aaacaatggg ttgaatataa ccgcttccga agagaaatga cattatcggt gttagatatt 4500
atgacattat ttccaatgta tgacatgcgc acgtacccaa tggaaacaaa agcacaacta 4560
acaagggaag tatatacaga tccaattggt gccataggag cgcaaggttc ttggtatgac 4620
tcagcacctt ctttcaatac tctggaaagt acttttataa gaggaaagca tctatttgat 4680
tttataacta gactctctat atatacaggg cgaagctcat tcagtgctag taattactta 4740
aaaaaatgga tagggcatca aatatcctct caacctatag gcggcagtat acaaactcaa 4800
acctatggca ctacgagtgg cagttctgtt attgctacgc agcaaattgg ctttacaggt 4860
tttgacgttt ataagacttt atcaacagcg ggggttctgt ttgcttatac ttcgaaatat 4920
tatggcgtat ctaaagttgt ttttgatgcg atatatcctg acaacaagta taaaacaaca 4980
tttacctata atcctggatc tgaaggtatt ggagcgcaag aaaaggattc agaagttgaa 5040
ttgccaccag aaacattaga tcaacccaat tatgaggcgt atagccatag attgaattat 5100
gttacattta ttagaaatcc agatgtacca gtattttctt ggacacatcg gagtgcggat 5160
cgtacgaata cagtttattc agataaaatc actcaaatac cagttgtaaa ggccagtgac 5220
ggccctaaac cttccgctaa cgaagttgga cactatcttg gtggagatcc aatatcattt 5280
aactcttctg gtagcactgg agtgataagg ttaaatataa attcaccatt atcccaaaaa 5340
taccgtgtga gaattcgcta ttgctcttca gttgattttg acttagatgt agttcgtgga 5400
ggcactactg taaataatgg tagatttaac aaaagcgcgc ctaacgtcgg atggcaaagt 5460
ttgaagtatg aaaattttaa atttgcaagc ttttctacac cttttacatt taatcaagct 5520
caagatacat taaaaataag tgtaaggaat tttagttcaa tcgtaggagg cagcgtagtt 5580
tatatagacc gaatcgagct catcccagta aatgcaacat atgaggcaga acaagattta 5640
gattcggcaa agaaagcagt gaataccttg tttacgaata caaaagatgg tttacgacca 5700
ggggtaacgg attatgaagt gaatcaagcg gcaaacttag tggaatgcct atcggatgat 5760
ttgtatccaa atgaaaaacg cttgttattt gatgcagtga aagaggcaaa acgactcagc 5820
gaggcacgta acttactaca agatccagat ttccaagaga taaatggaga aaatggatgg 5880
accgcaagta caggaattga ggttgtagaa ggagatgctc tatttaaagg gcgttatcta 5940
cgcctaccag gtgcgagaga aatggataca gaaacgtatc caacgtatct gtatcaaaaa 6000
gtagaggaag gtgtattaaa accatacaca agatatagat tgagagggtt tgtcggaagc 6060
agtcaaggct tggaaatttc cacaattcgt catcagacga accgaattgt aaaaaatgtt 6120
ccagatgatt tattaccaga tgtacctcct gtaaactctg atggtagaat caatcgatgc 6180
agcgaacaaa agtatgtgaa tagccgttta gaaggagaaa gaggattacc aaatgggaat 6240
cgttctgctg aagcgcatga attctctctc cctattgata taggagagct ggattacaat 6300
gaaaatgcag gaatatgggt tggatttaag attacggacc cagagggata tgcaacactc 6360
ggtaaccttg aattggtaga agagggacca ttgtcaggag acgcactaga acgcctgcaa 6420
agagaagaac aacagtggaa gcttcaaatg acaaaaagac gtgaagagac ggatagaaaa 6480
tatacggcag caaaacaagc ggtagatcgt ttatatgcag attaccaaga tcaacaattg 6540
aatccaaacg tagaaattac ggatattact gcggcccaaa acctgataca gtccattcct 6600
tatgtatata atgaaatgtt cccagaaata caagggatga actatacgaa gtacacagag 6660
ttaacaaatc gactccaaca agcgtggggt ttgtatgatc aacgaaacgc cataccaaat 6720
ggtgatttcc gaaatgaatt aagtaattgg aatacaacat ctggtgtaaa tgtacaacaa 6780
atcaacaata cgtctgtctt agtcatgcca aactgggatg ggcaagtttc gcaacagttt 6840
acagttcaac cgaatcaaag atatgtatta cgagttactg caagaaaaga aggggtaggg 6900
aatgggtatg tgagtatccg tgatggtgga aatcaaacag aaacgcttac gtttagtgca 6960
agcgattata acacagatag tgtgtataat acgcaagtgt cgaatacaaa tggtttgtac 7020
aatgagcaaa caggatatac cacaaaaaca gtgacattca tcccatatac agatcaagtg 7080
tggattgaga tgagcgagac cgaaggtatg ttctatatag aaagtgtcga attgattgtt 7140
gacgtagag
tggtagta cccctccaga tacaggtttc atctggaggg gtttttttct 7200
End codon
gaaaaagggc ctttttgtag agaagaatcc gattatttta ttacgattat atattttgtg 7260
gatagatcat
ggtacc 7276
KpnI
SEQ ID NO 2(Cry8Ea1蛋白的氨基酸序列):
MSPNNQNEYE IIDMAPSTSV SNDSNRYPFA SDPTNALQNM NYKEYLRMSE GYDSEYSGSP 60
EVLISERDAV KTAISLVGTI LGKLGVPLVG PIVSLYSTLI DVLWPGGKSQ WEIFMEQVEA 120
LINQKIAEYA RAKALAELEG LGNNYQLYLT ALEEWQENPS STRVLRDVRN RFEILDSLFT 180
QYMPSFRVTG YEVPLLSVYA QAANLHLLLL KDASIFGEEW GFSTTAINNY YNRQMSLIAQ 240
YSDHCVQWYR TGLDRLKGSN AKQWVEYNRF RREMTLSVLD IMTLFPMYDM RTYPMETKAQ 300
LTREVYTDPI GAIGAQGSWY DSAPSFNTLE STFIRGKHLF DFITRLSIYT GRSSFSASNY 360
LKKWIGHQIS SQPIGGSIQT QTYGTTSGSS VIATQQIGFT GFDVYKTLST AGVLFAYTSK 420
YYGVSKVVFD AIYPDNKYKT TFTYNPGSEG IGAQEKDSEV ELPPETLDQP NYEAYSHRLN 480
YVTFIRNPDV PVFSWTHRSA DRTNTVYSDK ITQIPVVKAS DGPKPSANEV GHYLGGDPIS 540
FNSSGSTGVI RLNINSPLSQ KYRVRIRYCS SVDFDLDVVR GGTTVNNGRF NKSAPNVGWQ 600
SLKYENFKFA SFSTPFTFNQ AQDTLKISVR NFSSIVGGSV VYIDRIELIP VNATYEAEQD 660
LDSAKKAVNT LFTNTKDGLR PGVTDYEVNQ AANLVECLSD DLYPNEKRLL FDAVKEAKRL 720
SEARNLLQDP DFQEINGENG WTASTGIEVV EGDALFKGRY LRLPGAREMD TETYPTYLYQ 780
KVEEGVLKPY TRYRLRGFVG SSQGLEISTI RHQTNRIVKN VPDDLLPDVP PVNSDGRINR 840
CSEQKYVNSR LEGERGLPNG NRSAEAHEFS LPIDIGELDY NENAGIWVGF KITDPEGYAT 900
LGNLELVEEG PLSGDALERL QREEQQWKLQ MTKRREETDR KYTAAKQAVD RLYADYQDQQ 960
LNPNVEITDI TAAQNLIQSI PYVYNEMFPE IQGMNYTKYT ELTNRLQQAW GLYDQRNAIP 1020
NGDFRNELSN WNTTSGVNVQ QINNTSVLVM PNWDGQVSQQ FTVQPNQRYV LRVTARKEGV 1080
GNGYVSIRDG GNQTETLTFS ASDYNTDSVY NTQVSNTNGL YNEQTGYTTK TVTFIPYTDQ 1140
VWIEMSETEG MFYIESVELI VDVE 1164
SEQ ID NO 3(cry8Fa1基因的核苷酸序列):
atcgataaag ggaatggaag acaactcgca aatggctcaa atatcgggtg ttttttattt 60
tgattatgca taattacaat gaaaacaaaa agaattcatt tgtatagtat tcccagaaat 120
atcgtgacat cgtttatcat acaataattc gttctaagaa gagccggatt atttttcaat 180
ctaacaggaa ttttattgtc tacagaagaa tattcttatc acggtaatga ggagagggag 240
tgaaaatcaa aagagtacct gatttgtcat gtaagaacaa aagaaatcga tcgtacagga 300
aagtcaaaag aaagtgtaaa aaattttata tcttttgtat gtata
aaaatag
360
RBS ORF→
agtccaaata atcaaaatga atatgaaatt atagatatgg caccttctac atctgtaacc 420
aatgattcta acagataccc ttttgcgaat gagcccacaa atgcattaca aaatatgaat 480
tataaggatt atttaagaat gtctgaggga tattctcctg aatatttaac aagcctaagt 540
ccttacagcc agtttggcac agttgataag atcatcagta ttattagtct attgaatagt 600
gctgcaggta ttcctggtct tgattttttt actggattgc tgcaatttat tcttgacttc 660
tttgcaccag aggatccgtg ggcagagtta atggaactag tggaacaact catagatcaa 720
aaaataacag ttgctacaag agaaaaggcg ctcgcagaat taagaggact gataaatgga 780
taccttgtat atcagcaatc attagaaagt tggctggaaa atccaaatgc tacaagagct 840
agtatagttc gagaacaata tgtcgcttta gaacttgatt ttgttacttc gatttcatct 900
tttgcgatag ctggacagga agtaccgtta ttagccgtgt acgcacaagc tgctaattta 960
catttgttat tattgagaga tgtgtcaata tttggagaag aatggggatt aacagtaaat 1020
gaggttaata ccttctatat tcgtcaaatg acttatacaa ctgagtatag tgattattgt 1080
gtaagaattt ataatactgg cttaaataaa ttaaaaggat ctagtgcatc tagttgggtt 1140
gattataatc gctttcgtag agaaatgaac ttactagtac tagatattat tgcgttattt 1200
ccaaactatg atgttcgtag gtatccaatg gaaacaacaa cggaattaac aagagtagtt 1260
tacactgatc caattgtgtt tgacgaaagg aagggggtgg cgtcgactca tagttggacg 1320
gcgattgcac catctttctc aagtatagaa tctctaactc gacgaccagg attatttaca 1380
tggttagatc aactaactat tttttcgaaa cgcatatcgc aacctagtgt atttataaat 1440
agttgggcgg ggcataagat tagcaccttt agaacacaaa aaacagatat actcataaat 1500
accacccatg gagatactaa taatcctata aaagaatttg tagtagatac caaaaaagta 1560
gaagatattt atcaaacgat agcataccca catgcagtag caaatgaagt attctattta 1620
ttcggtgtcc caaaagttga ttttaatatg gtacctgcag gtggctctgc aaactctgca 1680
cacaccctca ttttttctga tagtacggga gggagactgg aaagtattac gaagaactca 1740
gaagcagaat tacctccaac agagtcatta tcagatacac ctcaaccaaa ccaagtaact 1800
tattctcaca gattagatta tgctacaata attaaagcaa ataaaagtta tggaagtggg 1860
tatattccat tattaggttg gacccatcgg agtgtagatc gtaataatac aatttatccg 1920
aataaaatca ctcaaatacc agcagtaaaa gctttctcat atactgaatc atttaatgta 1980
aatgttattg caggtccagg attcacagga ggagatttaa taagtttagg tcatttagag 2040
aatatttata tgaaattaaa cgttccaaat cctcaaaaat tccgtgttcg tattcgttat 2100
gctgctagta caacttcgta tttgcaaata actgggctat ctaatttagc tcagtctgat 2160
cgtttcgaac agacgtattc taatgaaaat gaaaacaatt tgatgtttga aaattttcaa 2220
tatgtagaac ttagaaatat tttttcggta gatgctccat tagaaaatca tcaagtaagt 2280
atacaaaatt atcaaggtaa tggttttgtt attatagacc gaatcgaatt catcccagta 2340
aatgcaacat atgaggcaga acaagattta gattcggcaa agaaagcagt gaataccttg 2400
tttacgaata caaaagatgg tttacgacca ggggtaacgg attatgaagt gaatcaagcg 2460
gcaaacttag tggaatgcct atcggatgat ttgtatccaa atgaaaaacg cttgttattt 2520
gatgcagtga aagaggcaaa acgactcagc gaggcacgta acttactaca agatccagat 2580
ttccaagaga taaatggaga aaatggatgg accgcaagta caggaattga ggttgtagaa 2640
ggagatgctc tatttaaagg gcgttatcta cgcctaccag gtgcgagaga aatggataca 2700
gaaacgtatc caacgtatct gtatcaaaaa gtagaggaag gtgtattaaa accatacaca 2760
agatatagat tgagagggtt tgtcggaagc agtcaaggct tggaaatttc cacaattcgt 2820
catcagacga accgaattgt aaaaaatgtt ccagatgatt tattaccaga tgtacctcct 2880
gtaaactctg atggtagaat caatcgatgc agcgaacaaa agtatgtgaa tagccgttta 2940
gaaggagaaa gaggattacc aaatgggaat cgttctgctg aagcgcatga attctctctc 3000
cctattgata taggagagct ggattacaat gaaaatgcag gaatatgggt tggatttaag 3060
attacggacc cagagggata tgcaacactc ggtaaccttg aattggtaga agagggacca 3120
ttgtcaggag acgcactaga acgcctgcaa agagaagaac aacagtggaa gcttcaaatg 3180
acaaaaagac gtgaagagac ggatagaaaa tatacggcag caaaacaagc ggtagatcgt 3240
ttatatgcag attaccaaga tcaacaattg aatccaaacg tagaaattac ggatattact 3300
gcggcccaaa acctgataca gtccattcct tatgtatata atgaaatgtt cccagaaata 3360
caagggatga actatacgaa gtacacagag ttaacaaatc gactccaaca agcgtggggt 3420
ttgtatgatc aacgaaacgc cataccaaat ggtgatttcc gaaatgaatt aagtaattgg 3480
aatacaacat ctggtgtaaa tgtacaacaa atcaacaata cgtctgtctt agtcatgcca 3540
aactgggatg ggcaagtttc gcaacagttt acagttcaac cgaatcaaag atatgtatta 3600
cgagttactg caagaaaaga aggggtaggg aatgggtatg tgagtatccg tgatggtgga 3660
aatcaaacag aaacgcttac gtttagtgca agcgattata acacagatag tgtgtataat 3720
acgcaagtgt cgaatacaaa tggtttgtac aatgagcaaa caggatatac cacaaaaaca 3780
gtgacattca tcccatatac agatcaagtg tggattgaga tgagcgagac cgaaggtatg 3840
ttctatatag aaagtgtcga attgattgtt gacg
agt aatggtagta cccctccaga 3900
End codon
tacaggtttc atctggaggg gtttttttct gaaaaagggc ctttttgtag agaagaatcc 3960
gattatttta ttacgattat atattttgtg gatagatcat ggtacc 4006
SEQ ID NO 4(Cry8Fa1蛋白的氨基酸序列):
MSPNNQNEYE IIDMAPSTSV TNDSNRYPFA NEPTNALQNM NYKDYLRMSE GYSPEYLTSL 60
SPYSQFGTVD KIISIISLLN SAAGIPGLDF FTGLLQFILD FFAPEDPWAE LMELVEQLID 120
QKITVATREK ALAELRGLIN GYLVYQQSLE SWLENPNATR ASIVREQYVA LELDFVTSIS 180
SFAIAGQEVP LLAVYAQAAN LHLLLLRDVS IFGEEWGLTV NEVNTFYIRQ MTYTTEYSDY 240
CVRIYNTGLN KLKGSSASSW VDYNRFRREM NLLVLDIIAL FPNYDVRRYP METTTELTRV 300
VYTDPIVFDE RKGVASTHSW TAIAPSFSSI ESLTRRPGLF TWLDQLTIFS KRISQPSVFI 360
NSWAGHKIST FRTQKTDILI NTTHGDTNNP IKEFVVDTKK VEDIYQTIAY PHAVANEVFY 420
LFGVPKVDFN MVPAGGSANS AHTLIFSDST GGRLESITKN SEAELPPTES LSDTPQPNQV 480
TYSHRLDYAT IIKANKSYGS GYIPLLGWTH RSVDRNNTIY PNKITQIPAV KAFSYTESFN 540
VNVIAGPGFT GGDLISLGHL ENIYMKLNVP NPQKFRVRIR YAASTTSYLQ ITGLSNLAQS 600
DRFEQTYSNE NENNLMFENF QYVELRNIFS VDAPLENHQV SIQNYQGNGF VIIDRIEFIP 660
VNATYEAEQD LDSAKKAVNT LFTNTKDGLR PGVTDYEVNQ AANLVECLSD DLYPNEKRLL 720
FDAVKEAKRL SEARNLLQDP DFQEINGENG WTASTGIEVV EGDALFKGRY LRLPGAREMD 780
TETYPTYLYQ KVEEGVLKPY TRYRLRGFVG SSQGLEISTI RHQTNRIVKN VPDDLLPDVP 840
PVNSDGRINR CSEQKYVNSR LEGERGLPNG NRSAEAHEFS LPIDIGELDY NENAGIWVGF 900
KITDPEGYAT LGNLELVEEG PLSGDALERL QREEQQWKLQ MTKRREETDR KYTAAKQAVD 960
RLYADYQDQQ LNPNVEITDI TAAQNLIQSI PYVYNEMFPE IQGMNYTKYT ELTNRLQQAW 1020
GLYDQRNAIP NGDFRNELSN WNTTSGVNVQ QINNTSVLVM PNWDGQVSQQ FTVQPNQRYV 1080
LRVTARKEGV GNGYVSIRDG GNQTETLTFS ASDYNTDSVY NTQVSNTNGL YNEQTGYTTK 1140
TVTFIPYTDQ VWIEMSETEG MFYIESVELI VDVE 1174