WO2014026345A1 - Method for extracting full-length gene of misp of araneus ventricosus and expressing same - Google Patents

Method for extracting full-length gene of misp of araneus ventricosus and expressing same Download PDF

Info

Publication number
WO2014026345A1
WO2014026345A1 PCT/CN2012/080204 CN2012080204W WO2014026345A1 WO 2014026345 A1 WO2014026345 A1 WO 2014026345A1 CN 2012080204 W CN2012080204 W CN 2012080204W WO 2014026345 A1 WO2014026345 A1 WO 2014026345A1
Authority
WO
WIPO (PCT)
Prior art keywords
misp
length
full
gene
spider
Prior art date
Application number
PCT/CN2012/080204
Other languages
French (fr)
Chinese (zh)
Inventor
孟清
陈格飞
Original Assignee
东华大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 东华大学 filed Critical 东华大学
Priority to PCT/CN2012/080204 priority Critical patent/WO2014026345A1/en
Publication of WO2014026345A1 publication Critical patent/WO2014026345A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/43504Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates
    • C07K14/43513Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates from arachnidae
    • C07K14/43518Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates from arachnidae from spiders

Definitions

  • the present invention relates to the field of genetic engineering technology, and in particular to a method for extracting a full-length gene of MiSp (minor ampul late spidroin) and a full-length MiSp gene expression.
  • each egg contains 500 to 1000 eggs
  • the host mainly harms rice, wheat, cotton, fruit trees and other crops.
  • the abdomen can secrete six kinds of silk protein fibers and one mucus protein.
  • the minor ampul late si lk is a high-quality protein-polymerized fiber with high strength, high elasticity, radiation resistance and heat stability. And biodegradable characteristics, no super-shrinkage in water environment, stable mechanical properties (see Table 1): Table 1 Comparison of performance parameters of spider webs
  • MiSp full-length gene sequence studies have shown that spider silk protein has a highly repetitive primary structure, and its GC content is high, so the full-length gene coding sequence cannot be obtained by PCR alone. Therefore, the cloning of spider silk protein MiSp has been studied so far. The progress is slow.
  • the cDNA library is constructed by cDNA library and hybridization technology to obtain related sequences in the world, but it is only a partial short cDNA sequence of MiSp (see Table 2). So far, there is no MiSp complete genome sequence. Report; Table 2 Partially shorter cDNA sequences of MiSp
  • the existing expression systems including the host: Escherichia coli, Pichia, plants (tobacco, potatoes and Arabidopsis, etc.), mammalian cells, transgenic mice and goats, transgenic silkworms, etc., will be part of the natural gene Expression and artificial synthesis of repetitive module tandem expression, although the expression system is economical, easy to operate, and has the shortest cycle, but due to the problem of codon preference, the protein yield is low or the expression is terminated prematurely and the large molecular weight protein cannot be expressed, and The obtained recombinant incomplete spider silk protein or the artificial splicing tandem body combination protein is far from the natural spider silk protein. In order to make the spider silk protein and fiber large-scale application, it is urgent to realize a complete spider. Silk protein expression technology.
  • the object of the present invention is to overcome the deficiencies of the prior art and to provide a method for extracting the full-length MiSp gene of the large abalone, and realizing the expression of the full MiSp full-length gene.
  • a method for extracting the full-length MiSp gene of the big belly plant was designed. According to this method, the full-length MiSp gene of the large abdomen was obtained, which is characterized by the nucleotide sequence of the full-length MiSp gene of the big belly plant. SEQ ID NO. 1 is shown.
  • the gene is 10929 bp in length and contains two exons of 1428 bp and 3870 bp in length and an intron of 5628 bp in length.
  • a method for extracting a full-length MiSp gene from a spider characterized in that the method comprises the following steps:
  • He 1J was isolated from high Molecular Weight Genomic DNA (HMW-gDNA) by modified CTAB (cetyltriethylammnonium bromide) method;
  • the abdomen fosmid genomic library produced a total of 3. 9 X 10 5 clones, and transferred one by one to about 1026 384-well cell culture plates, and stored at 8% concentration of glycerol at -80 ° C.
  • the 1026 384-well cell culture plates were divided into approximately 26 batches of transfer screening. Each batch was transferred to 40 384-well cell culture plates, and the LB medium was transferred to 40 sterile 384 wells using an 8 X 100 L pipette. a cell culture plate, 60 ⁇ L per well, the LB medium containing 12. 5 g / mL chloramphenicol; B.
  • Construction of the primary mixing tank a) Mix all fosmid genomic library clones in the first batch of 384-well cell culture plates in the first batch with an 8X10 ⁇ 1 pipette, 5 L of bacteria per well, and add to 5 In mL LB medium, the mixture was labeled as No. 1 (384 library clones), and two tubes were aliquoted at 37 ° C and 180 rpm for 5 h. The remaining 39 384-well cell culture plates were cloned in the same manner. Mix, labeled as No. 2 Mix, No. 3 Mix... No. 40 Mix; b) Arrange 40 parts of the mixture obtained in the above procedure in 5 rows and 8 columns, and use 5X 100 ⁇ L pipette to set 5 rows and 8 columns.
  • FD4-16 is a secondary pool (Secondary pools); D. Build a super pool. Mix the first mixture of the first-stage mixing tank with 10 L and add it to 5 mL of LB medium, which is a super pool at 37 °C. Incubate at 180 rpm for 10 h.
  • the CTAB method is a modified CTAB method comprising the following steps: a) grinding: taking the treated spider chest muscles with 89% (v/v) pre-cooled CTAB extraction buffer [2% (g/) V) CTAB, 1. 4 mol/L NaCl, 100 ⁇ ol/L Tris-HCl (pH 8. 0), 20 mmol/L EDTA (pH 8. 0), 2% (g/V) PVP (polyvinylpyrrole) Anthrone)], 1% (V/V) SDS-LDS Master Mix [10%
  • NT+RP and the cleavage protein intron Inl N-terminal domain coding gene with high splicing activity was obtained, namely NT+RP+InlN; the second part of RP 5' The end is fused to the C-terminal domain of Inl, and the RP 3' end is fused to the N-terminal domain of In2, ie, InlC+RP + In2N ; the third part of RP+CT is fused to the C-terminal domain of In2, ie In2 C + RP+CT; (3) The fusion expression product was purified by Ni-NTA.
  • the purified product After mixing the purified NT+RP+InlN with InlC+RP + In2N, InlN and InlC spontaneously undergo trans-splicing reaction, and then self-cleave from the fusion protein, and then carry the MiSp fragment through peptide bond, Ni -NTA reverse purification (removal of Inl), the purified product continues to mix with the third part of In2 C+RP+CT, In2N and In2C are spatially close to each other, spontaneous trans-splicing reaction, and itself is cut from the MiSp fragment, and The MiSp fragments at both ends were linked by peptide bonds, and the recombinant full-length MiSp full-length protein was obtained by a three-step directional splicing reaction.
  • the present invention displays the full MiSp genome sequence of the first large-bellied spider, including the full-length gene of the spider protein MiSp in the large abdomen, and the MiSp full-length gene sequence of the large abalone is more related to other species.
  • This silk protein gene is predicted to be small, but further provides for the expression of the full-length protein.
  • the full-length coding sequence provides gene support for studying the phylogeny of spider silk protein and obtaining full-length recombinant MiSp protein and then bionic high-performance artificial spider silk; in the expression of the big belly plant spider MiSp gene, the present invention adopts protein intron The splicing and Rosetta2 (DE3) expression strains replace the traditional E.
  • the MiSp recombinant protein is segmentally expressed by the trans-splicing reaction of the cleavage protein intron with high splicing activity, through the protein intron Mediated trans-splicing reaction directional splicing of fusion protein in vitro, the recombinant full-length spider silk MiSp protein was obtained, and the natural full-length spider silk protein MiSp gene in Escherichia coli-derived bacteria was completed for the first time at home and abroad.
  • the recombinant expression in the middle provides a material basis for the biomimetic study of spider silk.
  • FIG. 1 is a schematic flow chart of a method for constructing and screening a fosmid library of a large abdomen in the present invention
  • FIG. 2 is a schematic flow chart of a full-length expression method of a large belly plant MiSp according to the present invention
  • 3 is a full-length protein sequence of the Great Spider Mi Mi obtained in the present invention
  • the technical scheme of the present invention mainly extracts the full-length MiSp gene of the spider, and fully expresses the full-length MiSp gene into the full-length protein.
  • the method used to extract the full-length MiSp gene of the big belly spider including the following steps:
  • step d) repeat step d) until the white protein layer is not visible at the interface between the aqueous phase and the organic phase, and the supernatant is taken;
  • the fosmid genomic library of the abdomen garden spiders produced a total of 3. 9 X 10 5 clones, and transferred one by one to about 1026 384-well cell culture plates, and stored at 8% concentration of glycerol at -80 ° C.
  • the 1026 384-well cell culture plates were divided into approximately 26 batches of transfer screens. Each batch was transferred to 40 384-well cell culture plates, and the LB medium was transferred to 40 sterile 384 cells using an 8 X 100 ⁇ L pipette. a cell culture plate, 60 ⁇ L per well, the LB medium containing 12. 5 g / mL chloramphenicol;
  • B. Construction of the primary mixing tank a) Mix all fosmid genomic library clones in the first batch of 384-well cell culture plates in the first batch with an 8 X 10 ⁇ 1 pipette, 5 L of bacteria per well, and add Into 5 mL of LB medium, labeled as No. 1 (384 library clones), and two tubes were incubated at 37 ° C and 180 rpm for 5 h. The remaining 39 384-well cell culture plates were separately prepared in the same manner. Clone and mix, labeled as No. 2 Mix, No. 3 Mix... No.
  • the expression method mainly uses the trans-protein intron-mediated trans-splicing technique to complete the segmental expression of the large abdomen spider MiSp.
  • the protein intron intein
  • the protein intron can spontaneously undergo a splicing reaction, and two The exons on the side are joined by peptide bonds.
  • the two separate polypeptide fragments can be joined to form a complete protein by trans-splicing of the broken protein intron, as follows:
  • NT+RP and the cleavage protein intron Inl N-terminal domain encoding gene with high splicing activity is expressed, ie NT+RP+InlN; the second part of RP 5 ' The end is fused to the C-terminal domain of Inl, and the RP 3 'end is fused to the N-terminal domain of In2, ie, InlC+RP + In2N ; the third part of RP+CT is fused to the C-terminal domain of In2, ie, In2 C + RP+CT;
  • the fusion expression product was purified by Ni-NTA. After mixing the purified NT+RP+InlN with InlC + RP + In2N, InlN and InlC spontaneously undergo trans-splicing reaction, and then self-cleave from the fusion protein, and carry the MiSp fragment through peptide bond, Ni -NTA reverse purification (removal of Inl), the purified product continues to mix with the third part of In2 C+RP+CT, In2N and In2C are spatially close to each other, spontaneous trans-splicing reaction, and itself is cut from the MiSp fragment, and The MiSp fragments at both ends were linked by peptide bonds, and the recombinant full-length MiSp was obtained by a three-step directional splicing reaction.
  • the MiSp full-length protein sequence of the large abalone can be obtained as shown in Figure 3, containing a 5301 bp MiSp full-length transcript, 1766 encoding amino acids, non-repetitive N-terminal and C-terminal modules, and glycine and C.
  • the repeating module consisting of the amino acid consists of the GX, GGX, GGGX and oligo_A repeats motifs and two identical Spacer sequences.

Abstract

Disclosed is a method for extracting the full-length gene of MiSp of Araneus ventricosus and expressing same. The nucleotide sequence of the obtained full-length gene of MiSp of Araneus ventricosus with a full length of 10929bp is as shown by SEQ ID NO.1, which contains two exons with a length of 1428bp and 3870bp, respectively, and an intron with a length of 5628bp. The expression method of the full-length gene of MiSp is by means of directional splicing via a trans-splicing reaction which cleaves the introns of a protein, and the recombinant expression of the native full-length spider silk protein MiSp gene of Araneus ventricosus in bacteria derived from E. coli is completed and the full-length protein of MiSp of Araneus ventricosus is obtained.

Description

一种提取大腹园蛛 MiSp全长基因及全长基因表达的方法  Method for extracting full-length gene and full-length gene expression of MiSp
[技术领域] 本发明涉及基因工程技术领域, 具体的说是一种提取大腹园蛛蛛丝蛋白 MiSp (minor ampul late spidroin ) 全长基因及 MiSp全长基因表达的方法。 [Technical Field] The present invention relates to the field of genetic engineering technology, and in particular to a method for extracting a full-length gene of MiSp (minor ampul late spidroin) and a full-length MiSp gene expression.
[背景技术] 大腹园蛛 ( ΑΓ3删 s ventricosus ) 属蜘蛛目(Araneae), 园蛛科, 主要分 布于湖北、 湖南、 四川、 江苏、 浙江、 安徽、 贵州、 云南、 河南、 河北青海、 新疆等, 该物种体型大, 呈灰褐色, 习惯在屋檐、 庭园、 树丛间结大型车轮状 垂直圆网, 夜间居网的中心, 白天在网旁的缝隙或树叶丛中隐蔽, 卵袋产于墙 或树皮裂缝等处, 每卵袋中含卵 500〜1000个, 寄主主要危害水稻、 小麦、 棉 花、 果树等作物。 大腹园蛛可分泌六种丝蛋白纤维和一种粘液蛋白, 其中的临时捕获丝 (minor ampul late si lk ) 是一种优质的蛋白聚合纤维, 具有强度高、 弹性大、 抗辐射、 热稳定和生物可降解等特性, 在水环境中不产生超收縮现象, 机械性 能稳定 (见表 1 ) : 表 1 大腹园蛛蜘丝参数性能对比 [Background Art] Large abdomen spider (ΑΓ3 deleted s ventricosus) belongs to the genus Araneae, arachnid, mainly distributed in Hubei, Hunan, Sichuan, Jiangsu, Zhejiang, Anhui, Guizhou, Yunnan, Henan, Hebei Qinghai, Xinjiang. Etc., the species is large in size and grayish brown. It is used to form a large wheel-shaped vertical circular net between eaves, gardens and trees. The center of the night net is concealed in the gaps or leaves in the net during the day. The egg pouch is produced. Walls or bark cracks, etc., each egg contains 500 to 1000 eggs, the host mainly harms rice, wheat, cotton, fruit trees and other crops. The abdomen can secrete six kinds of silk protein fibers and one mucus protein. The minor ampul late si lk is a high-quality protein-polymerized fiber with high strength, high elasticity, radiation resistance and heat stability. And biodegradable characteristics, no super-shrinkage in water environment, stable mechanical properties (see Table 1): Table 1 Comparison of performance parameters of spider webs
Material Strength (N m-2) Elongation (%) Energy to break (J kg-1) Material Strength (N m-2) Elongation (%) Energy to break (J kg-1)
Dragline silk 4x l09 35 4> < 105 Dragline silk 4x l0 9 35 4>< 10 5
Minor ampullate silk l x lO9 5 3> < 104 Minor ampullate silk lx lO 9 5 3>< 10 4
Kevlar 4x l09 5 3> < 104 Kevlar 4x l0 9 5 3>< 10 4
Tendon l x lO6 5 5> < 103 Tendon lx lO 6 5 5>< 10 3
Some data from Gosline, J. M.; Dennv, M. W.; DeMont, M. E. Nature 1984, 309, 551. 因此该种丝蛋白纤维在军工、 航空航天以及生 医学工程等领域有着巨大 的潜在应用价值, 但由于蜘蛛具有种内残食性, 天然产丝量远低于家蚕, 通过 驯养集取蛛丝效率低下, 因而利用基因工程技术合成蛛丝蛋白进而纺制成蛛丝 成为仿生蛛丝开发的必经之路。 利用基因工程技术合成临时捕获丝的首要步骤便是获取大腹园蛛蛛丝蛋白Some data from Gosline, JM; Dennv, MW; DeMont, ME Nature 1984, 309, 551. Therefore, this silk protein fiber has a huge field in military, aerospace and biomedical engineering. The potential application value, but because the spider has intra-species residual nature, the natural silk production is much lower than that of the silkworm. The domestication of the spider silk is inefficient, so the genetic engineering technology is used to synthesize the spider silk protein and then spun into a spider silk to become a bionic spider. The only way to develop silk. The first step in the synthesis of temporary capture silk using genetic engineering techniques is to obtain the spider protein of the abdomen
MiSp全长基因序列, 研究表明, 蛛丝蛋白具有高度重复的一级结构, 其 GC含量 高, 因此单纯 PCR技术无法获取其全长基因编码序列, 因此到目前为止, 蛛丝 蛋白 MiSp的克隆研究进展缓慢, 在世界范围内均是通过构建 cDNA文库及结合 杂交技术筛选 cDNA文库来获取相关序列, 但也仅仅是 MiSp的部分较短 cDNA序 列 (见表 2), 迄今还没有 MiSp完整基因组序列的报道; 表 2 MiSp的部分较短 cDNA序列 MiSp full-length gene sequence, studies have shown that spider silk protein has a highly repetitive primary structure, and its GC content is high, so the full-length gene coding sequence cannot be obtained by PCR alone. Therefore, the cloning of spider silk protein MiSp has been studied so far. The progress is slow. The cDNA library is constructed by cDNA library and hybridization technology to obtain related sequences in the world, but it is only a partial short cDNA sequence of MiSp (see Table 2). So far, there is no MiSp complete genome sequence. Report; Table 2 Partially shorter cDNA sequences of MiSp
Species cDNA/DNA Length (bp) Accession NO. YearSpecies cDNA/DNA Length (bp) Accession NO. Year
Nephila clavipes cDNA 3060 AF027735 1998Nephila clavipes cDNA 3060 AF027735 1998
Nephila clavipes cDNA 855 AF027736 1998Nephila clavipes cDNA 855 AF027736 1998
Nephila clavipes cDNA 472 AF027737 1998Nephila clavipes cDNA 472 AF027737 1998
Nephila ntipodiana cDNA 1189 DQ338462 2006Nephila ntipodiana cDNA 1189 DQ338462 2006
Latrodectus esperus cDNA 1585 EU394445 2008Latrodectus esperus cDNA 1585 EU394445 2008
Latrodectus esperus cDNA 1043 HM752571 2010Latrodectus esperus cDNA 1043 HM752571 2010
Latrodectus esperus cDNA 973 HM752570 2010Latrodectus esperus cDNA 973 HM752570 2010
Argiope argentata cDNA 540 JQ713003 2012Argiope argentata cDNA 540 JQ713003 2012
Argiope argentata cDNA 1195 JQ713004 2012Argiope argentata cDNA 1195 JQ713004 2012
Metepeira randiosa cDNA 1566 HM752575 2010Metepeira randiosa cDNA 1566 HM752575 2010
Metepeira andiosa cDNA 1160 HM752569 2010Metepeira andiosa cDNA 1160 HM752569 2010
Uloborus diversus cDNA 1550 HM752574 2010Uloborus diversus cDNA 1550 HM752574 2010
Uloborus diversus cDNA 2646 DQ399332 2006Uloborus diversus cDNA 2646 DQ399332 2006
Deinopis spinosa cDNA 1604 DQ399324 2006Deinopis spinosa cDNA 1604 DQ399324 2006
Parawixia bistriata cDNA 1709 GQ275358 2011Parawixia bistriata cDNA 1709 GQ275358 2011
Parawixia bistriata cDNA 1754 GQ275359 2012Parawixia bistriata cDNA 1754 GQ275359 2012
Parawixia bistriata cDNA 1434 GQ275360 2012 大腹园蛛蛛丝蛋白的重组是基于获取的大腹园蛛蛛丝蛋白 MiSp全长基因序 列, 但由于蛛丝蛋白编码基因大、 重度度高以及编码基因 GC含量高等使得该基 因在表达宿主中遗传不稳定、 容易形成稳定的 mRNA二级结构等, 从而导致基 因转录的提前终止、 缺失、 基因重排以及翻译的提前终止等许多问题, 使得现 有的表达系统无法完成该种超高分子量蛋白的全长表达。 现有的表达系统, 采 用的宿主主要包括: 大肠杆菌、 毕赤酵母、 植物 (烟草、 土豆和拟南芥等)、 哺乳动物细胞、 转基因小鼠及山羊、 转基因家蚕等, 将部分天然基因的表达和 人工合成重复模块串联体的表达, 虽然该表达系统经济、 操作容易、 周期最短, 但是由于存在密码子偏爱性的问题导致蛋白产量低或者表达提前终止以及不能 表达大分子量蛋白的问题, 而且得到的均是重组的不完整蛛丝蛋白或者是人工 拼接串联体的组合蛋白, 与天然的蛛丝蛋白相差甚远, 为了使得蛛丝蛋白及纤 维的大规模应用, 急需一种能够实现完整蛛丝蛋白表达的技术。 Parawixia bistriata cDNA 1434 GQ275360 2012 The recombination of the spider protein of the Abdominal Garden is based on the obtained full-length gene sequence of the spider silk protein MiSp of the large abdomen, but due to the large and severe gene encoding the spider silk protein and the high GC content of the coding gene, The gene is genetically unstable in the expression host, and it is easy to form a stable mRNA secondary structure, etc., resulting in a base. Due to many problems such as early termination of transcription, deletion, gene rearrangement, and early termination of translation, the existing expression system is unable to complete the full-length expression of this ultra-high molecular weight protein. The existing expression systems, including the host: Escherichia coli, Pichia, plants (tobacco, potatoes and Arabidopsis, etc.), mammalian cells, transgenic mice and goats, transgenic silkworms, etc., will be part of the natural gene Expression and artificial synthesis of repetitive module tandem expression, although the expression system is economical, easy to operate, and has the shortest cycle, but due to the problem of codon preference, the protein yield is low or the expression is terminated prematurely and the large molecular weight protein cannot be expressed, and The obtained recombinant incomplete spider silk protein or the artificial splicing tandem body combination protein is far from the natural spider silk protein. In order to make the spider silk protein and fiber large-scale application, it is urgent to realize a complete spider. Silk protein expression technology.
[发明内容] 本发明的目的在于克服现有技术的不足, 提供一种提取大腹园蛛 MiSp全长 基因, 并实现完整 MiSp全长基因表达的方法。 为实现上述目的, 设计一种提取大腹园蛛 MiSp全长基因的方法, 依据此方 法获得大腹园蛛 MiSp全长基因, 其特征在于大腹园蛛 MiSp全长基因的核苷酸 序列如 SEQ ID NO. 1所示。 该基因全长 10929 bp, 含有两个长度分别为 1428 bp和 3870 bp的外显子、 一个长度为 5628 bp的内含子。 一种大腹园蛛 MiSp全长基因的提取方法,其特征在于该方法包括以下步骤: SUMMARY OF THE INVENTION The object of the present invention is to overcome the deficiencies of the prior art and to provide a method for extracting the full-length MiSp gene of the large abalone, and realizing the expression of the full MiSp full-length gene. In order to achieve the above objectives, a method for extracting the full-length MiSp gene of the big belly plant was designed. According to this method, the full-length MiSp gene of the large abdomen was obtained, which is characterized by the nucleotide sequence of the full-length MiSp gene of the big belly plant. SEQ ID NO. 1 is shown. The gene is 10929 bp in length and contains two exons of 1428 bp and 3870 bp in length and an intron of 5628 bp in length. A method for extracting a full-length MiSp gene from a spider, characterized in that the method comprises the following steps:
( 1 ) 简并 PCR获取 MiSp C端基因序列 A. 分析比对公知的大腹园蛛蛛丝中牵引丝蛋白 MaSp及临时捕获丝蛋白 MiSp的 C端氨基酸序列, 获得保守的氨基酸序列, 分别为 SRISSA和 NIGQVD; (1) Degenerate PCR to obtain MiSp C-terminal gene sequence A. Analyze the C-terminal amino acid sequence of the traced silk protein MaSp and the temporary capture silk protein MiSp in the well-known spider web, and obtain the conserved amino acid sequences, namely SRISSA and NIGQVD;
B. 基于所述的 SRISSA和 NIGQVD两段保守的氨基酸序列, 设计简并 PCR的 上下游引物; B. Designing the upstream and downstream primers of degenerate PCR based on the two conserved amino acid sequences of SRISSA and NIGQVD;
C. 以大腹园蛛蛛丝高分子量基因组 DNA为模版, 通过简并 PCR、 测序分析 得到大腹园蛛 MiSp C端长度约 200 bp的基因片段序列; C. Using the high molecular weight genomic DNA of spider web of Daweiyuan as a template, the gene fragment sequence of the MiSp C-terminal length of about 200 bp was obtained by degenerate PCR and sequencing analysis.
D. 以所述的大腹园蛛 MiSp C端长度约 200 bp的基因片段序列为基础, 设 计特异性引物 (GMiSp-CF: 5' - TTACTCAGGTGTCCTTGG - 3,, GMiSp-CR: 5' - ATTGGCTTACTGCATTCT - 3' )用于文库筛选; D. Based on the sequence of the gene fragment of the MiSp C-terminal length of about 200 bp, design specific primers (GMiSp-CF: 5' - TTACTCAGGTGTCCTTGG-3, GMiSp-CR: 5' - ATTGGCTTACTGCATTCT - 3') for library screening;
(2 ) 分离高分子量 DNA构建 fosmid基因组文库 (2) Separating high molecular weight DNA to construct fosmid genomic library
A.禾 1J用改良 CTAB (cetyltriethylammnonium bromide ) 法分离大腹园蛛高 分子量基因组 DNA (High Molecular Weight Genomic DNA, HMW-gDNA); A. He 1J was isolated from high Molecular Weight Genomic DNA (HMW-gDNA) by modified CTAB (cetyltriethylammnonium bromide) method;
B. 利用文库构建试剂盒构建大腹园蛛 fosmid基因组文库; B. Constructing a fosmid genomic library of the large abalone plant using a library construction kit;
(3) 筛选 fosmid基因组文库 (3) Screening fosmid genomic library
A.所述大腹园蛛 fosmid基因组文库共产生 3. 9 X 105个克隆, 并逐一转接于 约 1026块 384孔细胞培养板, 以 8%浓度的甘油于 -80°C保存, 所述 1026块 384 孔细胞培养板分为约 26批转接筛选, 每批转接 40块 384孔细胞培养板, 将 LB 培养基用 8 X 100 L移液器转接到 40块无菌 384孔细胞培养板, 每孔 60 μ L, 所述的 LB培养基含 12. 5 g/mL氯霉素; B.构建一级混合池 a)用 8X10 μ 1移液器将第一批次中的 1号 384孔细胞培养板中所有 fosmid 基因组文库克隆进行混合, 每孔 5 L菌液, 并加入到 5 mL LB培养基中, 标记 为 1号混合 (384个文库克隆), 分装两管于 37 °C、 180 rpm培养 5 h, 按同 样的方法分别将剩余的 39块 384孔细胞培养板进行克隆混合, 分别标记为 2号 混合、 3号混合… 40号混合; b)将按上述步骤获得的 40份混合液按 5行 X8列进行排列, 用 8X 100 μ L移液器将 5行 Χ8列排列中的每一列中五个孔进行混合, 将混合后的 8组菌液 每孔抽取 5 41^加入到511^ 1^培养基中, 分别编号 FD1-1、 FD1-2—FD1-8, 于 37 °C、 180 rpm培养 10 h, 按同样方法将 5行 X 8列排列中的每行进行混合, 并分别编号为 FD2-1、 FD2- 2—FD2- 5、 FD1- 1、 FD1- 2—FD1- 8和 FD2- 1、 FD2- 2··· FD2-5即获得初级混合池 (Primary pools); A. The abdomen fosmid genomic library produced a total of 3. 9 X 10 5 clones, and transferred one by one to about 1026 384-well cell culture plates, and stored at 8% concentration of glycerol at -80 ° C. The 1026 384-well cell culture plates were divided into approximately 26 batches of transfer screening. Each batch was transferred to 40 384-well cell culture plates, and the LB medium was transferred to 40 sterile 384 wells using an 8 X 100 L pipette. a cell culture plate, 60 μL per well, the LB medium containing 12. 5 g / mL chloramphenicol; B. Construction of the primary mixing tank a) Mix all fosmid genomic library clones in the first batch of 384-well cell culture plates in the first batch with an 8X10 μ1 pipette, 5 L of bacteria per well, and add to 5 In mL LB medium, the mixture was labeled as No. 1 (384 library clones), and two tubes were aliquoted at 37 ° C and 180 rpm for 5 h. The remaining 39 384-well cell culture plates were cloned in the same manner. Mix, labeled as No. 2 Mix, No. 3 Mix... No. 40 Mix; b) Arrange 40 parts of the mixture obtained in the above procedure in 5 rows and 8 columns, and use 5X 100 μL pipette to set 5 rows and 8 columns. Five wells in each column of the array were mixed, and the mixed 8 groups of bacteria were extracted from each well by 5 41^ into 511^1^ medium, numbered FD1-1, FD1-2-FD1-8, respectively. Incubate at 37 °C, 180 rpm for 10 h, and mix each row of 5 rows and 8 columns in the same way, and number them as FD2-1, FD2- 2-FD2- 5, FD1- 1, FD1- 2-FD1- 8 and FD2- 1, FD2- 2··· FD2-5 obtain the primary pools;
C.构建二级混合池 a) 用 8X 10 μ 1移液器吸取第一批次的 40块 384孔细胞培养板第 1列的 菌液各 5 L, 加入到 5 mL LB培养基中, 标号 FD3-1, 于 37 °C, 180 rpm培 养 10 h,按同样方法分别将第 2至 24列进行混合,标号为 FD3-2、FD3_3 " 03-24, 共 24个混合池; b) 按同样方法将 40块 384孔细胞培养板的 A-P行进行混合, 分别编号 为 FD4-1、 FD4-2--FD4-16, 共 16个混合池, FD3_1、 FD3-2—FD3-24和 FD4_1、 FD4-2—FD4-16即为二级混合池 (Secondary pools); D.构建超级池 将构建一级混合池的第二份混合液各取 10 L混合在一起, 并加入到 5 mL LB培养基中, 即为超级混合池 (Super pool ), 于 37 °C、 180 rpm培养 10 h。 C. Construct a secondary mixing tank a) Pipette 5 L of each of the first batch of 40 384-well cell culture plates in the first batch of 8 10 10 μ 1 pipette into 5 mL LB medium, label FD3-1, incubated at 37 °C, 180 rpm for 10 h, mix the 2nd to 24th columns in the same way, labeled FD3-2, FD3_3 "03-24, a total of 24 mixing tanks; b) press the same Methods The AP lines of 40 384-well cell culture plates were mixed, numbered FD4-1, FD4-2--FD4-16, a total of 16 mixing cells, FD3_1, FD3-2-FD3-24 and FD4_1, FD4. -2—FD4-16 is a secondary pool (Secondary pools); D. Build a super pool. Mix the first mixture of the first-stage mixing tank with 10 L and add it to 5 mL of LB medium, which is a super pool at 37 °C. Incubate at 180 rpm for 10 h.
(4) 阳性克隆测序 对包含全长基因的阳性克隆进行 Shotgun测序, 测序覆盖率设定约为 7,最 后通过 PCR、 酶切以及测序验证, 即获得大腹园蛛 MiSp全长基因序列。 所述的 CTAB法为改良 CTAB法, 该方法包括以下步骤: a) 研磨: 取已处理好的蜘蛛胸部肌肉于装有 89% (V/V) 预冷 CTAB 提取缓 冲液 [2% (g/V) CTAB, 1. 4 mol/L NaCl, 100 匪 ol/L Tris- HCl (pH8. 0), 20 mmol/L EDTA (pH 8. 0) , 2% (g/V) PVP (聚乙烯吡咯垸酮)]、 1% (V/V) SDS-LDS预混液 [10%(4) Sequencing of positive clones The positive clones containing the full-length gene were subjected to Shotgun sequencing, and the sequencing coverage was set to about 7, and finally verified by PCR, restriction enzyme digestion and sequencing, and the MiSp full-length gene sequence of the spider was obtained. The CTAB method is a modified CTAB method comprising the following steps: a) grinding: taking the treated spider chest muscles with 89% (v/v) pre-cooled CTAB extraction buffer [2% (g/) V) CTAB, 1. 4 mol/L NaCl, 100 匪ol/L Tris-HCl (pH 8. 0), 20 mmol/L EDTA (pH 8. 0), 2% (g/V) PVP (polyvinylpyrrole) Anthrone)], 1% (V/V) SDS-LDS Master Mix [10%
(g/V) SDS, 10% (g/V) LDS]和 1% (V/V) β _巯基乙醇的 1. 5 mL离心管中,充 分研磨破碎; b) 水浴: 65 °C水浴约 2 h, 其间不时上下轻柔颠倒, 直至混合液消化得很 清亮; c) 提取: 加入等体积的的酚-氯仿-异戊醇 (25: 24: 1 ), 轻柔颠倒混匀, 在台式高速离心机上以转速 12000 rpm离心 5 min, 取上清液; d) 氯仿抽提: 在上清液中加入等体积氯仿, 轻柔的颠倒混匀, 在台式高速 离心机上以转速 12000 rpm离心 5 min, 取上清液; e) 重复步骤 d), 直至水相与有机相的界面处看不到白色的蛋白质层为止, 取上清液; f ) 沉淀 DNA: 在上清液中加入 1/10体积 5 mol/L NaCl和 2. 5倍无水乙醇 (预冷), 轻柔混匀, -20 °C放置 30 min沉淀 DNA, 在台式高速离心机上 13000 rpm离心 15 min, 弃上清液; g) 洗涤 DNA: 在留有沉淀的离心管中加入 1 mL 70%乙醇洗涤 DNA, 在台式 高速离心机上以转速 13000 111离心5 1^ 11, 弃上清液, 重复 1次; h)保存:室温干燥沉淀 5 10 min,加入适量 TE缓冲液 [10 ol/L的 Tris_HCl (pH8. 0 ) 和 1 mmol/L的 EDTA (pH8. 0) ]溶解 DNA, 存于- 20 °。或4 °C备用。 一种大腹园蛛 MiSp全长基因的表达方法, 其特征在于该方法借助断裂蛋白 质内含子介导的反式剪接技术完成分段表达大腹园蛛 Mi Sp, 利用蛋白质内含子 介导的反式剪接技术体外定向拼接, 实现完整表达大腹园蛛全长 MiSp, 具体步 骤如下: (g/V) SDS, 10% (g/V) LDS] and 1% (V/V) β-mercaptoethanol in a 1.5 mL centrifuge tube, fully ground and crushed; b) Water bath: 65 °C water bath approx. 2 h, from time to time, gently upside down until the mixture is digested very clearly; c) Extraction: Add an equal volume of phenol-chloroform-isoamyl alcohol (25: 24: 1), gently mix by inversion, centrifuge at high speed on a table Centrifuge at 12000 rpm for 5 min on the machine, take the supernatant; d) Extract with chloroform: Add equal volume of chloroform to the supernatant, gently mix by inversion, centrifuge at 12000 rpm for 5 min on a benchtop high-speed centrifuge. Supernatant e) Repeat step d) until the white protein layer is not visible at the interface between the aqueous phase and the organic phase, and take the supernatant; f) Precipitate the DNA: Add 1/10 volume of 5 mol/L NaCl to the supernatant. And 2.5 times absolute ethanol (pre-cooling), gently mix, place the pellet at -20 °C for 30 min, centrifuge at 13,000 rpm for 15 min on a benchtop high-speed centrifuge, discard the supernatant; g) wash the DNA: The precipitated centrifuge tube was washed with 1 mL of 70% ethanol, centrifuged at a speed of 13000 111 on a benchtop high-speed centrifuge, and the supernatant was discarded once. h) Preservation: Dry the precipitate at room temperature for 5 min, add The appropriate amount of TE buffer [10 ol / L of Tris_HCl (pH 8.0) and 1 mmol / L of EDTA (pH 8.0)] dissolved DNA, stored at - 20 °. Or spare at 4 °C. A method for expressing the MiSp full-length gene of the large abdomen, characterized in that the method is to perform segmental expression of the spider, Mi Sp, by using a protein intron-mediated trans-splicing technique, which is mediated by protein introns. The trans-splicing technique is used for directional splicing in vitro to achieve complete expression of the full-length MiSp of the large abdomen. The specific steps are as follows:
( 1 )将大腹园蛛 MiSp全长基因分为在 Rosetta2 (DE3 )表达菌株中能高效 表达的三个部分: NT+RP RP以及 RP+CT; (1) Divided the full-length MiSp gene into three parts of the Rosetta2 (DE3) expression strain: NT+RP RP and RP+CT;
(2 ) 利用 DNA重组技术, 将 NT+RP与筛选获得的具有高剪接活性的断裂蛋 白质内含子 Inl N端结构域编码基因融合表达, 即 NT+RP+InlN; 第二部分的 RP 5' 端与 Inl的 C端结构域融合表达, RP 3' 端与 In2的 N端结构域融合表达, 即 InlC+RP + In2N;第三部分的 RP+CT与 In2的 C端结构域融合,即 In2 C + RP+CT; (3) 融合表达产物经 Ni-NTA纯化。 将纯化后的 NT+RP+InlN与 InlC+RP + In2N混合后, InlN与 InlC能自发的发生反式剪接反应, 自身从融合蛋白上 剪接下来, 并将携带的 MiSp片段通过肽键连接、 Ni-NTA反纯化 (去除 Inl ), 纯化产物继续与第三部分的 In2 C+RP+CT混合, In2N与 In2C在空间上相互靠 近, 自发的反式剪接反应, 自身从 MiSp片段上剪接下来, 并把两端的 MiSp片 段以肽键连接, 利用三步定向拼接反应得到重组的大腹园蛛 MiSp全长蛋白。 本发明与现有技术相比,展示了国内外第一条大腹园蛛 MiSp基因组全序列, 包括大腹园蛛蛛丝蛋白 MiSp全长基因, 大腹园蛛 MiSp全长基因序列较相关其 他物种该种丝蛋白基因预测的小, 但为全长蛋白的表达进一步提供了可能。 该 全长编码序列为研究蛛丝蛋白的系统发育以及获取全长重组 MiSp蛋白进而仿生 高性能的人工蛛丝提供了基因支持; 在表达大腹园蛛 MiSp基因时, 本发明采用蛋白质内含子剪接和 Rosetta2 (DE3)表达菌株替代了传统的大肠杆菌作为宿主菌表达技术, 借助具有高剪接 活性的断裂蛋白质内含子的反式剪接反应, 完成分段表达 MiSp重组蛋白, 通过 蛋白质内含子介导的反式剪接反应体外定向拼接融合蛋白, 获得重组的大腹园 蛛全长蜘蛛丝 MiSp蛋白,在国内外首次完成了大腹园蛛天然全长蛛丝蛋白 MiSp 基因在大肠杆菌衍生菌中的重组表达, 为蜘蛛丝的仿生研究提供了物质基础。 (2) Using DNA recombination technology, the expression of NT+RP and the cleavage protein intron Inl N-terminal domain coding gene with high splicing activity was obtained, namely NT+RP+InlN; the second part of RP 5' The end is fused to the C-terminal domain of Inl, and the RP 3' end is fused to the N-terminal domain of In2, ie, InlC+RP + In2N ; the third part of RP+CT is fused to the C-terminal domain of In2, ie In2 C + RP+CT; (3) The fusion expression product was purified by Ni-NTA. After mixing the purified NT+RP+InlN with InlC+RP + In2N, InlN and InlC spontaneously undergo trans-splicing reaction, and then self-cleave from the fusion protein, and then carry the MiSp fragment through peptide bond, Ni -NTA reverse purification (removal of Inl), the purified product continues to mix with the third part of In2 C+RP+CT, In2N and In2C are spatially close to each other, spontaneous trans-splicing reaction, and itself is cut from the MiSp fragment, and The MiSp fragments at both ends were linked by peptide bonds, and the recombinant full-length MiSp full-length protein was obtained by a three-step directional splicing reaction. Compared with the prior art, the present invention displays the full MiSp genome sequence of the first large-bellied spider, including the full-length gene of the spider protein MiSp in the large abdomen, and the MiSp full-length gene sequence of the large abalone is more related to other species. This silk protein gene is predicted to be small, but further provides for the expression of the full-length protein. The full-length coding sequence provides gene support for studying the phylogeny of spider silk protein and obtaining full-length recombinant MiSp protein and then bionic high-performance artificial spider silk; in the expression of the big belly plant spider MiSp gene, the present invention adopts protein intron The splicing and Rosetta2 (DE3) expression strains replace the traditional E. coli as the host cell expression technology, and the MiSp recombinant protein is segmentally expressed by the trans-splicing reaction of the cleavage protein intron with high splicing activity, through the protein intron Mediated trans-splicing reaction directional splicing of fusion protein in vitro, the recombinant full-length spider silk MiSp protein was obtained, and the natural full-length spider silk protein MiSp gene in Escherichia coli-derived bacteria was completed for the first time at home and abroad. The recombinant expression in the middle provides a material basis for the biomimetic study of spider silk.
[附图说明] 图 1 为本发明大腹园蛛 fosmid文库构建及筛选方法流程示意图; 图 2为本发明大腹园蛛 MiSp全长表达方法流程示意图; 图 3为本发明获得的大腹园蛛 Mi Sp全长蛋白序列; [具体实施方式] 结合附图对本发明涉及的方法作进一步阐述, 其方法步骤对本领域技术人 员来说是可以实现的。 本发明的技术方案主要为提取大腹园蛛 MiSp全长基因,并将大腹园蛛 MiSp 全长基因进行完整表达成全长蛋白。 提取大腹园蛛 MiSp全长基因时采用的方法, 包括以下步骤: BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a schematic flow chart of a method for constructing and screening a fosmid library of a large abdomen in the present invention; FIG. 2 is a schematic flow chart of a full-length expression method of a large belly plant MiSp according to the present invention; 3 is a full-length protein sequence of the Great Spider Mi Mi obtained in the present invention; [Detailed Description of the Invention] The method of the present invention will be further described with reference to the accompanying drawings, and the method steps thereof can be realized by those skilled in the art. The technical scheme of the present invention mainly extracts the full-length MiSp gene of the spider, and fully expresses the full-length MiSp gene into the full-length protein. The method used to extract the full-length MiSp gene of the big belly spider, including the following steps:
( 1 ) 简并 PCR获取 MiSp C端基因序列 (1) Degenerate PCR to obtain MiSp C-terminal gene sequence
A. 分析比对公知的大腹园蛛蛛丝中牵引丝蛋白 MaSp及临时捕获丝蛋白 MiSp的 C端氨基酸序列, 获得保守的氨基酸序列, 分别为 SRISSA和 NIGQVD; A. Analyze the C-terminal amino acid sequence of the traced silk protein MaSp and the temporary capture silk protein MiSp in the well-known spider web, and obtain the conserved amino acid sequences, namely SRISSA and NIGQVD;
B. 基于所述的 SRISSA和 NIGQVD两段保守的氨基酸序列, 设计简并 PCR的 上下游引物; B. Designing the upstream and downstream primers of degenerate PCR based on the two conserved amino acid sequences of SRISSA and NIGQVD;
C. 以大腹园蛛蛛丝高分子量基因组 DNA为模版, 通过简并 PCR、 测序分析 得到大腹园蛛 MiSp C端长度约 200 bp的基因片段序列; C. Using the high molecular weight genomic DNA of spider web of Daweiyuan as a template, the gene fragment sequence of the MiSp C-terminal length of about 200 bp was obtained by degenerate PCR and sequencing analysis.
D. 以所述的大腹园蛛 MiSp C端长度约 200 bp的基因片段序列为基础, 设 计特异性引物 ( GMiSp-CF : 5' - TTACTCAGGTGTCCTTGG - 3,, GMiSp-CR : 5' - ATTGGCTTACTGCATTCT - 3' )用于文库筛选; D. Based on the gene fragment sequence of the MiSp C-terminal length of about 200 bp, design specific primers (GMiSp-CF: 5' - TTACTCAGGTGTCCTTGG-3, GMiSp-CR: 5' - ATTGGCTTACTGCATTCT - 3') for library screening;
( 2 ) 分离高分子量 DNA构建 fosmid基因组文库, 如图 1所示: A.禾 lj用改良 CTAB (cetyltriethylammnonium bromide) 法分离大腹园蛛高 分子量基因组 DNA (High Molecular Weight Genomic DNA, HMW-gDNA); 改良 CTAB法的步骤: a) 研磨。 取已处理好的蜘蛛胸部肌肉于装有 89% (V/V)预冷 CTAB 提取缓冲 液 [2% (g/V)CTAB, 1.4 mol/L NaCl, 100 mmol/L Tris-HCl (pH8.0), 20 mmol/L EDTA(pH 8.0), 2% (g/V) PVP (聚乙烯吡咯垸酮 )]、 1% (V/V) SDS-LDS 预混液 [10% (g/V) SDS, 10% (g/V) LDS]和 1% (V/V) β-巯基乙醇的 1.5 mL离心管中, 充分研磨破碎; (2) Separating high molecular weight DNA to construct a fosmid genomic library, as shown in Figure 1: A. Hejl uses the modified CTAB (cetyltriethylammnonium bromide) method to isolate High Molecular Weight Genomic DNA (HMW-gDNA); the steps of the modified CTAB method: a) Grinding. The treated spider chest muscles were loaded with 89% (v/v) pre-cooled CTAB extraction buffer [2% (g/V) CTAB, 1.4 mol/L NaCl, 100 mmol/L Tris-HCl (pH 8. 0), 20 mmol/L EDTA (pH 8.0), 2% (g/V) PVP (polyvinylpyrrolidone)], 1% (V/V) SDS-LDS master mix [10% (g/V) SDS, 10% (g/V) LDS] and 1% (V/V) β-mercaptoethanol in a 1.5 mL centrifuge tube, fully ground and crushed;
b) 水浴。 65 °C水浴约 2h, 其间不时上下轻柔颠倒, 直至混合液消化得很清亮; c) 提取。 加入等体积的的酚-氯仿-异戊醇 (25 : 24: 1) , 轻柔颠倒混匀, 在台 式高速离心机上以转速 12000 rpm离心 5 min, 取上清液; b) Water bath. The water bath at 65 °C is about 2 hours, during which time it is gently inverted up and down until the mixture is digested very clearly; c) extracted. Add an equal volume of phenol-chloroform-isoamyl alcohol (25: 24: 1), gently mix by inversion, centrifuge at a speed of 12000 rpm for 5 min on a bench-top high speed centrifuge, and take the supernatant;
d) 氯仿抽提。 在上清液中加入等体积氯仿, 轻柔的颠倒混匀, 在台式高速离心 机上 12000 rpm离心 5 min, 取上清液; d) Extraction by chloroform. Add equal volume of chloroform to the supernatant, gently mix by inversion, centrifuge at 12000 rpm for 5 min on a benchtop high-speed centrifuge, and take the supernatant;
e) 重复步骤 d) , 直至水相与有机相的界面处看不到白色的蛋白质层为止, 取 上清液; e) repeat step d) until the white protein layer is not visible at the interface between the aqueous phase and the organic phase, and the supernatant is taken;
f) 沉淀 DNA。 在上清液中加入 1/10体积 5 mol/L NaCl和 2.5倍无水乙醇 (预 冷) , 轻柔混匀, -20 °C放置 30min沉淀 DNA, 在台式高速离心机上 13000 rpm离心 15 min, 弃上清液; f) Precipitating DNA. Add 1/10 volume of 5 mol/L NaCl and 2.5 times absolute ethanol (pre-cool) to the supernatant, mix gently, place the pellet at -20 °C for 30 min, and centrifuge at 13000 rpm for 15 min on a benchtop high-speed centrifuge. Discard the supernatant;
g) 洗涤 DNA。在留有沉淀的离心管中加入 l mL 70%乙醇洗涤 DNA,在台式高 速离心机上 13000 rpm离心 5 min, 弃上清液, 重复 1次; g) Wash DNA. The DNA was washed by adding 1 mL of 70% ethanol to the centrifuge tube with the precipitate, and centrifuged at 13,000 rpm for 5 min in a benchtop high-speed centrifuge, and the supernatant was discarded and repeated once;
h) 保存。 室温干燥沉淀 5~10 min, 加入适量 TE缓冲液 [10 mmol/L的 Tris-HClh) Save. Dry the precipitate at room temperature for 5~10 min, add appropriate amount of TE buffer [10 mmol/L Tris-HCl
(pH8.0)和 1 mmol/L的 EDTA (pH8.0) ]溶解 DNA,存于 -20 或 4 °C备用。 B.利用文库构建试剂盒构建大腹园蛛 fosmid基因组文库; (pH 8.0) and 1 mmol/L EDTA (pH 8.0)] Dissolve the DNA and store at -20 or 4 °C until use. B. Constructing a fosmid genomic library of the large abalone plant using a library construction kit;
( 3 ) 筛选 fosmid基因组文库, 如图 1所示: (3) Screening the fosmid genomic library, as shown in Figure 1:
A. 所述大腹园蛛 fosmid基因组文库共产生 3. 9 X 105个克隆, 并逐一转接 于约 1026块 384孔细胞培养板, 以 8%浓度的甘油于 -80 °C保存, 所述 1026块 384孔细胞培养板分为约 26批转接筛选, 每批转接 40块 384孔细胞培养板, 将 LB培养基用 8 X 100 μ L移液器转接到 40块无菌 384孔细胞培养板, 每孔 60 μ L, 所述的 LB培养基含 12. 5 g/mL氯霉素; A. The fosmid genomic library of the abdomen garden spiders produced a total of 3. 9 X 10 5 clones, and transferred one by one to about 1026 384-well cell culture plates, and stored at 8% concentration of glycerol at -80 ° C. The 1026 384-well cell culture plates were divided into approximately 26 batches of transfer screens. Each batch was transferred to 40 384-well cell culture plates, and the LB medium was transferred to 40 sterile 384 cells using an 8 X 100 μL pipette. a cell culture plate, 60 μL per well, the LB medium containing 12. 5 g / mL chloramphenicol;
B.构建一级混合池 a)用 8 X 10 μ 1移液器将第一批次中的 1号 384孔细胞培养板中所有 fosmid 基因组文库克隆进行混合, 每孔 5 L菌液, 并加入到 5 mL LB培养基中, 标 记为 1号混合 (384个文库克隆), 分装两管于 37 °C、 180 rpm培养 5 h, 按 同样的方法分别将剩余的 39块 384孔细胞培养板进行克隆混合, 分别标记为 2 号混合、 3号混合… 40号混合; b ) 将按上述步骤获得的 40份混合液按 5行 X 8列进行排列, 用 8 X 100 μ L 移液器将 5行 Χ 8列排列中的每一列中五个孔进行混合,将混合后的 8组菌液每 孔抽取 5 加入到 5 mL LB培养基中, 分别编号 FD1_1、 FD1-2—FD1-8 , 于 37 °C、 180 rpm培养 10 h, 按同样方法将 5行 X 8列排列中的每行进行混合, 并分别编号为 FD2-1、 FD2-2--FD2-5. FD1- 1、 FD1-2'"FD1_8和 FD2_1、 FD2- 2··· FD2-5即获得初级混合池 (Primary pools ); B. Construction of the primary mixing tank a) Mix all fosmid genomic library clones in the first batch of 384-well cell culture plates in the first batch with an 8 X 10 μ 1 pipette, 5 L of bacteria per well, and add Into 5 mL of LB medium, labeled as No. 1 (384 library clones), and two tubes were incubated at 37 ° C and 180 rpm for 5 h. The remaining 39 384-well cell culture plates were separately prepared in the same manner. Clone and mix, labeled as No. 2 Mix, No. 3 Mix... No. 40 Mix; b) Place 40 parts of the mixture obtained in the above procedure in 5 rows x 8 columns, using 8 x 100 μL pipette 5 rows 五个 Five wells in each column of the 8 column arrangement were mixed, and the mixed 8 groups of bacteria were extracted 5 times into each well and added to 5 mL of LB medium, numbered FD1_1, FD1-2-FD1-8, respectively. Incubate at 37 °C, 180 rpm for 10 h, mix each row of 5 rows and 8 columns in the same way, and number them as FD2-1, FD2-2--FD2-5. FD1- 1, FD1 -2'"FD1_8 and FD2_1, FD2- 2··· FD2-5 obtain the primary pools (Primary pools);
C.构建二级混合池 a) 用 8 X 10 μ 1移液器吸取第一批次的 40块 384孔细胞培养板第 1列的菌 液各 5 μ L, 加入到 5 mL LB培养基中, 标号 FD3-1 , 于 37 °C, 180 rpm培养 10 h, 按同样方法分别将第 2至 24列进行混合,标号为 FD3- 2、 FD3- 3—FD3- 24, 共 24个混合池; b) 按同样方法将 40块 384孔细胞培养板的 A-P行进行混合, 分别编号为 FD4- 1、 FD4-2 · -FD4- 16,共 16个混合池, FD3- 1、 FD3- 2 · -FD3-24和 FD4- 1、 FD4- 2 · · · FD4-16即为二级混合池 (Secondary pools ); C. Building a secondary mixing pool a) Pipette 5 μL of each of the first batch of 40 384-well cell culture plates in column 1 with an 8 X 10 μ 1 pipette and add to 5 mL of LB medium, label FD3-1, Incubate at 37 °C, 180 rpm for 10 h, mix the 2nd to 24th columns in the same way, labeled FD3- 2, FD3- 3-FD3- 24, a total of 24 mixing tanks; b) 40 in the same way The AP line of the 384-well cell culture plate was mixed and numbered as FD4- 1 , FD4-2 · -FD4-16, a total of 16 mixing cells, FD3- 1, FD3- 2 · -FD3-24 and FD4- 1 FD4- 2 · · · FD4-16 is a secondary pool (Secondary pools);
D.构建超级池 将构建一级混合池的第二份混合液各取 10 μ L混合在一起,并加入到 5 mL LB 培养基中, 即为超级混合池 (Super pool ), 于 37 °C、 180 rpm培养 10 h。 D. Build the super pool. Mix the first mixture of the first-stage mixing tank with 10 μL and add it to 5 mL of LB medium, which is the super pool at 37 °C. Incubate at 180 rpm for 10 h.
(4) 阳性克隆测序 对包含全长基因的阳性克隆进行 Shotgun测序, 测序覆盖率设定约为 7, 最 后通过 PCR、 酶切以及测序验证, 即获得大腹园蛛 MiSp全长基因序列。 通过上述方法获得的大腹园蛛 MiSp全长基因的核苷酸序列如 SEQ ID NO. 1 所示, 该基因全长 10929 bp, 含有两个长度分别为 1428 bp和 3870 bp的外显 子、 一个长度为 5628 bp的内含子。 为获得大腹园蛛 MiSp全长蛋白需要对获得的大腹园蛛 MiSp全长基因进行 表达, 表达方法主要借助断裂蛋白质内含子介导的反式剪接技术完成分段表达 大腹园蛛 MiSp, 利用蛋白质内含子介导的反式剪接技术体外定向拼接, 实现完 整表达大腹园蛛 MiSp全长蛋白, 所述的蛋白质内含子(intein)是存在于前体蛋 白质(precursor protein)中的一段插入序列, 其两侧的氨基酸序列被称为蛋白质 外显子 (extein), 在形成成熟蛋白质的过程中, 蛋白质内含子能自发的发生剪接 反应, 并将两侧的外显子通过肽键连接起来。 利用断裂蛋白内含子的反式剪接 可以将两个分开的多肽片段连接形成一个完整的蛋白, 具体步骤如下: (4) Sequencing of positive clones The positive clones containing the full-length gene were subjected to Shotgun sequencing, and the sequencing coverage was set to about 7. Finally, the full-length MiSp gene sequence of the large abalone was obtained by PCR, restriction enzyme digestion and sequencing. The nucleotide sequence of the MiSp full-length gene obtained by the above method is shown in SEQ ID NO. 1, which is 10929 bp in length and contains two exons of lengths of 1428 bp and 3870 bp, respectively. An intron of 5628 bp in length. In order to obtain the MiSp full-length protein of the large abalone, it is necessary to express the full-length MiSp gene of the large abalone, and the expression method mainly uses the trans-protein intron-mediated trans-splicing technique to complete the segmental expression of the large abdomen spider MiSp. In vitro directional splicing using protein intron-mediated trans-splicing technology to achieve full expression of the full-length protein of the MiSp spider, the protein intron (intein) is present in the precursor egg An insertion sequence in a precursor protein, the amino acid sequence on both sides of which is called an extein. In the process of forming a mature protein, the protein intron can spontaneously undergo a splicing reaction, and two The exons on the side are joined by peptide bonds. The two separate polypeptide fragments can be joined to form a complete protein by trans-splicing of the broken protein intron, as follows:
( 1 ) 将大腹园蛛 MiSp全长基因分为在 Rosetta2 (DE3 ) 中能高效表达的 三个部分: NT+RP、 RP以及 RP+CT; (1) Divided the full-length MiSp gene into three parts of Rosetta2 (DE3): NT+RP, RP and RP+CT;
(2 ) 利用 DNA重组技术, 将 NT+RP与筛选获得的具有高剪接活性的断裂蛋 白质内含子 Inl N端结构域编码基因融合表达, 即 NT+RP+InlN; 第二部分的 RP 5 ' 端与 Inl的 C端结构域融合表达, RP 3 ' 端与 In2的 N端结构域融合表达, 即 InlC+RP + In2N;第三部分的 RP+CT与 In2的 C端结构域融合,即 In2 C + RP+CT; (2) Using DNA recombination technology, the expression of NT+RP and the cleavage protein intron Inl N-terminal domain encoding gene with high splicing activity is expressed, ie NT+RP+InlN; the second part of RP 5 ' The end is fused to the C-terminal domain of Inl, and the RP 3 'end is fused to the N-terminal domain of In2, ie, InlC+RP + In2N ; the third part of RP+CT is fused to the C-terminal domain of In2, ie, In2 C + RP+CT;
( 3 ) 融合表达产物经 Ni-NTA纯化。 将纯化后的 NT+RP+InlN与 InlC + RP + In2N混合后, InlN与 InlC能自发的发生反式剪接反应, 自身从融合蛋白上 剪接下来, 并将携带的 MiSp片段通过肽键连接、 Ni-NTA反纯化 (去除 Inl ),纯 化产物继续与第三部分的 In2 C+RP+CT混合, In2N与 In2C在空间上相互靠近, 自发的反式剪接反应, 自身从 MiSp片段上剪接下来, 并把两端的 MiSp片段以 肽键连接, 利用三步定向拼接反应得到重组的大腹园蛛全长 MiSp。 依上述步骤可获得大腹园蛛 MiSp全长蛋白序列如图 3所示, 含有长度为 5301 bp MiSp全长转录子、 1766个编码氨基酸、 非重复的 N端和 C端模块和由 甘氨酸和丙氨酸组成的重复模块, 重复模块由 GX, GGX, GGGX 和 oligo_A repeats四禾中 motif及两个完全一样的 Spacer序列组成。 (3) The fusion expression product was purified by Ni-NTA. After mixing the purified NT+RP+InlN with InlC + RP + In2N, InlN and InlC spontaneously undergo trans-splicing reaction, and then self-cleave from the fusion protein, and carry the MiSp fragment through peptide bond, Ni -NTA reverse purification (removal of Inl), the purified product continues to mix with the third part of In2 C+RP+CT, In2N and In2C are spatially close to each other, spontaneous trans-splicing reaction, and itself is cut from the MiSp fragment, and The MiSp fragments at both ends were linked by peptide bonds, and the recombinant full-length MiSp was obtained by a three-step directional splicing reaction. According to the above steps, the MiSp full-length protein sequence of the large abalone can be obtained as shown in Figure 3, containing a 5301 bp MiSp full-length transcript, 1766 encoding amino acids, non-repetitive N-terminal and C-terminal modules, and glycine and C. The repeating module consisting of the amino acid consists of the GX, GGX, GGGX and oligo_A repeats motifs and two identical Spacer sequences.

Claims

1.一种大腹园蛛 MiSp全长基因, 其特征在于大腹园蛛 MiSp全长基因的核苷酸 序列如 SEQ ID NO. 1所示。 A full length MiSp full-length gene, characterized in that the nucleotide sequence of the MiSp full-length gene of the large abalone is as shown in SEQ ID NO.
2.如权利要求 1所述的一种大腹园蛛 MiSp全长基因, 其特征在于该基因全长 10929 bp, 含有两个长度分别为 1428 bp和 3870 bp的外显子、一个长度为 5628 bp的内含子。 The full-length MiSp gene of the large-bellied spider, according to claim 1, characterized in that the gene is 10929 bp in length and comprises two exons of length 1428 bp and 3870 bp, respectively, and a length of 5628. Intron of bp.
3.一种如权利要求 1或 2所述的大腹园蛛 MiSp全长基因的提取方法, 其特征在 于该方法包括以下步骤: A method for extracting a MiSp full-length gene of the large abdomen spider according to claim 1 or 2, characterized in that the method comprises the following steps:
( 1 ) 简并 PCR获取 MiSp C端基因序列 (1) Degenerate PCR to obtain MiSp C-terminal gene sequence
A. 分析比对公知的大腹园蛛蛛丝中牵引丝蛋白 MaSp及临时捕获丝蛋白 MiSp 的 C端氨基酸序列, 获得保守的氨基酸序列, 分别为 SRISSA和 NIGQVD; A. Analyze the C-terminal amino acid sequence of the traced silk protein MaSp and the temporary capture silk protein MiSp in the well-known spider web, and obtain the conserved amino acid sequences, namely SRISSA and NIGQVD;
B. 基于所述的 SRISSA和 NIGQVD两段保守的氨基酸序列, 设计简并 PCR的上 下游引物; B. Designing upstream and downstream primers for degenerate PCR based on the two conserved amino acid sequences of SRISSA and NIGQVD;
C. 以大腹园蛛蛛丝高分子量基因组 DNA为模版, 通过简并 PCR、 测序分析得 到大腹园蛛 MiSp C端长度约 200 bp的基因片段序列; C. Using the high molecular weight genomic DNA of spider web of Daweiyuan as a template, the gene fragment sequence of the MiSp C-terminal length of about 200 bp was obtained by degenerate PCR and sequencing analysis.
D. 以所述的大腹园蛛 MiSp C端长度约 200 bp的基因片段序列为基础, 设计 特异性引物 (GMiSp-CF : 5' - TTACTCAGGTGTCCTTGG - 3,, GMiSp- CR : 5' - ATTGGCTTACTGCATTCT - 3' )用于文库筛选; D. Based on the gene fragment sequence of the MiSp C-terminal length of about 200 bp, design specific primers (GMiSp-CF: 5' - TTACTCAGGTGTCCTTGG-3, GMiSp-CR: 5' - ATTGGCTTACTGCATTCT - 3') for library screening;
(2 ) 分离高分子量 DNA构建 fosmid基因组文库 A.利用 CTAB (cetyltriethylammnonium bromide) 法分离大腹园蛛高分子量 基因组 DNA (High Molecular Weight Genomic DNA, HMW-gDNA); (2) Separating high molecular weight DNA to construct fosmid genomic library A. Using the CTAB (cetyltriethylammnonium bromide) method to isolate high Molecular Weight Genomic DNA (HMW-gDNA);
B. 利用文库构建试剂盒构建大腹园蛛 fosmid基因组文库; (3) 筛选 fosmid基因组文库 B. Using the library construction kit to construct the fosmid genomic library; (3) screening the fosmid genomic library
A.所述大腹园蛛 fosmid基因组文库共产生 3. 9 X 105个克隆,并逐一转接于约A. The large abdomen spider fosmid genomic library produced a total of 3. 9 X 10 5 clones, and transferred one by one
1026块 384孔细胞培养板, 以 8%浓度的甘油于 -80°C保存, 所述 1026块 384孔细胞培养板分为约 26批转接筛选, 每批转接 40块 384孔细胞培养 板,将 LB培养基用 8 X 100 μ L移液器转接到 40块无菌 384孔细胞培养板, 每孔 60 L, 所述的 LB培养基含 12. 5 μ g/mL氯霉素; 1026 384-well cell culture plates were stored at -80 ° C with 8% concentration of glycerol. The 1026 384-well cell culture plates were divided into about 26 batches of transfer screening, and 40 384-well cell culture plates were transferred per batch. 5 μg/毫升氯霉素; The LB medium is transferred to a 40 sterilized 384-well cell culture plate, 40 L per well, and the LB medium contains 12. 5 μg/mL chloramphenicol;
B.构建一级混合池 a)用 8 X 10 μ 1移液器将第一批次中的 1号 384孔细胞培养板中所有 fosmid 基因组文库克隆进行混合, 每孔 5 L菌液, 并加入到 5 mL LB培养基中, 标记为 1号混合 (384个文库克隆), 分装两管于 37 °C、 180 rpm培养 5 h, 按同样的方法分别将剩余的 39块 384孔细胞培养板进行克隆混合, 分 别标记为 2号混合、 3号混合… 40号混合; b ) 将按上述步骤获得的 40份混合液按 5行 X 8列进行排列, 用 8 X 100 μ L 移液器将 5行 X 8列排列中的每一列中五个孔进行混合, 将混合后的 8组 菌液每孔抽取 5 4 加入到5 1^ 1^培养基中, 分别编号 FD1-1、 FD1-2— FD1— 8, 于 37 °C、 180 rpm培养 10 h, 按同样方法将 5行 X 8列排列中 的每行进行混合, 并分别编号为 FD2-1、 FD2-2-FD2-5, FD1- 1、 FD1- 2··· FD1-8和 FD2-1、 FD2-2'"FD2_5即获得初级混合池 (Primary pools ); B. Construction of the primary mixing tank a) Mix all fosmid genomic library clones in the first batch of 384-well cell culture plates in the first batch with an 8 X 10 μ 1 pipette, 5 L of bacteria per well, and add Into 5 mL of LB medium, labeled as No. 1 (384 library clones), and two tubes were incubated at 37 ° C and 180 rpm for 5 h. The remaining 39 384-well cell culture plates were separately prepared in the same manner. Clone and mix, labeled as No. 2 Mix, No. 3 Mix... No. 40 Mix; b) Arrange 40 parts of the mixture obtained in the above procedure in 5 rows and 8 columns, using 8 x 100 μL pipette Five wells in each of the 5 rows and 8 columns are mixed, and the mixed 8 groups of bacteria are extracted 5 4 per well, and added to the 5 1 ^ 1 ^ medium, numbered FD1-1, FD1-2, respectively. — FD1—8, culture at 37 °C, 180 rpm for 10 h, in the same way, arrange 5 rows of X 8 columns. Each row is mixed and numbered FD2-1, FD2-2-FD2-5, FD1- 1, FD1- 2··· FD1-8 and FD2-1, FD2-2'"FD2_5 to obtain the primary mixing Pool (Primary pools);
C.构建二级混合池 a) 用 8 X 10μι移液器吸取第一批次的 40块 384孔细胞培养板第 1列的菌液 各 5 L, 加入到 5 mL LB培养基中, 标号 FD3-1 , 于 37 °C, 180 rpm 培养 10 h, 按同样方法分别将第 2至 24列进行混合, 标号为 FD3_2、 FD3-3--FD3-24, 共 24个混合池; b) 按同样方法将 40块 384孔细胞培养板的 A-P行进行混合, 分别编号为 FD4- 1、 FD4-2 · · · FD4- 16,共 16个混合池, FD3- 1、 FD3-2 · -FD3-24和 FD4- 1、 FD4-2'"FD4-16即为二级混合池 (Secondary pools ); C. Construction of two mixing tank a) with 8 X 10 μ ι pipetted broth 384 40-well cell culture plate in a first batch of each of the first column of 5 L, were added to 5 mL LB medium, Label FD3-1, cultured at 37 °C, 180 rpm for 10 h, mixing the 2nd to 24th columns in the same way, labeled FD3_2, FD3-3--FD3-24, a total of 24 mixing cells; b) The AP lines of 40 384-well cell culture plates were mixed in the same manner, numbered as FD4- 1 , FD4-2 · · · FD4- 16, a total of 16 mixing cells, FD3- 1 , FD3-2 · -FD3 -24 and FD4- 1, FD4-2'" FD4-16 is a secondary pool (Secondary pools);
D.构建超级池 将构建一级混合池的第二份混合液各取 10 μ L混合在一起,并加入到 5 mL LB培养基中, 即为超级混合池 (Super pool ), 于 37 °C、 180 rpm培养 10 h。 D. Build the super pool. Mix the first mixture of the first-stage mixing tank with 10 μL and add it to 5 mL of LB medium, which is the super pool at 37 °C. Incubate at 180 rpm for 10 h.
(4) 阳性克隆测序 对包含全长基因的阳性克隆进行 Shotgun测序, 测序覆盖率设定约为 7,最 后通过 PCR、 酶切以及测序验证, 即获得大腹园蛛 MiSp全长基因序列。 (4) Sequencing of positive clones The positive clones containing the full-length gene were subjected to Shotgun sequencing, and the sequencing coverage was set to about 7, and finally verified by PCR, restriction enzyme digestion and sequencing, and the MiSp full-length gene sequence of the spider was obtained.
4.如权利要求 3所述的一种大腹园蛛 MiSp全长基因的提取方法, 其特征在于所 述的 CTAB法为改良 CTAB法, 该方法包括以下步骤: a) 研磨: 取已处理好的蜘蛛胸部肌肉于装有 89% (V/V) 预冷 CTAB 提取缓冲液 [2% (g/V) CTAB, 1. 4 mol/L NaCl , 100 mmol/L Tris-HCl (pH8. 0), 20 mmol/L EDTA (pH 8. 0), 2% (g/V) PVP (聚乙烯吡咯垸酮)]、 1% (V/V) SDS-LDS预混液 [10%The method for extracting the MiSp full-length gene of the large-bellied spider, according to claim 3, wherein the CTAB method is a modified CTAB method, and the method comprises the following steps: a) Grinding: Take the treated spider chest muscles with 89% (V/V) pre-cooled CTAB extraction buffer [2% (g/V) CTAB, 1. 4 mol/L NaCl, 100 mmol/L Tris-HCl (pH 8. 0), 20 mmol/L EDTA (pH 8. 0), 2% (g/V) PVP (polyvinylpyrrolidone)], 1% (V/V) SDS-LDS Master Mix [10%
(g/V) SDS, 10% (g/V) LDS]和 1% (V/V) β _巯基乙醇的 1. 5 mL离心管中,充 分研磨破碎; b) 水浴: 65 °C水浴约 2 h, 其间不时上下轻柔颠倒, 直至混合液消化得很清亮; c) 提取: 加入等体积的的酚-氯仿-异戊醇 (25: 24: 1 ), 轻柔颠倒混匀, 在台 式高速离心机上以转速 12000 rpm离心 5 min, 取上清液; d) 氯仿抽提: 在上清液中加入等体积氯仿, 轻柔的颠倒混匀, 在台式高速离心 机上以转速 12000 rpm离心 5 min, 取上清液; e) 重复步骤 d), 直至水相与有机相的界面处看不到白色的蛋白质层为止, 取上 清液; f) 沉淀 DNA: 在上清液中加入 1/10体积 5 mol/L NaCl和 2. 5倍无水乙醇 (预 冷), 轻柔混匀, -20 °C放置 30 min沉淀 DNA, 在台式高速离心机上 13000 rpm 离心 15 min, 弃上清液; g) 洗涤 DNA: 在留有沉淀的离心管中加入 1 mL 70%乙醇洗涤 DNA, 在台式高速 离心机上以转速 13000 rpm离心 5 min, 弃上清液, 重复 1次; h) 保存: 室温干燥沉淀 5〜10 min, 加入适量 TE缓冲液 [10 mmol/L的 Tris_HCl (pH8. 0 ) 和 1 mmol/L的 EDTA (pH8. 0) ]溶解 DNA, 存于- 20 °。或4 °C备用。 (g/V) SDS, 10% (g/V) LDS] and 1% (V/V) β-mercaptoethanol in a 1.5 mL centrifuge tube, fully ground and crushed; b) Water bath: 65 °C water bath approx. 2 h, from time to time, gently upside down until the mixture is digested very clearly; c) Extraction: Add an equal volume of phenol-chloroform-isoamyl alcohol (25: 24: 1), gently mix by inversion, centrifuge at high speed on a table Centrifuge at 12000 rpm for 5 min on the machine, take the supernatant; d) Extract with chloroform: Add equal volume of chloroform to the supernatant, gently mix by inversion, centrifuge at 12000 rpm for 5 min on a benchtop high-speed centrifuge. Supernatant; e) Repeat step d) until the white protein layer is not visible at the interface between the aqueous phase and the organic phase, and take the supernatant; f) Precipitate DNA: Add 1/10 volume to the supernatant. Mol/L NaCl and 2.5 times absolute ethanol (pre-cooled), gently mix, precipitate the DNA at -20 °C for 30 min, centrifuge at 13,000 rpm for 15 min on a benchtop high-speed centrifuge, discard the supernatant; g) Wash DNA: Wash the DNA by adding 1 mL of 70% ethanol to the centrifuge tube with the precipitate, and centrifuge at a speed of 13000 rpm for 5 min on a benchtop high-speed centrifuge. Clear solution, repeat 1 time; h) Storage: Dry the precipitate at room temperature for 5~10 min, add appropriate amount of TE buffer [10 mmol/L Tris_HCl (pH 8.0) and 1 mmol/L EDTA (pH 8.0)] DNA, stored at - 20 °. Or spare at 4 °C.
5.一种如权利要求 1或 2所述的大腹园蛛 MiSp全长基因的表达方法, 其特征在 于该方法借助断裂蛋白质内含子介导的反式剪接技术完成分段表达大腹园蛛 MiSp, 利用蛋白质内含子介导的反式剪接技术体外定向拼接, 实现完整表达大 腹园蛛全长 MiSp, 具体步骤如下: A method for expressing a MiSp full-length gene of the large abdomen spider according to claim 1 or 2, wherein the method comprises the step of expressing the large abdomen by means of a broken protein intron-mediated trans-splicing technique. Spider MiSp, using protein intron-mediated trans-splicing technology to direct splicing in vitro, to achieve full expression of the full-length MiSp of the spider, the specific steps are as follows:
( 1 )将大腹园蛛 MiSp全长基因分为在 Rosetta2 (DE3)表达菌株中能高效表达 的三个部分: NT+RP、 RP以及 RP+CT; (1) Divided the full-length MiSp gene into three parts of the Rosetta2 (DE3) expression strain: NT+RP, RP and RP+CT;
(2 ) 利用 DNA重组技术, 将 NT+RP与筛选获得的具有高剪接活性的断裂蛋白质 内含子 Inl N端结构域编码基因融合表达, 即 NT+RP+InlN; 第二部分的 RP 5' 端与 Inl的 C端结构域融合表达, RP 3'端与 In2的 N端结构域融合表达,即 InlC +RP+ In2N;第三部分的 RP+CT与 In2的 C端结构域融合,即 In2 C + RP+CT; (2) Using DNA recombination technology, the expression of NT+RP and the cleavage protein intron Inl N-terminal domain coding gene with high splicing activity was obtained, namely NT+RP+InlN; the second part of RP 5' The end is fused to the C-terminal domain of Inl, and the 3' end of RP is expressed in fusion with the N-terminal domain of In2, ie, InlC + RP + In2N ; the third part of RP+CT is fused with the C-terminal domain of In2, ie In2 C + RP+CT;
(3)融合表达产物经 Ni-NTA纯化。将纯化后的 NT+RP+InlN与 InlC+RP+ In2N 混合后, InlN与 InlC能自发的发生反式剪接反应,自身从融合蛋白上剪接下来, 并将携带的 MiSp片段通过肽键连接、 Ni-NTA反纯化(去除 Inl ), 纯化产物继续 与第三部分的 In2 C+RP+CT混合, In2N与 In2C在空间上相互靠近, 自发的反 式剪接反应, 自身从 MiSp片段上剪接下来, 并把两端的 MiSp片段以肽键连接, 利用三步定向拼接反应得到重组的大腹园蛛 MiSp全长蛋白。 (3) The fusion expression product was purified by Ni-NTA. After mixing the purified NT+RP+InlN with InlC+RP+In2N, InlN and InlC spontaneously undergo trans-splicing reaction, and then cut from the fusion protein, and then carry the MiSp fragment through peptide bond, Ni- NTA reverse purification (removal of Inl), the purified product continues to mix with the third part of In2 C+RP+CT, In2N and In2C are spatially close to each other, spontaneous trans-splicing reaction, itself cut from the MiSp fragment, and The MiSp fragments at both ends were linked by peptide bonds, and the recombinant full-length MiSp full-length protein was obtained by a three-step directional splicing reaction.
PCT/CN2012/080204 2012-08-16 2012-08-16 Method for extracting full-length gene of misp of araneus ventricosus and expressing same WO2014026345A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2012/080204 WO2014026345A1 (en) 2012-08-16 2012-08-16 Method for extracting full-length gene of misp of araneus ventricosus and expressing same

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2012/080204 WO2014026345A1 (en) 2012-08-16 2012-08-16 Method for extracting full-length gene of misp of araneus ventricosus and expressing same

Publications (1)

Publication Number Publication Date
WO2014026345A1 true WO2014026345A1 (en) 2014-02-20

Family

ID=50101201

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/080204 WO2014026345A1 (en) 2012-08-16 2012-08-16 Method for extracting full-length gene of misp of araneus ventricosus and expressing same

Country Status (1)

Country Link
WO (1) WO2014026345A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003020916A2 (en) * 2001-08-29 2003-03-13 University Of Wyoming Spider silk protein encoding nucleic acids, polypeptides, antibodies and method of use thereof
CN1552854A (en) * 2003-05-26 2004-12-08 清华大学 Gene of coding cobweb protein and use thereof
CN101679975A (en) * 2007-03-16 2010-03-24 巴西农牧研究企业 proteins from the webs of nephilengys cruentata, avicularia juruensis and parawixia bistriata spiders

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003020916A2 (en) * 2001-08-29 2003-03-13 University Of Wyoming Spider silk protein encoding nucleic acids, polypeptides, antibodies and method of use thereof
CN1552854A (en) * 2003-05-26 2004-12-08 清华大学 Gene of coding cobweb protein and use thereof
CN101679975A (en) * 2007-03-16 2010-03-24 巴西农牧研究企业 proteins from the webs of nephilengys cruentata, avicularia juruensis and parawixia bistriata spiders

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
CHEN GEFEI: "Cloning and characterization of araneus ventricosus Masp gene based on Fosmid genomic library and STS/3D-PCR", CHINA EXCELLENCE MASTER THESIS FULL TEXT DATABASE, 10 June 2010 (2010-06-10) *
JOHN GATESY ET AL.: "Extreme Diversity, Conservation, and Convergence of Spider Silk Fibroin Sequences", SCIENCE, vol. 291, no. 5513, 30 March 2001 (2001-03-30), pages 2603 - 2605, XP001191057, DOI: doi:10.1126/science.1057561 *

Similar Documents

Publication Publication Date Title
ES2887366T3 (en) Methods and compositions for synthesizing improved silk fibers
CN107893080A (en) A kind of sgRNA for targetting rat Inhba genes and its application
CN108218972B (en) Proteins from the web of the Nephilengys Cruentata spider
CN116286900B (en) Acetic acid permease A gene RkAcpa and application thereof
CN107586795B (en) Method for producing phloretin by fermentation of saccharomyces cerevisiae
JP2013538554A5 (en)
WO2008019183A2 (en) Biopolymer and protein production using type iii secretion systems of gram negative bacteria
CN107759675A (en) A kind of signal peptide and its application that secernment efficiency can be improved from bacillus subtilis
CN112852650A (en) Saccharomyces cerevisiae engineering bacterium for high yield of santalene and santalol and construction method and application thereof
CN114989272A (en) Phytophthora camphora effector protein SCR97226 and application thereof
CN101240290B (en) Plant transgene expression vector containing green fluorescence protein gene and its construction method and application
CN110551190B (en) Method for producing spider silk by using silkworm
CN108101973B (en) Chlorella ellipsoidea NF-YC gene and application thereof
WO2014026345A1 (en) Method for extracting full-length gene of misp of araneus ventricosus and expressing same
CN110669787A (en) Chlorella vulgaris chloroplast homologous recombination empty vector and application thereof
CN102399786A (en) Dragline protein
CN106939314A (en) Salt sward anti-drought gene Hg.S4 and its application
CN105237631B (en) One kind is from sheep&#39;s hay albumen relevant to cold-resistant and its encoding gene and application
CN113881615B (en) Xenorhabdus nematophila with high yield of Xcn1 and application thereof
RU2451023C1 (en) Method of producing recombinant spider-web protein, fused protein, recombinant dna, expression vector, host cell and producer strain
CN110747224B (en) Lipid-rich nannochloropsis oculata chloroplast transgene system and application thereof
CN106892970A (en) A kind of Maize Kernel Development GAP-associated protein GAP, encoding gene and application
CN101892250A (en) Method for constructing cryptococcus albidus engineering bacterial strain for producing oleic acid in high yield
CN115948450B (en) Chlamydomonas reinhardtii chloroplast-saccharomyces cerevisiae-escherichia coli shuttle vector and construction method and application thereof
CN102618566B (en) Method for constructing Platymonas subcordiformis chloroplast expression system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12883002

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12883002

Country of ref document: EP

Kind code of ref document: A1