WO2022022197A1 - N-terminus coding sequence-based method for modifying regulatory protein expression - Google Patents

N-terminus coding sequence-based method for modifying regulatory protein expression Download PDF

Info

Publication number
WO2022022197A1
WO2022022197A1 PCT/CN2021/102986 CN2021102986W WO2022022197A1 WO 2022022197 A1 WO2022022197 A1 WO 2022022197A1 CN 2021102986 W CN2021102986 W CN 2021102986W WO 2022022197 A1 WO2022022197 A1 WO 2022022197A1
Authority
WO
WIPO (PCT)
Prior art keywords
protein
pullulanase
value
coding region
sfgfp
Prior art date
Application number
PCT/CN2021/102986
Other languages
French (fr)
Chinese (zh)
Inventor
刘松
徐奎栋
李江华
陈坚
周景文
Original Assignee
江南大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 江南大学 filed Critical 江南大学
Publication of WO2022022197A1 publication Critical patent/WO2022022197A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6809Methods for determination or identification of nucleic acids involving differential detection
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/65Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression using markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/67General methods for enhancing the expression
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/74Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
    • C12N15/75Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora for Bacillus
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/14Hydrolases (3)
    • C12N9/24Hydrolases (3) acting on glycosyl compounds (3.2)
    • C12N9/2402Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
    • C12N9/2405Glucanases
    • C12N9/2451Glucanases acting on alpha-1,6-glucosidic bonds
    • C12N9/2457Pullulanase (3.2.1.41)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12YENZYMES
    • C12Y302/00Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
    • C12Y302/01Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
    • C12Y302/01041Pullulanase (3.2.1.41)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/22Vectors comprising a coding region that has been codon optimised for expression in a respective host

Definitions

  • the invention relates to a method for regulating protein expression based on the modification of an N-terminal coding sequence, belonging to the technical field of genetic engineering and enzyme engineering
  • Mutation of genes is of great significance for changing the properties of proteins. Usually, mutation sequences with better properties can be found through mutation, thereby improving the application value of proteins. Synonymous mutation of genes is a commonly used mutation method, and the expression levels of synonymous mutations of genes can vary greatly.
  • the current commonly used method is to construct a synonymous mutation library and combine it with a high-throughput screening strategy to find the best mutant.
  • this method is time-consuming, labor-intensive, and specific, and cannot be used to guide the design of other genes.
  • synthesizing a series of short peptides is beneficial to widely improve gene expression, this method will have an impact on enzyme activity, because these expression-promoting short peptides occupy the position of the signal peptide, so that the Suitable for extracellular proteins that require the addition of a signal peptide.
  • NCS N-terminal coding region
  • the method of the present invention is established based on the bioinformatics analysis of representative samples, and by this method, the nucleotide sequence of the first 30 bases of the N-terminal of any gene can be de novo designed, and synonymous mutation can be performed on it.
  • the NCS nucleotide sequence of any gene is changed to the target nucleotide sequence by mutating the primers.
  • the present invention can be used to guide the design of any gene without adding additional amino acid sequence, and the properties of the protein are minimized. It can greatly improve the expression level of the target gene.
  • the present invention provides a method for screening nucleotide sequences encoding proteins with different expression levels, measuring the values of GC3 and ⁇ G, and then calculating the relative expression level of the protein by using the following equation, that is, the PsfGFP value, and screening according to the PsfGFP value The corresponding nucleotide sequence is obtained; the P sfGFP value is positively correlated with the actual expression of the protein:
  • PsfGFP 274497.657-108717.401 ⁇ GC3+4886.529 ⁇ G.
  • the ⁇ G is the minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter of the gene encoding the target protein and the 90-99 bp region of the N-terminal coding region.
  • the ⁇ G is the minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter of the gene encoding the target protein and the 96 bp region of the N-terminal coding region.
  • the protein is any protein that can be expressed in Bacillus subtilis.
  • the protein includes, but is not limited to, pullulanase.
  • amino acid sequence of the pullulanase is shown in SEQ ID NO.19.
  • the PsfGFP value is positively correlated with the actual expression level of the protein.
  • the corresponding nucleotide sequences are screened according to the PsfGFP value.
  • the invention provides a method for regulating the protein expression of genetically engineered bacteria.
  • GC3 and ⁇ G parameters of the gene calculate the relative expression level of each nucleotide sequence according to the equation, select the nucleotide sequence with the required expression level, mutate the N-terminal coding region of the target protein accordingly, and transform it into in host cells;
  • the ⁇ G is the minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter of the gene encoding the target protein and the 90-99 bp region of the N-terminal coding region.
  • the ⁇ G is the minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter of the gene encoding the target protein and the 96 bp region of the N-terminal coding region.
  • the nucleotide sequences whose P sfGFP value in the mutation library is in the top 10% are selected; when the protein expression needs to be down-regulated, the P sfGFP value in the mutation library is selected. sfGFP values are in the bottom 10% of nucleotide sequences.
  • the genetically engineered bacteria use Bacillus subtilis as a host.
  • the protein is any protein that can be expressed in Bacillus subtilis.
  • the protein includes, but is not limited to, pullulanase.
  • amino acid sequence of the pullulanase is shown in SEQ ID NO.19.
  • the invention provides a method for regulating the expression level of pullulanase.
  • the ⁇ G is the minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter of the gene encoding the target protein and the 90-99 bp region of the N-terminal coding region.
  • the recombinant plasmid is introduced into Bacillus subtilis, and the Bacillus subtilis is used to produce the protein.
  • amino acid sequence of the pullulanase is shown in SEQ ID NO.19.
  • the present invention also protects the application of the method for screening nucleotide sequences encoding high-expression proteins, or the method for regulating the protein expression of genetically engineered bacteria in regulating the expression of a target protein.
  • the present invention also protects the application of the method for regulating the expression of pullulanase in regulating pullulanase.
  • the calculated PsfGFP value is positively correlated with the actual expression level of the protein.
  • to calculate the PsfGFP value is to select the corresponding synonymous mutation sequence as needed. It was applied to transform the N-terminus of pullulanase fused to the nucleotide sequence of the Bgls signal peptide, and the selected synonymous mutation sequence could increase the extracellular enzyme activity by 2.67 times and decrease it by 48%.
  • Figure 1 is the map of the sfGFP expression plasmid P43-NMK-sfGFP.
  • Figure 2 is a graph showing the relative fluorescence intensity of the NCS library of sfGFP.
  • Figure 3 shows the nucleotide sequence indices and fluorescence values of the 1st to 60th samples among the 172 samples.
  • Figure 4 shows the nucleotide sequence indices and fluorescence values of the 61st to 120th samples among the 172 samples.
  • Figure 5 shows the nucleotide sequence indexes and fluorescence values of the 121st to 172nd samples in the 172 samples.
  • Figure 6 is the distribution of relative fluorescence values before and after transformation.
  • Figure 7 is the map of pullulanase expression plasmid P43-NMK-Bgls fused with BglS signal peptide.
  • Figure 8 is a protein gel image of the 5 NCS variants of the BglS signal peptide.
  • Figure 9 is a graph showing the correlation between the predicted expression value of pullulanase with the addition of five Bgls signal peptide sequences and the measured value of enzyme activity.
  • Seed medium (g/L): peptone 10, yeast extract 5, sodium chloride 5;
  • Fermentation medium (g/L): The following components were dissolved in 0.9L water: peptone 12g, yeast extract 24g, glycerol 4mL.
  • Seed culture Pick a single colony of engineering bacteria and insert it into the seed medium, the culture temperature is 37°C, the shaking speed is 200r/min, and the culture is 24h;
  • Fermentation culture The seed culture liquid is inserted into the fermentation medium according to the inoculum amount of 4%, the culture temperature is 37 °C, and the fermentation is carried out for 24 hours.
  • One-step cloning kit was purchased from Nanjing Novizan Biotechnology Co., Ltd.
  • Glue concentration of 10% SDS-PAGE gel was used to analyze the protein expression level.
  • MES or MOPS buffer was used as the running buffer, and the loading volume was 10 ⁇ L.
  • the electrophoresis voltage was 150V.
  • Specific sample preparation and electrophoresis operations were performed according to the kit instructions.
  • the molecular weights (kDa) of the standard protein were: 188, 98, 62, 49, 38, 28, 17, 14, 6 and 3; and when electrophoresed in MOPS buffer, the molecular weight of the standard protein was Molecular weights (kDa) are: 191, 97, 64, 51, 39, 28, 19, 14
  • the PLytr promoter (nucleotide sequence shown in SEQ ID NO.1) was used with primers Lytr-F/Lytr-R (nucleotide sequence shown in SEQ ID NO.2 and 3) and Lytr-F-plasmid/ Lytr-R-plasmid (nucleotide sequence shown in SEQ ID NO. 4 and 5) was connected to the P43NMK plasmid by a one-step cloning kit to construct the plasmid P43NMK-Lytr;
  • the sfGFP fluorescent protein reporter gene (nucleotide sequence shown in SEQ ID NO.6) was used primers sfGFP-F/sfGFP-R (nucleotide sequence shown in SEQ ID NO.7 and 8) and sfGFP-F-plasmid/sfGFP-R-plasmid (nucleotide sequences shown in SEQ ID NO. 9 and 10) were fused to the downstream of PLytr by a one-step cloning kit to obtain the construction of P43NMK-Lytr_sfGFP, as shown in Figure 1. Show;
  • the recombinant plasmids with synonymous mutations constructed in Example 1 were transformed into the expression host Bacillus subtilis WB600, respectively, and the transformed single clones were inoculated into 96 shallow-well plates containing 200 ⁇ L of LB seed medium, and cultured for 8 hours;
  • a total of 8598 monoclonal host cells were characterized in Example 2, and the fluorescence value/OD was defined as the relative fluorescence intensity RFI. According to the level of the RFI value, the monoclonal cells were sorted from high to low, and every 50 cells were selected for sequencing identification (that is, the first One of the 1 to 50 strains was selected, one of the 51 to 100 strains was selected, and so on), and a total of 172 single clones were identified by sequencing.
  • GC3 The third base of the synonymous codon is the content of GC;
  • T3s, C3s, A3s, G3s After a synonymous mutation occurs at the first 30 bases of the N-terminal of the gene, the third synonymous codon is the frequency of T, C, A, and G, respectively;
  • CAI codon preference
  • Fop frequency of optimal codons (both above calculated ranges are 30 nucleotide sequences for NCS mutations).
  • ⁇ G the minimum free energy
  • the calculated range includes the region from the transcription start site to the downstream of NCS, in this example, 25 bases upstream of ATG (the transcription start site of the PLytr promoter) to 96 bases downstream of ATG were selected base;
  • TIR translation initiation rate
  • the range is the same as the calculation ⁇ G.
  • Example 3 Substitute the sequence of 172 samples in Example 3 into the regression prediction equation, calculate the predicted value, and compare it with the actual fluorescence value measured in Example 3, and perform correlation analysis. As shown in Figure 6, the sequence of The Pearson coefficient between the predicted value and the measured fluorescence value can reach 0.675, and the correlation is very strong, indicating that the regression prediction equation can be used to predict the protein fluorescence value.
  • Example 5 Using a prediction equation to guide NCS engineering of the signal peptide BglS gene
  • the BglS signal peptide (nucleotide sequence shown in SEQ ID NO. 13) was fused to the N-terminal of the pullulanase encoding gene (nucleotide sequence shown in SEQ ID NO. 14) to achieve pullulanase of extracellular expression.
  • the specific method is to clone the BglS signal peptide into the downstream of PLytr in P43NMK-Lytr by using the same one-step cloning method in the example to construct P43NMK-Lytr-BglS, as shown in FIG. 7 .
  • NCS region of BglS close to ATG was optimized: all the synonymous mutation combinations of the first ten amino acids of BglS were exhausted, and there were 131,072 possibilities; according to the examples 4 equations were used to calculate the GC3 and ⁇ G of each of the 131072 sequences and the theoretical value of PsfGFP, and according to the predicted value, 5 Bgls variants including wild type were selected: NCS+, NCS+', NCS-wt, NCS -', NCS-.
  • NCS+ represents the P sfGFP maximum variant
  • NCS+' represents the intermediate variant between the maximum and wild type of P sfGFP
  • NCS-wt represents the wild type
  • NCS- represents the P sfGFP minimum variant
  • NCS-' represents the intermediate variant Between the minimum of PsfGFP and the intermediate value variant of wild type, it has continuously decreasing predicted expression intensity.
  • step (1) signal peptide Bgls variants NCS+ (nucleotide sequence shown in SEQ ID NO. 15), NCS+' (nucleotide sequence shown in SEQ ID NO. 16), NCS- ' (the nucleotide sequence is shown in SEQ ID NO. 17), NCS- (the nucleotide sequence is shown in SEQ ID NO.

Landscapes

  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Plant Pathology (AREA)
  • Analytical Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Medicinal Chemistry (AREA)
  • Immunology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

The present invention relates to the field of genetic engineering. Disclosed is an N-terminus coding sequence-based method for modifying a regulatory protein expression. For the present invention, Bacillus subtilis serves as an expression host, by means of a predictive model, the nucleotide sequence most favorable for promoting gene expression is assessed in an N-terminus coding region synonymous mutation. With the combination of synonymous mutation libraries of the top ten amino acids of the NCS of a superfolder green fluorescent protein (sfGFP), the fluorescent intensity of proteins in the libraries is measured, 172 representative samples are selected, sequenced, and identified, and a statistical method is used to establish a predictive model. Pullulanase of a BlgS signal peptide is optimally integrated via the model, the extracellular enzyme activity of pullulanase can be increased to 2.67 times of that prior to transformation and be reduced by 48%, thus providing a direction for a rational transformation to design an N-terminus gene from scratch, and favoring an easily regulated expression of the gene.

Description

一种基于N端编码序列改造调控蛋白质表达的方法A method for regulating protein expression based on N-terminal coding sequence modification 技术领域technical field
本发明涉及一种基于N端编码序列改造调控蛋白质表达的方法,属于基因工程及酶工程技术领域The invention relates to a method for regulating protein expression based on the modification of an N-terminal coding sequence, belonging to the technical field of genetic engineering and enzyme engineering
背景技术Background technique
基因的突变对于改变蛋白的性质具有非常重要的意义,通常通过突变,可以从中找到性质更好的突变序列,从而提高蛋白的应用价值。基因的同义突变就是常用的一种突变手段,基因的同义突变可实现表达量相差巨大。Mutation of genes is of great significance for changing the properties of proteins. Usually, mutation sequences with better properties can be found through mutation, thereby improving the application value of proteins. Synonymous mutation of genes is a commonly used mutation method, and the expression levels of synonymous mutations of genes can vary greatly.
目前常用的方法:是通过构建同义突变文库,并结合高通量筛选策略,以期找到最佳突变体。然而这种方法耗时耗力,并且专一性强,无法用于指导其他基因的设计。尽管有的研究通过发现,合成一系列的短肽,有利于广泛提高基因的表达,然而这种方法会对酶活产生影响,由于这些促表达的短肽,占据了信号肽的位置,从而不适合于需要添加信号肽的胞外蛋白。The current commonly used method is to construct a synonymous mutation library and combine it with a high-throughput screening strategy to find the best mutant. However, this method is time-consuming, labor-intensive, and specific, and cannot be used to guide the design of other genes. Although some studies have found that synthesizing a series of short peptides is beneficial to widely improve gene expression, this method will have an impact on enzyme activity, because these expression-promoting short peptides occupy the position of the signal peptide, so that the Suitable for extracellular proteins that require the addition of a signal peptide.
现有用于改善基因表达量的方法,往往都是通过非翻译区(5’UTR)的优化,然而当5’UTR模块已经足够强时,难以继续优化并显著提高表达量。而关于N端编码区(NCS)的研究较少。因此,建立一种适用于广泛基因设计的NCS改造策略非常重要。Existing methods for improving gene expression are often through the optimization of the untranslated region (5'UTR). However, when the 5'UTR module is strong enough, it is difficult to continue to optimize and significantly increase the expression. However, there are few studies on the N-terminal coding region (NCS). Therefore, it is very important to establish an NCS engineering strategy applicable to a wide range of gene designs.
发明内容SUMMARY OF THE INVENTION
本发明的方法是基于对代表性样本的生物信息学分析而建立的,通过此方法,可从头设计任意基因的N端前30位碱基的核苷酸序列,对其进行同义突变。本模型的实施方案中,是通过突变引物,改变任意基因的NCS核苷酸序列为目的核苷酸序列所完成。本发明通过优化NCS的核苷酸序列,可用于指导任意基因的设计,并不需要添加额外的氨基酸序列,对蛋白质的性质降到最低。可极大的提高目的基因的表达水平。The method of the present invention is established based on the bioinformatics analysis of representative samples, and by this method, the nucleotide sequence of the first 30 bases of the N-terminal of any gene can be de novo designed, and synonymous mutation can be performed on it. In the embodiment of this model, the NCS nucleotide sequence of any gene is changed to the target nucleotide sequence by mutating the primers. By optimizing the nucleotide sequence of NCS, the present invention can be used to guide the design of any gene without adding additional amino acid sequence, and the properties of the protein are minimized. It can greatly improve the expression level of the target gene.
本发明提供了一种筛选编码不同表达量的蛋白的核苷酸序列的方法,测定GC3和ΔG的值,再应用下述方程式计算蛋白的相对表达量,即P sfGFP值,根据P sfGFP值筛选出对应的核苷酸序列;P sfGFP值与蛋白的实际表达量成正相关: The present invention provides a method for screening nucleotide sequences encoding proteins with different expression levels, measuring the values of GC3 and ΔG, and then calculating the relative expression level of the protein by using the following equation, that is, the PsfGFP value, and screening according to the PsfGFP value The corresponding nucleotide sequence is obtained; the P sfGFP value is positively correlated with the actual expression of the protein:
P sfGFP=274497.657-108717.401×GC3+4886.529×ΔG。 PsfGFP =274497.657-108717.401×GC3+4886.529×ΔG.
在一种实施方式中,所述GC3为编码目的蛋白的基因靠近ATG的N端编码区前9n~10n个核苷酸的同义密码子第三位碱基是GC的含量,n=3;所述ΔG为编码目的蛋白的基因的任意启动子转录起始位点至N端编码区的第90~99bp区域间的mRNA二级结构的最小自由能。In one embodiment, the GC3 is the content of GC in the first 9n-10n nucleotides of the synonymous codon of the gene encoding the target protein near the N-terminal coding region of ATG, n=3; The ΔG is the minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter of the gene encoding the target protein and the 90-99 bp region of the N-terminal coding region.
在一种实施方式中,所述ΔG为编码目的蛋白的基因的任意启动子转录起始位点至N端编码区的第96bp区域间的mRNA二级结构的最小自由能。In one embodiment, the ΔG is the minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter of the gene encoding the target protein and the 96 bp region of the N-terminal coding region.
在一种实施方式中,所述蛋白为能够在枯草芽孢杆菌中表达的任意蛋白。In one embodiment, the protein is any protein that can be expressed in Bacillus subtilis.
在一种实施方式中,所述蛋白包括但不限于普鲁兰酶。In one embodiment, the protein includes, but is not limited to, pullulanase.
在一种实施方式中,所述普鲁兰酶的氨基酸序列如SEQ ID NO.19所示。In one embodiment, the amino acid sequence of the pullulanase is shown in SEQ ID NO.19.
在一种实施方式中,P sfGFP值与蛋白的实际表达量呈正相关。 In one embodiment, the PsfGFP value is positively correlated with the actual expression level of the protein.
在一种实施方式中,根据P sfGFP值筛选相应的核苷酸序列。 In one embodiment, the corresponding nucleotide sequences are screened according to the PsfGFP value.
在一种实施方式中,P sfGFP值越高,对应的蛋白表达量越高。 In one embodiment, the higher the PsfGFP value, the higher the corresponding protein expression level.
本发明提供了一种调控基因工程菌蛋白表达量的方法,选取目的蛋白N端编码区的长度为9n~10n个核苷酸,n=3,建立同义突变库;计算同义突变库中的基因的GC3和ΔG参数,根据方程计算每个核苷酸序列的相对表达量,选择具有所需表达量的核苷酸序列,将目的蛋白N端编码区进行相应突变,并将其转化到宿主细胞中;The invention provides a method for regulating the protein expression of genetically engineered bacteria. The length of the N-terminal coding region of the target protein is selected to be 9n-10n nucleotides, and n=3, and a synonymous mutation library is established; GC3 and ΔG parameters of the gene, calculate the relative expression level of each nucleotide sequence according to the equation, select the nucleotide sequence with the required expression level, mutate the N-terminal coding region of the target protein accordingly, and transform it into in host cells;
所述方程为:P sfGFP=274497.657-108717.401×GC3+4886.529×ΔG。 The equation is: P sfGFP =274497.657−108717.401×GC3+4886.529×ΔG.
在一种实施方式中,所述GC3为编码目的蛋白的基因靠近ATG的N端编码区前9n~10n个核苷酸的同义密码子第三位碱基是GC的含量,n=3。In one embodiment, the GC3 is the content of GC in the first 9n-10n nucleotides of the synonymous codon of the gene encoding the target protein near the N-terminal coding region of ATG, n=3.
在一种实施方式中,所述ΔG为编码目的蛋白的基因的任意启动子转录起始位点至N端编码区的第90~99bp区域间的mRNA二级结构的最小自由能。In one embodiment, the ΔG is the minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter of the gene encoding the target protein and the 90-99 bp region of the N-terminal coding region.
在一种实施方式中,所述ΔG为编码目的蛋白的基因的任意启动子转录起始位点至N端编码区的第96bp区域间的mRNA二级结构的最小自由能。In one embodiment, the ΔG is the minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter of the gene encoding the target protein and the 96 bp region of the N-terminal coding region.
在本发明的一种实施方式中,蛋白表达量需要上调的时候,选择突变库中的P sfGFP值处于前10%的核苷酸序列;蛋白表达量需要下调的时候,选择突变库中的P sfGFP值处于后10%的核苷酸序列。 In an embodiment of the present invention, when the protein expression needs to be up-regulated, the nucleotide sequences whose P sfGFP value in the mutation library is in the top 10% are selected; when the protein expression needs to be down-regulated, the P sfGFP value in the mutation library is selected. sfGFP values are in the bottom 10% of nucleotide sequences.
在一种实施方式中,将编码目的蛋白的基因靠近ATG的N端编码区前9n~10n个核苷酸突变为相应的核苷酸序列,n=3。In one embodiment, the first 9n-10n nucleotides of the gene encoding the protein of interest near the N-terminal coding region of ATG are mutated to the corresponding nucleotide sequence, n=3.
在一种实施方式中,所述基因工程菌以枯草芽孢杆菌为宿主。In one embodiment, the genetically engineered bacteria use Bacillus subtilis as a host.
在一种实施方式中,所述蛋白为能够在枯草芽孢杆菌中表达的任意蛋白。In one embodiment, the protein is any protein that can be expressed in Bacillus subtilis.
在一种实施方式中,所述蛋白包括但不限于普鲁兰酶。In one embodiment, the protein includes, but is not limited to, pullulanase.
在一种实施方式中,所述普鲁兰酶的氨基酸序列如SEQ ID NO.19所示。In one embodiment, the amino acid sequence of the pullulanase is shown in SEQ ID NO.19.
本发明提供了一种调控普鲁兰酶表达量的方法,将选取普鲁兰酶N端编码区前9n~10n个核苷酸,n=3,进行同义突变,构建突变体库,并计算P sfGFP值,根据P sfGFP值选择相应的 同义突变序列;将目的蛋白的N端编码区进行相应突变,连接至表达载体,构建重组质粒; The invention provides a method for regulating the expression level of pullulanase. The first 9n to 10n nucleotides in the N-terminal coding region of pullulanase are selected, n=3, and synonymous mutation is performed to construct a mutant library, and Calculate the P sfGFP value, select the corresponding synonymous mutation sequence according to the P sfGFP value; mutate the N-terminal coding region of the target protein correspondingly, connect it to the expression vector, and construct a recombinant plasmid;
所述P sfGFP值按照下述方式计算:P sfGFP=274497.657-108717.401×GC3+4886.529×ΔG; The P sfGFP value is calculated as follows: P sfGFP =274497.657-108717.401×GC3+4886.529×ΔG;
所述GC3为编码目的蛋白的基因靠近ATG的N端编码区前9n~10n个核苷酸的同义密码子第三位碱基是GC的含量,n=3;The GC3 is the first 9n~10n nucleotides of the gene encoding the target protein near the N-terminal coding region of the ATG synonymous codon, and the third base is the content of GC, n=3;
所述ΔG为编码目的蛋白的基因的任意启动子转录起始位点至N端编码区的第90~99bp区域间的mRNA二级结构的最小自由能。The ΔG is the minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter of the gene encoding the target protein and the 90-99 bp region of the N-terminal coding region.
在一种实施方式中,普鲁兰酶表达量需要上调的时候,选择突变库中的P sfGFP值处于前10%的核苷酸序列,将编码目的蛋白的基因靠近ATG的N端编码区前9n~10n个核苷酸突变为相应的核苷酸序列,n=3。 In one embodiment, when the expression level of pullulanase needs to be up-regulated, the nucleotide sequence with the PsfGFP value in the top 10% of the mutation library is selected, and the gene encoding the target protein is placed before the N-terminal coding region of ATG. 9n-10n nucleotides were mutated to the corresponding nucleotide sequence, n=3.
在一种实施方式中,普鲁兰酶表达量需要下调的时候,选择突变库中的P sfGFP值处于后10%的核苷酸序列,将编码目的蛋白的基因靠近ATG的N端编码区前9n~10n个核苷酸突变为相应的核苷酸序列,n=3。 In one embodiment, when the expression level of pullulanase needs to be down-regulated, the nucleotide sequence whose PsfGFP value in the mutation library is in the lower 10% is selected, and the gene encoding the target protein is placed before the N-terminal coding region of ATG. 9n-10n nucleotides were mutated to the corresponding nucleotide sequence, n=3.
在一种实施方式中,将重组质粒导入枯草芽孢杆菌,利用枯草芽孢杆菌生产蛋白。In one embodiment, the recombinant plasmid is introduced into Bacillus subtilis, and the Bacillus subtilis is used to produce the protein.
在一种实施方式中,所述普鲁兰酶的氨基酸序列如SEQ ID NO.19所示。In one embodiment, the amino acid sequence of the pullulanase is shown in SEQ ID NO.19.
本发明还保护所述筛选编码高表达量蛋白的核苷酸序列的方法,或调控基因工程菌蛋白表达量的方法在调节目的蛋白表达量中的应用。The present invention also protects the application of the method for screening nucleotide sequences encoding high-expression proteins, or the method for regulating the protein expression of genetically engineered bacteria in regulating the expression of a target protein.
本发明还保护所述调控普鲁兰酶表达量的方法在调节普鲁兰酶中的应用。The present invention also protects the application of the method for regulating the expression of pullulanase in regulating pullulanase.
本发明的有益效果:Beneficial effects of the present invention:
本发明通过结合sfGFP、并对目的基因的N端编码区进行改造(同义突变),探究出了一条用于指导蛋白做出定向改造、从而提高或降低目的蛋白表达的公式PsfGFP=274497.657-108717.401×GC3+4886.529×ΔG。所算得的PsfGFP值与蛋白的实际表达量成正相关,根据此公式,计算出PsfGFP值即根据需要选择相应的同义突变序列。将其应用于改造融合了Bgls信号肽的核苷酸序列的普鲁兰酶N端,选择的同义突变序列可使胞外酶活上调2.67倍、以及下调48%。By combining sfGFP and modifying the N-terminal coding region of the target gene (synonymous mutation), the present invention explores a formula for instructing the protein to make directional modification, thereby increasing or reducing the expression of the target protein PsfGFP=274497.657-108717.401 ×GC3+4886.529×ΔG. The calculated PsfGFP value is positively correlated with the actual expression level of the protein. According to this formula, to calculate the PsfGFP value is to select the corresponding synonymous mutation sequence as needed. It was applied to transform the N-terminus of pullulanase fused to the nucleotide sequence of the Bgls signal peptide, and the selected synonymous mutation sequence could increase the extracellular enzyme activity by 2.67 times and decrease it by 48%.
附图说明Description of drawings
图1为sfGFP表达质粒P43-NMK-sfGFP图谱。Figure 1 is the map of the sfGFP expression plasmid P43-NMK-sfGFP.
图2为sfGFP的NCS文库相对荧光强度情况图。Figure 2 is a graph showing the relative fluorescence intensity of the NCS library of sfGFP.
图3为172个样本中第1~60个样本的核苷酸序列指标和荧光值。Figure 3 shows the nucleotide sequence indices and fluorescence values of the 1st to 60th samples among the 172 samples.
图4为172个样本中第61~120个样本的核苷酸序列指标和荧光值。Figure 4 shows the nucleotide sequence indices and fluorescence values of the 61st to 120th samples among the 172 samples.
图5为172个样本中第121~172个样本的核苷酸序列指标和荧光值。Figure 5 shows the nucleotide sequence indexes and fluorescence values of the 121st to 172nd samples in the 172 samples.
图6为改造前后相对荧光值分布图。Figure 6 is the distribution of relative fluorescence values before and after transformation.
图7为融合BglS信号肽的普鲁兰酶表达质粒P43-NMK-Bgls图谱。Figure 7 is the map of pullulanase expression plasmid P43-NMK-Bgls fused with BglS signal peptide.
图8为BglS信号肽的5种NCS变体蛋白胶图。Figure 8 is a protein gel image of the 5 NCS variants of the BglS signal peptide.
图9为添加5种Bgls信号肽序列的普鲁兰酶的表达预测值与酶活测量值相关性图。Figure 9 is a graph showing the correlation between the predicted expression value of pullulanase with the addition of five Bgls signal peptide sequences and the measured value of enzyme activity.
具体实施方式detailed description
1、培养基组成:1. Culture medium composition:
种子培养基(g/L):蛋白胨10,酵母提取物5,氯化钠5;Seed medium (g/L): peptone 10, yeast extract 5, sodium chloride 5;
发酵培养基(g/L):将下列组分溶解在0.9L水中:蛋白胨12g,酵母提取物24g,甘油4mL。Fermentation medium (g/L): The following components were dissolved in 0.9L water: peptone 12g, yeast extract 24g, glycerol 4mL.
各组分溶解后高压灭菌;冷却到60℃,再加100mL灭菌的0.17mol/L的KH2PO4、0.72mol/L的K2HPO4溶液(2.31g的KH2PO4和12.54g的K2HPO4溶在足量的水中,使终体积为100mL;0.22μm的滤膜过滤除菌);After each component is dissolved, autoclave; cool to 60°C, add 100mL of sterilized 0.17mol/L KH2PO4, 0.72mol/L K2HPO4 solution (2.31g KH2PO4 and 12.54g K2HPO4 are dissolved in enough water , make the final volume 100mL; filter sterilization with 0.22μm filter);
2、培养方法:2. Cultivation method:
种子培养:挑取工程菌单菌落接入种子培养基中,培养温度37℃,摇床转速200r/min,培养24h;Seed culture: Pick a single colony of engineering bacteria and insert it into the seed medium, the culture temperature is 37°C, the shaking speed is 200r/min, and the culture is 24h;
发酵培养:种子培养液按4%的接种量接入发酵培养基中,培养温度37℃,发酵24hFermentation culture: The seed culture liquid is inserted into the fermentation medium according to the inoculum amount of 4%, the culture temperature is 37 °C, and the fermentation is carried out for 24 hours.
3、绿色荧光蛋白表达量及生物量测定3. Green fluorescent protein expression and biomass determination
在96孔板中加入用PBS缓冲液(100mM和pH 7.2)稀释成合适浓度的发酵液,使用Cytation3细胞成像微孔板检测仪(美国伯腾仪器有限公司),绿色荧光激发波长:480nm,绿色荧光发射波长:520nm,细胞生长OD吸收波长:600nm。Add the fermentation broth diluted with PBS buffer (100mM and pH 7.2) to the appropriate concentration in the 96-well plate, use Cytation3 cell imaging microplate detector (Berton Instrument Co., Ltd., USA), green fluorescence excitation wavelength: 480nm, green Fluorescence emission wavelength: 520nm, cell growth OD absorption wavelength: 600nm.
一步克隆试剂盒购自南京诺唯赞生物科技有限公司。One-step cloning kit was purchased from Nanjing Novizan Biotechnology Co., Ltd.
4、SDS-PAGE电泳检测4. SDS-PAGE electrophoresis detection
胶浓度为10%的
Figure PCTCN2021102986-appb-000001
SDS-PAGE胶被用于分析蛋白的表达水平,以MES或MOPS缓冲液为电泳缓冲液,上样量为10μL。电泳电压为150V。具体样品制备及电泳操作依照试剂盒说明书进行。以MES缓冲液进行电泳时,标准蛋白的分子量(kDa)分别为:188,98,62,49,38,28,17,14,6和3;而以MOPS缓冲液进行电泳时,标准蛋白的分子量(kDa)分别为:191,97,64,51,39,28,19,14
Glue concentration of 10%
Figure PCTCN2021102986-appb-000001
SDS-PAGE gel was used to analyze the protein expression level. MES or MOPS buffer was used as the running buffer, and the loading volume was 10 μL. The electrophoresis voltage was 150V. Specific sample preparation and electrophoresis operations were performed according to the kit instructions. When electrophoresed in MES buffer, the molecular weights (kDa) of the standard protein were: 188, 98, 62, 49, 38, 28, 17, 14, 6 and 3; and when electrophoresed in MOPS buffer, the molecular weight of the standard protein was Molecular weights (kDa) are: 191, 97, 64, 51, 39, 28, 19, 14
5、普鲁兰酶酶活测定方式5. Assay method of pullulanase activity
将1mL 1g/100mL普鲁兰多糖底物和0.9mL 100mM pH 4.5乙酸-乙酸钠缓冲液混合均匀,置于60℃水浴锅内预热10min,加入普鲁兰酶液0.1mL,反应10min后,加入3mL DNS显色液,然后于沸水浴中煮7min,置于冰水中终止显色反应,再加10mL去离子水, 混匀,在540nm下测定吸光值。单位时间内生成1μmol还原糖的酶量定义为一个酶活力单位。Mix 1mL 1g/100mL pullulan polysaccharide substrate and 0.9mL 100mM pH 4.5 acetic acid-sodium acetate buffer evenly, place it in a 60°C water bath to preheat for 10min, add 0.1mL pullulan enzyme solution, react for 10min, Add 3 mL of DNS color developing solution, then boil in a boiling water bath for 7 min, place in ice water to stop the color developing reaction, add 10 mL of deionized water, mix well, and measure the absorbance at 540 nm. The amount of enzyme that generates 1 μmol of reducing sugar per unit time is defined as one unit of enzyme activity.
实施例1:构建NCS同义突变文库Example 1: Construction of NCS synonymous mutation library
将PLytr启动子(核苷酸序列如SEQ ID NO.1所示)使用引物Lytr-F/Lytr-R(核苷酸序列如SEQ ID NO.2和3所示)和Lytr-F-plasmid/Lytr-R-plasmid(核苷酸序列如SEQ ID NO.4和5所示)通过一步克隆试剂盒连接至P43NMK质粒,构建得到质粒P43NMK-Lytr;The PLytr promoter (nucleotide sequence shown in SEQ ID NO.1) was used with primers Lytr-F/Lytr-R (nucleotide sequence shown in SEQ ID NO.2 and 3) and Lytr-F-plasmid/ Lytr-R-plasmid (nucleotide sequence shown in SEQ ID NO. 4 and 5) was connected to the P43NMK plasmid by a one-step cloning kit to construct the plasmid P43NMK-Lytr;
采用相同的手段,将sfGFP荧光蛋白报告基因(核苷酸序列如SEQ ID NO.6所示)使用引物sfGFP-F/sfGFP-R(核苷酸序列如SEQ ID NO.7和8所示)和sfGFP-F-plasmid/sfGFP-R-plasmid(核苷酸序列如SEQ ID NO.9和10所示),通过一步克隆试剂盒融合至PLytr的下游,得到构建P43NMK-Lytr_sfGFP,如图1所示;Using the same method, the sfGFP fluorescent protein reporter gene (nucleotide sequence shown in SEQ ID NO.6) was used primers sfGFP-F/sfGFP-R (nucleotide sequence shown in SEQ ID NO.7 and 8) and sfGFP-F-plasmid/sfGFP-R-plasmid (nucleotide sequences shown in SEQ ID NO. 9 and 10) were fused to the downstream of PLytr by a one-step cloning kit to obtain the construction of P43NMK-Lytr_sfGFP, as shown in Figure 1. Show;
以P43NMK-Lytr_sfGFP为模板,使用简并引物sfGFP-F-NCS/sfGFP-R-NCS(核苷酸序列如SEQ ID NO.11和12所示),获得sfGFP的N端前30位碱基发生同义突变的重组质粒,这些重组质粒构成了同义突变文库,使得sfGFP前30个碱基发生改变,但其编码的氨基酸序列保持不变。Using P43NMK-Lytr_sfGFP as a template and using degenerate primers sfGFP-F-NCS/sfGFP-R-NCS (nucleotide sequences shown in SEQ ID NO. 11 and 12), the first 30 bases of the N-terminal of sfGFP were obtained. Synonymous mutant recombinant plasmids, these recombinant plasmids constitute a synonymous mutant library, which changes the first 30 bases of sfGFP, but the encoded amino acid sequence remains unchanged.
实施例2:NCS同义突变文库的表征Example 2: Characterization of NCS Synonymous Mutation Libraries
将实施例1中构建得到的发生同义突变的重组质粒分别转化至表达宿主枯草芽孢杆菌WB600中,将转化后的单克隆接种至含有200μL LB种子培养基的96浅孔板,培养8小时;The recombinant plasmids with synonymous mutations constructed in Example 1 were transformed into the expression host Bacillus subtilis WB600, respectively, and the transformed single clones were inoculated into 96 shallow-well plates containing 200 μL of LB seed medium, and cultured for 8 hours;
接着,按照4mL/100mL的接种量接种至含有800μL TB培养基的96深孔板,培养24小时得到发酵液;Then, according to the inoculation amount of 4mL/100mL, it was inoculated into a 96 deep-well plate containing 800μL of TB medium, and cultured for 24 hours to obtain a fermentation broth;
然后将发酵液迅速置于冰上冷冻,离心后,去除上清,用PBS缓冲液(100mM、pH 7.2)稀释至合适倍数后,通过Cytation3细胞成像微孔板检测仪(美国伯腾仪器有限公司)测定荧光值(激发光480,吸收光520)以及OD 600。共表征了8598个单菌落,如图2。 Then the fermentation broth was quickly frozen on ice, and after centrifugation, the supernatant was removed, diluted to an appropriate multiple with PBS buffer (100 mM, pH 7.2), and passed through a Cytation3 cell imaging microplate detector (Borton Instrument Co., Ltd., USA). ) to measure the fluorescence value (excitation light 480, absorption light 520) and OD 600 . A total of 8598 single colonies were characterized as shown in Figure 2.
实施例3:代表性样本的序列鉴定和发酵Example 3: Sequence identification and fermentation of representative samples
实施例2中共表征8598个单克隆宿主细胞,定义荧光值/OD为相对荧光强度RFI,根据RFI值的高低,将单克隆细胞由高到低排序,每50个选择1个测序鉴定(即第1~50个菌株中选择一个,第51~100个菌株中选择一个,依此类推),共测序鉴定了172个单克隆。A total of 8598 monoclonal host cells were characterized in Example 2, and the fluorescence value/OD was defined as the relative fluorescence intensity RFI. According to the level of the RFI value, the monoclonal cells were sorted from high to low, and every 50 cells were selected for sequencing identification (that is, the first One of the 1 to 50 strains was selected, one of the 51 to 100 strains was selected, and so on), and a total of 172 single clones were identified by sequencing.
将172个经测序鉴定后的单克隆,接种至含有20mL种子培养基的250mL摇瓶中,37℃、220rpm发酵8小时后至OD 600大于4,按照4mL/100mL的比例接种到含有25mL发酵培养基的250mL摇瓶中,发酵24小时后,测定sfGFP的荧光值和OD 600。每组实验设置3个平行。其结果如下图3~5。 172 single clones identified by sequencing were inoculated into 250 mL shake flasks containing 20 mL of seed medium, fermented at 37°C and 220 rpm for 8 hours until OD 600 was greater than 4, and inoculated into 25 mL of fermentation culture at a ratio of 4 mL/100 mL. The fluorescence value and OD 600 of sfGFP were measured after 24 hours of fermentation in a 250 mL shake flask based on the sfGFP. Each group of experiments was set up in 3 parallels. The results are shown in Figures 3 to 5 below.
实施例4:使用生物信息学工具对样本的核苷酸进行序列分析Example 4: Sequence analysis of nucleotides of samples using bioinformatics tools
使用CodonW、Nupack、RBS calculator创建11个不同的核苷酸序列指标以进行序列分析。11 different nucleotide sequence metrics were created for sequence analysis using CodonW, Nupack, RBS calculator.
(1)使用CodonW计算GC、GC3、T3s、C3s、A3s、G3s、CAI、CBI、Fop(1) Use CodonW to calculate GC, GC3, T3s, C3s, A3s, G3s, CAI, CBI, Fop
GC:目的基因的G+C含量;GC: G+C content of the target gene;
GC3:同义密码子第三位碱基是GC的含量;GC3: The third base of the synonymous codon is the content of GC;
T3s、C3s、A3s、G3s:基因的N端前30位碱基发生同义突变后,第三个同义位置密码子分别是T、C、A、G的频率;T3s, C3s, A3s, G3s: After a synonymous mutation occurs at the first 30 bases of the N-terminal of the gene, the third synonymous codon is the frequency of T, C, A, and G, respectively;
CAI:密码子偏好性;CAI: codon preference;
CBI:密码子偏爱指数;CBI: codon preference index;
Fop:最佳密码子的频率(上述计算范围均是NCS突变的30个核苷酸序列)。Fop: frequency of optimal codons (both above calculated ranges are 30 nucleotide sequences for NCS mutations).
(2)使用Nupack计算ΔG(2) Calculate ΔG using Nupack
ΔG:最小自由能,其计算的范围包含转录起始位点至NCS下游的区域,在本实施例中选取ATG上游25个碱基处(PLytr启动子的转录起始位点)至ATG下游96碱基处;ΔG: the minimum free energy, the calculated range includes the region from the transcription start site to the downstream of NCS, in this example, 25 bases upstream of ATG (the transcription start site of the PLytr promoter) to 96 bases downstream of ATG were selected base;
(3)使用RBS calculator计算TIR(3) Calculate TIR using RBS calculator
TIR:翻译起始率,范围同计算ΔG。TIR: translation initiation rate, the range is the same as the calculation ΔG.
通过对172个样本中,以RFI作为因变量,11个核苷酸序列指标作为因变量进行分析,通过SPSS进行多元回归分析,方法采用逐步回归。By analyzing the 172 samples with RFI as the dependent variable and 11 nucleotide sequence indicators as the dependent variable, the multiple regression analysis was performed by SPSS, and the method was stepwise regression.
最终获得一条回归预测方程PsfGFP=274497.657-108717.401×GC3+4886.529×ΔG,见表1。并用以指导基因的NCS改造,在对NCS进行改造时,通过计算相应参数带入公式,即能根据算出的值,选择蛋白表达量高的同义突变序列。Finally, a regression prediction equation PsfGFP=274497.657-108717.401×GC3+4886.529×ΔG was obtained, see Table 1. And it is used to guide the NCS transformation of the gene. When the NCS is transformed, the corresponding parameters are calculated and brought into the formula, that is, the synonymous mutation sequence with high protein expression can be selected according to the calculated value.
表1 多元回归分析Table 1 Multiple regression analysis
Figure PCTCN2021102986-appb-000002
Figure PCTCN2021102986-appb-000002
将实施例3中的172个样本的序列代入所述回归预测方程,计算出预测值,并与实施例3中测定的实际荧光值进行比较,进行相关性分析,如图6所示,序列的预测值和测量荧光值之间的皮尔逊系数可达0.675,相关非常强,说明所述的回归预测方程可以用来预测蛋白荧光值。Substitute the sequence of 172 samples in Example 3 into the regression prediction equation, calculate the predicted value, and compare it with the actual fluorescence value measured in Example 3, and perform correlation analysis. As shown in Figure 6, the sequence of The Pearson coefficient between the predicted value and the measured fluorescence value can reach 0.675, and the correlation is very strong, indicating that the regression prediction equation can be used to predict the protein fluorescence value.
实施例5:使用预测方程指导信号肽BglS基因的NCS改造Example 5: Using a prediction equation to guide NCS engineering of the signal peptide BglS gene
(1)P43NMK-Lytr_BglS野生型的构建(1) Construction of P43NMK-Lytr_BglS wild type
将BglS信号肽(核苷酸序列如SEQ ID NO.13所示)融合在普鲁兰酶编码基因(核苷酸序列如SEQ ID NO.14所示)的N端,实现了普鲁兰酶的胞外表达。具体方式为利用实施例相同的一步克隆法,将BglS信号肽克隆至P43NMK-Lytr中的PLytr的下游,构建得到P43NMK-Lytr-BglS,如图7。The BglS signal peptide (nucleotide sequence shown in SEQ ID NO. 13) was fused to the N-terminal of the pullulanase encoding gene (nucleotide sequence shown in SEQ ID NO. 14) to achieve pullulanase of extracellular expression. The specific method is to clone the BglS signal peptide into the downstream of PLytr in P43NMK-Lytr by using the same one-step cloning method in the example to construct P43NMK-Lytr-BglS, as shown in FIG. 7 .
(2)P43NMK-Lytr_BglS同义突变质粒的构建(2) Construction of P43NMK-Lytr_BglS synonymous mutant plasmid
为了进一步的提高普鲁兰酶的胞外酶活,优化了靠近ATG的BglS的NCS区:将BglS的前十个氨基酸所有的同义突变组合方式穷举出来,共有131072种可能;按照实施例4的方程进行计算,计算131072条序列每一条序列的GC3和ΔG以及理论值PsfGFP,并根据预测值,选择包括野生型在内的5种Bgls变体:NCS+,NCS+’,NCS-wt,NCS-’,NCS-。In order to further improve the extracellular activity of pullulanase, the NCS region of BglS close to ATG was optimized: all the synonymous mutation combinations of the first ten amino acids of BglS were exhausted, and there were 131,072 possibilities; according to the examples 4 equations were used to calculate the GC3 and ΔG of each of the 131072 sequences and the theoretical value of PsfGFP, and according to the predicted value, 5 Bgls variants including wild type were selected: NCS+, NCS+', NCS-wt, NCS -', NCS-.
NCS+代表P sfGFP最大值变体;NCS+’代表P sfGFP的最大值与野生型之间的中间值变体;NCS-wt代表野生型;NCS-代表P sfGFP最小值变体,NCS-’代表介于P sfGFP的最小值与野生型之间的中间值变体,其具有连续降低的预测表达强度。 NCS+ represents the P sfGFP maximum variant; NCS+' represents the intermediate variant between the maximum and wild type of P sfGFP ; NCS-wt represents the wild type; NCS- represents the P sfGFP minimum variant, and NCS-' represents the intermediate variant Between the minimum of PsfGFP and the intermediate value variant of wild type, it has continuously decreasing predicted expression intensity.
利用与步骤(1)相同的方法,信号肽Bgls变体NCS+(核苷酸序列如SEQ ID NO.15所示)、NCS+’(核苷酸序列如SEQ ID NO.16所示)、NCS-’(核苷酸序列如SEQ ID NO.17所示)、NCS-(核苷酸序列如SEQ ID NO.18所示),分别连接至克隆至P43NMK-Lytr中的PLytr的下游,分别得到含有BglS信号肽同义突变序列的质粒;再将得到的质粒转化至表达宿主枯草芽孢杆菌WB600中,将转化后的单克隆接种至含有20mL LB培养基的250mL摇瓶中,37℃ 220rpm发酵8小时后,使得OD 600达到4以上,按照4mL/100mL的比例接种到含有25mL TB培养基的250mL摇瓶中,在37℃、250rpm发酵30小时后,测定普鲁兰酶的胞外酶活,结果如图9所示,发现其普鲁兰酶胞外酶活实现了预测的高中低5水平变化,并且与预测值有0.89的R 2水平。 Using the same method as step (1), signal peptide Bgls variants NCS+ (nucleotide sequence shown in SEQ ID NO. 15), NCS+' (nucleotide sequence shown in SEQ ID NO. 16), NCS- ' (the nucleotide sequence is shown in SEQ ID NO. 17), NCS- (the nucleotide sequence is shown in SEQ ID NO. 18), respectively connected to the downstream of the PLytr cloned into P43NMK-Lytr, respectively to obtain The plasmid containing the synonymous mutant sequence of BglS signal peptide; then transform the obtained plasmid into the expression host Bacillus subtilis WB600, inoculate the transformed single clone into a 250 mL shake flask containing 20 mL of LB medium, and ferment at 37°C and 220 rpm for 8 hours Then, make the OD 600 reach 4 or more, inoculate it into a 250 mL shake flask containing 25 mL of TB medium at a ratio of 4 mL/100 mL, and ferment at 37 °C and 250 rpm for 30 hours to measure the extracellular enzyme activity of pullulanase. As shown in Figure 9, it was found that its pullulanase extracellular enzyme activity achieved the predicted change in the high, medium and low 5 levels, and had an R 2 level of 0.89 with the predicted value.
表2 信号肽BglS的NCS突变体的预测及实际检测结果Table 2 Prediction and actual detection results of NCS mutants of signal peptide BglS
   预测荧光值(×10 3) Predicted fluorescence value (×10 3 ) 发酵酶活Fermentation enzyme activity
NCS+NCS+ 174.32174.32 41.34U/ml41.34U/ml
NCS+’NCS+’ 145.86145.86 28.18U/ml28.18U/ml
NCS-wtNCS-wt 117.64117.64 18.26U/ml18.26U/ml
NCS-’NCS-' 63.5363.53 13.44U/ml13.44U/ml
NCS-NCS- 9.419.41 5.83U/ml5.83U/ml
虽然本发明已以较佳实施例公开如上,但其并非用以限定本发明,任何熟悉此技术的人,在不脱离本发明的精神和范围内,都可做各种的改动与修饰,因此本发明的保护范围应该以权利要求书所界定的为准。Although the present invention has been disclosed above with preferred embodiments, it is not intended to limit the present invention. Anyone who is familiar with this technology can make various changes and modifications without departing from the spirit and scope of the present invention. Therefore, The protection scope of the present invention should be defined by the claims.

Claims (16)

  1. 一种调控普鲁兰酶表达量的方法,其特征在于,选取普鲁兰酶N端编码区前9n~10n个核苷酸,n=3,进行同义突变,构建突变体库,并计算P sfGFP值,根据P sfGFP值选择相应的同义突变序列;将目的蛋白的N端编码区进行相应突变,连接至表达载体,构建重组质粒; A method for regulating the expression level of pullulanase, characterized in that, selecting the first 9n to 10n nucleotides in the N-terminal coding region of pullulanase, n=3, performing synonymous mutation, constructing a mutant library, and calculating P sfGFP value, select the corresponding synonymous mutation sequence according to the P sfGFP value; carry out the corresponding mutation in the N-terminal coding region of the target protein, connect it to the expression vector, and construct a recombinant plasmid;
    所述P sfGFP值按照下述方式计算:P sfGFP=274497.657-108717.401×GC3+4886.529×ΔG; The P sfGFP value is calculated as follows: P sfGFP =274497.657-108717.401×GC3+4886.529×ΔG;
    所述GC3为编码目的蛋白的基因靠近ATG的N端编码区前9n~10n个核苷酸的同义密码子第三位碱基是GC的含量,n=3;The GC3 is a synonymous codon of the first 9n-10n nucleotides of the gene encoding the target protein close to the N-terminal coding region of ATG, and the third base is the content of GC, n=3;
    所述ΔG为编码目的蛋白的基因的任意启动子转录起始位点至N端编码区的第90~99bp区域间的mRNA二级结构的最小自由能;The ΔG is the minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter of the gene encoding the target protein and the 90-99 bp region of the N-terminal coding region;
    普鲁兰酶表达量需要上调的时候,选择突变库中的P sfGFP值处于前10%的核苷酸序列; When the expression level of pullulanase needs to be up-regulated, select the nucleotide sequence with the PsfGFP value in the top 10% in the mutation library;
    普鲁兰酶表达量需要下调的时候,选择突变库中的P sfGFP值处于后10%的核苷酸序列; When the expression level of pullulanase needs to be down-regulated, select the nucleotide sequence whose PsfGFP value in the mutation library is in the lower 10%;
    将重组质粒导入枯草芽孢杆菌,利用枯草芽孢杆菌生产蛋白;所述普鲁兰酶的氨基酸序列如SEQ ID NO.19所示。The recombinant plasmid was introduced into Bacillus subtilis, and the Bacillus subtilis was used to produce the protein; the amino acid sequence of the pullulanase was shown in SEQ ID NO.19.
  2. 一种筛选编码不同表达量的蛋白的核苷酸序列的方法,其特征在于,测定GC3和ΔG的值,再应用下述方程式计算蛋白的相对表达量,即P sfGFP值,根据P sfGFP值筛选出对应的核苷酸序列;P sfGFP值与蛋白的实际表达量成正相关: A method for screening nucleotide sequences encoding proteins with different expression levels, characterized in that the values of GC3 and ΔG are measured, and the relative expression level of the protein, that is, the PsfGFP value, is calculated by applying the following equation, and screening is performed according to the PsfGFP value. The corresponding nucleotide sequence is obtained; the P sfGFP value is positively correlated with the actual expression of the protein:
    P sfGFP=274497.657-108717.401×GC3+4886.529×ΔG; P sfGFP = 274497.657-108717.401×GC3+4886.529×ΔG;
    所述GC3为编码目的蛋白的基因靠近ATG的N端编码区前9n~10n个核苷酸的同义密码子第三位碱基是GC的含量,n=3;所述ΔG为目的基因的任意启动子转录起始位点至N端编码区的第90~99bp区域间的mRNA二级结构的最小自由能。The GC3 is the synonymous codon of the first 9n-10n nucleotides of the gene encoding the target protein close to the N-terminal coding region of ATG. The third base is the content of GC, n=3; the ΔG is the amount of the target gene. The minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter and the 90-99 bp region of the N-terminal coding region.
  3. 根据权利要求2所述的方法,其特征在于,所述ΔG为目的基因的任意启动子转录起始位点至N端编码区的第96bp区域间的mRNA二级结构的最小自由能。The method according to claim 2, wherein the ΔG is the minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter of the target gene and the 96 bp region of the N-terminal coding region.
  4. 根据权利要求3所述的方法,其特征在于,所述蛋白为能够在枯草芽孢杆菌中表达的任意蛋白。The method of claim 3, wherein the protein is any protein that can be expressed in Bacillus subtilis.
  5. 根据权利要求4所述的方法,其特征在于,所述蛋白包括但不限于普鲁兰酶。The method of claim 4, wherein the protein includes but is not limited to pullulanase.
  6. 根据权利要求5所述的方法,其特征在于,所述普鲁兰酶的氨基酸序列如SEQ ID NO.19所示。The method according to claim 5, wherein the amino acid sequence of the pullulanase is as shown in SEQ ID NO.19.
  7. 根据权利要求5所述的方法,其特征在于,根据P sfGFP值筛选相应的核苷酸序列。 The method according to claim 5, wherein the corresponding nucleotide sequence is screened according to the PsfGFP value.
  8. 一种调控基因工程菌蛋白表达量的方法,其特征在于,取目的蛋白N端编码区前9n~10n个核苷酸,n=3,建立同义突变库;计算同义突变库中的基因的参数GC3和ΔG,根据方程计算每个核苷酸序列的相对表达量,选择具有所需表达量的核苷酸序列,将目的蛋白N端编码区进行相应突变,并将其转化到宿主细胞中;A method for regulating the protein expression of genetically engineered bacteria, characterized in that, taking the first 9n to 10n nucleotides in the N-terminal coding region of a target protein, n=3, to establish a synonymous mutation library; and calculating the genes in the synonymous mutation library The parameters GC3 and ΔG, calculate the relative expression level of each nucleotide sequence according to the equation, select the nucleotide sequence with the required expression level, mutate the N-terminal coding region of the target protein accordingly, and transform it into host cells middle;
    所述方程为:P sfGFP=274497.657-108717.401×GC3+4886.529×ΔG; The equation is: P sfGFP =274497.657-108717.401×GC3+4886.529×ΔG;
    所述GC3为编码目的蛋白的基因靠近ATG的N端编码区前9n~10n个核苷酸的同义密码子第三位碱基是GC的含量,n=3;所述ΔG为目的基因的任意启动子转录起始位点至N端编码区的第90~99bp区域间的mRNA二级结构的最小自由能;The GC3 is the synonymous codon of the first 9n-10n nucleotides of the gene encoding the target protein close to the N-terminal coding region of ATG. The third base is the content of GC, n=3; the ΔG is the amount of the target gene. The minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter and the 90-99 bp region of the N-terminal coding region;
    蛋白表达量需要上调的时候,选择突变库中的P sfGFP值处于前10%的核苷酸序列; When the protein expression needs to be up-regulated, select the nucleotide sequence whose PsfGFP value in the mutation library is in the top 10%;
    蛋白表达量需要下调的时候,选择突变库中的P sfGFP值处于后10%的核苷酸序列。 When the protein expression needs to be down-regulated, select the nucleotide sequence whose PsfGFP value in the mutation library is in the lower 10%.
    所述ΔG为目的基因的任意启动子转录起始位点至N端编码区的第96bp区域间的mRNA二级结构的最小自由能。The ΔG is the minimum free energy of the mRNA secondary structure between the transcription initiation site of any promoter of the target gene and the 96 bp region of the N-terminal coding region.
  9. 根据权利要求8所述的方法,其特征在于,所述基因工程菌以枯草芽孢杆菌为宿主。The method according to claim 8, wherein the genetically engineered bacteria use Bacillus subtilis as a host.
  10. 根据权利要求9所述的方法,其特征在于,所述蛋白为能够在枯草芽孢杆菌中表达的任意蛋白。The method of claim 9, wherein the protein is any protein that can be expressed in Bacillus subtilis.
  11. 根据权利要求10所述的方法,其特征在于,所述蛋白包括但不限于普鲁兰酶。The method of claim 10, wherein the protein includes but is not limited to pullulanase.
  12. 根据权利要求11所述的方法,其特征在于,所述普鲁兰酶的氨基酸序列如SEQ ID NO.19所示。The method according to claim 11, wherein the amino acid sequence of the pullulanase is shown in SEQ ID NO.19.
  13. 一种调控普鲁兰酶表达量的方法,其特征在于,选取普鲁兰酶N端编码区前9n~10n个核苷酸,n=3,进行同义突变,构建突变体库,并计算P sfGFP值,根据P sfGFP值选择相应的同义突变序列;将目的蛋白的N端编码区进行相应突变,连接至表达载体,构建重组质粒; A method for regulating the expression level of pullulanase, characterized in that, selecting the first 9n to 10n nucleotides in the N-terminal coding region of pullulanase, n=3, performing synonymous mutation, constructing a mutant library, and calculating P sfGFP value, select the corresponding synonymous mutation sequence according to the P sfGFP value; carry out the corresponding mutation in the N-terminal coding region of the target protein, connect it to the expression vector, and construct a recombinant plasmid;
    普鲁兰酶表达量需要上调的时候,选择突变库中的P sfGFP值处于前10%的核苷酸序列; When the expression level of pullulanase needs to be up-regulated, select the nucleotide sequence with the PsfGFP value in the top 10% in the mutation library;
    普鲁兰酶表达量需要下调的时候,选择突变库中的P sfGFP值处于后10%的核苷酸序列。 When the expression level of pullulanase needs to be down-regulated, select the nucleotide sequence whose PsfGFP value in the mutant library is in the lower 10%.
  14. 根据权利要求13所述的方法,其特征在于,将重组质粒导入枯草芽孢杆菌,利用枯草芽孢杆菌生产蛋白。The method according to claim 13, wherein the recombinant plasmid is introduced into Bacillus subtilis, and the Bacillus subtilis is used to produce the protein.
  15. 根据权利要求14所述的方法,其特征在于,所述普鲁兰酶的氨基酸序列如SEQ ID NO.19所示。The method according to claim 14, wherein the amino acid sequence of the pullulanase is shown in SEQ ID NO.19.
  16. 权利要求1~12任一所述方法在调节目的蛋白表达量中的应用。Application of any one of the methods of claims 1 to 12 in regulating the expression level of a target protein.
PCT/CN2021/102986 2020-07-29 2021-06-29 N-terminus coding sequence-based method for modifying regulatory protein expression WO2022022197A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010742774.7A CN111850096B (en) 2020-07-29 2020-07-29 Method for modifying and regulating protein expression based on N-terminal coding sequence
CN202010742774.7 2020-07-29

Publications (1)

Publication Number Publication Date
WO2022022197A1 true WO2022022197A1 (en) 2022-02-03

Family

ID=72945069

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/102986 WO2022022197A1 (en) 2020-07-29 2021-06-29 N-terminus coding sequence-based method for modifying regulatory protein expression

Country Status (2)

Country Link
CN (1) CN111850096B (en)
WO (1) WO2022022197A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116072231A (en) * 2022-10-17 2023-05-05 中国医学科学院病原生物学研究所 Application of codon optimization based on amino acid sequence in mRNA vaccine research and development

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111850096B (en) * 2020-07-29 2022-02-01 江南大学 Method for modifying and regulating protein expression based on N-terminal coding sequence
CN113201052B (en) * 2021-04-21 2023-06-27 华东理工大学 Harpin Ea Efficient soluble expression of (2), production method and application

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102676480A (en) * 2012-06-08 2012-09-19 江南大学 Method for producing extracellular pullulanase by applying auto-induction culture medium and dual-temperature control strategy
CN102791854A (en) * 2009-12-22 2012-11-21 诺维信公司 Pullulanase variants and uses thereof
CN106190934A (en) * 2016-07-05 2016-12-07 江南大学 A kind of recombined bacillus subtilis producing pullulanase and structure thereof
CN111850096A (en) * 2020-07-29 2020-10-30 江南大学 Method for modifying and regulating protein expression based on N-terminal coding sequence

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8679790B2 (en) * 2010-01-05 2014-03-25 The Trustees Of The University Of Pennsylvania Leader sequence to boost gene expression
WO2015184466A1 (en) * 2014-05-30 2015-12-03 The Trustees Of Columbia University In The City Of New York Methods for altering polypeptide expression
CN104694452B (en) * 2015-03-30 2019-03-01 中国科学院上海高等研究院 A kind of recombined bacillus subtilis and its construction method of high yield Pullulanase
CN106754833B (en) * 2017-01-16 2020-06-09 广东溢多利生物科技股份有限公司 Method for efficiently expressing pullulanase in bacillus subtilis and recombinant bacillus subtilis

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102791854A (en) * 2009-12-22 2012-11-21 诺维信公司 Pullulanase variants and uses thereof
CN102676480A (en) * 2012-06-08 2012-09-19 江南大学 Method for producing extracellular pullulanase by applying auto-induction culture medium and dual-temperature control strategy
CN106190934A (en) * 2016-07-05 2016-12-07 江南大学 A kind of recombined bacillus subtilis producing pullulanase and structure thereof
CN111850096A (en) * 2020-07-29 2020-10-30 江南大学 Method for modifying and regulating protein expression based on N-terminal coding sequence

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
DATABASE PROTEIN 23 March 2016 (2016-03-23), ANONYMOUS : "pullulanase [synthetic construct]", XP055890670, retrieved from NCBI Database accession no. AMQ67157 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116072231A (en) * 2022-10-17 2023-05-05 中国医学科学院病原生物学研究所 Application of codon optimization based on amino acid sequence in mRNA vaccine research and development
CN116072231B (en) * 2022-10-17 2024-02-13 中国医学科学院病原生物学研究所 Method for optimally designing mRNA vaccine based on codon of amino acid sequence

Also Published As

Publication number Publication date
CN111850096A (en) 2020-10-30
CN111850096B (en) 2022-02-01

Similar Documents

Publication Publication Date Title
WO2022022197A1 (en) N-terminus coding sequence-based method for modifying regulatory protein expression
Arai et al. Expression of the nir and nor genes for denitrification of Pseudomonas aeruginosa requires a novel CRP/FNR-related transcriptional regulator, DNR, in addition to ANR
JP5211559B2 (en) Mutant bilirubin oxidase with thermostability
CN107012130B (en) Glucose oxidase mutant and coding gene and application thereof
CN111826377B (en) Signal peptide for promoting extracellular expression of pullulanase
CN110628738B (en) Method for improving activity of glucose oxidase, mutant and application thereof
CN1974768A (en) Transcriptional control element of human lung carcinoma cell NGAL gene promoter region
CN111850008B (en) Signal peptides for promoting extracellular expression of proteins
CN113106103B (en) LDLR gene mutant and application thereof
CN110592084B (en) Recombinant strain transformed by rhtA gene promoter, construction method and application thereof
CN107236758B (en) Method for improving expression quantity of foreign protein by coexpression of heat shock protein
CN112391396A (en) Enterococcus faecalis quorum sensing gene switch system constructed in escherichia coli, expression vector, engineering bacteria and application thereof
CN116970067A (en) Strategy for improving recombinant expression level of human serum albumin
CN114410496B (en) Method for improving yield of exogenous protein of pichia pastoris
CN111808177B (en) Signal peptide for improving protein expression quantity and application thereof
CN111254143B (en) Construction method of simple Arthrobacter engineering strain with excellent stress tolerance, strain and application thereof
CN114525285B (en) Cloning and application of peanut high-affinity nitrate transporter gene AhNRT2.7
CN107083394B (en) Method for improving expression quantity of foreign protein by coexpression of ribosomal protein
JP7403590B2 (en) Ale beer and its manufacturing method
CN113699174B (en) Self-induction expression system and application thereof in promoting gene expression
CN113755474B (en) Carboxypeptidase, and coding gene and application thereof
CN116536346A (en) Method for improving yield of pichia pastoris extracellular glucose oxidase and application
CN116949042A (en) Alcohol oxidase 1 promoter (P) for increasing expression quantity of foreign protein of Fabry colt type yeast AOX1 ) And a method thereof
JP2024018971A (en) Method for producing graminoid lipoxygenase 3, and recombinant yeast
CN1305901C (en) Tumor transfer related protein and coding gene thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21849697

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21849697

Country of ref document: EP

Kind code of ref document: A1