CN108681658A - A kind of algorithm of optimization foreign gene translation speed in Escherichia coli - Google Patents
A kind of algorithm of optimization foreign gene translation speed in Escherichia coli Download PDFInfo
- Publication number
- CN108681658A CN108681658A CN201810493075.6A CN201810493075A CN108681658A CN 108681658 A CN108681658 A CN 108681658A CN 201810493075 A CN201810493075 A CN 201810493075A CN 108681658 A CN108681658 A CN 108681658A
- Authority
- CN
- China
- Prior art keywords
- codon
- escherichia coli
- amino acid
- effect
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
Landscapes
- Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Analytical Chemistry (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Theoretical Computer Science (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The invention discloses a kind of algorithms of optimization foreign gene translation speed in Escherichia coli, include the following steps:All codon combinations situations of pairs of amino acid are all carried out sequence alignment, the preferred codons redefined using Escherichia coli SD similar structures effects by the effect for making gene translation minibreak firstly, for SD sequence similar structures with Escherichia coli SD sequences;Then, effect is recycled in conjunction with tRNA, the codon that amino acid of the same race can be expressed as in sequence is all converted to the preferred codons after redefining.Present invention incorporates codon preference effect, the recycling effect of tRNA and SD sequence similar structures to make translation minibreak effect scheduling theory knowledge, designs comprehensive three kinds of effects, rational algorithms for Escherichia coli to be optimized to the codon service condition of foreign gene.
Description
Technical field
The present invention relates to gene fields, and in particular to a kind of calculation of optimization foreign gene translation speed in Escherichia coli
Method.
Background technology
Currently, all being rested on single factor test mostly using the research for influencing gene translation about codon, mostly grind
Study carefully mode be theoretical research in combination with corresponding experimental verification, single factor can be determined to gene by control variate method
Translation efficiency is with the presence or absence of influence, and most of the work about gene translation speed is all the Preference about codon
's.To the combination effect of existing factor to probe into work then be fewer and fewer.
Invention content
To solve the above problems, the present invention provides a kind of calculations of optimization foreign gene translation speed in Escherichia coli
Method.
To achieve the above object, the technical solution that the present invention takes is:
A kind of algorithm of optimization foreign gene translation speed in Escherichia coli, includes the following steps:
The effect for making gene translation minibreak firstly, for SD sequence similar structures, by all of pairs of amino acid
Codon combinations situation all carries out sequence alignment with Escherichia coli SD sequences, wherein inevitable, there are one least similar password subgroups
It closes, then thering are optimal codon combinations to correspond the combination of 400 kinds of pairs of amino acid;It is optimal by this 400 kinds
Codon combinations, which sort out, to be come, and is then counted to the codon of each amino acid in this 400 kinds of optimal codon combinations,
For the codon of each amino acid, its is optimal for the most explanation of counts in this 400 kinds of optimal codons combinations
Possibility is maximum, as the optimal codon of this kind of amino acid, that is, uses Escherichia coli SD similar structures effect again fixed
The preferred codons of justice;
Then, effect is recycled in conjunction with tRNA, the codon that amino acid of the same race can be expressed as in sequence is all turned
The preferred codons being changed to after redefining.
Present invention incorporates codon preference effect, the recycling effect of tRNA and SD sequence similar structures to make translation
Minibreak effect scheduling theory knowledge designs comprehensive three kinds of effects, rational algorithms come to external source base for Escherichia coli
The codon service condition of cause optimizes.
Description of the drawings
Fig. 1 is the algorithm pattern that effect is recycled in conjunction with preferred codons effect and tRNA.
Fig. 2 is the algorithm pattern that SD sequence similar structures make translation minibreak effect.
Fig. 3 is the innovatory algorithm figure that SD sequence similar structures make translation minibreak effect
Fig. 4 is a kind of flow chart of optimization foreign gene algorithm of translation speed in Escherichia coli of the embodiment of the present invention.
Fig. 5 is the protein expression spirogram of three kinds of GFT.
Specific implementation mode
In order to make objects and advantages of the present invention be more clearly understood, the present invention is carried out with reference to embodiments further
It is described in detail.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not used to limit this hair
It is bright.
Effect algorithm is recycled in conjunction with preferred codons effect and tRNA
(1) preferred codons are found according to the preferred codons effect of Escherichia coli first, specific method is according to big
The tRNA of enterobacteria determines preferred codons using abundance, for same amino acid, corresponding to the higher tRNA of abundance
Codon is the preferred codons of the amino acid;
(2) and then effect is recycled according to tRNA to optimize sequence using the preferred codons that previous step determines.
Specific practice is that the codon that same amino acid can be expressed as in sequence is all converted to its corresponding preferred codons.Example
As occurred this codon of ATC in sequence, so that it may to be turned this codon of ATC according to the codon preference in Escherichia coli
The GTC codons that tRNA uses abundance relatively high are changed to,
(3) and so on whole section of sequence can be optimized.
Make the algorithm of gene translation minibreak effect based on SD sequence similar structures
By knowwhy it is known that ribosomal minibreak is can be with this during translation due to mRNA
Caused by 3 ' the end hybridization of 16SrRNA on ribosomes.And the structure of the similar SD sequences in mRNA sequence is more, class
Similitude like the structure and SD sequences of SD sequences is higher, then the number that ribosomes pauses is also more, the time of pause
It is longer, cause the rate of gene translation also slower.So for the translation rate of optimization gene, we will be as far as possible
Ground reduces the possibility that this phenomenon occurs.According to this conclusion, algorithm steps below are designed.
(1) first, two codons adjacent in sequence are taken out every time, is converted into corresponding two amino
Acid;
(2) and then to all codons corresponding to both amino acid be combined, allow these codon combinations all
Sequence alignment is carried out with the SD sequences of Escherichia coli, carries out calling ClustalW when sequence alignment, highest scoring person is most like
, score the lowest is least similar.Because we it is desirable that with the SD sequences of Escherichia coli least similar sequence, institute
Have using the minimum sequence of score, i.e., least similar codon combinations are as this optimization to adjacent codon
(3) and then again a pair adjacent codon adjacent to adjacent codon with this optimizes, in such processes successively
Whole section of sequence is optimized, you can whole section of sequence of optimization.
(4) when comparing number one, No. second codon, at the same by number one, No. second codon exchange position again with
The SD sequences of Escherichia coli are compared.Thus accomplish No. second codon after the SD sequence alignments with Escherichia coli
Also the comparison of sequence has been carried out in the case of half part with first half, (1 length bonus point branch of branch finally is calculated to score
2 length, along with branch 2 adds the length of branch 3, it is all and divided by 2), so that it may to obtain combining number one, second
Number codon and No. second, the similar effect result of third codon.
The present invention provides a kind of algorithms of optimization foreign gene translation speed in Escherichia coli, have combined above-mentioned calculation
Method includes the following steps:
The effect for making gene translation minibreak firstly, for SD sequence similar structures, by all of pairs of amino acid
Codon combinations situation all carries out sequence alignment with Escherichia coli SD sequences, wherein inevitable, there are one least similar password subgroups
It closes, then thering are optimal codon combinations to correspond the combination of 400 kinds of pairs of amino acid;It is optimal by this 400 kinds
Codon combinations, which sort out, to be come, and is then counted to the codon of each amino acid in this 400 kinds of optimal codon combinations,
For the codon of each amino acid, its is optimal for the most explanation of counts in this 400 kinds of optimal codons combinations
Possibility is maximum, as the optimal codon of this kind of amino acid, that is, uses Escherichia coli SD similar structures effect again fixed
The preferred codons of justice;
Then effect is recycled in conjunction with tRNA, the codon that amino acid of the same race can be expressed as in sequence is all converted
For the preferred codons after redefining.
Algorithm evaluation and verification:
We in genetic engineering it is most-often used to green fluorescence protein gene (GFP) carry out the proof of algorithm, pass through
Genotype of the two sequences (translation speed is accelerated in an optimization, and an optimization slows down translation speed) with wild type after optimization
It is compared and assesses.
Theoretical value is assessed:
What codon adaptation indexI (codon adaptation index, CAI) measured is some gene institute in organism
It is an important indicator for reflecting codon preference with the degree that is consistent of codon and codon used in cance high-expression gene.
And the important indicator in genetic engineering for reacting exogenous gene expression amount.We compare the CAI indexes of three kinds of codons,
Optimization improves the CAI of the GFP sequences of translation speed:0.877;
Wild type GFP CAI:0.611
Optimization reduces the CAI of the GFP sequences of translation speed:0.561;
As can be seen that the CAI after the algorithm optimization makes it
Experimental verification
We look for commercial company to synthesize corresponding sequence, are transferred in escherichia coli vector and observe according to the sequence of algorithm optimization
It is transferred to rear expression quantity, the fast expression quantity within the unit interval of translation speed is bigger, and the results are shown in Figure 5:
The result shows that the algorithm can be applied to successfully in genetic engineering, foreign gene can be effectively improved/reduced big
Translation speed in enterobacteria.
GFP-WT is the expressing quantity curve of wild type GFP;GFP-SLOW is the egg for the GFP that optimization reduces translation speed
White expression quantity curve;GFP-FAST is the expressing quantity curve for the GFP that optimization improves translation speed;PBAD is blank control
(no albumen).
The above is only a preferred embodiment of the present invention, it is noted that for the ordinary skill people of the art
For member, without departing from the principle of the present invention, it can also make several improvements and retouch, these improvements and modifications are also answered
It is considered as protection scope of the present invention.
Claims (1)
1. a kind of algorithm of optimization foreign gene translation speed in Escherichia coli, it is characterised in that:Include the following steps:
The effect for making gene translation minibreak firstly, for SD sequence similar structures, by all passwords of pairs of amino acid
Sub-portfolio situation all carries out sequence alignment with Escherichia coli SD sequences, wherein least similar codon combinations there are one inevitable,
So there are optimal codon combinations to correspond the combination of 400 kinds of pairs of amino acid;By this 400 kinds of optimal passwords
Sub-portfolio, which sorts out, to be come, and is then counted to the codon of each amino acid in this 400 kinds of optimal codon combinations, for
The codon of each amino acid, most explanation its optimal possibility of counts in this 400 kinds of optimal codons combination
Property it is maximum, as the optimal codon of this kind of amino acid, i.e., redefined using Escherichia coli SD similar structures effects
Preferred codons;
Then, effect is recycled in conjunction with tRNA, the codon that amino acid of the same race can be expressed as in sequence is all converted to
Preferred codons after redefining.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810493075.6A CN108681658B (en) | 2018-05-22 | 2018-05-22 | Method for optimizing translation speed of exogenous gene in escherichia coli |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810493075.6A CN108681658B (en) | 2018-05-22 | 2018-05-22 | Method for optimizing translation speed of exogenous gene in escherichia coli |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108681658A true CN108681658A (en) | 2018-10-19 |
CN108681658B CN108681658B (en) | 2021-09-21 |
Family
ID=63807611
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810493075.6A Active CN108681658B (en) | 2018-05-22 | 2018-05-22 | Method for optimizing translation speed of exogenous gene in escherichia coli |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108681658B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109979539A (en) * | 2019-04-10 | 2019-07-05 | 电子科技大学 | Gene order optimization method, device and data processing terminal |
CN115960934A (en) * | 2022-08-24 | 2023-04-14 | 深圳柏垠生物科技有限公司 | Escherichia coli expression exogenous gene optimization method and sequence thereof |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1167151A (en) * | 1997-04-02 | 1997-12-10 | 中国科学院上海生物化学研究所 | Method for deciding relative translation initial rate of external source genes in colibacillus |
CN1381591A (en) * | 2001-03-30 | 2002-11-27 | 珀尔根科学公司 | Genome analytical method |
EP1582582A1 (en) * | 2002-11-28 | 2005-10-05 | Riken | CELL EXTRACT OF i ESCHERICHIA COLI /i HAVING MUTATION IN S12 RIBOSOMAL PROTEIN AND PROCESS FOR PRODUCING PROTEIN IN CELL-FREE SYSTEM USING THE SAME |
CN1914616A (en) * | 2003-12-05 | 2007-02-14 | 科学工业研究委员会 | A computer based versatile method for identifying protein coding DNA sequences useful as drug targets |
CN103074357A (en) * | 2012-08-20 | 2013-05-01 | 广东大华农动物保健品股份有限公司 | Method for exogenous gene expression optimization in salmonella |
US20140256570A1 (en) * | 2008-11-07 | 2014-09-11 | Industrial Technology Research Institute | Methods for accurate sequence data and modified base position determination |
CN104673802A (en) * | 2015-03-12 | 2015-06-03 | 山东大学第二医院 | Irisin protein encoded nucleic acid molecule and method utilizing nucleic acid molecule to efficiently express irisin protein |
CN106520779A (en) * | 2016-11-09 | 2017-03-22 | 华南农业大学 | Method for improving 1L-2 protein expression efficiency of chicken through codon optimization of chicken 1L-2 gene |
US20170321257A1 (en) * | 2016-05-09 | 2017-11-09 | The Board Of Trustees Of The Leland Stanford Junior University | Bacterial pathogen identification by high resolution melting analysis |
-
2018
- 2018-05-22 CN CN201810493075.6A patent/CN108681658B/en active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1167151A (en) * | 1997-04-02 | 1997-12-10 | 中国科学院上海生物化学研究所 | Method for deciding relative translation initial rate of external source genes in colibacillus |
CN1381591A (en) * | 2001-03-30 | 2002-11-27 | 珀尔根科学公司 | Genome analytical method |
EP1582582A1 (en) * | 2002-11-28 | 2005-10-05 | Riken | CELL EXTRACT OF i ESCHERICHIA COLI /i HAVING MUTATION IN S12 RIBOSOMAL PROTEIN AND PROCESS FOR PRODUCING PROTEIN IN CELL-FREE SYSTEM USING THE SAME |
CN1914616A (en) * | 2003-12-05 | 2007-02-14 | 科学工业研究委员会 | A computer based versatile method for identifying protein coding DNA sequences useful as drug targets |
US20140256570A1 (en) * | 2008-11-07 | 2014-09-11 | Industrial Technology Research Institute | Methods for accurate sequence data and modified base position determination |
CN103074357A (en) * | 2012-08-20 | 2013-05-01 | 广东大华农动物保健品股份有限公司 | Method for exogenous gene expression optimization in salmonella |
CN104673802A (en) * | 2015-03-12 | 2015-06-03 | 山东大学第二医院 | Irisin protein encoded nucleic acid molecule and method utilizing nucleic acid molecule to efficiently express irisin protein |
US20170321257A1 (en) * | 2016-05-09 | 2017-11-09 | The Board Of Trustees Of The Leland Stanford Junior University | Bacterial pathogen identification by high resolution melting analysis |
CN106520779A (en) * | 2016-11-09 | 2017-03-22 | 华南农业大学 | Method for improving 1L-2 protein expression efficiency of chicken through codon optimization of chicken 1L-2 gene |
Non-Patent Citations (7)
Title |
---|
FENG-BIAO GUO等: "Three Computational Tools for Predicting Bacterial Essential Genes", 《GENE ESSENTIALITY》 * |
YIZHAR LAVNER等: "Codon bias as a factor in regulating expression via translation rate in the human genome", 《GENE》 * |
常美会等: "翻译速率对重组蛋白表达影响的研究进展", 《中国农业科技导报》 * |
李玉权等: "短短芽孢杆菌GZDF3全基因组密码子偏好性分析", 《基因组学与应用生物学》 * |
牛丹丹等: "改善大肠杆菌胞内氨基酰tRNA池提高外源基因表达水平", 《微生物学杂志》 * |
王钢等: "大肠杆菌体系外源蛋白表达速度的调控策略", 《过程工程学报》 * |
郑振宇等: "《基因工程》", 31 March 2015, 华中科技大学出版社 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109979539A (en) * | 2019-04-10 | 2019-07-05 | 电子科技大学 | Gene order optimization method, device and data processing terminal |
CN109979539B (en) * | 2019-04-10 | 2020-10-02 | 电子科技大学 | Gene sequence optimization method and device and data processing terminal |
CN115960934A (en) * | 2022-08-24 | 2023-04-14 | 深圳柏垠生物科技有限公司 | Escherichia coli expression exogenous gene optimization method and sequence thereof |
Also Published As
Publication number | Publication date |
---|---|
CN108681658B (en) | 2021-09-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108681658A (en) | A kind of algorithm of optimization foreign gene translation speed in Escherichia coli | |
Gamble et al. | Adjacent codons act in concert to modulate translation efficiency in yeast | |
KR20190077568A (en) | Inhibitors of CRISPR-Cas9 | |
Osawa et al. | Recent evidence for evolution of the genetic code | |
Boycheva et al. | Codon pairs in the genome of Escherichia coli | |
CN105647968A (en) | Fast CRISPR-Cas9 working efficiency testing system and application thereof | |
Swart et al. | The Oxytricha trifallax mitochondrial genome | |
US8932859B2 (en) | Methods for engineering polypeptide variants via somatic hypermutation and polypeptide made thereby | |
Kollmar et al. | Nuclear codon reassignments in the genomics era and mechanisms behind their evolution | |
Cheng et al. | The piggyBac transposon-derived genes TPB1 and TPB6 mediate essential transposon-like excision during the developmental rearrangement of key genes in Tetrahymena thermophila | |
Shen et al. | Complete mitochondrial genome of the Japanese snapping shrimp Alpheus japonicus (Crustacea: Decapoda: Caridea): Gene rearrangement and phylogeny within Caridea | |
MX2021014057A (en) | Optimized cannabinoid synthase polypeptides. | |
Parker | Variations in reading the genetic code | |
CN114360645A (en) | Codon optimization method of protein expression system and protein expression system | |
Magni et al. | Mutagenesis of super-suppressors in yeast | |
EA200600554A1 (en) | METHOD OF OBTAINING RECOMBINANT PROTEINS | |
Lu et al. | Efficient construction of a stable linear gene based on a TNA loop modified primer pair for gene delivery | |
MX2022001859A (en) | Method for treating muscular dystrophy by targeting lama1 gene. | |
CN102333870B (en) | Method for increasing protein expression efficiency and expression vector | |
Marcelino et al. | Evolution of the genus Mimivirus based on translation protein homology and its implication in the tree of life | |
Yoshikawa et al. | Markerless bacterial artificial chromosome manipulation method by red proteins of phage λ mediated homologous recombination utilizing fluorescent proteins for both positive and counter selection | |
Lawrence et al. | Unusual codon bias occurring within insertion sequences in Escherichia coli | |
Perez | Peculiar evolution of the Monkeypox virus genomes | |
Von Heune | A theoretical study of the attenuation control mechanism | |
Jukes | Recent problems in the genetic code |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |