CN108681658A - A kind of algorithm of optimization foreign gene translation speed in Escherichia coli - Google Patents

A kind of algorithm of optimization foreign gene translation speed in Escherichia coli Download PDF

Info

Publication number
CN108681658A
CN108681658A CN201810493075.6A CN201810493075A CN108681658A CN 108681658 A CN108681658 A CN 108681658A CN 201810493075 A CN201810493075 A CN 201810493075A CN 108681658 A CN108681658 A CN 108681658A
Authority
CN
China
Prior art keywords
codon
escherichia coli
amino acid
effect
sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810493075.6A
Other languages
Chinese (zh)
Other versions
CN108681658B (en
Inventor
叶远浓
张晓娅
黄梦雅
曾柱
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guizhou Medical University
Original Assignee
Guizhou Medical University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guizhou Medical University filed Critical Guizhou Medical University
Priority to CN201810493075.6A priority Critical patent/CN108681658B/en
Publication of CN108681658A publication Critical patent/CN108681658A/en
Application granted granted Critical
Publication of CN108681658B publication Critical patent/CN108681658B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids

Landscapes

  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Biophysics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)

Abstract

The invention discloses a kind of algorithms of optimization foreign gene translation speed in Escherichia coli, include the following steps:All codon combinations situations of pairs of amino acid are all carried out sequence alignment, the preferred codons redefined using Escherichia coli SD similar structures effects by the effect for making gene translation minibreak firstly, for SD sequence similar structures with Escherichia coli SD sequences;Then, effect is recycled in conjunction with tRNA, the codon that amino acid of the same race can be expressed as in sequence is all converted to the preferred codons after redefining.Present invention incorporates codon preference effect, the recycling effect of tRNA and SD sequence similar structures to make translation minibreak effect scheduling theory knowledge, designs comprehensive three kinds of effects, rational algorithms for Escherichia coli to be optimized to the codon service condition of foreign gene.

Description

A kind of algorithm of optimization foreign gene translation speed in Escherichia coli
Technical field
The present invention relates to gene fields, and in particular to a kind of calculation of optimization foreign gene translation speed in Escherichia coli Method.
Background technology
Currently, all being rested on single factor test mostly using the research for influencing gene translation about codon, mostly grind Study carefully mode be theoretical research in combination with corresponding experimental verification, single factor can be determined to gene by control variate method Translation efficiency is with the presence or absence of influence, and most of the work about gene translation speed is all the Preference about codon 's.To the combination effect of existing factor to probe into work then be fewer and fewer.
Invention content
To solve the above problems, the present invention provides a kind of calculations of optimization foreign gene translation speed in Escherichia coli Method.
To achieve the above object, the technical solution that the present invention takes is:
A kind of algorithm of optimization foreign gene translation speed in Escherichia coli, includes the following steps:
The effect for making gene translation minibreak firstly, for SD sequence similar structures, by all of pairs of amino acid Codon combinations situation all carries out sequence alignment with Escherichia coli SD sequences, wherein inevitable, there are one least similar password subgroups It closes, then thering are optimal codon combinations to correspond the combination of 400 kinds of pairs of amino acid;It is optimal by this 400 kinds Codon combinations, which sort out, to be come, and is then counted to the codon of each amino acid in this 400 kinds of optimal codon combinations, For the codon of each amino acid, its is optimal for the most explanation of counts in this 400 kinds of optimal codons combinations Possibility is maximum, as the optimal codon of this kind of amino acid, that is, uses Escherichia coli SD similar structures effect again fixed The preferred codons of justice;
Then, effect is recycled in conjunction with tRNA, the codon that amino acid of the same race can be expressed as in sequence is all turned The preferred codons being changed to after redefining.
Present invention incorporates codon preference effect, the recycling effect of tRNA and SD sequence similar structures to make translation Minibreak effect scheduling theory knowledge designs comprehensive three kinds of effects, rational algorithms come to external source base for Escherichia coli The codon service condition of cause optimizes.
Description of the drawings
Fig. 1 is the algorithm pattern that effect is recycled in conjunction with preferred codons effect and tRNA.
Fig. 2 is the algorithm pattern that SD sequence similar structures make translation minibreak effect.
Fig. 3 is the innovatory algorithm figure that SD sequence similar structures make translation minibreak effect
Fig. 4 is a kind of flow chart of optimization foreign gene algorithm of translation speed in Escherichia coli of the embodiment of the present invention.
Fig. 5 is the protein expression spirogram of three kinds of GFT.
Specific implementation mode
In order to make objects and advantages of the present invention be more clearly understood, the present invention is carried out with reference to embodiments further It is described in detail.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not used to limit this hair It is bright.
Effect algorithm is recycled in conjunction with preferred codons effect and tRNA
(1) preferred codons are found according to the preferred codons effect of Escherichia coli first, specific method is according to big The tRNA of enterobacteria determines preferred codons using abundance, for same amino acid, corresponding to the higher tRNA of abundance Codon is the preferred codons of the amino acid;
(2) and then effect is recycled according to tRNA to optimize sequence using the preferred codons that previous step determines. Specific practice is that the codon that same amino acid can be expressed as in sequence is all converted to its corresponding preferred codons.Example As occurred this codon of ATC in sequence, so that it may to be turned this codon of ATC according to the codon preference in Escherichia coli The GTC codons that tRNA uses abundance relatively high are changed to,
(3) and so on whole section of sequence can be optimized.
Make the algorithm of gene translation minibreak effect based on SD sequence similar structures
By knowwhy it is known that ribosomal minibreak is can be with this during translation due to mRNA Caused by 3 ' the end hybridization of 16SrRNA on ribosomes.And the structure of the similar SD sequences in mRNA sequence is more, class Similitude like the structure and SD sequences of SD sequences is higher, then the number that ribosomes pauses is also more, the time of pause It is longer, cause the rate of gene translation also slower.So for the translation rate of optimization gene, we will be as far as possible Ground reduces the possibility that this phenomenon occurs.According to this conclusion, algorithm steps below are designed.
(1) first, two codons adjacent in sequence are taken out every time, is converted into corresponding two amino Acid;
(2) and then to all codons corresponding to both amino acid be combined, allow these codon combinations all Sequence alignment is carried out with the SD sequences of Escherichia coli, carries out calling ClustalW when sequence alignment, highest scoring person is most like , score the lowest is least similar.Because we it is desirable that with the SD sequences of Escherichia coli least similar sequence, institute Have using the minimum sequence of score, i.e., least similar codon combinations are as this optimization to adjacent codon
(3) and then again a pair adjacent codon adjacent to adjacent codon with this optimizes, in such processes successively Whole section of sequence is optimized, you can whole section of sequence of optimization.
(4) when comparing number one, No. second codon, at the same by number one, No. second codon exchange position again with The SD sequences of Escherichia coli are compared.Thus accomplish No. second codon after the SD sequence alignments with Escherichia coli Also the comparison of sequence has been carried out in the case of half part with first half, (1 length bonus point branch of branch finally is calculated to score 2 length, along with branch 2 adds the length of branch 3, it is all and divided by 2), so that it may to obtain combining number one, second Number codon and No. second, the similar effect result of third codon.
The present invention provides a kind of algorithms of optimization foreign gene translation speed in Escherichia coli, have combined above-mentioned calculation Method includes the following steps:
The effect for making gene translation minibreak firstly, for SD sequence similar structures, by all of pairs of amino acid Codon combinations situation all carries out sequence alignment with Escherichia coli SD sequences, wherein inevitable, there are one least similar password subgroups It closes, then thering are optimal codon combinations to correspond the combination of 400 kinds of pairs of amino acid;It is optimal by this 400 kinds Codon combinations, which sort out, to be come, and is then counted to the codon of each amino acid in this 400 kinds of optimal codon combinations, For the codon of each amino acid, its is optimal for the most explanation of counts in this 400 kinds of optimal codons combinations Possibility is maximum, as the optimal codon of this kind of amino acid, that is, uses Escherichia coli SD similar structures effect again fixed The preferred codons of justice;
Then effect is recycled in conjunction with tRNA, the codon that amino acid of the same race can be expressed as in sequence is all converted For the preferred codons after redefining.
Algorithm evaluation and verification:
We in genetic engineering it is most-often used to green fluorescence protein gene (GFP) carry out the proof of algorithm, pass through Genotype of the two sequences (translation speed is accelerated in an optimization, and an optimization slows down translation speed) with wild type after optimization It is compared and assesses.
Theoretical value is assessed:
What codon adaptation indexI (codon adaptation index, CAI) measured is some gene institute in organism It is an important indicator for reflecting codon preference with the degree that is consistent of codon and codon used in cance high-expression gene. And the important indicator in genetic engineering for reacting exogenous gene expression amount.We compare the CAI indexes of three kinds of codons,
Optimization improves the CAI of the GFP sequences of translation speed:0.877;
Wild type GFP CAI:0.611
Optimization reduces the CAI of the GFP sequences of translation speed:0.561;
As can be seen that the CAI after the algorithm optimization makes it
Experimental verification
We look for commercial company to synthesize corresponding sequence, are transferred in escherichia coli vector and observe according to the sequence of algorithm optimization It is transferred to rear expression quantity, the fast expression quantity within the unit interval of translation speed is bigger, and the results are shown in Figure 5:
The result shows that the algorithm can be applied to successfully in genetic engineering, foreign gene can be effectively improved/reduced big Translation speed in enterobacteria.
GFP-WT is the expressing quantity curve of wild type GFP;GFP-SLOW is the egg for the GFP that optimization reduces translation speed White expression quantity curve;GFP-FAST is the expressing quantity curve for the GFP that optimization improves translation speed;PBAD is blank control (no albumen).
The above is only a preferred embodiment of the present invention, it is noted that for the ordinary skill people of the art For member, without departing from the principle of the present invention, it can also make several improvements and retouch, these improvements and modifications are also answered It is considered as protection scope of the present invention.

Claims (1)

1. a kind of algorithm of optimization foreign gene translation speed in Escherichia coli, it is characterised in that:Include the following steps:
The effect for making gene translation minibreak firstly, for SD sequence similar structures, by all passwords of pairs of amino acid Sub-portfolio situation all carries out sequence alignment with Escherichia coli SD sequences, wherein least similar codon combinations there are one inevitable, So there are optimal codon combinations to correspond the combination of 400 kinds of pairs of amino acid;By this 400 kinds of optimal passwords Sub-portfolio, which sorts out, to be come, and is then counted to the codon of each amino acid in this 400 kinds of optimal codon combinations, for The codon of each amino acid, most explanation its optimal possibility of counts in this 400 kinds of optimal codons combination Property it is maximum, as the optimal codon of this kind of amino acid, i.e., redefined using Escherichia coli SD similar structures effects Preferred codons;
Then, effect is recycled in conjunction with tRNA, the codon that amino acid of the same race can be expressed as in sequence is all converted to Preferred codons after redefining.
CN201810493075.6A 2018-05-22 2018-05-22 Method for optimizing translation speed of exogenous gene in escherichia coli Active CN108681658B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810493075.6A CN108681658B (en) 2018-05-22 2018-05-22 Method for optimizing translation speed of exogenous gene in escherichia coli

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810493075.6A CN108681658B (en) 2018-05-22 2018-05-22 Method for optimizing translation speed of exogenous gene in escherichia coli

Publications (2)

Publication Number Publication Date
CN108681658A true CN108681658A (en) 2018-10-19
CN108681658B CN108681658B (en) 2021-09-21

Family

ID=63807611

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810493075.6A Active CN108681658B (en) 2018-05-22 2018-05-22 Method for optimizing translation speed of exogenous gene in escherichia coli

Country Status (1)

Country Link
CN (1) CN108681658B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109979539A (en) * 2019-04-10 2019-07-05 电子科技大学 Gene order optimization method, device and data processing terminal
CN115960934A (en) * 2022-08-24 2023-04-14 深圳柏垠生物科技有限公司 Escherichia coli expression exogenous gene optimization method and sequence thereof

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1167151A (en) * 1997-04-02 1997-12-10 中国科学院上海生物化学研究所 Method for deciding relative translation initial rate of external source genes in colibacillus
CN1381591A (en) * 2001-03-30 2002-11-27 珀尔根科学公司 Genome analytical method
EP1582582A1 (en) * 2002-11-28 2005-10-05 Riken CELL EXTRACT OF i ESCHERICHIA COLI /i HAVING MUTATION IN S12 RIBOSOMAL PROTEIN AND PROCESS FOR PRODUCING PROTEIN IN CELL-FREE SYSTEM USING THE SAME
CN1914616A (en) * 2003-12-05 2007-02-14 科学工业研究委员会 A computer based versatile method for identifying protein coding DNA sequences useful as drug targets
CN103074357A (en) * 2012-08-20 2013-05-01 广东大华农动物保健品股份有限公司 Method for exogenous gene expression optimization in salmonella
US20140256570A1 (en) * 2008-11-07 2014-09-11 Industrial Technology Research Institute Methods for accurate sequence data and modified base position determination
CN104673802A (en) * 2015-03-12 2015-06-03 山东大学第二医院 Irisin protein encoded nucleic acid molecule and method utilizing nucleic acid molecule to efficiently express irisin protein
CN106520779A (en) * 2016-11-09 2017-03-22 华南农业大学 Method for improving 1L-2 protein expression efficiency of chicken through codon optimization of chicken 1L-2 gene
US20170321257A1 (en) * 2016-05-09 2017-11-09 The Board Of Trustees Of The Leland Stanford Junior University Bacterial pathogen identification by high resolution melting analysis

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1167151A (en) * 1997-04-02 1997-12-10 中国科学院上海生物化学研究所 Method for deciding relative translation initial rate of external source genes in colibacillus
CN1381591A (en) * 2001-03-30 2002-11-27 珀尔根科学公司 Genome analytical method
EP1582582A1 (en) * 2002-11-28 2005-10-05 Riken CELL EXTRACT OF i ESCHERICHIA COLI /i HAVING MUTATION IN S12 RIBOSOMAL PROTEIN AND PROCESS FOR PRODUCING PROTEIN IN CELL-FREE SYSTEM USING THE SAME
CN1914616A (en) * 2003-12-05 2007-02-14 科学工业研究委员会 A computer based versatile method for identifying protein coding DNA sequences useful as drug targets
US20140256570A1 (en) * 2008-11-07 2014-09-11 Industrial Technology Research Institute Methods for accurate sequence data and modified base position determination
CN103074357A (en) * 2012-08-20 2013-05-01 广东大华农动物保健品股份有限公司 Method for exogenous gene expression optimization in salmonella
CN104673802A (en) * 2015-03-12 2015-06-03 山东大学第二医院 Irisin protein encoded nucleic acid molecule and method utilizing nucleic acid molecule to efficiently express irisin protein
US20170321257A1 (en) * 2016-05-09 2017-11-09 The Board Of Trustees Of The Leland Stanford Junior University Bacterial pathogen identification by high resolution melting analysis
CN106520779A (en) * 2016-11-09 2017-03-22 华南农业大学 Method for improving 1L-2 protein expression efficiency of chicken through codon optimization of chicken 1L-2 gene

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
FENG-BIAO GUO等: "Three Computational Tools for Predicting Bacterial Essential Genes", 《GENE ESSENTIALITY》 *
YIZHAR LAVNER等: "Codon bias as a factor in regulating expression via translation rate in the human genome", 《GENE》 *
常美会等: "翻译速率对重组蛋白表达影响的研究进展", 《中国农业科技导报》 *
李玉权等: "短短芽孢杆菌GZDF3全基因组密码子偏好性分析", 《基因组学与应用生物学》 *
牛丹丹等: "改善大肠杆菌胞内氨基酰tRNA池提高外源基因表达水平", 《微生物学杂志》 *
王钢等: "大肠杆菌体系外源蛋白表达速度的调控策略", 《过程工程学报》 *
郑振宇等: "《基因工程》", 31 March 2015, 华中科技大学出版社 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109979539A (en) * 2019-04-10 2019-07-05 电子科技大学 Gene order optimization method, device and data processing terminal
CN109979539B (en) * 2019-04-10 2020-10-02 电子科技大学 Gene sequence optimization method and device and data processing terminal
CN115960934A (en) * 2022-08-24 2023-04-14 深圳柏垠生物科技有限公司 Escherichia coli expression exogenous gene optimization method and sequence thereof

Also Published As

Publication number Publication date
CN108681658B (en) 2021-09-21

Similar Documents

Publication Publication Date Title
CN108681658A (en) A kind of algorithm of optimization foreign gene translation speed in Escherichia coli
Gamble et al. Adjacent codons act in concert to modulate translation efficiency in yeast
KR20190077568A (en) Inhibitors of CRISPR-Cas9
Osawa et al. Recent evidence for evolution of the genetic code
Boycheva et al. Codon pairs in the genome of Escherichia coli
CN105647968A (en) Fast CRISPR-Cas9 working efficiency testing system and application thereof
Swart et al. The Oxytricha trifallax mitochondrial genome
US8932859B2 (en) Methods for engineering polypeptide variants via somatic hypermutation and polypeptide made thereby
Kollmar et al. Nuclear codon reassignments in the genomics era and mechanisms behind their evolution
Cheng et al. The piggyBac transposon-derived genes TPB1 and TPB6 mediate essential transposon-like excision during the developmental rearrangement of key genes in Tetrahymena thermophila
Shen et al. Complete mitochondrial genome of the Japanese snapping shrimp Alpheus japonicus (Crustacea: Decapoda: Caridea): Gene rearrangement and phylogeny within Caridea
MX2021014057A (en) Optimized cannabinoid synthase polypeptides.
Parker Variations in reading the genetic code
CN114360645A (en) Codon optimization method of protein expression system and protein expression system
Magni et al. Mutagenesis of super-suppressors in yeast
EA200600554A1 (en) METHOD OF OBTAINING RECOMBINANT PROTEINS
Lu et al. Efficient construction of a stable linear gene based on a TNA loop modified primer pair for gene delivery
MX2022001859A (en) Method for treating muscular dystrophy by targeting lama1 gene.
CN102333870B (en) Method for increasing protein expression efficiency and expression vector
Marcelino et al. Evolution of the genus Mimivirus based on translation protein homology and its implication in the tree of life
Yoshikawa et al. Markerless bacterial artificial chromosome manipulation method by red proteins of phage λ mediated homologous recombination utilizing fluorescent proteins for both positive and counter selection
Lawrence et al. Unusual codon bias occurring within insertion sequences in Escherichia coli
Perez Peculiar evolution of the Monkeypox virus genomes
Von Heune A theoretical study of the attenuation control mechanism
Jukes Recent problems in the genetic code

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant