CN117789825A - Gene source CDS sequence editing recording method, computer storage medium and electronic equipment - Google Patents
Gene source CDS sequence editing recording method, computer storage medium and electronic equipment Download PDFInfo
- Publication number
- CN117789825A CN117789825A CN202311856005.XA CN202311856005A CN117789825A CN 117789825 A CN117789825 A CN 117789825A CN 202311856005 A CN202311856005 A CN 202311856005A CN 117789825 A CN117789825 A CN 117789825A
- Authority
- CN
- China
- Prior art keywords
- cds sequence
- editing
- sequence
- history
- cds
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 66
- 238000000034 method Methods 0.000 title claims abstract description 51
- 230000008569 process Effects 0.000 claims abstract description 8
- 102000004169 proteins and genes Human genes 0.000 claims description 36
- 230000015654 memory Effects 0.000 claims description 35
- 108020004705 Codon Proteins 0.000 claims description 24
- 108091081062 Repeated sequence (DNA) Proteins 0.000 claims description 11
- 230000008859 change Effects 0.000 claims description 5
- 238000011144 upstream manufacturing Methods 0.000 claims description 4
- 238000012938 design process Methods 0.000 abstract description 9
- 238000013461 design Methods 0.000 abstract description 5
- 230000006870 function Effects 0.000 description 6
- 238000012545 processing Methods 0.000 description 6
- 230000014509 gene expression Effects 0.000 description 5
- 238000005457 optimization Methods 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 108700010070 Codon Usage Proteins 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- 150000001413 amino acids Chemical class 0.000 description 3
- 238000004590 computer program Methods 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- 108020005038 Terminator Codon Proteins 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 2
- 125000003275 alpha amino acid group Chemical group 0.000 description 2
- 229940104302 cytosine Drugs 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- 241000186361 Actinobacteria <class> Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004414 DNA Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000010362 genome editing Methods 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
Landscapes
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention provides a recording method, a computer storage medium and electronic equipment for gene source CDS sequence editing, wherein the method comprises the following steps: s1, providing a CDS sequence; s2, editing the CDS sequence, and inserting the CDS sequence into a carrier; s3, in the inserting process, recording the editing history of the CDS sequence; s4, storing editing history information of the CDS sequence. According to the recording method for the gene source CDS sequence editing, which is disclosed by the embodiment of the invention, the history of the user editing the gene source CDS sequence can be recorded in the carrier design process, and the edited CDS sequence is not required to be analyzed by software in the carrier design process or after the carrier design is finished, so that the operation flow of the user is greatly simplified, the time and the energy are saved, and the working efficiency is improved.
Description
Technical Field
The present invention relates to the field of gene editing, and more particularly, to a recording method of gene source CDS sequence editing, a computer storage medium, and an electronic device.
Background
The CDS sequence is the coding region of a gene, i.e., the nucleotide sequence required to encode a protein, and is typically composed of four bases of ATCG, wherein each three bases form a codon, each codon corresponding to an amino acid. CDS sequences are critical to gene expression, determine the amino acid sequence of proteins, and are a very important part of the vector design process.
Currently, the analysis method of the commonly used gene source CDS sequence has BLAST software and the like, along with the continuous development of biological technology, tens of thousands of researchers are using a platform to construct a carrier, and users inserting the personalized edited CDS sequence into the carrier are more and more, and the time and the labor are wasted for the users, the operation is complicated, and the working efficiency is greatly influenced no matter after the carrier design is finished or the analysis software is used for analyzing the edited CDS sequence in the carrier design process.
Disclosure of Invention
In order to solve the technical problems, the invention provides a recording method for editing a gene source CDS sequence, a computer storage medium and electronic equipment, which can greatly improve the analysis efficiency of the edited CDS sequence.
The recording method for gene source CDS sequence editing according to the embodiment of the first aspect of the invention comprises the following steps: s1, providing a CDS sequence; s2, editing the CDS sequence, and inserting the CDS sequence into a carrier; s3, in the inserting process, recording the editing history of the CDS sequence; s4, storing editing history information of the CDS sequence.
According to the recording method for the gene source CDS sequence editing, which is disclosed by the embodiment of the invention, the history of the user editing the gene source CDS sequence can be recorded in the carrier design process, and the edited CDS sequence is not required to be analyzed by software in the carrier design process or after the carrier design is finished, so that the operation flow of the user is greatly simplified, the time and the energy are saved, and the working efficiency is improved.
According to one embodiment of the present invention, the editing history of the CDS sequence includes:
determining whether the sequence length of the CDS sequence after editing is a multiple of 3, and recording the editing history of the CDS sequence when the sequence length of the CDS sequence is not a multiple of 3.
According to an embodiment of the present invention, the editing history of the CDS sequence further includes:
when deleting the initial codon of the CDS sequence, recording the editing history of the CDS sequence;
when deleting the stop codon of the CDS sequence, recording the editing history of the CDS sequence;
and automatically displaying the interval position of the CDS sequence.
According to an embodiment of the present invention, the editing history of the CDS sequence further includes:
when a protein tag is added to the upstream of the gene of the CDS sequence, displaying five positions of the protein tag, and recording the editing history of the CDS sequence;
when a protein tag is added to the downstream of the gene of the CDS sequence, displaying five positions of the protein tag, and recording the editing history of the CDS sequence;
when the number of the protein tags is plural and the positions of the protein tags are changed, the change of the protein tags is displayed, and the editing history of the CDS sequence is recorded.
According to an embodiment of the present invention, the editing history of the CDS sequence further includes:
when a base C appears at a predetermined position of the CDS sequence, adding a base T after deleting the base C, marking the base T, and recording the editing history of the CDS sequence.
According to an embodiment of the present invention, the editing history of the CDS sequence further includes:
when the GC content in the CDS sequence is higher than 80%, the interval position of the GC content higher than 80% is displayed, and the editing history of the CDS sequence is recorded.
According to an embodiment of the present invention, the editing history of the CDS sequence further includes:
displaying the repeated sequence and interval position when the repeated sequence exists in the CDS sequence, and recording the editing history of the CDS sequence;
recording an edit history of the CDS sequence while codons in the CDS sequence are optimized.
According to an embodiment of the present invention, the editing position of the CDS sequence and the editing history of the CDS sequence are provided with corresponding marks, respectively, and when one of the editing position and the editing history is focused, the position corresponding to the other of the editing position and the editing history is displayed.
In a second aspect, embodiments of the present invention provide a computer storage medium comprising one or more computer instructions which, when executed, implement a method as described in the above embodiments.
An electronic device according to an embodiment of the third aspect of the present invention includes a memory for storing one or more computer instructions and a processor; the processor is configured to invoke and execute the one or more computer instructions to implement the method as described in any of the embodiments above.
Additional aspects and advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention.
Drawings
The foregoing and/or additional aspects and advantages of the invention will become apparent and may be better understood from the following description of embodiments taken in conjunction with the accompanying drawings in which:
fig. 1 is a flowchart of a recording method of gene source CDS sequence editing according to an embodiment of the present invention;
fig. 2 is a schematic diagram of an electronic device according to an embodiment of the invention.
Reference numerals:
an electronic device 300;
a memory 310; an operating system 311; an application 312;
a processor 320; a network interface 330; an input device 340; a hard disk 350; and a display device 360.
Detailed Description
Embodiments of the present invention are described in detail below, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to like or similar elements or elements having like or similar functions throughout. The embodiments described below by referring to the drawings are illustrative only and are not to be construed as limiting the invention.
In the description of the present invention, it should be understood that the terms "center", "longitudinal", "lateral", "length", "width", "thickness", "upper", "lower", "front", "rear", "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outer", "clockwise", "counterclockwise", "axial", "radial", "circumferential", etc. indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings are merely for convenience in describing the present invention and simplifying the description, and do not indicate or imply that the device or element being referred to must have a specific orientation, be configured and operated in a specific orientation, and therefore should not be construed as limiting the present invention. Furthermore, features defining "first", "second" may include one or more such features, either explicitly or implicitly. In the description of the present invention, unless otherwise indicated, the meaning of "a plurality" is two or more.
In the description of the present invention, it should be noted that, unless explicitly specified and limited otherwise, the terms "mounted," "connected," and "connected" are to be construed broadly, and may be either fixedly connected, detachably connected, or integrally connected, for example; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between two elements. The specific meaning of the above terms in the present invention will be understood in specific cases by those of ordinary skill in the art.
The terminology mentioned in the present application is explained first.
CDS sequence: is a very important part of genomics research, and can be used for predicting the amino acid sequence of protein and further researching the structure and function of the protein.
Start codon: refers to the specific sequence recognized by RNA polymerase during transcription of DNA into RNA, which indicates the starting point for RNA polymerase to begin synthesizing mRNA at that location.
Stop codon: is a special termination flag that is used to identify whether the received message has transmitted the complete last character.
Tag: the system definition comprises common protein tags such as epitope tag proteins, protease cleavage sites and the like.
3xFLAG and FLAG: epitope Tag proteins, inside Tag.
GC content: is the proportion of (Guanine) and Cytosine (Cytosine) in the whole genome of the subject under study (e.g., actinomycetes).
Codon optimization: is a novel technique for increasing the expression level of a protein in an organism by increasing the translation efficiency of a target gene.
Interval position: the position of the modified sequence within this sequence, such as: 1-10.
The following describes a recording method of gene source CDS sequence editing according to an embodiment of the present invention in detail with reference to the accompanying drawings.
As shown in fig. 1, a recording method of gene source CDS sequence editing according to an embodiment of the present invention includes the steps of:
s1, providing a CDS sequence;
s2, editing the CDS sequence, and inserting the CDS sequence into a carrier;
s3, in the inserting process, recording the editing history of the CDS sequence;
s4, storing editing history information of the CDS sequence.
In other words, the recording method for the gene source CDS sequence editing according to the embodiment of the invention can record the editing history in the process of editing the gene source CDS sequence, each modification or editing can be provided with a button for canceling the modification, and the cancelled can be displayed by clicking the cancel. The editing history can be stored, the method does not need to use software to analyze the edited CDS sequence in the carrier design process or after the carrier design is finished, and the stored editing history can directly check all change information when a user enters editing again.
Therefore, according to the recording method for the gene source CDS sequence editing, which is disclosed by the embodiment of the invention, the history of the user editing the gene source CDS sequence in the carrier design process can be recorded, the edited CDS sequence does not need to be analyzed by software in the carrier design process or after the carrier design is finished, the operation flow of the user is greatly simplified, and the time and the energy are saved, so that the working efficiency is improved.
According to one embodiment of the present invention, the editing history of the CDS sequence includes:
determining whether the sequence length of the CDS sequence after editing is a multiple of 3, and recording the editing history of the CDS sequence when the sequence length of the CDS sequence is not a multiple of 3.
That is, when the length of the sequence after CDS sequence editing is not a multiple of 3, related information is automatically displayed, for example, a hint message is given, and the information is recorded, so that a sequence that cannot be translated into a protein can be recorded.
Optionally, the editing history of the CDS sequence further includes:
when deleting the initial codon of the CDS sequence, recording the editing history of the CDS sequence;
when deleting the stop codon of the CDS sequence, recording the editing history of the CDS sequence;
and automatically displaying the interval position of the CDS sequence.
In other words, when the ATG at the starting position is deleted in the CDS sequence editing process, the interval position of the sequence and the information of the starting codon are automatically displayed to form an editing history; correspondingly, when the last sequence TAA/TAG/TGA is deleted in the CDS sequence editing process, the interval position of the sequence and the information of the deleted termination codon are automatically displayed to form an editing history.
Since each sequence capable of normal translation starts at the start codon and ends at the stop codon. For example, a sequence that is translated from the first amino acid, i.e., the position of the ATG. If the foremost ATG is deleted, translation will begin with the next ATG, which will result in a translated protein that is shorter than the protein that was originally translated. Similarly, if the termination codon behind the gene is deleted, the original protein is additionally translated with a sequence.
Thus, by recording the editing history of the start codon and the stop codon, the user can quickly understand the difference between the translated protein and the protein which is originally required to be translated.
In some embodiments of the present invention, the editing history of the CDS sequence further includes:
when a protein tag is added to the upstream of the gene of the CDS sequence, displaying five positions of the protein tag, and recording the editing history of the CDS sequence;
when a protein tag is added to the downstream of the gene of the CDS sequence, displaying five positions of the protein tag, and recording the editing history of the CDS sequence;
when the number of the protein tags is plural and the positions of the protein tags are changed, the change of the protein tags is displayed, and the editing history of the CDS sequence is recorded.
Specifically, when Tag is added to the N-terminal of CDS sequence, the position of Tag added to the upstream of gene and the interval of corresponding sequence is automatically displayed; when Tag is added to the C-terminus of CDS sequence, the addition of Tag to the region downstream of the gene and the corresponding sequence is automatically shown.
When a plurality of tags, for example, a FLAG and a 3xFLAG are added to the CDS sequence, the positions of the tags are changed so that the FLAG is moved to the rear of the 3xFLAG, and at this time, the change of the tags is automatically displayed and the editing information is recorded in the editing history.
Optionally, according to an embodiment of the present invention, the editing history of the CDS sequence further includes:
when a base C appears at a predetermined position of the CDS sequence, adding a base T after deleting the base C, marking the base T, and recording the editing history of the CDS sequence.
For example, when the CDS sequence deletes base C at position 8, one base T is newly added and highlighted at position 8, and the editing history can be recorded as: automatic display of position 8 base C the mutation base T may affect the codon mutation, and when a stop codon appears in the middle of the sequence, automatic deletion is performed, and the position of the interval of the deleted sequence is displayed.
In some embodiments of the present invention, the editing history of the CDS sequence further includes:
when the GC content in the CDS sequence is higher than 80%, the interval position of the GC content higher than 80% is displayed, and the editing history of the CDS sequence is recorded.
In other words, if the GC content in the CDS sequence is higher than 80%, the section position having the GC content higher than 80% may be automatically displayed, and the detail button may be simultaneously configured, and the specific information may be displayed by clicking the detail button.
According to an embodiment of the present invention, the editing history of the CDS sequence further includes:
displaying the repeated sequence and interval position when the repeated sequence exists in the CDS sequence, and recording the editing history of the CDS sequence;
recording an edit history of the CDS sequence while codons in the CDS sequence are optimized.
Specifically, if there is a repeated sequence, the CDS sequence edits the history to automatically display the repeated sequence and display the section position of the repeated sequence, and may simultaneously configure a detail button, click the detail button, and may display specific information. In addition, the CDS sequence editing may use codon optimization online, and when codon optimization is used, the editing history may be recorded as a sequence for automatically displaying the codon optimization, and simultaneously configured with a detail button, a restore button, an applied button, etc., and specific information may be displayed after clicking.
High repeat/GC content may make vector construction difficult, with more difficult vector construction the more severe the repeat. Thus, by identifying the repeated sequence, the user can be made aware of the part at the time of analysis, so that different construction schemes can be replaced according to the extent of the repeated sequence.
Translation of the same amino acid from different species may use different codons and may have codon bias from species to species. Although the naturally occurring cause of codon preference is not known at present, the effect of this phenomenon on protein expression efficiency is significant. For recombinant protein expression, sequence optimization is generally required according to codon preference of the species for optimal expression.
Thus, by recording the edit history in which the codons are optimized, the translation in which the user has decrypted the codons can be made optimized.
Alternatively, in some embodiments of the present invention, the editing position of the CDS sequence and the editing history of the CDS sequence are provided with corresponding marks, respectively, and when one of the editing position and the editing history is focused, the position corresponding to the other of the editing position and the editing history is displayed.
For example, the mark may be set to a triangle, the triangle may be preset with a plurality of colors, and adjacent triangles may not be identical in color, and a triangle may be set for both the sequence editing position and the sequence editing history, and the corresponding sequence editing position and the corresponding editing history may be identical in color, and when a mouse is used to focus on a certain record of the sequence editing history, a line of the same color connects the triangles of the two.
In addition, the invention also provides a computer storage medium, which comprises one or more computer instructions, and the one or more computer instructions realize the analysis method of the biological information sequence when being executed.
That is, the computer storage medium stores a computer program which, when executed by a processor, causes the processor to perform the method of analyzing a biological information sequence of any one of the above.
As shown in fig. 2, an embodiment of the present invention provides an electronic device 300, including a memory 310 and a processor 320, where the memory 310 is configured to store one or more computer instructions, and the processor 320 is configured to invoke and execute the one or more computer instructions, thereby implementing any of the methods described above.
That is, the electronic device 300 includes: a processor 320 and a memory 310, in which memory 310 computer program instructions are stored which, when executed by the processor, cause the processor 320 to perform any of the methods described above.
Further, as shown in fig. 2, the electronic device 300 also includes a network interface 330, an input device 340, a hard disk 350, and a display device 360.
The interfaces and devices described above may be interconnected by a bus architecture. The bus architecture may be a bus and bridge that may include any number of interconnects. One or more Central Processing Units (CPUs), represented in particular by processor 320, and various circuits of one or more memories, represented by memory 310, are connected together. The bus architecture may also connect various other circuits together, such as peripheral devices, voltage regulators, and power management circuits. It is understood that a bus architecture is used to enable connected communications between these components. The bus architecture includes, in addition to a data bus, a power bus, a control bus, and a status signal bus, all of which are well known in the art and therefore will not be described in detail herein.
The network interface 330 may be connected to a network (e.g., the internet, a local area network, etc.), and may obtain relevant data from the network and store the relevant data in the hard disk 350.
The input device 340 may receive various instructions from an operator and transmit the instructions to the processor 320 for execution. The input device 340 may include a keyboard or pointing device (e.g., a mouse, a trackball, a touch pad, or a touch screen, among others).
The display device 360 may display results obtained by the processor 320 executing instructions.
The memory 310 is used for storing programs and data necessary for the operation of the operating system, and data such as intermediate results in the calculation process of the processor 320.
It will be appreciated that memory 310 in embodiments of the invention may be volatile memory or nonvolatile memory, or may include both volatile and nonvolatile memory. The nonvolatile memory may be Read Only Memory (ROM), programmable Read Only Memory (PROM), erasable Programmable Read Only Memory (EPROM), electrically Erasable Programmable Read Only Memory (EEPROM), or flash memory, among others. Volatile memory can be Random Access Memory (RAM), which acts as external cache memory. The memory 310 of the apparatus and methods described herein is intended to comprise, without being limited to, these and any other suitable types of memory.
In some implementations, the memory 310 stores the following elements, executable modules or data structures, or a subset thereof, or an extended set thereof: an operating system 311 and applications 312.
The operating system 311 includes various system programs, such as a framework layer, a core library layer, a driver layer, and the like, for implementing various basic services and processing hardware-based tasks. The application programs 312 include various application programs such as a Browser (Browser) and the like for implementing various application services. A program implementing the method of the embodiment of the present invention may be included in the application program 312.
The method disclosed in the above embodiment of the present invention may be applied to the processor 320 or implemented by the processor 320. Processor 320 may be an integrated circuit chip with signal processing capabilities. In implementation, the steps of the above method may be performed by integrated logic circuitry in hardware or instructions in software in processor 320. The processor 320 may be a general purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic, or discrete hardware components, which may implement or perform the methods, steps, and logic blocks disclosed in embodiments of the present invention. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of the method disclosed in connection with the embodiments of the present invention may be embodied directly in the execution of a hardware decoding processor, or in the execution of a combination of hardware and software modules in a decoding processor. The software modules may be located in a random access memory, flash memory, read only memory, programmable read only memory, or electrically erasable programmable memory, registers, etc. as well known in the art. The storage medium is located in the memory 310 and the processor 320 reads the information in the memory 310 and in combination with its hardware performs the steps of the method described above.
It is to be understood that the embodiments described herein may be implemented in hardware, software, firmware, middleware, microcode, or a combination thereof. For a hardware implementation, the processing units may be implemented within one or more Application Specific Integrated Circuits (ASICs), digital Signal Processors (DSPs), digital Signal Processing Devices (DSPDs), programmable Logic Devices (PLDs), field Programmable Gate Arrays (FPGAs), general purpose processors, controllers, micro-controllers, microprocessors, other electronic units designed to perform the functions described herein, or a combination thereof.
For a software implementation, the techniques described herein may be implemented with modules (e.g., procedures, functions, and so on) that perform the functions described herein. The software codes may be stored in a memory and executed by a processor. The memory may be implemented within the processor or external to the processor.
In particular, the processor 320 is further configured to read the computer program and execute any of the methods described above.
In the several embodiments provided in this application, it should be understood that the disclosed methods and apparatus may be implemented in other ways. For example, the apparatus embodiments described above are merely illustrative, e.g., the division of the units is merely a logical function division, and there may be additional divisions when actually implemented, e.g., multiple units or components may be combined or integrated into another system, or some features may be omitted or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be an indirect coupling or communication connection via some interfaces, devices or units, which may be in electrical, mechanical or other form.
In addition, each functional unit in the embodiments of the present invention may be integrated in one processing unit, or each unit may be physically included separately, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in hardware plus software functional units.
The integrated units implemented in the form of software functional units described above may be stored in a computer readable storage medium. The software functional unit is stored in a storage medium, and includes several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to perform part of the steps of the transceiving method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a random access Memory (Random Access Memory, RAM), a magnetic disk, or an optical disk, or other various media capable of storing program codes.
While the foregoing is directed to the preferred embodiments of the present invention, it will be appreciated by those skilled in the art that various modifications and adaptations can be made without departing from the principles of the present invention, and such modifications and adaptations are intended to be comprehended within the scope of the present invention.
Claims (10)
1. A recording method of gene source CDS sequence editing, characterized by comprising the steps of:
s1, providing a CDS sequence;
s2, editing the CDS sequence, and inserting the CDS sequence into a carrier;
s3, in the inserting process, recording the editing history of the CDS sequence;
s4, storing editing history information of the CDS sequence.
2. The method of claim 1, wherein the edit history of the CDS sequence comprises:
determining whether the sequence length of the CDS sequence after editing is a multiple of 3, and recording the editing history of the CDS sequence when the sequence length of the CDS sequence is not a multiple of 3.
3. The method of claim 1, wherein the edit history of the CDS sequence further comprises:
when deleting the initial codon of the CDS sequence, recording the editing history of the CDS sequence;
when deleting the stop codon of the CDS sequence, recording the editing history of the CDS sequence;
and automatically displaying the interval position of the CDS sequence.
4. The method of claim 1, wherein the edit history of the CDS sequence further comprises:
when a protein tag is added to the upstream of the gene of the CDS sequence, displaying five positions of the protein tag, and recording the editing history of the CDS sequence;
when a protein tag is added to the downstream of the gene of the CDS sequence, displaying five positions of the protein tag, and recording the editing history of the CDS sequence;
when the number of the protein tags is plural and the positions of the protein tags are changed, the change of the protein tags is displayed, and the editing history of the CDS sequence is recorded.
5. The method of claim 1, wherein the edit history of the CDS sequence further comprises:
when a base C appears at a predetermined position of the CDS sequence, adding a base T after deleting the base C, marking the base T, and recording the editing history of the CDS sequence.
6. The method of claim 1, wherein the edit history of the CDS sequence further comprises:
when the GC content in the CDS sequence is higher than 80%, the interval position of the GC content higher than 80% is displayed, and the editing history of the CDS sequence is recorded.
7. The method of claim 1, wherein the edit history of the CDS sequence further comprises:
displaying the repeated sequence and interval position when the repeated sequence exists in the CDS sequence, and recording the editing history of the CDS sequence;
recording an edit history of the CDS sequence while codons in the CDS sequence are optimized.
8. The method according to claim 1, wherein an editing position of the CDS sequence and an editing history of the CDS sequence are provided with corresponding marks, respectively, and when one of the editing position and the editing history is focused, a position corresponding to the other of the editing position and the editing history is displayed.
9. A computer storage medium comprising one or more computer instructions which, when executed, implement the method of any of claims 1-8.
10. An electronic device comprising a memory and a processor, characterized in that,
the memory is used for storing one or more computer instructions;
the processor is configured to invoke and execute the one or more computer instructions to implement the method of any of claims 1-8.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311856005.XA CN117789825A (en) | 2023-12-29 | 2023-12-29 | Gene source CDS sequence editing recording method, computer storage medium and electronic equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311856005.XA CN117789825A (en) | 2023-12-29 | 2023-12-29 | Gene source CDS sequence editing recording method, computer storage medium and electronic equipment |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117789825A true CN117789825A (en) | 2024-03-29 |
Family
ID=90383180
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311856005.XA Pending CN117789825A (en) | 2023-12-29 | 2023-12-29 | Gene source CDS sequence editing recording method, computer storage medium and electronic equipment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117789825A (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6303297B1 (en) * | 1992-07-17 | 2001-10-16 | Incyte Pharmaceuticals, Inc. | Database for storage and analysis of full-length sequences |
CN108004247A (en) * | 2017-12-06 | 2018-05-08 | 吉林大学 | A kind of ubiquitin-like substrate fusion protein and preparation method and medical application |
CN111145836A (en) * | 2019-12-30 | 2020-05-12 | 云舟生物科技(广州)有限公司 | Method for inserting gene nucleic acid sequence, computer storage medium and electronic device |
CN112164419A (en) * | 2020-09-04 | 2021-01-01 | 云舟生物科技(广州)有限公司 | Method for inserting protein tag into ORF and computer storage medium |
-
2023
- 2023-12-29 CN CN202311856005.XA patent/CN117789825A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6303297B1 (en) * | 1992-07-17 | 2001-10-16 | Incyte Pharmaceuticals, Inc. | Database for storage and analysis of full-length sequences |
CN108004247A (en) * | 2017-12-06 | 2018-05-08 | 吉林大学 | A kind of ubiquitin-like substrate fusion protein and preparation method and medical application |
CN111145836A (en) * | 2019-12-30 | 2020-05-12 | 云舟生物科技(广州)有限公司 | Method for inserting gene nucleic acid sequence, computer storage medium and electronic device |
CN112164419A (en) * | 2020-09-04 | 2021-01-01 | 云舟生物科技(广州)有限公司 | Method for inserting protein tag into ORF and computer storage medium |
Non-Patent Citations (1)
Title |
---|
赵慧 等: "cp4-epsps基因在毕赤酵母中的表达及鉴定", 《现代食品科技》, 11 September 2017 (2017-09-11), pages 55 - 62 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Vizueta et al. | BITACORA: A comprehensive tool for the identification and annotation of gene families in genome assemblies | |
Campbell et al. | Genome annotation and curation using MAKER and MAKER‐P | |
Pearson | Using the FASTA program to search protein and DNA sequence databases | |
Frazer et al. | Cross-species sequence comparisons: a review of methods and available resources | |
Meyer et al. | Gene structure conservation aids similarity based gene prediction | |
Uberbacher et al. | [16] Discovering and understanding genes in human DNA sequence using GRAIL | |
Wu et al. | GMAP: a genomic mapping and alignment program for mRNA and EST sequences | |
Florea et al. | Gene and alternative splicing annotation with AIR | |
Blakeley et al. | Investigating protein isoforms via proteomics: a feasibility study | |
CN111145836A (en) | Method for inserting gene nucleic acid sequence, computer storage medium and electronic device | |
US20100180196A1 (en) | Document management system, document management method, and document management program embodied on computer readable medium | |
Foissac et al. | Integrating alternative splicing detection into gene prediction | |
Hajarnavis et al. | A probabilistic model of 3′ end formation in Caenorhabditis elegans | |
Hu et al. | Using TWINSCAN to predict gene structures in genomic DNA sequences | |
CN117789825A (en) | Gene source CDS sequence editing recording method, computer storage medium and electronic equipment | |
JP6352804B2 (en) | Representation, visualization, comparison and reporting of genomic / proteomic sequences using bioinformatics character sets and mapped bioinformatics fonts | |
CN112164419A (en) | Method for inserting protein tag into ORF and computer storage medium | |
JP2000060553A (en) | Gene motif extractor, gene motif extraction treatment and recording medium that records gene motif extraction program | |
CN115458063A (en) | Carrier recommendation method, system, computer storage medium and electronic device | |
CN115982034B (en) | Test method of virtual terminal of carrier construction system, storage medium and electronic equipment | |
Hedges | RNA-seq fusion detection in clinical oncology | |
JP4411343B2 (en) | Information processing apparatus, information processing method, and program | |
CN112132637B (en) | Carrier price calculation method, computer storage medium, and electronic device | |
JP2006252541A (en) | Annotation method, annotation system, program, and computer readable recording medium | |
CN115881227B (en) | Carrier customization method and computer storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |