AU2020103826A4 - Whale dna sequence optimization method based on harmony search (hs) - Google Patents

Whale dna sequence optimization method based on harmony search (hs) Download PDF

Info

Publication number
AU2020103826A4
AU2020103826A4 AU2020103826A AU2020103826A AU2020103826A4 AU 2020103826 A4 AU2020103826 A4 AU 2020103826A4 AU 2020103826 A AU2020103826 A AU 2020103826A AU 2020103826 A AU2020103826 A AU 2020103826A AU 2020103826 A4 AU2020103826 A4 AU 2020103826A4
Authority
AU
Australia
Prior art keywords
whale
population
whales
dna sequence
constraints
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
AU2020103826A
Inventor
Xue LI
Bin Wang
Qiang Zhang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dalian University
Original Assignee
Dalian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dalian University filed Critical Dalian University
Priority to AU2020103826A priority Critical patent/AU2020103826A4/en
Application granted granted Critical
Publication of AU2020103826A4 publication Critical patent/AU2020103826A4/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/12Computing arrangements based on biological models using genetic models
    • G06N3/123DNA computing
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Evolutionary Biology (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Medical Informatics (AREA)
  • Analytical Chemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The disclosure provides a whale DNA sequence optimization method based on harmony search (HS), including the following steps: randomly generating an initial population; updating the population using the WOA algorithm; expanding a search range using HS to give a new population; adding a plurality of constraints to reduce a solution space; and sorting fitness values of results and outputting an optimal sequence. HS is highly discrete and exhibits a strong ability to search for local optimal solutions. Continuous HS increases the disturbance around an individual, thereby preventing the algorithm from falling into the local optimum and increasing the richness of a population at the same time. Initialize a population Calculate fitness values No p<0.5 Yes No Al<1 Yes Update by a spiral Use an individual with the Randomly select an method optimpdaftness for individual for updating Harmony search Use constraints to reduce a solution space Add 1 to the number of iterations No Whether a maximum number of iterations is reached Yes End FIG. I

Description

Initialize a population
Calculate fitness values
No p<0.5
Yes No Al<1
Yes
Update by a spiral Use an individual with the Randomly select an method optimpdaftness for individual for updating
Harmony search
Use constraints to reduce a solution space
Add 1 to the number of iterations
No Whether a maximum number of iterations is reached
Yes
End
FIG. I
WHALE DNA SEQUENCE OPTIMIZATION METHOD BASED ON HARMONY SEARCH (HS) TECHNICAL FIELD The disclosure belongs to the field of coding design in DNA computing, and in particular, relates to a whale DNA sequence optimization method based on harmony search (HS). BACKGROUND In 1994, professor Adelman of the University of Southern California solved the problem of seven-vertex Hamiltonian path by using DNA molecules as a computing medium. This finds a method for combining biotechnology with a computer and starts a new way to DNA computing. Utilizing the characteristic that DNA molecules can be uniquely identified during encoding, and following the principle of complementary base pairing, a problem to be solved turns into a collection of DNA molecules. A biochemical reaction is conducted on the generated DNA molecules to give all solution spaces for the problem, and the solution spaces are then separated and extracted to give a solution for the problem. The followed principle of complementary base pairing is that base A is paired with base T and base C is paired with base G. The biochemical reaction usually adopts hybridization of DNA molecules, so that encoded DNA molecules are completely hybridized, which is the key for accurately and reliably solving the problem. Therefore, it is a significant task to study high-quality DNA molecular sequences. SUMMARY By combining a whale algorithm with the harmony search algorithm (HSA), a whale DNA sequence optimization method based on HS is provided. Compared with the existing methods, this method adds a new constraint: pairing. By simulating this method, a DNA sequence with a higher sequence quality can be obtained. In order to achieve the above object, the disclosure adopts the following technical solution: a whale DNA sequence optimization method based on HS, which needs to obtain an optimized DNA sequence that meets a plurality of constraints. The method includes the following steps: randomly generating an initial population; updating the population using the whale optimization algorithm (WOA); expanding a search range using HS to give a new population; adding a plurality of constraints to reduce a solution space; and sorting fitness values of results and outputting an optimal sequence. The specific steps are as follows: step 1: randomly generating an initial population, and initializing parameters, where, Maxiter is a maximum number of iterations, and t is a current number of iterations; step 2: calculating fitness values for the current population, and taking a minimum of the sum of individual fitness values as a current optimal whale, and recording position information thereof; step 3: randomly generating a variable for a change in each iteration; step 4: judging the variable p(p E-[0,1]), and if p < 0.5, proceeding to step 5, otherwise, proceeding to step 6; step 5: determining whether JAI of this iteration is less than 1, and if it is less than 1, using the current optimal whale to update positions of the remaining whales, otherwise, randomly selecting a whale to update positions of other whales; step 6: using the current optimal whale to update positions of other whales by a spiral formula; step 7: subjecting all existing populations to HS, and expanding a search range to give a new outstanding population; step 8: evaluating all populations using constraints, deleting whales that do not meet the constraints, and using a fast non-dominated sorting approach to select the same number of whales as in the initial population; step 9: adding 1 to the number of iterations to determine whether a maximum number of iterations is reached, and if it is not reached, proceeding to step 2, otherwise, proceeding to step ; and step 10: sorting fitness values of results and outputting the results to give an optimized population. Through the above method, the disclosure can achieve the following effects: 1. When fitness values are calculated for the initial population, the minimum value of the sum of individual fitness values is determined as the current optimum, and the minimum value is recorded, so that individuals can better adapt to the environment when a whale algorithm population evolves, thereby enabling comprehensive optimization. 2. The whale algorithm has the disadvantage of easily falling into the local optimum. However, HS is highly discrete and exhibits a strong ability to search for local optimal solutions. Continuous HS increases the disturbance around an individual, thereby preventing the algorithm from falling into the local optimum and increasing the richness of a population at the same time. 3. The whale DNA sequence optimization algorithm based on HS provided by the disclosure can be used to acquire DNA coding sequences with high quality. BRIEF DESCRIPTION OF DRAWING FIG. 1 is a flowchart for implementing the disclosure. DETAILED DESCRIPTION The disclosure will be described in detail below by way of non-limiting example only, with reference to the accompanying drawing. The disclosure adopts the following 8 constraints: hairpin structure, H-measure, continuity, similarity, Hamming distance, melting temperature, GC content, and a newly proposed constraint: pairing. The first four constraints mentioned above are adopted as the objective function, and the rest are adopted as constraints. They are used in the second step of the claims to calculate the fitness of each individual. Hairpin structure indicates that a DNA strand has self-folding due to self-complementarity. H-measure represents the number of complementary base pairs in two complementary sequences, which is used to limit the unnecessary hybridization among sequences. Continuity means that the number of consecutive identical bases in a specified interval of a DNA sequence should be limited to a specific threshold. Similarity refers to the probability that two DNA sequences in the same direction have the same base on an allele. Hamming distance refers to the number of different bases on alleles of two different DNA sequences. Melting temperature indicates the temperature at which half of a DNA molecule changes from a double-stranded state to a single-stranded state. GC content indicates the percentage of bases G (guanine) and C (cytosine) contained in any DNA strand in all bases in this DNA strand, which is defined as 50% in the disclosure. Pairing means that any 3 consecutive bases are regarded as one unit, and then the unit is compared with all the remaining units to see if they are paired; and if they are completely paired, the positions of two different bases are adjusted. Example 1 The example of the disclosure is implemented on the premise of the technical solution of the disclosure, and the detailed implementations and specific operation processes are given, but the protection scope of the disclosure is not limited to the following examples. In the example, the length and dimension for DNA coding were both 20, and constraints such as hairpin structure, H measure, continuity, similarity, Hamming distance, melting temperature, GC content, and pairing were as described above. Step 1: An initial DNA coding sequence, with a dimension and a length both of 20, was initialized. Relevant parameters were initialized: the maximum number of iterations Maxiter: 300; the number of iterations t: starting from 0; and the Hamming distance L: 11. Step 2: Fitness values were calculated for an existing population. The individual with the minimum fitness value was adopted as the current optimal solution, and position information thereof was recorded. Step 3: A parameter value was defined for a change in each iteration. a was reduced to 0 from 2 using a linear method, i was any random number between [0,1]; 1 was any random number between [-1,1]; p was any random number between [0,1]; A was defined according to formula (6): = 2a -' - a (6) Step 4: A randomly selected p was judged, and if p < 0.5, it will proceed to step 5, otherwise, it will proceed to step 6. Step 5: It was determined whether JAI of this iteration is less than 1, and if it is less than 1, the current optimal whale will be used to update positions of the remaining whales according to formula (7), otherwise, a whale was randomly selected to update other whales according to formula (8); Z(t + 1) = X'(t) - A - D (7) where,(t) - r 2, F(t) is the position of the current optimal whale, and X is the current updated whale position; Z(t + 1) = Xrand - A D (8) where, D = C rand-I, and Xrand represents an randomly-selected whale position. Step 6: The current optimal whale was used to update positions of other whales by a spiral formula shown as follows: Z(t + 1)=7 - e' - cos(2rcl) + X(t) (9) where, D' = |X(t)- Z|. Step 7: All populations were subjected to HS. In the HS, a voice was transformed according to the current timbre, a transformed voice was compared with the original voice, and a sound with a better timbre was reserved, thereby acquiring a new outstanding population. Step 8: All populations were evaluated using constraints, whales that do not meet the constraints were deleted, and whales that meet the constraints were reserved. A fast non-dominated sorting approach was adopted to pick out the top 20 among whales for the next iteration. If the number of populations that meet the constraints is below 20, all will enter the next iteration. Step 9: 1 was added to the number of iterations to determine whether a maximum number of iterations is reached, and if it is not reached, it will proceed to step 2, otherwise, it will proceed to step 10. Step 10: Fitness values of results were sorted and the results were output to give an optimized population. Table 1 Initial DNA sequences AATATCCGAAATCCCCTGCT GCTTAAGATTTGTGAATGAT CGAAATCTTAATAGACACGA GGAAAAAATTATCATCAAAT AATGGCATCATGGCGACTTA CATGTAGCCTGGATACTTTG TATAATCTGTGTTTAAAATA ATACAATCGAACAAGATGGT GAACTATTAAATTCAAATCA GTATAAATCCATTAACAGGC CATGCTTTTGTAGCTATTAG
GCATCCGCTATTAATATATG GTAAATTGATTAATGGCATA TCCTAGTGCTTATACAGATT ACCGGCCTCTGTTATATATA CATTCACTTTTCAGTTCACC TAGTAAGTAATTAGGTTTAT GTTTTGAGAATTTTAATCTT AGAACCATAATTTAACTGTC GCTTTTCCTATAGTTCTTAT
Table 2 A set of the optimal DNA sequences CTCGTCTAACCTTCTTCAGC CTGTGTGGAATGCAAGGATG CGAGCGTAGTGTAGTCATCA AATTACAGGCCACCACCGAT CAGTAGCAGTCATAACGAGC GCATAGCACATCGTAGCGTA TGGACCTTGAGAGTGGAGAT The disclosure provides a whale DNA sequence optimization method based on HS, where, an initial population is searched with a whale algorithm. The populations are screened based on constraints such as hairpin structure, H-measure, continuity, similarity, Hamming distance, melting temperature, GC content, and pairing, and finally the top 7 among selected sequences as the final outstanding set for outputting. The disclosure runs under the environment of WinOIntel(R)CPU2.70GHz, ARM 8.00GB, and simulation is conducted with MATLAB2018a. An experiment shows that the optimized sequence obtained in this example is better than sequences obtained from other algorithms. In this specification, any reference to the prior art is not intended to be, and is not to be construed as, any admission, implication or suggestion that the prior art forms part of the common general knowledge in Australia or anywhere else. In this specification, the word "comprising" has the non-limiting meaning of "including at least" and not a limiting meaning such as "including only". The same applies, with necessary changes made, to other grammatical forms of the word such as "comprise", "comprised", and ''comprises". Although the invention is described above by way of specific disclosure and example, it is not limited thereto but may include other embodiments and examples limited only by the claims.

Claims (3)

  1. What is claimed is: 1. A whale DNA sequence optimization method based on harmony search (HS), comprising the following specific steps: step 1: randomly generating an initial population; step 2: calculating fitness values for the current population, and taking a minimum of the sum of individual fitness values as a current optimal whale, and recording position information thereof; step 3: randomly generating a variable for a change in each iteration; step 4: determining whether the variable is less than a set value, if so, proceeding to step 5, and if not, proceeding to step 6; step 5: determining whether the modulus JAI of a coefficient vector A of the iteration is less than 1, and if it is less than 1, using the current optimal whale to update positions of the remaining whales, otherwise, randomly selecting a whale to update positions of other whales; step 6: using the current optimal whale to update positions of other whales by a spiral formula; step 7: adding HS, and expanding a search range to give a new outstanding population; step 8: evaluating all populations using constraints, deleting whales that do not meet the constraints, and using a fast non-dominated sorting approach to select the same number of whales as in the initial population; step 9: adding 1 to the number of iterations to determine whether a maximum number of iterations is reached, and if it is not reached, proceeding to step 2, otherwise, proceeding to step ; and step 10: sorting fitness values of results and outputting the results to give an optimized population.
  2. 2. The whale DNA sequence optimization method based on HS according to claim 1, wherein, the calculation formula for updating whale positions is as follows:
    Z(t + 1) =f1 ifp<.5 (1) D' -e -cos(2rcl) + X*(t) if p 0.5 wherein, A 2 - ' - a, a is reduced to 0 from 2 using a linear method, and i is any random number between [0,1]; 1 is any random number between [-1,1]; p is any random number between
    [0,1]; X*(t) represents the position of the current optimal whale; X# is defined in formula (2):
    X* (t) |AI< 1 X# = Uad JI>1(2) ~Xrana |Al21 wherein, Xrand represents an randomly-selected whale position; D is defined in formula (3): , D = | CX(t -X( -I AI < 1 (3) C- Xrand -1 |JAI 1
    D' is defined in formula (4): D' = |X- (t) - XI (4)
  3. 3. The whale DNA sequence optimization method based on HS according to claim 1, wherein a new population constraint is added, and specifically: for a sequence x and a reverse sequence y thereof, 3 consecutive bases are regarded as a unit, namely, x' = (xi, xi+1 , xi+2 ) and y' = (Yj, Yj+1Y, y+2), for V i, j E [1, n - 2], which is subject to function (5) pair(x') subcb(x',y', k) 3 fpair(x) =x' subcb(x',y',k)*3 (5)
    wherein, the function subcb () aggregates whether x' and y' are completely complementary paired; and when subcb ()= 3, it means that bases in the two units are completely paired, and then any two different bases in x' are used for position exchange.
AU2020103826A 2020-12-01 2020-12-01 Whale dna sequence optimization method based on harmony search (hs) Active AU2020103826A4 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2020103826A AU2020103826A4 (en) 2020-12-01 2020-12-01 Whale dna sequence optimization method based on harmony search (hs)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
AU2020103826A AU2020103826A4 (en) 2020-12-01 2020-12-01 Whale dna sequence optimization method based on harmony search (hs)

Publications (1)

Publication Number Publication Date
AU2020103826A4 true AU2020103826A4 (en) 2021-02-11

Family

ID=74502298

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2020103826A Active AU2020103826A4 (en) 2020-12-01 2020-12-01 Whale dna sequence optimization method based on harmony search (hs)

Country Status (1)

Country Link
AU (1) AU2020103826A4 (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113034554A (en) * 2021-02-27 2021-06-25 西北大学 Chaotic reverse learning-based whale optimized broken warrior body fragment registration method
CN113054648A (en) * 2021-05-08 2021-06-29 东莞理工学院 Improved whale algorithm-based direct-current micro-grid droop coefficient optimization method and system
CN113177954A (en) * 2021-04-28 2021-07-27 中南大学 Image processing method and device
CN113435108A (en) * 2021-06-02 2021-09-24 中国人民解放军空军工程大学 Battlefield target grouping method based on improved whale optimization algorithm
CN113627071A (en) * 2021-06-26 2021-11-09 西安科技大学 Coal-fired boiler NO based on whale algorithm optimization long-time memory networkXSoft measurement method
CN113759333A (en) * 2021-07-14 2021-12-07 中国人民解放军空军预警学院 Wind turbine multipath echo micromotion parameter estimation method based on whale optimization algorithm
CN113887691A (en) * 2021-08-24 2022-01-04 杭州电子科技大学 Whale evolution system and method for service combination problem
CN114021745A (en) * 2021-10-22 2022-02-08 三峡大学 Virtual overhaul flow optimization method for hydropower station equipment
CN114330839A (en) * 2021-12-13 2022-04-12 重庆邮电大学 Landslide displacement prediction method based on WOA-LSTM model
CN114489061A (en) * 2022-01-17 2022-05-13 湖北工业大学 Whale optimization algorithm-based smooth path planning method
CN114692345A (en) * 2022-06-02 2022-07-01 河南科技学院 Intelligent scheduling method for crane boom production
CN115080402A (en) * 2022-06-22 2022-09-20 河南工业大学 Whale optimization algorithm-based software defect positioning method and processing device
CN115828120A (en) * 2023-02-17 2023-03-21 山东科技大学 Self-adaptive identification method and system for ship traffic behavior pattern and computer equipment
CN116015967A (en) * 2023-01-06 2023-04-25 重庆邮电大学 Industrial Internet intrusion detection method based on improved whale algorithm optimization DELM
CN118013864A (en) * 2024-04-09 2024-05-10 江西泰德工程有限公司 Viscose fiber yellowing optimization method based on whale algorithm
CN114021745B (en) * 2021-10-22 2024-07-05 三峡大学 Virtual overhaul flow optimization method for hydropower station equipment

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113034554A (en) * 2021-02-27 2021-06-25 西北大学 Chaotic reverse learning-based whale optimized broken warrior body fragment registration method
CN113034554B (en) * 2021-02-27 2024-03-29 西北大学 Whale optimized broken warrior body fragment registration method based on chaos reverse learning
CN113177954A (en) * 2021-04-28 2021-07-27 中南大学 Image processing method and device
CN113054648A (en) * 2021-05-08 2021-06-29 东莞理工学院 Improved whale algorithm-based direct-current micro-grid droop coefficient optimization method and system
CN113054648B (en) * 2021-05-08 2023-11-03 东莞理工学院 Direct-current micro-grid droop coefficient optimization method and system based on improved whale algorithm
CN113435108A (en) * 2021-06-02 2021-09-24 中国人民解放军空军工程大学 Battlefield target grouping method based on improved whale optimization algorithm
CN113435108B (en) * 2021-06-02 2024-02-02 中国人民解放军空军工程大学 Battlefield target grouping method based on improved whale optimization algorithm
CN113627071B (en) * 2021-06-26 2023-06-20 西安科技大学 Coal-fired boiler NO based on whale algorithm optimization long-short-term memory network X Soft measurement method
CN113627071A (en) * 2021-06-26 2021-11-09 西安科技大学 Coal-fired boiler NO based on whale algorithm optimization long-time memory networkXSoft measurement method
CN113759333A (en) * 2021-07-14 2021-12-07 中国人民解放军空军预警学院 Wind turbine multipath echo micromotion parameter estimation method based on whale optimization algorithm
CN113759333B (en) * 2021-07-14 2024-04-02 中国人民解放军空军预警学院 Wind turbine multipath echo jiggle parameter estimation method based on whale optimization algorithm
CN113887691A (en) * 2021-08-24 2022-01-04 杭州电子科技大学 Whale evolution system and method for service combination problem
CN114021745A (en) * 2021-10-22 2022-02-08 三峡大学 Virtual overhaul flow optimization method for hydropower station equipment
CN114021745B (en) * 2021-10-22 2024-07-05 三峡大学 Virtual overhaul flow optimization method for hydropower station equipment
CN114330839A (en) * 2021-12-13 2022-04-12 重庆邮电大学 Landslide displacement prediction method based on WOA-LSTM model
CN114330839B (en) * 2021-12-13 2024-05-28 重庆邮电大学 Landslide displacement prediction method based on WOA-LSTM model
CN114489061B (en) * 2022-01-17 2023-05-16 湖北工业大学 Smooth path planning method based on whale optimization algorithm
CN114489061A (en) * 2022-01-17 2022-05-13 湖北工业大学 Whale optimization algorithm-based smooth path planning method
CN114692345A (en) * 2022-06-02 2022-07-01 河南科技学院 Intelligent scheduling method for crane boom production
CN115080402A (en) * 2022-06-22 2022-09-20 河南工业大学 Whale optimization algorithm-based software defect positioning method and processing device
CN116015967A (en) * 2023-01-06 2023-04-25 重庆邮电大学 Industrial Internet intrusion detection method based on improved whale algorithm optimization DELM
CN116015967B (en) * 2023-01-06 2024-05-31 重庆软江图灵人工智能科技有限公司 Industrial Internet intrusion detection method based on improved whale algorithm optimization DELM
CN115828120A (en) * 2023-02-17 2023-03-21 山东科技大学 Self-adaptive identification method and system for ship traffic behavior pattern and computer equipment
CN118013864A (en) * 2024-04-09 2024-05-10 江西泰德工程有限公司 Viscose fiber yellowing optimization method based on whale algorithm

Similar Documents

Publication Publication Date Title
AU2020103826A4 (en) Whale dna sequence optimization method based on harmony search (hs)
Wang et al. DMfold: a novel method to predict RNA secondary structure with pseudoknots based on deep learning and improved base pair maximization principle
Shrikumar et al. Technical note on transcription factor motif discovery from importance scores (TF-MoDISco) version 0.5. 6.5
AU2015331619A1 (en) Haplotype phasing models
JP2021509572A (en) How to Align Target Nucleic Acid Sequencing Data
CN108052696B (en) Three-value FPRM circuit area and delay optimization method using particle swarm optimization
CN113724790B (en) PiRNA-disease association relation prediction method based on convolution denoising self-encoder
CN109801681B (en) SNP (Single nucleotide polymorphism) selection method based on improved fuzzy clustering algorithm
CN110534154B (en) Whale DNA sequence optimization method based on harmony search
CN109390032B (en) Method for exploring disease-related SNP (single nucleotide polymorphism) combination in data of whole genome association analysis based on evolutionary algorithm
Zhang et al. On the application of BERT models for nanopore methylation detection
Saeed et al. Biological sequence analysis
El-Badawy et al. On the use of pseudo-EIIP mapping scheme for identifying exons locations in DNA sequences
Oluoch et al. A review on RNA secondary structure prediction algorithms
Phogat et al. Disease single nucleotide polymorphism selection using hybrid feature selection technique
US20230073973A1 (en) Deep learning based system and method for prediction of alternative polyadenylation site
Csuros Maximum-scoring segment sets
CN116343908A (en) Method, medium and device for predicting protein coding region by fusing DNA shape characteristics
Shehzadi et al. Intelligent predictor using cancer-related biologically information extraction from cancer transcriptomes
Yi et al. Learning from data-rich problems: a case study on genetic variant calling
NL2013120B1 (en) A method for finding associated positions of bases of a read on a reference genome.
Li et al. Seeding with minimized subsequence
Tsaftaris et al. DNA-based matching of digital signals
CN111833964A (en) Method for mining superior locus of Bayesian network optimized by integer linear programming
US20240153582A1 (en) Systems and methods for myopic estimation of nucleic acid binding

Legal Events

Date Code Title Description
FGI Letters patent sealed or granted (innovation patent)