CN114512185A - Donkey population natural selection classification system for variant data dimension reduction input - Google Patents
Donkey population natural selection classification system for variant data dimension reduction input Download PDFInfo
- Publication number
- CN114512185A CN114512185A CN202210038022.1A CN202210038022A CN114512185A CN 114512185 A CN114512185 A CN 114512185A CN 202210038022 A CN202210038022 A CN 202210038022A CN 114512185 A CN114512185 A CN 114512185A
- Authority
- CN
- China
- Prior art keywords
- donkey
- data
- genome sequence
- sequence data
- model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 241000283074 Equus asinus Species 0.000 title claims abstract description 118
- 230000009467 reduction Effects 0.000 title claims abstract description 11
- 238000012545 processing Methods 0.000 claims abstract description 27
- 230000004927 fusion Effects 0.000 claims abstract description 26
- 238000013527 convolutional neural network Methods 0.000 claims abstract description 17
- 238000000034 method Methods 0.000 claims abstract description 12
- 238000007781 pre-processing Methods 0.000 claims abstract description 10
- 238000013145 classification model Methods 0.000 claims abstract description 4
- 238000010276 construction Methods 0.000 claims abstract description 4
- 230000035772 mutation Effects 0.000 claims description 26
- 239000011159 matrix material Substances 0.000 claims description 19
- 238000012549 training Methods 0.000 claims description 13
- 230000006870 function Effects 0.000 claims description 12
- 230000008569 process Effects 0.000 claims description 10
- 108090000623 proteins and genes Proteins 0.000 claims description 9
- 238000012360 testing method Methods 0.000 claims description 9
- 230000004913 activation Effects 0.000 claims description 6
- 238000004364 calculation method Methods 0.000 claims description 6
- 239000012634 fragment Substances 0.000 claims description 6
- 238000012512 characterization method Methods 0.000 claims description 3
- 238000004140 cleaning Methods 0.000 claims description 3
- 238000007418 data mining Methods 0.000 abstract description 2
- 238000013528 artificial neural network Methods 0.000 description 6
- 238000009395 breeding Methods 0.000 description 4
- 230000001488 breeding effect Effects 0.000 description 4
- 238000012271 agricultural production Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 241000251468 Actinopterygii Species 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/20—Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/20—Supervised data analysis
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Biophysics (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- General Engineering & Computer Science (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Molecular Biology (AREA)
- Software Systems (AREA)
- Biotechnology (AREA)
- Databases & Information Systems (AREA)
- Computational Linguistics (AREA)
- Epidemiology (AREA)
- Genetics & Genomics (AREA)
- Public Health (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biomedical Technology (AREA)
- Bioethics (AREA)
- Computing Systems (AREA)
- Analytical Chemistry (AREA)
- Chemical & Material Sciences (AREA)
- Mathematical Physics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention belongs to the field of biological information data mining, and particularly discloses a donkey population natural selection classification system for variable data dimension reduction input. The system comprises: the donkey genome sequence data processing module comprises an input module, a donkey genome sequence data processing module and a classification module; the input module is used for acquiring donkey genome sequence data; donkey genome sequence data processing module, including: the donkey genome sequence data preprocessing unit and the donkey genome sequence data fusion unit are used for processing donkey genome sequence data acquired by the input module and converting the donkey genome sequence data into variable locus fusion data; the classification module comprises a model construction unit and a model prediction unit, and utilizes a convolutional neural network to construct a natural selection classification model, utilizes the variation site fusion data to perform data dimension reduction, and then performs natural selection classification on the donkey population. The method has the functional advantage of analyzing the influence of natural selection on the donkey population by excavating the donkey population genome data, and has few model parameters and high accuracy.
Description
Technical Field
The invention relates to the field of biological population genome data mining, in particular to a natural selection classification system based on a neural network.
Background
Group genetics is the life science of studying the genetic characteristics and rules of biological groups. In agricultural production, the method has great economic value for pest and disease management and seed selection breeding; in medical treatment, the medicine has great contribution to the infection law of diseases; has great scientific significance for biodiversity protection and research.
At present, some systems for natural selection and classification of drosophila and fish appear at home and abroad, the systems separately process variation node position information and variation matrix data to form a multi-input network, and the variation node position information is combined through a fully-connected network, so that the problems of overlarge parameter quantity and overfitting are caused, the accuracy of a model is influenced, and the system is difficult to apply to specific production practice. In addition, there are few natural selection classification systems for donkey populations, and these systems do not solve the problem of natural selection classification of donkey populations well. The problems restrict the agricultural production, seed selection and breeding efficiency of donkey and restrict the scientific breeding of donkey.
Disclosure of Invention
The invention provides a donkey population natural selection classification system for variable data dimension reduction input, which aims at solving the problems of difficult model training and low model accuracy caused by overlarge neural network parameters acting on natural selection classification and the condition that natural selection research on donkey populations by current domestic and foreign models is little, and aims at researching the natural selection influence on donkey populations and improving the condition of model accuracy and applying the conditions to specific production practice. And (4) outputting a natural selection classification result through the variable locus fusion data and the neural network. The system adopts a CNN neural network to carry out natural selection judgment.
The technical scheme adopted by the invention is as follows:
a donkey population natural selection classification system for variant data dimension reduction input comprises: the donkey genome sequence data processing system comprises an input module, a donkey genome sequence data processing module and a classification module;
the input module is used for acquiring donkey genome sequence data;
the donkey genome sequence data processing module is connected with the input module and is used for processing the genome sequence data acquired by the input module and outputting the variation site position data and the variation matrix data of the donkey genome sequence;
the donkey genome sequence data processing module comprises: the donkey genome sequence data preprocessing unit and the donkey genome sequence data fusion unit;
the donkey genome sequence data preprocessing unit is used for dividing and cleaning donkey genome sequence data;
the donkey genome sequence data preprocessing unit comprises: the donkey genome sequence data slicer, the mutation node position calculator, the donkey genome sequence data converter and the donkey genome sequence data washer;
the donkey genome sequence data slicer is used for slicing donkey genome sequences into a plurality of equal-size fragments;
the variant node position calculator is used for calculating the relative position of the locus in the donkey gene segment in the corresponding segment;
the donkey genome sequence data converter is used for converting the divided donkey genome segment data into a 0,1 binary data matrix, namely variation matrix data, wherein 0 represents an ancestor gene, and 1 represents a variation gene;
the donkey genome sequence data cleaner is used for deleting over-short and over-long data, merging repeated locus data and carrying out OR operation on the repeated locus data to obtain a result;
the donkey genome sequence data fusion unit is used for calculating the position data of the mutation sites and the position data of the mutation matrix to generate mutation site fusion data, and the specific calculation process comprises the following steps:
Mij=Hij*posj
wherein HijRefers to the ith row and jth column data, pos, of the mutation matrixjRefers to the relative position of the jth mutation site in the fragment interval, which represents multiplication, MijRefers to the ith row and jth column data of the mutation site fusion data;
the classification module comprises: the donkey genome sequence model building unit and the donkey genome sequence model classifying unit are characterized in that the classifying module builds a donkey population natural selection classifying model by using a convolutional neural network and performs natural selection classification on donkey populations by using data output by the donkey genome sequence data processing module;
the donkey genome sequence model building unit adopts a convolutional neural network to build a natural selection model, and the model building is carried out according to the following sequence: calling an Input layer, a CNN layer and a Dropout layer to build a classification model; the CNN layer is used for vector characterization learning, the Dropout layer is used for preventing model overfitting, then the weight is adjusted according to each feature, and finally the weight and the feature value vector are multiplied and then summed and output;
the calculation process of the CNN layer is as follows:
V=conv(W,X)+b
w is the weight matrix, X is the mutation site fusion data, b is the bias,is the activation function, conv is the convolution function, V is the convolution function output, Y is the activation function output;
the donkey genome sequence model classification unit inputs the variable locus fusion data obtained by the donkey genome sequence data processing module into the model constructed by the genome sequence model construction unit, trains the model by using the training set data, and reads the test set into the trained model for natural selection classification; wherein, the natural selection prediction classification process is carried out according to the following sequence: (1) reading in the variable locus fusion data output by the donkey genome sequence data processing module, dividing the read-in data into a training set and a test set according to the ratio of 8:2, (2) coding the discrete type data by adopting a single-hot coding mode to finally obtain the vector representation of the variable locus data, (3) inputting the training set data converted into the vector representation into a model for model training, (4) reading in the test set data by utilizing the trained model, and performing natural selection classification.
Compared with the prior art, the invention has the beneficial effects that:
the donkey population natural selection classification system with the dimensionality reduction input of the variation data can analyze the natural selection classification of donkey populations, the variation site fusion data can reduce the input size so as to reduce neural network parameters, and the variation site fusion data of the donkey populations are automatically extracted and analyzed through the neural network, so that high-accuracy and reliable results are obtained, and the system has application values in the aspects of analyzing the natural selection effect received by the variation nodes of the donkey populations, agricultural production, seed selection and breeding, scientific feeding and the like.
Drawings
FIG. 1 is a donkey population natural selection classification system with variable data dimension reduction input.
Detailed Description
The invention is further described below with reference to the accompanying drawings.
Fig. 1 shows a donkey population natural selection classification system with variable data dimension reduction input according to an embodiment of the present invention.
Referring to fig. 1, the system for natural selection and classification of donkey populations with variant data input in a dimensionality reduction manner according to the embodiment of the present invention includes: the system comprises an input module, a data processing module and a classification module;
the input module is used for acquiring donkey genome sequence data;
the donkey genome sequence data processing module is connected with the input module and is used for processing the genome sequence data acquired by the input module and outputting the mutation site position data and the mutation matrix data of the donkey genome sequence;
the donkey genome sequence data processing module comprises: the donkey genome sequence data preprocessing unit and the donkey genome sequence data fusion unit;
the donkey genome sequence data preprocessing unit is used for dividing and cleaning donkey genome sequence data;
the donkey genome sequence data preprocessing unit comprises: the donkey genome sequence data slicer, the mutation node position calculator, the donkey genome sequence data converter and the donkey genome sequence data washer;
the donkey genome sequence data slicer is used for slicing donkey genome sequences into a plurality of fragments with equal sizes;
the variant node position calculator is used for calculating the relative position of the locus in the donkey gene segment in the corresponding segment;
the donkey genome sequence data converter is used for converting the divided donkey genome segment data into a 0,1 binary data matrix, namely variation matrix data, wherein 0 represents an ancestor gene, and 1 represents a variation gene;
the donkey genome sequence data cleaner is used for deleting over-short and over-long data, merging repeated locus data and carrying out OR operation on the repeated locus data to obtain a result;
the donkey genome sequence data fusion unit is used for calculating the position data of the mutation sites and the position data of the mutation matrix to generate mutation site fusion data, and the specific calculation process comprises the following steps:
Mij=Hij*posj
wherein HijRefers to the ith row and jth column data, pos, of the mutation matrixjRefers to the relative position of the jth mutation site in the fragment interval, which represents multiplication, MijRefers to the ith row and jth column data of the mutation site fusion data;
the classification module comprises: the donkey genome sequence model building unit and the donkey genome sequence model classifying unit are characterized in that the classifying module builds a donkey population natural selection classifying model by using a convolutional neural network and performs natural selection classification on donkey populations by using data output by the donkey genome sequence data processing module;
the donkey genome sequence model building unit adopts a convolutional neural network to build a natural selection model, and the model building is carried out according to the following sequence: calling an Input layer, a CNN layer and a Dropout layer to build a classification model; the CNN layer is used for vector characterization learning, the Dropout layer is used for preventing model overfitting, then the weight is adjusted according to each feature, and finally the weight and the feature value vector are multiplied and then summed and output;
the calculation process of the CNN layer is as follows:
V=conv(W,X)+b
w is the weight matrix, X is the mutation site fusion data, b is the bias,is the activation function, conv is the convolution function, V is the convolution function output, Y is the activation function output;
the donkey genome sequence model classification unit inputs the variable locus fusion data obtained by the donkey genome sequence data processing module into the model constructed by the genome sequence model construction unit, trains the model by using the training set data, and reads the test set into the trained model for natural selection classification; wherein, the natural selection prediction classification process is carried out according to the following sequence: (1) reading in the variable locus fusion data output by the donkey genome sequence data processing module, dividing the read-in data into a training set and a test set according to the ratio of 8:2, (2) coding the discrete type data by adopting a single-hot coding mode to finally obtain the vector representation of the variable locus data, (3) inputting the training set data converted into the vector representation into a model for model training, (4) reading in the test set data by utilizing the trained model, and performing natural selection classification.
The above description is only a preferred embodiment of the present invention, and the protection scope of the present invention is not limited to the above embodiments, and all technical solutions belonging to the idea of the present invention belong to the protection scope of the present invention. It should be noted that modifications and embellishments within the scope of the invention may occur to those skilled in the art without departing from the principle of the invention, and are considered to be within the scope of the invention.
Claims (1)
1. A donkey population natural selection classification system for variant data dimension reduction input is characterized in that the classification system comprises: the donkey genome sequence data processing system comprises an input module, a donkey genome sequence data processing module and a classification module;
the input module is used for acquiring donkey genome sequence data;
the donkey genome sequence data processing module is connected with the input module and is used for processing the genome sequence data acquired by the input module and outputting the variation site position data and the variation matrix data of the donkey genome sequence;
the donkey genome sequence data processing module comprises: the donkey genome sequence data preprocessing unit and the donkey genome sequence data fusion unit;
the donkey genome sequence data preprocessing unit is used for dividing and cleaning donkey genome sequence data;
the donkey genome sequence data preprocessing unit comprises: the donkey genome sequence data slicer, the mutation node position calculator, the donkey genome sequence data converter and the donkey genome sequence data washer;
the donkey genome sequence data slicer is used for slicing donkey genome sequences into a plurality of equal-size fragments;
the variant node position calculator is used for calculating the relative position of the locus in the donkey gene segment in the corresponding segment;
the donkey genome sequence data converter is used for converting the divided donkey genome segment data into a 0,1 binary data matrix, namely variation matrix data, wherein 0 represents an ancestor gene, and 1 represents a variation gene;
the donkey genome sequence data cleaner is used for deleting over-short and over-long data, merging repeated locus data and carrying out OR operation on the repeated locus data to obtain a result;
the donkey genome sequence data fusion unit is used for calculating the position data of the mutation sites and the position data of the mutation matrix to generate mutation site fusion data, and the specific calculation process comprises the following steps:
Mij=Hij*posj
wherein HijRefers to the ith row and jth column data, pos, of the mutation matrixjRefers to the relative position of the jth mutation site in the fragment interval, which represents multiplication, MijFinger mutation site fusionThe ith row and the jth column of data;
the classification module comprises: the donkey genome sequence model building unit and the donkey genome sequence model classifying unit are characterized in that the classifying module builds a donkey population natural selection classifying model by using a convolutional neural network and performs natural selection classification on donkey populations by using data output by the donkey genome sequence data processing module;
the donkey genome sequence model building unit adopts a convolutional neural network to build a natural selection model, and the model building is carried out according to the following sequence: calling an Input layer, a CNN layer and a Dropout layer to build a classification model; the CNN layer is used for vector characterization learning, the Dropout layer is used for preventing model overfitting, then the weight is adjusted according to each feature, and finally the weight and the feature value vector are multiplied and then summed and output;
the calculation process of the CNN layer is as follows:
V=conv(W,X)+b
w is a weight matrix, X is mutation site fusion data, b is an offset,is the activation function, conv is the convolution function, V is the convolution function output, Y is the activation function output;
the donkey genome sequence model classification unit inputs the variable locus fusion data obtained by the donkey genome sequence data processing module into the model constructed by the genome sequence model construction unit, trains the model by using the training set data, and reads the test set into the trained model for natural selection classification; wherein, the natural selection prediction classification process is carried out according to the following sequence: (1) reading in the variable locus fusion data output by the donkey genome sequence data processing module, dividing the read-in data into a training set and a test set according to the ratio of 8:2, (2) coding the discrete type data by adopting a single-hot coding mode to finally obtain the vector representation of the variable locus data, (3) inputting the training set data converted into the vector representation into a model for model training, (4) reading in the test set data by utilizing the trained model, and performing natural selection classification.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210038022.1A CN114512185B (en) | 2022-01-13 | 2022-01-13 | Donkey population natural selection classification system for variable data dimension reduction input |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210038022.1A CN114512185B (en) | 2022-01-13 | 2022-01-13 | Donkey population natural selection classification system for variable data dimension reduction input |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114512185A true CN114512185A (en) | 2022-05-17 |
CN114512185B CN114512185B (en) | 2024-04-05 |
Family
ID=81549378
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210038022.1A Active CN114512185B (en) | 2022-01-13 | 2022-01-13 | Donkey population natural selection classification system for variable data dimension reduction input |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114512185B (en) |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108052796A (en) * | 2017-12-26 | 2018-05-18 | 云南大学 | Global human mtDNA development tree classification querying methods based on integrated study |
CN108509860A (en) * | 2018-03-09 | 2018-09-07 | 西安电子科技大学 | HOh Xil Tibetan antelope detection method based on convolutional neural networks |
CN110111901A (en) * | 2019-05-16 | 2019-08-09 | 湖南大学 | Transportable patient classification system based on RNN neural network |
US20190318806A1 (en) * | 2018-04-12 | 2019-10-17 | Illumina, Inc. | Variant Classifier Based on Deep Neural Networks |
US20200194098A1 (en) * | 2018-12-14 | 2020-06-18 | Merck Sharp & Dohme Corp. | Identifying biosynthetic gene clusters |
CN112182247A (en) * | 2020-10-15 | 2021-01-05 | 华中农业大学 | Genetic population map construction method and system, storage medium and electronic equipment |
CN113128685A (en) * | 2021-04-25 | 2021-07-16 | 湖南大学 | Natural selection classification and population scale change analysis system based on neural network |
WO2021211840A1 (en) * | 2020-04-15 | 2021-10-21 | Chan Zuckerberg Biohub, Inc. | Local-ancestry inference with machine learning model |
-
2022
- 2022-01-13 CN CN202210038022.1A patent/CN114512185B/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108052796A (en) * | 2017-12-26 | 2018-05-18 | 云南大学 | Global human mtDNA development tree classification querying methods based on integrated study |
CN108509860A (en) * | 2018-03-09 | 2018-09-07 | 西安电子科技大学 | HOh Xil Tibetan antelope detection method based on convolutional neural networks |
US20190318806A1 (en) * | 2018-04-12 | 2019-10-17 | Illumina, Inc. | Variant Classifier Based on Deep Neural Networks |
US20200194098A1 (en) * | 2018-12-14 | 2020-06-18 | Merck Sharp & Dohme Corp. | Identifying biosynthetic gene clusters |
CN110111901A (en) * | 2019-05-16 | 2019-08-09 | 湖南大学 | Transportable patient classification system based on RNN neural network |
WO2021211840A1 (en) * | 2020-04-15 | 2021-10-21 | Chan Zuckerberg Biohub, Inc. | Local-ancestry inference with machine learning model |
CN112182247A (en) * | 2020-10-15 | 2021-01-05 | 华中农业大学 | Genetic population map construction method and system, storage medium and electronic equipment |
CN113128685A (en) * | 2021-04-25 | 2021-07-16 | 湖南大学 | Natural selection classification and population scale change analysis system based on neural network |
Non-Patent Citations (2)
Title |
---|
WANG C等: "Donkey genomes provide new insights into dom estication and selection for coat color", 《NATURE COMMUNICATIONS》, 31 December 2020 (2020-12-31) * |
施怿;李海鹏;: "群体基因组学方法:从经典统计学到有监督学习", 中国科学:生命科学, no. 04, 25 March 2019 (2019-03-25) * |
Also Published As
Publication number | Publication date |
---|---|
CN114512185B (en) | 2024-04-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11837324B2 (en) | Deep learning-based aberrant splicing detection | |
CN113519028B (en) | Methods and compositions for estimating or predicting genotypes and phenotypes | |
Akay | A study on particle swarm optimization and artificial bee colony algorithms for multilevel thresholding | |
Zhang et al. | Evolving optimal neural networks using genetic algorithms with Occam's razor | |
CN111898689B (en) | Image classification method based on neural network architecture search | |
Johnson et al. | Automating configuration of convolutional neural network hyperparameters using genetic algorithm | |
JP7522936B2 (en) | Gene phenotype prediction based on graph neural networks | |
CN114639446B (en) | Method for estimating aquatic animal genome breeding value based on MCP sparse deep neural network model | |
CN113053459A (en) | Hybrid prediction method for integrating parental phenotypes based on Bayesian model | |
CN116168766A (en) | Variety identification method, system and terminal based on ensemble learning | |
CN116340726A (en) | Energy economy big data cleaning method, system, equipment and storage medium | |
CN114512185B (en) | Donkey population natural selection classification system for variable data dimension reduction input | |
CN113128685B (en) | Natural selection classification and group scale change analysis system based on neural network | |
CN115618272A (en) | Method for automatically identifying single cell type based on depth residual error generation algorithm | |
CN109034392A (en) | The selection and system of a kind of Tilapia mossambica corss combination system | |
KR20230043071A (en) | Variant Pathogenicity Scoring and Classification and Use Thereof | |
Polushina et al. | A cross-entropy method for change-point detection in four-letter dna sequences | |
Houssein et al. | Salp swarm algorithm: modification and application | |
Whitehouse et al. | Tree sequences as a general-purpose tool for population genetic inference | |
Luh et al. | Classification of generalist or specialist life styles of predaceous phytoseiid mites using a computer genetic algorithm, information theory, and life history traits | |
Eyüpoğlu | Clustering of mitochondrial D-loop sequences using similarity matrix, PCA and K-means algorithm | |
US20240119314A1 (en) | Gene coding breeding prediction method and device based on graph clustering | |
CN118609667A (en) | Crop phenotype association regulation and control network optimization method and system | |
Jayapriya et al. | ENHANCED BIO-INSPIRED ALGORITHM FOR CONSTRUCTING PHYLOGENETIC TREE. | |
Pham et al. | Fuzzy clustering of stochastic models for molecular phylogenetics |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |