CN113128685A - Natural selection classification and population scale change analysis system based on neural network - Google Patents

Natural selection classification and population scale change analysis system based on neural network Download PDF

Info

Publication number
CN113128685A
CN113128685A CN202110446165.1A CN202110446165A CN113128685A CN 113128685 A CN113128685 A CN 113128685A CN 202110446165 A CN202110446165 A CN 202110446165A CN 113128685 A CN113128685 A CN 113128685A
Authority
CN
China
Prior art keywords
data
classification
population
neural network
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110446165.1A
Other languages
Chinese (zh)
Other versions
CN113128685B (en
Inventor
彭绍亮
黄浩
辛彬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hunan University
Original Assignee
Hunan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hunan University filed Critical Hunan University
Priority to CN202110446165.1A priority Critical patent/CN113128685B/en
Publication of CN113128685A publication Critical patent/CN113128685A/en
Application granted granted Critical
Publication of CN113128685B publication Critical patent/CN113128685B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/12Computing arrangements based on biological models using genetic models
    • G06N3/126Evolutionary algorithms, e.g. genetic algorithms or genetic programming
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Biophysics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Physiology (AREA)
  • Genetics & Genomics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a natural selection classification and population scale change analysis system based on a neural network, which comprises the following steps: an input module for obtaining genome sequence data; the data processing module is used for processing the genome sequence data acquired by the input module and outputting summary statistics of the genome sequence data; the classification fitting module is used for constructing a population genetic classification and parameter fitting model by combining a cyclic neural network with a convolutional neural network, and performing natural selection classification and fitting population change on the population by using data output by the data processing module; the input module is connected with the data processing module, and the data processing module is connected with the classification fitting module. The invention can simultaneously analyze the scale change of the population and the natural selection classification, thereby eliminating the influence of the population scale change on the classification judgment of the natural selection, and automatically extracting and analyzing various summary statistics of the population through the neural network, thereby obtaining a result with high accuracy and reliability.

Description

Natural selection classification and population scale change analysis system based on neural network
Technical Field
The invention relates to the field of biological population genomes, in particular to a natural selection classification and population scale change analysis system based on a neural network.
Background
Group genetics is the life science of studying the genetic characteristics and rules of biological groups. In agricultural production, the method has great economic value for pest and disease management and seed selection breeding; in medical treatment, the medicine has great contribution to the infection law of diseases; has great scientific significance for biodiversity protection and research.
At present, some natural selection classification systems appear at home and abroad, but the influence of population scale change on natural selection judgment is not considered, and the population scale change may leave signals similar to natural selection on a population genome, so that the judgment of natural selection classification is influenced.
Disclosure of Invention
The invention aims to provide a natural selection classification and population scale change analysis system based on a neural network, so as to overcome the defects in the prior art.
In order to achieve the purpose, the technical scheme adopted by the invention is as follows:
a neural network based natural choice classification and population scale variation analysis system, comprising:
an input module for obtaining genome sequence data;
the data processing module is used for processing the genome sequence data acquired by the input module and outputting summary statistics of the genome sequence data;
the classification fitting module is used for constructing a population genetic classification and parameter fitting model by combining a cyclic neural network with a convolutional neural network, and performing natural selection classification and fitting population change on the population by using data output by the data processing module;
the input module is connected with the data processing module, and the data processing module is connected with the classification fitting module.
Further, the data processing module comprises,
the data preprocessing unit is used for dividing and cleaning the genome sequence data;
the data summarizing statistic generation unit is used for dividing each piece of output data of the data preprocessing unit into a set number of windows and calculating the population genetics summarizing statistic of each window;
the data preprocessing unit is connected with the data summarizing statistic generation unit.
Further, the number of the windows is 3.
Further, the statistics of genetic summary for each window population include number of loci, locus folding spectrum, length distribution among loci, length distribution of state identification region, linkage disequilibrium distribution, Tajima's D statistics.
Further, the data preprocessing unit includes,
the data slice divider is used for dividing the genome sequence into a plurality of segments with equal size;
a data position calculator for calculating the relative position of a site in a gene fragment at the fragment;
a data converter for converting the divided genome fragment data into binary data;
the data cleaner is used for deleting data with length less than a first set length and data with length greater than a second set length, combining the data of the repeated sites and carrying out OR operation on the data of the repeated sites to obtain a result, wherein the second set length is greater than the first set length;
the data slice divider, the data position calculator, the data converter and the data cleaner are sequentially connected.
Further, 0 in the binary data represents an ancestor gene and 1 represents a variant gene.
Further, the class fitting module includes,
the model building unit is used for building a natural selection classification and population change parameter fitting model by adopting a gate control cycle unit in a cyclic neural network (RNN) and combining a convolutional neural network;
the model prediction unit is used for inputting the group genetic summary statistic sequence obtained by the data processing module into the natural selection classification and group change parameter fitting model constructed by the model construction unit, training the model by using training set data, and reading the test set into the trained model to perform natural selection classification and group change parameter fitting;
the model building unit is connected with the model prediction unit.
Further, the model construction unit constructs a natural selection classification and group change parameter fitting model according to the following sequence, firstly calls an Input layer, a BilSTM layer, a CNN layer and a Dropout layer, and constructs a natural selection classification and group change parameter fitting model; the BiLSTM layer and the CNN layer are used for vector characterization learning, and the Dropout layer is used for preventing overfitting of the model; then adjusting the weight according to the correlation of each characteristic and the population genetic variable; and finally, multiplying the weight and the characteristic value vector, and summing and outputting.
Further, the calculation process of the gating cycle unit is,
ft=σ(Wf·[ht-1,xt]+bf)
it=σ(Wi·[ht-1,xt]+bi)
Figure BDA0003036996500000021
Figure BDA0003036996500000022
ot=σ(Wo·[ht-1,xt]+bo)
ht=ot*tanh(Ct);
wherein f istIs a forgetting gate, itIs a memory door, and the memory door is provided with a memory,
Figure BDA0003036996500000023
is in a temporary state, CtIs the current time state, otIs an input gate, htIs a hidden state, σ is an activation function, Wf、Wi、WC、WoAre different weight matrices, bf、bi、bC、boAre different offsets, ht-1Is a concealment of the upper layerState, xtIs the current input, '-' stands for point product, and tanh is the tangent function. (ii) a
V=conv(W,X)+b
The calculation process of the CNN layer is as follows:
Figure BDA0003036996500000031
w is the weight matrix, X is the BiLSTM layer output, b is the offset,
Figure BDA0003036996500000032
is an activation function.
Further, the natural choice prediction classification process in the model prediction unit proceeds in the following order: firstly, reading a group genetic summary statistic sequence output by a data processing module, and dividing the read data into a training set and a test set according to a set proportion; then, coding the dispersed type data by adopting a single-hot coding mode to obtain vector representation of a group genetic summary statistical sequence; secondly, inputting training set data converted into vector representation into a model for model training; and finally, reading in test set data by using the trained model, and performing natural selection classification and population scale change parameter fitting.
Compared with the prior art, the invention has the advantages that: the invention can simultaneously analyze the scale change of the population and the natural selection classification, thereby eliminating the influence of the population scale change on the natural selection classification judgment, and automatically extracting and analyzing various summary statistics of the population through the neural network, thereby obtaining a beneficial result with high accuracy and reliability.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a block diagram of a neural network-based natural choice classification and population-scale variation analysis system of the present invention.
Detailed Description
The preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings so that the advantages and features of the present invention can be more easily understood by those skilled in the art, and the scope of the present invention will be more clearly and clearly defined.
Referring to fig. 1, the present invention discloses a natural selection classification and population scale change analysis system based on neural network, comprising: an input module for obtaining genome sequence data; the data processing module is used for processing the genome sequence data acquired by the input module and outputting summary statistics of the genome sequence data; the classification fitting module is used for constructing a population genetic classification and parameter fitting model by combining a cyclic neural network with a convolutional neural network, and performing natural selection classification and fitting population change on the population by using data output by the data processing module; the input module is connected with the data processing module, and the data processing module is connected with the classification fitting module.
In this embodiment, the data processing module includes: the data preprocessing unit is used for dividing and cleaning the genome sequence data; the data summarizing statistic generation unit is used for dividing each piece of output data of the data preprocessing unit into a set number of windows (3 in the embodiment) and calculating the population genetics summarizing statistic of each window; the data preprocessing unit is connected with the data summarizing statistic generation unit.
Preferably, the genetic summary statistics of each window population include site number, site fold spectra, site-to-site length distribution, state signature region length distribution, linkage disequilibrium distribution, Tajima's D statistics.
In this embodiment, the data preprocessing unit includes: the data slice divider is used for dividing the genome sequence into a plurality of segments with equal size; a data position calculator for calculating the relative position of a site in a gene fragment at the fragment; a data converter for converting the divided genome fragment data into 0, 1 binary data (0 in the binary data represents an ancestor gene, 1 represents a variant gene); the data cleaner is used for deleting data with length less than a first set length and data with length greater than a second set length, combining the data of the repeated sites and carrying out OR operation on the data of the repeated sites to obtain a result, wherein the second set length is greater than the first set length; the data slice divider, the data position calculator, the data converter and the data cleaner are connected in sequence.
In this embodiment, the classification fitting module includes: the model building unit is used for building a natural selection classification and population change parameter fitting model by adopting a gate control cycle unit in a cyclic neural network (RNN) and combining a convolutional neural network; the model prediction unit is used for inputting the group genetic summary statistic sequence obtained by the data processing module into the natural selection classification and group change parameter fitting model constructed by the model construction unit, training the model by using training set data, and reading the test set into the trained model to perform natural selection classification and group change parameter fitting; the model building unit is connected with the model prediction unit.
The model construction unit constructs a natural selection classification and group change parameter fitting model according to the following sequence, firstly, an Input layer, a BilSTM layer, a CNN layer and a Dropout layer are called, and the natural selection classification and group change parameter fitting model is constructed; the BiLSTM layer and the CNN layer are used for vector characterization learning, and the Dropout layer is used for preventing overfitting of the model; then adjusting the weight according to the correlation of each characteristic and the population genetic variable; and finally, multiplying the weight and the characteristic value vector, and summing and outputting.
The calculation process of the gating cycle unit is that,
ft=σ(Wf·[ht-1,xt]+bf)
it=σ(Wi·[ht-1,xt]+bi)
Figure BDA0003036996500000041
Figure BDA0003036996500000042
ot=σ(Wo·[ht-1,xt]+bo)
ht=ot*tanh(Ct);
wherein f istIs a forgetting gate, itIs a memory door, and the memory door is provided with a memory,
Figure BDA0003036996500000043
is in a temporary state, CtIs the current time state, otIs an input gate, htIs a hidden state, σ is an activation function, Wf、Wi、WC、WoAre different weight matrices, bf、bi、bC、boAre different offsets, ht-1Is a hidden state of the previous layer, xtIs the current input, '-' stands for point product, and tanh is the tangent function. (ii) a
V=conv(W,X)+b
The calculation process of the CNN layer is as follows:
Figure BDA0003036996500000051
w is the weight matrix, X is the BiLSTM layer output, b is the offset,
Figure BDA0003036996500000052
is an activation function.
In this embodiment, the natural selection prediction classification process in the model prediction unit is performed in the following order: firstly, reading a group genetic summary statistic sequence output by a data processing module, and dividing the read data into a training set and a test set according to a set proportion; then, coding the dispersed type data by adopting a single-hot coding mode to obtain vector representation of a group genetic summary statistical sequence; secondly, inputting training set data converted into vector representation into a model for model training; and finally, reading in test set data by using the trained model, and performing natural selection classification and population scale change parameter fitting.
The invention provides a natural selection classification and group scale change analysis system based on a neural network, which utilizes the characteristic that the neural network can have multiple outputs, simultaneously outputs a group scale change parameter and a natural selection classification result, and can simultaneously analyze the group scale change and the natural selection classification, thereby eliminating the influence of the group scale change on the natural selection classification judgment, and automatically extracting and analyzing various summary statistics of the group through the neural network, thereby obtaining a beneficial result with high accuracy and reliability.
Although the embodiments of the present invention have been described with reference to the accompanying drawings, various changes or modifications may be made by the patentees within the scope of the appended claims, and within the scope of the invention, as long as they do not exceed the scope of the invention described in the claims.

Claims (10)

1. A natural selection classification and population scale change analysis system based on a neural network is characterized in that: comprises the steps of (a) preparing a mixture of a plurality of raw materials,
an input module for obtaining genome sequence data;
the data processing module is used for processing the genome sequence data acquired by the input module and outputting summary statistics of the genome sequence data;
the classification fitting module is used for constructing a population genetic classification and parameter fitting model by combining a cyclic neural network with a convolutional neural network, and performing natural selection classification and fitting population change on the population by using data output by the data processing module;
the input module is connected with the data processing module, and the data processing module is connected with the classification fitting module.
2. The neural network-based natural choice classification and population-scale variation analysis system of claim 1, wherein: the data processing module comprises a data processing module and a data processing module,
the data preprocessing unit is used for dividing and cleaning the genome sequence data;
the data summarizing statistic generation unit is used for dividing each piece of output data of the data preprocessing unit into a set number of windows and calculating the population genetics summarizing statistic of each window;
the data preprocessing unit is connected with the data summarizing statistic generation unit.
3. The neural network-based natural choice classification and population-scale variation analysis system of claim 2, wherein: the number of the windows is 3.
4. The neural network-based natural choice classification and population-scale variation analysis system of claim 2, wherein: the statistics of genetics summary of each window group comprise the number of sites, site folding frequency spectrum, length distribution among the sites, length distribution of state identification regions, linkage disequilibrium distribution and Tajima's D statistics.
5. The neural network-based natural choice classification and population-scale variation analysis system of claim 2, wherein: the data pre-processing unit comprises a data pre-processing unit,
the data slice divider is used for dividing the genome sequence into a plurality of segments with equal size;
a data position calculator for calculating the relative position of a site in a gene fragment at the fragment;
a data converter for converting the divided genome fragment data into binary data;
the data cleaner is used for deleting data with length less than a first set length and data with length greater than a second set length, combining the data of the repeated sites and carrying out OR operation on the data of the repeated sites to obtain a result, wherein the second set length is greater than the first set length;
the data slice divider, the data position calculator, the data converter and the data cleaner are sequentially connected.
6. The neural network-based natural choice classification and population-scale variation analysis system of claim 2, wherein: in the binary data, 0 represents an ancestral gene and 1 represents a mutated gene.
7. The neural network-based natural choice classification and population-scale variation analysis system of claim 1, wherein: the class-fitting module includes a class-fitting module,
the model building unit is used for building a natural selection classification and population change parameter fitting model by adopting a gate control cycle unit in a cyclic neural network (RNN) and combining a convolutional neural network;
the model prediction unit is used for inputting the group genetic summary statistic sequence obtained by the data processing module into the natural selection classification and group change parameter fitting model constructed by the model construction unit, training the model by using training set data, and reading the test set into the trained model to perform natural selection classification and group change parameter fitting;
the model building unit is connected with the model prediction unit.
8. The neural network-based natural choice classification and population-scale variation analysis system of claim 7, wherein: the model construction unit constructs a natural selection classification and group change parameter fitting model according to the following sequence, firstly calls an Input layer, a BilSTM layer, a CNN layer and a Dropout layer, and constructs the natural selection classification and group change parameter fitting model; the BiLSTM layer and the CNN layer are used for vector characterization learning, and the Dropout layer is used for preventing overfitting of the model; then adjusting the weight according to the correlation of each characteristic and the population genetic variable; and finally, multiplying the weight and the characteristic value vector, and summing and outputting.
9. The neural network-based natural choice classification and population-scale variation analysis system of claim 8, wherein: the calculation process of the gating cycle unit is that,
ft=σ(Wf·[ht-1,xt]+bf)
it=σ(Wi·[ht-1,xt]+bi)
Figure FDA0003036996490000021
Figure FDA0003036996490000022
ot=σ(Wo·[ht-1,xt]+bo)
ht=ot*tanh(Ct);
wherein f istIs a forgetting gate, itIs a memory door, and the memory door is provided with a memory,
Figure FDA0003036996490000023
is in a temporary state, CtIs the current time state, otIs an input gate, htIs a hidden state, σ is an activation function, Wf、Wi、WC、WoAre different weight matrices, bf、bi、bC、boAre different offsets, ht-1Is a hidden state of the previous layer, xtIs the current input, '-' stands for point product, and tanh is the tangent function. (ii) a
The calculation process of the CNN layer is as follows:
Figure FDA0003036996490000024
w is the weight matrix, X is the BiLSTM layer output, b is the offset,
Figure FDA0003036996490000025
is an activation function。
10. The neural network-based natural choice classification and population-scale variation analysis system of claim 7, wherein: the natural selection prediction classification process in the model prediction unit is carried out according to the following sequence: firstly, reading a group genetic summary statistic sequence output by a data processing module, and dividing the read data into a training set and a test set according to a set proportion; then, coding the dispersed type data by adopting a single-hot coding mode to obtain vector representation of a group genetic summary statistical sequence; secondly, inputting training set data converted into vector representation into a model for model training; and finally, reading in test set data by using the trained model, and performing natural selection classification and population scale change parameter fitting.
CN202110446165.1A 2021-04-25 2021-04-25 Natural selection classification and group scale change analysis system based on neural network Active CN113128685B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110446165.1A CN113128685B (en) 2021-04-25 2021-04-25 Natural selection classification and group scale change analysis system based on neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110446165.1A CN113128685B (en) 2021-04-25 2021-04-25 Natural selection classification and group scale change analysis system based on neural network

Publications (2)

Publication Number Publication Date
CN113128685A true CN113128685A (en) 2021-07-16
CN113128685B CN113128685B (en) 2023-04-07

Family

ID=76779703

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110446165.1A Active CN113128685B (en) 2021-04-25 2021-04-25 Natural selection classification and group scale change analysis system based on neural network

Country Status (1)

Country Link
CN (1) CN113128685B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114512185A (en) * 2022-01-13 2022-05-17 湖南大学 Donkey population natural selection classification system for variant data dimension reduction input

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102622535A (en) * 2012-02-27 2012-08-01 上海电机学院 Processing method and processing device based on multiple sequence alignment genetic algorithm
CN107025386A (en) * 2017-03-22 2017-08-08 杭州电子科技大学 A kind of method that gene association analysis is carried out based on deep learning algorithm
US20190180186A1 (en) * 2017-12-13 2019-06-13 Sentient Technologies (Barbados) Limited Evolutionary Architectures For Evolution of Deep Neural Networks
CN110111848A (en) * 2019-05-08 2019-08-09 南京鼓楼医院 A kind of human cyclin expressing gene recognition methods based on RNN-CNN neural network fusion algorithm
CN110832510A (en) * 2018-01-15 2020-02-21 因美纳有限公司 Variant classifier based on deep learning
CN110870019A (en) * 2017-10-16 2020-03-06 因美纳有限公司 Semi-supervised learning for training deep convolutional neural network sets
US10657447B1 (en) * 2018-11-29 2020-05-19 SparkCognition, Inc. Automated model building search space reduction
WO2021035164A1 (en) * 2019-08-22 2021-02-25 Inari Agriculture, Inc. Methods and systems for assessing genetic variants
WO2021072165A1 (en) * 2019-10-10 2021-04-15 Pioneer Hi-Bred International, Inc. Synchronized breeding and agronomic methods to improve crop plants

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102622535A (en) * 2012-02-27 2012-08-01 上海电机学院 Processing method and processing device based on multiple sequence alignment genetic algorithm
CN107025386A (en) * 2017-03-22 2017-08-08 杭州电子科技大学 A kind of method that gene association analysis is carried out based on deep learning algorithm
CN110870019A (en) * 2017-10-16 2020-03-06 因美纳有限公司 Semi-supervised learning for training deep convolutional neural network sets
US20190180186A1 (en) * 2017-12-13 2019-06-13 Sentient Technologies (Barbados) Limited Evolutionary Architectures For Evolution of Deep Neural Networks
CN110832510A (en) * 2018-01-15 2020-02-21 因美纳有限公司 Variant classifier based on deep learning
US10657447B1 (en) * 2018-11-29 2020-05-19 SparkCognition, Inc. Automated model building search space reduction
CN110111848A (en) * 2019-05-08 2019-08-09 南京鼓楼医院 A kind of human cyclin expressing gene recognition methods based on RNN-CNN neural network fusion algorithm
WO2021035164A1 (en) * 2019-08-22 2021-02-25 Inari Agriculture, Inc. Methods and systems for assessing genetic variants
WO2021072165A1 (en) * 2019-10-10 2021-04-15 Pioneer Hi-Bred International, Inc. Synchronized breeding and agronomic methods to improve crop plants

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
LIN K ET AL.: "Distinguishing positive selection from neutral evolution:boosting the performance of summary statistics", 《GENETICS》 *
XIAOMING LIU: "Exploring Population Size Changes Using SNP Frequency Spectra", 《NATURE》 *
文子龙等: "群体遗传学下动物驯化研究进展", 《遗传》 *
施怪等: "群体基因组学方法:从经典统计学到有监督学习", 《中国科学:生命科学》 *
郑萍萍: "基于全基因组测序探究大熊猫种群历史及适应", 《国家科技图书文献中心》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114512185A (en) * 2022-01-13 2022-05-17 湖南大学 Donkey population natural selection classification system for variant data dimension reduction input
CN114512185B (en) * 2022-01-13 2024-04-05 湖南大学 Donkey population natural selection classification system for variable data dimension reduction input

Also Published As

Publication number Publication date
CN113128685B (en) 2023-04-07

Similar Documents

Publication Publication Date Title
Kim et al. Tree-guided group lasso for multi-response regression with structured sparsity, with an application to eQTL mapping
CN112863696A (en) Drug sensitivity prediction method and device based on transfer learning and graph neural network
CN113517066B (en) Depression assessment method and system based on candidate gene methylation sequencing and deep learning
WO2023217290A1 (en) Genophenotypic prediction based on graph neural network
CN115689008A (en) CNN-BilSTM short-term photovoltaic power prediction method and system based on ensemble empirical mode decomposition
CN113889192B (en) Single-cell RNA-seq data clustering method based on deep noise reduction self-encoder
CN113128685B (en) Natural selection classification and group scale change analysis system based on neural network
CN116798652A (en) Anticancer drug response prediction method based on multitasking learning
Hilmarsson et al. High resolution ancestry deconvolution for next generation genomic data
Oriol Sabat et al. SALAI-Net: species-agnostic local ancestry inference network
Zhang et al. SLRRSC: single-cell type recognition method based on similarity and graph regularization constraints
Schofield et al. A unified capture-recapture framework
CN113505651A (en) Mosquito identification method based on convolutional neural network
CN112992268A (en) SNP locus sequence feature extraction method
CN109671468B (en) Characteristic gene selection and cancer classification method
CN110070070B (en) Action recognition method
Hejase et al. Sia: Selection inference using the ancestral recombination graph
CN114512185B (en) Donkey population natural selection classification system for variable data dimension reduction input
Chen et al. Microarray gene expression
CN109887543B (en) Differential methylation site recognition method for low methylation level
AU2021207383B2 (en) Ancestry inference based on convolutional neural network
Comes et al. Probabilistic neural network classification for microarraydata
Babichev et al. Technique of Gene Regulatory Networks Reconstruction Based on ARACNE Inference Algorithm.
Hu Modeling the gene regulatory dynamics in neural differentiation with single cell data using a machine learning approach
CN117198426B (en) Multi-scale medicine-medicine response interpretable prediction method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant