WO2021011990A8 - An iterative regression method for genomic prediction - Google Patents

An iterative regression method for genomic prediction Download PDF

Info

Publication number
WO2021011990A8
WO2021011990A8 PCT/AU2020/000072 AU2020000072W WO2021011990A8 WO 2021011990 A8 WO2021011990 A8 WO 2021011990A8 AU 2020000072 W AU2020000072 W AU 2020000072W WO 2021011990 A8 WO2021011990 A8 WO 2021011990A8
Authority
WO
WIPO (PCT)
Prior art keywords
trait
estimate
block
starting
bayesian
Prior art date
Application number
PCT/AU2020/000072
Other languages
French (fr)
Other versions
WO2021011990A1 (en
Inventor
Edmond Joseph Breen
Hans Dieter Daetwyler
Michael Edward GODDARD
Original Assignee
Agriculture Victoria Services Pty Ltd
Dairy Australia Limited
Geoffrey Gardiner Dairy Foundation Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from AU2019902659A external-priority patent/AU2019902659A0/en
Application filed by Agriculture Victoria Services Pty Ltd, Dairy Australia Limited, Geoffrey Gardiner Dairy Foundation Limited filed Critical Agriculture Victoria Services Pty Ltd
Priority to AU2020318571A priority Critical patent/AU2020318571A1/en
Publication of WO2021011990A1 publication Critical patent/WO2021011990A1/en
Publication of WO2021011990A8 publication Critical patent/WO2021011990A8/en

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/40Population genetics; Linkage disequilibrium
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6881Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for tissue or cell typing, e.g. human leukocyte antigen [HLA] probes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/124Animal traits, i.e. production traits, including athletic performance or the like
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers

Landscapes

  • Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Theoretical Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Medical Informatics (AREA)
  • Genetics & Genomics (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Organic Chemistry (AREA)
  • Software Systems (AREA)
  • Databases & Information Systems (AREA)
  • Molecular Biology (AREA)
  • Analytical Chemistry (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Bioethics (AREA)
  • Ecology (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Zoology (AREA)
  • Pure & Applied Mathematics (AREA)
  • Wood Science & Technology (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Mathematical Analysis (AREA)
  • Physiology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Epidemiology (AREA)
  • Evolutionary Computation (AREA)
  • Public Health (AREA)
  • Immunology (AREA)

Abstract

A method for improving the computational efficiency of estimation of Bayesian methods, such as Bayesian MCMC models, for performing genomic analysis of SNP data to estimate breeding values, Quantitative Trait Locus (QTLs), or genomic locations associated with a disease or trait. The method extends the BayesR method by using a blocked Gibbs sampling approach to estimate SNO effects comprising breaking the SNPS into non overlapping blocks of markers and sequentially processing each block according to a block ordering. Each regression coefficient in the block is then sequentially estimated, starting from a starting index. The method significantly reduces the computational time to estimate model parameters and can be applied to both single trait and multi-trait analyses.
PCT/AU2020/000072 2019-07-25 2020-07-23 An iterative regression method for genomic prediction WO2021011990A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2020318571A AU2020318571A1 (en) 2019-07-25 2020-07-23 An iterative regression method for genomic prediction

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
AU2019902659 2019-07-25
AU2019902659A AU2019902659A0 (en) 2019-07-25 An iterative regression method for genomic prediction

Publications (2)

Publication Number Publication Date
WO2021011990A1 WO2021011990A1 (en) 2021-01-28
WO2021011990A8 true WO2021011990A8 (en) 2021-12-30

Family

ID=74192407

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/AU2020/000072 WO2021011990A1 (en) 2019-07-25 2020-07-23 An iterative regression method for genomic prediction

Country Status (2)

Country Link
AU (1) AU2020318571A1 (en)
WO (1) WO2021011990A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113555063A (en) * 2021-07-28 2021-10-26 仲恺农业工程学院 Threshold character genome breeding value estimation method based on SNP chip and application
CN113808660B (en) * 2021-09-13 2024-02-13 复旦大学附属华山医院 Natural selection and database-based hereditary rare disease prevalence Bayes calculation model, construction method and application thereof

Also Published As

Publication number Publication date
WO2021011990A1 (en) 2021-01-28
AU2020318571A1 (en) 2022-02-24

Similar Documents

Publication Publication Date Title
CN109524059B (en) Rapid and stable animal individual genome breeding value evaluation method
Therkildsen et al. Practical low‐coverage genomewide sequencing of hundreds of individually barcoded samples for population and evolutionary genomics in nonmodel species
Madsen et al. DMU-a package for analyzing multivariate mixed models in quantitative genetics and genomics
Verbyla et al. Accuracy of genomic selection using stochastic search variable selection in Australian Holstein Friesian dairy cattle
WO2021011990A8 (en) An iterative regression method for genomic prediction
Nadachowska-Brzyska et al. Demographic divergence history of pied flycatcher and collared flycatcher inferred from whole-genome re-sequencing data
Culumber et al. Replicated hybrid zones of Xiphophorus swordtails along an elevational gradient
Yamasaki et al. Genome-wide patterns of divergence and introgression after secondary contact between Pungitius sticklebacks
TW200636411A (en) Automated throughput control system and method of operating the same
PH12016501255A1 (en) Improved molecular breeding methods
Heller et al. Mid‐Holocene decline in African buffalos inferred from Bayesian coalescent‐based analyses of microsatellites and mitochondrial DNA
Ronen et al. Predicting carriers of ongoing selective sweeps without knowledge of the favored allele
Masuda et al. 331 Efficient quality control methods for genomic and pedigree data used in routine genomic evaluation
Rodríguez-Ramilo et al. Artificial selection with traditional or genomic relationships: consequences in coancestry and genetic diversity
US20160283655A1 (en) Computation pipeline of location-dependent variant calls
US10424395B2 (en) Computation pipeline of single-pass multiple variant calls
Kao et al. naiveBayesCall: An efficient model-based base-calling algorithm for high-throughput sequencing
WO2011146788A3 (en) Methods of assessing a risk of developing necrotizing meningoencephalitis
Coscia et al. Genetic homogeneity in the deep-sea grenadier Macrourus berglax across the North Atlantic Ocean
Chang et al. A rapid and efficient linear mixed model approach using the score test and its application to GWAS
WO2014137893A3 (en) Interactive healthcare modeling
Rodriguez et al. An accurate method for inferring relatedness in large datasets of unphased genotypes via an embedded likelihood-ratio test
Söylev et al. CONGA: Copy number variation genotyping in ancient genomes and low-coverage sequencing data
Wang Genetic association tests in the presence of epistasis or gene‐environment interaction
Janss Disentangling pleiotropy along the genome using sparse latent variable models

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20845017

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2020318571

Country of ref document: AU

Date of ref document: 20200723

Kind code of ref document: A

122 Ep: pct application non-entry in european phase

Ref document number: 20845017

Country of ref document: EP

Kind code of ref document: A1