AU2022259667A1 - Efficient voxelization for deep learning - Google Patents

Efficient voxelization for deep learning Download PDF

Info

Publication number
AU2022259667A1
AU2022259667A1 AU2022259667A AU2022259667A AU2022259667A1 AU 2022259667 A1 AU2022259667 A1 AU 2022259667A1 AU 2022259667 A AU2022259667 A AU 2022259667A AU 2022259667 A AU2022259667 A AU 2022259667A AU 2022259667 A1 AU2022259667 A1 AU 2022259667A1
Authority
AU
Australia
Prior art keywords
amino acid
voxel
computer
atoms
atom
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
AU2022259667A
Other languages
English (en)
Inventor
Kai-How FARH
Hong Gao
Tobias HAMP
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Illumina Cambridge Ltd
Illumina Inc
Original Assignee
Illumina Cambridge Ltd
Illumina Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US17/703,935 external-priority patent/US20220336056A1/en
Application filed by Illumina Cambridge Ltd, Illumina Inc filed Critical Illumina Cambridge Ltd
Publication of AU2022259667A1 publication Critical patent/AU2022259667A1/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Genetics & Genomics (AREA)
  • Databases & Information Systems (AREA)
  • Bioethics (AREA)
  • Public Health (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Epidemiology (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Image Processing (AREA)
  • Image Generation (AREA)
  • Magnetic Resonance Imaging Apparatus (AREA)
AU2022259667A 2021-04-15 2022-04-14 Efficient voxelization for deep learning Pending AU2022259667A1 (en)

Applications Claiming Priority (9)

Application Number Priority Date Filing Date Title
US202163175495P 2021-04-15 2021-04-15
US63/175,495 2021-04-15
US202163175767P 2021-04-16 2021-04-16
US63/175,767 2021-04-16
US17/703,935 US20220336056A1 (en) 2021-04-15 2022-03-24 Multi-channel protein voxelization to predict variant pathogenicity using deep convolutional neural networks
US17/703,958 2022-03-24
US17/703,958 US20220336057A1 (en) 2021-04-15 2022-03-24 Efficient voxelization for deep learning
US17/703,935 2022-03-24
PCT/US2022/024918 WO2022221593A1 (fr) 2021-04-15 2022-04-14 Voxélisation efficace pour apprentissage en profondeur

Publications (1)

Publication Number Publication Date
AU2022259667A1 true AU2022259667A1 (en) 2023-10-26

Family

ID=81448684

Family Applications (2)

Application Number Title Priority Date Filing Date
AU2022259667A Pending AU2022259667A1 (en) 2021-04-15 2022-04-14 Efficient voxelization for deep learning
AU2022258691A Pending AU2022258691A1 (en) 2021-04-15 2022-04-14 Multi-channel protein voxelization to predict variant pathogenicity using deep convolutional neural networks

Family Applications After (1)

Application Number Title Priority Date Filing Date
AU2022258691A Pending AU2022258691A1 (en) 2021-04-15 2022-04-14 Multi-channel protein voxelization to predict variant pathogenicity using deep convolutional neural networks

Country Status (9)

Country Link
EP (2) EP4323991A1 (fr)
JP (2) JP2024513995A (fr)
KR (2) KR20230170680A (fr)
AU (2) AU2022259667A1 (fr)
BR (2) BR112023021266A2 (fr)
CA (2) CA3215520A1 (fr)
IL (2) IL307661A (fr)
MX (2) MX2023012227A (fr)
WO (2) WO2022221591A1 (fr)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116153404B (zh) * 2023-02-28 2023-08-15 成都信息工程大学 一种单细胞ATAC-seq数据分析方法

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2018350891B9 (en) * 2017-10-16 2022-05-19 Illumina, Inc. Deep learning-based techniques for training deep convolutional neural networks
WO2019084559A1 (fr) * 2017-10-27 2019-05-02 Apostle, Inc. Prédiction d'impact pathogène lié au cancer de mutations somatiques à l'aide de procédés basés sur un apprentissage profond
CN110245685B (zh) * 2019-05-15 2022-03-25 清华大学 基因组单位点变异致病性的预测方法、系统及存储介质

Also Published As

Publication number Publication date
WO2022221593A1 (fr) 2022-10-20
EP4323991A1 (fr) 2024-02-21
CA3215514A1 (fr) 2022-10-20
MX2023012226A (es) 2024-01-08
KR20230170679A (ko) 2023-12-19
CA3215520A1 (fr) 2022-10-20
AU2022258691A1 (en) 2023-10-26
BR112023021266A2 (pt) 2023-12-12
KR20230170680A (ko) 2023-12-19
BR112023021343A2 (pt) 2023-12-19
EP4323989A1 (fr) 2024-02-21
JP2024513995A (ja) 2024-03-27
IL307667A (en) 2023-12-01
WO2022221591A1 (fr) 2022-10-20
MX2023012227A (es) 2024-01-08
JP2024514894A (ja) 2024-04-03
IL307661A (en) 2023-12-01

Similar Documents

Publication Publication Date Title
US20230045003A1 (en) Deep learning-based use of protein contact maps for variant pathogenicity prediction
WO2023014912A1 (fr) Utilisation basée sur l'apprentissage de transfert de cartes de contact de protéine pour une prédiction de pathogénicité de variant
US20220336057A1 (en) Efficient voxelization for deep learning
US20230108368A1 (en) Combined and transfer learning of a variant pathogenicity predictor using gapped and non-gapped protein samples
US11515010B2 (en) Deep convolutional neural networks to predict variant pathogenicity using three-dimensional (3D) protein structures
AU2022259667A1 (en) Efficient voxelization for deep learning
WO2022221587A1 (fr) Analyse basée sur l'intelligence artificielle de structures tridimensionnelles (3d) de protéine
US20230047347A1 (en) Deep neural network-based variant pathogenicity prediction
US20230343413A1 (en) Protein structure-based protein language models
EP4413575A1 (fr) Apprentissage combiné et par transfert d'un prédicteur de pathogénicité de variants au moyen d'échantillons de protéines à brèche et sans brèche
WO2023059750A1 (fr) Apprentissage combiné et par transfert d'un prédicteur de pathogénicité de variants au moyen d'échantillons de protéines à brèche et sans brèche
JP2024538478A (ja) ギャップ付き及び非ギャップタンパク質サンプルを使用した変異体病原性予測器の複合学習及び転移学習
JP2024538475A (ja) 三次元(3d)タンパク質構造ボクセルを用いた進化的保存からの変異体病原性の予測
JP2024538477A (ja) タンパク質構造に基づくタンパク質言語モデル
CN117178327A (zh) 使用深度卷积神经网络来预测变体致病性的多通道蛋白质体素化
CN117581302A (zh) 使用有缺口和非缺口的蛋白质样品的变体致病性预测器的组合学习和迁移学习