BR112023021266A2 - MULTI-CHANNEL PROTEIN VOXELIZATION TO PREDICT VARIANT PATHOGENICITY USING DEEP CONVOLUTIONAL NEURAL NETWORKS - Google Patents

MULTI-CHANNEL PROTEIN VOXELIZATION TO PREDICT VARIANT PATHOGENICITY USING DEEP CONVOLUTIONAL NEURAL NETWORKS

Info

Publication number
BR112023021266A2
BR112023021266A2 BR112023021266A BR112023021266A BR112023021266A2 BR 112023021266 A2 BR112023021266 A2 BR 112023021266A2 BR 112023021266 A BR112023021266 A BR 112023021266A BR 112023021266 A BR112023021266 A BR 112023021266A BR 112023021266 A2 BR112023021266 A2 BR 112023021266A2
Authority
BR
Brazil
Prior art keywords
convolutional neural
dimensional
sequence
evolutionary conservation
alternative allele
Prior art date
Application number
BR112023021266A
Other languages
Portuguese (pt)
Inventor
Hong Gao
Kai-How FARH
Tobias Hamp
Original Assignee
Illumina Cambridge Ltd
Illumina Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US17/703,935 external-priority patent/US20220336056A1/en
Application filed by Illumina Cambridge Ltd, Illumina Inc filed Critical Illumina Cambridge Ltd
Publication of BR112023021266A2 publication Critical patent/BR112023021266A2/en

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Genetics & Genomics (AREA)
  • Databases & Information Systems (AREA)
  • Bioethics (AREA)
  • Public Health (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Epidemiology (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Image Processing (AREA)
  • Image Generation (AREA)
  • Magnetic Resonance Imaging Apparatus (AREA)

Abstract

trata-se de um sistema que inclui pelo menos um voxelizador, um codificador de alelo alternativo, um codificador de conservação evolucionária e uma rede neural convolucional. o voxelizador acessa uma estrutura tridimensional de uma sequência de aminoácidos de referência de uma proteína e encaixa uma grade tridimensional de voxels em átomos na estrutura tridimensional por aminoácidos para gerar canais de distância em aminoácidos. o codificador de alelo alternativo codifica uma sequência de alelos alternativa para cada voxel na grade tridimensional de voxels. o codificador de conservação evolucionária codifica uma sequência de conservação evolucionária para cada voxel na grade tridimensional de voxels. a rede neural convolucional aplica convoluções tridimensionais a um tensor que inclui os canais de distância em aminoácidos codificados com a sequência de alelos alternativa e as respectivas sequências de conservação evolucionárias, e determina uma patogenicidade de um nucleotídeo variante com base, pelo menos em parte, no tensor.it is a system that includes at least a voxelizer, an alternative allele encoder, an evolutionary conservation encoder and a convolutional neural network. The voxelizer accesses a three-dimensional structure of a reference amino acid sequence of a protein and fits a three-dimensional grid of voxels into atoms into the three-dimensional amino acid structure to generate distance channels in amino acids. the alternative allele encoder encodes an alternative allele sequence for each voxel in the three-dimensional grid of voxels. the evolutionary conservation encoder encodes an evolutionary conservation sequence for each voxel in the three-dimensional grid of voxels. The convolutional neural network applies three-dimensional convolutions to a tensor that includes the distance channels in amino acids encoded with the alternative allele sequence and the respective evolutionary conservation sequences, and determines a pathogenicity of a nucleotide variant based, at least in part, on the tensioner.

BR112023021266A 2021-04-15 2022-04-14 MULTI-CHANNEL PROTEIN VOXELIZATION TO PREDICT VARIANT PATHOGENICITY USING DEEP CONVOLUTIONAL NEURAL NETWORKS BR112023021266A2 (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US202163175495P 2021-04-15 2021-04-15
US202163175767P 2021-04-16 2021-04-16
US17/703,935 US20220336056A1 (en) 2021-04-15 2022-03-24 Multi-channel protein voxelization to predict variant pathogenicity using deep convolutional neural networks
US17/703,958 US20220336057A1 (en) 2021-04-15 2022-03-24 Efficient voxelization for deep learning
PCT/US2022/024916 WO2022221591A1 (en) 2021-04-15 2022-04-14 Multi-channel protein voxelization to predict variant pathogenicity using deep convolutional neural networks

Publications (1)

Publication Number Publication Date
BR112023021266A2 true BR112023021266A2 (en) 2023-12-12

Family

ID=81448684

Family Applications (2)

Application Number Title Priority Date Filing Date
BR112023021266A BR112023021266A2 (en) 2021-04-15 2022-04-14 MULTI-CHANNEL PROTEIN VOXELIZATION TO PREDICT VARIANT PATHOGENICITY USING DEEP CONVOLUTIONAL NEURAL NETWORKS
BR112023021343A BR112023021343A2 (en) 2021-04-15 2022-04-14 EFFICIENT VOXELIZATION FOR DEEP LEARNING

Family Applications After (1)

Application Number Title Priority Date Filing Date
BR112023021343A BR112023021343A2 (en) 2021-04-15 2022-04-14 EFFICIENT VOXELIZATION FOR DEEP LEARNING

Country Status (9)

Country Link
EP (2) EP4323991A1 (en)
JP (2) JP2024514894A (en)
KR (2) KR20230170680A (en)
AU (2) AU2022258691A1 (en)
BR (2) BR112023021266A2 (en)
CA (2) CA3215514A1 (en)
IL (2) IL307661A (en)
MX (2) MX2023012226A (en)
WO (2) WO2022221593A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116153404B (en) * 2023-02-28 2023-08-15 成都信息工程大学 Single-cell ATAC-seq data analysis method

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3622521A1 (en) * 2017-10-16 2020-03-18 Illumina, Inc. Deep convolutional neural networks for variant classification
EP3704640A4 (en) * 2017-10-27 2021-08-18 Apostle, Inc. Predicting cancer-related pathogenic impact of somatic mutations using deep learning-based methods
CN110245685B (en) * 2019-05-15 2022-03-25 清华大学 Method, system and storage medium for predicting pathogenicity of genome single-site variation

Also Published As

Publication number Publication date
CA3215520A1 (en) 2022-10-20
EP4323991A1 (en) 2024-02-21
JP2024514894A (en) 2024-04-03
WO2022221593A1 (en) 2022-10-20
MX2023012227A (en) 2024-01-08
MX2023012226A (en) 2024-01-08
CA3215514A1 (en) 2022-10-20
IL307667A (en) 2023-12-01
WO2022221591A1 (en) 2022-10-20
EP4323989A1 (en) 2024-02-21
JP2024513995A (en) 2024-03-27
KR20230170680A (en) 2023-12-19
KR20230170679A (en) 2023-12-19
BR112023021343A2 (en) 2023-12-19
AU2022259667A1 (en) 2023-10-26
AU2022258691A1 (en) 2023-10-26
IL307661A (en) 2023-12-01

Similar Documents

Publication Publication Date Title
BR112023021266A2 (en) MULTI-CHANNEL PROTEIN VOXELIZATION TO PREDICT VARIANT PATHOGENICITY USING DEEP CONVOLUTIONAL NEURAL NETWORKS
BR112019006757A2 (en) variable number of intramodes for video encoding
CL2016000297A1 (en) Procedure for the manufacture of a hydrophobic or superhydrophobic surface topography
BR112018070876A2 (en) system and method for beam adjustment request
BRPI0520370B8 (en) SYSTEM AND METHOD FOR MEASURING AND MAPPING A SURFACE IN RELATION TO A REFERENCE
BR112018077188A2 (en) innovative thermostable fructose-6-phosphate-3-epimerase and a method for producing alulose using it
BR112013008057A2 (en) method of identifying and limiting the base motifs of the tire tread carving
BR112015024512A2 (en) method and system for determining the spatial structure of an object
BR112014011053A2 (en) significant coding coefficient information in transform hop mode
BR112012009154A2 (en) method for improving a geological model of a subsurface region, computer program product, and method for controlling hydrocarbons in a subsurface region
BR112022002163A2 (en) Methods and systems to determine part wear using a limiting model
BR112017017498A2 (en) well curvature profiling system and method for profiling well curvature
BR112018071430A2 (en) optical coherence tomography scan compensation
BR112018012996A2 (en) ? method for modeling an underground formation, and profiling system?
BR112018069801A2 (en) layer generation time setting in additive manufacturing
BR112017021452A2 (en) process for determining the abrasion resistance of at least one wear layer disposed on a support plate
BR112019025744A8 (en) METHOD TO IMPROVE PLANT TRANSFORMATION EFFICIENCY AND METHOD TO TRANSFORM PLANT
BR112021023490A2 (en) Determining the weight of an animal based on a 3d image
BR112021022865A2 (en) Matrix and method for detecting spatial information from nucleic acids
CY1121726T1 (en) METHOD OF CALCULATING THE SURFACE SPEED OF A MINIMUM SHIP AND METHOD OF PRODUCING EACH ELEMENT OF COURSE DEVIATION AT ANY POINT ON THE SHIP'S COURSE
BR112019011365A2 (en) methods to treat cancer
BR112022001579A2 (en) Method, apparatus, and computer-readable medium for adaptive normalization of analyte levels
BR112014011629A2 (en) one piece design made of 3d fabric composite material
BR112022024817A2 (en) INFORMATION PROCESSING DEVICE AND METHOD
BR112020004536A8 (en) DYNAMIC POSITIONING CONTROL