KR20210010488A - 기계 학습 가능한 생물학적 중합체 어셈블리 - Google Patents

기계 학습 가능한 생물학적 중합체 어셈블리 Download PDF

Info

Publication number
KR20210010488A
KR20210010488A KR1020207035288A KR20207035288A KR20210010488A KR 20210010488 A KR20210010488 A KR 20210010488A KR 1020207035288 A KR1020207035288 A KR 1020207035288A KR 20207035288 A KR20207035288 A KR 20207035288A KR 20210010488 A KR20210010488 A KR 20210010488A
Authority
KR
South Korea
Prior art keywords
assembly
nucleotide
learning model
locations
location
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
KR1020207035288A
Other languages
English (en)
Korean (ko)
Inventor
민 둑 카오
Original Assignee
퀀텀-에스아이 인코포레이티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 퀀텀-에스아이 인코포레이티드 filed Critical 퀀텀-에스아이 인코포레이티드
Publication of KR20210010488A publication Critical patent/KR20210010488A/ko
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional [2D] or three-dimensional [3D] molecular structures, e.g. structural or functional relations or structure alignment
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/20Sequence assembly
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/30Unsupervised data analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Data Mining & Analysis (AREA)
  • Chemical & Material Sciences (AREA)
  • Software Systems (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Bioethics (AREA)
  • Databases & Information Systems (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Analytical Chemistry (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Epidemiology (AREA)
  • Public Health (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)
  • Addition Polymer Or Copolymer, Post-Treatments, Or Chemical Modifications (AREA)
KR1020207035288A 2018-05-14 2019-05-13 기계 학습 가능한 생물학적 중합체 어셈블리 Ceased KR20210010488A (ko)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201862671260P 2018-05-14 2018-05-14
US62/671,260 2018-05-14
US201862671884P 2018-05-15 2018-05-15
US62/671,884 2018-05-15
PCT/US2019/032065 WO2019222120A1 (en) 2018-05-14 2019-05-13 Machine learning enabled biological polymer assembly

Publications (1)

Publication Number Publication Date
KR20210010488A true KR20210010488A (ko) 2021-01-27

Family

ID=66669118

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020207035288A Ceased KR20210010488A (ko) 2018-05-14 2019-05-13 기계 학습 가능한 생물학적 중합체 어셈블리

Country Status (10)

Country Link
US (1) US20190348152A1 (https=)
EP (1) EP3794596A1 (https=)
JP (1) JP2021523479A (https=)
KR (1) KR20210010488A (https=)
CN (1) CN112437961A (https=)
AU (1) AU2019270961A1 (https=)
BR (1) BR112020022257A2 (https=)
CA (1) CA3098876A1 (https=)
MX (1) MX2020012278A (https=)
WO (1) WO2019222120A1 (https=)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3624068A1 (en) * 2018-09-14 2020-03-18 Covestro Deutschland AG Method for improving prediction relating to the production of a polymer-ic produc
US11664090B2 (en) * 2020-06-11 2023-05-30 Life Technologies Corporation Basecaller with dilated convolutional neural network
EP4211691A1 (en) * 2020-09-11 2023-07-19 F. Hoffmann-La Roche AG Deep-learning-based techniques for generating a consensus sequence from multiple noisy sequences
CA3214755A1 (en) * 2021-04-09 2022-10-13 Natalie CASTELLANA Method for antibody identification from protein mixtures

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010127045A2 (en) * 2009-04-29 2010-11-04 Complete Genomics, Inc. Method and system for calling variations in a sample polynucleotide sequence with respect to a reference polynucleotide sequence
EP2718862B1 (en) * 2011-06-06 2018-10-31 Koninklijke Philips N.V. Method for assembly of nucleic acid sequence data
EP3084002A4 (en) * 2013-12-16 2017-08-23 Complete Genomics, Inc. Basecaller for dna sequencing using machine learning
CA2894317C (en) * 2015-06-15 2023-08-15 Deep Genomics Incorporated Systems and methods for classifying, prioritizing and interpreting genetic variants and therapies using a deep neural network

Also Published As

Publication number Publication date
MX2020012278A (es) 2021-01-29
JP2021523479A (ja) 2021-09-02
WO2019222120A1 (en) 2019-11-21
AU2019270961A1 (en) 2020-11-19
BR112020022257A2 (pt) 2021-02-23
CN112437961A (zh) 2021-03-02
EP3794596A1 (en) 2021-03-24
US20190348152A1 (en) 2019-11-14
CA3098876A1 (en) 2019-11-21

Similar Documents

Publication Publication Date Title
KR102539188B1 (ko) 심층 컨볼루션 신경망을 트레이닝하기 위한 심층 학습-기반 기술
US11817180B2 (en) Systems and methods for analyzing nucleic acid sequences
US20200176082A1 (en) Analysis of nanopore signal using a machine-learning technique
KR20210010488A (ko) 기계 학습 가능한 생물학적 중합체 어셈블리
CN114300036A (zh) 遗传变异致病性预测方法、装置、存储介质及计算机设备
US20240120027A1 (en) Machine-learning model for refining structural variant calls
US20250069699A1 (en) Methods and Systems for Discovery of Embedded Target Genes in Biosynthetic Gene Clusters
JP2021523479A5 (https=)
US20260004878A1 (en) Method for assuming organism or host, method for obtaining model for assuming organism or host, and computer device for performing the same
EP4584789A1 (en) Pathogenicity prediction for protein mutations using amino acid score distributions
US10937523B2 (en) Methods, systems and computer readable storage media for generating accurate nucleotide sequences
Van der Borght et al. QQ-SNV: single nucleotide variant detection at low frequency by comparing the quality quantiles
Bravo et al. The TELCoMB protocol for high‐sensitivity detection of ARG‐MGE colocalizations in complex microbial communities
CN117995283B (zh) 一种单样本宏基因组聚类方法、系统、终端及存储介质
JPWO2019222120A5 (https=)
Dai Mining Molecular Mechanisms of Gene Expression Regulation Using DNA Sequence Features Based on Deep Learning
Eraslan Enriching the characterization of complex clinical and molecular phenotypes with deep learning
Shaw Prediction of Isoform Functions and Interactions with ncRNAs via Deep Learning
Hartmann Regulatory motif discovery using PWMs and the architecture of eukaryotic core promoters
WIJAYA Integrative methods for discovering generic cis-regulatory motifs

Legal Events

Date Code Title Description
PA0105 International application

St.27 status event code: A-0-1-A10-A15-nap-PA0105

PG1501 Laying open of application

St.27 status event code: A-1-1-Q10-Q12-nap-PG1501

A201 Request for examination
E13-X000 Pre-grant limitation requested

St.27 status event code: A-2-3-E10-E13-lim-X000

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

PA0201 Request for examination

St.27 status event code: A-1-2-D10-D11-exm-PA0201

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

P22-X000 Classification modified

St.27 status event code: A-2-2-P10-P22-nap-X000

E902 Notification of reason for refusal
PE0902 Notice of grounds for rejection

St.27 status event code: A-1-2-D10-D21-exm-PE0902

E601 Decision to refuse application
PE0601 Decision on rejection of patent

St.27 status event code: N-2-6-B10-B15-exm-PE0601

R18 Changes to party contact information recorded

Free format text: ST27 STATUS EVENT CODE: A-3-3-R10-R18-OTH-X000 (AS PROVIDED BY THE NATIONAL OFFICE)

R18-X000 Changes to party contact information recorded

St.27 status event code: A-3-3-R10-R18-oth-X000