JP2021523479A - 機械学習可能な生物学的ポリマーアセンブリ - Google Patents

機械学習可能な生物学的ポリマーアセンブリ Download PDF

Info

Publication number
JP2021523479A
JP2021523479A JP2020564123A JP2020564123A JP2021523479A JP 2021523479 A JP2021523479 A JP 2021523479A JP 2020564123 A JP2020564123 A JP 2020564123A JP 2020564123 A JP2020564123 A JP 2020564123A JP 2021523479 A JP2021523479 A JP 2021523479A
Authority
JP
Japan
Prior art keywords
assembly
positions
nucleotide
learning model
nucleotides
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2020564123A
Other languages
English (en)
Japanese (ja)
Other versions
JP2021523479A5 (https=
Inventor
ドゥック ツァオ、ミン
ドゥック ツァオ、ミン
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Quantum Si Inc
Original Assignee
Quantum Si Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Quantum Si Inc filed Critical Quantum Si Inc
Publication of JP2021523479A publication Critical patent/JP2021523479A/ja
Publication of JP2021523479A5 publication Critical patent/JP2021523479A5/ja
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/20Sequence assembly
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional [2D] or three-dimensional [3D] molecular structures, e.g. structural or functional relations or structure alignment
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/30Unsupervised data analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Data Mining & Analysis (AREA)
  • Chemical & Material Sciences (AREA)
  • Software Systems (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Bioethics (AREA)
  • Databases & Information Systems (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Analytical Chemistry (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Epidemiology (AREA)
  • Public Health (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)
  • Addition Polymer Or Copolymer, Post-Treatments, Or Chemical Modifications (AREA)
JP2020564123A 2018-05-14 2019-05-13 機械学習可能な生物学的ポリマーアセンブリ Pending JP2021523479A (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201862671260P 2018-05-14 2018-05-14
US62/671,260 2018-05-14
US201862671884P 2018-05-15 2018-05-15
US62/671,884 2018-05-15
PCT/US2019/032065 WO2019222120A1 (en) 2018-05-14 2019-05-13 Machine learning enabled biological polymer assembly

Publications (2)

Publication Number Publication Date
JP2021523479A true JP2021523479A (ja) 2021-09-02
JP2021523479A5 JP2021523479A5 (https=) 2022-05-26

Family

ID=66669118

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2020564123A Pending JP2021523479A (ja) 2018-05-14 2019-05-13 機械学習可能な生物学的ポリマーアセンブリ

Country Status (10)

Country Link
US (1) US20190348152A1 (https=)
EP (1) EP3794596A1 (https=)
JP (1) JP2021523479A (https=)
KR (1) KR20210010488A (https=)
CN (1) CN112437961A (https=)
AU (1) AU2019270961A1 (https=)
BR (1) BR112020022257A2 (https=)
CA (1) CA3098876A1 (https=)
MX (1) MX2020012278A (https=)
WO (1) WO2019222120A1 (https=)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3624068A1 (en) * 2018-09-14 2020-03-18 Covestro Deutschland AG Method for improving prediction relating to the production of a polymer-ic produc
US11664090B2 (en) * 2020-06-11 2023-05-30 Life Technologies Corporation Basecaller with dilated convolutional neural network
EP4211691A1 (en) * 2020-09-11 2023-07-19 F. Hoffmann-La Roche AG Deep-learning-based techniques for generating a consensus sequence from multiple noisy sequences
WO2022216795A1 (en) * 2021-04-09 2022-10-13 Abterra Biosciences, Inc. Method for antibody identification from protein mixtures

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150169824A1 (en) * 2013-12-16 2015-06-18 Complete Genomics, Inc. Basecaller for dna sequencing using machine learning

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2511843B1 (en) * 2009-04-29 2016-12-21 Complete Genomics, Inc. Method and system for calling variations in a sample polynucleotide sequence with respect to a reference polynucleotide sequence
EP2718862B1 (en) * 2011-06-06 2018-10-31 Koninklijke Philips N.V. Method for assembly of nucleic acid sequence data
CA2894317C (en) * 2015-06-15 2023-08-15 Deep Genomics Incorporated Systems and methods for classifying, prioritizing and interpreting genetic variants and therapies using a deep neural network

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150169824A1 (en) * 2013-12-16 2015-06-18 Complete Genomics, Inc. Basecaller for dna sequencing using machine learning

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
NICHOLAS J. LOMAN ET AL.: "A complete bacterial genome assembled de novo using only nanopore sequencing data", BIORXIV [ONLINE], JPN6023031396, 2015, pages 1 - 21, ISSN: 0005119110 *

Also Published As

Publication number Publication date
KR20210010488A (ko) 2021-01-27
WO2019222120A1 (en) 2019-11-21
CN112437961A (zh) 2021-03-02
CA3098876A1 (en) 2019-11-21
EP3794596A1 (en) 2021-03-24
BR112020022257A2 (pt) 2021-02-23
AU2019270961A1 (en) 2020-11-19
MX2020012278A (es) 2021-01-29
US20190348152A1 (en) 2019-11-14

Similar Documents

Publication Publication Date Title
US12264360B2 (en) Analysis of nanopore signal using a machine-learning technique
US11817180B2 (en) Systems and methods for analyzing nucleic acid sequences
KR102416048B1 (ko) 변이체 분류를 위한 심층 컨볼루션 신경망
JP2021523479A (ja) 機械学習可能な生物学的ポリマーアセンブリ
Gross et al. CONTRAST: a discriminative, phylogeny-free approach to multiple informant de novo gene prediction
CN112837747A (zh) 基于注意力孪生网络的蛋白质结合位点预测方法
WO2023197718A9 (zh) 一种预测环状rna ires的方法
Gao et al. RicENN: prediction of rice enhancers with neural network based on DNA sequences
AU2022383192A1 (en) Methods and systems for discovery of embedded target genes in biosynthetic gene clusters
Balvert et al. Ogre: overlap graph-based metagenomic read clustering
US10971249B2 (en) Systems and methods for off-target sequence detection
CN103793625A (zh) 碱基序列比对系统及方法
JP2021523479A5 (https=)
CN119183596A (zh) 用于信号误差校正的深度人工神经网络的方法
US20250253012A1 (en) Error Correction of Nucleic Acid Sequencing Reads
Mechelke et al. A probabilistic model for secondary structure prediction from protein chemical shifts
NL2013120B1 (en) A method for finding associated positions of bases of a read on a reference genome.
US10937523B2 (en) Methods, systems and computer readable storage media for generating accurate nucleotide sequences
Grassi et al. A functional strategy to characterize expression Quantitative Trait Loci
KR20210109207A (ko) 유전자 선별 방법 및 장치
Fujimoto et al. Learning the language of genes: representing global codon bias with deep language models
John et al. Tools for sequence assembly and annotation
Zhao et al. Identifying TF Binding Motifs from a Partial Set of Target Genes and its Application to Regulatory Network Inference
Guo et al. The prediction of human genes in DNA based on a generalized hidden Markov model
JP2013094149A (ja) Dna配列解読システム、dna配列解読方法及びプログラム

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20220513

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20220513

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20220516

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20230801

A601 Written request for extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A601

Effective date: 20231031

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20240201

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20240514

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20240902

A911 Transfer to examiner for re-examination before appeal (zenchi)

Free format text: JAPANESE INTERMEDIATE CODE: A911

Effective date: 20240909

A912 Re-examination (zenchi) completed and case transferred to appeal board

Free format text: JAPANESE INTERMEDIATE CODE: A912

Effective date: 20241108