BR112023019465A2 - Modelo de aprendizado de máquina para detectar uma bolha dentro de uma lâmina de amostra de nucleotídeo para sequenciamento - Google Patents

Modelo de aprendizado de máquina para detectar uma bolha dentro de uma lâmina de amostra de nucleotídeo para sequenciamento

Info

Publication number
BR112023019465A2
BR112023019465A2 BR112023019465A BR112023019465A BR112023019465A2 BR 112023019465 A2 BR112023019465 A2 BR 112023019465A2 BR 112023019465 A BR112023019465 A BR 112023019465A BR 112023019465 A BR112023019465 A BR 112023019465A BR 112023019465 A2 BR112023019465 A2 BR 112023019465A2
Authority
BR
Brazil
Prior art keywords
bubbles
sequencing
data
bubble
machine learning
Prior art date
Application number
BR112023019465A
Other languages
English (en)
Inventor
Tyler Westerberg Brandon
Derek Parnaby Gavin
Junqi Yuan
David Hahm Mark
Ezra Langlois Robert
Thomas Gros
Original Assignee
Illumina Software Inc
Illumina Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Illumina Software Inc, Illumina Inc filed Critical Illumina Software Inc
Publication of BR112023019465A2 publication Critical patent/BR112023019465A2/pt

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/10Machine learning using kernel methods, e.g. support vector machines [SVM]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Chemical & Material Sciences (AREA)
  • Medical Informatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Organic Chemistry (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Mathematical Physics (AREA)
  • Molecular Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Analytical Chemistry (AREA)
  • Databases & Information Systems (AREA)
  • Public Health (AREA)
  • Epidemiology (AREA)
  • Bioethics (AREA)
  • Biomedical Technology (AREA)
  • Computational Linguistics (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Biochemistry (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

trata-se de métodos, sistemas e mídia não transitória legível por computador para detectar com precisão e eficiência quando bolhas impactam as rodadas de sequenciamento de ácido nucleico com base em dados capturados durante (ou derivados de) chamadas de base durante rodadas de sequenciamento. em particular, em uma ou mais modalidades, os sistemas revelados recebem dados que identificam chamadas de nucleobase e dados que identificam métricas de qualidade para as chamadas de nucleobase durante ciclos de sequenciamento. com base em chamadas específicas de nucleobase e marcadores de limite para as métricas de qualidade, o sistema revelado utiliza um modelo de aprendizado de máquina para detectar a presença de uma bolha em uma lâmina de amostra de nucleotídeo. além de simplesmente detectar a presença de uma bolha, o sistema revelado pode também classificar diferentes bolhas detectadas, como bolhas de ar, bolhas de óleo ou bolhas fantasma, ou outras saídas durante o sequenciamento. utilizando dados de chamada e métricas de qualidade, o sistema da revelação pode usar dados de sequenciamento prontamente disponíveis em uma abordagem independente de plataforma para detectar bolhas usando um modelo de aprendizado de máquina treinado exclusivamente.
BR112023019465A 2021-04-02 2022-03-23 Modelo de aprendizado de máquina para detectar uma bolha dentro de uma lâmina de amostra de nucleotídeo para sequenciamento BR112023019465A2 (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202163170072P 2021-04-02 2021-04-02
PCT/US2022/071297 WO2022213027A1 (en) 2021-04-02 2022-03-23 Machine-learning model for detecting a bubble within a nucleotide-sample slide for sequencing

Publications (1)

Publication Number Publication Date
BR112023019465A2 true BR112023019465A2 (pt) 2023-12-05

Family

ID=81308122

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112023019465A BR112023019465A2 (pt) 2021-04-02 2022-03-23 Modelo de aprendizado de máquina para detectar uma bolha dentro de uma lâmina de amostra de nucleotídeo para sequenciamento

Country Status (10)

Country Link
US (1) US20220319641A1 (pt)
EP (1) EP4315342A1 (pt)
JP (1) JP2024512651A (pt)
KR (1) KR20230167028A (pt)
CN (1) CN117043867A (pt)
BR (1) BR112023019465A2 (pt)
CA (1) CA3214148A1 (pt)
IL (1) IL307378A (pt)
MX (1) MX2023011659A (pt)
WO (1) WO2022213027A1 (pt)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11520844B2 (en) * 2021-04-13 2022-12-06 Casepoint, Llc Continuous learning, prediction, and ranking of relevancy or non-relevancy of discovery documents using a caseassist active learning and dynamic document review workflow

Family Cites Families (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0450060A1 (en) 1989-10-26 1991-10-09 Sri International Dna sequencing
US5846719A (en) 1994-10-13 1998-12-08 Lynx Therapeutics, Inc. Oligonucleotide tags for sorting and identification
US5750341A (en) 1995-04-17 1998-05-12 Lynx Therapeutics, Inc. DNA sequencing by parallel oligonucleotide extensions
GB9620209D0 (en) 1996-09-27 1996-11-13 Cemu Bioteknik Ab Method of sequencing DNA
GB9626815D0 (en) 1996-12-23 1997-02-12 Cemu Bioteknik Ab Method of sequencing DNA
EP3034626A1 (en) 1997-04-01 2016-06-22 Illumina Cambridge Limited Method of nucleic acid sequencing
US6969488B2 (en) 1998-05-22 2005-11-29 Solexa, Inc. System and apparatus for sequential processing of analytes
US6274320B1 (en) 1999-09-16 2001-08-14 Curagen Corporation Method of sequencing a nucleic acid
US7001792B2 (en) 2000-04-24 2006-02-21 Eagle Research & Development, Llc Ultra-fast nucleic acid sequencing device and a method for making and using the same
AU2001282881B2 (en) 2000-07-07 2007-06-14 Visigen Biotechnologies, Inc. Real-time sequence determination
EP1354064A2 (en) 2000-12-01 2003-10-22 Visigen Biotechnologies, Inc. Enzymatic nucleic acid synthesis: compositions and methods for altering monomer incorporation fidelity
US7057026B2 (en) 2001-12-04 2006-06-06 Solexa Limited Labelled nucleotides
EP2607369B1 (en) 2002-08-23 2015-09-23 Illumina Cambridge Limited Modified nucleotides for polynucleotide sequencing
GB0321306D0 (en) 2003-09-11 2003-10-15 Solexa Ltd Modified polymerases for improved incorporation of nucleotide analogues
EP3175914A1 (en) 2004-01-07 2017-06-07 Illumina Cambridge Limited Improvements in or relating to molecular arrays
US7315019B2 (en) 2004-09-17 2008-01-01 Pacific Biosciences Of California, Inc. Arrays of optical confinements and uses thereof
WO2006064199A1 (en) 2004-12-13 2006-06-22 Solexa Limited Improved method of nucleotide detection
US8623628B2 (en) 2005-05-10 2014-01-07 Illumina, Inc. Polymerases
GB0514936D0 (en) 2005-07-20 2005-08-24 Solexa Ltd Preparation of templates for nucleic acid sequencing
US7405281B2 (en) 2005-09-29 2008-07-29 Pacific Biosciences Of California, Inc. Fluorescent nucleotide analogs and uses therefor
EP3722409A1 (en) 2006-03-31 2020-10-14 Illumina, Inc. Systems and devices for sequence by synthesis analysis
WO2008051530A2 (en) 2006-10-23 2008-05-02 Pacific Biosciences Of California, Inc. Polymerase enzymes and reagents for enhanced nucleic acid sequencing
GB2457851B (en) 2006-12-14 2011-01-05 Ion Torrent Systems Inc Methods and apparatus for measuring analytes using large scale fet arrays
US8349167B2 (en) 2006-12-14 2013-01-08 Life Technologies Corporation Methods and apparatus for detecting molecular interactions using FET arrays
US8262900B2 (en) 2006-12-14 2012-09-11 Life Technologies Corporation Methods and apparatus for measuring analytes using large scale FET arrays
WO2008092150A1 (en) * 2007-01-26 2008-07-31 Illumina, Inc. Nucleic acid sequencing system and method
WO2010039553A1 (en) 2008-10-03 2010-04-08 Illumina, Inc. Method and system for determining the accuracy of dna base identifications
US20100137143A1 (en) 2008-10-22 2010-06-03 Ion Torrent Systems Incorporated Methods and apparatus for measuring analytes
US8951781B2 (en) 2011-01-10 2015-02-10 Illumina, Inc. Systems, methods, and apparatuses to image a sample for biological or chemical analysis
CA2859660C (en) 2011-09-23 2021-02-09 Illumina, Inc. Methods and compositions for nucleic acid sequencing
BR112014024789B1 (pt) 2012-04-03 2021-05-25 Illumina, Inc aparelho de detecção e método para formação de imagem de um substrato
EP3844477A4 (en) * 2018-08-28 2023-01-04 Essenlix Corporation IMPROVING THE ACCURACY OF A DOSAGE
WO2020206464A1 (en) * 2019-04-05 2020-10-08 Essenlix Corporation Assay accuracy and reliability improvement

Also Published As

Publication number Publication date
CN117043867A (zh) 2023-11-10
JP2024512651A (ja) 2024-03-19
EP4315342A1 (en) 2024-02-07
US20220319641A1 (en) 2022-10-06
IL307378A (en) 2023-11-01
CA3214148A1 (en) 2022-10-06
WO2022213027A1 (en) 2022-10-06
MX2023011659A (es) 2023-10-11
KR20230167028A (ko) 2023-12-07

Similar Documents

Publication Publication Date Title
BR112023019465A2 (pt) Modelo de aprendizado de máquina para detectar uma bolha dentro de uma lâmina de amostra de nucleotídeo para sequenciamento
BR112017020363A2 (pt) método para determinar a presença de uma variante em um ou mais genes em um indivíduo, sistema, e, meio legível por computador não transitório
PH12021550223A1 (en) Determination Of Base Modifications Of Nucleic Acids
Lycett et al. Estimating reassortment rates in co-circulating Eurasian swine influenza viruses
HRP20191108T1 (hr) Postupak dijagnosticiranja kolorektalnog raka iz uzorka ljudskog izmeta kvantitivnim pcr-om
ATE297032T1 (de) Verfahren zum adressieren der teilnehmer eines bussystems mittels identifizierungsströmen
BRPI0403907A (pt) Sistema e método para detectar uma lista em entrada de tinta
TW201612855A (en) Object detecting device, object detecting method and object detecting system
BR112017002884A2 (pt) ?análise computacional de dados biológicos por meio do uso de variedade e hiperplano?
BR112015010012A2 (pt) método; e sistema
BR112018076983A8 (pt) Sistema e método para análise secundária de dados de sequenciamento de nucleotídeos
BRPI0607576A8 (pt) Sistema e método para pesquisar uma rede ponto-a-ponto
EP3760737A3 (en) Platform for discovery and analysis of therapeutic agents
BR112018077404A2 (pt) sistema e método de marcação em grupo baseados na aprendizagem
BR102019021121A2 (pt) sistema e método para executar a detecção automatizada de defeitos
MX2017015263A (es) Sistema y metodo de verificacion de seguridad.
PH12018502144A1 (en) Playing card dealing shoe activation device
RU2017106892A (ru) Автоматизированные модернизации участников арендуемой среды для услуг с несколькими участниками арендуемой среды
ATE489673T1 (de) Verfahren und vorrichtung zur eingabe von texten
WO2019035691A3 (ko) 유전자 정보에 기초하여 서비스를 제공하기 위한 방법, 시스템 및 비일시성의 컴퓨터 판독 가능 기록 매체
BR102017028319A8 (pt) Método e sistema para identificar uma parte de um veículo e sistema de inspeção de veículos
SE1750356A1 (sv) Systems and methods for identifying individual animals in a group of animals
BR112016003057A2 (pt) composições e métodos para análise multimodal de ácidos nucleicos cmet
BR112022016397A2 (pt) Método, sistema e programas de computador para rastreabilidade de espécimes vivos
CO2018012378A2 (es) Método, sistema de comunicación y programa informático para proporcionar información indicativa de la concentración de alérgenos en el medio ambiente