CN113705585A - 基于神经网络实现的方法和系统 - Google Patents

基于神经网络实现的方法和系统 Download PDF

Info

Publication number
CN113705585A
CN113705585A CN202111113164.1A CN202111113164A CN113705585A CN 113705585 A CN113705585 A CN 113705585A CN 202111113164 A CN202111113164 A CN 202111113164A CN 113705585 A CN113705585 A CN 113705585A
Authority
CN
China
Prior art keywords
training
sequence
pair
amino acid
benign
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111113164.1A
Other languages
English (en)
Chinese (zh)
Inventor
H·高
K-H·法尔
S·雷迪帕迪格帕蒂
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inmair Ltd
Illumina Inc
Original Assignee
Inmair Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US16/160,986 external-priority patent/US11315016B2/en
Priority claimed from US16/407,149 external-priority patent/US10540591B2/en
Application filed by Inmair Ltd filed Critical Inmair Ltd
Publication of CN113705585A publication Critical patent/CN113705585A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • G06F18/24133Distances to prototypes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • G06V10/443Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
    • G06V10/449Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
    • G06V10/451Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
    • G06V10/454Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/20Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/30Detection of binding sites or motifs
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/20Ensemble learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • Data Mining & Analysis (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • General Physics & Mathematics (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Computing Systems (AREA)
  • Biotechnology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Analytical Chemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Genetics & Genomics (AREA)
  • Bioethics (AREA)
  • Multimedia (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Epidemiology (AREA)
  • Public Health (AREA)
  • Image Analysis (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Machine Translation (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
CN202111113164.1A 2018-10-15 2019-05-09 基于神经网络实现的方法和系统 Pending CN113705585A (zh)

Applications Claiming Priority (15)

Application Number Priority Date Filing Date Title
USPCT/US2018/055881 2018-10-15
US16/160903 2018-10-15
US16/160,986 US11315016B2 (en) 2017-10-16 2018-10-15 Deep convolutional neural networks for variant classification
US16/160,968 US11798650B2 (en) 2017-10-16 2018-10-15 Semi-supervised learning for training an ensemble of deep convolutional neural networks
USPCT/US2018/055878 2018-10-15
USPCT/US2018/055840 2018-10-15
US16/160968 2018-10-15
PCT/US2018/055881 WO2019079182A1 (en) 2017-10-16 2018-10-15 SEMI-SUPERVISED APPRENTICESHIP FOR THE LEARNING OF A SET OF NEURONAL NETWORKS WITH DEEP CONVOLUTION
PCT/US2018/055878 WO2019079180A1 (en) 2017-10-16 2018-10-15 NEURONAL NETWORKS WITH DEEP CONVOLUTION OF VARIANT CLASSIFICATION
PCT/US2018/055840 WO2019079166A1 (en) 2017-10-16 2018-10-15 TECHNIQUES BASED ON DEEP LEARNING LEARNING OF NEURONAL NETWORKS WITH DEEP CONVOLUTION
US16/160986 2018-10-15
US16/160,903 US10423861B2 (en) 2017-10-16 2018-10-15 Deep learning-based techniques for training deep convolutional neural networks
US16/407,149 US10540591B2 (en) 2017-10-16 2019-05-08 Deep learning-based techniques for pre-training deep convolutional neural networks
US16/407149 2019-05-08
CN201980003263.9A CN111328419B (zh) 2018-10-15 2019-05-09 基于神经网络实现的方法和系统

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201980003263.9A Division CN111328419B (zh) 2018-10-15 2019-05-09 基于神经网络实现的方法和系统

Publications (1)

Publication Number Publication Date
CN113705585A true CN113705585A (zh) 2021-11-26

Family

ID=70283180

Family Applications (2)

Application Number Title Priority Date Filing Date
CN202111113164.1A Pending CN113705585A (zh) 2018-10-15 2019-05-09 基于神经网络实现的方法和系统
CN201980003263.9A Active CN111328419B (zh) 2018-10-15 2019-05-09 基于神经网络实现的方法和系统

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201980003263.9A Active CN111328419B (zh) 2018-10-15 2019-05-09 基于神经网络实现的方法和系统

Country Status (8)

Country Link
JP (3) JP6888123B2 (ko)
KR (1) KR102165734B1 (ko)
CN (2) CN113705585A (ko)
AU (2) AU2019272062B2 (ko)
IL (2) IL271091B (ko)
NZ (1) NZ759665A (ko)
SG (2) SG11201911777QA (ko)
WO (1) WO2020081122A1 (ko)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109543761B (zh) * 2018-11-27 2020-07-07 成都工业学院 植物适生地的分类方法及装置
KR102418073B1 (ko) * 2020-06-08 2022-07-06 고려대학교 산학협력단 인공지능 기반 비디오 투시 연하검사 자동화 분석 장치 및 방법
CN111830408B (zh) * 2020-06-23 2023-04-18 朗斯顿科技(北京)有限公司 一种基于边缘计算和深度学习的电机故障诊断系统及方法
CN112003735B (zh) * 2020-07-28 2021-11-09 四川大学 一种感知风险的深度学习驱动的极限传输容量调整方法
CN112183088B (zh) * 2020-09-28 2023-11-21 云知声智能科技股份有限公司 词语层级确定的方法、模型构建方法、装置及设备
KR102279056B1 (ko) * 2021-01-19 2021-07-19 주식회사 쓰리빌리언 지식전이를 이용한 유전자변이의 병원성 예측 시스템
CN113299345B (zh) * 2021-06-30 2024-05-07 中国人民解放军军事科学院军事医学研究院 病毒基因分类的方法、装置及电子设备
CN113539354B (zh) * 2021-07-19 2023-10-27 浙江理工大学 一种高效预测革兰氏阴性菌ⅲ型和ⅳ型效应蛋白的方法
CN113822342B (zh) * 2021-09-02 2023-05-30 湖北工业大学 一种安全图卷积网络的文献分类方法及系统
CN113836892B (zh) * 2021-09-08 2023-08-08 灵犀量子(北京)医疗科技有限公司 样本量数据提取方法、装置、电子设备及存储介质
CN113963746B (zh) * 2021-09-29 2023-09-19 西安交通大学 一种基于深度学习的基因组结构变异检测系统及方法
US20240087683A1 (en) * 2022-09-14 2024-03-14 Microsoft Technology Licensing, Llc Classification using a machine learning model trained with triplet loss
CN115662520B (zh) * 2022-10-27 2023-04-14 黑龙江金域医学检验实验室有限公司 Bcr/abl1融合基因的检测方法及相关设备
CN116153396A (zh) * 2023-04-21 2023-05-23 鲁东大学 一种基于迁移学习的非编码变异预测方法
CN117688785B (zh) * 2024-02-02 2024-04-16 东北大学 一种基于种植思想的全张量重力梯度数据反演方法

Family Cites Families (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0450060A1 (en) 1989-10-26 1991-10-09 Sri International Dna sequencing
US5641658A (en) 1994-08-03 1997-06-24 Mosaic Technologies, Inc. Method for performing amplification of nucleic acid with two primers bound to a single solid support
ATE269908T1 (de) 1997-04-01 2004-07-15 Manteia S A Methode zur sequenzierung von nukleinsäuren
AR021833A1 (es) 1998-09-30 2002-08-07 Applied Research Systems Metodos de amplificacion y secuenciacion de acido nucleico
GB0006153D0 (en) * 2000-03-14 2000-05-03 Inpharmatica Ltd Database
WO2002004680A2 (en) 2000-07-07 2002-01-17 Visigen Biotechnologies, Inc. Real-time sequence determination
US7211414B2 (en) 2000-12-01 2007-05-01 Visigen Biotechnologies, Inc. Enzymatic nucleic acid synthesis: compositions and methods for altering monomer incorporation fidelity
AR031640A1 (es) 2000-12-08 2003-09-24 Applied Research Systems Amplificacion isotermica de acidos nucleicos en un soporte solido
US7057026B2 (en) 2001-12-04 2006-06-06 Solexa Limited Labelled nucleotides
US20040002090A1 (en) 2002-03-05 2004-01-01 Pascal Mayer Methods for detecting genome-wide sequence variations associated with a phenotype
EP3795577A1 (en) 2002-08-23 2021-03-24 Illumina Cambridge Limited Modified nucleotides
JP2008513782A (ja) 2004-09-17 2008-05-01 パシフィック バイオサイエンシーズ オブ カリフォルニア, インコーポレイテッド 分子解析のための装置及び方法
GB0427236D0 (en) 2004-12-13 2005-01-12 Solexa Ltd Improved method of nucleotide detection
DK1907571T3 (en) 2005-06-15 2017-08-21 Complete Genomics Inc NUCLEIC ACID ANALYSIS USING INCIDENTAL MIXTURES OF NON-OVERLAPPING FRAGMENTS
GB0514910D0 (en) 2005-07-20 2005-08-24 Solexa Ltd Method for sequencing a polynucleotide template
US7405281B2 (en) 2005-09-29 2008-07-29 Pacific Biosciences Of California, Inc. Fluorescent nucleotide analogs and uses therefor
GB0522310D0 (en) 2005-11-01 2005-12-07 Solexa Ltd Methods of preparing libraries of template polynucleotides
EP2021503A1 (en) 2006-03-17 2009-02-11 Solexa Ltd. Isothermal methods for creating clonal single molecule arrays
EP3373174A1 (en) 2006-03-31 2018-09-12 Illumina, Inc. Systems and devices for sequence by synthesis analysis
US7754429B2 (en) 2006-10-06 2010-07-13 Illumina Cambridge Limited Method for pair-wise sequencing a plurity of target polynucleotides
AU2007309504B2 (en) 2006-10-23 2012-09-13 Pacific Biosciences Of California, Inc. Polymerase enzymes and reagents for enhanced nucleic acid sequencing
KR20110052627A (ko) * 2008-07-16 2011-05-18 다나-파버 캔서 인스티튜트 인크. 전립선암과 관련된 시그너처 및 pc결정인자 및 그의 사용 방법
RU2011117576A (ru) * 2008-10-02 2012-11-10 Конинклейке Филипс Электроникс Н.В. (Nl) Способ определения показателя достоверности для отличительных черт, полученных из клинических данных, и применение показателя достоверности для предпочтения одной отличительной черты над другой
JP5773406B2 (ja) 2010-07-28 2015-09-02 学校法人明治大学 Gpiアンカー型タンパク質の判定装置、判定方法及び判定プログラム
EP2663656B1 (en) 2011-01-13 2016-08-24 Decode Genetics EHF Genetic variants as markers for use in urinary bladder cancer risk assessment
ES2875892T3 (es) * 2013-09-20 2021-11-11 Spraying Systems Co Boquilla de pulverización para craqueo catalítico fluidizado
US20160085910A1 (en) 2014-09-18 2016-03-24 Illumina, Inc. Methods and systems for analyzing nucleic acid sequencing data
KR20180008572A (ko) * 2015-05-15 2018-01-24 파이어니어 하이 부렛드 인터내쇼날 인코포레이팃드 Cas 엔도뉴클레아제 시스템, pam 서열 및 가이드 rna 요소의 신속한 특성화
EP3311299A4 (en) * 2015-06-22 2019-02-20 Myriad Women's Health, Inc. METHODS OF PREDICTING PATHOGENICITY OF GENETIC SEQUENCE VARIANTS
CN107622182B (zh) * 2017-08-04 2020-10-09 中南大学 蛋白质局部结构特征的预测方法及系统
CN108197427B (zh) * 2018-01-02 2020-09-04 山东师范大学 基于深度卷积神经网络的蛋白质亚细胞定位方法和装置
CN108595909A (zh) * 2018-03-29 2018-09-28 山东师范大学 基于集成分类器的ta蛋白靶向预测方法

Also Published As

Publication number Publication date
SG11201911777QA (en) 2020-05-28
IL282689A (en) 2021-06-30
JP2021152907A (ja) 2021-09-30
CN111328419B (zh) 2021-10-19
JP6888123B2 (ja) 2021-06-16
JP7200294B2 (ja) 2023-01-06
WO2020081122A1 (en) 2020-04-23
NZ759665A (en) 2022-07-01
KR20200044731A (ko) 2020-04-29
AU2021269351B2 (en) 2023-12-14
IL271091B (en) 2021-05-31
JP2021501923A (ja) 2021-01-21
AU2019272062A1 (en) 2020-04-30
IL271091A (en) 2020-04-30
CN111328419A (zh) 2020-06-23
JP2023052011A (ja) 2023-04-11
SG10202108013QA (en) 2021-09-29
KR102165734B1 (ko) 2020-10-14
AU2021269351A1 (en) 2021-12-09
AU2019272062B2 (en) 2021-08-19

Similar Documents

Publication Publication Date Title
CN111328419B (zh) 基于神经网络实现的方法和系统
EP3659143B1 (en) Deep learning-based techniques for pre-training deep convolutional neural networks
US10540591B2 (en) Deep learning-based techniques for pre-training deep convolutional neural networks
JP7247253B2 (ja) 経験的バリアントスコア(evs)ベースの深層学習バリアントコーラ
WO2020014280A1 (en) DEEP LEARNING-BASED FRAMEWORK FOR IDENTIFYING SEQUENCE PATTERNS THAT CAUSE SEQUENCE-SPECIFIC ERRORS (SSEs)
CN110832597A (zh) 基于深度神经网络的变体分类器
AU2019272065C1 (en) Deep learning-based framework for identifying sequence patterns that cause sequence-specific errors (SSEs)
CA3064223A1 (en) Deep learning-based techniques for pre-training deep convolutional neural networks
NZ788839A (en) Deep learning-based techniques for pre-training deep convolutional neural networks
NZ791625A (en) Variant classifier based on deep neural networks
NZ788045A (en) Deep convolutional neural networks for variant classification

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination