KR20220019778A - 머신 학습을 이용한 단백질 식별을 위한 기법들 및 관련된 시스템들 및 방법들 - Google Patents
머신 학습을 이용한 단백질 식별을 위한 기법들 및 관련된 시스템들 및 방법들 Download PDFInfo
- Publication number
- KR20220019778A KR20220019778A KR1020227000689A KR20227000689A KR20220019778A KR 20220019778 A KR20220019778 A KR 20220019778A KR 1020227000689 A KR1020227000689 A KR 1020227000689A KR 20227000689 A KR20227000689 A KR 20227000689A KR 20220019778 A KR20220019778 A KR 20220019778A
- Authority
- KR
- South Korea
- Prior art keywords
- data
- learning model
- machine learning
- amino acids
- training
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/20—Sequence assembly
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
- G06N3/0442—Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
-
- G06N3/0445—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G06N3/0454—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
- G06N3/0455—Auto-encoder networks; Encoder-decoder networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G06N3/0481—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/088—Non-supervised learning, e.g. competitive learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/0895—Weakly supervised learning, e.g. semi-supervised or self-supervised learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/09—Supervised learning
-
- G06N7/005—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/30—Unsupervised data analysis
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B5/00—ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Data Mining & Analysis (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- General Physics & Mathematics (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Biomedical Technology (AREA)
- Computational Linguistics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Medical Informatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Evolutionary Biology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physiology (AREA)
- Databases & Information Systems (AREA)
- Epidemiology (AREA)
- Bioethics (AREA)
- Public Health (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Chemical & Material Sciences (AREA)
- Analytical Chemistry (AREA)
- Algebra (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Probability & Statistics with Applications (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201962860750P | 2019-06-12 | 2019-06-12 | |
| US62/860,750 | 2019-06-12 | ||
| PCT/US2020/037541 WO2020252345A1 (en) | 2019-06-12 | 2020-06-12 | Techniques for protein identification using machine learning and related systems and methods |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| KR20220019778A true KR20220019778A (ko) | 2022-02-17 |
Family
ID=71409529
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020227000689A Withdrawn KR20220019778A (ko) | 2019-06-12 | 2020-06-12 | 머신 학습을 이용한 단백질 식별을 위한 기법들 및 관련된 시스템들 및 방법들 |
Country Status (10)
| Country | Link |
|---|---|
| US (1) | US20200395099A1 (https=) |
| EP (1) | EP3966824A1 (https=) |
| JP (1) | JP2022536343A (https=) |
| KR (1) | KR20220019778A (https=) |
| CN (1) | CN115989545A (https=) |
| AU (1) | AU2020290510A1 (https=) |
| BR (1) | BR112021024915A2 (https=) |
| CA (1) | CA3142888A1 (https=) |
| MX (1) | MX2021015347A (https=) |
| WO (1) | WO2020252345A1 (https=) |
Families Citing this family (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| BR112021008098A2 (pt) | 2018-11-15 | 2021-08-10 | Quantum-Si Incorporated | métodos e composições para o sequenciamento de proteínas |
| US11126890B2 (en) * | 2019-04-18 | 2021-09-21 | Adobe Inc. | Robust training of large-scale object detectors with a noisy dataset |
| CN114929897A (zh) * | 2019-10-28 | 2022-08-19 | 宽腾矽公司 | 制备用于多肽测序的富集样品的方法 |
| JP7592737B2 (ja) | 2020-03-06 | 2024-12-02 | ボストンジーン コーポレイション | 多重免疫蛍光イメージングを使用する組織特性の決定 |
| WO2021236983A2 (en) | 2020-05-20 | 2021-11-25 | Quantum-Si Incorporated | Methods and compositions for protein sequencing |
| CA3227592A1 (en) * | 2021-09-22 | 2023-03-30 | Gregory KAPP | Methods and systems for determining polypeptide interactions |
| CN114118366B (zh) * | 2021-11-15 | 2025-04-22 | 国网浙江省电力有限公司电力科学研究院 | 基于长短期记忆神经网络的生命体触电电流检测方法 |
| CN114093415B (zh) * | 2021-11-19 | 2022-06-03 | 中国科学院数学与系统科学研究院 | 肽段可检测性预测方法及系统 |
| CN114456926A (zh) * | 2022-03-15 | 2022-05-10 | 常州市环境科学研究院 | 一种可预测生长趋势的浮游植物自动培养装置及方法 |
| US12587274B2 (en) | 2023-03-28 | 2026-03-24 | Quantum Generative Materials Llc | Satellite optimization management system based on natural language input and artificial intelligence |
| WO2025057424A1 (ja) | 2023-09-15 | 2025-03-20 | 富士通株式会社 | 情報処理プログラム,情報処理装置および情報処理方法 |
| WO2025128525A1 (en) * | 2023-12-11 | 2025-06-19 | Research Development Foundation | System and method for predicting microproteins |
| WO2025123211A1 (zh) * | 2023-12-12 | 2025-06-19 | 深圳华大生命科学研究院 | 一种多肽分类器的构建与应用 |
| WO2025123212A1 (zh) * | 2023-12-12 | 2025-06-19 | 深圳华大生命科学研究院 | 一种基于目标检测模型的多肽信号提取方法 |
| US12368503B2 (en) | 2023-12-27 | 2025-07-22 | Quantum Generative Materials Llc | Intent-based satellite transmit management based on preexisting historical location and machine learning |
| US12603701B2 (en) | 2023-12-27 | 2026-04-14 | Quantum Generative Materials Llc | Distributed satellite constellation management and control system |
| CN117744748B (zh) * | 2024-02-20 | 2024-04-30 | 北京普译生物科技有限公司 | 一种神经网络模型训练、碱基识别方法及装置、电子设备 |
Family Cites Families (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050119454A1 (en) * | 2000-01-24 | 2005-06-02 | The Cielo Institute, Inc. | Algorithmic design of peptides for binding and/or modulation of the functions of receptors and/or other proteins |
| CA2466792A1 (en) * | 2003-05-16 | 2004-11-16 | Affinium Pharmaceuticals, Inc. | Evaluation of spectra |
| EP2389585A2 (en) * | 2009-01-22 | 2011-11-30 | Li-Cor, Inc. | Single molecule proteomics with dynamic probes |
| US20120015825A1 (en) * | 2010-07-06 | 2012-01-19 | Pacific Biosciences Of California, Inc. | Analytical systems and methods with software mask |
| WO2016069124A1 (en) * | 2014-09-15 | 2016-05-06 | Board Of Regents, The University Of Texas System | Improved single molecule peptide sequencing |
| EP4414988A3 (en) * | 2013-01-31 | 2024-11-06 | Codexis, Inc. | Methods, systems, and software for identifying bio-molecules using models of multiplicative form |
| US9212996B2 (en) * | 2013-08-05 | 2015-12-15 | Tellspec, Inc. | Analyzing and correlating spectra, identifying samples and their ingredients, and displaying related personalized information |
| BR112016006284B1 (pt) * | 2013-09-27 | 2022-07-26 | Codexis, Inc | Método implementado por computador, produto de programa de computador, e, sistema de computador |
| MX384725B (es) * | 2014-08-08 | 2025-03-14 | Quantum Si Inc | Dispositivo integrado con fuente de luz externa para el sondeo, detección y análisis de moléculas. |
| WO2017214320A1 (en) * | 2016-06-07 | 2017-12-14 | Edico Genome, Corp. | Bioinformatics systems, apparatus, and methods for performing secondary and/or tertiary processing |
| EP3568782A1 (en) * | 2017-01-13 | 2019-11-20 | Massachusetts Institute Of Technology | Machine learning based antibody design |
| EA201992476A1 (ru) * | 2017-04-18 | 2020-02-25 | Икс-Чем, Инк. | Способы идентификации соединений |
| US11573239B2 (en) * | 2017-07-17 | 2023-02-07 | Bioinformatics Solutions Inc. | Methods and systems for de novo peptide sequencing using deep learning |
| US11587644B2 (en) * | 2017-07-28 | 2023-02-21 | The Translational Genomics Research Institute | Methods of profiling mass spectral data using neural networks |
| WO2019152943A1 (en) * | 2018-02-02 | 2019-08-08 | Arizona Board Of Regents, For And On Behalf Of, Arizona State University | Methods, systems, and media for predicting functions of molecular sequences |
| KR102885910B1 (ko) * | 2018-02-17 | 2025-11-13 | 리제너론 파마슈티칼스 인코포레이티드 | Mhc 펩티드 결합 예측을 위한 gan-cnn |
| US20210151123A1 (en) * | 2018-03-08 | 2021-05-20 | Jungla Inc. | Interpretation of Genetic and Genomic Variants via an Integrated Computational and Experimental Deep Mutational Learning Framework |
| US20210239705A1 (en) * | 2018-06-06 | 2021-08-05 | Nautilus Biotechnology, Inc. | Methods and applications of protein identification |
| BR112021008098A2 (pt) * | 2018-11-15 | 2021-08-10 | Quantum-Si Incorporated | métodos e composições para o sequenciamento de proteínas |
-
2020
- 2020-06-12 KR KR1020227000689A patent/KR20220019778A/ko not_active Withdrawn
- 2020-06-12 MX MX2021015347A patent/MX2021015347A/es unknown
- 2020-06-12 WO PCT/US2020/037541 patent/WO2020252345A1/en not_active Ceased
- 2020-06-12 CN CN202080057353.9A patent/CN115989545A/zh active Pending
- 2020-06-12 JP JP2021573337A patent/JP2022536343A/ja active Pending
- 2020-06-12 AU AU2020290510A patent/AU2020290510A1/en not_active Abandoned
- 2020-06-12 BR BR112021024915A patent/BR112021024915A2/pt not_active Application Discontinuation
- 2020-06-12 EP EP20735761.7A patent/EP3966824A1/en not_active Withdrawn
- 2020-06-12 US US16/900,582 patent/US20200395099A1/en not_active Abandoned
- 2020-06-12 CA CA3142888A patent/CA3142888A1/en active Pending
Also Published As
| Publication number | Publication date |
|---|---|
| CA3142888A1 (en) | 2020-12-17 |
| WO2020252345A9 (en) | 2022-02-10 |
| BR112021024915A2 (pt) | 2022-01-18 |
| JP2022536343A (ja) | 2022-08-15 |
| CN115989545A (zh) | 2023-04-18 |
| MX2021015347A (es) | 2022-04-06 |
| US20200395099A1 (en) | 2020-12-17 |
| WO2020252345A1 (en) | 2020-12-17 |
| AU2020290510A1 (en) | 2022-02-03 |
| EP3966824A1 (en) | 2022-03-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR20220019778A (ko) | 머신 학습을 이용한 단백질 식별을 위한 기법들 및 관련된 시스템들 및 방법들 | |
| US12469581B2 (en) | Machine learning enabled pulse and base calling for sequencing devices | |
| US20230207068A1 (en) | Methods of Profiling Mass Spectral Data Using Neural Networks | |
| US20230114905A1 (en) | Highly multiplexable analysis of proteins and proteomes | |
| Pierleoni et al. | PredGPI: a GPI-anchor predictor | |
| US20220277811A1 (en) | Detecting False Positive Variant Calls In Next-Generation Sequencing | |
| US20230360732A1 (en) | Systems and methods for assessing and improving the quality of multiplex molecular assays | |
| Smith et al. | Estimating error rates for single molecule protein sequencing experiments | |
| KR20210010488A (ko) | 기계 학습 가능한 생물학적 중합체 어셈블리 | |
| CN120808899B (zh) | 基于图因果学习的抗菌肽功能可解释预测方法及系统 | |
| US20250052649A1 (en) | Method and system for analysing a biological sample | |
| US20260088131A1 (en) | System and method for evaluating molecular properties | |
| US20240094215A1 (en) | Characterizing accessibility of macromolecule structures | |
| Wong et al. | The Next Generation of Protein Sequencing and Analysis Methods | |
| Sadat et al. | Can genome-based Large Language Models predict gene expression? | |
| Chin et al. | Optimized local protein structure with support vector machine to predict protein secondary structure | |
| Bhumika et al. | Interpretable Multi-Task Learning Framework for DNA Intelligence | |
| Fai et al. | Protein secondary structure prediction using optimal local protein structure and support vector machine |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
St.27 status event code: A-0-1-A10-A15-nap-PA0105 |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| PG1501 | Laying open of application |
St.27 status event code: A-1-1-Q10-Q12-nap-PG1501 |
|
| P22-X000 | Classification modified |
St.27 status event code: A-2-2-P10-P22-nap-X000 |
|
| A201 | Request for examination | ||
| E13-X000 | Pre-grant limitation requested |
St.27 status event code: A-2-3-E10-E13-lim-X000 |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| PA0201 | Request for examination |
St.27 status event code: A-1-2-D10-D11-exm-PA0201 |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| P22-X000 | Classification modified |
St.27 status event code: A-2-2-P10-P22-nap-X000 |
|
| PC1202 | Submission of document of withdrawal before decision of registration |
St.27 status event code: N-1-6-B10-B11-nap-PC1202 |
|
| WITB | Written withdrawal of application | ||
| R18 | Changes to party contact information recorded |
Free format text: ST27 STATUS EVENT CODE: A-3-3-R10-R18-OTH-X000 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| R18-X000 | Changes to party contact information recorded |
St.27 status event code: A-3-3-R10-R18-oth-X000 |