CN113412519B - 机器学习引导的多肽分析 - Google Patents

机器学习引导的多肽分析 Download PDF

Info

Publication number
CN113412519B
CN113412519B CN202080013315.3A CN202080013315A CN113412519B CN 113412519 B CN113412519 B CN 113412519B CN 202080013315 A CN202080013315 A CN 202080013315A CN 113412519 B CN113412519 B CN 113412519B
Authority
CN
China
Prior art keywords
model
layers
protein
amino acid
neural network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202080013315.3A
Other languages
English (en)
Chinese (zh)
Other versions
CN113412519A (zh
Inventor
J·D·菲拉
A·L·彼姆
M·K·吉布森
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Flagship Development And Innovation Vi Co
Original Assignee
Flagship Development And Innovation Vi Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Flagship Development And Innovation Vi Co filed Critical Flagship Development And Innovation Vi Co
Publication of CN113412519A publication Critical patent/CN113412519A/zh
Application granted granted Critical
Publication of CN113412519B publication Critical patent/CN113412519B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional [2D] or three-dimensional [3D] molecular structures, e.g. structural or functional relations or structure alignment
    • G16B15/20Protein or domain folding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • G06N3/0442Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • G06N3/0455Auto-encoder networks; Encoder-decoder networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/047Probabilistic or stochastic networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0475Generative networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/0895Weakly supervised learning, e.g. semi-supervised or self-supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/094Adversarial learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/096Transfer learning
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/30Unsupervised data analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/10Machine learning using kernel methods, e.g. support vector machines [SVM]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/01Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/022Knowledge engineering; Knowledge acquisition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Chemical & Material Sciences (AREA)
  • Bioethics (AREA)
  • Databases & Information Systems (AREA)
  • Epidemiology (AREA)
  • Public Health (AREA)
  • Analytical Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Probability & Statistics with Applications (AREA)
  • Algebra (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
CN202080013315.3A 2019-02-11 2020-02-10 机器学习引导的多肽分析 Active CN113412519B (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201962804034P 2019-02-11 2019-02-11
US201962804036P 2019-02-11 2019-02-11
US62/804,036 2019-02-11
US62/804,034 2019-02-11
PCT/US2020/017517 WO2020167667A1 (en) 2019-02-11 2020-02-10 Machine learning guided polypeptide analysis

Publications (2)

Publication Number Publication Date
CN113412519A CN113412519A (zh) 2021-09-17
CN113412519B true CN113412519B (zh) 2024-05-21

Family

ID=70005699

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080013315.3A Active CN113412519B (zh) 2019-02-11 2020-02-10 机器学习引导的多肽分析

Country Status (8)

Country Link
US (1) US20220122692A1 (https=)
EP (1) EP3924971A1 (https=)
JP (1) JP7492524B2 (https=)
KR (1) KR20210125523A (https=)
CN (1) CN113412519B (https=)
CA (1) CA3127965A1 (https=)
IL (1) IL285402A (https=)
WO (1) WO2020167667A1 (https=)

Families Citing this family (69)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018125928A1 (en) 2016-12-29 2018-07-05 DeepScale, Inc. Multi-channel sensor simulation for autonomous control systems
WO2018176000A1 (en) 2017-03-23 2018-09-27 DeepScale, Inc. Data synthesis for autonomous control systems
US11409692B2 (en) 2017-07-24 2022-08-09 Tesla, Inc. Vector computational unit
US11157441B2 (en) 2017-07-24 2021-10-26 Tesla, Inc. Computational array microprocessor system using non-consecutive data formatting
US10671349B2 (en) 2017-07-24 2020-06-02 Tesla, Inc. Accelerated mathematical engine
US11893393B2 (en) 2017-07-24 2024-02-06 Tesla, Inc. Computational array microprocessor system with hardware arbiter managing memory requests
US12307350B2 (en) 2018-01-04 2025-05-20 Tesla, Inc. Systems and methods for hardware-based pooling
US11561791B2 (en) 2018-02-01 2023-01-24 Tesla, Inc. Vector computational unit receiving data elements in parallel from a last row of a computational array
US11215999B2 (en) 2018-06-20 2022-01-04 Tesla, Inc. Data pipeline and deep learning system for autonomous driving
US11361457B2 (en) 2018-07-20 2022-06-14 Tesla, Inc. Annotation cross-labeling for autonomous control systems
US11636333B2 (en) 2018-07-26 2023-04-25 Tesla, Inc. Optimizing neural network structures for embedded systems
US11562231B2 (en) 2018-09-03 2023-01-24 Tesla, Inc. Neural networks for embedded devices
KR20250078625A (ko) 2018-10-11 2025-06-02 테슬라, 인크. 증강 데이터로 기계 모델을 훈련하기 위한 시스템 및 방법
US11196678B2 (en) 2018-10-25 2021-12-07 Tesla, Inc. QOS manager for system on a chip communications
US11816585B2 (en) 2018-12-03 2023-11-14 Tesla, Inc. Machine learning models operating at different frequencies for autonomous vehicles
US11537811B2 (en) 2018-12-04 2022-12-27 Tesla, Inc. Enhanced object detection for autonomous vehicles based on field view
US11610117B2 (en) 2018-12-27 2023-03-21 Tesla, Inc. System and method for adapting a neural network model on a hardware platform
US10997461B2 (en) 2019-02-01 2021-05-04 Tesla, Inc. Generating ground truth for machine learning from time series elements
US11150664B2 (en) 2019-02-01 2021-10-19 Tesla, Inc. Predicting three-dimensional features for autonomous driving
US11567514B2 (en) 2019-02-11 2023-01-31 Tesla, Inc. Autonomous and user controlled vehicle summon to a target
US10956755B2 (en) 2019-02-19 2021-03-23 Tesla, Inc. Estimating object properties using visual image data
US12040050B1 (en) * 2019-03-06 2024-07-16 Nabla Bio, Inc. Systems and methods for rational protein engineering with deep representation learning
EP4008006A1 (en) * 2019-08-02 2022-06-08 Flagship Pioneering Innovations VI, LLC Machine learning guided polypeptide design
US11455540B2 (en) * 2019-11-15 2022-09-27 International Business Machines Corporation Autonomic horizontal exploration in neural networks transfer learning
US20210249105A1 (en) * 2020-02-06 2021-08-12 Salesforce.Com, Inc. Systems and methods for language modeling of protein engineering
US11049590B1 (en) 2020-02-12 2021-06-29 Peptilogics, Inc. Artificial intelligence engine architecture for generating candidate drugs
IL300826A (en) 2020-08-25 2023-04-01 Seer Inc Compositions and methods for testing proteins and nucleic acids
EP4205125A4 (en) * 2020-08-28 2024-02-21 Just-Evotec Biologics, Inc. IMPLEMENTATION OF A GENERATIVE MACHINE LEARNING ARCHITECTURE TO GENERATE TRAINING DATA FOR A CLASSIFICATION MODEL
US11948664B2 (en) * 2020-09-21 2024-04-02 Just-Evotec Biologics, Inc. Autoencoder with generative adversarial network to generate protein sequences
US11403316B2 (en) 2020-11-23 2022-08-02 Peptilogics, Inc. Generating enhanced graphical user interfaces for presentation of anti-infective design spaces for selecting drug candidates
KR102569987B1 (ko) * 2021-03-10 2023-08-24 삼성전자주식회사 생체정보 추정 장치 및 방법
CN112951341B (zh) * 2021-03-15 2024-04-30 江南大学 一种基于复杂网络的多肽分类方法
US11512345B1 (en) 2021-05-07 2022-11-29 Peptilogics, Inc. Methods and apparatuses for generating peptides by synthesizing a portion of a design space to identify peptides having non-canonical amino acids
CN113257361B (zh) * 2021-05-31 2021-11-23 中国科学院深圳先进技术研究院 自适应蛋白质预测框架的实现方法、装置及设备
KR20240018606A (ko) * 2021-06-10 2024-02-13 바스프 아그리컬쳐럴 솔루션즈 시드 유에스 엘엘씨 단백질의 기공 형성 능력을 예측하기 위한 심층 학습 모델
US12462575B2 (en) 2021-08-19 2025-11-04 Tesla, Inc. Vision-based machine learning model for autonomous driving with adjustable virtual camera
WO2023023265A1 (en) 2021-08-19 2023-02-23 Tesla, Inc. Vision-based system training with simulated content
CN113971992B (zh) * 2021-10-26 2024-03-29 中国科学技术大学 针对分子属性预测图网络的自监督预训练方法与系统
CN114333982B (zh) 2021-11-26 2023-09-26 北京百度网讯科技有限公司 蛋白质表示模型预训练、蛋白质相互作用预测方法和装置
US20230268026A1 (en) 2022-01-07 2023-08-24 Absci Corporation Designing biomolecule sequence variants with pre-specified attributes
WO2023133564A2 (en) * 2022-01-10 2023-07-13 Aether Biomachines, Inc. Systems and methods for engineering protein activity
CN114678061A (zh) * 2022-02-09 2022-06-28 浙江大学杭州国际科创中心 基于预训练语言模型的蛋白质构象感知表示学习方法
CA3250670A1 (en) * 2022-05-09 2023-11-16 Leidos, Inc. SYSTEM AND METHOD FOR GENERING SYNTHETIC DATA USING HIDDEN TRANSFORMERS
EP4310726A1 (en) * 2022-07-20 2024-01-24 Nokia Solutions and Networks Oy Apparatus and method for channel impairment estimations using transformer-based machine learning model
CN114927165B (zh) * 2022-07-20 2022-12-02 深圳大学 泛素化位点的识别方法、装置、系统和存储介质
EP4573554A1 (en) * 2022-08-15 2025-06-25 Microsoft Technology Licensing, LLC Machine learning solution to predict protein characteristics
WO2024040189A1 (en) * 2022-08-18 2024-02-22 Seer, Inc. Methods for using a machine learning algorithm for omic analysis
CN115169543A (zh) * 2022-09-05 2022-10-11 广东工业大学 一种基于迁移学习的短期光伏功率预测方法及系统
CN120188223A (zh) * 2022-11-02 2025-06-20 巴斯夫欧洲公司 用于使用自然语言处理(nlp)来预测蛋白质功能相似度的系统和方法
CN115966249B (zh) * 2023-02-15 2023-05-26 北京科技大学 基于分数阶神经网的蛋白质-atp结合位点预测方法及装置
AU2024222132A1 (en) * 2023-02-15 2025-08-28 Insitro, Inc. Machine-learning-enabled predictive biomarker discovery and patient stratification using standard-of-care data
EP4668279A4 (en) 2023-02-16 2026-03-25 Fujitsu Ltd INFORMATION PROCESSING PROGRAM, INFORMATION PROCESS, AND INFORMATION PROCESSING DEVICE
CN116312834B (zh) * 2023-03-03 2026-02-24 大连理工大学 一种基于机器学习预测酰胺键分子合成转化率的方法
CN116072227B (zh) 2023-03-07 2023-06-20 中国海洋大学 海洋营养成分生物合成途径挖掘方法、装置、设备和介质
CN116805508A (zh) * 2023-03-22 2023-09-26 东北林业大学 一种组织特异性eRNA的识别方法
CN116206690B (zh) * 2023-05-04 2023-08-08 山东大学齐鲁医院 一种抗菌肽生成和识别方法及系统
EP4730341A4 (en) * 2023-07-26 2026-05-06 Fujifilm Corp Medicine development support device, operation method for medicine development support device, operation program for medicine development support device, learning device, operation method for learning device, and operation program for learning device
WO2025036438A1 (zh) * 2023-08-15 2025-02-20 上海金斯康生物科技有限公司 预测突变对蛋白酶活性的影响
CN117334325B (zh) * 2023-09-26 2024-04-16 中山大学肿瘤防治中心(中山大学附属肿瘤医院、中山大学肿瘤研究所) 一种lcat在肝细胞癌诊断、治疗和预测复发的应用
CN120010238B (zh) * 2023-11-15 2025-11-11 中国石油化工股份有限公司 裂解炉操作的优化方法、装置、存储介质及处理器
US12554753B2 (en) * 2023-11-30 2026-02-17 Microsoft Technology Licensing, Llc Technical data enrichment through language models
CN117352043B (zh) * 2023-12-06 2024-03-05 江苏正大天创生物工程有限公司 基于神经网络的蛋白设计方法及系统
WO2025128525A1 (en) * 2023-12-11 2025-06-19 Research Development Foundation System and method for predicting microproteins
US20250218545A1 (en) * 2023-12-27 2025-07-03 X Development Llc Large language model driven data augmentation for protein machine learning
CN118335182B (zh) * 2024-01-10 2025-09-09 中国科学院天津工业生物技术研究所 基于深度学习的同源寡聚体亚基数量预测方法
CN117854591A (zh) * 2024-01-10 2024-04-09 中国人民解放军军事科学院军事医学研究院 一种基于自然语言处理技术的抗癌肽识别方法
WO2025191795A1 (ja) * 2024-03-14 2025-09-18 Ntt株式会社 学習装置、信号生成装置、学習方法、信号生成方法及びプログラム
WO2025259965A1 (en) * 2024-06-13 2025-12-18 Seattle Children's Hospital D/B/A Seattle Children's Research Institute Predicting nanobody stability and generating stability-modulated sequences
CN119513721B (zh) * 2024-11-04 2025-09-23 北京理工大学 一种用于旋转机械迁移诊断的无监督多源信息域自适应方法、设备、介质及产品

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107742061A (zh) * 2017-09-19 2018-02-27 中山大学 一种蛋白质相互作用预测方法、系统和装置
CN108601731A (zh) * 2015-12-16 2018-09-28 磨石肿瘤生物技术公司 新抗原的鉴别、制造及使用
CN109036571A (zh) * 2014-12-08 2018-12-18 20/20基因系统股份有限公司 用于预测患有癌症的可能性或风险的方法和机器学习系统

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2020501240A (ja) * 2016-11-18 2020-01-16 ナントミクス,エルエルシー 汎がんゲノムにおけるdnaアクセシビリティを予測するための方法及びシステム
US11631236B2 (en) * 2017-03-14 2023-04-18 Samsung Electronics Co., Ltd. System and method for deep labeling
US10592725B2 (en) * 2017-04-21 2020-03-17 General Electric Company Neural network systems
EP3486816A1 (en) * 2017-11-16 2019-05-22 Institut Pasteur Method, device, and computer program for generating protein sequences with autoregressive neural networks
US20190259470A1 (en) * 2018-02-19 2019-08-22 Protabit LLC Artificial intelligence platform for protein engineering

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109036571A (zh) * 2014-12-08 2018-12-18 20/20基因系统股份有限公司 用于预测患有癌症的可能性或风险的方法和机器学习系统
CN108601731A (zh) * 2015-12-16 2018-09-28 磨石肿瘤生物技术公司 新抗原的鉴别、制造及使用
CN107742061A (zh) * 2017-09-19 2018-02-27 中山大学 一种蛋白质相互作用预测方法、系统和装置

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding;Jacob Devlin等;arXiv e-prints;全文 *
Deep Recurrent Neural Network for Protein Function Prediction from Sequence;Xueliang Leon Liu等;arXiv e-prints;全文 *
Deep Variational Transfer: Transfer Learning through Semi-supervised Deep Generative Models;Marouan Belhaj等;arXiv e-prints;全文 *
DeepDTA: Deep Drug-Target Binding Affinity Prediction;Hakime ¨ Oztu¨rk等;arXiv e-prints;全文 *
Method of extracting sentences about protein interaction from the literature on protein structure analysis using selective transfer learning;Koyabu, S等;IEEE 12TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS & BIOENGINEERING;20121231;第46-51页 *
Multi-task Deep Neural Networks in Automated Protein Function Prediction;Sureyya Rifaioglu, Ahmet等;arXiv e-prints;全文 *
Seq3seq Fingerprint: Towards End-to-end Semi-supervised Deep Drug Discovery;Xiaoyu Zhang等;9th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM-BCB);第404-413页 *
基于模糊支持向量机的膜蛋白分类研究;邹凌云;王正志;王勇献;;生物医学工程研究(第04期);第6-11页 *

Also Published As

Publication number Publication date
JP7492524B2 (ja) 2024-05-29
WO2020167667A1 (en) 2020-08-20
EP3924971A1 (en) 2021-12-22
CN113412519A (zh) 2021-09-17
JP2022521686A (ja) 2022-04-12
IL285402A (en) 2021-09-30
US20220122692A1 (en) 2022-04-21
KR20210125523A (ko) 2021-10-18
CA3127965A1 (en) 2020-08-20

Similar Documents

Publication Publication Date Title
CN113412519B (zh) 机器学习引导的多肽分析
CN115136246B (zh) 机器学习引导的多肽设计
Vilhekar et al. Artificial intelligence in genetics
Kalfon et al. scPRINT: pre-training on 50 million cells allows robust gene network predictions
Yoshida et al. Bayesian learning in sparse graphical factor models via variational mean-field annealing
Chakraborty et al. Artificial intelligence in biological data
Zhao et al. Predicting protein functions based on heterogeneous graph attention technique
Yamada et al. De novo profile generation based on sequence context specificity with the long short-term memory network
Wang et al. Lm-gvp: A generalizable deep learning framework for protein property prediction from sequence and structure
US20250378311A1 (en) Machine learning architecture for modeling local and global features
Jiang et al. Modeling microbiome-trait associations with taxonomy-adaptive neural networks
TWI874059B (zh) 一種深度學習模型系統及其方法
KR102482302B1 (ko) 인공지능 기술을 사용하여 클러스터 데이터에 대응되는 주조직 적합성 복합체를 결정하기 위한 방법 및 장치
Mathai et al. DataDriven Approaches for Early Detection and Prediction of Chronic Kidney Disease Using Machine Learning
Seigneuric et al. Decoding artificial intelligence and machine learning concepts for cancer research applications
Yang et al. DPGOK: A Deep Learning-Based Method for Protein Function Prediction by Fusing GO Knowledge With Protein Features
CA3145875C (en) Machine learning guided polypeptide design
KR102547975B1 (ko) 인공지능 기술을 사용하여 클러스터 데이터에 대응되는 주조직 적합성 복합체를 결정하기 위한 방법 및 장치
HK40076311A (zh) 机器学习引导的多肽设计
CN119446335B (zh) 基于三元比较专业知识决策的药物-靶标相互作用机制预测方法和系统
Tandon et al. Artificial Intelligence and Machine Learning for Exploring PROTAC in Underutilized Cells
Yang et al. DeepGDel: Deep Learning-based Gene Deletion Prediction Framework for Growth-Coupled Production in Genome-Scale Metabolic Models
Bouamrane Obkml-go: Optimized clustering combination with biological knowledge for dna microarray expression data
Vieira Developing Deep Learnig Methods to Predict Phenotypes and Clinical Outcomes From Transcriptomics Data
Ganie et al. Enhancing Early Cancer Detection: An Investigation of DNA Sequences and Machine Learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant