JP7492524B2 - 機械学習支援ポリペプチド解析 - Google Patents

機械学習支援ポリペプチド解析 Download PDF

Info

Publication number
JP7492524B2
JP7492524B2 JP2021546841A JP2021546841A JP7492524B2 JP 7492524 B2 JP7492524 B2 JP 7492524B2 JP 2021546841 A JP2021546841 A JP 2021546841A JP 2021546841 A JP2021546841 A JP 2021546841A JP 7492524 B2 JP7492524 B2 JP 7492524B2
Authority
JP
Japan
Prior art keywords
layers
model
protein
aspects
amino acid
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2021546841A
Other languages
English (en)
Japanese (ja)
Other versions
JP2022521686A5 (https=
JP2022521686A (ja
Inventor
フィーラ・ジェイコブ・ディー.
ビーム・アンドリュー・レーン
ギブソン・モリー・クリサン
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Flagship Pioneering Innovations VI Inc
Original Assignee
Flagship Pioneering Innovations VI Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Flagship Pioneering Innovations VI Inc filed Critical Flagship Pioneering Innovations VI Inc
Publication of JP2022521686A publication Critical patent/JP2022521686A/ja
Publication of JP2022521686A5 publication Critical patent/JP2022521686A5/ja
Application granted granted Critical
Publication of JP7492524B2 publication Critical patent/JP7492524B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional [2D] or three-dimensional [3D] molecular structures, e.g. structural or functional relations or structure alignment
    • G16B15/20Protein or domain folding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • G06N3/0442Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • G06N3/0455Auto-encoder networks; Encoder-decoder networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/047Probabilistic or stochastic networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0475Generative networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/0895Weakly supervised learning, e.g. semi-supervised or self-supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/094Adversarial learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/096Transfer learning
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/30Unsupervised data analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/10Machine learning using kernel methods, e.g. support vector machines [SVM]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/01Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/022Knowledge engineering; Knowledge acquisition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Molecular Biology (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Chemical & Material Sciences (AREA)
  • Bioethics (AREA)
  • Databases & Information Systems (AREA)
  • Epidemiology (AREA)
  • Public Health (AREA)
  • Analytical Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Probability & Statistics with Applications (AREA)
  • Algebra (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
JP2021546841A 2019-02-11 2020-02-10 機械学習支援ポリペプチド解析 Active JP7492524B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201962804034P 2019-02-11 2019-02-11
US201962804036P 2019-02-11 2019-02-11
US62/804,036 2019-02-11
US62/804,034 2019-02-11
PCT/US2020/017517 WO2020167667A1 (en) 2019-02-11 2020-02-10 Machine learning guided polypeptide analysis

Publications (3)

Publication Number Publication Date
JP2022521686A JP2022521686A (ja) 2022-04-12
JP2022521686A5 JP2022521686A5 (https=) 2023-02-16
JP7492524B2 true JP7492524B2 (ja) 2024-05-29

Family

ID=70005699

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2021546841A Active JP7492524B2 (ja) 2019-02-11 2020-02-10 機械学習支援ポリペプチド解析

Country Status (8)

Country Link
US (1) US20220122692A1 (https=)
EP (1) EP3924971A1 (https=)
JP (1) JP7492524B2 (https=)
KR (1) KR20210125523A (https=)
CN (1) CN113412519B (https=)
CA (1) CA3127965A1 (https=)
IL (1) IL285402A (https=)
WO (1) WO2020167667A1 (https=)

Families Citing this family (69)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018125928A1 (en) 2016-12-29 2018-07-05 DeepScale, Inc. Multi-channel sensor simulation for autonomous control systems
WO2018176000A1 (en) 2017-03-23 2018-09-27 DeepScale, Inc. Data synthesis for autonomous control systems
US11409692B2 (en) 2017-07-24 2022-08-09 Tesla, Inc. Vector computational unit
US11157441B2 (en) 2017-07-24 2021-10-26 Tesla, Inc. Computational array microprocessor system using non-consecutive data formatting
US10671349B2 (en) 2017-07-24 2020-06-02 Tesla, Inc. Accelerated mathematical engine
US11893393B2 (en) 2017-07-24 2024-02-06 Tesla, Inc. Computational array microprocessor system with hardware arbiter managing memory requests
US12307350B2 (en) 2018-01-04 2025-05-20 Tesla, Inc. Systems and methods for hardware-based pooling
US11561791B2 (en) 2018-02-01 2023-01-24 Tesla, Inc. Vector computational unit receiving data elements in parallel from a last row of a computational array
US11215999B2 (en) 2018-06-20 2022-01-04 Tesla, Inc. Data pipeline and deep learning system for autonomous driving
US11361457B2 (en) 2018-07-20 2022-06-14 Tesla, Inc. Annotation cross-labeling for autonomous control systems
US11636333B2 (en) 2018-07-26 2023-04-25 Tesla, Inc. Optimizing neural network structures for embedded systems
US11562231B2 (en) 2018-09-03 2023-01-24 Tesla, Inc. Neural networks for embedded devices
KR20250078625A (ko) 2018-10-11 2025-06-02 테슬라, 인크. 증강 데이터로 기계 모델을 훈련하기 위한 시스템 및 방법
US11196678B2 (en) 2018-10-25 2021-12-07 Tesla, Inc. QOS manager for system on a chip communications
US11816585B2 (en) 2018-12-03 2023-11-14 Tesla, Inc. Machine learning models operating at different frequencies for autonomous vehicles
US11537811B2 (en) 2018-12-04 2022-12-27 Tesla, Inc. Enhanced object detection for autonomous vehicles based on field view
US11610117B2 (en) 2018-12-27 2023-03-21 Tesla, Inc. System and method for adapting a neural network model on a hardware platform
US10997461B2 (en) 2019-02-01 2021-05-04 Tesla, Inc. Generating ground truth for machine learning from time series elements
US11150664B2 (en) 2019-02-01 2021-10-19 Tesla, Inc. Predicting three-dimensional features for autonomous driving
US11567514B2 (en) 2019-02-11 2023-01-31 Tesla, Inc. Autonomous and user controlled vehicle summon to a target
US10956755B2 (en) 2019-02-19 2021-03-23 Tesla, Inc. Estimating object properties using visual image data
US12040050B1 (en) * 2019-03-06 2024-07-16 Nabla Bio, Inc. Systems and methods for rational protein engineering with deep representation learning
EP4008006A1 (en) * 2019-08-02 2022-06-08 Flagship Pioneering Innovations VI, LLC Machine learning guided polypeptide design
US11455540B2 (en) * 2019-11-15 2022-09-27 International Business Machines Corporation Autonomic horizontal exploration in neural networks transfer learning
US20210249105A1 (en) * 2020-02-06 2021-08-12 Salesforce.Com, Inc. Systems and methods for language modeling of protein engineering
US11049590B1 (en) 2020-02-12 2021-06-29 Peptilogics, Inc. Artificial intelligence engine architecture for generating candidate drugs
IL300826A (en) 2020-08-25 2023-04-01 Seer Inc Compositions and methods for testing proteins and nucleic acids
EP4205125A4 (en) * 2020-08-28 2024-02-21 Just-Evotec Biologics, Inc. IMPLEMENTATION OF A GENERATIVE MACHINE LEARNING ARCHITECTURE TO GENERATE TRAINING DATA FOR A CLASSIFICATION MODEL
US11948664B2 (en) * 2020-09-21 2024-04-02 Just-Evotec Biologics, Inc. Autoencoder with generative adversarial network to generate protein sequences
US11403316B2 (en) 2020-11-23 2022-08-02 Peptilogics, Inc. Generating enhanced graphical user interfaces for presentation of anti-infective design spaces for selecting drug candidates
KR102569987B1 (ko) * 2021-03-10 2023-08-24 삼성전자주식회사 생체정보 추정 장치 및 방법
CN112951341B (zh) * 2021-03-15 2024-04-30 江南大学 一种基于复杂网络的多肽分类方法
US11512345B1 (en) 2021-05-07 2022-11-29 Peptilogics, Inc. Methods and apparatuses for generating peptides by synthesizing a portion of a design space to identify peptides having non-canonical amino acids
CN113257361B (zh) * 2021-05-31 2021-11-23 中国科学院深圳先进技术研究院 自适应蛋白质预测框架的实现方法、装置及设备
KR20240018606A (ko) * 2021-06-10 2024-02-13 바스프 아그리컬쳐럴 솔루션즈 시드 유에스 엘엘씨 단백질의 기공 형성 능력을 예측하기 위한 심층 학습 모델
US12462575B2 (en) 2021-08-19 2025-11-04 Tesla, Inc. Vision-based machine learning model for autonomous driving with adjustable virtual camera
WO2023023265A1 (en) 2021-08-19 2023-02-23 Tesla, Inc. Vision-based system training with simulated content
CN113971992B (zh) * 2021-10-26 2024-03-29 中国科学技术大学 针对分子属性预测图网络的自监督预训练方法与系统
CN114333982B (zh) 2021-11-26 2023-09-26 北京百度网讯科技有限公司 蛋白质表示模型预训练、蛋白质相互作用预测方法和装置
US20230268026A1 (en) 2022-01-07 2023-08-24 Absci Corporation Designing biomolecule sequence variants with pre-specified attributes
WO2023133564A2 (en) * 2022-01-10 2023-07-13 Aether Biomachines, Inc. Systems and methods for engineering protein activity
CN114678061A (zh) * 2022-02-09 2022-06-28 浙江大学杭州国际科创中心 基于预训练语言模型的蛋白质构象感知表示学习方法
CA3250670A1 (en) * 2022-05-09 2023-11-16 Leidos, Inc. SYSTEM AND METHOD FOR GENERING SYNTHETIC DATA USING HIDDEN TRANSFORMERS
EP4310726A1 (en) * 2022-07-20 2024-01-24 Nokia Solutions and Networks Oy Apparatus and method for channel impairment estimations using transformer-based machine learning model
CN114927165B (zh) * 2022-07-20 2022-12-02 深圳大学 泛素化位点的识别方法、装置、系统和存储介质
EP4573554A1 (en) * 2022-08-15 2025-06-25 Microsoft Technology Licensing, LLC Machine learning solution to predict protein characteristics
WO2024040189A1 (en) * 2022-08-18 2024-02-22 Seer, Inc. Methods for using a machine learning algorithm for omic analysis
CN115169543A (zh) * 2022-09-05 2022-10-11 广东工业大学 一种基于迁移学习的短期光伏功率预测方法及系统
CN120188223A (zh) * 2022-11-02 2025-06-20 巴斯夫欧洲公司 用于使用自然语言处理(nlp)来预测蛋白质功能相似度的系统和方法
CN115966249B (zh) * 2023-02-15 2023-05-26 北京科技大学 基于分数阶神经网的蛋白质-atp结合位点预测方法及装置
AU2024222132A1 (en) * 2023-02-15 2025-08-28 Insitro, Inc. Machine-learning-enabled predictive biomarker discovery and patient stratification using standard-of-care data
EP4668279A4 (en) 2023-02-16 2026-03-25 Fujitsu Ltd INFORMATION PROCESSING PROGRAM, INFORMATION PROCESS, AND INFORMATION PROCESSING DEVICE
CN116312834B (zh) * 2023-03-03 2026-02-24 大连理工大学 一种基于机器学习预测酰胺键分子合成转化率的方法
CN116072227B (zh) 2023-03-07 2023-06-20 中国海洋大学 海洋营养成分生物合成途径挖掘方法、装置、设备和介质
CN116805508A (zh) * 2023-03-22 2023-09-26 东北林业大学 一种组织特异性eRNA的识别方法
CN116206690B (zh) * 2023-05-04 2023-08-08 山东大学齐鲁医院 一种抗菌肽生成和识别方法及系统
EP4730341A4 (en) * 2023-07-26 2026-05-06 Fujifilm Corp Medicine development support device, operation method for medicine development support device, operation program for medicine development support device, learning device, operation method for learning device, and operation program for learning device
WO2025036438A1 (zh) * 2023-08-15 2025-02-20 上海金斯康生物科技有限公司 预测突变对蛋白酶活性的影响
CN117334325B (zh) * 2023-09-26 2024-04-16 中山大学肿瘤防治中心(中山大学附属肿瘤医院、中山大学肿瘤研究所) 一种lcat在肝细胞癌诊断、治疗和预测复发的应用
CN120010238B (zh) * 2023-11-15 2025-11-11 中国石油化工股份有限公司 裂解炉操作的优化方法、装置、存储介质及处理器
US12554753B2 (en) * 2023-11-30 2026-02-17 Microsoft Technology Licensing, Llc Technical data enrichment through language models
CN117352043B (zh) * 2023-12-06 2024-03-05 江苏正大天创生物工程有限公司 基于神经网络的蛋白设计方法及系统
WO2025128525A1 (en) * 2023-12-11 2025-06-19 Research Development Foundation System and method for predicting microproteins
US20250218545A1 (en) * 2023-12-27 2025-07-03 X Development Llc Large language model driven data augmentation for protein machine learning
CN118335182B (zh) * 2024-01-10 2025-09-09 中国科学院天津工业生物技术研究所 基于深度学习的同源寡聚体亚基数量预测方法
CN117854591A (zh) * 2024-01-10 2024-04-09 中国人民解放军军事科学院军事医学研究院 一种基于自然语言处理技术的抗癌肽识别方法
WO2025191795A1 (ja) * 2024-03-14 2025-09-18 Ntt株式会社 学習装置、信号生成装置、学習方法、信号生成方法及びプログラム
WO2025259965A1 (en) * 2024-06-13 2025-12-18 Seattle Children's Hospital D/B/A Seattle Children's Research Institute Predicting nanobody stability and generating stability-modulated sequences
CN119513721B (zh) * 2024-11-04 2025-09-23 北京理工大学 一种用于旋转机械迁移诊断的无监督多源信息域自适应方法、设备、介质及产品

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018094360A2 (en) 2016-11-18 2018-05-24 Nantomics, Llc Methods and systems for predicting dna accessibility in the pan-cancer genome

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016094330A2 (en) * 2014-12-08 2016-06-16 20/20 Genesystems, Inc Methods and machine learning systems for predicting the liklihood or risk of having cancer
ES2970865T3 (es) * 2015-12-16 2024-05-31 Gritstone Bio Inc Identificación, fabricación y uso de neoantígenos
US11631236B2 (en) * 2017-03-14 2023-04-18 Samsung Electronics Co., Ltd. System and method for deep labeling
US10592725B2 (en) * 2017-04-21 2020-03-17 General Electric Company Neural network systems
CN107742061B (zh) * 2017-09-19 2021-06-01 中山大学 一种蛋白质相互作用预测方法、系统和装置
EP3486816A1 (en) * 2017-11-16 2019-05-22 Institut Pasteur Method, device, and computer program for generating protein sequences with autoregressive neural networks
US20190259470A1 (en) * 2018-02-19 2019-08-22 Protabit LLC Artificial intelligence platform for protein engineering

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018094360A2 (en) 2016-11-18 2018-05-24 Nantomics, Llc Methods and systems for predicting dna accessibility in the pan-cancer genome

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Xiaoyu Zhang et al.,Seq3seq fingerprint: towards end-to-end semi-supervised deep drug discovery,[onlin], ACM SIGBioinformatics Record, Volume 8, Issue 1,pp 1-10,[2023年12月5日検索],インターネット,<URL:https://arxiv.org/ftp/arxiv/papers/1701/1701.08318.pdf>,Article No.1, URL:https://doi.org/10.1145/3284959.3284960
Xueliang Liu,Deep Recurrent Neural Network for Protein Function Prediction from Sequence,[online], ,[2023年12月5日検索],インターネット,<URL:https://arxiv.org/abs/1701.08318>,arXiv:1701.08318

Also Published As

Publication number Publication date
WO2020167667A1 (en) 2020-08-20
EP3924971A1 (en) 2021-12-22
CN113412519A (zh) 2021-09-17
JP2022521686A (ja) 2022-04-12
CN113412519B (zh) 2024-05-21
IL285402A (en) 2021-09-30
US20220122692A1 (en) 2022-04-21
KR20210125523A (ko) 2021-10-18
CA3127965A1 (en) 2020-08-20

Similar Documents

Publication Publication Date Title
JP7492524B2 (ja) 機械学習支援ポリペプチド解析
US20220270711A1 (en) Machine learning guided polypeptide design
Han et al. Develop machine learning-based regression predictive models for engineering protein solubility
Li et al. Evaluation of machine learning-assisted directed evolution across diverse combinatorial landscapes
Zhao et al. Predicting protein functions based on heterogeneous graph attention technique
Wang et al. Lm-gvp: A generalizable deep learning framework for protein property prediction from sequence and structure
TWI874059B (zh) 一種深度學習模型系統及其方法
Jia et al. im5C-DSCGA: a proposed hybrid framework based on improved DenseNet and attention mechanisms for identifying 5-methylcytosine sites in human RNA
CA3145875C (en) Machine learning guided polypeptide design
Fung et al. Few-shot in-context learning with large language models for antibody characterization
Yang et al. DPGOK: A Deep Learning-Based Method for Protein Function Prediction by Fusing GO Knowledge With Protein Features
Ly-Trong et al. TreeFormer: A transformer-based tree rearrangement operation for phylogenetic reconstruction
HK40076311A (zh) 机器学习引导的多肽设计
Mathai et al. DataDriven Approaches for Early Detection and Prediction of Chronic Kidney Disease Using Machine Learning
Meehl et al. Efficient Protein Engineering via Integrated Language Models and Bayesian Optimization
Rocks et al. Dual-encoder contrastive learning accelerates enzyme discovery
Guo et al. scTrace+: Enhancing cell fate inference by integrating the lineage-tracing and multi-faceted transcriptomic similarity information
Leske et al. Alignment-free Bacterial Taxonomy Classification with Genomic Language Models
CN119446335B (zh) 基于三元比较专业知识决策的药物-靶标相互作用机制预测方法和系统
Yang et al. DeepGDel: Deep Learning-based Gene Deletion Prediction Framework for Growth-Coupled Production in Genome-Scale Metabolic Models
Bouamrane Obkml-go: Optimized clustering combination with biological knowledge for dna microarray expression data
Vieira Developing Deep Learnig Methods to Predict Phenotypes and Clinical Outcomes From Transcriptomics Data
Ataei et al. TooTranslator: Zero-Shot Classification of Specific Substrates for Transport Proteins by Language Embedding Alignment of Proteins and Chemicals
Chakraborty et al. An Explainable AI Framework Integrating Machine and Deep Learning Models for Multi-Species DNA Functional Group Classification
Ganie et al. Enhancing Early Cancer Detection: An Investigation of DNA Sequences and Machine Learning

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20230207

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20230207

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20231212

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20240308

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20240423

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20240517

R150 Certificate of patent or registration of utility model

Ref document number: 7492524

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150