JP2024538478A - ギャップ付き及び非ギャップタンパク質サンプルを使用した変異体病原性予測器の複合学習及び転移学習 - Google Patents

ギャップ付き及び非ギャップタンパク質サンプルを使用した変異体病原性予測器の複合学習及び転移学習 Download PDF

Info

Publication number
JP2024538478A
JP2024538478A JP2023580573A JP2023580573A JP2024538478A JP 2024538478 A JP2024538478 A JP 2024538478A JP 2023580573 A JP2023580573 A JP 2023580573A JP 2023580573 A JP2023580573 A JP 2023580573A JP 2024538478 A JP2024538478 A JP 2024538478A
Authority
JP
Japan
Prior art keywords
amino acid
gapped
pathogenicity
protein
computer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2023580573A
Other languages
English (en)
Japanese (ja)
Inventor
トビアス・ハンプ
ホン・ガオ
カイ-ハウ・ファー
Original Assignee
イルミナ インコーポレイテッド
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US17/533,091 external-priority patent/US11538555B1/en
Application filed by イルミナ インコーポレイテッド filed Critical イルミナ インコーポレイテッド
Priority claimed from PCT/US2022/045823 external-priority patent/WO2023059750A1/en
Publication of JP2024538478A publication Critical patent/JP2024538478A/ja
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/20Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/20Heterogeneous data integration

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Medical Informatics (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Data Mining & Analysis (AREA)
  • Molecular Biology (AREA)
  • Bioethics (AREA)
  • Databases & Information Systems (AREA)
  • Chemical & Material Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Analytical Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Software Systems (AREA)
  • Epidemiology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Public Health (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Probability & Statistics with Applications (AREA)
  • Physiology (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
JP2023580573A 2021-10-06 2022-10-05 ギャップ付き及び非ギャップタンパク質サンプルを使用した変異体病原性予測器の複合学習及び転移学習 Pending JP2024538478A (ja)

Applications Claiming Priority (13)

Application Number Priority Date Filing Date Title
US202163253122P 2021-10-06 2021-10-06
US63/253,122 2021-10-06
US202163281592P 2021-11-19 2021-11-19
US202163281579P 2021-11-19 2021-11-19
US63/281,579 2021-11-19
US63/281,592 2021-11-19
US17/533,091 2021-11-22
US17/533,091 US11538555B1 (en) 2021-10-06 2021-11-22 Protein structure-based protein language models
US17/953,286 2022-09-26
US17/953,293 2022-09-26
US17/953,293 US20230108368A1 (en) 2021-10-06 2022-09-26 Combined and transfer learning of a variant pathogenicity predictor using gapped and non-gapped protein samples
US17/953,286 US20230108241A1 (en) 2021-10-06 2022-09-26 Predicting variant pathogenicity from evolutionary conservation using three-dimensional (3d) protein structure voxels
PCT/US2022/045823 WO2023059750A1 (en) 2021-10-06 2022-10-05 Combined and transfer learning of a variant pathogenicity predictor using gapped and non-gapped protein samples

Publications (1)

Publication Number Publication Date
JP2024538478A true JP2024538478A (ja) 2024-10-23

Family

ID=89808344

Family Applications (3)

Application Number Title Priority Date Filing Date
JP2023580573A Pending JP2024538478A (ja) 2021-10-06 2022-10-05 ギャップ付き及び非ギャップタンパク質サンプルを使用した変異体病原性予測器の複合学習及び転移学習
JP2023580572A Pending JP2024538477A (ja) 2021-10-06 2022-10-05 タンパク質構造に基づくタンパク質言語モデル
JP2023579826A Pending JP2024538475A (ja) 2021-10-06 2022-10-05 三次元(3d)タンパク質構造ボクセルを用いた進化的保存からの変異体病原性の予測

Family Applications After (2)

Application Number Title Priority Date Filing Date
JP2023580572A Pending JP2024538477A (ja) 2021-10-06 2022-10-05 タンパク質構造に基づくタンパク質言語モデル
JP2023579826A Pending JP2024538475A (ja) 2021-10-06 2022-10-05 三次元(3d)タンパク質構造ボクセルを用いた進化的保存からの変異体病原性の予測

Country Status (4)

Country Link
EP (3) EP4413577A1 (https=)
JP (3) JP2024538478A (https=)
KR (3) KR20240088641A (https=)
CN (2) CN117546242A (https=)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117178327A (zh) * 2021-04-15 2023-12-05 因美纳有限公司 使用深度卷积神经网络来预测变体致病性的多通道蛋白质体素化
CN118629516B (zh) * 2024-05-17 2025-09-16 安徽农业大学 一种基于多模态特征和孪生网络的神经肽预测方法及系统
CN119560009B (zh) * 2025-01-22 2025-06-24 浙江工业大学 一种蛋白质翻译后修饰与疾病关联预测系统及方法

Also Published As

Publication number Publication date
EP4413575A1 (en) 2024-08-14
KR20240082270A (ko) 2024-06-10
KR20240082269A (ko) 2024-06-10
EP4413577A1 (en) 2024-08-14
EP4413576A1 (en) 2024-08-14
CN117642824A (zh) 2024-03-01
KR20240088641A (ko) 2024-06-20
JP2024538477A (ja) 2024-10-23
CN117546242A (zh) 2024-02-09
JP2024538475A (ja) 2024-10-23

Similar Documents

Publication Publication Date Title
US12444482B2 (en) Multi-channel protein voxelization to predict variant pathogenicity using deep convolutional neural networks
US11515010B2 (en) Deep convolutional neural networks to predict variant pathogenicity using three-dimensional (3D) protein structures
JP2024538478A (ja) ギャップ付き及び非ギャップタンパク質サンプルを使用した変異体病原性予測器の複合学習及び転移学習
JP2024514894A (ja) 深層学習のための効率的なボクセル化
JP7755105B2 (ja) 3次元(3d)タンパク質構造を用いて変異体病原性を予測する深層畳み込みニューラルネットワーク
US20230343413A1 (en) Protein structure-based protein language models
US20230108368A1 (en) Combined and transfer learning of a variant pathogenicity predictor using gapped and non-gapped protein samples
US20230047347A1 (en) Deep neural network-based variant pathogenicity prediction
CN117581302A (zh) 使用有缺口和非缺口的蛋白质样品的变体致病性预测器的组合学习和迁移学习
CN117178327A (zh) 使用深度卷积神经网络来预测变体致病性的多通道蛋白质体素化
WO2023059752A1 (en) Protein structure-based protein language models

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20240412