CN116508017A - 计算药物靶标选择 - Google Patents

计算药物靶标选择 Download PDF

Info

Publication number
CN116508017A
CN116508017A CN202180074273.9A CN202180074273A CN116508017A CN 116508017 A CN116508017 A CN 116508017A CN 202180074273 A CN202180074273 A CN 202180074273A CN 116508017 A CN116508017 A CN 116508017A
Authority
CN
China
Prior art keywords
public
drug
target
drug target
character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202180074273.9A
Other languages
English (en)
Chinese (zh)
Inventor
丹尼尔·詹姆斯·克劳瑟
大卫·纳加内斯·卡隆
吉列尔莫·塞拉诺·纳赫拉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Aix Saianxia Artificial Intelligence Co ltd
Original Assignee
Aix Saianxia Artificial Intelligence Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Aix Saianxia Artificial Intelligence Co ltd filed Critical Aix Saianxia Artificial Intelligence Co ltd
Publication of CN116508017A publication Critical patent/CN116508017A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H70/00ICT specially adapted for the handling or processing of medical references
    • G16H70/40ICT specially adapted for the handling or processing of medical references relating to drugs, e.g. their side effects or intended usage
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3325Reformulation based on results of preceding query
    • G06F16/3326Reformulation based on results of preceding query using relevance feedback from the user, e.g. relevance feedback on documents, documents sets, document terms or passages
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/353Clustering; Classification into predefined classes
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/355Creation or modification of classes or clusters
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/38Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/382Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using citations
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/10Machine learning using kernel methods, e.g. support vector machines [SVM]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/20Ensemble learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • G06N3/0442Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • G06N3/0455Auto-encoder networks; Encoder-decoder networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/0895Weakly supervised learning, e.g. semi-supervised or self-supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/091Active learning
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • G16B40/20Supervised data analysis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C20/00Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
    • G16C20/70Machine learning, data mining or chemometrics
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H70/00ICT specially adapted for the handling or processing of medical references
    • G16H70/20ICT specially adapted for the handling or processing of medical references relating to practices or guidelines

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Databases & Information Systems (AREA)
  • Mathematical Physics (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Epidemiology (AREA)
  • Public Health (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Chemical & Material Sciences (AREA)
  • Primary Health Care (AREA)
  • Bioethics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Toxicology (AREA)
  • Medicinal Chemistry (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Evolutionary Biology (AREA)
  • Biotechnology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Library & Information Science (AREA)
  • Medical Treatment And Welfare Office Work (AREA)
CN202180074273.9A 2020-10-29 2021-10-29 计算药物靶标选择 Pending CN116508017A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
GB2017177.3 2020-10-29
GB2017177.3A GB2600687A (en) 2020-10-29 2020-10-29 Computational drug target selection
PCT/GB2021/052813 WO2022096861A2 (en) 2020-10-29 2021-10-29 Computational drug target selection

Publications (1)

Publication Number Publication Date
CN116508017A true CN116508017A (zh) 2023-07-28

Family

ID=73776466

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202180074273.9A Pending CN116508017A (zh) 2020-10-29 2021-10-29 计算药物靶标选择

Country Status (7)

Country Link
US (1) US20230352193A1 (https=)
EP (1) EP4238097A2 (https=)
JP (1) JP2023547964A (https=)
KR (1) KR20230128266A (https=)
CN (1) CN116508017A (https=)
GB (1) GB2600687A (https=)
WO (1) WO2022096861A2 (https=)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB202304213D0 (en) 2023-03-23 2023-05-10 Exscientia Ai Ltd Computational drug target selection
KR20250045561A (ko) 2023-09-25 2025-04-02 주식회사 엘지에너지솔루션 충전 관리 장치 및 그것의 동작 방법

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107533563A (zh) * 2015-05-29 2018-01-02 英特尔公司 用于动态自动化内容发现的技术
CN108427702A (zh) * 2017-10-23 2018-08-21 平安科技(深圳)有限公司 目标文档获取方法及应用服务器

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5347878B2 (ja) * 2009-09-29 2013-11-20 富士通株式会社 文献間関係解析装置、該プログラム、及び該方法
CN109074420B (zh) * 2016-05-12 2022-03-08 豪夫迈·罗氏有限公司 用于预测靶向药物治疗疾病的效果的系统
JP7237574B2 (ja) * 2018-12-27 2023-03-13 オムロンヘルスケア株式会社 血圧測定装置
US11721441B2 (en) * 2019-01-15 2023-08-08 Merative Us L.P. Determining drug effectiveness ranking for a patient using machine learning

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107533563A (zh) * 2015-05-29 2018-01-02 英特尔公司 用于动态自动化内容发现的技术
CN108427702A (zh) * 2017-10-23 2018-08-21 平安科技(深圳)有限公司 目标文档获取方法及应用服务器

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
BOB JA SCHIJVENAARS ET AL.: "Thesaurus-based disambiguation of gene symbols", 《BMC BIOINFORMATICS》, vol. 6, 16 June 2005 (2005-06-16), pages 1 - 9 *
FARAG SAAD ET AL.: "Improving Named Entity Recognition for Biomedical and Patent Data Using Bi-LSTM Deep Neural Network Models", 《NLDB 2020》, 17 July 2020 (2020-07-17), pages 25 - 26 *
MARTIN KRALLINGER ET AL.: ""Information Retrieval and Text Mining Technologies for Chemistry"", 《CHEMICAL REVIEWS》, vol. 117, no. 12, 5 May 2017 (2017-05-05), pages 7726 - 7736 *
QI WANG: ""A Bibiliometric Model for Identifying Emerging Research Topics"", HTTPS://ARXIV.ORG/ABS/1707.03599, 12 July 2017 (2017-07-12), pages 2 - 9 *
UDO HAHN ET AL.: "Mining the pharmacogenomics literatureça survey of the state of the art", 《BRIEFINGS IN BIOINFORMATICS》, vol. 3, no. 4, 24 July 2012 (2012-07-24), pages 460 - 494 *

Also Published As

Publication number Publication date
GB202017177D0 (en) 2020-12-16
JP2023547964A (ja) 2023-11-14
GB2600687A (en) 2022-05-11
US20230352193A1 (en) 2023-11-02
KR20230128266A (ko) 2023-09-04
WO2022096861A3 (en) 2022-08-25
EP4238097A2 (en) 2023-09-06
WO2022096861A2 (en) 2022-05-12

Similar Documents

Publication Publication Date Title
Yan et al. Protein fold recognition based on multi-view modeling
Shatkay et al. Multi-dimensional classification of biomedical text: Toward automated, practical provision of high-utility text to diverse users
Koskinen et al. PANNZER: high-throughput functional annotation of uncharacterized proteins in an error-prone environment
Hristovski et al. Using literature-based discovery to identify disease candidate genes
Boyack et al. Clustering more than two million biomedical publications: Comparing the accuracies of nine text-based similarity approaches
Noh et al. Keyword selection and processing strategy for applying text mining to patent analysis
Elbasir et al. DeepCrystal: a deep learning framework for sequence-based protein crystallization prediction
Li et al. PGCN: Disease gene prioritization by disease and gene embedding through graph convolutional neural networks
Jung et al. TimesVector: a vectorized clustering approach to the analysis of time series transcriptome data from multiple phenotypes
Li et al. Evaluating disease similarity based on gene network reconstruction and representation
Zhang et al. MATCHA: probing multi-way chromatin interaction with hypergraph representation learning
US20230352193A1 (en) Computational Drug Target Selection
Khan et al. DextMP: deep dive into text for predicting moonlighting proteins
Yones et al. Genome-wide pre-miRNA discovery from few labeled examples
Hong et al. CrepHAN: cross-species prediction of enhancers by using hierarchical attention networks
Blei et al. Statistical modeling of biomedical corpora: mining the caenorhabditis genetic center bibliography for genes related to life span
Jiang et al. Integrating image caption information into biomedical document classification in support of biocuration
Van Landeghem et al. Exploring biomolecular literature with EVEX: connecting genes through events, homology, and indirect associations
Zhong et al. Benchmarking gene embeddings from sequence, expression, network, and text models for functional prediction tasks
Karnyoto et al. Lda topic modeling for bioinformatics terms in arXiv documents
Turina et al. ThermoScan: semi-automatic identification of protein stability data from Pubmed
Dehghani et al. BioBERT-based SNP-trait associations extraction from biomedical literature
Chen et al. BERTE: High-precision hierarchical classification of transposable elements by a transfer learning method with BERT pre-trained model and convolutional neural network
Sehgal et al. Retrieval with gene queries
Roy et al. Nonnegative tensor factorization of biomedical literature for analysis of genomic data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination