WO2022190215A1 - 機械学習プログラム、予測プログラム、装置、及び方法 - Google Patents

機械学習プログラム、予測プログラム、装置、及び方法 Download PDF

Info

Publication number
WO2022190215A1
WO2022190215A1 PCT/JP2021/009344 JP2021009344W WO2022190215A1 WO 2022190215 A1 WO2022190215 A1 WO 2022190215A1 JP 2021009344 W JP2021009344 W JP 2021009344W WO 2022190215 A1 WO2022190215 A1 WO 2022190215A1
Authority
WO
WIPO (PCT)
Prior art keywords
node
nodes
edges
machine learning
triple
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2021/009344
Other languages
English (en)
French (fr)
Japanese (ja)
Inventor
孝典 鵜飼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to PCT/JP2021/009344 priority Critical patent/WO2022190215A1/ja
Priority to JP2023504926A priority patent/JPWO2022190215A1/ja
Priority to EP21930072.0A priority patent/EP4307183A4/en
Publication of WO2022190215A1 publication Critical patent/WO2022190215A1/ja
Priority to US18/457,023 priority patent/US20230401455A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/022Knowledge engineering; Knowledge acquisition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/046Forward inferencing; Production systems
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients

Definitions

  • the information processing device generates a node indicating each value of each item included in the case data as described above, and from each "ID" node, the attributes, drugs, diseases, and side effects of the patient indicated by the ID.
  • Graph data is generated by connecting the edges to the nodes that represent each.
  • FIG. 2 shows an example of graph data generated from the case data shown in FIG. 2 , nodes indicated by circles with respective values inscribed therein are nodes indicating attributes, drugs, and diseases, respectively, and nodes indicated by rounded squares in which side effects are indicated are nodes indicating side effects. , and the arrows connecting the nodes are edges.
  • FIG. 2 shows a case where an ontology (broken line in FIG. 2) indicating background knowledge is connected to graph data indicating case data.
  • An ontology that indicates background knowledge is a systematization of background knowledge in the field of interest. etc.
  • nodes related to the ontology are represented by ellipses. Converting case data into graph data facilitates such ontology connections. Similar side effects may occur when diseases are similar or when drugs containing the same ingredients are administered. improves.
  • the information processing apparatus does not recalculate the embedding vector from 1 and update the graph data, but executes machine learning using additional data to obtain the embedding vector, which is a parameter of the machine-learned graph data. We will update it from time to time.
  • the processing load per machine learning is low, there is no need to store all the data used in past execution of machine learning, and changes in data can be immediately dealt with.
  • the machine learning device 10 functionally includes an acquisition unit 12 , an update unit 14 and a prediction unit 18 .
  • Graph data 16 is stored in a predetermined storage area of the machine learning device 10 .
  • the graph data 16 is machine-learned graph data in a state before additional data is input and update processing is executed.
  • Graph data 20 is graph data that has been updated.
  • the updating unit 14 selects a specific embedding vector from among the plurality of embedding vectors based on the additional data and the plurality of embedding vectors representing the plurality of nodes and the plurality of edges of the machine-learned graph data 16.
  • a specific embedding vector is an embedding vector representing nodes and edges included in the range determined as the range of online learning.
  • the specific embedding vector is the embedding vector of each of one or more nodes and one or more edges that are connected to the triple representing the additional data under a specific condition in the machine-learned graph data 16. is.
  • step S50 the prediction unit 18 acquires the input data input to the machine learning device 10, converts it into graph data, and calculates embedding vectors for each of the nodes and edges included in the graph data representing the input data.
  • step S52 the prediction unit 18 uses the similarity between the embedding vector of the graph data 20 and the embedding vector of the input data to determine whether there is a possibility of connecting to an edge to be predicted in the graph data 20. Identify a node.
  • step S54 the prediction unit 18 outputs as a prediction result whether or not the edge to be predicted is connected to the identified node, and the prediction process ends.
  • the updating unit 214 updates the embedding vector of the graph data 16 based on the additional data to generate the updated graph data 20, like the updating unit 14 in the first embodiment. Since the update unit 214 in the second embodiment has a different method for determining the range of online learning than the update unit 14 in the first embodiment, the method for determining the range by the update unit 214 in the second embodiment will be described below. .
  • range determination processing in machine learning processing differs from that in the first embodiment, so the range determination processing in the second embodiment will be described with reference to FIG. 15 .
  • processing similar to the range determination processing (FIG. 11) in the first embodiment is denoted by the same step number, and detailed description thereof will be omitted.
  • the updating unit 314 updates the embedding vector of the graph data 16 based on the additional data to generate the updated graph data 20, like the updating unit 14 in the first embodiment. Since the update unit 314 in the third embodiment has a different method for determining the range of online learning than the update unit 14 in the first embodiment, the method for determining the range by the update unit 314 in the third embodiment will be described below. .
  • the range is expanded to include the path from one node of the triple representing the additional data to the prediction node.
  • the updating unit adjusts the range to include the nodes and edges included in the triple representing the additional data from the other node to the node having the same distance as the distance from the one node to the prediction node. may decide.
  • the range is determined so as to include the path from the subject node to the prediction node, and the length of this path (corresponding to the distance) is two. Therefore, as shown in FIG. 19, the updating unit may extend the range 24 so that the edges and nodes included in the path of length 2 from the node that is the object are also included.
  • the imbalance in the scope of the online learning can be eliminated between the node side that is the subject of the triple representing the additional data and the node side that is the object.
  • a machine learning device including an acquisition unit and an update unit and a prediction device including a prediction unit may be implemented by different computers.
  • the storage unit of the machine learning device stores a machine learning program for executing the machine learning process in each of the above embodiments.
  • a prediction program for executing the prediction processing in each of the above embodiments is stored in the storage unit of the prediction device.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Computing Systems (AREA)
  • Evolutionary Computation (AREA)
  • General Physics & Mathematics (AREA)
  • Public Health (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Databases & Information Systems (AREA)
  • Pathology (AREA)
  • Epidemiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
PCT/JP2021/009344 2021-03-09 2021-03-09 機械学習プログラム、予測プログラム、装置、及び方法 Ceased WO2022190215A1 (ja)

Priority Applications (4)

Application Number Priority Date Filing Date Title
PCT/JP2021/009344 WO2022190215A1 (ja) 2021-03-09 2021-03-09 機械学習プログラム、予測プログラム、装置、及び方法
JP2023504926A JPWO2022190215A1 (https=) 2021-03-09 2021-03-09
EP21930072.0A EP4307183A4 (en) 2021-03-09 2021-03-09 MACHINE LEARNING PROGRAM, PREDICTION PROGRAM, DEVICE AND METHOD
US18/457,023 US20230401455A1 (en) 2021-03-09 2023-08-28 Storage medium, prediction device, and prediction method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2021/009344 WO2022190215A1 (ja) 2021-03-09 2021-03-09 機械学習プログラム、予測プログラム、装置、及び方法

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US18/457,023 Continuation US20230401455A1 (en) 2021-03-09 2023-08-28 Storage medium, prediction device, and prediction method

Publications (1)

Publication Number Publication Date
WO2022190215A1 true WO2022190215A1 (ja) 2022-09-15

Family

ID=83226432

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2021/009344 Ceased WO2022190215A1 (ja) 2021-03-09 2021-03-09 機械学習プログラム、予測プログラム、装置、及び方法

Country Status (4)

Country Link
US (1) US20230401455A1 (https=)
EP (1) EP4307183A4 (https=)
JP (1) JPWO2022190215A1 (https=)
WO (1) WO2022190215A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240420391A1 (en) * 2023-06-16 2024-12-19 The Toronto-Dominion Bank Intelligent dashboard search engine

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2016212853A (ja) * 2015-04-30 2016-12-15 富士通株式会社 類似性計算装置、薬の類似性を計算し及び類似性を用いて副作用を推定する副作用決定装置及びシステム
JP2018085116A (ja) 2016-11-23 2018-05-31 富士通株式会社 知識グラフを完成させるための方法および装置
JP2018206374A (ja) 2017-05-19 2018-12-27 富士通株式会社 予測される部位特異的なタンパク質リン酸化候補の発見のためのシステムおよび方法
US20190220524A1 (en) * 2018-01-16 2019-07-18 Accenture Global Solutions Limited Determining explanations for predicted links in knowledge graphs
JP2019125364A (ja) 2018-01-03 2019-07-25 エヌイーシー ラボラトリーズ ヨーロッパ ゲーエムベーハー マルチモーダルレコメンデーション方法及びシステム

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2016212853A (ja) * 2015-04-30 2016-12-15 富士通株式会社 類似性計算装置、薬の類似性を計算し及び類似性を用いて副作用を推定する副作用決定装置及びシステム
JP2018085116A (ja) 2016-11-23 2018-05-31 富士通株式会社 知識グラフを完成させるための方法および装置
JP2018206374A (ja) 2017-05-19 2018-12-27 富士通株式会社 予測される部位特異的なタンパク質リン酸化候補の発見のためのシステムおよび方法
JP2019125364A (ja) 2018-01-03 2019-07-25 エヌイーシー ラボラトリーズ ヨーロッパ ゲーエムベーハー マルチモーダルレコメンデーション方法及びシステム
US20190220524A1 (en) * 2018-01-16 2019-07-18 Accenture Global Solutions Limited Determining explanations for predicted links in knowledge graphs

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
MURAKAMI KATSUHIKO: "Knowledge graph embedding using network structure ", THE 34TH ANNUAL CONFERENCE FOR THE JAPANESE SOCIETY OF ARTIFICIAL INTELLIGENCE, 9 June 2020 (2020-06-09), XP055968606 *
See also references of EP4307183A4

Also Published As

Publication number Publication date
EP4307183A4 (en) 2024-05-01
US20230401455A1 (en) 2023-12-14
EP4307183A1 (en) 2024-01-17
JPWO2022190215A1 (https=) 2022-09-15

Similar Documents

Publication Publication Date Title
Ripley et al. Manual for RSIENA
US12505358B2 (en) Methods and systems for approximating embeddings of out-of-knowledge-graph entities for link prediction in knowledge graph
JP2022024102A (ja) 検索モデルのトレーニング方法、目標対象の検索方法及びその装置
CN108292310A (zh) 用于数字实体相关的技术
JP6608972B2 (ja) ソーシャルネットワークに基づいてグループを探索する方法、デバイス、サーバ及び記憶媒体
JP2021500692A (ja) 系図エンティティ解決システムおよび方法
CN118779340A (zh) 自然语言查询语句的处理方法和装置
US20160378765A1 (en) Concept expansion using tables
CN113129053A (zh) 信息推荐模型训练方法、信息推荐方法及存储介质
CN113536040A (zh) 信息查询方法、装置以及存储介质
CN114118310A (zh) 基于综合相似度的聚类方法和装置
US10509800B2 (en) Visually interactive identification of a cohort of data objects similar to a query based on domain knowledge
US20230401455A1 (en) Storage medium, prediction device, and prediction method
JP6321845B1 (ja) 付与装置、付与方法および付与プログラム
CN111523048A (zh) 社交网络中好友的推荐方法、装置、存储介质及终端
US10943353B1 (en) Handling untrainable conditions in a network architecture search
CN119917674A (zh) 基于带属性多层图模型的第三方库的表示查询和推荐方法
JP2018156332A (ja) 生成装置、生成方法および生成プログラム
US20100115068A1 (en) Methods and systems for intelligent reconfiguration of information handling system networks
CN103477339B (zh) 模式识别
JP6541737B2 (ja) 選択装置、選択方法、選択プログラム、モデルおよび学習データ
JP7444280B2 (ja) 機械学習プログラム、推定プログラム、装置、及び方法
CN117555950A (zh) 基于数据中台的数据血缘关系构建方法
JP7491405B2 (ja) 点過程学習方法、点過程学習装置及びプログラム
CN116631642A (zh) 一种临床发现事件的抽取方法及装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21930072

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2023504926

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2021930072

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2021930072

Country of ref document: EP

Effective date: 20231009