JP2021089523A

JP2021089523A - Health-medical care data analysis system and health-medical care data analysis method

Info

Publication number: JP2021089523A
Application number: JP2019218895A
Authority: JP
Inventors: 大崎　高伸; Takanobu Osaki; 高伸大崎; 利昇三好; Toshinori Miyoshi; 信二垂水; Shinji Tarumi; 伴　秀行; Hideyuki Ban; 伴　　秀行
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2019-12-03
Filing date: 2019-12-03
Publication date: 2021-06-10
Anticipated expiration: 2039-12-03
Also published as: JP7300973B2

Abstract

To assist in creating a new policy for medical care, medical checkup or nursing care.SOLUTION: A health-and-medical care data analysis system for analyzing health-and-medical care data comprises: a first database for managing the first health-and-medical care data of a first group; a knowledge vector creation unit for outputting a correlation between the first health-and-medical care data as vector information; a second database for storing knowledge vector information; a third database for managing the second health-and-medical care data of a second group and policy information relating to the policy carried out in the second group; a first vector conversion unit for generating vector information by vector-converting the second health-and-medical care data using knowledge vector information; and a machine learning unit for outputting model information in which a relationship between the vector information and the policy information is modeled by machine learning and storing it in a fourth database. The second group is composed of low-order groups constituting the first group, and the policy information includes the subjects of the policy of health-and-medical care and the results of the policy in the second group.SELECTED DRAWING: Figure 1

Description

本発明は、医療や介護等のヘルスケア情報を分析するヘルスケアデータ分析システム及び方法に関する。 The present invention relates to a healthcare data analysis system and method for analyzing healthcare information such as medical care and long-term care.

近年、健康寿命や労働寿命が延伸する一方で、少子高齢化などの人口構造の変化によって、医療費や介護費の増大が懸案事項となっている。このため、健康、医療、介護などの発生や重篤化の予防に対する適切な施策が要求されている。 In recent years, while healthy life expectancy and working life expectancy have been extended, changes in the population structure such as the declining birthrate and aging population have made increasing medical and long-term care costs a concern. For this reason, appropriate measures are required to prevent the occurrence and aggravation of health, medical care, long-term care, etc.

医療などに対する施策を決定するためには、将来の予測と過去の実績を検討する必要がある。将来の健康状態を精度よく予測する手法としては、例えば、特許文献１が知られている。特許文献１では、対象者の受診記録を表現した入力ベクトルを正規化してからその入力ベクトルを学習アルゴリズムに入力することで、対象者の将来の状態の予測を示す出力ベクトルを得ている。 In order to decide measures for medical care, it is necessary to consider future forecasts and past achievements. For example, Patent Document 1 is known as a method for accurately predicting a future health condition. In Patent Document 1, an output vector indicating a prediction of the future state of the subject is obtained by normalizing the input vector expressing the consultation record of the subject and then inputting the input vector into the learning algorithm.

また、過去の実績を検討する手法としては、特許文献２が知られている。特許文献２では、健診値と問診結果などの異なる種類のデータが混在する医療データを、潜在トピック分析を用いることで適切なクラスタリングを行うことが可能となるようなデータ表現を得る技術が開示されている。 Further, Patent Document 2 is known as a method for examining past achievements. Patent Document 2 discloses a technique for obtaining a data representation that enables appropriate clustering of medical data in which different types of data such as medical examination values and interview results are mixed by using latent topic analysis. Has been done.

特開２０１８−１９４９０４号公報JP-A-2018-194904 特開２０１７−０２７３０７号公報Japanese Unexamined Patent Publication No. 2017-027307

上記従来技術では、健康状態の予測や医療情報の分類などを行うことは可能であるが、医療や健診又は介護に対する施策の立案を支援することが難しい。 With the above-mentioned conventional technology, it is possible to predict the health condition and classify medical information, but it is difficult to support the planning of measures for medical treatment, medical examination, or long-term care.

特に、特定の地域（自治体や地方）や職域（集団）で実施した施策に対するヘルスケアデータから予測や分類を行った場合、その分析結果はその地域に適用することは可能であるが、特定の地域等で実施した施策を、異なる地域で実施する場合、同様の効果が得られるとは限らず、また、元の地域で分析した結果である予測モデルや分類方法などを適用できるとは限らない。 In particular, if predictions and classifications are made from healthcare data for measures implemented in a specific area (local government or region) or occupational area (group), the analysis results can be applied to that area, but they are specific. When measures implemented in a region are implemented in different regions, the same effect may not always be obtained, and the prediction model and classification method that are the results of analysis in the original region may not always be applicable. ..

すなわち、地域毎に環境（気候、風土）や人口構成や産業構造が異なるため、健康、医療、介護の予防等に関する施策の効果は同一になるとは限らない。このため、上記従来技術では、一部の地域のデータの分析結果である予測モデルや分類手法などから、全国的な医療等の施策や、他の地域の施策の作成を支援することが難しい、という問題があった。 In other words, the effects of measures related to health, medical care, prevention of long-term care, etc. are not always the same because the environment (climate, climate), population composition, and industrial structure differ from region to region. Therefore, with the above-mentioned conventional technology, it is difficult to support the creation of national medical measures and measures in other regions from the prediction model and classification method which are the analysis results of data in some regions. There was a problem.

そこで本発明は、上記問題点に鑑みてなされたもので、地域等の施策に対するヘルスケアデータを利用して、医療や健診又は介護に対する新たな施策の作成を支援することを目的とする。 Therefore, the present invention has been made in view of the above problems, and an object of the present invention is to support the creation of new measures for medical care, medical examination, or long-term care by using healthcare data for measures such as the community.

本発明は、プロセッサとメモリを有する計算機システムが、保健医療データを分析する保健医療データ分析システムであって、第１の集団の第１の保健医療データを管理する第１のデータベースと、前記第１の保健医療データを入力して、前記第１の保健医療データ間の相互の関係をベクトルに変換して知識ベクトル情報として出力する知識ベクトル作成部と、前記知識ベクトル情報を格納する第２のデータベースと、第２の集団の第２の保健医療データと、前記第２の集団で実施された施策に関する施策情報と、を管理する第３のデータベースと、前記第３のデータベースの第２の保健医療データと、前記第２のデータベースの知識ベクトル情報を入力し、前記第２の集団の第２の保健医療データを前記知識ベクトル情報を用いてベクトル変換によりベクトル情報を生成する第１のベクトル変換部と、前記変換されたベクトル情報と、前記第３のデータベースの施策情報を入力し、前記ベクトル情報と施策情報との関係を機械学習によりモデル化したモデル情報を出力する機械学習部と、前記出力されたモデル情報を管理する第４のデータベースと、を有し、前記第２の集団は、前記第１の集団を構成する下位の集団で構成され、前記施策情報は、前記第２の集団における保健医療の施策の対象者と、施策の結果を含む。 In the present invention, a computer system having a processor and a memory is a health care data analysis system that analyzes health care data, and has a first database that manages a first health care data of a first group, and the above-mentioned first. A knowledge vector creation unit that inputs 1 health care data, converts the mutual relationship between the first health care data into a vector and outputs it as knowledge vector information, and a second knowledge vector information storage unit. A third database that manages a database, a second health care data of the second group, and measure information regarding measures implemented in the second group, and a second health of the third database. A first vector transformation in which medical data and knowledge vector information of the second database are input, and vector information is generated by vector conversion of the second health care data of the second group using the knowledge vector information. A machine learning unit that inputs the converted vector information and the measure information of the third database and outputs model information that models the relationship between the vector information and the measure information by machine learning. It has a fourth database that manages the output model information, the second group is composed of lower groups constituting the first group, and the measure information is the second group. Includes the target persons of the health care measures in Japan and the results of the measures.

したがって、本発明は、全国（第１の集団）的な保健医療（ヘルスケア）データからベクトル情報を算出し、このベクトル情報と、特定の地域（第２の集団）で実施した施策及び施策に対するヘルスケアデータのベクトル情報からモデル情報（予測モデル）を生成する。このモデル情報に他の地域（第３の集団）のヘルスケアデータを適用することで、特定の地域（第２の集団）で行った施策を他の地域（第３の集団）で実施した場合の効果を予測することが可能となり、ヘルスケア（医療や健診又は介護）に対する新たな施策の作成を支援することが可能となる。 Therefore, the present invention calculates vector information from national (first group) health care data, and for this vector information and measures and measures implemented in a specific area (second group). Model information (predictive model) is generated from vector information of healthcare data. When the measures taken in a specific area (second group) are implemented in another area (third group) by applying the healthcare data of another area (third group) to this model information. It becomes possible to predict the effect of the above, and it becomes possible to support the creation of new measures for healthcare (medical care, medical examination or long-term care).

本明細書において開示される主題の、少なくとも一つの実施の詳細は、添付されている図面と以下の記述の中で述べられる。開示される主題のその他の特徴、態様、効果は、以下の開示、図面、請求項により明らかにされる。 Details of at least one practice of the subject matter disclosed herein are set forth in the accompanying drawings and in the description below. Other features, aspects, and effects of the disclosed subject matter are manifested in the disclosures, drawings, and claims below.

本発明の実施例を示し、ヘルスケアデータ分析システムの概要を示す図である。It is a figure which shows the Example of this invention and shows the outline of the healthcare data analysis system. 本発明の実施例を示し、ヘルスケアデータ分析システムの構成の一例を示すブロック図である。It is a block diagram which shows the Example of this invention and shows an example of the structure of the healthcare data analysis system. 本発明の実施例を示し、知識作成装置で行われる知識ベクトル情報の作成処理の一例を示すフローチャートである。It is a flowchart which shows the Example of this invention and shows an example of the knowledge vector information creation processing performed by the knowledge creation apparatus. 本発明の実施例を示し、モデル生成装置で行われるモデル生成処理の一例を示すフローチャートである。It is a flowchart which shows the Example of this invention and shows an example of the model generation processing performed by the model generation apparatus. 本発明の実施例を示し、シミュレーション装置＃２で行われるシミュレーション処理の一例を示すフローチャートである。It is a flowchart which shows the Example of this invention and shows an example of the simulation process performed by the simulation apparatus # 2. 本発明の実施例を示し、シミュレーション装置＃１で行われるシミュレーション処理の一例を示すフローチャートである。It is a flowchart which shows the Example of this invention and shows an example of the simulation process performed by the simulation apparatus # 1. 本発明の実施例を示し、データベースＤＢ＃１の健診情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the medical examination information of database DB # 1. 本発明の実施例を示し、データベースＤＢ＃３の健診情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the medical examination information of database DB # 3. 本発明の実施例を示し、データベースＤＢ＃５の健診情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the medical examination information of database DB # 5. 本発明の実施例を示し、データベースＤＢ＃１のレセプト情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the receipt information of the database DB # 1. 本発明の実施例を示し、データベースＤＢ＃３のレセプト情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the receipt information of the database DB # 3. 本発明の実施例を示し、データベースＤＢ＃５のレセプト情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the receipt information of the database DB # 5. 本発明の実施例を示し、データベースＤＢ＃１の介護情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the care information of database DB # 1. 本発明の実施例を示し、データベースＤＢ＃３の介護情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the care information of database DB # 3. 本発明の実施例を示し、データベースＤＢ＃５の介護情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the care information of database DB # 5. 本発明の実施例を示し、データベースＤＢ＃３の施策情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the measure information of the database DB # 3. 本発明の実施例を示し、データベースＤＢ＃３の施策効果情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the measure effect information of the database DB # 3. 本発明の実施例を示し、知識作成装置で生成される突き合わせ情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the matching information generated by the knowledge-creating apparatus. 本発明の実施例を示し、知識作成装置で生成されるベクトル情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the vector information generated by the knowledge-creating apparatus. 本発明の実施例を示し、データベースＤＢ＃１に格納される知識ベクトル情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the knowledge vector information stored in the database DB # 1. 本発明の実施例を示し、データベースＤＢ＃４に格納されるモデル情報の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the model information stored in the database DB # 4. 本発明の実施例を示し、モデル選択画面の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the model selection screen. 本発明の実施例を示し、シミュレーション結果画面の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the simulation result screen. 本発明の実施例を示し、シミュレーション装置の構成の一例を示すブロック図である。It is a block diagram which shows the Example of this invention and shows an example of the structure of the simulation apparatus. 本発明の実施例を示し、Ｓｋｉｐ−ｇｒａｍモデルを用いた用語のベクトル表現の生成の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the generation of the vector representation of a term using a Skip-gram model.

以下、本発明の実施形態を添付図面に基づいて説明する。 Hereinafter, embodiments of the present invention will be described with reference to the accompanying drawings.

＜概要＞
図１は、本発明の実施例を示し、ヘルスケア（保健医療）データ分析システムの概要を示す図である。本実施例のヘルスケアデータは、医療情報と、健診情報と、介護情報の少なくともひとつを含む情報である。 <Overview>
FIG. 1 is a diagram showing an embodiment of the present invention and showing an outline of a healthcare data analysis system. The healthcare data of this embodiment is information including at least one of medical information, medical examination information, and long-term care information.

ヘルスケアデータ分析システムでは、国（又は上位の集団）Ａ１が収集したヘルスケアデータは、医療のレセプト情報と、健診情報と、介護情報を含んでデータベース（ＤＢ＃１）１に格納される。国Ａ１が収集したヘルスケアデータは、匿名のデータベースであり、個人の特定はできないが、レセプト情報と、健診情報及び介護情報は、共通の個人ＩＤによって同一人物の情報を突き合わせることができる。 In the healthcare data analysis system, the healthcare data collected by the country (or higher group) A1 is stored in the database (DB # 1) 1 including medical receipt information, medical examination information, and long-term care information. .. The healthcare data collected by Country A1 is an anonymous database, and it is not possible to identify an individual, but the receipt information, medical examination information, and long-term care information can be matched with the same person's information by a common personal ID. ..

そして、国Ａ１のレベルでは、後述するように、ヘルスケアデータをベクトル変換することで知識ベクトル情報２１及びベクトル情報２３を生成し、データベース（ＤＢ＃２）２に格納される。なお、ベクトル変換の一例としては、分散表現を用いることができる。 Then, at the level of country A1, as will be described later, the knowledge vector information 21 and the vector information 23 are generated by vector-converting the healthcare data and stored in the database (DB # 2) 2. As an example of vector transformation, a distributed representation can be used.

一方、国（又は集団）Ａ１を構成する下位の地域（又は下位の集団）Ａ２は、当該地域Ａ２に居住又は勤務する個人のヘルスケアデータ（医療のレセプト情報、健診情報、介護情報）と、当該地域で実施した施策情報が、データベース（ＤＢ＃３）３に格納される。 On the other hand, the subordinate area (or subordinate group) A2 that constitutes the country (or group) A1 is the health care data (medical receipt information, medical examination information, care information) of the individual who lives or works in the area A2. , The information on the measures implemented in the area is stored in the database (DB # 3) 3.

地域Ａ２では、データベース３のヘルスケアデータと、施策情報を、国Ａ１が保有する知識ベクトル情報２１を用いて後述するようにベクトル化する。そして、地域Ａ２では、ベクトル化した情報で機械学習を実施して、ヘルスケアに関する予測を行うモデル情報４１を生成し、国Ａ１が管理するデータベース（ＤＢ＃４）４に登録する。モデル情報４１は、地域Ａ２で実施した施策情報毎に生成され、施策を行った結果（効果）を含むことができる。 In the area A2, the healthcare data of the database 3 and the measure information are vectorized using the knowledge vector information 21 held by the country A1 as described later. Then, in the region A2, machine learning is performed with the vectorized information to generate model information 41 for making predictions about healthcare, and the model information 41 is registered in the database (DB # 4) 4 managed by the country A1. The model information 41 is generated for each measure information implemented in the area A2, and can include the result (effect) of the measure.

また、国（又は集団）Ａ１を構成する他の地域（下位の集団）Ａ３は、当該地域Ａ３に居住又は勤務する個人のヘルスケアデータをデータベース（ＤＢ＃５）５に格納する。 In addition, another region (subgroup) A3 constituting the country (or group) A1 stores the healthcare data of the individual who lives or works in the region A3 in the database (DB # 5) 5.

国Ａ１が管理するデータベース４は、地域Ａ２と他の地域Ａ３で共有される。データベース４に格納されたモデル情報４１は、国Ａ１と他の地域Ａ３で利用される。国Ａ１では、データベース４から選択したモデル情報４１で、データベース１のヘルスケアデータについてシミュレーションを実施して、地域Ａ２で実施した施策を全国規模で実施した場合の効果を予測することができる。 The database 4 managed by country A1 is shared by region A2 and other regions A3. The model information 41 stored in the database 4 is used in the country A1 and other regions A3. In country A1, the model information 41 selected from database 4 can be used to simulate the healthcare data in database 1 and predict the effect of implementing the measures implemented in region A2 on a nationwide scale.

同様に、他の地域Ａ３では、データベース４から選択したモデル情報４１で、データベース５のヘルスケアデータについてシミュレーションを実施して、地域Ａ２で実施した施策を地域Ａ３で実施した場合の効果を予測することができる。 Similarly, in the other region A3, the model information 41 selected from the database 4 is used to simulate the healthcare data in the database 5 to predict the effect of implementing the measures implemented in the region A2 in the region A3. be able to.

以上のように、本実施例のヘルスケアデータ分析システムでは、国Ａ１が所有する全国レベルのヘルスケアデータをベクトル変換して知識ベクトル情報２１及びベクトル情報２３を生成し、施策を実施した地域Ａ２のヘルスケアデータと施策情報に知識ベクトル情報２１を用いてベクトル化した情報を機械学習してモデル情報４１（予測モデル）を生成する。なお、ベクトル情報２３が、ベクトル値の実体であり、知識ベクトル情報２１は、ベクトル情報２３を管理するテーブルである。 As described above, in the healthcare data analysis system of this embodiment, the national level healthcare data owned by country A1 is vector-converted to generate knowledge vector information 21 and vector information 23, and the area A2 where the measures are implemented. Model information 41 (prediction model) is generated by machine learning the vectorized information using the knowledge vector information 21 for the healthcare data and the measure information of the above. The vector information 23 is an entity of the vector value, and the knowledge vector information 21 is a table that manages the vector information 23.

国Ａ１と他の地域Ａ３では、このモデル情報４１を用いてそれぞれのヘルスケアデータでシミュレーションを実施することで、地域Ａ２で実施した様々な施策を国Ａ１や他の地域Ａ３で実施した場合の効果を予測することが可能となる。 In country A1 and other regions A3, by implementing simulations with their respective healthcare data using this model information 41, when various measures implemented in region A2 are implemented in country A1 and other regions A3. The effect can be predicted.

国Ａ１や他の地域Ａ３では、シミュレーションの結果に基づいて、新たな施策の立案や、既存の施策の流用を行うことで、ヘルスケアに関する施策の立案や決定を効率よく実施することができる。特に、国Ａ１や地域Ａ３では、地域Ａ２で実施した施策の結果に関するデータを持たないが、地域Ａ２で作成した予測モデルを、ＤＢ４にモデル情報として集約することで、国Ａ１や地域Ａ３で当該の施策を実施した場合の効果を予測することができる。さらに、地域Ａ２で予測モデルを作成するときに、国Ａ１のような全体のデータの特徴から作成した知識ベクトル情報２１を用いて、地域Ａ２のデータをベクトル化する。これにより、地域Ａ２で作成した予測モデルは、地域Ａ２のデータ特性だけでなく、国Ａ１の全体データにおける傾向を反映したモデルを作成することができる。そして、地域Ａ２で作成したモデルを国Ａ１や地域Ａ３に適用する場合にも、国Ａ１や地域Ａ３のデータを同じ知識ベクトル情報２１でベクトル化した上で予測モデルに適用することで、国Ａ１や地域Ａ３で適用した場合を精度よく予測することができる効果がある。このように、本実施例のヘルスケアデータ分析システムは、ヘルスケアに関する施策の立案や決定を支援することができる。 In country A1 and other regions A3, by planning new measures and diverting existing measures based on the results of simulations, it is possible to efficiently plan and decide measures related to healthcare. In particular, country A1 and region A3 do not have data on the results of measures implemented in region A2, but by aggregating the forecast models created in region A2 into DB4 as model information, the country A1 and region A3 are concerned. It is possible to predict the effect of implementing the above measures. Further, when the prediction model is created in the region A2, the data in the region A2 is vectorized by using the knowledge vector information 21 created from the characteristics of the entire data such as the country A1. As a result, the prediction model created in the region A2 can create a model that reflects not only the data characteristics of the region A2 but also the tendency in the overall data of the country A1. Then, even when the model created in the region A2 is applied to the country A1 or the region A3, the data of the country A1 or the region A3 is vectorized by the same knowledge vector information 21 and then applied to the prediction model. It has the effect of being able to accurately predict the case where it is applied in the area A3. In this way, the healthcare data analysis system of this embodiment can support the planning and decision-making of measures related to healthcare.

なお、上記では最上位の集団を国Ａ１として、下位の集団（Ａ２、Ａ３）を地域とした例を示したが、これに限定されるものではなく、最上位の集団（Ａ１）を会社全体とし、集団（Ａ２、Ａ３）を事業部等とすることができる。 In the above, an example is shown in which the highest group is country A1 and the lower group (A2, A3) is a region, but the present invention is not limited to this, and the highest group (A1) is the entire company. Then, the group (A2, A3) can be a business division or the like.

＜システム構成＞
図２は、ヘルスケアデータ分析システムの構成の一例を示すブロック図である。本実施例のヘルスケアデータ分析システムは、国（集団）Ａ１が管理するデータセンタＤＣ＃１と、地域（集団）Ａ２が管理するデータセンタＤＣ＃２と、地域（集団）Ａ３が管理するデータセンタＤＣ＃３と、をネットワーク（図示省略）で構成した例を示す。 <System configuration>
FIG. 2 is a block diagram showing an example of the configuration of a healthcare data analysis system. The healthcare data analysis system of this embodiment has data center DC # 1 managed by country (group) A1, data center DC # 2 managed by region (group) A2, and data managed by region (group) A3. An example in which the center DC # 3 and the center DC # 3 are configured by a network (not shown) is shown.

国（集団）Ａ１が管理するデータセンタＤＣ＃１は、匿名のヘルスケアデータを格納するデータベース１と、知識ベクトル情報２１及びベクトル情報２３を生成する知識作成装置９と、地域Ａ２で生成されたモデル情報４１を格納するデータベース４と、データベース４からモデル情報４１を選択してデータベース１のヘルスケアデータと知識ベクトル情報２１及びベクトル情報２３でシミュレーションを実施するシミュレーション装置（＃１）４とを含む。 The data center DC # 1 managed by the country (group) A1 is generated in the database 1 that stores anonymous healthcare data, the knowledge creation device 9 that generates the knowledge vector information 21 and the vector information 23, and the area A2. The database 4 for storing the model information 41 and the simulation device (# 1) 4 for selecting the model information 41 from the database 4 and performing the simulation with the healthcare data and the knowledge vector information 21 and the vector information 23 of the database 1 are included. ..

地域（集団）Ａ２が管理するデータセンタＤＣ＃２は、地域Ａ２居住する個人情報を含むヘルスケアデータと施策情報を格納するデータベース３と、データセンタＤＣ＃１の知識ベクトル情報２１及びベクトル情報２３を用いて、地域Ａ２のヘルスケアデータと施策情報をベクトル化して、予測モデルを生成してデータセンタＤＣ＃１のデータベース４にモデル情報４１として格納するモデル生成装置７と、を含む。 The data center DC # 2 managed by the area (group) A2 is a database 3 that stores healthcare data including personal information resident in the area A2 and measure information, and knowledge vector information 21 and vector information 23 of the data center DC # 1. Includes a model generator 7 that vectorizes healthcare data and measure information in region A2, generates a prediction model, and stores it in database 4 of data center DC # 1 as model information 41.

地域（集団）Ａ３が管理するデータセンタＤＣ＃３は、地域Ａ３に居住する個人情報を含むヘルスケアデータを格納するデータベース５と、データセンタＤＣ＃１のデータベース４から選択したモデル情報４１と知識ベクトル情報２１及びベクトル情報２３を用いてデータベース５のヘルスケアデータでシミュレーションを実施するシミュレーション装置８と、を含む。 The data center DC # 3 managed by the area (group) A3 has a database 5 that stores healthcare data including personal information residing in the area A3, and model information 41 and knowledge selected from the database 4 of the data center DC # 1. Includes a simulation device 8 that performs simulations with healthcare data in database 5 using vector information 21 and vector information 23.

データセンタＤＣ＃１のデータベース１には、健康診断などの情報を匿名で格納した健診情報１１と、医療機関を受診した際のレセプトを匿名で格納したレセプト情報１２と、介護施設等を利用した際の情報を匿名で格納した介護情報１３と、が格納される。 Database 1 of data center DC # 1 uses medical examination information 11 that anonymously stores information such as medical examinations, receipt information 12 that anonymously stores receipts when visiting a medical institution, and nursing care facilities. The nursing care information 13 and the care information 13 in which the information at the time of this is anonymously stored are stored.

知識作成装置９は、ＣＰＵ９１と、メモリ９２と、通信装置（図示省略）を含む計算機である。メモリ９２には、知識ベクトル作成部９３がプログラムとしてロードされて、ＣＰＵ９１によって実行される。 The knowledge creation device 9 is a computer including a CPU 91, a memory 92, and a communication device (not shown). The knowledge vector creation unit 93 is loaded into the memory 92 as a program and executed by the CPU 91.

知識ベクトル作成部９３は、データベース１の健診情報１１、レセプト情報１２、介護情報１３のいずれかを用いて分散表現によってベクトル変換を行い、ベクトル情報２３及び知識ベクトル情報２１を生成してデータベース２へ格納する。なお、本実施例では、ベクトル情報２３のベクトル化する次元数をｍ次元とする。 The knowledge vector creation unit 93 performs vector conversion by distributed representation using any one of the medical examination information 11, the receipt information 12, and the long-term care information 13 of the database 1, generates the vector information 23 and the knowledge vector information 21, and generates the database 2 Store in. In this embodiment, the number of vectorized dimensions of the vector information 23 is m-dimensional.

データベース１の健診情報１１、レセプト情報１２、介護情報１３は、匿名化されているが、同一人物には同一の個人ＩＤ（識別子）を付与しているため、レセプト情報１２と健診情報１１及び介護情報１３の個人ＩＤを突き合わせて、各情報（テーブル）を結合してからベクトル変換を行うことができる。 The medical examination information 11, the receipt information 12, and the long-term care information 13 of the database 1 are anonymized, but since the same person is given the same personal ID (identifier), the receipt information 12 and the medical examination information 11 And the personal IDs of the long-term care information 13 can be matched, and each information (table) can be combined before vector conversion can be performed.

シミュレーション装置（＃１）６は、図２３で示すように、ＣＰＵ６１と、メモリ６２と、入力装置６７と、出力装置６８と、通信装置６９と、ストレージ装置６６を含む計算機である。 As shown in FIG. 23, the simulation device (# 1) 6 is a computer including a CPU 61, a memory 62, an input device 67, an output device 68, a communication device 69, and a storage device 66.

入力装置６７は、キーボードやマウスあるいはタッチパネル等で構成される。出力装置６８は、ディスプレイで構成される。通信装置６９は、図示しないネットワークを介して他の計算機と通信を行う。ストレージ装置６６は、不揮発性の記憶媒体で構成されてデータやプログラムを格納する。 The input device 67 is composed of a keyboard, a mouse, a touch panel, or the like. The output device 68 is composed of a display. The communication device 69 communicates with other computers via a network (not shown). The storage device 66 is composed of a non-volatile storage medium and stores data and programs.

メモリ６２には、出力部６３と、ベクトル変換部６４と、推定演算部６５が、プログラムとしてロードされてＣＰＵ６１によって実行される。 The output unit 63, the vector conversion unit 64, and the estimation calculation unit 65 are loaded into the memory 62 as a program and executed by the CPU 61.

シミュレーション装置（＃１）６は、利用者がデータベース４に蓄積された複数のモデル情報４１の中から選択し、データベース１のヘルスケアデータと知識ベクトル情報２１からベクトル変換部６４でベクトル化してから、推定演算部６５で選択したモデル情報４１とのベクトル情報２３によってシミュレーションを実施する。 The simulation device (# 1) 6 is selected by the user from a plurality of model information 41 stored in the database 4, and then vectorized from the healthcare data and knowledge vector information 21 of the database 1 by the vector conversion unit 64. , The simulation is performed by the vector information 23 with the model information 41 selected by the estimation calculation unit 65.

ベクトル変換部６４は、利用者が選択したデータベース１のレセプト情報１２と健診情報１１と介護情報１３のうちの少なくともひとつをベクトル化する。ベクトル変換部６４は、利用者が選択したテーブルを個人ＩＤで突き合わせてからベクトル化することができる。 The vector conversion unit 64 vectorizes at least one of the receipt information 12, the medical examination information 11, and the long-term care information 13 of the database 1 selected by the user. The vector conversion unit 64 can match the table selected by the user with the personal ID and then vectorize the table.

推定演算部６５は、ベクトル化されたデータと、知識ベクトル情報２１のベクトル情報２３を入力として、上記選択されたモデル情報４１でシミュレーションを実施して、出力装置６８へ結果を出力する。 The estimation calculation unit 65 takes the vectorized data and the vector information 23 of the knowledge vector information 21 as inputs, performs a simulation with the selected model information 41, and outputs the result to the output device 68.

ＣＰＵ６１は、各機能部のプログラムに従って処理を実行することによって、所定の機能を提供する機能部として稼働する。例えば、ＣＰＵ６１は、ベクトル変換プログラムに従って処理を実行することでベクトル変換部６４として機能する。他のプログラムについても同様である。さらに、ＣＰＵ６１は、各プログラムが実行する複数の処理のそれぞれの機能を提供する機能部としても稼働する。計算機及び計算機システムは、これらの機能部を含む装置及びシステムである。 The CPU 61 operates as a functional unit that provides a predetermined function by executing a process according to a program of each functional unit. For example, the CPU 61 functions as the vector conversion unit 64 by executing the process according to the vector conversion program. The same applies to other programs. Further, the CPU 61 also operates as a functional unit that provides each function of a plurality of processes executed by each program. A computer and a computer system are devices and systems including these functional parts.

次に、国（集団）Ａ１を構成する地域（集団）Ａ２が管理するデータセンタＤＣ＃２は、データベース３とモデル生成装置７を含む。データベース３には、当該地域Ａ２に居住又は勤務する個人のヘルスケアデータとして、レセプト情報３２と、健診情報３１と、介護情報３３と、当該地域で実施した施策情報３４及び施策効果情報３４１０と、が格納される。地域Ａ２のデータベース３は、国Ａ１とは異なり、個人情報を含むヘルスケアデータを格納する。 Next, the data center DC # 2 managed by the region (group) A2 constituting the country (group) A1 includes the database 3 and the model generator 7. Database 3 contains receipt information 32, medical examination information 31, long-term care information 33, measure information 34 implemented in the area, and measure effect information 3410 as healthcare data of individuals living or working in the area A2. , Is stored. The database 3 of the region A2, unlike the country A1, stores healthcare data including personal information.

健診情報３１は、健康診断などの情報を格納し、レセプト情報３２は、医療機関を受診した際のレセプトを格納し、介護情報３３は、介護施設等を利用した際の情報を格納する。 The medical examination information 31 stores information such as a medical examination, the receipt information 32 stores a receipt when a medical institution is consulted, and the long-term care information 33 stores information when a long-term care facility or the like is used.

施策情報３４には、地域Ａ２で実施した施策の内容が格納される。後述するように、施策情報３４には、保健医療の施策の対象者と、施策結果（施策の効果）が格納される。 The measure information 34 stores the contents of the measures implemented in the area A2. As will be described later, the measure information 34 stores the target person of the health care measure and the measure result (effect of the measure).

モデル生成装置７は、ＣＰＵ７１と、メモリ７２と、通信装置（図示省略）を含む計算機である。メモリ７２には、ベクトル変換部７３と機械学習部７４がプログラムとしてロードされてＣＰＵ７１によって実行される。 The model generation device 7 is a computer including a CPU 71, a memory 72, and a communication device (not shown). The vector conversion unit 73 and the machine learning unit 74 are loaded into the memory 72 as a program and executed by the CPU 71.

ベクトル変換部７３は、分散表現を用いてデータベース３のヘルスケアデータ（健診情報３１、レセプト情報３２、介護情報３３）と施策情報３４をベクトル化し、ベクトル化したデータを機械学習部７４で機械学習を実施して予測モデルを生成し、データセンタＤＣ＃１のデータベース４にモデル情報４１として格納する。 The vector conversion unit 73 vectorizes the healthcare data (medical examination information 31, receipt information 32, care information 33) and the measure information 34 of the database 3 using the distributed representation, and the machine learning unit 74 machines the vectorized data. The training is performed to generate a prediction model, which is stored as model information 41 in the database 4 of the data center DC # 1.

次に、国（集団）Ａ１を構成する地域（集団）Ａ３が管理するデータセンタＤＣ＃３は、データベース５とシミュレーション装置（＃２）８を含む。データベース５には、当該地域Ａ３に居住又は勤務する個人のヘルスケアデータとして、レセプト情報５２と、健診情報５１と、介護情報５３が格納される。地域Ａ３のデータベース５は、国Ａ１とは異なり、個人情報を含むヘルスケアデータである。 Next, the data center DC # 3 managed by the region (group) A3 constituting the country (group) A1 includes the database 5 and the simulation device (# 2) 8. The database 5 stores the receipt information 52, the medical examination information 51, and the long-term care information 53 as the healthcare data of the individual who lives or works in the area A3. The database 5 of the region A3 is different from the country A1 in that it is healthcare data including personal information.

健診情報５１は、健康診断などの情報を格納し、レセプト情報５２は、医療機関を受診した際のレセプトを格納し、介護情報５３は、介護施設等を利用した際の情報を格納する。 The medical examination information 51 stores information such as a medical examination, the receipt information 52 stores a receipt when a medical institution is consulted, and the long-term care information 53 stores information when a long-term care facility or the like is used.

シミュレーション装置（＃２）８は、ＣＰＵ８１と、メモリ８２と、通信装置（図示省略）を含む計算機である。メモリ８２には、出力部８３と、ベクトル変換部８４と、推定演算部８５がプログラムとしてロードされてＣＰＵ８１によって実行される。 The simulation device (# 2) 8 is a computer including a CPU 81, a memory 82, and a communication device (not shown). The output unit 83, the vector conversion unit 84, and the estimation calculation unit 85 are loaded into the memory 82 as a program and executed by the CPU 81.

シミュレーション装置（＃２）８は、利用者がデータセンタＤＣ＃１のデータベース４に蓄積された複数のモデル情報４１の中から選択し、データベース５のヘルスケアデータをベクトル変換部８４でベクトル化してから、推定演算部８５で選択したモデル情報４１と知識ベクトル情報２１のベクトル情報２３によってシミュレーションを実施する。 The simulation device (# 2) 8 is selected by the user from a plurality of model information 41 stored in the database 4 of the data center DC # 1, and the healthcare data of the database 5 is vectorized by the vector conversion unit 84. Therefore, the simulation is performed by the model information 41 selected by the estimation calculation unit 85 and the vector information 23 of the knowledge vector information 21.

ベクトル変換部８４は、利用者が選択したデータベース１のレセプト情報５２と健診情報５１と介護情報５３のうちの少なくともひとつをベクトル化する。ベクトル変換部８４は、利用者が選択したテーブルを個人ＩＤで突き合わせてからベクトル化することができる。 The vector conversion unit 84 vectorizes at least one of the receipt information 52, the medical examination information 51, and the long-term care information 53 of the database 1 selected by the user. The vector conversion unit 84 can match the table selected by the user with the personal ID and then vectorize the table.

推定演算部８５は、ベクトル化されたデータと、知識ベクトル情報２１のベクトル情報２３を入力として、上記選択されたモデル情報４１でシミュレーションを実施して、出力装置（図示省略）へ結果を出力する。 The estimation calculation unit 85 takes the vectorized data and the vector information 23 of the knowledge vector information 21 as inputs, performs a simulation with the selected model information 41, and outputs the result to an output device (not shown). ..

なお、図２では、国Ａ１を構成する地域が２つの例を示したが、これに限定されるものではない。また、地域Ａ２にモデル生成装置７を配置し、地域Ａ３にシミュレーション装置８を配置する例を示したが、これに限定されるものではない。例えば、各地域Ａ２、Ａ３にモデル生成装置７と、シミュレーション装置８をそれぞれ配置して、自地域の施策とヘルスケアデータによるモデル情報４１を生成してデータセンタＤＣ＃１のデータベース４に登録し、他の地域のモデル情報４１でシミュレーションを実施することができる。 In FIG. 2, two examples are shown in which the regions constituting the country A1 are not limited to these. Further, although an example in which the model generation device 7 is arranged in the area A2 and the simulation device 8 is arranged in the area A3 is shown, the present invention is not limited to this. For example, a model generator 7 and a simulation device 8 are arranged in each of the regions A2 and A3, and model information 41 based on the measures and healthcare data of the own region is generated and registered in the database 4 of the data center DC # 1. , The simulation can be carried out with the model information 41 of other regions.

また、知識作成装置９と、モデル生成装置７と、シミュレーション装置（＃２）８は、図２４に示したシミュレーション装置（＃１）６と同様に、入力装置と出力装置を有し、利用者の操作を入力装置で受け付けて、処理結果を出力装置に表示する。 Further, the knowledge creation device 9, the model generation device 7, and the simulation device (# 2) 8 have an input device and an output device as in the simulation device (# 1) 6 shown in FIG. 24, and are users. Is accepted by the input device and the processing result is displayed on the output device.

＜データ＞
次に、各データセンタＤＣ＃１〜ＤＣ＃３で使用されるデータについて説明する。図７は、データベース（ＤＢ＃１）１の健診情報１１の一例を示す図である。健診情報１１は国Ａ１が管理するテーブルである。 <Data>
Next, the data used in each of the data centers DC # 1 to DC # 3 will be described. FIG. 7 is a diagram showing an example of the medical examination information 11 of the database (DB # 1) 1. The medical examination information 11 is a table managed by the country A1.

健診情報１１は、健診ＩＤ１１０と、地域ＩＤ１１１と、個人ＩＤ１１２と、受診日１１３と、受信時年齢１１４と、腹囲１１５と、最低血圧１１６と、最高血圧１１７と、血糖値１１８と、中性脂肪１１９を一つのレコードに含む。 The medical examination information 11 includes the medical examination ID 110, the area ID 111, the individual ID 112, the examination date 113, the age at reception 114, the abdominal circumference 115, the diastolic blood pressure 116, the systolic blood pressure 117, and the blood glucose level 118. Includes triglyceride 119 in one record.

健診ＩＤ１１０には、健診を行った機関などを示す識別子が格納される。地域ＩＤ１１１には、健診を実施した地域を示す識別子が格納される。個人ＩＤ１１２には、受診者の識別子が付与される。腹囲１１５〜中性脂肪１１９には、測定値が格納される。 The medical examination ID 110 stores an identifier indicating the institution that performed the medical examination. The area ID 111 stores an identifier indicating the area where the medical examination was performed. The personal ID 112 is given an identifier of the examinee. Measured values are stored in the abdominal circumference 115 to triglyceride 119.

国Ａ１が管理するデータベース１は、匿名のデータベースであり、氏名や住所などの個人を特定する情報は無く、個人ＩＤ１１２は同一人物か否かを識別する情報として機能する。 The database 1 managed by the country A1 is an anonymous database, does not have personally identifiable information such as a name and an address, and the personal ID 112 functions as information for identifying whether or not the person is the same person.

次に、図８は、データベース（ＤＢ＃３）３の健診情報３１の一例を示す図である。健診情報３１は、地域Ａ２が管理するテーブルであり、個人を特定する個人情報を含む。 Next, FIG. 8 is a diagram showing an example of the medical examination information 31 of the database (DB # 3) 3. The medical examination information 31 is a table managed by the area A2, and includes personal information that identifies an individual.

健診情報３１は、健診ＩＤ３１０と、地域ＩＤ３１１と、個人ＩＤ３１２と、受診日３１３と、受信時年齢３１４と、腹囲３１５と、最低血圧３１６と、最高血圧３１７と、血糖値３１８と、中性脂肪３１９を一つのレコードに含む。 The medical examination information 31 includes a medical examination ID 310, an area ID 311 and an individual ID 312, a consultation date 313, a reception age 314, an abdominal circumference 315, a minimum blood pressure 316, a systolic blood pressure 317, and a blood glucose level 318. Includes triglyceride 319 in one record.

健診ＩＤ３１０には、健診を行った機関などを示す識別子が格納される。地域ＩＤ３１１には、健診を実施した地域を示す識別子が格納される。個人ＩＤ３１２には、受診者の識別子が付与される。 The medical examination ID 310 stores an identifier indicating the institution that performed the medical examination. The area ID 311 stores an identifier indicating the area where the medical examination was performed. The personal ID 312 is given an identifier of the examinee.

地域Ａ２が管理するデータベース３は、氏名や住所等の個人情報を含むデータベースであり、個人ＩＤ１１２は個人情報に対応付けられている。腹囲３１５〜中性脂肪３１９には測定値が格納される。 The database 3 managed by the area A2 is a database including personal information such as a name and an address, and the personal ID 112 is associated with the personal information. Measured values are stored in the abdominal circumference 315 to triglyceride 319.

次に、図９は、データベース（ＤＢ＃５）５の健診情報５１の一例を示す図である。健診情報５１は、他の地域Ａ３が管理するテーブルであり、個人を特定する個人情報を含む。 Next, FIG. 9 is a diagram showing an example of the medical examination information 51 of the database (DB # 5) 5. The medical examination information 51 is a table managed by another area A3, and includes personal information that identifies an individual.

健診情報５１は、健診ＩＤ５１０と、地域ＩＤ５１１と、個人ＩＤ５１２と、受診日５１３と、受信時年齢５１４と、腹囲５１５と、最低血圧５１６と、最高血圧５１７と、血糖値５１８と、中性脂肪５１９を一つのレコードに含む。 The medical examination information 51 includes a medical examination ID 510, an area ID 511, an individual ID 512, a consultation date 513, a reception age 514, an abdominal circumference 515, a minimum blood pressure 516, a systolic blood pressure 517, and a blood glucose level 518. One record contains triglyceride 519.

健診ＩＤ５１０には、健診を行った機関などを示す識別子が格納される。地域ＩＤ５１１には、健診を実施した地域を示す識別子が格納される。個人ＩＤ５１２には、受診者の識別子が付与される。 The medical examination ID 510 stores an identifier indicating the institution that performed the medical examination. The area ID 511 stores an identifier indicating the area where the medical examination was performed. The personal ID 512 is given an identifier of the examinee.

地域Ａ２が管理するデータベース３は、氏名や住所等の個人情報を含むデータベースであり、個人ＩＤ１１２は個人情報に対応付けられている。腹囲５１５〜中性脂肪５１９には測定値が格納される。 The database 3 managed by the area A2 is a database including personal information such as a name and an address, and the personal ID 112 is associated with the personal information. Measured values are stored in the abdominal circumference 515 to triglyceride 519.

図１０は、データベース（ＤＢ＃１）１のレセプト情報１２の一例を示す図である。レセプト情報１２は国Ａ１が管理するテーブルである。 FIG. 10 is a diagram showing an example of the receipt information 12 of the database (DB # 1) 1. Receipt information 12 is a table managed by country A1.

レセプト情報１２は、レセプトＩＤ１２０と、地域ＩＤ１２１と、個人ＩＤ１２２と、診療年月１２３と、医療機関番号１２４と、傷病名１２５と、診療行為１２６と、医薬品１２７と、請求点数１２８と、を一つのレコードに含む。 The receipt information 12 includes a receipt ID 120, an area ID 121, an individual ID 122, a medical treatment date 123, a medical institution number 124, an injury / illness name 125, a medical treatment practice 126, a drug 127, and a claim point 128. Included in one record.

レセプトＩＤ１２０には、医療請求の識別子が格納される。地域ＩＤ１１１には、診療を実施した地域を示す識別子が格納される。個人ＩＤ１１２には、受診者の識別子が付与される。 The medical claim identifier is stored in the receipt ID 120. The area ID 111 stores an identifier indicating the area where the medical treatment was performed. The personal ID 112 is given an identifier of the examinee.

医療機関番号１２４には、診療を行った医療機関の識別子が格納される。傷病名１２５には、傷病の名称又はコードが格納される。診療行為１２６には、患者に適用した診療内容のコードが格納される。医薬品１２７は、処方された医薬品が格納される。請求点数１２８には保健の点数が格納される。 The medical institution number 124 stores the identifier of the medical institution that performed the medical treatment. The injury / illness name 125 stores the name or code of the injury / illness. The medical practice 126 stores a code of medical care content applied to the patient. The drug 127 stores the prescribed drug. Health points are stored in the billing points 128.

国Ａ１が管理するデータベース１は、匿名のデータベースであり、氏名や住所などの個人を特定する情報は無く、個人ＩＤ１２２は同一人物か否かを識別する情報として機能する。 The database 1 managed by the country A1 is an anonymous database, does not have personally identifiable information such as a name and an address, and the personal ID 122 functions as information for identifying whether or not the person is the same person.

次に、図１１は、データベース（ＤＢ＃３）３のレセプト情報３２の一例を示す図である。レセプト情報３２は、地域Ａ２が管理するテーブルであり、個人を特定する個人情報を含む。 Next, FIG. 11 is a diagram showing an example of the receipt information 32 of the database (DB # 3) 3. The receipt information 32 is a table managed by the area A2, and includes personal information that identifies an individual.

レセプト情報３２は、レセプトＩＤ３２０と、地域ＩＤ３２１と、個人ＩＤ３２２と、診療年月３２３と、医療機関番号３２４と、傷病名３２５と、診療行為３２６と、医薬品３２７と、請求点数３２８と、を一つのレコードに含む。 The receipt information 32 includes a receipt ID 320, an area ID 321, an individual ID 322, a medical treatment date 323, a medical institution number 324, an injury / illness name 325, a medical practice 326, a drug 327, and a number of claims 328. Included in one record.

レセプトＩＤ３２０には、医療請求の識別子が格納される。地域ＩＤ１１１には、診療を実施した地域を示す識別子が格納される。個人ＩＤ１３２には、受診者の識別子が付与される。 The medical claim identifier is stored in the receipt ID 320. The area ID 111 stores an identifier indicating the area where the medical treatment was performed. An identifier of the examinee is given to the personal ID 132.

医療機関番号３２４には、診療を行った医療機関の識別子が格納される。傷病名３２５には、傷病の名称又はコードが格納される。診療行為３２６には、患者に適用した診療内容のコードが格納される。医薬品３２７は、処方された医薬品が格納される。請求点数３２８には保健の点数が格納される。 The medical institution number 324 stores the identifier of the medical institution that performed the medical treatment. The name or code of the injury or illness is stored in the injury or illness name 325. The medical practice 326 stores a code of medical care content applied to the patient. The drug 327 stores the prescribed drug. Health points are stored in the billing points 328.

地域Ａ２が管理するデータベース３は、氏名や住所等の個人情報を含むデータベースであり、個人ＩＤ３２２は個人情報に対応付けられている。 The database 3 managed by the area A2 is a database including personal information such as a name and an address, and the personal ID 322 is associated with the personal information.

次に、図１２は、データベース（ＤＢ＃５）５のレセプト情報５２の一例を示す図である。レセプト情報５２は、他の地域Ａ３が管理するテーブルであり、個人を特定する個人情報を含む。 Next, FIG. 12 is a diagram showing an example of the receipt information 52 of the database (DB # 5) 5. The receipt information 52 is a table managed by another area A3, and includes personal information that identifies an individual.

レセプト情報５２は、レセプトＩＤ５２０と、地域ＩＤ５２１と、個人ＩＤ５２２と、診療年月５２３と、医療機関番号５２４と、傷病名５２５と、診療行為５２６と、医薬品５２７と、請求点数５２８と、を一つのレコードに含む。 The receipt information 52 includes a receipt ID 520, an area ID 521, an individual ID 522, a medical treatment date 523, a medical institution number 524, an injury / illness name 525, a medical practice 526, a drug 527, and a number of claims 528. Included in one record.

レセプトＩＤ５２０には、医療請求の識別子が格納される。地域ＩＤ１１１には、診療を実施した地域を示す識別子が格納される。個人ＩＤ１５２には、受診者の識別子が付与される。 The medical claim identifier is stored in the receipt ID 520. The area ID 111 stores an identifier indicating the area where the medical treatment was performed. The personal ID 152 is given an identifier of the examinee.

医療機関番号５２４には、診療を行った医療機関の識別子が格納される。傷病名５２５には、傷病の名称又はコードが格納される。診療行為５２６には、患者に適用した診療内容のコードが格納される。医薬品５２７は、処方された医薬品が格納される。請求点数５２８には保健の点数が格納される。 The medical institution number 524 stores the identifier of the medical institution that performed the medical treatment. The name or code of the injury or illness is stored in the injury or illness name 525. The medical practice 526 stores a code of medical care content applied to the patient. The drug 527 stores the prescribed drug. Health points are stored in the billing points 528.

他の地域Ａ３が管理するデータベース５は、氏名や住所等の個人情報を含むデータベースであり、個人ＩＤ５２２は個人情報に対応付けられている。 The database 5 managed by the other area A3 is a database including personal information such as a name and an address, and the personal ID 522 is associated with the personal information.

図１３は、データベース（ＤＢ＃１）１の介護情報１３の一例を示す図である。介護情報１３は国Ａ１が管理するテーブルである。 FIG. 13 is a diagram showing an example of the long-term care information 13 of the database (DB # 1) 1. The long-term care information 13 is a table managed by the country A1.

介護情報１３は、介護情報ＩＤ１３０と、地域ＩＤ１３１と、個人ＩＤ１３２と、年月１３３と、要介護度１３４と、事業所番号１３５と、サービスコード１３６と、請求額１３７と、を一つのレコードに含む。 The long-term care information 13 contains the long-term care information ID 130, the area ID 131, the personal ID 132, the date 133, the degree of long-term care 134, the business establishment number 135, the service code 136, and the billing amount 137 in one record. Including.

介護情報ＩＤ１３０には、介護請求の識別子が格納される。地域ＩＤ１３１には、介護サービスを実施した地域を示す識別子が格納される。個人ＩＤ１３２には、介護者の識別子が付与される。年月１３３には、介護サービスを受けた年月が格納される。要介護度１３４には、要介護認定で判定された介護の必要性の程度が格納される。 The long-term care information ID 130 stores the long-term care request identifier. The area ID 131 stores an identifier indicating the area where the long-term care service is provided. A caregiver identifier is assigned to the personal ID 132. The year and month 133 stores the year and month when the long-term care service was received. The degree of need for long-term care 134 stores the degree of need for long-term care determined by the certification for long-term care.

事業所番号１３５には、介護サービスを実施した事業所の識別子が格納される。サービスコード１３６には、実施した介護サービスのコードが格納される。請求額１３７には、介護サービスの請求額が格納される。 The business establishment number 135 stores the identifier of the business establishment that provided the long-term care service. The service code 136 stores the code of the implemented long-term care service. The billed amount of the long-term care service is stored in the billed amount 137.

国Ａ１が管理するデータベース１は、匿名のデータベースであり、氏名や住所などの個人を特定する情報は無く、個人ＩＤ１３２は同一人物か否かを識別する情報として機能する。 The database 1 managed by the country A1 is an anonymous database, does not have personally identifiable information such as a name and an address, and the personal ID 132 functions as information for identifying whether or not the person is the same person.

図１４は、データベース（ＤＢ＃３）３の介護情報３３の一例を示す図である。介護情報３３は、地域Ａ２が管理するテーブルである。 FIG. 14 is a diagram showing an example of the long-term care information 33 of the database (DB # 3) 3. The long-term care information 33 is a table managed by the area A2.

介護情報３３は、介護情報ＩＤ３３０と、地域ＩＤ３３１と、個人ＩＤ３３２と、年月３３３と、要介護度３３４と、事業所番号３３５と、サービスコード３３６と、請求額３３７と、を一つのレコードに含む。 The long-term care information 33 contains the long-term care information ID 330, the area ID 331, the personal ID 332, the date 333, the degree of long-term care 334, the business office number 335, the service code 336, and the billing amount 337 in one record. Including.

介護情報ＩＤ３３０には、介護請求の識別子が格納される。地域ＩＤ３３１には、介護サービスを実施した地域を示す識別子が格納される。個人ＩＤ３３２には、介護者の識別子が付与される。年月３３３には、介護サービスを受けた年月が格納される。要介護度３３４には、要介護認定で判定された介護の必要性の程度が格納される。 The long-term care information ID 330 stores the long-term care request identifier. The area ID 331 stores an identifier indicating the area where the long-term care service is provided. A caregiver identifier is assigned to the personal ID 332. The year and month 333 stores the year and month when the long-term care service was received. The degree of need for long-term care 334 stores the degree of need for long-term care determined by the certification for long-term care.

事業所番号３３５には、介護サービスを実施した事業所の識別子が格納される。サービスコード３３６には、実施した介護サービスのコードが格納される。請求額３３７には、介護サービスの請求額が格納される。 The business establishment number 335 stores the identifier of the business establishment that provided the long-term care service. The service code 336 stores the code of the long-term care service provided. The billed amount of the long-term care service is stored in the billed amount 337.

地域Ａ２が管理するデータベース３は、氏名や住所等の個人情報を含むデータベースであり、個人ＩＤ３３２は個人情報に対応付けられている。 The database 3 managed by the area A2 is a database including personal information such as a name and an address, and the personal ID 332 is associated with the personal information.

図１５は、データベース（ＤＢ＃５）５の介護情報５３の一例を示す図である。介護情報５３は、他の地域Ａ３が管理するテーブルである。 FIG. 15 is a diagram showing an example of the long-term care information 53 of the database (DB # 5) 5. The long-term care information 53 is a table managed by another area A3.

介護情報５３は、介護情報ＩＤ５３０と、地域ＩＤ５３１と、個人ＩＤ５３２と、年月５３３と、要介護度５３４と、事業所番号５３５と、サービスコード５３６と、請求額５３７と、を一つのレコードに含む。 The long-term care information 53 includes a long-term care information ID 530, an area ID 531, an individual ID 532, a date 533, a degree of long-term care 534, a business establishment number 535, a service code 536, and a billing amount 537 in one record. Including.

介護情報ＩＤ５３０には、介護情報の識別子が格納される。地域ＩＤ５３１には、介護サービスを実施した地域を示す識別子が格納される。個人ＩＤ５３２には、介護者の識別子が付与される。年月５３３には、介護サービスを受けた年月が格納される。要介護度５３４には、要介護認定で判定された介護の必要性の程度が格納される。 The long-term care information ID 530 stores an identifier of the long-term care information. The area ID 531 stores an identifier indicating the area where the long-term care service is provided. A caregiver identifier is assigned to the personal ID 532. The year and month 533 stores the year and month when the long-term care service was received. The degree of need for long-term care 534 stores the degree of need for long-term care determined by the certification for long-term care.

事業所番号５３５には、介護サービスを実施した事業所の識別子が格納される。サービスコード５３６には、実施した介護サービスのコードが格納される。請求額５３７には、介護サービスの請求額が格納される。 The business establishment number 535 stores the identifier of the business establishment that provided the long-term care service. The service code 536 stores the code of the long-term care service provided. The billed amount of the long-term care service is stored in the billed amount 537.

他の地域Ａ３が管理するデータベース５は、氏名や住所等の個人情報を含むデータベースであり、個人ＩＤ５３２は個人情報に対応付けられている。 The database 5 managed by the other area A3 is a database including personal information such as a name and an address, and the personal ID 532 is associated with the personal information.

図１６Ａは、データベース（ＤＢ＃３）３の施策情報３４の一例を示す図である。施策情報３４は、地域Ａ２が管理するテーブルである。 FIG. 16A is a diagram showing an example of the measure information 34 of the database (DB # 3) 3. The measure information 34 is a table managed by the area A2.

施策情報３４は、地域ＩＤ３４０と、施策ＩＤ３４１と、施策名３４２と、対象者ＩＤ３４３と、目標達成３４４、を一つのレコードに含む。地域ＩＤ３４０には、当該施策を実施した地域の識別子が格納される。施策ＩＤ３４１には、施策を識別するための識別子が格納される。 The measure information 34 includes the area ID 340, the measure ID 341, the measure name 342, the target person ID 343, and the goal achievement 344 in one record. The area ID 340 stores the identifier of the area where the measure is implemented. An identifier for identifying the measure is stored in the measure ID 341.

施策名３４２には、施策の名称が格納される。対象者ＩＤ３４３には、施策に参加した個人ＩＤが格納される。目標達成３４４には、対象者が施策の目標を達成したか否かを示す値が格納され、達成した場合には「１」が格納され、達成できなかった場合には「０」が格納される。 The name of the measure is stored in the measure name 342. The individual ID who participated in the measure is stored in the target person ID 343. In the goal achievement 344, a value indicating whether or not the target person has achieved the goal of the measure is stored, "1" is stored when the target person has achieved the goal, and "0" is stored when the target person cannot achieve the goal. To.

施策情報３４の対象者ＩＤ３４３は、データベース３の健診情報３１、レセプト情報３２又は介護情報３３の個人ＩＤに対応している。施策情報３４は、施策に参加した個人が、施策毎に設定された目標に到達したか否かを管理することができる。 The target person ID 343 of the measure information 34 corresponds to the personal ID of the medical examination information 31, the receipt information 32, or the long-term care information 33 of the database 3. The measure information 34 can manage whether or not the individual who participated in the measure has reached the goal set for each measure.

図１６Ｂは、データベース（ＤＢ＃３）３の施策効果情報３４１０の一例を示す図である。施策効果情報３４１０は、地域Ａ２が管理するテーブルである。 FIG. 16B is a diagram showing an example of the measure effect information 3410 of the database (DB # 3) 3. The measure effect information 3410 is a table managed by the area A2.

施策効果情報３４１０は、施策ＩＤ３４１１と、施策名称３４１２と、実施年度３４１３と、効果指標３４１４と、実測効果３４１５とを一つのレコードに含む。施策ＩＤ３４１１には、施策の識別子が格納され、施策情報３４の施策ＩＤ３４１に対応する。 The measure effect information 3410 includes the measure ID 3411, the measure name 3412, the implementation year 3413, the effect index 3414, and the actually measured effect 3415 in one record. The measure ID 3411 stores the measure identifier and corresponds to the measure ID 341 of the measure information 34.

施策名称３４１２には、施策の名称が格納され、施策情報３４の施策名３４２に対応する。実施年度３４１３には、施策を実施した年度が格納される。効果指標３４１４には、施策の効果を測定する指標が格納される。実測効果３４１５には、効果指標３４１４に設定された指標の値が格納される。施策効果情報３４１０は、施策毎の実測効果３４１５を管理する。 The name of the measure is stored in the measure name 3412, and corresponds to the measure name 342 of the measure information 34. The year in which the measure was implemented is stored in the implementation year 3413. The effect index 3414 stores an index for measuring the effect of the measure. The measured effect 3415 stores the value of the index set in the effect index 3414. The measure effect information 3410 manages the actual measurement effect 3415 for each measure.

図１７は、データセンタＤＣ＃１の知識作成装置９が生成する突き合わせ情報２２の一例を示す図である。知識作成装置９は、健診情報１１、レセプト情報１２、介護情報１３についてそれぞれ単独で知識ベクトル情報２１のベクトル情報２３を作成することもできるが、健診情報１１とレセプト情報１２及び介護情報１３を結合してから知識ベクトル情報２１及びベクトル情報２３を生成することもできる。 FIG. 17 is a diagram showing an example of the matching information 22 generated by the knowledge creation device 9 of the data center DC # 1. The knowledge creation device 9 can independently create the vector information 23 of the knowledge vector information 21 for the medical examination information 11, the receipt information 12, and the care information 13, but the medical examination information 11, the receipt information 12, and the care information 13 can be created independently. It is also possible to generate the knowledge vector information 21 and the vector information 23 after combining the above.

図１７の例は、健診情報１１とレセプト情報１２と介護情報１３を、個人ＩＤ１１２、１２２、１３２を突き合わせて結合した例を示す。事例２２１には、個人ＩＤ１１２、１２２、１３２の値が格納される。 The example of FIG. 17 shows an example in which the medical examination information 11, the receipt information 12, and the long-term care information 13 are combined by matching the personal IDs 112, 122, and 132. In the case 221, the values of the personal IDs 112, 122, and 132 are stored.

レセプト情報２２２には、事例２２１の個人ＩＤ（１２２）に対応するレセプト情報１２のレコードの値が格納される。介護情報２２３には、事例２２１の個人ＩＤ（１３２）に対応する介護情報１３のレコードの値が格納される。健診情報２２４には、事例２２１の個人ＩＤ（１１２）に対応する健診情報１１のレコードの値が格納される。 The receipt information 222 stores the value of the record of the receipt information 12 corresponding to the personal ID (122) of the case 221. In the long-term care information 223, the value of the record of the long-term care information 13 corresponding to the personal ID (132) of the case 221 is stored. The medical examination information 224 stores the value of the record of the medical examination information 11 corresponding to the personal ID (112) of the case 221.

知識作成装置９は、健診情報１１とレセプト情報１２及び介護情報１３を、個人ＩＤで突き合わせて結合した突き合わせ情報２２を生成してから、ベクトル変換を実施する。なお、本実施例では、突き合わせ情報２２の、レセプト情報２２２と介護情報２２３と健診情報２２４のカラムの数ｎを、ベクトル変換前の次元数ｎとする。 The knowledge creation device 9 performs vector conversion after generating the matching information 22 in which the medical examination information 11, the receipt information 12, and the nursing care information 13 are matched by the personal ID and combined. In this embodiment, the number n of the columns of the receipt information 222, the long-term care information 223, and the medical examination information 224 of the matching information 22 is set to the number of dimensions n before vector conversion.

なお、知識作成装置９は、ベクトル情報２３を生成する際に、データベース１から読み込むデータソースと、年度（又は期間）と、地域等を指定し、これらの指定を知識ベクトル情報２１に設定する。 The knowledge creation device 9 specifies a data source to be read from the database 1, a year (or period), a region, and the like when generating the vector information 23, and sets these designations in the knowledge vector information 21.

図１８は、知識作成装置９で生成されるベクトル情報２３の一例を示す図である。ベクトル情報２３は、知識ベクトル作成部９３が分散表現により、突き合わせ情報２２をベクトル値に変換した情報である。 FIG. 18 is a diagram showing an example of the vector information 23 generated by the knowledge creation device 9. The vector information 23 is information obtained by converting the matching information 22 into a vector value by the knowledge vector creating unit 93 by a distributed representation.

ベクトル情報２３は、知識ＩＤ２３１と、項目２３２と、ベクトル値２３３を一つのレコードに含む。知識ＩＤ２３１は、知識ベクトル情報２１が管理するグループを示す識別子が格納される。項目２３２は、突き合わせ情報２２のフィールド名（レセプト情報２２２、介護情報２２３、健診情報２２４の項目名）が格納される。 The vector information 23 includes the knowledge ID 231 and the item 232 and the vector value 233 in one record. The knowledge ID 231 stores an identifier indicating a group managed by the knowledge vector information 21. Item 232 stores the field name of the matching information 22 (item name of receipt information 222, long-term care information 223, and medical examination information 224).

図示の例では、図１７に示したｎ次元の突き合わせ情報２２を、ｍ次元の分散表現によりベクトル化した例を示す。 In the illustrated example, the n-dimensional matching information 22 shown in FIG. 17 is vectorized by an m-dimensional distributed representation.

図１９は、知識作成装置９で生成される知識ベクトル情報２１の一例を示す図である。知識ベクトル情報２１は、知識ＩＤ２１１と、データソース２１２と、年度２１３と、地域２１４を一つのレコードに含む。 FIG. 19 is a diagram showing an example of the knowledge vector information 21 generated by the knowledge creation device 9. The knowledge vector information 21 includes the knowledge ID 211, the data source 212, the year 213, and the region 214 in one record.

知識ＩＤ２１１は、知識ベクトル作成部９３が付与したベクトル情報２３のグループを示す識別子で、図２１の知識ＩＤ２３１に対応する。データソース２１２は、知識ＩＤ２１１のベクトル値を生成する際の入力となったデータベース１の情報を格納する。年度２１３には、データベース１のデータソースからデータを選択した年度が格納される。地域２１４には、データベース１のデータソースからデータを選択した地域が格納される。 The knowledge ID 211 is an identifier indicating a group of vector information 23 assigned by the knowledge vector creation unit 93, and corresponds to the knowledge ID 231 in FIG. The data source 212 stores the information of the database 1 that is input when generating the vector value of the knowledge ID 211. In year 213, the year in which data is selected from the data source of database 1 is stored. The area 214 stores the area in which data is selected from the data source of the database 1.

知識ベクトル情報２１は、ベクトル情報２３のグループを知識ＩＤ２３１によって、ベクトル情報２３のデータソースと年度と地域を管理する。シミュレーション装置６、８からベクトル情報２３を利用する際には、知識ベクトル情報２１を参照して、データソース２１２と、年度２１３と、地域２１４等から所望の知識ＩＤ２１１を選択し、選択した知識ＩＤ２１１の値に一致するベクトル情報２３の知識ＩＤ２３１のレコードを取得すればよい。 The knowledge vector information 21 manages the data source, the year, and the region of the vector information 23 by the knowledge ID 231 for the group of the vector information 23. When the vector information 23 is used from the simulation devices 6 and 8, the desired knowledge ID 211 is selected from the data source 212, the year 213, the region 214, and the like with reference to the knowledge vector information 21, and the selected knowledge ID 211 is selected. The record of the knowledge ID 231 of the vector information 23 that matches the value of may be acquired.

なお、本実施例では、知識ベクトル情報２１とベクトル情報２３を分ける例を示したが、データソース２１２と、年度２１３と、地域２１４をベクトル情報２３に加えて、一つのテーブルとしてもよい。 In this embodiment, the knowledge vector information 21 and the vector information 23 are separated from each other, but the data source 212, the year 213, and the area 214 may be added to the vector information 23 to form one table.

図２０は、モデル生成装置７で生成されるモデル情報４１の一例を示す図である。モデル情報４１は、モデルＩＤ４１１と、モデル４１２と、知識ＩＤ４１３と、施策ＩＤ４１４と、実績４１５とを一つのレコードに含む。 FIG. 20 is a diagram showing an example of model information 41 generated by the model generation device 7. The model information 41 includes the model ID 411, the model 412, the knowledge ID 413, the measure ID 414, and the actual result 415 in one record.

モデルＩＤ４１１には、モデル生成装置７が設定した識別子が格納される。モデル４１２には、モデル生成装置７で生成されたモデルが格納される。知識ＩＤ４１３には、モデルの生成時にデータベース２から取得した知識ベクトル情報２１の知識ＩＤ２１１の値が格納される。 The model ID 411 stores an identifier set by the model generation device 7. The model 412 stores the model generated by the model generation device 7. The knowledge ID 413 stores the value of the knowledge ID 211 of the knowledge vector information 21 acquired from the database 2 when the model is generated.

施策ＩＤ４１４には、地域Ａ２で実施された施策の識別子が格納される。施策ＩＤ４１４は、図１６Ａの施策情報３４の施策ＩＤ３４１に対応する。実績４１５には、施策を実施した結果が格納される。実績４１５は、図１６Ａの目標達成３４４に対応し、目標達成３４４の値が「１」であれば実績４１５には「目標達成」が設定され、目標達成３４４の値が「０」であれば実績４１５には「目標未達」が設定される。 The measure ID 414 stores the identifier of the measure implemented in the area A2. The measure ID 414 corresponds to the measure ID 341 of the measure information 34 of FIG. 16A. The result of implementing the measure is stored in the result 415. The actual result 415 corresponds to the goal achievement 344 of FIG. 16A, and if the value of the goal achievement 344 is "1", "target achievement" is set for the actual result 415, and if the value of the goal achievement 344 is "0". "Unachieved target" is set for the achievement 415.

シミュレーション装置６、８は、モデル情報４１を参照することで、目標を達成した施策とそうでない施策を判断することができる。 By referring to the model information 41, the simulation devices 6 and 8 can determine which measures have achieved the target and which measures have not.

＜処理＞
図３は、データセンタＤＣ＃１の知識作成装置９で行われる知識ベクトル情報２１の作成処理の一例を示すフローチャートである。この処理は、知識作成装置９の利用者の指令に応じて開始される。 <Processing>
FIG. 3 is a flowchart showing an example of the knowledge vector information 21 creation process performed by the knowledge creation device 9 of the data center DC # 1. This process is started in response to a command from the user of the knowledge creation device 9.

まず、知識作成装置９は、データベース１のヘルスケアデータ（保健医療情報）から利用者が指定したデータを取得する（Ｓ１）。知識作成装置９は、データベース１から読み込むテーブルと、年度（又は期間）と、対象とする地域などを受け付けて、指定された内容のデータ（ヘルスケアデータ）をデータベース２から取得する。 First, the knowledge creation device 9 acquires data specified by the user from the healthcare data (health care information) of the database 1 (S1). The knowledge creation device 9 receives a table to be read from the database 1, a year (or period), a target area, and the like, and acquires data (healthcare data) having a specified content from the database 2.

なお、知識作成装置９は、データソースが複数指定された場合には、図１７に示したように、個人ＩＤ（事例ＩＤ２２１）を突き合わせて、複数のテーブルを結合した突き合わせ情報２２を生成する。 When a plurality of data sources are specified, the knowledge creation device 9 collates personal IDs (case IDs 221) and generates matching information 22 by combining a plurality of tables, as shown in FIG.

次に、知識作成装置９の知識ベクトル作成部９３は、データベース２から取得したデータソース（突き合わせ情報２２）について、保健医療データの間の相互の関係を分散表現によってベクトル化して、ベクトル情報２３を生成する（Ｓ２）。知識作成装置９は、図１８で示したように、ｎ次元の項目２３２に対してｍ次元のベクトル値を算出し、各項目２３２に知識ＩＤ２３２を付与する。 Next, the knowledge vector creation unit 93 of the knowledge creation device 9 vectorizes the mutual relationship between the health care data with respect to the data source (matching information 22) acquired from the database 2 by a distributed representation, and obtains the vector information 23. Generate (S2). As shown in FIG. 18, the knowledge creation device 9 calculates an m-dimensional vector value for the n-dimensional item 232 and assigns the knowledge ID 232 to each item 232.

知識作成装置９は、生成したベクトル情報２３の知識ＩＤ２３１とデータソースと年度と地域などの入力情報から、知識ベクトル情報２１を生成する。そして、知識作成装置９は、生成した知識ベクトル情報２１とベクトル情報２３をデータベース２へ格納する（Ｓ３）。 The knowledge creation device 9 generates knowledge vector information 21 from the knowledge ID 231 of the generated vector information 23, data source, and input information such as year and region. Then, the knowledge creation device 9 stores the generated knowledge vector information 21 and vector information 23 in the database 2 (S3).

保健医療データの間の相互の関係を分散表現によってベクトル化する手法の一例として、Ｓｋｉｐ−ｇｒａｍモデルなどが知られている。本実施例でも、Ｓｋｉｐ−ｇｒａｍモデルを用語のベクトル表現生成に用いることができる。 The Skip-gram model is known as an example of a method for vectorizing mutual relationships between health care data by a distributed representation. Also in this embodiment, the Skip-gram model can be used to generate a vector representation of terms.

図２４は、Ｓｋｉｐ−ｇｒａｍモデルを用いた用語のベクトル表現の生成の一例を示す模式図である。 FIG. 24 is a schematic diagram showing an example of generating a vector representation of terms using the Skip-gram model.

図２４に、本実施例で用いるモデルの模式図を示した。知識ベクトル作成部９３は、このモデルのパラメータの初期値を設定する。以下では、まず、図２を用いて、本実施例で用いるモデルについて説明する。本実施例では、図２４に示す２層のモデルを用いる。 FIG. 24 shows a schematic diagram of the model used in this embodiment. The knowledge vector creation unit 93 sets the initial values of the parameters of this model. In the following, first, the model used in this embodiment will be described with reference to FIG. In this embodiment, the two-layer model shown in FIG. 24 is used.

図２４の左側にｘで示すワンホットベクトル４０１は、用語を示す一つの要素のみが１、他の要素は全て０となるベクトルである。ここでは、ｘは列ベクトルとする。次元数は、用語表現ベクトルを作成する対象となる用語の数と等しくなる。すなわち、対象となる全用語をｚ１、ｚ２、…、ｚｎとすると、ベクトルｘはｎ次元となる、ｚｐを表すワンホットベクトルは、ｐ番目の要素のみが１、他の要素が０である列ベクトルｘ＝（０、…、０、１、０、…、０）＾Ｔとなる。ここで、ｘ＾Ｔは、ベクトルｘの転置を表す。 The one-hot vector 401 indicated by x on the left side of FIG. 24 is a vector in which only one element indicating a term is 1 and all other elements are 0. Here, x is a column vector. The number of dimensions is equal to the number of terms for which the term representation vector is created. That is, if all the target terms are z1, z2, ..., Zn, the vector x is n-dimensional. The one-hot vector representing zp is a matrix in which only the p-th element is 1 and the other elements are 0. The vector x = (0, ..., 0, 1, 0, ..., 0) ^ T. Here, x ^ T represents the transpose of the vector x.

いま、簡単のため、対象となる用語が医薬品Ａ、医薬品Ｂ、医薬品Ｃの三つとすると、ベクトルｘは３次元となる。医薬品Ａ、医薬品Ｂ、医薬品Ｃのそれぞれが対応する要素を第一要素、第二要素、第三要素とすると、医薬品Ａはｘ＝（１、０、０）＾Ｔ、医薬品Ｂはｘ＝（０、１、０）＾Ｔ、医薬品Ｃはｘ＝（０、０、１）＾Ｔと表される。 Now, for the sake of simplicity, assuming that the target terms are drug A, drug B, and drug C, the vector x is three-dimensional. Assuming that the corresponding elements of drug A, drug B, and drug C are the first element, the second element, and the third element, drug A is x = (1, 0, 0) ^ T, and drug B is x = ( 0, 1, 0) ^ T and drug C are represented as x = (0, 0, 1) ^ T.

前述では、用語が三つの例を示したが、実際には、全ての傷病名、医薬品名、診療行為名は膨大な数となるため、ワンホットベクトル４０１の次元数も大きくなる。 In the above, three examples of terms are shown, but in reality, all the names of injuries and illnesses, drug names, and medical practice names are enormous, so the number of dimensions of the one-hot vector 401 is also large.

いま、データベース２から取得したデータソース（突き合わせ情報２２）から一つのレコードを選択する。突き合わせ情報２２のレセプトの傷病名、医薬品、診療行為の全用語をｗ１、ｗ２、…、ｗＴとする。また、それぞれに対応するワンホットベクトルをｘ１、ｘ２、…、ｘＴとする。このとき、図2４は、各用語ｗｉのワンホットベクトルｘｋから、それと共起するｗｉ以外の用語ｗ１、…、ｗｉ−１、ｗｉ＋１、…、ｗＴのワンホットベクトルｘ１、…、ｘｉ−１、ｘｉ＋１、…、ｘＴを予測するモデルである。行列ＷとＷ'がこのモデルのパラメータであり、これらのパラメータをレセプトデータに基づいて予測の精度が向上するように更新する。 Now, one record is selected from the data source (matching information 22) acquired from the database 2. All terms of the injury / illness name, medicine, and medical practice of the receipt of the matching information 22 are w1, w2, ..., WT. Further, the one-hot vectors corresponding to each are x1, x2, ..., XT. At this time, FIG. 24 shows the one-hot vector xk of each term wi, and the terms w1, ..., wi-1, wi + 1, ..., wT one-hot vector x1, ..., xi-1, which co-occur with it. It is a model that predicts xi + 1, ..., XT. The matrices W and W'are the parameters of this model, and these parameters are updated based on the receipt data to improve the accuracy of the prediction.

図２４では、まず、用語ｗｉのｎ次元のワンホットベクトルｘｉを、ｍ×ｎ行列Ｗにより、ｍ次元ベクトルＷｘｉに変換する。ｍは、用語表現ベクトルの次元数であり、予め定めておく。数百次元程度の大きさに定めるとよい。ｍは、全用語数と等しいｎと比較して、一般的に小さい値であり、Ｗｘｉは、ｘｉをｍ次元ベクトルで、エンコードしたものとみなすことができる。これを用いて、ｘｋが表す用語と共起する用語ｗ１、…、ｗｉ−１、ｗｉ＋１、…、ｗＴのｎ次元のワンホットベクトルｘ１、…、ｘｉ−１、ｘｉ＋１、…、ｘＴを復元することを考える。そのために、ｎ×ｍ行列Ｗ’によりｎ次元ベクトルＷ’Ｗｘに変換する。 In FIG. 24, first, the n-dimensional one-hot vector xi of the term wi is converted into the m-dimensional vector Wxi by the m × n matrix W. m is the number of dimensions of the term expression vector and is predetermined. It is recommended to set the size to several hundred dimensions. m is generally a small value compared to n, which is equal to the total number of terms, and Wxi can be regarded as encoding xi with an m-dimensional vector. Using this, the terms w1, ..., wi-1, wi + 1, ..., wT n-dimensional one-hot vector x1, ..., xi-1, xi + 1, ..., XT that co-occur with the term represented by xk are restored. Think about it. Therefore, it is converted into an n-dimensional vector W'Wx by the n × m matrix W'.

ここで、ｗｊのワンホットベクトルｘｊとＷ’Ｗｘｉとの類似度として、これらの内積を計算する。ｘｉはＩｉ番目の要素が１で他の要素が０であるとする。また、ｍ次元列ベクトルｖｋを用いて、Ｗ＝（ｖ１、ｖ２、…、ｖｎ）とする。また、ｍ次元列ベクトルｖ’ｋを用いて、Ｗ’＝（ｖ’１、ｖ’２、…、ｖ’ｎ）＾Ｔとする。このとき、Ｗｘｉ＝ｖＩｉとなる。また、ｘｊはＩｊ番目の要素が１で他の要素が０であるとする。このとき、ｘｊとＷ’Ｗｘｊの内積は、ｖ’ＩｊとｖＩｉの内積ｖ’Ｉｊ・ｖＩｉに等しくなる。このように、ｘｊとＷ’Ｗｘｉの内積を計算するには、実際には、ｍ次元ベクトル同士の内積ｖ’Ｉｊ・ｖＩｉを計算すればよい。 Here, the inner product of these is calculated as the degree of similarity between the one-hot vector xj of wj and W'Wxi. As for xi, it is assumed that the Iith element is 1 and the other elements are 0. Further, using the m-dimensional matrix vector vk, W = (v1, v2, ..., Vn). Further, using the m-dimensional matrix vector v'k, W'= (v'1, v'2, ..., V'n) ^ T. At this time, Wxi = vIi. Further, it is assumed that the Ijth element of xj is 1 and the other elements are 0. At this time, the inner product of xj and W'Wxj is equal to the inner product v'Ij · vIi of v'Ij and vIi. In this way, in order to calculate the inner product of xj and W'Wxi, the inner product v'Ij · vIi of the m-dimensional vectors may actually be calculated.

いま、ソフトマックス関数を用いて下記のように定める。ここで、Σ＿ｐは、ｐについて１からｎまで加算するものである。この計算は、全ての用語に対して行う必要があるので、計算量が大きく、実際には、ＨｉｅｒａｒｃｈｉｃａｌＳｏｆｔｍａｘやＮＥＧ（Ｎｅｇａｔｉｖｅｓａｍｐｌｉｎｇ）を用いるとよい。
ｐ（ｗｊ｜ｗｉ）＝ｅｘｐ（ｖ’Ｉｊ・ｖＩｉ）／Σ＿ｐｅｘｐ（ｖ’ｐ・ｖＩｉ） Now, using the softmax function, it is defined as follows. Here, Σ_p adds 1 to n with respect to p. Since this calculation needs to be performed for all terms, the amount of calculation is large, and in practice, it is preferable to use Hierarchical Softmax or NEG (Negative sampling).
p (wj | wi) = exp (v'Ij · vIi) / Σ_p exp (v'p · vIi)

このモデルでは、突き合わせ情報２２を用いて、共起する用語を再現できる対数確率値である下記の関数を最大化するように、ＷとＷ’を更新する。なお、下記で、Σ＿ｉは、ｉについて１からＴまで、Σ＿ｊ≠ｉはｊについてｉを除く１からＴまでを加算するものである。
（１／Ｔ）Σ＿ｉ Σ＿ｊ≠ｉｌｏｇ｛ｐ（ｗｊ｜ｗｉ）｝ In this model, the matching information 22 is used to update W and W'so as to maximize the following function, which is a logarithmic probability value that can reproduce co-occurrence terms. In the following, Σ_i adds 1 to T for i, and Σ_j ≠ i adds 1 to T excluding i for j.
(1 / T) Σ_i Σ_j ≠ i log {p (wj | wi)}

突き合わせ情報２２ごとに、上記の対数確率値を最大化するように、勾配降下法によってＷとＷ’を繰り返し更新する。このとき、用語ｗのワンホットベクトルがｉ番目の値が１で他の要素が０となるベクトルｘであるとき、Ｗｘ＝ｖｉが用語ｗの用語表現ベクトルとなる。 For each matching information 22, W and W'are repeatedly updated by the gradient descent method so as to maximize the above logarithmic probability value. At this time, when the one-hot vector of the term w is a vector x in which the i-th value is 1 and the other elements are 0, Wx = vi is the term expression vector of the term w.

ここでは、１つ目の層を入力層、２つ目の層を出力層と呼び、Ｗを入力層行列、Ｗ’を出力層行列、ｘを入力ベクトル、Ｗｘを隠れベクトルを呼ぶ。 Here, the first layer is called an input layer, the second layer is called an output layer, W is called an input layer matrix, W'is called an output layer matrix, x is called an input vector, and Wx is called a hidden vector.

上記処理により、指定されたヘルスケアデータからｍ次元のベクトル値を有するベクトル情報２３と、ベクトル情報２３のカタログ情報としての知識ベクトル情報２１が生成されて、データベース２に格納される。 By the above processing, vector information 23 having an m-dimensional vector value and knowledge vector information 21 as catalog information of the vector information 23 are generated from the designated healthcare data and stored in the database 2.

図４は、データセンタＤＣ＃２のモデル生成装置７で行われるモデル生成処理の一例を示すフローチャートである。この処理は、モデル生成装置７の利用者の指令に応じて開始される。 FIG. 4 is a flowchart showing an example of a model generation process performed by the model generation device 7 of the data center DC # 2. This process is started in response to a command from the user of the model generator 7.

モデル生成装置７は、データセンタＤＣ＃１のデータベース２から利用者が選択した知識ベクトル情報２１と、データセンタＤＣ＃２のデータベース３から利用者が選択したヘルスケアデータ情報（健診情報３１、レセプト情報３２又は介護情報３３）と施策情報３４を受け付ける（Ｓ１１）。 The model generator 7 includes knowledge vector information 21 selected by the user from database 2 of data center DC # 1 and healthcare data information selected by the user from database 3 of data center DC # 2 (medical examination information 31, Receives receipt information 32 or care information 33) and measure information 34 (S11).

モデル生成装置７は、データセンタＤＣ＃１の知識ベクトル情報２１の知識ＩＤ２１１に対応する知識ＩＤ２３１のデータをベクトル情報２３から取得する。また、モデル生成装置７は、利用者が選択したテーブルと年度と地域に該当するヘルスケアデータをデータベース３から読み込む。なお、モデル生成装置７は、複数のヘルスケアデータを指定された場合には、図１７で示したように、個人ＩＤ（事例ＩＤ２２１）で突き合わせを行ってテーブルを結合した突き合わせ情報２２を生成する。また、モデル生成装置７は、利用者が選択した施策情報３４の施策ＩＤ４１４を取得する。 The model generation device 7 acquires the data of the knowledge ID 231 corresponding to the knowledge ID 211 of the knowledge vector information 21 of the data center DC # 1 from the vector information 23. In addition, the model generator 7 reads the table selected by the user and the healthcare data corresponding to the year and region from the database 3. When a plurality of healthcare data are specified, the model generation device 7 performs matching with an individual ID (case ID 221) and generates matching information 22 in which tables are combined, as shown in FIG. .. Further, the model generation device 7 acquires the measure ID 414 of the measure information 34 selected by the user.

次に、モデル生成装置７のベクトル変換部７３が、上記取得したベクトル情報２３を用いて、取得したヘルスケアデータ（健診情報３１、レセプト情報３２、介護情報３３又は突き合わせ情報２２）をベクトル化する（Ｓ１２）。 Next, the vector conversion unit 73 of the model generation device 7 vectorizes the acquired healthcare data (health examination information 31, receipt information 32, nursing care information 33 or matching information 22) using the acquired vector information 23. (S12).

ベクトル変換部７３は、例えば、ベクトル情報２３の項目と一致するヘルスケアデータの項目について、ｍ次元のベクトル値２３３を与えて、分散表現によりベクトル値を算出する。なお、データベース３のヘルスケアデータ（健診情報３１、レセプト情報３２、介護情報３３）のベクトル化は、これに限定されるものではなく、ベクトル情報２３（知識ベクトル情報２１）を用いてベクトル化すればよく、周知又は公知の技術を適用すればよい。 For example, the vector conversion unit 73 gives an m-dimensional vector value 233 to the item of healthcare data that matches the item of the vector information 23, and calculates the vector value by the distributed representation. The vectorization of the healthcare data (medical examination information 31, receipt information 32, care information 33) in the database 3 is not limited to this, and is vectorized using the vector information 23 (knowledge vector information 21). This may be done, and a well-known or known technique may be applied.

次に、モデル生成装置７の機械学習部７４が、ベクトル化されたヘルスケアデータを入力して、ヘルスケアデータと施策情報３４の関係のモデルを生成する（Ｓ１３）。機械学習部７４は、例えば、施策情報３４の目標達成３４４の比率（「１」の割合）や施策効果情報３４１０の実測効果３４１５を目的変数として、学習を実行してモデルを生成する。 Next, the machine learning unit 74 of the model generation device 7 inputs the vectorized healthcare data to generate a model of the relationship between the healthcare data and the measure information 34 (S13). For example, the machine learning unit 74 executes learning and generates a model by using the ratio of the target achievement 344 of the measure information 34 (the ratio of “1”) and the actually measured effect 3415 of the measure effect information 3410 as objective variables.

モデル生成装置７は、生成されたモデルをデータセンタＤＣ＃１のデータベース４にモデル情報４１として格納し（Ｓ１４）、データセンタＤＣ＃１、ＤＣ＃２のシミュレーション装置６、８でモデル情報４１を共有する。 The model generation device 7 stores the generated model in the database 4 of the data center DC # 1 as model information 41 (S14), and stores the model information 41 in the simulation devices 6 and 8 of the data centers DC # 1 and DC # 2. share it.

上記処理によって、モデル生成装置７は、データセンタＤＣ＃１で生成したベクトル情報２３を使用して、データセンタＤＣ＃２のデータベース２に格納されたヘルスケアデータをベクトル化し、機械学習部７４は指定された施策情報３４を目的変数としてベクトル化したヘルスケアデータからモデル情報４１を生成することができる。 By the above processing, the model generator 7 uses the vector information 23 generated in the data center DC # 1 to vectorize the healthcare data stored in the database 2 of the data center DC # 2, and the machine learning unit 74 uses the vector information 23. Model information 41 can be generated from healthcare data vectorized using the designated measure information 34 as an objective variable.

また、全国レベルのベクトル情報２３を、地域Ａ２のヘルスケアデータのベクトル化に適用することで、生成されるモデル情報４１（予測モデル）の精度を向上させることができる。 Further, by applying the national level vector information 23 to the vectorization of the healthcare data of the region A2, the accuracy of the generated model information 41 (prediction model) can be improved.

図５は、データセンタＤＣ＃３のシミュレーション装置８で行われるシミュレーション処理の一例を示すフローチャートである。この処理は、シミュレーション装置８の利用者の指令に応じて開始される。 FIG. 5 is a flowchart showing an example of simulation processing performed by the simulation device 8 of the data center DC # 3. This process is started in response to a command from the user of the simulation device 8.

シミュレーション装置８は、データセンタＤＣ＃１のデータベース２から利用者が選択した知識ベクトル情報２１及びモデル情報４１と、データセンタＤＣ＃３のデータベース５から利用者が選択したヘルスケアデータ（健診情報５１、レセプト情報５２又は介護情報５３）とを受け付ける（Ｓ２１）。 The simulation device 8 includes knowledge vector information 21 and model information 41 selected by the user from the database 2 of the data center DC # 1, and healthcare data (medical examination information) selected by the user from the database 5 of the data center DC # 3. 51, receipt information 52 or care information 53) is accepted (S21).

シミュレーション装置８は、データセンタＤＣ＃１の知識ベクトル情報２１の知識ＩＤ２１１に対応する知識ＩＤ２３１のベクトル情報２３を取得する。また、シミュレーション装置８は、利用者が選択したテーブルと年度と地域に該当するヘルスケアデータをデータベース３から読み込む。 The simulation device 8 acquires the vector information 23 of the knowledge ID 231 corresponding to the knowledge ID 211 of the knowledge vector information 21 of the data center DC # 1. In addition, the simulation device 8 reads the table selected by the user and the healthcare data corresponding to the year and region from the database 3.

なお、シミュレーション装置８は、データベース５のデータソースが複数指定された場合には、図１７に示したように、個人ＩＤ（事例ＩＤ２２１）を突き合わせて、複数のテーブルを結合した突き合わせ情報２２を生成する。また、シミュレーション装置８は、上記選択されたモデル情報４１をデータセンタＤＣ＃１のデータベース４から取得する。なお、モデル情報４１の取得は、シミュレーション装置８の利用者が、出力装置（図示省略）に表示されたモデル選択画面５５０から選択したモデル情報４１を使用する。 When a plurality of data sources of the database 5 are specified, the simulation device 8 matches the personal IDs (case IDs 221) and generates the matching information 22 by combining the plurality of tables, as shown in FIG. To do. Further, the simulation device 8 acquires the selected model information 41 from the database 4 of the data center DC # 1. The model information 41 is acquired by using the model information 41 selected by the user of the simulation device 8 from the model selection screen 550 displayed on the output device (not shown).

図２１は、シミュレーション装置８が出力装置（図示省略）へ表示するモデル選択画面５５０の一例を示す図である。モデル選択画面５５０は、モデル情報４１を生成した地域のコードを入力する地域５５１と、モデル情報４１を生成したデータベースの期間を入力する年度５５２と、モデル選択領域５６０と、シミュレーション実行ボタン５５３を含む。 FIG. 21 is a diagram showing an example of the model selection screen 550 displayed on the output device (not shown) by the simulation device 8. The model selection screen 550 includes an area 551 for inputting the code of the area for generating the model information 41, a year 552 for inputting the period of the database for generating the model information 41, a model selection area 560, and a simulation execution button 553. ..

モデル選択領域５６０は、チェックボックス５６１と、施策ＩＤ５６２と、施策名称５６３と、実施年度５６４と、地域番号５６５と、効果指標５６６と、実測効果５６７を有する。シミュレーション装置８の利用者は、所望の施策ＩＤ５６２のモデルのチェックボックス５６１をチェックすることで、モデル情報４１を選択することができる。 The model selection area 560 has a check box 561, a measure ID 562, a measure name 563, an implementation year 564, an area number 565, an effect index 566, and an actual measurement effect 567. The user of the simulation device 8 can select the model information 41 by checking the check box 561 of the model of the desired measure ID 562.

なお、施策ＩＤ５６２は、モデル情報４１の施策ＩＤ４１４に対応し、施策名称５６３は、施策情報３４の施策名３４２に対応し、実施年度５６４は、施策効果情報３４１０の実施年度３４１３に対応し、地域番号５６５は、施策情報３４の地域ＩＤ３４０に対応し、効果指標５６６は、施策効果情報３４１０の効果指標３４１４に対応し、実測効果５６７は施策効果情報３４１０の実測効果３４１５に対応する。 The measure ID 562 corresponds to the measure ID 414 of the model information 41, the measure name 563 corresponds to the measure name 342 of the measure information 34, and the implementation year 564 corresponds to the implementation year 3413 of the measure effect information 3410. The number 565 corresponds to the area ID 340 of the measure information 34, the effect index 566 corresponds to the effect index 3414 of the measure effect information 3410, and the actual measurement effect 567 corresponds to the actual measurement effect 3415 of the measure effect information 3410.

次に、シミュレーション装置８のベクトル変換部８４が、上記取得したベクトル情報２３を用いて、取得したヘルスケアデータ（又は突き合わせ情報２２）をベクトル化する（Ｓ２２）。この処理は、上記ステップＳ１２のベクトル変換部７３の処理と同様である。 Next, the vector conversion unit 84 of the simulation device 8 vectorizes the acquired healthcare data (or matching information 22) using the acquired vector information 23 (S22). This process is the same as the process of the vector conversion unit 73 in step S12.

次に、シミュレーション装置８の推定演算部８５が、ベクトル化されたヘルスケアデータ（又は突き合わせ情報２２）を入力として、上記取得したモデル情報４１を用いてシミュレーションを実施する（Ｓ２３）。 Next, the estimation calculation unit 85 of the simulation device 8 inputs the vectorized healthcare data (or matching information 22) and executes a simulation using the acquired model information 41 (S23).

シミュレーション装置８の出力部８３は、推定演算部８５のシミュレーション結果を出力装置（図示省略）へ表示する。この処理によって、地域Ａ３のシミュレーション装置８では、データベース５のヘルスケアデータで、モデル情報４１の施策ＩＤ４１４に対応する地域Ａ２の施策を実施した場合の効果を推定することができる。 The output unit 83 of the simulation device 8 displays the simulation result of the estimation calculation unit 85 on the output device (not shown). By this processing, in the simulation device 8 of the area A3, it is possible to estimate the effect when the measure of the area A2 corresponding to the measure ID 414 of the model information 41 is implemented from the healthcare data of the database 5.

図２２は、シミュレーション装置８が出力装置（図示省略）へ表示するシミュレーション結果６１０の画面の一例を示す図である。シミュレーション結果６１０は、地域のコードを表示する地域６１１と、データソースの期間を示す年度６１２と、施策毎の目標達成率を表示するグラフ領域６２０と、施策毎の抑制金額を表示するグラフ領域６３０と、グラフ領域へ出力するシミュレーション結果を選択する領域６４０を含む。目標達成とは、例えば、減量プログラムの目標体重を達成した人数、禁煙プログラムで禁煙を達成できた人数、リハビリや介護の予防などで活動量を向上させることができた人数など、施策が目指す改善の指標の目標達成の人数や割合などである。 FIG. 22 is a diagram showing an example of a screen of the simulation result 610 displayed on the output device (not shown) by the simulation device 8. The simulation result 610 includes a region 611 that displays the region code, a year 612 that indicates the period of the data source, a graph area 620 that displays the target achievement rate for each measure, and a graph area 630 that displays the amount of restraint for each measure. And the area 640 for selecting the simulation result to be output to the graph area. Achieving the goal means, for example, the number of people who achieved the target weight of the weight loss program, the number of people who achieved smoking cessation in the smoking cessation program, the number of people who could improve the amount of activity by preventing rehabilitation and long-term care, etc. The number and percentage of people who achieved the target of the index.

領域６４０は、チェックボックス６４１と、施策ＩＤ６４２と、施策名称６４３と、適用人数６４４と、目標達成６４５と、効果指標６４６と、予測値６４７を含む。シミュレーション装置８の利用者は、所望の施策ＩＤ６４２のシミュレーション結果のチェックボックス６４１をチェックすることで、シミュレーション結果のグラフをグラフ領域６２０、６３０へ表示することができる。 Area 640 includes a check box 641, a measure ID 642, a measure name 643, an applicable number of people 644, a target achievement 645, an effect index 646, and a predicted value 647. The user of the simulation device 8 can display the graph of the simulation result in the graph areas 620 and 630 by checking the check box 641 of the simulation result of the desired measure ID 642.

なお、施策ＩＤ６４２は、モデル情報４１の施策ＩＤ４１４に対応し、施策名称６４３は、施策情報３４の施策名３４２に対応し、適用人数６４４は、シミュレーションに使用したデータベース５のヘルスケアデータの人数に対応し、目標達成（予測）６４５は、適用人数６４４の中で目標達成が可能な人数に対応し、効果指標６４６は、施策効果情報３４１０の効果指標３４１４に対応し、予測値６４７は効果指標６４６の値（シミュレーション結果の金額）に対応する。このように、図２１、図２２で示したように、ある地域で実施した減量プログラムについて、自分の地域（もしくは国）で適用した場合に減量の目標達成や医療費抑制などの効果がどの程度得られるかを、確認することができる。 The measure ID 642 corresponds to the measure ID 414 of the model information 41, the measure name 643 corresponds to the measure name 342 of the measure information 34, and the applicable number 644 corresponds to the number of healthcare data of the database 5 used for the simulation. Correspondingly, the target achievement (forecast) 645 corresponds to the number of people who can achieve the target among the applicable number of people 644, the effect index 646 corresponds to the effect index 3414 of the measure effect information 3410, and the predicted value 647 corresponds to the effect index. It corresponds to a value of 646 (amount of simulation result). In this way, as shown in FIGS. 21 and 22, when the weight loss program implemented in a certain area is applied in one's own area (or country), the effect of achieving the weight loss target and suppressing medical expenses is to what extent. You can check if you can get it.

図６は、データセンタＤＣ＃１のシミュレーション装置６で行われるシミュレーション処理の一例を示すフローチャートである。この処理は、シミュレーション装置６の利用者の指令に応じて開始される。 FIG. 6 is a flowchart showing an example of simulation processing performed by the simulation device 6 of the data center DC # 1. This process is started in response to a command from the user of the simulation device 6.

シミュレーション装置６は、データセンタＤＣ＃１のデータベース２から利用者が選択した知識ベクトル情報２１及びモデル情報４１と、データセンタＤＣ＃１のデータベース１から利用者が選択したヘルスケアデータ（健診情報１１、レセプト情報１２又は介護情報１３）とを受け付ける（Ｓ３１）。 The simulation device 6 includes knowledge vector information 21 and model information 41 selected by the user from the database 2 of the data center DC # 1, and healthcare data (medical examination information) selected by the user from the database 1 of the data center DC # 1. 11. Accepts receipt information 12 or care information 13) (S31).

シミュレーション装置６は、データセンタＤＣ＃１の知識ベクトル情報２１の知識ＩＤ２１１に対応する知識ＩＤ２３１のベクトル情報２３を取得する。また、シミュレーション装置６は、利用者が選択したテーブルと年度と地域に該当するヘルスケアデータをデータベース１から読み込む。 The simulation device 6 acquires the vector information 23 of the knowledge ID 231 corresponding to the knowledge ID 211 of the knowledge vector information 21 of the data center DC # 1. In addition, the simulation device 6 reads the table selected by the user and the healthcare data corresponding to the year and region from the database 1.

なお、シミュレーション装置６は、データベース１のデータソースが複数指定された場合には、図１７に示したように、個人ＩＤ（事例ＩＤ２２１）を突き合わせて、複数のテーブルを結合した突き合わせ情報２２を生成する。また、シミュレーション装置６は、上記図２１と同様にして、選択されたモデル情報４１をデータセンタＤＣ＃１のデータベース４から選択する。 When a plurality of data sources of the database 1 are specified, the simulation device 6 matches the personal IDs (case IDs 221) and generates the matching information 22 by combining the plurality of tables, as shown in FIG. To do. Further, the simulation device 6 selects the selected model information 41 from the database 4 of the data center DC # 1 in the same manner as in FIG. 21.

次に、シミュレーション装置６のベクトル変換部６４が、上記取得したベクトル情報２３を用いて、取得したヘルスケアデータ（又は突き合わせ情報２２）をベクトル化する（Ｓ３２）。この処理は、上記ステップＳ１２のベクトル変換部７３の処理と同様である。 Next, the vector conversion unit 64 of the simulation device 6 vectorizes the acquired healthcare data (or matching information 22) using the acquired vector information 23 (S32). This process is the same as the process of the vector conversion unit 73 in step S12.

次に、シミュレーション装置６の推定演算部６５が、ベクトル化されたヘルスケアデータ（又は突き合わせ情報２２）を入力として、上記取得したモデル情報４１を用いてシミュレーションを実施する（Ｓ３３）。 Next, the estimation calculation unit 65 of the simulation device 6 inputs the vectorized healthcare data (or matching information 22) and performs a simulation using the acquired model information 41 (S33).

シミュレーション装置６の出力部６３は、上記図２２と同様に、推定演算部６５のシミュレーション結果を出力装置（図示省略）へ表示する。この処理によって、国Ａ１のシミュレーション装置６では、データベース１のヘルスケアデータで、モデル情報４１の施策ＩＤ４１４に対応する地域Ａ２の施策を実施した場合の効果を推定することができる。 The output unit 63 of the simulation device 6 displays the simulation result of the estimation calculation unit 65 on the output device (not shown) in the same manner as in FIG. 22. By this processing, in the simulation apparatus 6 of the country A1, the effect when the measure of the region A2 corresponding to the measure ID 414 of the model information 41 is implemented can be estimated from the healthcare data of the database 1.

以上のように、本実施例のヘルスケアデータ分析システムは、全国的なヘルスケアデータからベクトル情報２３及び知識ベクトル情報２１を算出し、このベクトル情報２３と、特定の地域Ａ２の施策情報３４とヘルスケアデータからモデル情報（予測モデル）４１を生成する。このモデル情報４１に他の地域Ａ３（又は国Ａ１）のヘルスケアデータを適用することで、特定の地域Ａ２の施策を他の地域Ａ３（又は国Ａ１）で実施した場合の効果を予測することが可能となる。これにより、ヘルスケア（医療や健診又は介護）に対する新たな施策の作成を支援することが可能となる。特に、国Ａ１や地域Ａ３では、地域Ａ２で実施した施策の結果に関するデータを持たないが、地域Ａ２で作成した予測モデルを、ＤＢ４にモデル情報として集約することで、国Ａ１や地域Ａ３で当該の施策を実施した場合の効果を予測することができる。さらに、地域Ａ２で予測モデルを作成するときに、国Ａ１のような全体のデータの特徴から作成した知識ベクトル情報２１を用いて、地域Ａ２のデータをベクトル化する。これにより、地域Ａ２で作成した予測モデルは、地域Ａ２のデータ特性だけでなく、国Ａ１の全体データにおける傾向を反映したモデルを作成することができる。そして、地域Ａ２で作成したモデルを国Ａ１や地域Ａ３に適用する場合にも、国Ａ１や地域Ａ３のデータを同じ知識ベクトル情報２１でベクトル化した上で予測モデルに適用することで、国Ａ１や地域Ａ３で適用した場合を精度よく予測することができる効果がある。 As described above, the healthcare data analysis system of this embodiment calculates the vector information 23 and the knowledge vector information 21 from the national healthcare data, and the vector information 23 and the measure information 34 of the specific area A2 are used. Model information (prediction model) 41 is generated from healthcare data. By applying the healthcare data of another region A3 (or country A1) to this model information 41, it is possible to predict the effect when the measures of a specific region A2 are implemented in another region A3 (or country A1). Is possible. This makes it possible to support the creation of new measures for health care (medical care, medical examination or long-term care). In particular, country A1 and region A3 do not have data on the results of measures implemented in region A2, but by aggregating the forecast models created in region A2 into DB4 as model information, the country A1 and region A3 are concerned. It is possible to predict the effect of implementing the above measures. Further, when the prediction model is created in the region A2, the data in the region A2 is vectorized by using the knowledge vector information 21 created from the characteristics of the entire data such as the country A1. As a result, the prediction model created in the region A2 can create a model that reflects not only the data characteristics of the region A2 but also the tendency in the overall data of the country A1. Then, even when the model created in the region A2 is applied to the country A1 or the region A3, the data of the country A1 or the region A3 is vectorized by the same knowledge vector information 21 and then applied to the prediction model. It has the effect of being able to accurately predict the case where it is applied in the area A3.

なお、上記実施例では、最上位の地域（又は集団）として国Ａ１のデータセンタＤＣ＃１と、下位の集団として地域Ａ２のデータセンタＤＣ＃２及び地域Ａ３のデータセンタＤＣ＃３の３つのデータセンタを有する例を示したが、これに限定されるものではない。 In the above embodiment, there are three data centers DC # 1 in country A1 as the highest region (or group), data center DC # 2 in region A2 as a lower group, and data center DC # 3 in region A3. An example of having a data center is shown, but the present invention is not limited to this.

例えば、１つのデータセンタに、知識作成装置９、シミュレーション装置６、モデル生成装置７、シミュレーション装置８と、各データベースを配置してもよい。あるいは、知識作成装置９と、シミュレーション装置６、モデル生成装置７、シミュレーション装置８の機能をひとつの計算機で実現し、知識作成部、第１のシミュレーション部、モデル生成部、第２のシミュレーション部としてもよい。 For example, a knowledge creation device 9, a simulation device 6, a model generation device 7, a simulation device 8, and each database may be arranged in one data center. Alternatively, the functions of the knowledge creation device 9, the simulation device 6, the model generation device 7, and the simulation device 8 are realized by one computer, and as the knowledge creation unit, the first simulation unit, the model generation unit, and the second simulation unit. May be good.

＜結び＞
以上のように、上記実施例のヘルスケア（保健医療）データ分析システムは、以下のような構成とすることができる。 <Conclusion>
As described above, the healthcare data analysis system of the above embodiment can have the following configuration.

（１）プロセッサとメモリを有する計算機システムが、保健医療データを分析する保健医療データ分析システムであって、第１の集団（国Ａ１）の第１の保健医療データ（健診情報１１、レセプト情報１２、介護情報１３）を管理する第１のデータベース（ＤＢ＃１）と、前記第１の保健医療データを入力して、前記第１の保健医療データ間の相互の関係をベクトルに変換して知識ベクトル情報（２１）として出力する知識ベクトル作成部（９３）と、前記知識ベクトル情報（２１）を格納する第２のデータベース（ＤＢ＃２）と、第２の集団（地域Ａ２）の第２の保健医療データ（健診情報３１、レセプト情報３２、介護情報３３）と、前記第２の集団（Ａ２）で実施された施策に関する施策情報（３４）と、を管理する第３のデータベース（ＤＢ＃３）と、前記第３のデータベース（ＤＢ＃３）の第２の保健医療データと、前記第２のデータベース（ＤＢ＃２）の知識ベクトル情報（２１）を入力し、前記第２の集団（Ａ２）の第２の保健医療データを前記知識ベクトル情報（２１）を用いてベクトル変換によりベクトル情報を生成する第１のベクトル変換部（７３）と、前記変換されたベクトル情報と、前記第３のデータベース（ＤＢ＃３）の施策情報（３４）を入力し、前記ベクトル情報と施策情報（３４）との関係を機械学習によりモデル化したモデル情報（４１）を出力する機械学習部（７４）と、前記出力されたモデル情報（４１）を管理する第４のデータベース（ＤＢ＃４）と、を有し、前記第２の集団（Ａ２）は、前記第１の集団（Ａ１）を構成する下位の集団で構成され、前記施策情報（３４）は、前記第２の集団（Ａ２）における保健医療の施策の対象者と、施策の結果を含むことを特徴とする保健医療データ分析システム。 (1) A computer system having a processor and a memory is a health care data analysis system that analyzes health care data, and is the first health care data (medical examination information 11, receipt information) of the first group (country A1). 12. The first database (DB # 1) that manages the care information 13) and the first health care data are input, and the mutual relationship between the first health care data is converted into a vector. A knowledge vector creation unit (93) that outputs as knowledge vector information (21), a second database (DB # 2) that stores the knowledge vector information (21), and a second group (region A2). A third database (DB) that manages health and medical data (medical examination information 31, receipt information 32, care information 33) and measure information (34) regarding measures implemented in the second group (A2). # 3), the second health care data of the third database (DB # 3), and the knowledge vector information (21) of the second database (DB # 2) are input, and the second group The first vector conversion unit (73) that generates vector information by vector conversion of the second health care data of (A2) using the knowledge vector information (21), the converted vector information, and the first. The machine learning unit (74) that inputs the measure information (34) of the database (DB # 3) of 3 and outputs the model information (41) that models the relationship between the vector information and the measure information (34) by machine learning. ) And a fourth database (DB # 4) that manages the output model information (41), and the second group (A2) constitutes the first group (A1). The measure information (34) is a health data analysis system characterized by including the target persons of the health care measures in the second group (A2) and the results of the measures.

上記構成により、ヘルスケアデータ分析システムは、全国的なヘルスケアデータからベクトル情報２３及び知識ベクトル情報２１を算出し、このベクトル情報２３と、特定の地域Ａ２の施策情報３４と地域Ａ２のヘルスケアデータからモデル情報（予測モデル）４１を生成する。このモデル情報４１に他の地域Ａ３（又は国Ａ１）のヘルスケアデータを適用することで、特定の地域Ａ２の施策を他の地域Ａ３（又は国Ａ１）で実施した場合の効果を予測することが可能となる。これにより、ヘルスケア（医療や健診又は介護）に対する新たな施策の作成を支援することが可能となる。 With the above configuration, the healthcare data analysis system calculates vector information 23 and knowledge vector information 21 from national healthcare data, and this vector information 23, measure information 34 of specific region A2, and healthcare of region A2. Model information (prediction model) 41 is generated from the data. By applying the healthcare data of another region A3 (or country A1) to this model information 41, it is possible to predict the effect when the measures of a specific region A2 are implemented in another region A3 (or country A1). Is possible. This makes it possible to support the creation of new measures for health care (medical care, medical examination or long-term care).

（２）上記（１）に記載の保健医療データ分析システムであって、第３の集団（Ａ３）の第３の保健医療データ（健診情報５１、レセプト情報５２、介護情報５３）を管理する第５のデータベース（ＤＢ＃５）と、前記第３の保健医療データを前記知識ベクトル情報（２１）を用いてベクトル変換によりベクトル情報を生成する第２のベクトル変換部（８４）と、前記ベクトル変換したベクトル情報を前記モデル情報（４１）に適用してシミュレーションを実施する第１の推定演算部（８５）と、前記シミュレーションの結果を出力する第１の出力部（８３）と、をさらに有し、前記第３の集団（Ａ３）は、前記第１の集団（Ａ１）を構成する下位の集団で、前記第２の集団（Ａ２）とは異なる集団で構成され、前記第１の推定演算部（８５）は、前記第２の集団（Ａ２）で実施した保健医療の施策を、前記第３の集団（Ａ３）で実行した結果を推定することを特徴とする保健医療データ分析システム。 (2) The health care data analysis system according to (1) above manages the third health care data (health care information 51, receipt information 52, care information 53) of the third group (A3). A fifth database (DB # 5), a second vector conversion unit (84) that generates vector information by vector conversion of the third health care data using the knowledge vector information (21), and the vector. It further has a first estimation calculation unit (85) that applies the converted vector information to the model information (41) to perform simulation, and a first output unit (83) that outputs the result of the simulation. The third group (A3) is a lower group that constitutes the first group (A1), is composed of a group different from the second group (A2), and is the first estimation operation. The department (85) is a health care data analysis system characterized by estimating the result of executing the health care measures implemented in the second group (A2) in the third group (A3).

上記構成により、ヘルスケアデータ分析システムは、ベクトル情報２３と、特定の地域Ａ２（第２の集団）の施策情報３４と地域Ａ２（第２の集団）のヘルスケアデータからモデル情報（予測モデル）４１を生成する。このモデル情報４１に他の地域Ａ３（第３の集団）のヘルスケアデータを適用することで、特定の地域Ａ２（第２の集団）の施策を他の地域Ａ３（第３の集団）で実施した場合の効果を予測することが可能となる。 With the above configuration, the healthcare data analysis system uses vector information 23, measure information 34 of a specific area A2 (second group), and model information (prediction model) from healthcare data of area A2 (second group). 41 is generated. By applying the healthcare data of another region A3 (third group) to this model information 41, the measures of a specific region A2 (second group) are implemented in the other region A3 (third group). It is possible to predict the effect of this.

（３）上記（１）に記載の保健医療データ分析システムであって、前記第１の保健医療データを前記知識ベクトル情報（２１）を用いてベクトル変換によりベクトル情報を生成する第３のベクトル変換部（６４）と、前記ベクトル変換したベクトル情報を前記モデル情報（４１）に適用してシミュレーションを実施する第２の推定演算部（６２）と、前記シミュレーションの結果を出力する第２の出力部（６３）と、をさらに有し、前記第２の推定演算部（６５）は、前記第２の集団（Ａ２）で実施した保健医療の施策を、前記第１の集団（Ａ１）で実行した結果を推定することを特徴とする保健医療データ分析システム。 (3) In the health care data analysis system according to (1) above, a third vector conversion that generates vector information by vector conversion of the first health care data using the knowledge vector information (21). A unit (64), a second estimation calculation unit (62) that applies the vector-converted vector information to the model information (41) to perform a simulation, and a second output unit that outputs the result of the simulation. The second estimation calculation unit (65) further has (63) and, and the second estimation calculation unit (65) executed the health care measures implemented in the second group (A2) in the first group (A1). A health care data analysis system characterized by estimating results.

上記構成により、ヘルスケアデータ分析システムは、ベクトル情報２３と、特定の地域Ａ２（第２の集団）の施策情報３４と地域Ａ２（第２の集団）のヘルスケアデータからモデル情報（予測モデル）４１を生成する。このモデル情報４１に国Ａ１（第１の集団）のヘルスケアデータを適用することで、特定の地域Ａ２（第２の集団）の施策を国Ａ１（第１の集団）で実施した場合の効果を予測することが可能となる。 With the above configuration, the healthcare data analysis system uses vector information 23, measure information 34 of a specific area A2 (second group), and model information (prediction model) from healthcare data of area A2 (second group). 41 is generated. By applying the healthcare data of country A1 (first group) to this model information 41, the effect when the measures of a specific area A2 (second group) are implemented in country A1 (first group). Can be predicted.

（４）上記（１）に記載の保健医療データ分析システムであって、前記第１の集団（Ａ１）の第１の保健医療データは、匿名の情報であることを特徴とする保健医療データ分析システム。 (4) The health care data analysis system according to (1) above, wherein the first health care data of the first group (A1) is anonymous information. system.

上記構成により、全国レベルの保健医療データでは、個人情報を隠蔽してデータの利用を行うことで、個人情報の保護を推進することができる。 With the above configuration, it is possible to promote the protection of personal information by concealing personal information and using the data in national level health care data.

（５）上記（２）に記載の保健医療データ分析システムであって、前記第４のデータベース（ＤＢ＃４）のモデル情報（４１）は、前記施策情報（３４）の識別子を含み、前記第１の推定演算部（８５）は、複数のモデル情報（４１）のそれぞれに対する施策の結果を出力し、前記モデル情報（４１）の選択を受け付けて、前記選択されたモデル情報（４１）で第３の集団（Ａ３）のヘルスケアデータのシミュレーションを実施することを特徴とする保健医療データ分析システム。 (5) In the health and medical data analysis system according to (2) above, the model information (41) of the fourth database (DB # 4) includes the identifier of the measure information (34), and the first item. The estimation calculation unit (85) of 1 outputs the result of the measure for each of the plurality of model information (41), accepts the selection of the model information (41), and uses the selected model information (41) as the first. A health care data analysis system characterized by performing simulations of healthcare data of 3 groups (A3).

上記構成により、ヘルスケアデータ分析システムの利用者は、複数のモデル情報（４１）のそれぞれに対する施策の結果を出力装置へ表示させて、所望のモデル情報（４１）を選択し、選択されたモデル情報４１で、他の地域Ａ３（第３の集団）のヘルスケアデータのシミュレーションを行うことができる。 With the above configuration, the user of the healthcare data analysis system displays the result of the measure for each of the plurality of model information (41) on the output device, selects the desired model information (41), and selects the selected model. With information 41, it is possible to simulate the healthcare data of another region A3 (third group).

（６）上記（３）に記載の保健医療データ分析システムであって、前記第４のデータベース（ＤＢ＃４）のモデル情報（４１）は、前記施策情報（３４）の識別子を含み、前記第２の推定演算部（６５）は、複数の前記モデル情報（４１）のそれぞれに対する施策の結果を出力し、前記モデル情報（４１）の選択を受け付けて、前記選択されたモデル情報（４１）でシミュレーションを実施することを特徴とする保健医療データ分析システム。 (6) In the health and medical data analysis system according to (3) above, the model information (41) of the fourth database (DB # 4) includes the identifier of the measure information (34), and the first item. The estimation calculation unit (65) of 2 outputs the result of the measure for each of the plurality of the model information (41), accepts the selection of the model information (41), and uses the selected model information (41). A health and medical data analysis system characterized by performing simulations.

上記構成により、ヘルスケアデータ分析システムの利用者は、複数のモデル情報（４１）のそれぞれに対する施策の結果を出力装置へ表示させて、所望のモデル情報（４１）を選択し、選択されたモデル情報４１で、国Ａ１（第１の集団）のヘルスケアデータのシミュレーションを行うことができる。 With the above configuration, the user of the healthcare data analysis system displays the result of the measure for each of the plurality of model information (41) on the output device, selects the desired model information (41), and selects the selected model. Information 41 can be used to simulate country A1 (first group) healthcare data.

なお、本発明は上記した実施例に限定されるものではなく、様々な変形例が含まれる。例えば、上記した実施例は本発明を分かりやすく説明するために詳細に記載したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また、ある実施例の構成の一部を他の実施例の構成に置き換えることが可能であり、また、ある実施例の構成に他の実施例の構成を加えることも可能である。また、各実施例の構成の一部について、他の構成の追加、削除、又は置換のいずれもが、単独で、又は組み合わせても適用可能である。 The present invention is not limited to the above-described examples, and includes various modifications. For example, the above-described embodiment is described in detail in order to explain the present invention in an easy-to-understand manner, and is not necessarily limited to the one including all the configurations described. Further, it is possible to replace a part of the configuration of one embodiment with the configuration of another embodiment, and it is also possible to add the configuration of another embodiment to the configuration of one embodiment. Further, for a part of the configurations of each embodiment, any of addition, deletion, or replacement of other configurations can be applied alone or in combination.

また、上記の各構成、機能、処理部、及び処理手段等は、それらの一部又は全部を、例えば集積回路で設計する等によりハードウェアで実現してもよい。また、上記の各構成、及び機能等は、プロセッサがそれぞれの機能を実現するプログラムを解釈し、実行することによりソフトウェアで実現してもよい。各機能を実現するプログラム、テーブル、ファイル等の情報は、メモリや、ハードディスク、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等の記録装置、又は、ＩＣカード、ＳＤカード、ＤＶＤ等の記録媒体に置くことができる。 Further, each of the above configurations, functions, processing units, processing means and the like may be realized by hardware by designing a part or all of them by, for example, an integrated circuit. Further, each of the above configurations, functions, and the like may be realized by software by the processor interpreting and executing a program that realizes each function. Information such as programs, tables, and files that realize each function can be stored in a memory, a hard disk, a recording device such as an SSD (Solid State Drive), or a recording medium such as an IC card, an SD card, or a DVD.

また、制御線や情報線は説明上必要と考えられるものを示しており、製品上必ずしも全ての制御線や情報線を示しているとは限らない。実際には殆ど全ての構成が相互に接続されていると考えてもよい。 In addition, the control lines and information lines indicate those that are considered necessary for explanation, and do not necessarily indicate all the control lines and information lines in the product. In practice, it can be considered that almost all configurations are interconnected.

１、２、３、４、５データベース
６、８シミュレーション装置
７モデル生成装置
９知識作成装置
１１、３１、５１健診情報
１２、３２、５２レセプト情報
１３、３３、５３介護情報
２１知識ベクトル情報
２３ベクトル情報
３４施策情報
４１モデル情報
６１、７１、８１、９１ＣＰＵ
５２、６２、７２、８２、８２メモリ
６３、８３出力部
６４、８４ベクトル変換部
６５、８２推定演算部
９３知識ベクトル作成部 1, 2, 3, 4, 5 Database 6, 8 Simulation device 7 Model generation device 9 Knowledge creation device 11, 31, 51 Medical examination information 12, 32, 52 Receipt information 13, 33, 53 Nursing information 21 Knowledge vector information 23 Vector information 34 Measure information 41 Model information 61, 71, 81, 91 CPU
52, 62, 72, 82, 82 Memory 63, 83 Output unit 64, 84 Vector conversion unit 65, 82 Estimation calculation unit 93 Knowledge vector creation unit

Claims

A computer system with a processor and memory is a health care data analysis system that analyzes health care data.
A first database that manages the first health care data of the first group,
A knowledge vector creation unit that inputs the first health care data, converts the mutual relationship between the first health care data into a vector, and outputs it as knowledge vector information.
A second database that stores the knowledge vector information,
A third database that manages the second health care data of the second group and the measure information regarding the measures implemented in the second group.
The second health care data of the third database and the knowledge vector information of the second database are input, and the second health care data of the second group is vector-converted using the knowledge vector information. The first vector conversion unit that generates vector information,
A machine learning unit that inputs the converted vector information and the measure information of the third database and outputs model information that models the relationship between the vector information and the measure information by machine learning.
It has a fourth database that manages the output model information, and has.
The second group is composed of lower groups constituting the first group.
The measure information is a health care data analysis system characterized by including the target persons of the health care measures in the second group and the results of the measures.

The health care data analysis system according to claim 1.
A fifth database that manages the third health care data of the third group,
A second vector conversion unit that generates vector information by vector conversion of the third health care data using the knowledge vector information, and
A first estimation calculation unit that applies the vector-converted vector information to the model information to perform a simulation, and
It further has a first output unit that outputs the result of the simulation, and
The third group is
It is a lower group that constitutes the first group, and is composed of a group different from the second group.
The first estimation calculation unit is
A health care data analysis system characterized in that the results of implementing the health care measures implemented in the second group in the third group are estimated.

The health care data analysis system according to claim 1.
A third vector conversion unit that generates vector information by vector conversion of the first health care data using the knowledge vector information,
A second estimation calculation unit that applies the vector-converted vector information to the model information to perform a simulation, and
It further has a second output unit that outputs the result of the simulation.
The second estimation calculation unit is
A health care data analysis system characterized in that the results of implementing the health care measures implemented in the second group in the first group are estimated.

The health care data analysis system according to claim 1.
The first health care data of the first group is a health care data analysis system characterized in that it is anonymous information.

The health care data analysis system according to claim 2.
The model information of the fourth database includes an identifier of the measure information.
The first estimation calculation unit is
Health and medical data characterized by outputting the results of measures for each of a plurality of model information, accepting the selection of the model information, and performing the simulation of the third health and medical data with the selected model information. Analysis system.

The health care data analysis system according to claim 3.
The model information of the fourth database includes an identifier of the measure information.
The second estimation calculation unit is
Health medical care characterized in that the results of measures for each of the plurality of the model information are output, the selection of the model information is accepted, and the simulation of the first health care data is performed with the selected model information. Data analysis system.

A computer with a processor and memory is a health care data analysis method that analyzes health care data.
The computer inputs the first health care data from the first database that manages the first health care data of the first group, and vectorizes the mutual relationship between the first health care data. A knowledge vector creation step of converting and outputting as knowledge vector information and storing the knowledge vector information in a second database, and
The computer inputs the second health care data from the third database that manages the second health care data of the second group and the measure information regarding the measures implemented in the second group. , The first vector conversion step of inputting knowledge vector information from the second database and generating vector information by vector conversion of the second health care data of the second group using the knowledge vector information.
The computer inputs the converted vector information and the measure information of the third database, and stores the model information in which the relationship between the vector information and the measure information is modeled by machine learning in the fourth database. Including machine learning steps,
The second group is composed of lower groups constituting the first group.
The measure information is a health care data analysis method characterized by including the target persons of the health care measures in the second group and the results of the measures.

The health care data analysis method according to claim 7.
The computer inputs the third health care data from the fifth database that manages the third health care data of the third group, inputs the knowledge vector information from the second database, and the third The second vector conversion step of generating vector information by vector conversion of the health and medical data of the above using the knowledge vector information, and
A first estimation calculation step in which the computer applies the vector-converted vector information to the model information to perform a simulation.
The computer further includes a first output step that outputs the result of the simulation.
The third group is
It is a lower group that constitutes the first group, and is composed of a group different from the second group.
The first estimation calculation step is
A method for analyzing health care data, which estimates the result of implementing the health care measures implemented in the second group in the third group.

The health care data analysis method according to claim 7.
A third vector conversion step in which the computer generates vector information by vector conversion of the first health care data using the knowledge vector information.
A second estimation calculation step in which the computer applies the vector-converted vector information to the model information to perform a simulation.
The computer further includes a second output step, which outputs the result of the simulation.
The second estimation calculation step is
A method for analyzing health care data, which estimates the result of implementing the health care measures implemented in the second group in the first group.

The health care data analysis method according to claim 7.
A method for analyzing health care data, wherein the first health care data of the first group is anonymous information.

The health care data analysis method according to claim 8.
The model information of the fourth database includes an identifier of the measure information.
The first estimation calculation step is
Health medical care characterized in that the results of measures for each of the plurality of the model information are output, the selection of the model information is accepted, and the simulation of the third health care data is performed with the selected model information. Data analysis method.

The health care data analysis method according to claim 9.
The model information of the fourth database includes an identifier of the measure information.
The second estimation calculation step is
Health medical care characterized in that the results of measures for each of the plurality of the model information are output, the selection of the model information is accepted, and the simulation of the first health care data is performed with the selected model information. Data analysis method.