JP6929124B2

JP6929124B2 - Forecasting systems, forecasting methods, and forecasting programs

Info

Publication number: JP6929124B2
Application number: JP2017095794A
Authority: JP
Inventors: 悠輔巣籠; 亮一草間; 聖吾原
Original assignee: Micin Inc
Current assignee: Micin Inc
Priority date: 2017-05-12
Filing date: 2017-05-12
Publication date: 2021-09-01
Anticipated expiration: 2037-05-12
Also published as: JP2018194904A

Description

本発明の一側面は予測システム、予測方法、および予測プログラムに関する。 One aspect of the invention relates to prediction systems, prediction methods, and prediction programs.

医療または人の健康を管理するための様々なコンピュータシステムが知られている。例えば下記の特許文献１には、対象者の健康状態の推移を予測する医療データ解析装置が記載されている。この解析装置は、異なる種類のデータが頻度要素として混在するバグオブワードの形で与えられた一連の対象者の一連の年代における医療データを、対象者及び年代ごとの個別データの集まりとして、深層学習を適用する。そして、解析装置は、各中間層における当該個別データの表現データを、異なる種類のデータが頻度要素として混在するバグオブワードの圧縮表現として出力する。 Various computer systems for managing medical or human health are known. For example, Patent Document 1 below describes a medical data analysis device that predicts changes in the health condition of a subject. This analyzer deeply records medical data in a series of ages of a series of subjects given in the form of a bug-of-word in which different types of data are mixed as frequency factors, as a collection of individual data for each subject and age. Apply learning. Then, the analysis device outputs the representation data of the individual data in each intermediate layer as a compressed representation of a bug of word in which different types of data are mixed as frequency elements.

特開２０１７−２７３０７号公報JP-A-2017-27307

人の健康状態を予測する手法は知られている。しかし、その状態をより精度良く予測することができれば便利であり、さらに、その予測結果に基づいて様々な更なる情報を得ることも可能になる。そのため、対象者の将来の状態をより精度良く予測する仕組みが望まれている。 Methods for predicting human health are known. However, it is convenient if the state can be predicted more accurately, and further, various further information can be obtained based on the prediction result. Therefore, a mechanism for more accurately predicting the future state of the subject is desired.

本発明の一側面に係る予測システムは、対象者の過去の受診記録を所定のデータベースから取得する取得部であって、該受診記録が少なくとも医療処置を示す、該取得部と、受診記録に基づいて複数の入力ベクトルを生成し、各入力ベクトルの要素群を所定の数値範囲内で正規化する変換部であって、要素群が、医療処置を示す要素を少なくとも含む、該変換部と、正規化された各入力ベクトルを学習アルゴリズムに入力することで、将来の所定の時期における対象者の状態の予測を示す出力ベクトルを求める予測部であって、出力ベクトルの要素群が入力ベクトルの要素群に対応する、該予測部とを備える。 The prediction system according to one aspect of the present invention is an acquisition unit that acquires the past consultation record of the subject from a predetermined database, and is based on the acquisition unit and the consultation record in which the consultation record indicates at least medical treatment. A conversion unit that generates a plurality of input vectors and normalizes the element group of each input vector within a predetermined numerical range, wherein the element group includes at least an element indicating a medical treatment. It is a prediction unit that obtains an output vector that indicates the prediction of the state of the target person at a predetermined time in the future by inputting each of the converted input vectors to the learning algorithm, and the element group of the output vector is the element group of the input vector. It is provided with the prediction unit corresponding to the above.

本発明の一側面に係る予測方法は、少なくとも一つのプロセッサを備える予測システムにより実行される予測方法であって、対象者の過去の受診記録を所定のデータベースから取得する取得ステップであって、該受診記録が少なくとも医療処置を示す、該取得ステップと、受診記録に基づいて複数の入力ベクトルを生成し、各入力ベクトルの要素群を所定の数値範囲内で正規化する変換ステップであって、要素群が、医療処置を示す要素を少なくとも含む、該変換ステップと、正規化された各入力ベクトルを学習アルゴリズムに入力することで、将来の所定の時期における対象者の状態の予測を示す出力ベクトルを求める予測ステップであって、出力ベクトルの要素群が入力ベクトルの要素群に対応する、該予測ステップとを含む。 The prediction method according to one aspect of the present invention is a prediction method executed by a prediction system including at least one processor, and is an acquisition step of acquiring a past consultation record of a subject from a predetermined database. An acquisition step in which the consultation record indicates at least a medical procedure, and a conversion step in which a plurality of input vectors are generated based on the consultation record and the element group of each input vector is normalized within a predetermined numerical range. By inputting the transformation step and each normalized input vector into the learning algorithm, the group includes at least an element indicating the medical treatment, so that an output vector indicating a prediction of the subject's state at a predetermined time in the future is obtained. It is a prediction step to be obtained, and includes the prediction step in which the element group of the output vector corresponds to the element group of the input vector.

本発明の一側面に係る予測プログラムは、対象者の過去の受診記録を所定のデータベースから取得する取得ステップであって、該受診記録が少なくとも医療処置を示す、該取得ステップと、受診記録に基づいて複数の入力ベクトルを生成し、各入力ベクトルの要素群を所定の数値範囲内で正規化する変換ステップであって、要素群が、医療処置を示す要素を少なくとも含む、該変換ステップと、正規化された各入力ベクトルを学習アルゴリズムに入力することで、将来の所定の時期における対象者の状態の予測を示す出力ベクトルを求める予測ステップであって、出力ベクトルの要素群が入力ベクトルの要素群に対応する、該予測ステップとをコンピュータシステムに実行させる。 The prediction program according to one aspect of the present invention is an acquisition step of acquiring a past consultation record of a subject from a predetermined database, and is based on the acquisition step and the consultation record in which the consultation record indicates at least a medical treatment. A conversion step that generates a plurality of input vectors and normalizes the element group of each input vector within a predetermined numerical range, wherein the element group includes at least an element indicating a medical treatment. It is a prediction step to obtain an output vector indicating the prediction of the state of the target person at a predetermined time in the future by inputting each of the converted input vectors to the learning algorithm, and the element group of the output vector is the element group of the input vector. The computer system is made to perform the prediction step corresponding to the above.

このような側面においては、対象者の受診記録を表現した入力ベクトルを正規化してからその入力ベクトルを学習アルゴリズムに入力することで、対象者の将来の状態の予測を示す出力ベクトルが得られる。入力ベクトルは医療処置を示す要素を少なくとも含み、出力ベクトルはその入力ベクトルに対応する。医療処置を示す要素を少なくとも含む出力ベクトルを出力することで、対象者の状態をより精度良く予測することができる。 In such an aspect, by normalizing the input vector expressing the consultation record of the subject and then inputting the input vector into the learning algorithm, an output vector showing the prediction of the future state of the subject can be obtained. The input vector contains at least an element indicating a medical procedure, and the output vector corresponds to the input vector. By outputting an output vector that includes at least an element indicating a medical procedure, the condition of the subject can be predicted more accurately.

本発明の一側面によれば、対象者の将来の状態をより精度良く予測することができる。 According to one aspect of the present invention, the future state of the subject can be predicted more accurately.

実施形態に係る予測システムで用いられるコンピュータのハードウェア構成を示す図である。It is a figure which shows the hardware configuration of the computer used in the prediction system which concerns on embodiment. 実施形態に係る予測システムの機能構成を示す図である。It is a figure which shows the functional structure of the prediction system which concerns on embodiment. 実施形態に係る予測システムの動作を示すフローチャートである。It is a flowchart which shows the operation of the prediction system which concerns on embodiment. 実施形態に係る予測プログラムの構成を示す図である。It is a figure which shows the structure of the prediction program which concerns on embodiment.

以下、添付図面を参照しながら本発明の実施形態を詳細に説明する。なお、図面の説明において同一または同等の要素には同一の符号を付し、重複する説明を省略する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings. In the description of the drawings, the same or equivalent elements are designated by the same reference numerals, and duplicate description will be omitted.

実施形態に係る予測システム１０は、対象者の将来の状態を予測するコンピュータシステムである。例えば、予測システム１０は、対象者がいつどのような医療処置を受けるかを予測したり、対象者の身体の状態が将来どのようになるかを予測したりする。「医療処置」とは医療機関が患者に対して行う処置であって、例えば診察、治療、薬の処方の少なくとも一つを含む。より具体的には、医療処置は医療に関する資格を有する者（例えば、医者、看護師、薬剤師など）により施される。人（患者）が医療処置を受けることを「受診」という。「対象者」とは、予測処理の対象となる人（患者）である。 The prediction system 10 according to the embodiment is a computer system that predicts the future state of the subject. For example, the prediction system 10 predicts when and what kind of medical treatment the subject will receive, and predicts what the physical condition of the subject will be in the future. A "medical procedure" is a procedure performed by a medical institution on a patient and includes, for example, at least one of medical examination, treatment, and prescription of a drug. More specifically, medical procedures are performed by medically qualified persons (eg, doctors, nurses, pharmacists, etc.). When a person (patient) receives medical treatment, it is called "visiting". The "subject" is a person (patient) who is the target of the prediction process.

図１は予測システム１０を構成するコンピュータ１００の一般的なハードウェア構成を示す。例えば、コンピュータ１００はプロセッサ１０１、主記憶部１０２、補助記憶部１０３、通信制御部１０４、入力装置１０５、および出力装置１０６を備える。プロセッサ１０１はオペレーティングシステムおよびアプリケーション・プログラムを実行する。主記憶部１０２は例えばＲＯＭおよびＲＡＭで構成される。補助記憶部１０３は例えばハードディスクまたはフラッシュメモリで構成され、一般に主記憶部１０２よりも大量のデータを記憶する。通信制御部１０４は例えばネットワークカードまたは無線通信モジュールで構成される。入力装置１０５は例えばキーボード、マウス、タッチパネルなどで構成される。出力装置１０６は例えばモニタおよびスピーカで構成される。 FIG. 1 shows a general hardware configuration of the computer 100 constituting the prediction system 10. For example, the computer 100 includes a processor 101, a main storage unit 102, an auxiliary storage unit 103, a communication control unit 104, an input device 105, and an output device 106. Processor 101 executes operating systems and application programs. The main storage unit 102 is composed of, for example, a ROM and a RAM. The auxiliary storage unit 103 is composed of, for example, a hard disk or a flash memory, and generally stores a larger amount of data than the main storage unit 102. The communication control unit 104 is composed of, for example, a network card or a wireless communication module. The input device 105 is composed of, for example, a keyboard, a mouse, a touch panel, and the like. The output device 106 is composed of, for example, a monitor and a speaker.

予測システム１０の各機能要素は、プロセッサ１０１または主記憶部１０２の上に所定のソフトウェア（例えば、後述する予測プログラムＰ１）を読み込ませてそのソフトウェアを実行させることで実現される。プロセッサ１０１はそのソフトウェアに従って、通信制御部１０４、入力装置１０５、または出力装置１０６を動作させ、主記憶部１０２または補助記憶部１０３におけるデータの読み出しおよび書き込みを行う。処理に必要なデータまたはデータベースは主記憶部１０２または補助記憶部１０３内に格納される。 Each functional element of the prediction system 10 is realized by loading a predetermined software (for example, a prediction program P1 described later) on the processor 101 or the main storage unit 102 and executing the software. The processor 101 operates the communication control unit 104, the input device 105, or the output device 106 according to the software, and reads and writes data in the main storage unit 102 or the auxiliary storage unit 103. The data or database required for processing is stored in the main storage unit 102 or the auxiliary storage unit 103.

予測システム１０は１台のコンピュータで構成されてもよいし、複数台のコンピュータで構成されてもよい。複数台のコンピュータを用いる場合には、これらのコンピュータがインターネットやイントラネットなどの通信ネットワークを介して接続されることで、論理的に一つの予測システム１０が構築される。 The prediction system 10 may be composed of one computer or a plurality of computers. When a plurality of computers are used, one prediction system 10 is logically constructed by connecting these computers via a communication network such as the Internet or an intranet.

予測システム１０はインターネットやイントラネットなどの通信ネットワークを介してデータベース群２０にアクセスすることができる。データベース群２０は、人々の受診記録を記憶するデータベースの集合である。データベースは、プロセッサまたは外部のコンピュータからの任意のデータ操作（例えば、抽出、追加、削除、上書きなど）に対応できるようにデータ集合を記憶する機能要素または装置である。データベースの実装方法は限定されないが、例えばデータベース管理システム（ＤＢＭＳ）が用いられてもよい。「受診記録」とは、人が過去に受けた医療処置を少なくとも示すデータである。 The prediction system 10 can access the database group 20 via a communication network such as the Internet or an intranet. The database group 20 is a set of databases for storing people's consultation records. A database is a functional element or device that stores a set of data to accommodate any data manipulation (eg, extract, add, delete, overwrite, etc.) from a processor or external computer. The method of implementing the database is not limited, but a database management system (DBMS) may be used, for example. A "visit record" is data that at least indicates the medical procedure that a person has received in the past.

図２に示すように、本実施形態ではデータベース群２０はレセプトデータベース２１、健診データベース２２、およびアンケートデータベース２３を含む。レセプトデータベース２１、健診データベース２２、およびアンケートデータベース２３が設けられる位置は限定されない。例えば、これらのデータベースの少なくとも一部が予測システム１０とは異なるコンピュータシステムにより管理されてもよいし、予測システム１０の一部であってもよい。また、各データベースの管理主体も限定されない。例えば、予測システム１０の運営者が他の機関からデータを提供されて、これらのデータの集合をデータベース群２０として管理してもよい。 As shown in FIG. 2, in the present embodiment, the database group 20 includes a receipt database 21, a medical examination database 22, and a questionnaire database 23. The location where the receipt database 21, the medical examination database 22, and the questionnaire database 23 are provided is not limited. For example, at least a part of these databases may be managed by a computer system different from the prediction system 10, or may be a part of the prediction system 10. In addition, the management entity of each database is not limited. For example, the operator of the prediction system 10 may be provided with data from another institution and manage a set of these data as a database group 20.

レセプトデータベース２１は、人が過去に受けた医療処置を示すレセプトデータを記憶するデータベースである。例えば、レセプトデータは医療機関で生成されてレセプトデータベース２１に蓄積される。レセプトデータベース２１は各医療機関のレセプトデータベースの集合であってもよいし、各医療機関のレセプトデータが集約された一つのデータベースであってもよい。 The medical receipt database 21 is a database that stores medical receipt data indicating medical treatments that a person has received in the past. For example, the receipt data is generated by a medical institution and stored in the receipt database 21. The receipt database 21 may be a set of receipt databases of each medical institution, or may be one database in which the receipt data of each medical institution is aggregated.

レセプトデータの各レコードは対象者ＩＤ、対象者の属性、医療機関ＩＤ、受診日、および医療処置の詳細を含む。対象者ＩＤは医療処置を受けた人を一意に特定する識別子である。対象者の属性は、対象者の性質または特徴を示すデータであり、例えば性別および年齢である。医療機関ＩＤは医療処置を施した医療機関を一意に特定する識別子である。受診日は医療処置が実施された日にちである。医療処置の詳細は１以上の診察項目、１以上の治療項目、および１以上の処方薬を示すデータである。処方薬を示すデータは、薬の種類、用量、および投与日数を含む。診察、治療、および処方薬はいずれも非常に多くの種類が存在するので、医療処置の詳細は非常に多くの（例えば数千、数万の）データ項目を含み得る。 Each record of the receipt data includes the subject ID, the subject's attributes, the medical institution ID, the date of consultation, and the details of the medical procedure. The subject ID is an identifier that uniquely identifies a person who has undergone medical treatment. The subject's attributes are data that indicate the subject's nature or characteristics, such as gender and age. The medical institution ID is an identifier that uniquely identifies the medical institution that has performed the medical procedure. The date of consultation is the date on which the medical procedure was performed. The details of the medical procedure are data showing one or more medical examination items, one or more treatment items, and one or more prescription drugs. Data showing prescription drugs include drug type, dose, and days of administration. Since there are so many types of examinations, treatments, and prescription drugs, the details of a medical procedure can contain so many (eg, thousands, tens of thousands) of data items.

健診データベース２２は、人が過去に受けた健康診断の結果を示す健診データを記憶するデータベースである。例えば、健診データは医療機関で生成されて健診データベース２２に蓄積される。レセプトデータベース２１と同様に、健診データベース２２は各医療機関の健診データベースの集合であってもよいし、各医療機関の健診データが集約された一つのデータベースであってもよい。 The medical examination database 22 is a database that stores medical examination data showing the results of medical examinations that a person has received in the past. For example, medical examination data is generated by a medical institution and stored in the medical examination database 22. Similar to the medical receipt database 21, the medical examination database 22 may be a set of medical examination databases of each medical institution, or may be one database in which medical examination data of each medical institution is aggregated.

健診データの各レコードは対象者ＩＤ、対象者の属性、医療機関ＩＤ、健診日、および検査結果を含む。対象者ＩＤは健康診断を受けた人を一意に特定する識別子である。対象者の属性についてはレセプトデータと同様である。医療機関ＩＤは健康診断を実施した医療機関を一意に特定する識別子である。健診日は健康診断が実施された日にちである。検査結果は健康診断の結果であり、より具体的には複数の検査値の集合である。健康診断は様々な検査項目（例えば、法定の各種項目および非法定の各種項目の少なくとも一方）を含むので、検査結果は多数の（例えば数十、数百の）データ項目を含み得る。 Each record of the medical examination data includes the subject ID, the subject's attributes, the medical institution ID, the medical examination date, and the test result. The subject ID is an identifier that uniquely identifies the person who has undergone the medical examination. The attributes of the target person are the same as those of the receipt data. The medical institution ID is an identifier that uniquely identifies the medical institution that performed the medical examination. The medical examination date is the date when the medical examination was carried out. The test result is the result of a medical examination, and more specifically, it is a set of a plurality of test values. Since the health examination includes various test items (eg, at least one of the legal and non-statutory items), the test result can include a large number (eg, tens or hundreds) of data items.

アンケートデータベース２３は、人が過去に回答した健康に関するアンケート結果を示すアンケートデータを記憶するデータベースである。健康に関するアンケートの例として、企業で実施されるストレスチェック、健康診断時に実施される生活習慣に関するアンケートなどが挙げられるが、アンケートの内容はこれらに限定されない。アンケートデータは企業または医療機関で生成されてアンケートデータベース２３に蓄積される。レセプトデータベース２１と同様に、アンケートデータベース２３は各企業または各医療機関のアンケートデータベースの集合であってもよいし、各企業または各医療機関のアンケートデータが集約された一つのデータベースであってもよい。 The questionnaire database 23 is a database that stores questionnaire data showing the results of questionnaires related to health that people have answered in the past. Examples of health-related questionnaires include, but are not limited to, stress checks conducted by companies and lifestyle-related questionnaires conducted during health examinations. The questionnaire data is generated by a company or a medical institution and stored in the questionnaire database 23. Similar to the receipt database 21, the questionnaire database 23 may be a set of questionnaire databases of each company or each medical institution, or may be one database in which the questionnaire data of each company or each medical institution are aggregated. ..

アンケートデータの各レコードは対象者ＩＤ、実施機関ＩＤ、回答日、およびアンケート結果を含む。対象者ＩＤはアンケートに回答した人を一意に特定する識別子である。実施機関ＩＤはアンケートを実施した機関（例えば企業または医療機関）を一意に特定する識別子である。回答日はアンケートの回答が記入された日である。アンケート結果は複数の回答を示すデータであり、例えばストレスチェック、生活習慣、意識調査などについての様々な回答を含む。 Each record of the questionnaire data includes the subject ID, the executing agency ID, the response date, and the questionnaire result. The target person ID is an identifier that uniquely identifies the person who responded to the questionnaire. The executing agency ID is an identifier that uniquely identifies the institution (for example, a company or medical institution) that conducted the questionnaire. The answer date is the date when the answer to the questionnaire is entered. Questionnaire results are data showing multiple answers, including various answers such as stress checks, lifestyle habits, and awareness surveys.

図２は予測システム１０の機能構成を示す。図２に示すように、予測システム１０は機能的構成要素として取得部１１、変換部１２、および予測部１３を備える。 FIG. 2 shows the functional configuration of the prediction system 10. As shown in FIG. 2, the prediction system 10 includes an acquisition unit 11, a conversion unit 12, and a prediction unit 13 as functional components.

取得部１１は、対象者の過去の受診記録を取得する機能要素である。自動的にまたは予測システム１０のオペレータにより対象者ＩＤが入力されると、取得部１１はこの対象者ＩＤに対応する受診記録をデータベース群２０から読み出す。具体的には、取得部１１はその対象者ＩＤに対応するレセプトデータ、健診データ、およびアンケートデータをレセプトデータベース２１、健診データベース２２、およびアンケートデータベース２３からそれぞれ読み出す。そして、取得部１１はこれらのデータから成る過去の受診記録を変換部１２に出力する。 The acquisition unit 11 is a functional element that acquires the past consultation record of the subject. When the target person ID is automatically or by the operator of the prediction system 10, the acquisition unit 11 reads out the consultation record corresponding to the target person ID from the database group 20. Specifically, the acquisition unit 11 reads the receipt data, the medical examination data, and the questionnaire data corresponding to the subject ID from the receipt database 21, the medical examination database 22, and the questionnaire database 23, respectively. Then, the acquisition unit 11 outputs the past consultation record composed of these data to the conversion unit 12.

変換部１２は、過去の受診記録に基づいて複数の入力ベクトルを生成する機能要素である。入力ベクトルはＮ行１列（Ｎ＞１）の行列で示されるデータであり、後述する学習アルゴリズムに入力される。一つの入力ベクトルは過去の一時点における受診記録に対応し、この入力ベクトルはその時点におけるレセプトデータ、健診データ、およびアンケートデータを示す。医療処置の詳細（診察、治療、および処方薬）を含むレセプトデータがＰ個のデータ項目から成り、検査結果（検査値）を含む健診データがＱ個のデータ項目から成り、アンケート結果（回答）を含むアンケートデータがＲ個のデータ項目から成るとする。この場合、ある一時点ｔにおける入力ベクトルｖ_ｔを下記の式（１）で表すことができる。要素ａ_１〜ａ_Ｐはレセプトデータの各データ項目を示し、要素ｂ_１〜ｂ_Ｑは健診データの各データ項目を示し、要素ｃ_１〜ｃ_Ｒはアンケートデータの各データ項目を示す。各入力ベクトルの要素数（行数）は各データのデータ項目数に応じて膨大になり得る。

The conversion unit 12 is a functional element that generates a plurality of input vectors based on past consultation records. The input vector is data represented by a matrix of N rows and 1 column (N> 1), and is input to a learning algorithm described later. One input vector corresponds to the medical examination record at one time point in the past, and this input vector indicates the receipt data, the medical examination data, and the questionnaire data at that time point. Receipt data including details of medical treatment (examination, treatment, and prescription drug) consists of P data items, and medical examination data including test results (test values) consists of Q data items, and questionnaire results (answers) ) Is included in the questionnaire data, which consists of R data items. In this case, the input vector v _t at a certain temporary point t can be expressed by the following equation (1). Elements a _{1 to} a _P indicate each data item of the receipt data, elements b _{1 to} b _Q indicate each data item of the medical examination data, and elements c _{1 to} c _R indicate each data item of the questionnaire data. The number of elements (number of rows) of each input vector can be enormous depending on the number of data items of each data.

一般に、人は医療処置を複数回受け、健康診断を定期的に受け、健康に関するアンケートにも定期的に回答する。例えば、人は医療処置を特定の期間において毎月受けたり、健康診断を年に一度受けたり、アンケートに年に一度（例えば健康診断の際に）回答したりする。したがって、取得部１１は一つの対象者ＩＤについて、過去の複数の時点におけるレセプトデータ、健診データ、またはアンケートデータを取得し、複数の時点での受診記録を示す複数の入力ベクトルをこれらのデータから生成する。得られる入力ベクトルの個数をＭとすると、対象者の過去の受診記録は、下記の式（２）で示される入力ベクトルの集合Ｖで表すことができる。集合Ｖ内の入力ベクトルは時系列に並べられる。したがって、この集合Ｖは時系列データである。
Ｖ＝｛ｖ_１，ｖ_２，…，ｖ_Ｍ｝ …（２） In general, a person receives multiple medical procedures, undergoes regular health examinations, and regularly answers health questionnaires. For example, a person receives medical treatment monthly for a specific period of time, receives a health checkup once a year, and answers a questionnaire once a year (for example, during a health checkup). Therefore, the acquisition unit 11 acquires the receipt data, the medical examination data, or the questionnaire data at a plurality of past time points for one subject ID, and obtains a plurality of input vectors indicating the consultation records at the plurality of time points. Generate from. Assuming that the number of input vectors obtained is M, the past consultation record of the subject can be represented by a set V of input vectors represented by the following equation (2). The input vectors in the set V are arranged in chronological order. Therefore, this set V is time series data.
V = {v ₁ , v ₂ , ..., v _M } ... (2)

入力ベクトルの要素群は数値で表される。一方、レセプトデータ、健診データ、およびアンケートデータの中には数値で表されていないデータ項目（例えば、テキストで表現されたデータ項目）が存在し得る。この場合には、変換部１２はそのデータ項目を所定の規則に基づいて数値に変換する。非数値を数値に変換する規則は、変換部１２を実現するプログラムコードまたは設定ファイルに予め記述され、変換部１２はその記述に従って変換を実行する。 The element group of the input vector is represented by a numerical value. On the other hand, there may be data items that are not represented numerically (for example, data items expressed in text) in the receipt data, the medical examination data, and the questionnaire data. In this case, the conversion unit 12 converts the data item into a numerical value based on a predetermined rule. The rules for converting non-numeric values into numerical values are described in advance in the program code or setting file that realizes the conversion unit 12, and the conversion unit 12 executes the conversion according to the description.

非数値を数値に変換する方法は限定されず、またその変換方法はデータ項目毎に異なってもよい。例えば、変換部１２は非数値データを二値（０または１）に変換してもよいし、所定の範囲内（例えば０〜１、０〜１０、１〜１００、−５〜＋５など）の数値に変換してもよいし、元データに含まれている値を用いた変換を行ってもよい。一例として、レセプトデータのある項目Ａが、特定の成分Ｂを含む注射を示すものとする。変換部１２はこの項目Ａに対応する要素に、注射の有無を示す二値（０：注射なし、１：注射あり）を代入してもよいし、適用された成分Ｂの量を代入してもよい。 The method of converting a non-numeric value into a numerical value is not limited, and the conversion method may differ for each data item. For example, the conversion unit 12 may convert non-numeric data into binary values (0 or 1), or within a predetermined range (for example, 0 to 1, 0 to 10, 1 to 100, -5 to +5, etc.). It may be converted into a numerical value, or may be converted using the value contained in the original data. As an example, it is assumed that item A with receipt data indicates an injection containing a specific component B. The conversion unit 12 may substitute a binary value (0: no injection, 1: with injection) indicating the presence or absence of injection into the element corresponding to this item A, or substitute the amount of the applied component B into the element. May be good.

レセプトデータ、健診データ、またはアンケートデータが示す日にち（受診日、健診日、および回答日）は互いに異なり得る。また、これらのデータの個数も互いに異なり得る。例えば、受診日の間隔が健診日の間隔および回答日の間隔よりも小さい場合には、少なくとも一つの入力ベクトルにおいて、健診データおよびアンケートデータに対応する要素が空値になる。このように空値が発生する場合には、取得部１１はその空値を所定の有意な数値（例えば０）に置き換える。これをパディングといい、この処理によりすべての入力ベクトルのすべての要素に有意な数値が設定される。 The dates indicated by the receipt data, medical examination data, or questionnaire data (examination date, medical examination date, and response date) can be different from each other. Also, the number of these data can be different from each other. For example, when the consultation day interval is smaller than the medical examination day interval and the response date interval, the elements corresponding to the medical examination data and the questionnaire data are empty values in at least one input vector. When an empty value is generated in this way, the acquisition unit 11 replaces the empty value with a predetermined significant numerical value (for example, 0). This is called padding, and this process sets significant numerical values for all elements of all input vectors.

変換部１２は、生成した各入力ベクトルの要素群を所定の数値範囲内で正規化する。その数値範囲は限定されないが、例えば変換部１２は０以上１以下の範囲で各要素を正規化してもよい。正規化の規則も、変換部１２を実現するプログラムコードまたは設定ファイルに予め記述され、変換部１２はその記述に従って正規化を実行する。 The conversion unit 12 normalizes the element group of each generated input vector within a predetermined numerical range. The numerical range is not limited, but for example, the conversion unit 12 may normalize each element in the range of 0 or more and 1 or less. The normalization rules are also described in advance in the program code or the setting file that realizes the conversion unit 12, and the conversion unit 12 executes the normalization according to the description.

正規化の具体的な規則は限定されず、またその正規化方法は要素ごとに異なってもよい。例えば、変換部１２はある要素を二値（０または１）に変換してもよいし、０から１の間の任意の値に変換してもよい。なお、データ項目によっては正規化しても値が変わらない場合があり得る。下記の式（３）は正規化された入力ベクトルの集合の具体的な一例を示す。

The specific rules of normalization are not limited, and the normalization method may differ from element to element. For example, the conversion unit 12 may convert an element into a binary value (0 or 1), or may convert it into an arbitrary value between 0 and 1. Depending on the data item, the value may not change even if it is normalized. Equation (3) below shows a specific example of a set of normalized input vectors.

変換部１２は正規化された入力ベクトルの任意の要素についてスパース正則化を実行することで、入力ベクトル内で０の要素を増やしてもよい。このスパース正則化により、過学習を避けつつ学習の精度を上げることが期待できる。スパース正則化の規則も、変換部１２を実現するプログラムコードまたは設定ファイルに予め記述され、変換部１２はその記述に従ってスパース正則化を実行する。 The conversion unit 12 may increase the number of 0 elements in the input vector by performing sparse regularization on any element of the normalized input vector. This sparse regularization can be expected to improve learning accuracy while avoiding overfitting. The rules for sparse regularization are also described in advance in the program code or the setting file that realizes the conversion unit 12, and the conversion unit 12 executes the sparse regularization according to the description.

どの要素をスパース正則化の対象とするかは任意に決めてよく、変換部１２は入力ベクトルの全要素についてスパース正則化を実行してもよいし、一部の要素についてのみスパース正則化を実行してもよい。例えば、変換部１２はレセプトデータのうち医療処置を示す要素に対してのみスパース正則化を実行してもよい。あるいは、変換部１２はレセプトデータのうち医療処置に対応する要素と、アンケートデータに対応する要素とに対してのみスパース正則化を実行してもよい。変換部１２は、精度が要求される要素（例えば健診データの検査結果に対応する要素）にはスパース正則化を実行することなく、他の要素の一部または全部に対してスパース正則化を実行してもよい。 Which element is subject to sparse regularization may be arbitrarily determined, and the conversion unit 12 may execute sparse regularization for all elements of the input vector, or execute sparse regularization for only some elements. You may. For example, the conversion unit 12 may execute sparse regularization only for the element indicating the medical procedure in the receipt data. Alternatively, the conversion unit 12 may execute sparse regularization only for the element corresponding to the medical procedure and the element corresponding to the questionnaire data in the receipt data. The conversion unit 12 does not execute sparse regularization for elements that require accuracy (for example, elements corresponding to the test results of medical examination data), but performs sparse regularization for some or all of the other elements. You may do it.

変換部１２は、正規化された入力ベクトル、または正規化およびスパース正則化の双方が適用された入力ベクトルを圧縮してもよい。圧縮とは入力ベクトルの次元数を減らす処理である。変換部１２は集合Ｖ内のすべての入力ベクトルにおいて０である要素（行）をこれらの入力ベクトルから取り除くことで、各入力ベクトルの次元数を減らす。 The transforming unit 12 may compress the normalized input vector or the input vector to which both normalization and sparse normalization are applied. Compression is the process of reducing the number of dimensions of the input vector. The conversion unit 12 reduces the number of dimensions of each input vector by removing elements (rows) that are 0 in all the input vectors in the set V from these input vectors.

このように、変換部１２は複数の入力ベクトルを生成し（必要であればパディングを実行する）、各入力ベクトルを正規化する。必要であれば、変換部１２はさらにその入力ベクトルに対してスパース正規化および圧縮の少なくとも一方を実行する。変換部１２はこの一連の処理で得られた入力ベクトルの集合Ｖを予測部１３に出力する。 In this way, the conversion unit 12 generates a plurality of input vectors (executes padding if necessary) and normalizes each input vector. If necessary, the transforming unit 12 further performs at least one of sparse normalization and compression on the input vector. The conversion unit 12 outputs the set V of the input vectors obtained by this series of processing to the prediction unit 13.

予測部１３は、少なくとも正規化が施された各入力ベクトルを学習アルゴリズムに入力することで、将来の所定の時期における対象者の状態の予測（例えば受診予測）を示す出力ベクトルを得る機能要素である。学習アルゴリズム（学習モデル）の例としてＬＳＴＭ（ＬｏｎｇＳｈｏｒｔ−ＴｅｒｍＭｅｍｏｒｙ）、ＧＲＵ（ＧａｔｅｄＲｅｃｕｒｒｅｎｔＵｎｉｔ）、Ａｔｔｅｎｔｉｏｎ（アテンション）が挙げられる。しかし、予測部１３が用いる学習アルゴリズムはこれらに限定されず、任意のアルゴリズムが用いられてよい。 The prediction unit 13 is a functional element that obtains an output vector indicating a prediction of the state of the subject (for example, a consultation prediction) at a predetermined time in the future by inputting at least each normalized input vector to the learning algorithm. be. Examples of the learning algorithm (learning model) include LSTM (Long Short-Term Memory), GRU (Gated Recurrent Unit), and Attention. However, the learning algorithm used by the prediction unit 13 is not limited to these, and any algorithm may be used.

予測部１３は時系列に並べられた複数の入力ベクトルを前半と後半の二つに分ける。続いて、予測部１３はまず前半の入力ベクトルを学習アルゴリズムに処理させることで学習アルゴリズムを学習させる。続いて、予測部１３は後半の入力ベクトルを学習アルゴリズムに処理させることで学習アルゴリズムを検証し、最も計算制度が高いと推定される学習アルゴリズム（以下ではこれを「最良の学習アルゴリズム」ともいう）を得る。すなわち、前半の入力ベクトルの集合は訓練データであり、後半の入力ベクトルの集合はテストデータである。 The prediction unit 13 divides a plurality of input vectors arranged in time series into the first half and the second half. Subsequently, the prediction unit 13 first causes the learning algorithm to process the input vector in the first half to learn the learning algorithm. Subsequently, the prediction unit 13 verifies the learning algorithm by letting the learning algorithm process the input vector in the latter half, and the learning algorithm estimated to have the highest calculation system (hereinafter, this is also referred to as the "best learning algorithm"). To get. That is, the set of input vectors in the first half is training data, and the set of input vectors in the second half is test data.

続いて、予測部１３は得られた最良の学習アルゴリズムを用いて、将来の所定の時期（以下では「対象時期」ともいう）における対象者の状態の予測を示す出力ベクトルを求める。予測したい将来の時期は予め予測システム１０に入力され、例えば対象者ＩＤと共に入力される。予測部１３は現在から対象時期までの期間を複数の単位期間に分割する。そして、予測部１３は現在から対象時期に向かって、単位期間が経過した時点での出力ベクトルを順番に求める。 Subsequently, the prediction unit 13 uses the obtained best learning algorithm to obtain an output vector indicating the prediction of the state of the target person at a predetermined time in the future (hereinafter, also referred to as “target time”). The future time to be predicted is input to the prediction system 10 in advance, and is input together with, for example, the target person ID. The prediction unit 13 divides the period from the present to the target time into a plurality of unit periods. Then, the prediction unit 13 sequentially obtains the output vector at the time when the unit period elapses from the present to the target time.

一例として、過去の期間に対応する入力ベクトルの集合Ｖ＝｛ｖ_１，ｖ_２，…，ｖ_Ｍ｝を用いて、対象時期ｔ_ｙにおける出力ベクトルｖ_ｙを算出するとする。まず、予測部１３は集合Ｖの前半Ｖ_Ｆ＝｛ｖ_１，…，ｖ_ｉ｝を用いて学習アルゴリズムを学習させ、集合Ｖの後半Ｖ_Ｌ＝｛ｖ_ｉ＋１，…，ｖ_Ｍ｝を用いて学習アルゴリズムを検証することで、最良の学習アルゴリズムを得る。 As an example, the set _{_{V = {v 1, v 2}} , ..., v M} of input vectors corresponding to the past time with, and to calculate the output vector _{v y} in the target time _{t y.} First, half _{_{V F = {v 1, ...}} , v i} of the prediction unit 13 is set V to learn a learning algorithm using a second half _{_{V L = {v i + 1}} , ..., v M} of set V with By verifying the learning algorithm, the best learning algorithm is obtained.

続いて、予測部１３は現時点ｔ_ｃから対象時期ｔ_ｙまでの期間をＮ個の単位期間ｋに分割する。続いて、予測部１３は入力ベクトルの集合Ｖ＝｛ｖ_１，…，ｖ_Ｍ｝を最良の学習アルゴリズムに入力することで、将来の時期（ｔ_ｃ＋１ｋ）における状態の予測を示す出力ベクトルｖ´_１を得る。 Then, the estimating unit 13 divides the current time t _c a period until the target time t _y to N unit period k. Subsequently, the prediction unit 13 inputs the set V = {v ₁ , ..., V _M } of the input vectors to the best learning algorithm, so that the output vector v indicating the prediction of the state _{in the future time (t c + 1k).} ´ ₁ is obtained.

続いて、予測部１３は集合Ｖの中で最も古い入力ベクトルｖ_１を除き、代わりにその出力ベクトルｖ´_１を集合Ｖに加える。この結果、入力ベクトルの集合Ｖは｛ｖ_２，…，ｖ_Ｍ，ｖ´_１｝に変わる。予測部１３はこの集合Ｖ＝｛ｖ_２，…，ｖ_Ｍ，ｖ´_１｝を最良の学習アルゴリズムに入力することで、将来の時期（ｔ_ｃ＋２ｋ）における状態の予測を示す出力ベクトルｖ´_２を得る。 Then, the estimating unit 13 except for the oldest input vector v ₁ in the set V, is added instead its output vector v _'1 to set V in. As a result, the set V of the input vectors changes to _{{v 2} , ..., v _M , _{v ′ 1}.} Prediction unit 13 the set _{_{V = {v 2, ...,}} v M, v'1} by inputting the best learning algorithm, the output vector indicating the predicted state at a future time _(t c + 2k) v' _{Get 2} .

続いて、予測部１３は集合Ｖの中で最も古い入力ベクトルｖ_２を除き、代わりにその出力ベクトルｖ´_２を集合Ｖに加えることで、入力ベクトルの集合Ｖ＝｛ｖ_３，…，ｖ_Ｍ，ｖ´_１，ｖ´_２｝を得る。予測部１３はこの集合Ｖ＝｛ｖ_３，…，ｖ_Ｍ，ｖ´_１，ｖ´_２｝を最良の学習アルゴリズムに入力することで、将来の時期（ｔ_ｃ＋３ｋ）における状態の予測を示す出力ベクトルｖ´_３を得る。 Then, the estimating unit 13 except for the oldest input vector v ₂ in the set V, by adding the output vectors v _'2 to set V instead, the set of input vectors _{V = {v 3, ...,} v _M, _{v '1,} obtain v' _2}. Prediction unit 13 the set _{_{V = {v 3, ...,}} v M, v'1, v'2} indicates a by entering the best learning algorithms, prediction of the state in the future time _(t c + 3k) obtain an output vector v _'3.

このように、予測部１３は入力ベクトルの集合で示される期間を将来に向かってシフトさせながら出力ベクトルを求め続け、最終的に、対象時期ｔ_ｙ（＝ｔ_ｃ＋Ｎｋ）における出力ベクトルｖ´_Ｎを求める。最終的に得られる出力ベクトルの要素群は入力ベクトルの要素群に対応する。したがって、出力ベクトルは、医療処置を示す１以上の要素（レセプトデータの項目に関する予測）を少なくとも含み、人の身体の状態に関する他の要素（健診データデータまたはアンケートデータの項目に関する予測。例えば各種の検査値。）をさらに含み得る。 Thus, prediction unit 13 continue to seek an output vector while shifting towards a period shown by a set of input vectors in the future, finally, the target timing _{_{t y (= t c + Nk}} ) output vectors in v _'N Ask for. The finally obtained output vector element group corresponds to the input vector element group. Therefore, the output vector includes at least one or more elements indicating the medical procedure (prediction regarding the item of the receipt data) and other elements regarding the physical condition of the person (prediction regarding the item of the medical examination data data or the questionnaire data, for example, various types). Test value of.) Can be further included.

最終的に得られた出力ベクトルから、対象者が将来の所定の時期においてどのような医療処置を受けるかを知ったり、対象者の身体の将来の状態を知ったりことができる。ただし、この出力ベクトルの各要素は正規化されているので、その正規化された値をそのまま出力しても人には理解しづらい。そこで、予測部１３はこの出力ベクトルを加工してもよい。例えば、予測部１３はその出力ベクトルの正規化を解除することで、出力ベクトルの要素の値を、正規化されていない値に戻してもよい（非正規化）。あるいは、予測部１３は正規化されたままの状態にある出力ベクトルまたは非正規化された出力ベクトルの要素の少なくとも一部をテキストに変換してもよい。 From the finally obtained output vector, it is possible to know what kind of medical treatment the subject will receive at a predetermined time in the future, and to know the future state of the subject's body. However, since each element of this output vector is normalized, it is difficult for humans to understand even if the normalized value is output as it is. Therefore, the prediction unit 13 may process this output vector. For example, the prediction unit 13 may return the value of the element of the output vector to the unnormalized value by canceling the normalization of the output vector (non-normalization). Alternatively, the prediction unit 13 may convert at least a part of the elements of the output vector or the denormalized output vector in the normalized state into text.

あるいは、予測部１３は出力ベクトルの要素から導出可能な更なるデータを生成してもよい。例えば、予測部１３は医療処置に関する各要素の値と、各医療処置（各診察、各治療、および各処方薬）の単価（医療報酬）とに基づいて、将来の所定の時期における医療費の推定値を求めてもよい。 Alternatively, the prediction unit 13 may generate additional data that can be derived from the elements of the output vector. For example, the prediction unit 13 determines the medical expenses at a predetermined time in the future based on the value of each element related to the medical procedure and the unit price (medical fee) of each medical procedure (each medical examination, each treatment, and each prescription drug). Estimates may be obtained.

予測部１３は出力ベクトルそのもの、加工された出力ベクトル、および出力ベクトルから導出されたデータのうちの少なくとも一部を予測結果として出力する。出力方法は限定されない。例えば、予測部１３は予測結果をモニタ上に表示させたりプリンタで印刷したりしてもよいし、所定のデータベースに予測結果を格納してもよいし、他のコンピュータシステムに予測結果を送信してもよい。 The prediction unit 13 outputs at least a part of the output vector itself, the processed output vector, and the data derived from the output vector as the prediction result. The output method is not limited. For example, the prediction unit 13 may display the prediction result on a monitor or print it with a printer, may store the prediction result in a predetermined database, or transmit the prediction result to another computer system. You may.

このように、予測結果の形式および出力方法は限定されない。いずれにしても、予測システム１０のユーザはその予測結果を参照することで、対象者が将来いつどのような医療処置を受けるかの見当を付けることができる。また、ユーザは対象者がいつ治療を止めるか、将来どのくらい医療費が掛かるかなども知ることが可能である。さらに、ユーザは対象者の身体の状態を示す検査値が将来どのように推移するかを知ることも可能である。 As described above, the format and output method of the prediction result are not limited. In any case, the user of the prediction system 10 can estimate when and what kind of medical treatment the subject will receive in the future by referring to the prediction result. The user can also know when the subject will stop treatment and how much medical expenses will be incurred in the future. Furthermore, the user can also know how the test value indicating the physical condition of the subject will change in the future.

次に、図３を参照しながら、予測システム１０の動作を説明するとともに本実施形態に係る予測方法について説明する。図３は予測システム１０の動作を示すフローチャートである。 Next, the operation of the prediction system 10 will be described with reference to FIG. 3, and the prediction method according to the present embodiment will be described. FIG. 3 is a flowchart showing the operation of the prediction system 10.

まず、取得部１１が対象者の過去の受診記録（レセプトデータ、健診データ、およびアンケートデータ）をデータベース群２０から取得する（ステップＳ１１、取得ステップ）。 First, the acquisition unit 11 acquires the past consultation records (receipt data, medical examination data, and questionnaire data) of the subject from the database group 20 (step S11, acquisition step).

続いて、変換部１２がその受診記録に基づいて複数の入力ベクトルを生成する（ステップＳ１２、変換ステップ）。まず、変換部１２はその受診記録に基づいて入力ベクトルの集合を生成する（ステップＳ１２１）。必要であれば、この処理において変換部１２は非数値のデータを数値に変換したりパディングを実行したりする。続いて、変換部１２は各入力ベクトルの要素群を正規化し（ステップＳ１２２）、必要に応じて、正規化された入力ベクトルに対してスパース正則化または圧縮を実行する（ステップＳ１２３）。 Subsequently, the conversion unit 12 generates a plurality of input vectors based on the consultation record (step S12, conversion step). First, the conversion unit 12 generates a set of input vectors based on the consultation record (step S121). If necessary, in this process, the conversion unit 12 converts non-numeric data into numerical values and executes padding. Subsequently, the conversion unit 12 normalizes the element group of each input vector (step S122), and executes sparse regularization or compression on the normalized input vector as needed (step S123).

続いて、予測部１３が、少なくとも正規化が施された各入力ベクトルを学習アルゴリズムに入力することで、将来の所定の時期（対象時期）における対象者の状態の予測（例えば受診予測）を示す出力ベクトルを求める（ステップＳ１３）。予測部１３は入力ベクトルの集合を用いて学習アルゴリズムを学習させ、さらにその学習結果を検証することで、最良の学習アルゴリズムを得る（ステップＳ１３１）。そして、予測部１３は現在から対象時期に向かって、所定の単位期間ごとに出力ベクトルを求めていく。現時点をｔ_ｃ、単位期間をｋとし、将来の時期をｔ_ｃ＋ｉｋ（ｉ＝１，…，Ｎ）で表すとする。予測部１３はまず将来の時期（ｔ_ｃ＋１ｋ）における状態の予測（例えば受診予測）を示す出力ベクトルを求め、対象時期（ｔ_ｃ＋Ｎｋ）に至るまで変数ｉを増分しながら出力ベクトルの計算を繰り返す（ステップＳ１３２〜Ｓ１３５）。最後に、予測部１３はその対象時期における出力ベクトルに基づく予測結果を出力する（ステップＳ１４）。 Subsequently, the prediction unit 13 inputs at least each normalized input vector into the learning algorithm to show the prediction of the state of the subject (for example, consultation prediction) at a predetermined time (target time) in the future. Obtain the output vector (step S13). The prediction unit 13 learns a learning algorithm using a set of input vectors, and further verifies the learning result to obtain the best learning algorithm (step S131). Then, the prediction unit 13 obtains the output vector for each predetermined unit period from the present to the target time. Let t _{c be} the current time, k be the unit period, and be t _c + ik (i = 1, ..., N) for the future time. The prediction unit 13 first obtains an output vector indicating a state prediction (for example, consultation prediction) in a _{future time (t c} + 1k), and calculates the output vector while incrementing the variable i until the _{target time (t c + Nk).} Repeat (steps S132 to S135). Finally, the prediction unit 13 outputs a prediction result based on the output vector at the target time (step S14).

次に、図４を参照しながら、少なくとも１台のコンピュータを予測システム１０として機能させるための予測プログラムＰ１を説明する。図４は予測プログラムＰ１の構成を示す図である。 Next, the prediction program P1 for making at least one computer function as the prediction system 10 will be described with reference to FIG. FIG. 4 is a diagram showing the configuration of the prediction program P1.

予測プログラムＰ１はメインモジュールＰ１０、取得モジュールＰ１１、変換モジュールＰ１２、および予測モジュールＰ１３を含む。メインモジュールＰ１０は受診予測を統括的に管理する部分である。取得モジュールＰ１１、変換モジュールＰ１２、および予測モジュールＰ１３を実行することで、取得部１１、変換部１２、および予測部１３が実現する。 The prediction program P1 includes a main module P10, an acquisition module P11, a conversion module P12, and a prediction module P13. The main module P10 is a part that comprehensively manages the consultation prediction. By executing the acquisition module P11, the conversion module P12, and the prediction module P13, the acquisition unit 11, the conversion unit 12, and the prediction unit 13 are realized.

予測プログラムＰ１は、例えば、ＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ、半導体メモリなどの有形の記録媒体に固定的に記録された上で提供されてもよい。あるいは、予測プログラムＰ１は、搬送波に重畳されたデータ信号として通信ネットワークを介して提供されてもよい。 The prediction program P1 may be provided after being fixedly recorded on a tangible recording medium such as a CD-ROM, a DVD-ROM, or a semiconductor memory. Alternatively, the prediction program P1 may be provided via a communication network as a data signal superimposed on a carrier wave.

以上説明したように、本発明の一側面に係る予測システムは、対象者の過去の受診記録を所定のデータベースから取得する取得部であって、該受診記録が少なくとも医療処置を示す、該取得部と、受診記録に基づいて複数の入力ベクトルを生成し、各入力ベクトルの要素群を所定の数値範囲内で正規化する変換部であって、要素群が、医療処置を示す要素を少なくとも含む、該変換部と、正規化された各入力ベクトルを学習アルゴリズムに入力することで、将来の所定の時期における対象者の状態の予測を示す出力ベクトルを求める予測部であって、出力ベクトルの要素群が入力ベクトルの要素群に対応する、該予測部とを備える。 As described above, the prediction system according to one aspect of the present invention is an acquisition unit that acquires past consultation records of a subject from a predetermined database, and the acquisition unit indicates at least medical treatment. A conversion unit that generates a plurality of input vectors based on a medical examination record and normalizes the element group of each input vector within a predetermined numerical range, and the element group includes at least an element indicating a medical treatment. It is a prediction unit that obtains an output vector that indicates the prediction of the state of the target person at a predetermined time in the future by inputting the conversion unit and each normalized input vector to the learning algorithm, and is an element group of the output vector. Is provided with the prediction unit corresponding to the element group of the input vector.

このような側面においては、対象者の受診記録を表現した入力ベクトルを正規化してからその入力ベクトルを学習アルゴリズムに入力することで、対象者の将来の状態の予測を示す出力ベクトルが得られる。入力ベクトルは医療処置を示す要素を少なくとも含み、出力ベクトルはその入力ベクトルに対応する。医療処置を示す要素を少なくとも含む出力ベクトルを出力することで、人の状態をより精度良く予測することができる。例えば、ユーザは、出力ベクトルを参照または利用することで、将来の所定の時期において対象者がどのような医療処置を受けるかを知ることができる（すなわち、対象者の受診の予測が可能になる）。 In such an aspect, by normalizing the input vector expressing the consultation record of the subject and then inputting the input vector into the learning algorithm, an output vector showing the prediction of the future state of the subject can be obtained. The input vector contains at least an element indicating a medical procedure, and the output vector corresponds to the input vector. By outputting an output vector containing at least an element indicating a medical procedure, a person's condition can be predicted more accurately. For example, the user can refer to or use the output vector to know what kind of medical treatment the subject will receive at a predetermined time in the future (that is, it is possible to predict the visit of the subject. ).

医療処置を示す値はテキストや数値などの様々な形式で表現されるので、これらの値を単に数値化して学習アルゴリズムに入力するだけでは、計算の精度が上がらなかったり計算に時間が掛かったりする可能性がある。しかし、医療処置を示す要素を含む入力ベクトルを正規化することで、多種多様なデータを使って求める必要がある対象者の将来の状態の予測を精度良くかつ短時間で行うことが可能になる。 Values indicating medical procedures are expressed in various formats such as texts and numerical values, so simply digitizing these values and inputting them into the learning algorithm may not improve the accuracy of the calculation or take a long time to calculate. there is a possibility. However, by normalizing the input vector containing the elements indicating the medical procedure, it becomes possible to accurately and quickly predict the future state of the subject, which needs to be obtained using a wide variety of data. ..

他の側面に係る予測システムでは、変換部が、正規化された入力ベクトルの要素群のうち、医療処置を示す要素に対してスパース正則化を実行してもよい。医療処置については非常に多くの種類が存在するので、医療処置を示す要素の数も非常に多くなる。この要素をスパース正則化することで、値が０である要素が増えるので、過学習を避けつつ学習の精度を上げたり、計算時間を短縮したりすることができる。 In the prediction system according to the other aspect, the transforming unit may perform sparse regularization on the elements indicating the medical procedure in the elements of the normalized input vector. Since there are so many types of medical procedures, the number of elements that indicate medical procedures is also very large. By making this element sparse regularized, the number of elements having a value of 0 increases, so that it is possible to improve the learning accuracy and shorten the calculation time while avoiding overfitting.

他の側面に係る予測システムでは、予測部が、出力ベクトルに基づいて、将来の所定の時期における医療費を求めてもよい。この場合には、将来の医療費を予測することができる。 In the prediction system according to the other aspect, the prediction unit may obtain the medical expenses at a predetermined time in the future based on the output vector. In this case, future medical costs can be predicted.

以上、本発明をその実施形態に基づいて詳細に説明した。しかし、本発明は上記実施形態に限定されるものではない。本発明は、その要旨を逸脱しない範囲で様々な変形が可能である。 The present invention has been described in detail above based on the embodiment. However, the present invention is not limited to the above embodiment. The present invention can be modified in various ways without departing from the gist thereof.

上記実施形態では受診記録がレセプトデータ、健診データ、およびアンケートデータで構成されたが、健診データおよびアンケートデータの少なくとも一方が省略されてもよい。あるいは、受診記録は、少なくとも、レセプトデータと、人が過去に受けた運動能力測定の結果を示す運動データとから構成されてもよい。したがって、受診記録はレセプトデータ、健診データ、アンケートデータ、および運動データから構成されてもよい。 In the above embodiment, the consultation record is composed of the receipt data, the medical examination data, and the questionnaire data, but at least one of the medical examination data and the questionnaire data may be omitted. Alternatively, the consultation record may be composed of at least the receipt data and the exercise data showing the result of the exercise ability measurement that the person has received in the past. Therefore, the medical examination record may be composed of medical receipt data, medical examination data, questionnaire data, and exercise data.

少なくとも一つのプロセッサにより実行される予測方法の処理手順は上記実施形態での例に限定されない。例えば、上述したステップ（処理）の一部が省略されてもよいし、別の順序で各ステップが実行されてもよい。また、上述したステップのうちの任意の２以上のステップが組み合わされてもよいし、ステップの一部が修正又は削除されてもよい。あるいは、上記の各ステップに加えて他のステップが実行されてもよい。 The processing procedure of the prediction method executed by at least one processor is not limited to the example in the above embodiment. For example, some of the steps (processes) described above may be omitted, or each step may be executed in a different order. Further, any two or more steps among the above-mentioned steps may be combined, or a part of the steps may be modified or deleted. Alternatively, other steps may be performed in addition to each of the above steps.

予測システム内で二つの数値の大小関係を比較する際には、「以上」および「よりも大きい」という二つの基準のどちらを用いてもよく、「以下」および「未満」という二つの基準のうちのどちらを用いてもよい。このような基準の選択は、二つの数値の大小関係を比較する処理についての技術的意義を変更するものではない。 When comparing the magnitude relations of two numbers in a forecasting system, either of the two criteria "greater than or equal to" and "greater than" may be used, and the two criteria "less than or equal to" and "less than" Either of them may be used. The selection of such criteria does not change the technical significance of the process of comparing the magnitude relations of two numbers.

１０…予測システム、１１…取得部、１２…変換部、１３…予測部、２０…データベース群、２１…レセプトデータベース、２２…健診データベース、２３…アンケートデータベース、Ｐ１…予測プログラム、Ｐ１０…メインモジュール、Ｐ１１…取得モジュール、Ｐ１２…変換モジュール、Ｐ１３…予測モジュール。 10 ... Prediction system, 11 ... Acquisition unit, 12 ... Conversion unit, 13 ... Prediction unit, 20 ... Database group, 21 ... Receipt database, 22 ... Medical examination database, 23 ... Questionnaire database, P1 ... Prediction program, P10 ... Main module , P11 ... Acquisition module, P12 ... Conversion module, P13 ... Prediction module.

Claims

An acquisition unit that acquires the past consultation record of the subject from a predetermined database, and the acquisition unit that the consultation record indicates at least a medical procedure.
A conversion unit that generates a plurality of input vectors based on the consultation record and normalizes the element group of each input vector within a predetermined numerical range, and the element group includes at least an element indicating the medical procedure. , The conversion unit and
Wherein by inputting the normalized set a learning algorithm for each input vector has been, a prediction unit for obtaining the final output vector indicating a prediction of the subject's status at a future predetermined period, the final The element group of the output vector corresponds to the element group of the input vector, and the prediction unit is provided .
The prediction unit
The period from the present to the predetermined time is divided into a plurality of unit periods.
An intermediate output vector showing a prediction of the state of the subject at the time when the unit period elapses while changing the set for each of the plurality of unit periods in order from the present to the predetermined time. Is executed to obtain the final output vector, and the final output vector is obtained.
In the iterative process, the set is input to the learning algorithm to acquire the intermediate output vector at the elapse of the i-th unit period, and then the oldest input vector in the set is removed and the set is removed. By inputting the set modified by adding the acquired intermediate output vector to the learning algorithm, the process of acquiring the intermediate output vector at the elapse of the (i + 1) th unit period is included.
Prediction system.

The transforming unit performs sparse regularization on the element indicating the medical procedure in the element group of the normalized input vector.
The prediction system according to claim 1.

The prediction unit obtains medical expenses at a predetermined time in the future based on the final output vector.
The prediction system according to claim 1 or 2.

A prediction method performed by a prediction system with at least one processor.
An acquisition step of acquiring a subject's past consultation record from a predetermined database, wherein the consultation record indicates at least a medical procedure.
A conversion step in which a plurality of input vectors are generated based on the consultation record and the element groups of each input vector are normalized within a predetermined numerical range, wherein the element group includes at least an element indicating the medical procedure. , The conversion step and
By inputting the set of each normalized input vector into the learning algorithm, it is a prediction step of obtaining a final output vector indicating the prediction of the state of the subject at a predetermined time in the future , which is the final step. element group of minimal output vector corresponding to the element group of the input vector, see contains the said prediction step,
In the prediction step
The period from the present to the predetermined time is divided into a plurality of unit periods.
An intermediate output vector showing a prediction of the state of the subject at the time when the unit period elapses while changing the set for each of the plurality of unit periods in order from the present to the predetermined time. Is executed to obtain the final output vector, and the final output vector is obtained.
In the iterative process, the set is input to the learning algorithm to acquire the intermediate output vector at the elapse of the i-th unit period, and then the oldest input vector in the set is removed and the set is removed. By inputting the set modified by adding the acquired intermediate output vector to the learning algorithm, the process of acquiring the intermediate output vector at the elapse of the (i + 1) th unit period is included.
Prediction method.

An acquisition step of acquiring a subject's past consultation record from a predetermined database, wherein the consultation record indicates at least a medical procedure.
A conversion step in which a plurality of input vectors are generated based on the consultation record and the element groups of each input vector are normalized within a predetermined numerical range, wherein the element group includes at least an element indicating the medical procedure. , The conversion step and
By inputting the set of each normalized input vector into the learning algorithm, it is a prediction step of obtaining a final output vector indicating the prediction of the state of the subject at a predetermined time in the future , which is the final step. The computer system is made to perform the prediction step in which the elements of the output vector correspond to the elements of the input vector.
In the prediction step
The period from the present to the predetermined time is divided into a plurality of unit periods.
An intermediate output vector showing a prediction of the state of the subject at the time when the unit period elapses while changing the set for each of the plurality of unit periods in order from the present to the predetermined time. Is executed to obtain the final output vector, and the final output vector is obtained.
In the iterative process, the set is input to the learning algorithm to acquire the intermediate output vector at the elapse of the i-th unit period, and then the oldest input vector in the set is removed and the set is removed. By inputting the set modified by adding the acquired intermediate output vector to the learning algorithm, the process of acquiring the intermediate output vector at the elapse of the (i + 1) th unit period is included.
Prediction program.