JP2018140172A

JP2018140172A - Data collection device and data collection method

Info

Publication number: JP2018140172A
Application number: JP2018032317A
Authority: JP
Inventors: 越智　大介; Daisuke Ochi; 大介越智; 隆史山内; Takashi Yamauchi; 淑希恒元; Yoshiki Tsunemoto; 檜山　聡; Satoshi Hiyama; 聡檜山; 準一菅原; Junichi Sugawara; 理宇山下; Riu Yamashita; 修田邉; Osamu Tanabe; 正朗長崎; Masao Nagasaki
Original assignee: Tohoku University NUC; NTT Docomo Inc
Current assignee: Tohoku University NUC; NTT Docomo Inc
Priority date: 2017-02-28
Filing date: 2018-02-26
Publication date: 2018-09-13

Abstract

PROBLEM TO BE SOLVED: To provide a data collection technique for improving accuracy of predicting a premature delivery.SOLUTION: One embodiment of this invention relates to a data collection device which has: a data acquisition function of acquiring data on a pregnant woman from plural data source devices; a premature delivery prediction function of executing prediction of a premature delivery by the pregnant woman from the acquired data on the basis of a premature delivery prediction model; and a result report function of reporting a result of the premature delivery prediction. The data acquisition function acquires the data at plural points of time in a data acquisition period from any time around the pregnancy of the pregnant woman to any time around a delivery.SELECTED DRAWING: Figure 1

Description

本発明は、一般に医療診断技術に関し、より詳細には、非侵襲的に早産の発症予測又は異常検出を行うための早産発症有無予測のためのデータ収集装置及び方法に関する。 The present invention generally relates to medical diagnostic techniques, and more particularly to a data collection apparatus and method for predicting the onset of premature birth for noninvasively predicting the onset or detecting abnormalities of premature birth.

妊娠関連の疾患は、従来は予測が困難であった。つまり、妊婦関連疾患の発症は、遺伝的要因だけでなく環境要因が複雑に関与するとされているが、環境要因の把握が困難であったためである。そのため、未だに原因が特定されずに予防方法が確立されていない妊婦関連疾患が多い。 Pregnancy-related diseases have been difficult to predict. In other words, the onset of maternally related diseases is not only genetic factors but also environmental factors are involved in a complicated manner, but it is difficult to grasp environmental factors. For this reason, there are many pregnant-related diseases for which no cause has yet been identified and no preventive method has been established.

現時点の産科診療においては、受診時に取得される検査情報、病歴聴取及び／又は診察情報による発症予測が主体であり、在宅中などの連続した生活環境情報を取得することは困難であった。疾患発症に関する研究において、妊婦の生活環境情報を取得する方法として従来は調査票による調査を実施することが主な手法であった。しかしこの方法には、妊婦の記憶が不正確で回答に誤りがあること（想起バイアス）による精度の問題があり、また実施間隔が早くて半年に１回程度と、取得頻度の問題等があった。そのため、妊婦の生活の実態を捉えて疾患との因果関係を明らかにするには不十分な方法であった。今後、疾患の発症予測を精緻に行うためには、受診時以外の連続環境情報を取得し、それらと受診時情報を統合的に解析することが必要であるが、これらの情報についてはこれまで入手困難だったため、妊娠に関連した疾患の発症予測には利用されてこなかった。 In the present obstetric practice, the onset prediction is mainly based on examination information, medical history and / or examination information obtained at the time of consultation, and it is difficult to obtain continuous living environment information such as at home. In the research on the onset of disease, the main method for obtaining the living environment information of pregnant women has been to conduct surveys using questionnaires. However, this method has problems with accuracy due to inaccurate memory of pregnant women and incorrect answers (recall bias), and there is a problem with the frequency of acquisition, such as once every six months, because the implementation interval is early. It was. Therefore, it was an inadequate method for capturing the actual life of pregnant women and clarifying the causal relationship with the disease. In the future, in order to accurately predict the onset of disease, it will be necessary to acquire continuous environmental information other than at the time of consultation and to analyze the information at the time of consultation in an integrated manner. Because it was difficult to obtain, it was not used to predict the onset of pregnancy related diseases.

早産は、原因不明が約3分の1を占める周産期医療上の代表的な疾患で、遺伝的要因と環境因子による複雑な相互作用により発症する多因子疾患である。 Premature birth is a typical perinatal medical disease whose cause is unknown by about one third, and is a multifactorial disease that develops through a complex interaction of genetic and environmental factors.

妊婦の生活環境に基づいた、疫学的な早産発症研究としては、例えば、妊娠前ＢＭＩ（ＢｏｄｙＭａｓｓＩｎｄｅｘ）を測定し、当該ＢＭＩの低さが早産のリスクを高めるという報告がある（先行技術文献１）。しかし、この研究では、単一のパラメータかつ、１時点の計測結果に基づいているため予測精度が調整オッズ比で２．９１であり、臨床的には十分な精度とは言えない。また、妊婦の睡眠の質に着目し、妊娠初期（妊娠１４週〜１６週）と妊娠後期（３０−３２週）においてその睡眠の質が悪いと早産のリスクを高めるという報告がある（先行技術文献２）。しかし、この研究も調査票による調査であり、想起バイアスが含まれるうえ、予測精度が妊娠初期のオッズ比で１．２５、妊娠後期のオッズ比で１．１８であり、臨床上の発症予測には不十分である。 As an epidemiological study of premature birth based on the living environment of pregnant women, for example, there is a report that BMI (Body Mass Index) before pregnancy is measured, and that the low BMI increases the risk of preterm birth (prior art documents). 1). However, in this study, since the prediction accuracy is 2.91 in terms of the adjusted odds ratio because it is based on a single parameter and the measurement result at one time point, it cannot be said that the accuracy is clinically sufficient. In addition, focusing on the quality of sleep of pregnant women, there is a report that if the quality of sleep is poor in early pregnancy (14-16 weeks of pregnancy) and late pregnancy (30-32 weeks), the risk of preterm birth is increased (prior art) Reference 2). However, this study is also a questionnaire survey, which includes recall bias, and the prediction accuracy is 1.25 for the odds of early pregnancy and 1.18 for the odds of late pregnancy. Is insufficient.

また、先行技術文献１および先行技術文献２はともにオッズ比の比較をしているのみであり、早産の発症リスクについては議論できるが、その妊婦の最終的な早産の有無を予測できているわけではない。 Prior art reference 1 and prior art reference 2 both compare only odds ratios and can discuss the risk of premature birth, but can predict the premature birth of the pregnant woman. is not.

"Prepregnancy body mass index is an independent risk factor for gestational hypertension, gestational diabetes, preterm labor, and small- and large-for-gestational-age infants", Dayeon Shin and Won O. Song, The Journal of Maternal-Fetal & Neonatal Medicine, Volume 28, 2015 (http://www.tandfonline.com/doi/abs/10.3109/14767058.2014.964675)"Prepregnancy body mass index is an independent risk factor for gestational hypertension, gestational diabetes, preterm labor, and small- and large-for-gestational-age infants", Dayeon Shin and Won O. Song, The Journal of Maternal-Fetal & Neonatal Medicine, Volume 28, 2015 (http://www.tandfonline.com/doi/abs/10.3109/14767058.2014.964675) """Poor Sleep Quality is Associated with Preterm Birth", Michele L. Okun and et al. (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3198204/)"" "Poor Sleep Quality is Associated with Preterm Birth", Michele L. Okun and et al. (Https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3198204/)

上述した問題点を鑑み、本発明の課題は、早産の予測精度を向上させるためのデータ収集技術を提供することである。 In view of the above-described problems, an object of the present invention is to provide a data collection technique for improving the accuracy of prediction of premature birth.

上記課題を解決するため、本発明の一態様は、複数のデータソース装置から妊婦に関するデータを取得するデータ取得機能と、早産予測モデルに基づき、前記取得したデータから前記妊婦の早産予測を実行する早産予測機能と、前記早産予測の結果を通知する結果通知機能と、を有し、前記データ取得機能は、前記妊婦の妊娠前後の何れかの時期から出産前後の何れかの時期までのデータ取得期間において、前記データを複数の時点にて取得するデータ収集装置に関する。 In order to solve the above problems, according to one aspect of the present invention, based on a data acquisition function for acquiring data on a pregnant woman from a plurality of data source devices and a preterm birth prediction model, the preterm birth prediction of the pregnant woman is executed from the acquired data. A premature birth prediction function and a result notification function for notifying the result of the preterm birth prediction, wherein the data acquisition function acquires data from any time before and after pregnancy of the pregnant woman to any time before and after childbirth The present invention relates to a data collection device that acquires the data at a plurality of points in time.

本発明によると、早産の予測精度を向上させるためのデータ収集技術を提供することができる。 ADVANTAGE OF THE INVENTION According to this invention, the data collection technique for improving the prediction precision of premature birth can be provided.

図１は、本発明の一実施例によるシステム構成を示すブロック図である。FIG. 1 is a block diagram showing a system configuration according to an embodiment of the present invention. 図２は、本発明の一実施例による測定値転送装置の機能構成を示すブロック図である。FIG. 2 is a block diagram showing a functional configuration of the measurement value transfer apparatus according to one embodiment of the present invention. 図３は、本発明の一実施例による体調登録装置の機能構成を示すブロック図である。FIG. 3 is a block diagram showing a functional configuration of the physical condition registration device according to one embodiment of the present invention. 図４は、本発明の一実施例による気象情報取得装置の機能構成を示すブロック図である。FIG. 4 is a block diagram showing a functional configuration of a weather information acquisition apparatus according to an embodiment of the present invention. 図５は、本発明の一実施例による調査票登録装置の機能構成を示すブロック図である。FIG. 5 is a block diagram showing a functional configuration of the survey form registration apparatus according to an embodiment of the present invention. 図６は、本発明の一実施例による診療記録登録装置の機能構成を示すブロック図である。FIG. 6 is a block diagram showing a functional configuration of a medical record registration apparatus according to an embodiment of the present invention. 図７は、本発明の一実施例によるデータ収集装置の機能構成を示すブロック図である。FIG. 7 is a block diagram showing a functional configuration of the data collection device according to one embodiment of the present invention. 図８は、本発明の一実施例によるデータ収集処理を示すフローチャートである。FIG. 8 is a flowchart showing data collection processing according to an embodiment of the present invention. 図９は、本発明の一実施例による早産予測モデル生成処理を示すフローチャートである。FIG. 9 is a flowchart showing premature birth prediction model generation processing according to an embodiment of the present invention. 図１０は、妊娠２４週にて早産発症有無予測を実施した結果を示すグラフである。FIG. 10 is a graph showing the results of predicting the onset of preterm birth at 24 weeks of gestation. 図１１は、妊娠２８週にて早産発症有無予測を実施した結果を示すグラフである。FIG. 11 is a graph showing the results of predicting the onset of preterm birth at 28 weeks of gestation. 図１２は、妊娠３２週にて早産発症有無予測を実施した結果を示すグラフである。FIG. 12 is a graph showing the results of predicting the onset of preterm birth at 32 weeks of gestation. 図１３は、本発明の一実施例による特徴量抽出・分類モデル作成処理を示すフローチャートである。FIG. 13 is a flowchart showing a feature amount extraction / classification model creation process according to an embodiment of the present invention. 図１４は、本発明の一実施例による特徴量抽出・回帰モデル作成処理を示すフローチャートである。FIG. 14 is a flowchart showing a feature amount extraction / regression model creation process according to an embodiment of the present invention. 図１５は、本発明の一実施例による特徴量抽出・リスク予測モデル作成処理を示すフローチャートである。FIG. 15 is a flowchart showing a feature amount extraction / risk prediction model creation process according to an embodiment of the present invention.

以下、図面に基づいて本発明の実施の形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

後述される実施例では、早産予測のためのデータ収集装置が開示される。ＢＭＩや睡眠の質など単一の指標による従来の早産予測では予測精度が低いため、以下の実施例によるデータ収集装置は、妊婦の生活環境のデータのうち、複数のデータを組み合わせることで予測精度を向上させる。また、データ収集装置は、取得するデータを１つの時点の測定値ではなく、例えば、連続的または定期的など複数の時点にて取得し、取得したデータから想起バイアスを極力排除することによって客観的なデータを利用する。 In an embodiment described later, a data collection device for predicting preterm birth is disclosed. Since the prediction accuracy is low in the conventional preterm birth prediction based on a single index such as BMI and sleep quality, the data collection device according to the following embodiment predicts the accuracy by combining a plurality of data among the data on the living environment of the pregnant woman. To improve. In addition, the data collection device acquires the data to be acquired at a plurality of time points, for example, continuously or periodically, instead of the measurement value at one time point, and objectively eliminates the recall bias from the acquired data as much as possible. Use simple data.

具体的には、以下の実施例によるデータ収集システムによると、妊婦の生活環境をより客観的により高頻度で取得することで妊婦関連疾患発症の環境要因を明らかにするために、妊娠初期または妊娠前より、以下のような多様なデータが取得される。
１）血圧計、体重計、活動量計、体温計などの計測装置の日々のデータ
２）日々の睡眠、便通、吐き気、痛み、子宮収縮、動悸、胎動間隔、食事内容、服薬内容の体調データ
３）妊婦の居住地の気温、天候、湿度、気圧、潮汐などの日々の生活環境データ
４）食習慣、睡眠状況、喫煙等の生活習慣に関する定期的な調査票データ
５）妊婦健診等の定期的な診療記録データ
これらのようなデータを取得することで、多種類のデータを利用可能となり、単一の指標に基づく従来の早産予測より予測精度を向上させることができる。また、上記１）〜３）に関して妊娠期間中やその前後において、１日数回から１週間に数回など高頻度に、または期間中連続的にデータを取得し、さらに、上記１）、３）などによる客観的なデータ取得と、２）による日々の登録において妊婦が記憶するべき間隔を短くすることによる想起バイアスの排除又は低減を実現することができる。なお、上記のデータのうち、１）〜３）のデータ項目はライフログと定義される。 Specifically, according to the data collection system according to the following embodiment, in order to clarify the environmental factors of pregnancy-related diseases by acquiring the living environment of pregnant women more objectively and more frequently, The following various data are acquired from before.
1) Daily data of blood pressure meters, weight scales, activity meters, thermometers and other measuring devices 2) Physical condition data of daily sleep, bowel movement, nausea, pain, uterine contraction, palpitation, fetal movement interval, meal content, medication content 3 ) Daily living environment data such as temperature, weather, humidity, barometric pressure, tide, etc. of the pregnant woman's residence 4) Regular questionnaire data on lifestyle habits such as eating habits, sleeping conditions, smoking 5) Regular pregnancy checkups, etc. Medical record data By acquiring such data, it is possible to use many kinds of data, and the prediction accuracy can be improved compared to the conventional preterm birth prediction based on a single index. In addition, during 1) to 3) above, during and after the pregnancy period, data is acquired frequently, such as several times a day to several times a week, or continuously during the period. It is possible to eliminate or reduce the recall bias by shortening the interval that a pregnant woman should memorize in objective data acquisition by the above and daily registration in 2). Of the above data, the data items 1) to 3) are defined as life logs.

これにより、本発明では、早産予測モデルを作成するために、これらのライフログや調査票、診療記録を取得し、妊婦の環境情報を充実させることができる。つまり、本発明は、非侵襲的に早産の発症予測又は異常検出のために、妊婦の妊娠期間中や産後に日々のライフログとして、例えば、血圧計、体重体組成計、活動量計、体温計等で計測した客観データ、日々の睡眠の質、便通、吐き気、痛み、子宮収縮、動悸、胎動、食事、服薬などの妊婦の体調に関するデータを取得すると共に、妊婦の居住地の気温、天候、湿度、気圧、潮汐などの客観データなどのうち１つ以上のデータ項目に関するデータを取得し、これらを組み合わせて解析することによって、より高精度な早産予測を実現することができる。 Thereby, in this invention, in order to create a preterm birth prediction model, these life logs, survey forms, and medical records can be acquired, and environmental information of pregnant women can be enriched. That is, the present invention is a non-invasive method for predicting the onset of premature birth or detecting abnormalities, such as a blood pressure meter, a body composition monitor, an activity meter, and a thermometer as daily life logs during pregnancy and after delivery of pregnant women. Objective data measured by, etc., daily sleep quality, bowel movement, nausea, pain, uterine contraction, palpitations, fetal movement, diet, medication, etc. By acquiring data related to one or more data items from objective data such as humidity, atmospheric pressure, and tide, and analyzing them in combination, it is possible to realize more accurate preterm birth prediction.

まず、図１を参照して、本発明の一実施例によるデータ収集システムを説明する。図１は、本発明の一実施例によるシステム構成を示すブロック図である。 First, a data collection system according to an embodiment of the present invention will be described with reference to FIG. FIG. 1 is a block diagram showing a system configuration according to an embodiment of the present invention.

図１に示されるように、データ収集システム１０は、計測装置２０、測定値転送装置３０、体調登録装置４０、気象情報取得装置５０、調査票登録装置６０、診療記録登録装置７０及びデータ収集装置１００を有する。なお、図示されたシステム構成は一例にすぎず、一部の装置は省かれてもよいし、あるいは、他の装置が追加されてもよい。例えば、データ収集システム１０は、計測装置２０又は体調登録装置４０のいずれか一方又は両方、診療記録登録装置７０及びデータ収集装置１００を有し、任意的に、測定値転送装置３０、気象情報取得装置５０及び／又は調査票登録装置６０を有してもよい。 As shown in FIG. 1, the data collection system 10 includes a measurement device 20, a measurement value transfer device 30, a physical condition registration device 40, a weather information acquisition device 50, a survey form registration device 60, a medical record registration device 70, and a data collection device. 100. The illustrated system configuration is merely an example, and some devices may be omitted, or other devices may be added. For example, the data collection system 10 includes one or both of the measurement device 20 and the physical condition registration device 40, the medical record registration device 70, and the data collection device 100. Optionally, the measurement value transfer device 30, the weather information acquisition You may have the apparatus 50 and / or the survey form registration apparatus 60. FIG.

計測装置２０は、血圧計、体重計、活動量計、体温計などを含み、妊婦の血圧等の生理データを計測する。 The measuring device 20 includes a sphygmomanometer, a weight scale, an activity meter, a thermometer, and the like, and measures physiological data such as blood pressure of a pregnant woman.

測定値転送装置３０は、計測装置２０とデータ収集装置１００との間でデータを転送する。なお、計測装置２０自体が測定値転送装置３０の機能を含み、データ収集装置１００に計測値を直接送信するようにしてもよい。 The measurement value transfer device 30 transfers data between the measurement device 20 and the data collection device 100. Note that the measurement device 20 itself may include the function of the measurement value transfer device 30, and the measurement value may be directly transmitted to the data collection device 100.

体調登録装置４０は、妊婦が日々の体調情報を登録するための装置であり、例えば、スマートフォン及びアプリケーションに実装可能である。具体的には、妊婦は、自らの体調情報を体調登録装置４０に入力する。ここで、体調情報は、例えば、睡眠、便通、吐き気、痛み、子宮収縮、動悸、胎動間隔、食事内容、服薬内容などのデータ項目を含むものであってもよいが、これらに限定されない。 The physical condition registration device 40 is a device for a pregnant woman to register daily physical condition information, and can be implemented in, for example, a smartphone and an application. Specifically, the pregnant woman inputs her physical condition information into the physical condition registration apparatus 40. Here, the physical condition information may include, for example, data items such as sleep, bowel movement, nausea, pain, uterine contraction, palpitation, fetal movement interval, meal content, and medication content, but is not limited thereto.

気象情報取得装置５０は、一般に公開されている気象情報のデータベース（図示せず）から、妊婦の居住地における気象情報を取得する。ここで、気象情報は、例えば、所定の期間毎（１時間単位、１日単位など）の天候、降水確率、気温、湿度、風速、風向、気圧、潮汐などのデータ項目を含むものであってもよいが、これらに限定されない。 The meteorological information acquisition device 50 acquires meteorological information on a pregnant woman's residence from a publicly available weather information database (not shown). Here, the weather information includes, for example, data items such as weather, precipitation probability, temperature, humidity, wind speed, wind direction, atmospheric pressure, tide, etc. every predetermined period (one hour unit, one day unit, etc.). However, it is not limited to these.

調査票登録装置６０は、妊婦からの定期的な調査票の情報を登録するための装置である。具体的には、入力担当者が、妊婦から取得した調査票の情報を調査票登録装置６０に登録する。ここで、調査票の情報は、例えば、妊婦やその家族の家族構成、妊婦や家族の既往歴、妊娠出産歴、食生活、睡眠状況、服薬状況、喫煙飲酒状況、運動状況、勤務状況、精神的状況などのデータ項目を含むものであってもよいが、これらに限定されない。 The survey form registration device 60 is a device for registering information on regular survey forms from pregnant women. Specifically, the person in charge registers information on the survey form acquired from the pregnant woman in the survey form registration device 60. Here, the information on the questionnaire includes, for example, the family composition of pregnant women and their families, the history of pregnant women and their families, pregnancy and childbirth history, dietary habits, sleep status, medication status, smoking and drinking status, exercise status, work status, mental status. It may include data items such as a target situation, but is not limited thereto.

診療記録登録装置７０は、妊婦の出産時の診療記録を登録するための装置である。また、任意で妊婦の定期的な妊婦健診等の診療情報が更に診療記録登録装置７０に登録されてもよい。 The medical record registration device 70 is a device for registering a medical record at the time of delivery of a pregnant woman. In addition, medical information such as a regular pregnant medical checkup of a pregnant woman may optionally be registered in the medical record registration apparatus 70.

データ収集装置１００は、以下で詳細に説明されるように、計測装置２０、測定値転送装置３０、体調登録装置４０、気象情報取得装置５０、調査票登録装置６０及び診療記録登録装置７０の１つ以上からネットワーク２００を介し上述したデータ項目に関するデータを収集し、早産の発症予測モデルを生成し、予測を実施する。ネットワーク２００については単一のネットワークではなく、複数から構成されていても良い。 As will be described in detail below, the data collection device 100 is one of the measurement device 20, the measurement value transfer device 30, the physical condition registration device 40, the weather information acquisition device 50, the survey form registration device 60, and the medical record registration device 70. Data on the above-described data items is collected from the network 200 through the network 200, a premature birth onset prediction model is generated, and prediction is performed. The network 200 may be composed of a plurality of networks instead of a single network.

なお、上述した各装置は、典型的には、コンピュータ、スマートフォン、タブレット、サーバなどの通信機能を備えた情報処理装置により実現されてもよい。例えば、情報処理装置に搭載されたプロセッサが、メモリ装置に格納されたデータやプログラムを処理及び実行することによって、後述される各種機能及び処理を実行する。しかしながら、各装置は、何れか特定のハードウェア構成に限定されるものでなく、適切なハードウェア構成により実現されてもよい。 Each device described above may be typically realized by an information processing device having a communication function such as a computer, a smartphone, a tablet, or a server. For example, a processor mounted on the information processing apparatus executes various functions and processes to be described later by processing and executing data and programs stored in the memory device. However, each device is not limited to any specific hardware configuration, and may be realized by an appropriate hardware configuration.

次に、図２を参照して、本発明の一実施例による測定値転送装置を説明する。図２は、本発明の一実施例による測定値転送装置の機能構成を示すブロック図である。 Next, a measured value transfer apparatus according to an embodiment of the present invention will be described with reference to FIG. FIG. 2 is a block diagram showing a functional configuration of the measurement value transfer apparatus according to one embodiment of the present invention.

図２に示されるように、測定値転送装置３０は、データ取得機能３１、データ転送機能３２、計測装置認証機能３３及び利用者認証機能３４を有する。 As shown in FIG. 2, the measurement value transfer device 30 includes a data acquisition function 31, a data transfer function 32, a measurement device authentication function 33, and a user authentication function 34.

データ取得機能３１は、１つ以上の計測装置２０と通信し、計測装置２０からデータ及び測定値を取得する。なお、測定値転送装置３０と計測装置２０とは、以下に限定することなく、Ｂｌｕｅｔｏｏｔｈ（登録商標）、Ｗｉ−Ｆｉ、ＮＦＣ、ＺｉｇＢｅｅ、ＵＳＢ接続、モバイル通信（３Ｇ、ＬＴＥ、４Ｇ、５Ｇなど）などの通信方式により実現されてもよい。例えば、計測装置２０から取得されるデータ項目は、以下に限定されることなく、血圧計、体重計、活動量計、体温計などの妊婦に関する生理データであってもよい。 The data acquisition function 31 communicates with one or more measurement devices 20 and acquires data and measurement values from the measurement devices 20. The measurement value transfer device 30 and the measurement device 20 are not limited to the following, but are not limited to Bluetooth (registered trademark), Wi-Fi, NFC, ZigBee, USB connection, mobile communication (3G, LTE, 4G, 5G, etc.) It may be realized by a communication method such as For example, the data items acquired from the measurement device 20 are not limited to the following, and may be physiological data related to pregnant women such as a blood pressure monitor, a weight scale, an activity meter, and a thermometer.

データ転送機能３２は、取得したデータ及び測定値をネットワーク２００を介しデータ収集装置１００に転送する。当該データは、妊娠期間中やその前後において、１日数回から１週間に数回など高頻度に、または期間中連続的に取得され、データ収集装置１００に転送されてもよい。 The data transfer function 32 transfers the acquired data and measurement values to the data collection device 100 via the network 200. The data may be acquired at a high frequency such as several times a day to several times a week during or around the pregnancy period, or continuously during the period, and may be transferred to the data collection device 100.

計測装置認証機能３３は、通信先の計測装置２０を認証する。 The measurement device authentication function 33 authenticates the communication device 20 as a communication destination.

利用者認証機能３４は、妊婦などの測定値転送装置３０の利用者を認証する。 The user authentication function 34 authenticates a user of the measurement value transfer device 30 such as a pregnant woman.

次に、図３を参照して、本発明の一実施例による体調登録装置を説明する。図３は、本発明の一実施例による体調登録装置の機能構成を示すブロック図である。 Next, a physical condition registration device according to an embodiment of the present invention will be described with reference to FIG. FIG. 3 is a block diagram showing a functional configuration of the physical condition registration device according to one embodiment of the present invention.

図３に示されるように、体調登録装置４０は、データ登録機能４１、データ転送機能４２及び利用者認証機能４３を有する。 As shown in FIG. 3, the physical condition registration device 40 includes a data registration function 41, a data transfer function 42, and a user authentication function 43.

データ登録機能４１は、妊婦などの利用者からの入力内容をデータとして登録する。例えば、妊婦から入力されるデータ項目は、以下に限定されることなく、日々の睡眠、便通、吐き気、痛み、子宮収縮、動悸、胎動間隔、食事内容、服薬内容などの妊婦の体調データであってもよい。 The data registration function 41 registers input contents from a user such as a pregnant woman as data. For example, data items input from pregnant women are not limited to the following, but are physical condition data of pregnant women such as daily sleep, bowel movements, nausea, pain, uterine contraction, palpitation, fetal movement interval, meal content, medication content, etc. May be.

データ転送機能４２は、登録されたデータをデータ収集装置１００に転送する。当該データは、妊娠期間中やその前後において、１日数回から１週間に数回など高頻度に、または期間中連続的に取得され、データ収集装置１００に転送されてもよい。 The data transfer function 42 transfers the registered data to the data collection device 100. The data may be acquired at a high frequency such as several times a day to several times a week during or around the pregnancy period, or continuously during the period, and may be transferred to the data collection device 100.

利用者認証機能４３は、妊婦などの体調登録装置の利用者を認証する。 The user authentication function 43 authenticates a user of a physical condition registration device such as a pregnant woman.

次に、図４を参照して、本発明の一実施例による気象情報取得装置を説明する。図４は、本発明の一実施例による気象情報取得装置の機能構成を示すブロック図である。 Next, a weather information acquisition apparatus according to an embodiment of the present invention will be described with reference to FIG. FIG. 4 is a block diagram showing a functional configuration of a weather information acquisition apparatus according to an embodiment of the present invention.

図４に示されるように、気象情報取得装置５０は、データ取得機能５１及びデータ転送機能５２を有する。 As shown in FIG. 4, the weather information acquisition apparatus 50 has a data acquisition function 51 and a data transfer function 52.

データ取得機能５１は、気象情報データベースから気象情報を取得する。気象情報データベースには、例えば、一般公開されているデータベースがある。例えば、気象情報は、以下に限定されることなく、妊婦の居住地の気温、天候、湿度、気圧、潮汐などの生活環境データであってもよい。 The data acquisition function 51 acquires weather information from the weather information database. As the weather information database, for example, there is a publicly available database. For example, the weather information is not limited to the following, and may be living environment data such as the temperature, weather, humidity, atmospheric pressure, and tide of a pregnant woman's residence.

データ転送機能５２は、取得したデータをデータ収集装置１００に転送する。当該データは、妊娠期間中やその前後において、１日数回から１週間に数回など高頻度に、または期間中連続的に取得され、データ収集装置１００に転送されてもよい。 The data transfer function 52 transfers the acquired data to the data collection device 100. The data may be acquired at a high frequency such as several times a day to several times a week during or around the pregnancy period, or continuously during the period, and may be transferred to the data collection device 100.

次に、図５を参照して、本発明の一実施例による調査票登録装置を説明する。図５は、本発明の一実施例による調査票登録装置の機能構成を示すブロック図である。 Next, a survey form registration apparatus according to an embodiment of the present invention will be described with reference to FIG. FIG. 5 is a block diagram showing a functional configuration of the survey form registration apparatus according to an embodiment of the present invention.

図５に示されるように、調査票登録装置６０は、データ登録機能６１及びデータ転送機能６２を有する。 As shown in FIG. 5, the survey form registration device 60 has a data registration function 61 and a data transfer function 62.

データ登録機能６１は、調査票情報の入力担当者が入力するデータを登録する。例えば、妊婦に関する調査票情報は、以下に限定されることなく、妊婦やその家族の家族構成、妊婦や家族の既往歴、妊娠出産歴、食生活、睡眠状況、服薬状況、喫煙飲酒状況、運動状況、勤務状況、精神的状況などの妊婦やその家族の生活習慣に関するデータ項目であってもよい。 The data registration function 61 registers data input by the person in charge of survey form information. For example, questionnaire information on pregnant women includes, but is not limited to, the family composition of pregnant women and their families, the history of pregnant women and their families, pregnancy and childbirth history, dietary habits, sleeping status, medication status, smoking and drinking status, exercise It may be a data item regarding the lifestyle of pregnant women and their families, such as status, work status, and mental status.

データ転送機能６２は、登録したデータをデータ収集装置１００に転送する。 The data transfer function 62 transfers the registered data to the data collection device 100.

次に、図６を参照して、本発明の一実施例による診療記録登録装置を説明する。図６は、本発明の一実施例による診療記録登録装置の機能構成を示すブロック図である。 Next, a medical record registration apparatus according to an embodiment of the present invention will be described with reference to FIG. FIG. 6 is a block diagram showing a functional configuration of a medical record registration apparatus according to an embodiment of the present invention.

図６に示されるように、診療記録登録装置７０は、データ登録機能７１及びデータ転送機能７２を有する。 As shown in FIG. 6, the medical record registration device 70 has a data registration function 71 and a data transfer function 72.

データ登録機能７１は、診療記録の入力担当者が入力するデータを登録する。例えば、診療記録は、以下に限定されることなく、医師などにより記録された妊婦健診などの診療記録であってもよい。 The data registration function 71 registers data input by the person in charge of medical record input. For example, the medical record is not limited to the following, and may be a medical record such as a pregnant woman medical checkup recorded by a doctor or the like.

データ転送機能７２は、登録したデータをデータ収集装置１００に転送する。 The data transfer function 72 transfers the registered data to the data collection device 100.

次に、図７を参照して、本発明の一実施例によるデータ収集装置を説明する。図７は、本発明の一実施例によるデータ収集装置の機能構成を示すブロック図である。 Next, a data collection apparatus according to an embodiment of the present invention will be described with reference to FIG. FIG. 7 is a block diagram showing a functional configuration of the data collection device according to one embodiment of the present invention.

図７に示されるように、データ収集装置１００は、データ取得機能１１０、データ保存機能１２０、早産予測モデル生成機能１３０、早産予測機能１４０及び結果通知機能１５０を有する。 As illustrated in FIG. 7, the data collection device 100 includes a data acquisition function 110, a data storage function 120, a preterm birth prediction model generation function 130, a preterm birth prediction function 140, and a result notification function 150.

データ取得機能１１０は、複数のデータソース装置から妊婦に関するデータを取得する。また、データ取得機能１１０は、妊婦の妊娠前後の何れかの時期から出産前後の何れかの時期までのデータ取得期間において、妊婦に関するデータを複数の時点にて取得する。すなわち、データ取得機能１１０は、上述した計測装置２０、測定値転送装置３０、体調登録装置４０、気象情報取得装置５０、調査票登録装置６０及び／又は診療記録登録装置７０などのデータソース装置から、妊婦の妊娠前後の何れかの時期から出産前後の何れかの時期までのデータ取得期間において、妊婦に関する１つ以上のデータ項目のデータを、例えば、定期的など複数の時点にて取得する。 The data acquisition function 110 acquires data regarding pregnant women from a plurality of data source devices. In addition, the data acquisition function 110 acquires data on a pregnant woman at a plurality of points in a data acquisition period from any time before and after pregnancy of the pregnant woman to any time before and after childbirth. That is, the data acquisition function 110 is received from a data source device such as the measurement device 20, the measurement value transfer device 30, the physical condition registration device 40, the weather information acquisition device 50, the survey form registration device 60, and / or the medical record registration device 70 described above. In the data acquisition period from any time before and after pregnancy of the pregnant woman to any time before and after childbirth, data of one or more data items related to the pregnant woman are acquired at a plurality of times such as regularly.

ここで、妊婦に関するデータは、例えば、妊娠時年齢、血圧、脈拍、体重、ＢＭＩ、歩数、カロリー、体温、睡眠時間、就寝中に起きた回数、睡眠の質、便通の回数、便通の柔らかさ、子宮収縮、頭の痛み、歯の痛み、上腹部の痛み、下腹部の痛み、腰の痛み、吐き気、嘔吐、動悸、胎動間隔、食事内容、服薬内容、過去の早産歴、妊娠状況、気温、室温、湿度、気圧、天気概況、潮汐及び月齢の１つ以上のデータ項目を含むものであってもよい。 Here, data on pregnant women include, for example, pregnancy age, blood pressure, pulse rate, body weight, BMI, number of steps, calories, body temperature, sleep time, number of waking up during sleep, quality of sleep, number of bowel movements, softness of bowel movements Uterine contraction, head pain, tooth pain, upper abdominal pain, lower abdominal pain, lower back pain, nausea, vomiting, palpitations, fetal interval, meal content, medication content, past premature birth history, pregnancy status, temperature , Room temperature, humidity, barometric pressure, weather conditions, tides, and age.

データ保存機能１２０は、取得した妊婦に関するデータを保存する。 The data storage function 120 stores the acquired data regarding pregnant women.

早産予測モデル生成機能１３０は、早産予測モデルを生成及び保持する。例えば、早産予測モデル生成機能１３０は、正期産により出産した妊婦と早産により出産した妊婦とを含む複数の妊婦から取得した妊婦に関するデータに基づき、早産予測モデルを生成及び更新してもよい。具体的には、早産予測モデルは、以下で詳細に説明されるようなモデル導出方法に従って生成されてもよい。 The preterm birth prediction model generation function 130 generates and holds a preterm birth prediction model. For example, the preterm birth prediction model generation function 130 may generate and update the preterm birth prediction model based on data on pregnant women acquired from a plurality of pregnant women including a pregnant woman who delivered by term birth and a pregnant woman who delivered by premature birth. Specifically, the preterm birth prediction model may be generated according to a model derivation method as described in detail below.

早産予測機能１４０は、早産予測モデルに基づき、取得した妊婦に関するデータから妊婦の早産予測を実行する。具体的には、早産予測機能１４０は、取得した妊婦に関するデータを早産予測モデルに投入し、早産予測モデルの出力結果に基づき当該妊婦の早産発症有無予測、出産日予測又は早産発症リスク算出を実行してもよい。すなわち、早産予測において、早産予測機能１４０は、当該妊婦に早産発症の可能性があるか予測し（早産発症有無予測）、当該妊婦の出産日を予測し（出産日予測）、あるいは、当該妊婦の早産発症リスクを算出してもよい（早産発症リスク算出）。なお、早産予測機能１４０は、妊婦の週数に応じて異なるデータ項目に基づき、当該妊婦の早産発症有無予測、出産日予測又は早産発症リスク算出を実行してもよい。 The preterm birth prediction function 140 executes the preterm birth prediction of the pregnant woman from the acquired data on the pregnant woman based on the preterm birth prediction model. Specifically, the preterm birth prediction function 140 inputs the acquired data on the pregnant woman into the preterm birth prediction model, and executes the preterm birth onset prediction, the birth date prediction, or the preterm birth risk calculation based on the output result of the preterm birth prediction model. May be. That is, in preterm birth prediction, the preterm birth prediction function 140 predicts whether or not the pregnant woman has the possibility of premature birth (predicting the onset of premature birth), predicts the date of birth of the pregnant woman (prediction of birth date), or the pregnant woman The risk of preterm birth may be calculated (risk calculation of preterm birth). The preterm birth prediction function 140 may execute premature birth onset prediction, birth date prediction, or premature birth risk calculation based on different data items depending on the number of weeks of the pregnant woman.

結果通知機能１５０は、早産予測機能１４０によって実行された早産予測の結果を妊婦、医療関係者などに通知する。 The result notification function 150 notifies a pregnant woman, a medical staff, and the like of the result of the preterm birth prediction executed by the preterm birth prediction function 140.

なお、上述した装置は、それぞれが単独の装置であることもあれば、ある装置が他の装置の機能の一部又は全部を備えることもある。
（実施例１）
次に、図８〜１０を参照して、本発明の一実施例によるデータ収集装置１００におけるデータ収集処理を説明する。図８は、本発明の一実施例によるデータ収集処理を示すフローチャートである。 Note that each of the above-described devices may be a single device, or a certain device may have some or all of the functions of another device.
Example 1
Next, a data collection process in the data collection apparatus 100 according to an embodiment of the present invention will be described with reference to FIGS. FIG. 8 is a flowchart showing data collection processing according to an embodiment of the present invention.

図８に示されるように、ステップＳ１００において、データ収集装置１００は、上述した計測装置２０、測定値転送装置３０、体調登録装置４０、気象情報取得装置５０、調査票登録装置６０及び／又は診療記録登録装置７０などのデータソース装置の全て又は一部からデータを取得し、取得したデータを保存する。具体的には、データ収集装置１００は、妊婦の妊娠前後の何れかの時期から出産前後の何れかの時期までのデータ取得期間においてデータを定期的に取得する。 As shown in FIG. 8, in step S <b> 100, the data collection device 100 performs the above-described measurement device 20, measurement value transfer device 30, physical condition registration device 40, weather information acquisition device 50, survey form registration device 60, and / or medical care. Data is acquired from all or part of the data source device such as the record registration device 70, and the acquired data is stored. Specifically, the data collection device 100 periodically acquires data in a data acquisition period from any time before and after pregnancy of the pregnant woman to any time before and after childbirth.

まず、妊婦は、妊娠前もしくは妊娠中の任意の時点から、出産もしくは産後の任意の時点までの間、計測装置２０を使用する。計測装置２０は、例えば、血圧計、体重計、活動量計、体温計などを含む。血圧計は、収縮期血圧、拡張期血圧、脈拍数、室温などを計測する。体重計は、体重、ＢＭＩ、体脂肪率、骨格筋率などを計測する。活動量計は、歩数、消費カロリーなどを計測する。体温計は、体温などを計測する。計測装置２０による測定は、血圧計については１日朝晩の２回、その他は１日朝１回を想定するが、その限りではなく、任意の間隔で計測することも可能である。測定値転送装置３０は、各計測装置２０から計測データを取得し、データ収集装置１００に転送する。 First, the pregnant woman uses the measuring device 20 from an arbitrary time point before or during pregnancy to an arbitrary time point after giving birth or after giving birth. The measurement device 20 includes, for example, a sphygmomanometer, a weight scale, an activity meter, a thermometer, and the like. The sphygmomanometer measures systolic blood pressure, diastolic blood pressure, pulse rate, room temperature, and the like. The weight scale measures body weight, BMI, body fat percentage, skeletal muscle percentage, and the like. The activity meter measures the number of steps, calorie consumption, and the like. A thermometer measures body temperature and the like. The measurement by the measuring device 20 is assumed to be twice a day for morning and evening for the sphygmomanometer and once a morning for the other, but not limited thereto, and can be measured at an arbitrary interval. The measurement value transfer device 30 acquires measurement data from each measurement device 20 and transfers the measurement data to the data collection device 100.

また、妊婦は、体調登録装置４０に自身の体調を登録する。体調登録装置４０として、スマートフォンに体調登録用アプリやＷｅｂページを用いる場合を想定するが、その限りではない。体調情報として、就寝日時、起床日時、睡眠時間、就寝中に起きた回数、睡眠の質、便通の回数、便痛の軟らかさ、吐き気を感じた総時間、嘔吐の回数、空嘔吐の回数、頭の痛み、歯の痛み、腰の痛み、上腹部の痛み、下腹部の痛み、痛みを伴わない子宮収縮回数、痛みを伴う子宮収縮回数、動悸、胎動間隔、食事内容、服薬内容の１つ以上のデータ項目が登録されることが想定されるが、これらに限定されるものでない。体調情報の登録期間は、妊娠前もしくは妊娠中の任意の時点から、出産もしくは産後の任意の時点までの間を想定する。また、体調登録間隔は、主なデータ項目は１日１回を想定するが、食事内容や服薬内容など一部のデータ項目については、１日複数回の入力を想定する。また、これらのデータ項目は毎日ではなく、任意の間隔で計測することも可能である。 In addition, the pregnant woman registers her physical condition in the physical condition registration device 40. Although it is assumed that the physical condition registration device 40 uses a physical condition registration application or a web page on a smartphone, the present invention is not limited thereto. Physical condition information includes bedtime, wakeup time, sleep time, number of times you wake up during sleep, quality of sleep, number of bowel movements, softness of fecal pain, total time of nausea, number of vomiting, number of vomiting, One of head pain, tooth pain, lower back pain, upper abdominal pain, lower abdominal pain, painless uterine contractions, painful uterine contractions, palpitation, fetal interval, meal content, medication content It is assumed that the above data items are registered, but the present invention is not limited to these. The registration period of physical condition information is assumed to be from any time point before or during pregnancy to any time point after childbirth or postpartum. As for the physical condition registration interval, main data items are assumed to be once a day, but some data items such as meal contents and medication contents are assumed to be input multiple times a day. Further, these data items can be measured at arbitrary intervals instead of every day.

また、気象情報取得装置５０は、毎日の気象情報を収集する。例えば、気象庁のＷｅｂサイトから取得可能である。取得されるデータ項目は、日付、降水量、平均気温、最高気温、最低気温、全天日射量、平均雲量、降雪の深さ、最深積雪、最大瞬間風速、潮汐及び／又は月齢を想定し、さらには１時間ごとの、気温、降水量、風向き、風速、日照時間、湿度、気圧、天気概況を想定するが、これらに限定されるものでない。また、データの単位も１日や時間に限定されず、週単位、月単位、年単位も取り得る。 Moreover, the weather information acquisition apparatus 50 collects daily weather information. For example, it can be obtained from the website of the Japan Meteorological Agency. The data items to be acquired assume date, precipitation, average temperature, maximum temperature, minimum temperature, total solar radiation, average cloud cover, snow depth, maximum snow depth, maximum instantaneous wind speed, tide and / or age, Furthermore, although hourly temperature, precipitation, wind direction, wind speed, sunshine duration, humidity, atmospheric pressure, and weather conditions are assumed, the present invention is not limited to these. Further, the data unit is not limited to one day or time, and may be a week unit, a month unit, or a year unit.

また、調査票登録装置６０は、妊婦が記入した定期的な調査票の情報を取得する。調査票の情報は、妊婦やその家族の家族構成、妊婦や家族の既往歴、妊娠出産歴、食生活、睡眠状況、服薬状況、喫煙飲酒状況、運動状況、勤務状況、精神的状況などのデータ項目を含むが、これらに限定されるものでない。取得の頻度については、調査票の頻度を想定する。 In addition, the survey form registration device 60 acquires information on a regular survey form entered by a pregnant woman. The questionnaire information includes data such as the family structure of pregnant women and their families, the history of pregnant women and their families, pregnancy and childbirth history, dietary habits, sleeping status, medication status, smoking and drinking status, exercise status, working status, and mental status. Including but not limited to items. As for the frequency of acquisition, the frequency of survey forms is assumed.

また、診療記録登録装置７０は、妊婦が妊婦健診などで受診した際の検査値や所見などの記録を取得する。取得の頻度については、受診の頻度を想定する。 In addition, the medical record registration device 70 acquires records such as test values and findings when a pregnant woman has received a medical checkup for pregnant women. As for the frequency of acquisition, the frequency of consultation is assumed.

ステップｓ１０１において、データ収集装置１００は、取得したデータに対して異常値を検出及び除去し、欠測値を補間する。例えば、日々の収縮期血圧が１２０ｍｍＨｇである妊婦が、ある１日だけ６０ｍｍＨｇであった場合、データ収集装置１００は、当該測定値を実際の値ではなく計測エラーとみなして、当該測定値を除去する。この除去基準の一例として、データ収集装置１００は、その妊婦のこれまでの収縮期血圧の平均μと標準偏差σを計算し、ある計測値が平均μよりｎσ以上解離していた場合に、当該計測値を除去する（ｎは任意の数値）、などの方法があるが、これに限定されるものでない。また、欠測値に関しては、前後の点を用いた線形補間やスプライン補間や、欠測値以外の計測点にフィッティングしたモデルを用いて補完する方法などがあるが、それらに限定されない。 In step s101, the data collection device 100 detects and removes abnormal values from the acquired data, and interpolates missing values. For example, when a pregnant woman whose daily systolic blood pressure is 120 mmHg is 60 mmHg for one day, the data collection device 100 regards the measurement value as a measurement error instead of an actual value, and removes the measurement value. To do. As an example of this removal criterion, the data collection device 100 calculates the mean μ and standard deviation σ of the systolic blood pressure of the pregnant woman so far, and when a certain measured value is dissociated more than nσ from the average μ, There is a method of removing a measured value (n is an arbitrary numerical value), but is not limited thereto. Further, regarding missing values, there are linear interpolation and spline interpolation using front and rear points, and a method of complementing using a model fitted to measurement points other than missing values, but it is not limited thereto.

ステップｓ１０２において、データ収集装置１００は、既に出産して出産状況が確定している妊婦のデータを用いて、早産群とそれ以外の群でデータ分布の違いが大きいデータ項目を特徴量として抽出する。データ分布の違いの大きさの定義には、例えば「データの平均値が同じとは言えない確率が大きい」ことや「データの分布が同じとは言えない確率が大きい」という指標が利用可能であるが、それらに限定されるものではない。これには、ステューデントのｔ検定、ウェルチのｔ検定、多変量回帰、多変量ロジスティック回帰、カイ二乗検定、フィッシャーの正確確率検定などを用いた方法や、Ｌａｓｓｏ回帰やＲｉｄｇｅ回帰やＥｌａｓｔｉｃＮｅｔなどにより有用なデータ項目を限定する方法や主成分分析の寄与率などからデータ項目を限定する方法などがあるが、それらに限定されるものでない。 In step s102, the data collection apparatus 100 extracts data items having a large difference in data distribution between the premature birth group and the other groups as feature amounts using the data of pregnant women who have already given birth and the delivery status is determined. . To define the magnitude of the difference in data distribution, for example, indicators such as “the probability that the average value of the data cannot be said to be the same” or “the probability that the data distribution cannot be said to be the same” are available. There are, but are not limited to them. This is useful by methods using Student's t-test, Welch's t-test, multivariate regression, multivariate logistic regression, chi-square test, Fisher's exact test, Lasso regression, Ridge regression, and Elastic Net. There are a method of limiting data items and a method of limiting data items based on the contribution ratio of principal component analysis, but the method is not limited to these.

ステップｓ１０３、ｓ１０５及びｓ１０７において、データ収集装置１００は、抽出した特徴量を用いて早産予測モデルを作成する。具体的には、データ収集装置１００は、ステップｓ１０３において、早産になるかどうかを予測する分類モデルとして早産予測モデルを生成する。また、データ収集装置１００は、ステップｓ１０５において、各妊婦の出産日を予測する回帰モデルとして出産日予測モデルを生成する。また、データ収集装置１００は、ｓ１０７において、早産発症のリスクを算出する早産発症リスク算出モデルを生成する。回帰モデルには、線形回帰モデル、ロジスティック回帰モデル、ＳＶＭによる回帰モデル、多層ニューラルネットワークによる回帰モデルなどがあるが、それらに限定されるものでない。また、分類モデルにおいても、線形分類、ロジスティック回帰、ベイジアンネットワーク、ＳＶＭ、ｋ近傍法、ランダムフォレスト、多層ニューラルネットワークによる分類モデルなどがあるが、それらに限定されるものでない。早産発症リスク算出モデルについては、オッズ比、寄与危険度、相対危険度などがあるが、それらに限定されるものでない。また、ステップｓ１０３、ｓ１０５、ｓ１０７は、ｓ１０２と切り離さずに一体で処理されてもよい。 In steps s103, s105, and s107, the data collection device 100 creates a preterm birth prediction model using the extracted feature amount. Specifically, in step s103, the data collection device 100 generates a preterm birth prediction model as a classification model for predicting whether or not preterm birth occurs. In step s105, the data collection device 100 generates a birth date prediction model as a regression model for predicting the birth date of each pregnant woman. In s107, the data collection device 100 generates a premature labor risk calculation model for calculating the risk of premature labor. Examples of the regression model include, but are not limited to, a linear regression model, a logistic regression model, a regression model using SVM, and a regression model using a multilayer neural network. The classification model includes linear classification, logistic regression, Bayesian network, SVM, k-nearest neighbor method, random forest, classification model by multilayer neural network, and the like, but is not limited thereto. The preterm birth risk calculation model includes, but is not limited to, odds ratio, contribution risk, relative risk. In addition, steps s103, s105, and s107 may be integrally processed without being separated from s102.

ステップｓ１０４、ｓ１０６及びｓ１０８において、データ収集装置１００は、生成された早産予測モデル、出産日予測モデル及び早産発症のリスク算出モデルに予測対象の妊婦のデータをそれぞれ投入し、早産予測を実行する。具体的には、データ収集装置１００は、ステップｓ１０４において、早産予測モデルを利用して予測対象の妊婦のデータから将来の早産の有無を予測する。また、データ収集装置１００は、ステップｓ１０６において、出産日予測モデルを利用して予測対象の妊婦のデータから出産日を予測する。また、データ収集装置１００は、ステップｓ１０８において、早産発症リスク算出モデルを利用して予測対象の妊婦のデータから早産リスクを算出する。なお、早産有無予測は、最終的な早産有無の予測、この先ｎ１週間（ｎ１は任意の数値）で早産を発症しない予測、この先ｎ２週間（ｎ２は任意の数値）で早産を発症する予測、などを含むが、それらに限定されるものでない。 In steps s104, s106, and s108, the data collection device 100 inputs the data of the pregnant woman to be predicted into the generated preterm birth prediction model, birth date prediction model, and preterm birth risk calculation model, and executes preterm birth prediction. Specifically, in step s104, the data collection device 100 uses the preterm birth prediction model to predict the presence or absence of future preterm birth from the data of the pregnant woman to be predicted. In step s106, the data collection apparatus 100 predicts the date of birth from the data of the pregnant woman to be predicted using the birth date prediction model. Further, in step s108, the data collection device 100 uses the preterm birth risk calculation model to calculate the preterm birth risk from the predicted pregnant woman data. Prediction of premature birth, prediction of final premature birth, prediction that premature birth will not occur in the next n1 weeks (n1 is an arbitrary value), prediction of premature birth in the next n2 weeks (n2 is an arbitrary value), etc. Including, but not limited to.

ステップｓ１０９において、データ収集装置１００は、早産予測結果、出産日予測結果及び／又は早産リスク計算結果を妊婦又は医療機関に通知し、当該データ収集処理を終了する。
（実施例２）
なお、上述した実施例１のうち、早産予測モデル、出産日予測モデル又は早産発症リスク算出モデルを生成及び／又は投入するのに利用するデータ項目は、妊娠時年齢、血圧、脈拍、体重、ＢＭＩ、歩数、消費カロリー、体温、睡眠時間、就寝中に起きた回数、睡眠の質、便通の回数、便通の柔らかさ、子宮収縮、頭の痛み、歯の痛み、上腹部の痛み、下腹部の痛み、腰の痛み、吐き気、嘔吐、動悸、胎動間隔、食事内容、服薬内容、過去の早産歴、妊娠状況（多胎など）、気温、室温、湿度、気圧、天気概況、潮汐及び月齢の１つ以上に限定されてもよい。これにより、より少ないデータ数で効率的にモデルの作成及び利用が可能となる。
（実施例３）
図９は、本発明の一実施例による早産予測モデル生成処理を示すフローチャートである。上述した実施例２に関して、後述される早産予測モデル生成処理では、早産と正期産のデータ分布の違いの大きさを利用し、更に少ないデータ項目数によって、妊娠週数に応じて早産予測モデルに必要なデータ項目を変更して早産有無予測モデルが生成される。当該早産予測モデル生成処理は、上述された図８のステップｓ１０２，ｓ１０３において実行可能であり、後述されるステップｓ１０２−１〜６がステップｓ１０２の特徴量の抽出処理に対応し、ステップｓ１０３−１〜１１がステップｓ１０３の早産有無予測モデルの生成処理に対応する。実施例３では、早産予測モデル生成機能１３０は、取得したデータを早産群と非早産群に分け、各データ項目のうち二群でデータ分布の違いが大きい（すなわち、区別する、判別する、分離するなど）データ項目を抽出し、さらに各データ項目において相関の高いデータ項目同士のうち一つを選択することでデータ項目数を削減することによって、早産予測モデルの生成に利用する特徴量を抽出する。また、早産予測モデル生成機能１３０は、早産群と非早産群でデータ分布の違いが大きい特徴量から順次特徴量数を増やすことによって、予測精度の高い早産予測モデルを選定する。さらに、早産予測モデル生成機能１３０は、ある期間のデータから抽出した特徴量を用いて作成した早産予測モデルと、別の期間のデータから抽出した特徴量を用いて作成した早産予測モデル群から予測精度が高い早産予測モデルを選定する。なお、上記は早産予測モデルに利用する特徴量の抽出方法や早産予測モデルの選定方法の一例であって、早産予測生成モデル１３０はこれに制限されない。例えば、多層ニューラルネットワークまたはＬａｓｓｏ回帰などによって特徴量の抽出を抽出し、早産予測モデルを作成する方法もある。 In step s109, the data collection device 100 notifies the pregnant woman or the medical institution of the premature birth prediction result, the birth date prediction result, and / or the premature birth risk calculation result, and ends the data collection process.
(Example 2)
In addition, in Example 1 mentioned above, the data items used to generate and / or input the preterm birth prediction model, the birth date prediction model or the preterm birth risk calculation model are the age at pregnancy, blood pressure, pulse, body weight, BMI. , Steps, calories burned, body temperature, sleeping time, number of times you wake up during sleep, quality of sleep, number of bowel movements, softness of bowel movement, uterine contraction, head pain, tooth pain, upper abdominal pain, lower abdominal pain Pain, low back pain, nausea, vomiting, palpitation, fetal interval, meal content, medication content, past premature birth history, pregnancy status (multiple births, etc.), temperature, room temperature, humidity, barometric pressure, weather conditions, tide and age It may be limited to the above. This makes it possible to efficiently create and use a model with a smaller number of data.
(Example 3)
FIG. 9 is a flowchart showing premature birth prediction model generation processing according to an embodiment of the present invention. With regard to the above-described second embodiment, the preterm birth prediction model generation process described later uses the magnitude of the difference in data distribution between preterm birth and full term birth, and further reduces the number of data items, depending on the number of gestation weeks. The data items necessary for this are changed and a preterm birth prediction model is generated. The premature birth prediction model generation process can be executed in steps s102 and s103 of FIG. 8 described above. Steps s102-1 to s6 to be described later correspond to the feature amount extraction process of step s102, and step s103-1 ˜11 corresponds to the premature birth prediction model generation process in step s103. In the third embodiment, the preterm birth prediction model generation function 130 divides the acquired data into the preterm birth group and the non-preterm birth group, and there is a large difference in data distribution between the two groups of each data item (that is, distinguish, discriminate, separate) Extract data items, and then select one of the highly correlated data items in each data item to reduce the number of data items, thereby extracting features used to generate the preterm birth prediction model To do. Further, the preterm birth prediction model generation function 130 selects a preterm birth prediction model with high prediction accuracy by sequentially increasing the number of feature amounts from the feature amount having a large difference in data distribution between the preterm birth group and the non-preterm birth group. Furthermore, the preterm birth prediction model generation function 130 predicts from the preterm birth prediction model created using the feature amount extracted from the data of a certain period and the preterm birth prediction model group created using the feature amount extracted from the data of another period. Select a model for predicting preterm birth with high accuracy. Note that the above is an example of a feature quantity extraction method and a preterm birth prediction model selection method used for the preterm birth prediction model, and the preterm birth prediction generation model 130 is not limited thereto. For example, there is a method in which extraction of feature amounts is extracted by a multilayer neural network or Lasso regression to create a preterm birth prediction model.

図９に示されるように、ステップｓ１０２−１において、データ収集装置１００は、取得したデータ項目の一部のデータ項目を加工する。具体的には、データ収集装置１００は、一部のデータ項目について、以下に限定することなく、数日前のデータとの差、数日間の平均及び分散、計測値の近似モデルを作成したその微分、二階微分などを導出する。 As illustrated in FIG. 9, in step s102-1, the data collection device 100 processes some data items of the acquired data items. Specifically, the data collection device 100 is not limited to the following, for some data items, the difference from the data of several days ago, the average and variance of several days, and the derivative of the approximation model of the measured value Deriving second derivative etc.

まず、特徴量の抽出処理について説明する。 First, feature amount extraction processing will be described.

ステップｓ１０２−２において、データ収集装置１００は、元のデータとｓ１０２−１で作成したデータとを結合し、モデル作成のために、既に出産済みの妊婦のデータを抽出する。 In step s102-2, the data collection device 100 combines the original data and the data created in s102-1, and extracts the data of pregnant women who have already given birth for model creation.

ステップｓ１０２−３において、データ収集装置１００は、早産予測に有用なデータ項目が妊娠週数ごとに変化することを前提とし、各週数において有用なデータ項目（特徴量）を抽出する処理のループを開始する。例えば、１６週から３２週まで各週で特徴量を抽出する場合、妊娠日数を示すパラメータｔ１をｔ１＝１６×７（日）でループ内処理を実行し、次にｔ１＝１７×７として実行し、最終的にｔ１＝３２×７として実行する。 In step s102-3, the data collection device 100 assumes that the data items useful for preterm birth change for each pregnancy week, and performs a processing loop for extracting data items (features) useful for each week number. Start. For example, when extracting feature quantities in each week from week 16 to week 32, the in-loop processing is executed with t1 = 16 × 7 (days) as the parameter t1 indicating the number of days of pregnancy, and then t1 = 17 × 7. Finally, t1 = 32 × 7 is executed.

ステップｓ１０２−４において、データ収集装置１００は、ループ内処理の最初に、ｓ１０２−２で抽出したデータから、期間が（ｔ１−ａ）日から（ｔ１＋ｂ）日となるデータを選択し、それらのデータに紐付く妊婦が早産になった群と、そうでなかった群にデータを分ける。ａやｂは任意の数値で、例えば１週間単位でモデルを作成する場合は、ａ＝０、ｂ＝６である。また、ａ＝２８など大きくすることで、ｔ１より２８日前のデータからｂ日後の長期のデータを利用してその後のデータ項目の抽出が可能である。その後、データ収集装置１００は、ｓ１０２で説明するような方法を用いて、二群でデータ分布の違いが大きいデータ項目を抽出する。ここでは、例えばウェルチのｔ検定にてｐ値がｃ未満となるデータ項目を抽出する。ここでｃは任意の数値で、例えばｃ＝０．０５である。しかしながら、本発明は、これに限定されず、早産群と正産群とを分離可能な何れか適切な判別モデル（例えば、統計モデル、学習モデルなど）が適用されてもよい。 In step s102-4, the data collection device 100 selects data whose period is from (t1-a) day to (t1 + b) day from the data extracted in s102-2 at the beginning of the in-loop processing, The data is divided into a group in which the pregnant women associated with the data had a premature birth and a group that did not. a and b are arbitrary numerical values. For example, when a model is created in units of one week, a = 0 and b = 6. Further, by increasing a, such as a = 28, it is possible to extract data items after that by using long-term data after b days from data 28 days before t1. Thereafter, the data collection apparatus 100 extracts data items having a large difference in data distribution between the two groups, using the method described in s102. Here, for example, data items whose p value is less than c are extracted by Welch's t-test. Here, c is an arbitrary numerical value, for example, c = 0.05. However, the present invention is not limited to this, and any appropriate discrimination model (for example, a statistical model, a learning model, or the like) that can separate the preterm birth group and the normal birth group may be applied.

ステップｓ１０２−５において、データ収集装置１００は、抽出されたデータ項目の中で、相関が高いものをまとめ込むことでデータ項目数を絞る。ここでは、各データ項目の相関係数を算出し、それがｄ以上のものを同じクラスタとして分類し、各クラスタで最もｐ値が低い項目を残してそれ以外を除外することで、データ項目数をクラスタ数に削減する。ここで、ｄは任意の数値であり、例えば、ｄ＝０．７５である。クラスタの構成方法としては、同一クラスタ内の相互のデータ項目が全部ｄ以上であるとする場合を想定するが、それに限定されるものでない。なお、ｓ１０２−５を実行してからｓ１０２−４を実行する場合もあれば、ｓ１０２−４とｓ１０２−５とを繰り返して実行する場合もある。 In step s102-5, the data collection device 100 narrows down the number of data items by collecting the extracted data items having high correlation. Here, the correlation coefficient of each data item is calculated, and those having d or more are classified as the same cluster, and the items having the lowest p-value are left in each cluster and the others are excluded. Is reduced to the number of clusters. Here, d is an arbitrary numerical value, for example, d = 0.75. As a cluster configuration method, it is assumed that the mutual data items in the same cluster are all d or more, but is not limited thereto. Note that s102-4 may be executed after executing s102-5, or s102-4 and s102-5 may be executed repeatedly.

ステップｓ１０２−６において、データ収集装置１００は、上記で残ったデータ項目をｐ値が低い順に並べ替え、これらを時点ｔ１のデータから作成した特徴量とする。 In step s102-6, the data collection device 100 rearranges the remaining data items in ascending order of the p value, and sets these as feature amounts created from the data at the time point t1.

次に、早産予測モデルの作成方法について説明する。上記で作成した特徴量の全てを使ったモデルが最適とは限らないため、特徴量の数を徐々に増やしてモデルを作成し、予測精度が高いモデルを選定することを考える。 Next, a method for creating a preterm birth prediction model will be described. Since the model using all of the feature quantities created above is not necessarily optimal, consider creating a model by gradually increasing the number of feature quantities and selecting a model with high prediction accuracy.

ステップｓ１０３−１において、データ収集装置１００は、モデル作成の特徴量数を１から変化させてパラメータｆに特徴量名のリストを格納し、後段のループ内処理を実行する。 In step s103-1, the data collection device 100 changes the number of feature values for model creation from 1, stores a list of feature value names in the parameter f, and executes the subsequent in-loop processing.

ステップｓ１０３−２において、データ収集装置１００は、時点ｔ１で算出した特徴量を用いて、時点（ｔ１−ｇ×７）から（ｔ１＋ｈ×７）の時点のデータを使ってモデルを生成する。ここで、ｇとｈは任意の数値であり、例えば、ｇ＝４、ｈ＝４であってもよい。これは、例えば、２５週のデータを使って予測をする際に、２４週で選定した特徴量（例：「体重、夜血圧」）を使って作成したモデルの方が、２５週で選定した特徴量（例：「朝血圧、睡眠時間」）を使って作成したモデルよりも予測精度が高いという場合もあるためである。なお、ｓ１０３−２では、時点ｔ２をパラメータとし、（ｔ１−ｇ×７）から（ｔ１＋ｈ×７）まで７日ずつずらすことで週ごとにループ内処理を実行する。 In step s103-2, the data collection device 100 generates a model using data from the time point (t1−g × 7) to (t1 + h × 7) using the feature amount calculated at the time point t1. Here, g and h are arbitrary numerical values, and may be g = 4 and h = 4, for example. This is because, for example, when making predictions using 25-week data, the model created using the feature values selected in the 24th week (eg “weight, nocturnal blood pressure”) was selected in the 25th week. This is because the prediction accuracy may be higher than a model created using feature quantities (eg, “morning blood pressure, sleep time”). In s103-2, the in-loop processing is executed every week by shifting the time t2 as a parameter and shifting from (t1−g × 7) to (t1 + h × 7) by 7 days.

ステップｓ１０３−３において、データ収集装置１００は、モデル作成の準備として、学習用データと評価用データを作成する。具体的には、データ収集装置１００は、期間（ｔ２−ａ）から（ｔ２＋ｂ）のデータを抽出し、まず早産群Ｄ１とそうでなかった群Ｄ２とに分ける。また、学習用データと評価用データに分ける比率をｅとして、データ収集装置１００は更に、Ｄ１をＤ１ａとＤ１ｂとに分け、Ｄ２をＤ２ａとＤ２ｂとに分け、学習用データをＤ１ａ＋Ｄ２ａとし、評価用データをＤ１ｂ＋Ｄ２ｂとする。ここで、ｅは任意の数値で、例えば、ｅ＝０．５であってもよい。また、Ｄ２として使用する妊婦の数を調節することが可能であり、例えばＤ２の妊婦の数をＤ１の妊婦の数の２倍まで、とすることも可能である。 In step s103-3, the data collection device 100 creates learning data and evaluation data as preparation for model creation. Specifically, the data collection device 100 extracts data from the period (t2-a) to (t2 + b), and first divides the data into the preterm birth group D1 and the group D2 that did not. The data collection apparatus 100 further divides D1 into D1a and D1b, D2 into D2a and D2b, and the learning data as D1a + D2a, where e is the ratio of dividing the learning data and the evaluation data into evaluative data. The data is D1b + D2b. Here, e is an arbitrary numerical value, and may be e = 0.5, for example. Further, the number of pregnant women used as D2 can be adjusted. For example, the number of pregnant women of D2 can be up to twice the number of pregnant women of D1.

ステップｓ１０３−４において、データ収集装置１００は、特徴量を特徴量名リストｆに限定してモデルを学習する。例えば、この時点の予測モデルとしてサポートベクターマシン（ＳＶＭ）を使用し、学習用データを用いて学習させる。予測モデルは最終的に多数できるため、区別のためにこの予測モデルをＭ（ｔ１，ｆ，ｔ２）とする。 In step s103-4, the data collection device 100 learns the model by limiting the feature quantity to the feature quantity name list f. For example, a support vector machine (SVM) is used as a prediction model at this time, and learning is performed using learning data. Since a large number of prediction models can finally be formed, this prediction model is set to M (t1, f, t2) for distinction.

ステップｓ１０３−５において、データ収集装置１００は、評価用データを用いて、モデルが早産を予測できる精度を評価する。評価方法には、例えば、横軸に擬陽性率をとり、縦軸に感度をとるＲＯＣ（ＲｅｃｅｉｖｅｒＯｐｅｒａｔｉｎｇＣｈａｒａｃｔｅｒｉｓｔｉｃ）曲線を作成し、そのカーブ下の面積であるＡＵＣ（ＡｒｅａＵｎｄｅｒｔｈｅＣｕｒｖｅ）を計算する。ＡＵＣが高いほど予測精度が良いとされる。データ収集装置１００は、この評価結果ＡＵＣを予測モデルＭ（ｔ１，ｆ，ｔ２）とともに記録する。なお、評価方法はこれに限定されるものでなく、図８のｓ１０７にてｓ１０２−１〜ｓ１０３−１１と同様の方法で算出されるような早産リスクの値を組み合わせることも可能である。また、上述の説明ではｆを１つの特徴量名リストとしているが、複数の特徴量名リストの組合せで予測モデルを作成することも可能である。例えば、ｔ１が２４週（ｔ１＝２４×７）のときに、２３週のデータから抽出した特徴量名リストｆ１と２４週のデータから抽出した特徴量名リストｆ２とを用い、それぞれ違う特徴量としてｆ１にある特徴量に２３週のデータを学習させ、ｆ２にある特徴量に２４週のデータを学習させるような予測モデルＭ（ｔ１，ｆ１＋ｆ２，ｔ２）を作成し評価することもできる。 In step s103-5, the data collection device 100 evaluates the accuracy with which the model can predict premature birth using the evaluation data. In the evaluation method, for example, an ROC (Receiver Operating Characteristic) curve having a false positive rate on the horizontal axis and sensitivity on the vertical axis is created, and AUC (Area Under the Curve) which is an area under the curve is calculated. The higher the AUC, the better the prediction accuracy. The data collection device 100 records this evaluation result AUC together with the prediction model M (t1, f, t2). Note that the evaluation method is not limited to this, and it is possible to combine premature birth risk values calculated in the same manner as s102-1 to s103-11 in s107 of FIG. In the above description, f is one feature quantity name list, but a prediction model can be created by combining a plurality of feature quantity name lists. For example, when t1 is 24 weeks (t1 = 24 × 7), the feature quantity name list f1 extracted from the 23-week data and the feature quantity name list f2 extracted from the 24-week data are used, and different feature quantities are used. It is also possible to create and evaluate a prediction model M (t1, f1 + f2, t2) in which the 23-week data is learned from the feature quantity in f1, and the 24-week data is learned from the feature quantity in f2.

ステップｓ１０３−６、ｓ１０３−７及びｓ１０３−８において、データ収集装置１００は、ｔ２、ｆ、ｔ１の分だけモデルを作成し、各ループを終了する。 In steps s103-6, s103-7, and s103-8, the data collection device 100 creates models for t2, f, and t1, and ends each loop.

ステップｓ１０３−９において、データ収集装置１００は、作成した多数のモデルの中から、各週数で最良のモデルを選択する。このため、データ収集装置１００は、まずｔ２をパラメータとし、１６×７（日），１７×７などの値を取って、各週数でのループ内処理を実行する。 In step s103-9, the data collection device 100 selects the best model for each number of weeks from among the created many models. For this reason, the data collection apparatus 100 first performs processing in the loop for each number of weeks using t2 as a parameter, taking values such as 16 × 7 (days), 17 × 7, and the like.

ステップｓ１０３−１０において、データ収集装置１００は、時点ｔ２における最適なモデルを選定するために、全ｔ１及び全ｆにおいて作成済みのモデルＭ（ｔ１，ｆ，ｔ２）を選択し、その中で最もＡＵＣが高いモデルを、時点ｔ２における最適なモデルとして選定する。なお、この選定方法はこれに限定されるものでなく、図８のｓ１０７にてｓ１０２−１〜ｓ１０３−１１と同様の方法で算出されるような早産リスクの値をＡＵＣと組み合わせることも可能である。 In step s103-10, the data collection device 100 selects models M (t1, f, t2) that have been created at all t1 and all f in order to select the optimum model at time t2, and most of them. A model having a high AUC is selected as an optimum model at time t2. Note that this selection method is not limited to this, and it is also possible to combine the value of the risk of premature birth as calculated by the same method as s102-1 to s103-11 in s107 of FIG. 8 with AUC. is there.

ステップｓ１０３−１１において、データ収集装置１００は、これをｔ２について全時点の処理を実行し、ループを終了する。 In step s103-11, the data collection device 100 executes processing at all points in time for t2, and ends the loop.

以上により、各週数での最適な早産有無予測モデルを作成できる。例えば、３２週のデータに対する早産予測モデルにおいては、特徴量として消費カロリー、室温、頭の痛み、朝晩の室温の差、便通の回数、朝の体温が利用される。２８週の早産予測モデルにおいては、特徴量として消費カロリー、室温、便通の柔らかさ、便通の回数、頭の痛み、痛みを伴わない収縮回数、朝晩の室温差、就寝中に起きた回数、朝晩の収縮期血圧の差が利用される。２４週の早産予測モデルにおいては、特徴量として嘔吐の回数、消費カロリー、体温、腰の痛みが利用される。２０週の早産予測モデルでは、脈拍、消費カロリー、気温が利用される。１６週の早産予測モデルでは、気温、分娩時年齢、拡張期血圧が利用される。 As described above, it is possible to create an optimal preterm birth prediction model for each number of weeks. For example, in the preterm birth prediction model for data for 32 weeks, calorie consumption, room temperature, head pain, room temperature difference between morning and evening, number of bowel movements, body temperature in the morning are used as characteristic quantities. In the 28-week premature birth prediction model, calorie consumption, room temperature, softness of bowel movement, number of bowel movements, head pain, number of contractions without pain, room temperature difference between morning and evening, number of times that occurred during bedtime, morning and evening The difference in systolic blood pressure is used. In the 24-week preterm birth prediction model, the number of vomiting, calorie consumption, body temperature, and lower back pain are used as characteristic quantities. In the 20-week preterm birth prediction model, pulse, calorie consumption, and temperature are used. In the 16-week preterm birth prediction model, temperature, age at delivery, and diastolic blood pressure are used.

なお、上述した実施例では、各週数に応じて異なるデータ項目に基づき早産有無予測モデルを生成する処理を説明したが、同様の処理によって出産日予測モデル及び早産発症リスク算出モデルが生成可能であることは明らかであろう。 In the above-described embodiment, the process of generating the preterm birth prediction model based on the data items that differ according to the number of weeks has been described. However, the birth date prediction model and the premature birth risk calculation model can be generated by the same process. It will be clear.

また、上述した実施例から理解されるように、早産予測モデル生成機能１３０は、特徴量からモデルを作成するにあたり、早産群と非早産群でデータ分布の違いが大きい特徴量から順次特徴量数を増やすことで予測精度が高いモデルを選定してもよい。また、早産予測モデル生成機能１３０は、ある期間におけるデータから早産を予測するモデルを作成するにあたり、当該期間のデータから抽出した特徴量を用いて作成した早産予測モデルと、別の期間のデータから抽出した特徴量を用いて作成した早産予測モデル群から予測精度が高いモデルを選定してもよい。これらにおいてモデルで使用した特徴量数を横軸にして、ＡＵＣの値を確認すると、図１０〜１２に示されるような実施結果が得られる。２８週の結果を見ると、特徴量数が増加することでＡＵＣが高値になり早産の予測精度が高まっていることがわかる。これにより、複数のデータを組み合わせることによって予測精度が向上することが確認できる。また、３２週の結果を見ると、ステップｓ１０２−６で６個の特徴量が抽出されたにも関わらず、最も予測精度が高かったものはそのうち上位５個を使った場合であり、ステップｓ１０３−１におけるパラメータｆのループの効果があることが理解できる。同様に２４週の結果でも上位４個の特徴量を使った場合が最も予測精度が高い。また、３２週の予測モデルは、３２週のデータから抽出された特徴量群を用いてモデルを作成するより、２８週のデータから抽出された特徴量群を用いてモデルを作成する方がより良い予測精度を実現できることがわかり、ステップｓ１０３−２におけるパラメータｔ２のループの効果があることがわかる。結果として、各週のデータに対する予測モデルが示すＡＵＣは、１６週のデータにおいて０．８２、２０週で０．７７、２４週で０．８３、２８週で０．８１、３２週で０．８３であり、概ね０．８の予測精度であった。また、各週のデータに対する予測モデルが示す早産に対するオッズ比は、１６週のデータからの予測で３．１、２０週で８．６、２４週で２６．６、２８週で７．３、３２週で８．７であり、先行技術文献１の示すオッズ比２．９１および、先行技術文献２の示すオッズ比１．２５や１．１８を大きく上回っており、本発明のデータ収集システムを用いることによって、早産発症の予測が改善していることがわかる。
（実施例４）
本発明によるデータ収集装置１０の早産予測モデル生成機能１３０及び早産予測機能１４０は、対象疾患を早産以外にも設定可能である。例えば、妊娠高血圧症候群、妊娠糖尿病、SGA(Small-for-gestational age)、FGR(Fetal growth restriction）、死産、切迫流産、切迫早産、前期破水、pPROM(preterm premature rapture of the membranes)、羊水過多、羊水過少、子癇、急性妊娠脂肪肝、HELLP症候群、児のアトピー、児の精神関連疾患、児の心循環系疾患、児の代謝異常、その他妊婦や胎児に関する疾患のうち一つまたは複数を設定可能であり、これらの発症有無予測、発症日予測又は発症リスク算出を行うことができる。また、正期産となる予測にも可能である。 Further, as can be understood from the above-described embodiment, the premature birth prediction model generation function 130, in creating a model from the feature amount, sequentially determines the number of feature amounts from the feature amount having a large difference in data distribution between the preterm birth group and the non-preterm birth group. A model with high prediction accuracy may be selected by increasing. Further, when creating a model for predicting premature birth from data in a certain period, the premature birth prediction model generation function 130 uses a premature birth prediction model created using the feature amount extracted from the data in the period and data from another period. A model with high prediction accuracy may be selected from the premature birth prediction model group created using the extracted feature quantity. In these, when the number of feature quantities used in the model is set on the horizontal axis and the value of AUC is confirmed, the results shown in FIGS. 10 to 12 are obtained. Looking at the results for the 28th week, it can be seen that the number of features increases, the AUC becomes higher, and the accuracy of predicting preterm birth is increasing. Thereby, it can confirm that prediction accuracy improves by combining a plurality of data. Looking at the results for 32 weeks, even though six feature values were extracted in step s102-6, the one with the highest prediction accuracy was when the top five were used, and step s103. It can be understood that there is a loop effect of the parameter f at -1. Similarly, the prediction accuracy is highest when the top four feature values are used in the results for 24 weeks. In addition, for the 32 week prediction model, it is better to create a model using a feature quantity group extracted from 28 week data than to create a model using a feature quantity group extracted from 32 week data. It can be seen that good prediction accuracy can be realized, and it can be seen that there is an effect of the loop of the parameter t2 in step s103-2. As a result, the AUC indicated by the prediction model for each week's data is 0.82 in the 16-week data, 0.77 in the 20th week, 0.83 in the 24th week, 0.81 in the 28th week, 0.83 in the 32nd week. The prediction accuracy was approximately 0.8. The odds ratio for preterm birth shown by the prediction model for each week's data is 3.1 for the prediction from the 16-week data, 8.6 for the 20th week, 26.6 for the 24th week, 7.3 for the 28th week, 32 It is 8.7 per week, which greatly exceeds the odds ratio of 2.91 shown in the prior art document 1 and the odds ratios of 1.25 and 1.18 shown in the prior art document 2, and uses the data collection system of the present invention. This shows that the prediction of the onset of preterm birth has improved.
(Example 4)
The preterm birth prediction model generation function 130 and the preterm birth prediction function 140 of the data collection device 10 according to the present invention can set the target disease other than premature birth. For example, gestational hypertension syndrome, gestational diabetes, SGA (Small-for-gestational age), FGR (Fetal growth restriction), stillbirth, imminent miscarriage, imminent premature birth, early rupture, pPROM (preterm premature rapture of the membranes), excessive amniotic fluid, One or more of amniotic fluid deficiency, eclampsia, acute fatty liver of pregnancy, HELLP syndrome, infant atopy, infant mental related disorders, infant cardiovascular disease, infant metabolic disorders, and other maternal and fetal disorders It is possible to predict the presence / absence of onset, the onset date, or the risk of onset. It is also possible to predict full-term production.

このように、本発明によれば、従来の調査票や診療記録に加え、妊娠期間中やその前後にわたり非侵襲的に日々のライフログの収集が可能であり、早産有無予測において、生活環境のデータ項目の多様化、データ取得期間と取得時点数の増加、データの客観性や想起バイアスの最小化が可能である。さらに、これら得られた各種データを組み合わせて解析することにより早産の発症予測が可能となり、事前の予防や迅速な対応につながる。また、日々のライフログからの予測や異常検出が可能となり、数週間に一度の間隔の妊婦健診を待つことなく必要に応じた受診を促すことで、症状の早期発見や重篤化の防止、迅速な対処につながる。
（実施例５）
図１３は、本発明の一実施例による特徴量抽出・早産発症有無予測モデル生成処理を示すフローチャートである。上述した実施例３と比べ、後述される処理では、複数の週数のデータを用いることで、早産と正期産のデータ分布の違いの大きさだけでなく、データ分布の時系列的変化の違いの大きさを利用し、より性能の高い予測モデルを生成可能である。データ分布の時系列的変化の違いの大きさの定義には、例えば、早産と正期産においてそれぞれのあるデータ項目のデータ分布が妊娠週数に従って変化するなかで、その「変化のしかたが同じとは言えない確率が大きい」という指標が利用可能であるが、それに限定されるものではない。例えば正期産ではあるデータ項目が妊娠週数に従って低下するのに対し、早産では低下が見られない、などの場合である。これには、ステューデントのｔ検定、ウェルチのｔ検定、多変量回帰、多変量ロジスティック回帰、カイ二乗検定、フィッシャーの正確確率検定などを用いた方法や、Ｌａｓｓｏ回帰やＲｉｄｇｅ回帰やＥｌａｓｔｉｃＮｅｔなどにより有用なデータ項目を限定する方法や主成分分析の寄与率などからデータ項目を限定する方法などがあるが、それらに限定されるものでない。 As described above, according to the present invention, in addition to the conventional survey form and medical records, daily life logs can be collected non-invasively during and before the pregnancy period. It is possible to diversify data items, increase the data acquisition period and the number of acquisition points, minimize data objectivity and recall bias. Furthermore, by combining and analyzing these various data obtained, it is possible to predict the onset of premature birth, leading to advance prevention and quick response. In addition, prediction from daily life logs and detection of abnormalities are possible, and early detection of symptoms and prevention of seriousness are promoted by prompting consultations as needed without waiting for a pregnant woman medical checkup once every several weeks. , Leading to quick action.
(Example 5)
FIG. 13 is a flowchart showing the feature amount extraction / premature birth occurrence prediction model generation process according to an embodiment of the present invention. Compared to Example 3 described above, in the processing described later, by using data of a plurality of weeks, not only the magnitude of the difference in data distribution between premature delivery and full-term delivery but also the time-series change of the data distribution. A prediction model with higher performance can be generated by using the magnitude of the difference. The definition of the magnitude of the difference in time-series changes in the data distribution includes, for example, the fact that the data distribution of a certain data item in preterm birth and full-term birth changes according to the number of gestation weeks. However, this is not a limitation. For example, a certain data item for full-term birth declines according to the number of gestational weeks, whereas no decline is seen for preterm birth. This is useful by methods using Student's t-test, Welch's t-test, multivariate regression, multivariate logistic regression, chi-square test, Fisher's exact test, Lasso regression, Ridge regression, and Elastic Net. There are a method of limiting data items and a method of limiting data items based on the contribution ratio of principal component analysis, but the method is not limited to these.

当該特徴量抽出・早産発症有無予測モデル作成処理は、上述された図８のステップｓ１０２，ｓ１０３において実行可能であり、後述されるステップｓ２０２−１〜７がステップｓ１０２の特徴量の抽出処理に対応し、ステップｓ２０３−１〜６がステップｓ１０３の早産発症有無予測モデルの生成処理に対応する。 The feature quantity extraction / premature birth occurrence prediction model creation process can be executed in steps s102 and s103 in FIG. 8 described above, and steps s202-1 to s7-7 described later correspond to the feature quantity extraction process in step s102. Steps s203-1 to s <b> 203-6 correspond to the generation process of the preterm birth onset prediction model in step s <b> 103.

実施例５では、早産予測モデル生成機能１３０は、取得したデータを早産群と正期産群に分け、各データ項目のうち二群でデータ分布が異なる（すなわち、区別する、判別する、分離するなど）データ項目を抽出するが、複数の週別データや静的データ（年齢、非妊時BMI、身長、既往歴、分娩歴など）を含めて交絡因子を抽出し、その影響を調整のうえ、抽出する。さらに各データ項目において相関の高いデータ項目同士のうち一つを選択することでデータ項目数を削減することによって、早産予測モデルの生成に利用する特徴量を抽出する。また、早産予測モデル生成機能１３０は、早産群と正期産群でデータ分布が異なる特徴量のうち上位の所定数の特徴量を抽出し、抽出された特徴量を用いて早産予測モデルを作成し、最も性能の良い特徴量による早産予測モデルを特定する。次に、早産予測モデル生成機能１３０は、当該特徴量を除いた残りの特徴量と特定された特徴量とを併せた２変数の予測モデルを作成し、これらのモデルを含めて最適なモデルを選択する。早産予測モデル生成機能１３０は、上述した一連の処理を再帰的に実行し、最終的に早産予測モデルを特定する。なお、上記は早産予測モデルに利用する特徴量の抽出方法や早産予測モデルの選定方法の一例であって、早産予測モデル生成機能１３０はこれに制限されない。例えば、多層ニューラルネットワークまたはＬａｓｓｏ回帰などによって特徴量の抽出を抽出し、早産予測モデルを作成する方法もある。 In the fifth embodiment, the preterm birth prediction model generation function 130 divides the acquired data into the preterm birth group and the full term birth group, and the data distribution is different (ie, distinguishes, discriminates, separates) in two groups among the respective data items. Extract data items, but extract confounding factors including multiple weekly data and static data (age, non-pregnant BMI, height, medical history, delivery history, etc.) and adjust the effects ,Extract. Furthermore, the feature-value utilized for the production | generation of a premature delivery prediction model is extracted by reducing the number of data items by selecting one among data items with high correlation in each data item. In addition, the preterm birth prediction model generation function 130 extracts a predetermined number of high-order feature quantities from among the feature quantities having different data distributions in the preterm birth group and the regular birth group, and creates a preterm birth prediction model using the extracted feature quantities. Then, the premature birth prediction model with the best performance feature quantity is specified. Next, the premature birth prediction model generation function 130 creates a two-variable prediction model that combines the remaining feature quantity excluding the feature quantity and the identified feature quantity, and selects an optimal model including these models. select. The preterm birth prediction model generation function 130 recursively executes the series of processes described above, and finally identifies the preterm birth prediction model. The above is an example of a feature quantity extraction method and a premature birth prediction model selection method used for the premature birth prediction model, and the premature birth prediction model generation function 130 is not limited thereto. For example, there is a method in which extraction of feature amounts is extracted by a multilayer neural network or Lasso regression to create a preterm birth prediction model.

図１３に示されるように、ステップｓ２０２−１において、データ収集装置１００は、取得したデータ項目の一部のデータ項目を加工する。具体的には、データ収集装置１００は、一部のデータ項目について、以下に限定することなく、数日前のデータとの差、数日間の平均及び分散、計測値の近似モデルを作成したその微分、二階微分などを導出する。 As illustrated in FIG. 13, in step s202-1, the data collection device 100 processes some data items of the acquired data items. Specifically, the data collection device 100 is not limited to the following, for some data items, the difference from the data of several days ago, the average and variance of several days, and the derivative of the approximation model of the measured value Deriving second derivative etc.

ステップｓ２０２−２において、データ収集装置１００は、元のデータとｓ２０２−１で作成したデータとを結合し、モデル作成のために、既に出産済みの妊婦の妊娠時データと経時データを抽出する。妊娠時データとは、妊娠週数に依らない静的なデータ項目のデータであり、妊娠時年齢、非妊時BMI、身長、体重、経産歴、既往症、家族歴などを含む。経時データとは、妊娠期間中の特定の期間の計測値を集計したデータの羅列であり、例えば特定の期間として一週間を指定可能であり、ユーザID、16週の血圧、16週の脈拍、16週の体温、・・・、17週の血圧、17週の脈拍、17週の体温、・・・、などのように時間方向に各妊婦の集計値を記述したものである。なお、特定の期間には任意の期間を指定可能である。また、特定の期間での計測値の集計方法としては、平均値、中央値など任意の方法を指定可能である。 In step s202-2, the data collection device 100 combines the original data and the data created in s202-1 and extracts pregnancy data and time-lapse data of pregnant women who have already given birth for model creation. Pregnancy data is data of static data items that do not depend on the number of weeks of pregnancy, and includes gestational age, non-pregnant BMI, height, weight, birth history, past medical history, family history, and the like. The time-lapse data is an enumeration of data obtained by aggregating the measurement values of a specific period during pregnancy, for example, one week can be specified as a specific period, user ID, blood pressure of 16 weeks, pulse of 16 weeks, The total values of each pregnant woman are described in the time direction such as body temperature at 16 weeks,..., Blood pressure at 17 weeks, pulse at 17 weeks, body temperature at 17 weeks,. An arbitrary period can be specified as the specific period. Moreover, as a totaling method of the measured values in a specific period, an arbitrary method such as an average value or a median value can be designated.

ステップｓ２０２−３において、上記妊娠時データに対して、それらのデータに紐付く妊婦が早産になった群と、正期産になった群のデータを抽出する。その後、データ収集装置１００は、ｓ１０２で説明するような方法を用いて、二群間でデータ分布が異なるデータ項目を抽出し、交絡データ項目とする。ここでは、例えば多変量ロジスティック回帰にてｐ値がａ（例えばa=0.01）未満となるデータ項目を抽出する。さらに、交絡データ項目に季節項を入れても良い。本発明は、これに限定されず、早産群と正期産群とを分離可能な何れか適切な判別モデル（例えば、統計モデル、学習モデルなど）が適用されてもよい。 In step s202-3, with respect to the data at the time of pregnancy, data of a group in which the pregnant women associated with the data are prematurely delivered and a group in which the premature delivery is established are extracted. Thereafter, the data collection device 100 extracts data items having different data distributions between the two groups by using the method described in s102, and sets the data items as confounded data items. Here, for example, data items whose p-value is less than a (for example, a = 0.01) are extracted by multivariate logistic regression. Further, a seasonal term may be included in the confounding data item. The present invention is not limited to this, and any appropriate discrimination model (for example, a statistical model, a learning model, or the like) that can separate the preterm birth group and the full-term birth group may be applied.

ステップｓ２０２−４において、それらの交絡データ項目間の相関係数を計算し、相関がｂ（例えばb=0.75）以上のものを同じクラスタに分け、各クラスタでもっともｐ値が低い項目を残してそれ以外を除外することで交絡データ項目数をクラスタの数に削減する。クラスタの構成方法としては、同一クラスタ内の相互のデータ項目が全部ｂ以上であるとする場合を想定するが、それに限定されるものでない。なお、ｓ２０２−４を実行してからｓ２０２−３を実行する場合もあれば、ｓ２０２−３とｓ２０２−４を繰り返して実行する場合もある。 In step s202-4, the correlation coefficient between the confounded data items is calculated, and those having a correlation of b (for example, b = 0.75) or more are divided into the same cluster, and the item having the lowest p value is left in each cluster. By excluding others, the number of confounding data items is reduced to the number of clusters. As a cluster configuration method, it is assumed that all data items in the same cluster are b or more, but the method is not limited to this. Note that s202-3 may be executed after executing s202-4, or s202-3 and s202-4 may be executed repeatedly.

次に、ステップｓ２０２−５において、データ収集装置１００は、妊娠時データおよび経時データに対し、各データ項目１つずつに交絡データ項目を加えてロジスティック回帰によるp値とオッズ比を算出し、p値がc（例えばc=0.01）未満となるデータ項目を選択することで、交絡因子の影響を除いたうえで二群を有意に分けるデータ項目を特定する。 Next, in step s202-5, the data collection device 100 calculates the p-value and odds ratio by logistic regression by adding confounding data items to each data item one by one with respect to the pregnancy data and the time-lapse data, and p By selecting the data item whose value is less than c (for example, c = 0.01), the data item that significantly divides the two groups is specified after removing the influence of the confounding factor.

ステップｓ２０２−６において、それらのデータ項目間の相関係数を計算し、相関がｄ（例えばd=0.75）以上のものを同じクラスタに分け、各クラスタでもっともp値が低い項目を残してそれ以外を除外することでデータ項目数をクラスタの数に削減する。クラスタの構成方法については、ｓ２０２−４の場合と同様である。 In step s202-6, a correlation coefficient between these data items is calculated, and those having a correlation of d (for example, d = 0.75) or more are divided into the same cluster, and the item having the lowest p-value is left in each cluster. The number of data items is reduced to the number of clusters by excluding other items. The cluster configuration method is the same as in s202-4.

次にステップｓ２０２−７において、データ収集装置１００は、上記で残ったデータ項目をｐ値が低い順に並べ替え、これらを早産予測モデル作成のための特徴量とする。また、特徴量表として、データ項目名、オッズ比、p値、変回帰係数βなどの情報を保存する。 Next, in step s202-7, the data collection device 100 rearranges the remaining data items in ascending order of the p-value, and sets these as feature amounts for creating a preterm birth prediction model. In addition, information such as data item names, odds ratios, p values, and regression coefficients β are stored as feature quantity tables.

次に、早産予測モデルの作成方法について説明する。 Next, a method for creating a preterm birth prediction model will be described.

上記で作成した特徴量の全てを使ったモデルが最適とは限らないため、特徴量の数を徐々に増やしてモデルを作成し、予測精度が高いモデルを選定することを考える。 Since the model using all of the feature quantities created above is not necessarily optimal, consider creating a model by gradually increasing the number of feature quantities and selecting a model with high prediction accuracy.

ステップｓ２０３−１において、データ収集装置１００は、妊娠の進み具合によってその後の早産発症のリスクが変化することを想定し、一定の時間単位ごとに早産発症の予測モデルを構築するものとする。一定の時間単位とは、例えば一週間を単位とすることが可能であるが、その限りではない。ここでは、妊娠１６週から３６週の各週（ｔ週）においての予測モデルを構築するものとする。 In step s203-1, the data collection device 100 assumes that the risk of subsequent preterm onset changes according to the progress of pregnancy, and constructs a predictive model for the onset of premature birth every certain time unit. The certain time unit can be, for example, a week, but is not limited thereto. Here, it is assumed that a prediction model for each week (week t) from the 16th to 36th weeks of pregnancy is constructed.

ステップｓ２０３−２において、データ収集装置１００は、ｓ２０２−２の妊娠時データおよび経時データにおいて、ｓ２０２−７で特定した特徴量のうち、妊娠時データおよびt週以前の経時データを抽出する。 In step s203-2, the data collection device 100 extracts the pregnancy time data and the time data before t weeks from the feature amount specified in s202-7 from the pregnancy time data and time data of s202-2.

ステップｓ２０３−３において、データ収集装置１００は、モデル作成の準備として、学習用データと評価用データを作成する。具体的には、データ収集装置１００は、まず早産群Ｄ１と正期産群Ｄ２とに分ける。また、学習用データと評価用データに分ける比率をｅとして、データ収集装置１００は更に、Ｄ１をＤ１ａとＤ１ｂとに分け、Ｄ２をＤ２ａとＤ２ｂとに分け、学習用データをＤ１ａ＋Ｄ２ａとし、評価用データをＤ１ｂ＋Ｄ２ｂとする。ここで、ｅは任意の数値で、例えば、ｅ＝０．５であってもよい。また、Ｄ２として使用する妊婦の数を調節することが可能であり、例えばＤ２の妊婦の数をＤ１の妊婦の数の２倍まで、とすることも可能である。 In step s203-3, the data collection device 100 creates learning data and evaluation data as preparation for model creation. Specifically, the data collection device 100 first divides into the preterm birth group D1 and the full term birth group D2. The data collection apparatus 100 further divides D1 into D1a and D1b, D2 into D2a and D2b, and the learning data as D1a + D2a, where e is the ratio of dividing the learning data and the evaluation data into evaluative data. The data is D1b + D2b. Here, e is an arbitrary numerical value, and may be e = 0.5, for example. Further, the number of pregnant women used as D2 can be adjusted. For example, the number of pregnant women of D2 can be up to twice the number of pregnant women of D1.

ステップｓ２０３−４において、予測装置１００は、予測性能を向上させるため、学習用データD1a, D2aにおいてD1aをオーバーサンプリングしてもよく、及び／又はD2aをアンダーサンプリングしてもよい。 In step s203-4, the prediction apparatus 100 may oversample D1a and / or undersample D2a in the learning data D1a and D2a in order to improve prediction performance.

ステップｓ２０３−５において、データ収集装置１００は、学習用データD1a, D2aを用いてforward stepwise selectionによって特徴量を選択し、t週時点での早産予測モデルM_tを学習する。例えば、この時点の予測モデルとしてサポートベクターマシン（ＳＶＭ）を使用し、学習用データを用いて学習させる。例えば、最初に１００個の特徴量（妊娠時データおよび経時データ）について、予測装置１００は、１つの特徴量だけを用いてｔ時点の早産予測暫定モデルM_t (1)〜M_t (100)を作成する。そして、予測装置１００は、評価データ用D1b, D2bを用いて、最も性能の良いM_t (A)（特徴量Aを使用）を特定する。性能スコアとして、例えば、AUC(Area under the curve)、f-score、感度と特異度の調和平均などが利用されてもよい。次に、予測装置１００は、学習用データD1a, D2aを用いて、当該特徴量を除いた残りの９９個の特徴量から１つの特徴量を選択してAと併せた２変数の早産予測モデルM_t (A,1)〜M_t (A,99)を作成し、評価用データD1b, D2bを用いて、M_t (A)を含めて最適なモデルM_t(A,B)を選択する。予測装置１００は、上述した一連の処理を再帰的に実行し、最終的に早産予測モデルM_t (A,B,C,...)を特定する。 In step s203-5, the data collection device 100 selects feature amounts by forward stepwise selection using the learning data D1a and D2a, and learns the preterm birth prediction model M _t at the time point _t . For example, a support vector machine (SVM) is used as a prediction model at this time, and learning is performed using learning data. For example, for the first 100 feature quantities (pregnancy data and time-lapse data), the prediction apparatus 100 uses only one feature quantity to predict the premature birth prediction model M _t (1) to M _t (100) at time _t. Create Then, the prediction apparatus 100 identifies M _t (A) (using the feature amount A) with the best performance using the evaluation data D1b and D2b. As the performance score, for example, AUC (Area under the curve), f-score, harmonic average of sensitivity and specificity may be used. Next, the prediction device 100 selects one feature amount from the remaining 99 feature amounts excluding the feature amount using the learning data D1a and D2a, and uses a two-variable premature birth prediction model combined with A. create an _{M t (a, 1) ~M} t (a, 99), evaluation data D1b, with D2b, including M _t (a) selecting an optimum model M _t (a, B) . The prediction device 100 recursively executes the series of processes described above, and finally identifies the preterm birth prediction model M _t (A, B, C,...).

なお、早産予測モデルMは、SVM、線形ロジスティック回帰、ニューラルネットワーク、ランダムフォレストなどに基づき生成されてもよい。また、最適な特徴量を選択する方法としてはforward step selectionに限定されず、backward stepwise selection又はbidirectional stepwise selectionが利用されてもよい。また、全データ項目を利用し、PCAやMDS、NMDSなどで次元削減した後、PC1, PC2, ..., PCNのN個の特徴量に対してforward/backward/bidirectional stepwise selectionを適用してもよい。 Note that the preterm birth prediction model M may be generated based on SVM, linear logistic regression, neural network, random forest, or the like. Further, the method for selecting the optimum feature quantity is not limited to forward step selection, and backward stepwise selection or bidirectional stepwise selection may be used. Also, after using all data items and reducing dimensions with PCA, MDS, NMDS, etc., apply forward / backward / bidirectional stepwise selection to N features of PC1, PC2, ..., PCN. Also good.

ステップｓ２０３−６において、データ収集装置１００は、ｔについて全時点のモデルを作成し、ループを終了する。
（実施例６）
図１４は、本発明の他の実施例による特徴量抽出・出産日予測モデル作成処理を示すフローチャートである。複数の週数のデータを用いることで、早産と正期産のデータ分布の違いだけでなく、データ分布の時系列的変化の違いを利用し、より性能の高い予測モデルを生成可能である。当該特徴量抽出・モデル作成処理は、上述された図８のステップｓ１０２，ｓ１０５において実行可能であり、前述のステップｓ２０２−１〜７がステップｓ１０２の特徴量の抽出処理に対応し、後述のステップｓ２０５−１〜６がステップｓ１０５の出産日予測モデルの生成処理に対応する。なお、図１４のステップｓ２０２−１〜７は図１３のステップｓ２０２−１〜７と同じであり、ステップｓ２０５−１〜４，ｓ２０５−６は図１３のステップｓ２０３−１〜４，ｓ２０３−６と同じであるため、その説明は省く。 In step s203-6, the data collection device 100 creates models at all points in time for t, and ends the loop.
(Example 6)
FIG. 14 is a flowchart showing a feature quantity extraction / birth date prediction model creation process according to another embodiment of the present invention. By using the data of multiple weeks, it is possible to generate a prediction model with higher performance using not only the difference in data distribution between premature delivery and full-term delivery but also the difference in time-series changes in data distribution. The feature quantity extraction / model creation process can be executed in steps s102 and s105 of FIG. 8 described above, and the above-described steps s202-1 to s20-7 correspond to the feature quantity extraction process of step s102. Steps s205-1 to 6 correspond to the birth date prediction model generation process in step s105. 14 is the same as steps s202-1 to s72-1 to 7 in FIG. 13, and steps s205-1 to s205-6 are the same as steps s203-1 to s203-6 in FIG. The explanation is omitted because it is the same.

図１４のステップｓ２０５−５において、データ収集装置１００は、学習用データD1a, D2aを用いてforward stepwise selectionによって特徴量を選択し、t週時点での出産日予測モデルM_tを学習する。例えば、この時点の予測モデルとしてサポートベクターマシン（ＳＶＭ）を使用し、学習用データを用いて学習させる。例えば、最初に１００個の特徴量（妊娠時データおよび経時データ）について、データ収集装置１００は、１つの特徴量だけを用いてｔ時点の早産予測暫定モデルM_t (1)〜M_t (100)を作成する。そして、データ収集装置１００は、評価データ用D1b, D2bを用いて、最も性能の良いM_t (A)（特徴量Aを使用）を特定する。性能スコアとして、例えば、MSE（平均平方誤差）又はR²（決定係数）が利用されてもよい。次に、データ収集装置１００は、学習用データD1a, D2aを用いて、当該特徴量を除いた残りの９９個の特徴量から１つの特徴量を選択してAと併せた２変数の早産予測モデルM_t (A,1)〜M_t (A,99)を作成し、評価用データD1b, D2bを用いて、M_t (A)を含めて最適なモデルM_t(A,B)を選択する。データ収集装置１００は、上述した一連の処理を再帰的に実行し、最終的に早産予測モデルM_t (A,B,C,...)を特定する。 In step s205-5 in FIG. 14, the data collection device 100, the learning data D1a, select the feature quantity by forward stepwise selection with D2a, learns the birth date predictive model M _t at t weeks time. For example, a support vector machine (SVM) is used as a prediction model at this time, and learning is performed using learning data. For example, for the first 100 feature quantities (pregnancy data and time-lapse data), the data collection device 100 uses only one feature quantity to predict the premature birth prediction model M _t (1) to M _t (100 ). Then, the data collection device 100 identifies M _t (A) (using the feature amount A) with the best performance using the evaluation data D1b and D2b. As the performance score, for example, MSE (mean square error) or R ² (determination coefficient) may be used. Next, the data collection device 100 selects one feature amount from the remaining 99 feature amounts excluding the feature amount using the learning data D1a and D2a, and predicts premature birth of two variables combined with A Create models M _t (A, 1) to M _t (A, 99), and use evaluation data D1b and D2b to select the optimal model M _t (A, B) including M _t (A) To do. The data collection device 100 recursively executes the series of processes described above, and finally identifies the preterm birth prediction model M _t (A, B, C,...).

なお、早産予測モデルMは、SVM、線形回帰、ニューラルネットワークなどに基づき生成されてもよい。また、最適な特徴量を選択する方法としてはforward step selectionに限定されず、backward stepwise selection又はbidirectional stepwise selectionが利用されてもよい。また、全データ項目を利用し、PCAやMDS、NMDSなどで次元削減した後、PC1, PC2, ..., PCNのN個の特徴量に対してforward/backward/bidirectional stepwise selectionを適用してもよい。
（実施例７）
図１５は、本発明の他の実施例による特徴量抽出・早産リスク予測モデル作成処理を示すフローチャートである。複数の週数のデータを用いることで、早産と正期産のデータ分布の違いだけでなく、データ分布の時系列的変化の違いを利用し、より性能の高い予測モデルを生成可能である。当該特徴量抽出・モデル作成処理は、上述された図８のステップｓ１０２，ｓ１０７において実行可能であり、前述のステップｓ２０２−１〜７がステップｓ１０２の特徴量の抽出処理に対応し、後述のステップｓ２０７−１〜６がステップｓ１０７の早産リスク予測モデルの生成処理に対応する。なお、図１５のステップｓ２０２−１〜７は図１３のステップｓ１０２−１〜７と同じであり、ステップｓ２０７−１〜４，ｓ２０７−６は図１３のステップｓ２０３−１〜４，ｓ２０３−６と同じであるため、その説明は省く。 Note that the preterm birth prediction model M may be generated based on SVM, linear regression, neural network, or the like. Further, the method for selecting the optimum feature quantity is not limited to forward step selection, and backward stepwise selection or bidirectional stepwise selection may be used. Also, after using all data items and reducing dimensions with PCA, MDS, NMDS, etc., apply forward / backward / bidirectional stepwise selection to N features of PC1, PC2, ..., PCN. Also good.
(Example 7)
FIG. 15 is a flowchart showing a feature quantity extraction / premature birth risk prediction model creation process according to another embodiment of the present invention. By using the data of multiple weeks, it is possible to generate a prediction model with higher performance using not only the difference in data distribution between premature delivery and full-term delivery but also the difference in time-series changes in data distribution. The feature quantity extraction / model creation process can be executed in steps s102 and s107 of FIG. 8 described above, and the above-described steps s202-1 to s20-7 correspond to the feature quantity extraction process of step s102, and will be described later. Steps s207-1 to 6 correspond to the process for generating a preterm birth risk prediction model in step s107. Steps s202-1 to s7-7 in FIG. 15 are the same as steps s102-1 to s7-7 in FIG. 13, and steps s207-1 to s207-6 are steps s203-1 to s203-6 in FIG. The explanation is omitted because it is the same.

図１５のステップｓ２０７−５において、データ収集装置１００は、ｓ２０３−５と同様な方法で週数ｔにおける早産リスク予測モデルM_ｔを作成するが、この際に用いるモデルには多変量ロジスティック回帰とする。 In step s207-5 of FIG. 15, the data collection device 100 creates a premature birth risk prediction model M _t for the number of weeks t by the same method as in s203-5. The model used at this time includes multivariate logistic regression. To do.

図８のデータ収集装置１００は、ｓ１０８にて、妊娠週数ｔ１週の未出産の妊婦において、早産リスク予測モデルM_t1に特徴量のデータを当てはめ、早産となる確率ｐを計算することができる。以上、本発明の実施例について詳述したが、本発明は上述した特定の実施形態に限定されるものではなく、特許請求の範囲に記載された本発明の要旨の範囲内において、種々の変形・変更が可能である。 The data collection device 100 of FIG. 8 can calculate the probability p of premature birth by applying feature data to the preterm birth risk prediction model M _t1 in a pregnant woman who has not given birth in t1 gestation week in s108. . As mentioned above, although the Example of this invention was explained in full detail, this invention is not limited to the specific embodiment mentioned above, In the range of the summary of this invention described in the claim, various deformation | transformation・ Change is possible.

１０データ収集システム
２０計測装置
３０測定値転送装置
４０体調登録装置
５０気象情報取得装置
６０調査票登録装置
７０診療記録登録装置
１００データ収集装置
２００ネットワーク DESCRIPTION OF SYMBOLS 10 Data collection system 20 Measurement apparatus 30 Measurement value transfer apparatus 40 Physical condition registration apparatus 50 Weather information acquisition apparatus 60 Survey form registration apparatus 70 Medical record registration apparatus 100 Data collection apparatus 200 Network

Claims

A data acquisition function for acquiring data about pregnant women from multiple data source devices;
Based on a preterm birth prediction model, a preterm birth prediction function for performing the preterm birth prediction of the pregnant woman from the acquired data;
A result notification function for notifying the result of the preterm birth prediction;
Have
The data acquisition function is a data collection device that acquires the data at a plurality of time points in a data acquisition period from any time before and after pregnancy of the pregnant woman to any time before and after childbirth.

Data on the pregnant woman include age at pregnancy, blood pressure, pulse rate, body weight, BMI, number of steps, calorie consumption, body temperature, sleep time, number of times awakened during sleep, quality of sleep, number of bowel movements, softness of bowel movements, uterine contraction , Head pain, tooth pain, upper abdominal pain, lower abdominal pain, lower back pain, nausea, vomiting, palpitation, fetal interval, meal content, medication content, past premature birth history, pregnancy status, temperature, room temperature, The data collection device according to claim 1, comprising one or more data items of humidity, atmospheric pressure, weather conditions, tide, and age.

Data on the pregnant woman include age at pregnancy, blood pressure, pulse rate, body weight, BMI, number of steps, calorie consumption, body temperature, sleep time, number of times awakened during sleep, quality of sleep, number of bowel movements, softness of bowel movements, uterine contraction , Head pain, tooth pain, upper abdominal pain, lower abdominal pain, lower back pain, nausea, vomiting, palpitation, fetal interval, meal content, medication content, past premature birth history, pregnancy status, temperature, room temperature, The data collection device according to claim 1 or 2, comprising a combination of two or more data items of humidity, atmospheric pressure, weather conditions, tide, and age.

The premature birth prediction function uses the acquired data, and uses the magnitude of the difference in data distribution between preterm birth and full-term birth or / and the magnitude of the difference in time series change in data distribution in the preterm birth prediction model. The data collection device according to any one of claims 1 to 3, wherein the premature birth occurrence prediction, birth date prediction, or preterm birth risk calculation of the pregnant woman is executed.

The data collection device according to claim 4, wherein the preterm birth predicting function executes premature birth occurrence prediction, birth date prediction, or preterm birth risk calculation based on data items that differ depending on the number of weeks of the pregnant woman.

Based on data on the pregnant women obtained from multiple pregnant women, including pregnant women who delivered at term and those who delivered prematurely, the magnitude of the difference in data distribution between preterm and term products and / or time series of data distribution The data collection device according to any one of claims 1 to 5, further comprising a premature birth prediction model generation function for generating a preterm birth prediction model using a magnitude of a difference in a global change.

The premature birth prediction model generation function determines a data item for discriminating a pregnant woman who gave birth by the term birth and a pregnant woman who gave birth by the premature birth, and the data distribution difference between preterm delivery and term delivery is determined by the determined data item. The data collection apparatus according to claim 6, wherein the premature birth prediction model is generated using a magnitude or / and a magnitude of a difference in time-series change in data distribution.

Obtaining data on pregnant women from a plurality of data source devices;
Performing a preterm birth prediction of the pregnant woman from the acquired data based on a preterm birth prediction model;
Notifying the result of the preterm birth prediction;
Have
The step of executing the preterm birth prediction of the pregnant woman includes obtaining the data at a plurality of time points in a data obtaining period from any time before and after pregnancy of the pregnant woman to any time before and after giving birth. Collection method.