CN115186918A

CN115186918A - Fetal birth weight prediction method based on ensemble learning

Info

Publication number: CN115186918A
Application number: CN202210874763.3A
Authority: CN
Inventors: 高婧; 石虎伟; 程蔚蔚; 姚昱君; 徐捷; 陈锐遥; 许肖娜
Original assignee: International Peace Maternity & Child Health Hospital Of China Welfare Institute; Shanghai AI Innovation Center
Current assignee: International Peace Maternity & Child Health Hospital Of China Welfare Institute; Shanghai AI Innovation Center
Priority date: 2022-07-22
Filing date: 2022-07-22
Publication date: 2022-10-14

Abstract

The invention relates to the technical field of fetal birth weight prediction, and provides a fetal birth weight prediction method based on ensemble learning, which comprises the following steps: preprocessing and data balancing processing are carried out on case data; determining a standard model; averaging the standard models to generate an ensemble learning model; training the ensemble learning model through the case data; and inputting feature information into the ensemble learning model to generate a predicted fetal weight.

Description

Fetal birth weight prediction method based on ensemble learning

Technical Field

The invention relates to the technical field of fetal birth weight prediction in general. In particular, the invention relates to a fetus birth weight prediction method based on ensemble learning.

Background

In modern prenatal care, it is essential to predict fetal weight in order to monitor fetal growth. Since growth abnormalities are associated with adverse consequences for the parturient and fetus, such as the birth of a giant fetus associated with shoulder dystocia, postpartum hemorrhage and fracture at birth, infants with low birth weight are at increased risk of receiving Neonatal Intensive Care Unit (NICU) and lifelong disease. Thus, accurate prediction of fetal weight facilitates clinical decisions, such as timely prenatal intervention and selection of a reasonable delivery style to improve pregnancy outcome.

In the prior art, an ultrasonic evaluation method based on biological characteristic measurement and a regression equation is the most preferred method in obstetrical practice due to objectivity and convenience. However, most ultrasound formulas are established based on the western population, and since the fetal weight varies from race to race after 20 weeks, there is a deviation when the ultrasound formula is applied to chinese. In addition, the accuracy of fetal weight prediction for large and low birth weight infants is low. Taking the Hadlock equation as an example, the overall sensitivity of estimating fetal weight is only 0.56 (95% Cl, 0.49-0.62), while inaccurate estimates may lead to inappropriate prenatal intervention. In addition, the prior art generally depends on professional ultrasonic instruments, so that a lot of inconvenience exists in use.

At present, a method for predicting the birth weight of a fetus with high convenience and accuracy needs to be provided.

Disclosure of Invention

To at least partially solve the above problems in the prior art, the present invention provides a method for predicting birth weight of fetus based on ensemble learning, comprising the following steps:

preprocessing and data balancing processing are carried out on case data;

determining a standard model;

averaging the standard models to generate an ensemble learning model;

training the ensemble learning model through the case data; and

inputting feature information into the ensemble learning model to generate a fetal predicted weight.

In one embodiment of the invention, it is provided that the preprocessing of the case data comprises:

deleting the missing value; and/or

Confirming the deletion value of one of height, weight and BMI before pregnancy according to the BMI formula; and/or

Converting the number of gestational days into the number of gestational weeks; and/or

Converting a first feature into a second feature', wherein the first feature comprises obstetrics related features and ultrasound related features, and the second feature is related to pregnancy time and is represented by the following formula:

in one embodiment of the present invention, it is provided that the data equalization processing of the case data includes:

classifying newborns weighing less than 2500g as low-weight newborns, newborns weighing 2500g or more and 4000g or less as normal-weight newborns, and newborns weighing 4000g or more as large-weight newborns; and

the number of low-weight and large-weight newborns is increased by the up-sampling method.

In one embodiment of the invention, it is provided that the upsampling method comprises a SMOTE method.

In one embodiment of the invention, the standard model comprises a Ridge model, an SVM model, a Random Forest model, an XGboost model and an MLP model.

In one embodiment of the invention, the standard model is averaged by a Bagging method to generate an ensemble learning model.

In one embodiment of the invention, it is provided that the characteristic information comprises: pre-pregnancy weight, last parity weight, last ultrasound births day, pre-pregnancy BMI, weight gain during pregnancy, last parity births day, gestational diabetes mellitus, parity, parietal diameter, abdominal circumference, left and right abdominal diameters, anteroposterior abdominal diameter, and amniotic fluid index.

The present invention further provides a computer system, comprising:

a processor configured to execute machine executable instructions; and

a memory having stored thereon machine executable instructions which, when executed by the processor, perform the steps according to the ensemble learning-based fetal birth weight prediction method.

The invention also proposes a computer readable storage medium having stored thereon a computer program which, when being executed by a processor, performs the steps according to the ensemble learning based fetal birth weight prediction method.

It should be noted that the present invention is not related to methods for diagnosis and treatment of disease, but merely provides information related to medical treatment, and pertains to an intelligent system, i.e. the present invention is neither intended to determine the disease of a patient, nor to provide a parameter or index for diagnosing a disease, nor is it a method for prescreening a disease. In contrast, the information provided by the solution of the invention cannot be used for diagnosis and treatment of diseases, but the corresponding diagnosis and treatment should be provided to the user by the hospital/doctor.

The invention has at least the following beneficial effects: the invention provides a fetus birth weight prediction method based on ensemble learning, and the prediction effect of the method in different weight ranges greatly exceeds that of an ultrasonic formula. The accuracy of final prediction of the fetal birth weight can reach 82.25%. In addition, in the method, the user can input the characteristic information through the mobile terminal, so that the use convenience is greatly improved.

Drawings

To further clarify the advantages and features that may be present in various embodiments of the present invention, a more particular description of various embodiments of the invention will be rendered by reference to the appended drawings. It is appreciated that these drawings depict only typical embodiments of the invention and are therefore not to be considered limiting of its scope. In the drawings, the same or corresponding parts will be denoted by the same or similar reference numerals for clarity.

Fig. 1 shows a computer system implementing the method according to the invention.

Fig. 2 is a flow chart of a method for predicting birth weight of a fetus based on ensemble learning according to an embodiment of the present invention.

Fig. 3 shows a schematic diagram of a multi-layer perceptron model.

Detailed Description

It should be noted that the components in the figures may be exaggerated and not necessarily to scale for illustrative purposes. In the figures, identical or functionally identical components are provided with the same reference symbols.

In the present invention, "disposed on" \ 8230 "", "disposed over" \823030 "", and "disposed over" \8230 "", do not exclude the presence of an intermediate therebetween, unless otherwise specified. Furthermore, "arranged on or above" \\8230 ", merely indicates a relative positional relationship between two components, and in certain cases, such as after reversing the product direction, may also be converted to" arranged under or below \8230 ", and vice versa.

In the present invention, the embodiments are only intended to illustrate the aspects of the present invention, and should not be construed as limiting.

In the present invention, the terms "a" and "an" do not exclude the presence of a plurality of elements, unless otherwise specified.

It is further noted herein that in embodiments of the present invention, only a portion of the components or assemblies may be shown for clarity and simplicity, but those of ordinary skill in the art will appreciate that, given the teachings of the present invention, required components or assemblies may be added as needed for a particular situation. Furthermore, features from different embodiments of the invention may be combined with each other, unless otherwise indicated. For example, a feature of the second embodiment may be substituted for a corresponding or functionally equivalent or similar feature of the first embodiment, and the resulting embodiments are likewise within the scope of the disclosure or recitation of the present application.

It is also noted herein that, within the scope of the present invention, the terms "same", "equal", and the like do not mean that the two values are absolutely equal, but allow some reasonable error, that is, the terms also encompass "substantially the same", "substantially equal". By analogy, in the present disclosure, the terms "perpendicular," parallel, "and the like in the directions of the tables also encompass the meanings of" substantially perpendicular, "" substantially parallel.

The numbering of the steps of the methods of the present invention does not limit the order in which the method steps are performed. Unless specifically stated, the method steps may be performed in a different order.

The invention is further elucidated with reference to the following description, in conjunction with the detailed description, and with reference to the accompanying drawings.

FIG. 1 illustrates a computer system 100 implementing systems and/or methods in accordance with the present invention. Unless specifically stated otherwise, a method and/or system in accordance with the present invention may be implemented in the computer system 100 shown in FIG. 1 for purposes of the present invention, or the present invention may be implemented in a distributed fashion across a network, such as a local area network or the Internet, among multiple computer systems 100 in accordance with the present invention. Computer system 100 of the present invention may comprise various types of computer systems, such as hand-held devices, laptop computers, personal Digital Assistants (PDAs), multiprocessor systems, microprocessor-based or programmable consumer electronics, network PCs, minicomputers, mainframe computers, network servers, tablet computers, and the like.

As shown in FIG. 1, computer system 100 comprises processor 111, system bus 101, system memory 102, video adapter 105, audio adapter 107, hard drive interface 109, optical drive interface 113, network interface 114, and Universal Serial Bus (USB) interface 112. The system bus 101 may be any of several types of bus structures including a memory bus or memory controller, a peripheral bus, and a local bus using any of a variety of bus architectures. The system bus 101 is used for communication between the respective bus devices. In addition to the bus devices or interfaces shown in fig. 1, other bus devices or interfaces are also contemplated. The system memory 102 includes a Read Only Memory (ROM) 103 and a Random Access Memory (RAM) 104, where the ROM 103 may store, for example, basic input/output system (BIOS) data used to implement basic routines for information transfer at start-up, and the RAM 104 is used to provide operating memory for the system that is accessed quickly. The computer system 100 further includes a hard disk drive 109 for reading from and writing to a hard disk 110, an optical drive interface 113 for reading from or writing to optical media such as a CD-ROM, and the like. Hard disk 110 may store, for example, an operating system and application programs. The drives and their associated computer-readable media provide nonvolatile storage of computer readable instructions, data structures, program modules and other data for the computer system 100. Computer system 100 may also include a video adapter 105 for image processing and/or image output for connecting an output device such as a display 106. Computer system 100 may also include an audio adapter 107 for audio processing and/or audio output, for connecting output devices such as speakers 108. In addition, the computer system 100 may also include a network interface 114 for network connections, where the network interface 114 may connect to the Internet 116 through a network device, such as a router 115, where the connection may be wired or wireless. Additionally, computer system 100 may also include a universal serial bus interface (USB) 112 for connecting peripheral devices, including, for example, a keyboard 117, a mouse 118, and other peripheral devices, such as a microphone, a camera, and the like.

When the invention is implemented on the computer system 100 shown in fig. 1, the fetal birth weight can be predicted in different weight ranges, the prediction effect can greatly exceed the ultrasonic formula, and the accuracy of the fetal birth weight prediction can finally reach 82.25%. In addition, in the method, the user can input the characteristic information through the mobile terminal, so that the use convenience is greatly improved.

Furthermore, embodiments may be provided as a computer program product that may include one or more machine-readable media having stored thereon machine-executable instructions that, when executed by one or more machines such as a computer, network of computers, or other electronic devices, may result in the one or more machines performing operations in accordance with embodiments of the present invention. The machine-readable medium may include, but is not limited to, floppy diskettes, optical disks, CD-ROMs (compact disc read-only memories), and magneto-optical disks, ROMs (read-only memories), RAMs (random access memories), EPROMs (erasable programmable read-only memories), EEPROMs (electrically erasable programmable read-only memories), magnetic or optical cards, flash memory, or other type of media/machine-readable medium suitable for storing machine-executable instructions.

Moreover, embodiments may be downloaded as a computer program product, wherein the program may be transferred from a remote computer (e.g., a server) to a requesting computer (e.g., a client) by way of one or more data signals embodied in and/or modulated by a carrier wave or other propagation medium via a communication link (e.g., a modem and/or network connection). Thus, a machine-readable medium as used herein may include, but is not necessarily required to be, such a carrier wave.

Fig. 2 is a flow chart of a method for predicting birth weight of a fetus based on ensemble learning according to an embodiment of the present invention. As shown in fig. 2, the method may include the steps of:

step 201, preprocessing and data balancing are performed on the case data.

Step 202, determining a standard model.

And step 203, averaging the standard models to generate an ensemble learning model.

And step 204, training the ensemble learning model through the case data.

Step 205, inputting the characteristic information into the ensemble learning model to generate the predicted fetal weight.

In step 201, for the missing values in the case data, the corresponding missing values can be filled as much as possible according to the calculation formula BMI = weight/[ height ] 2, and the relationship between height, weight and BMI before pregnancy. For other missing values, they can be deleted directly because the amount of original data is sufficient and the missing value ratio is not high. 17036 samples remained after the null was removed. New features of the BMI classification can be added based on chinese body mass index and in combination with the corresponding BMI to consider whether the weight gain during pregnancy is within the appropriate range. In addition, the number of gestational days may be converted into the number of gestational weeks. Processing the characteristics into a form related to pregnancy time, including obstetric related characteristics and ultrasound related characteristics, as represented by the formula:

the proportion of low and large births is very small relative to normal-weight newborns, and we classify the samples into three categories by boundaries of 2500g and 4000g and then upsample to increase the number of extreme-weight newborns so that the model can be used for any weight of newborn weight prediction. A SMOTE (Synthetic minimum Oversampling Technique) method can be used as the upsampling method, and the SMOTE method is based on an improved scheme of a random Oversampling method, so that the problem of model overfitting caused by repeated sampling can be solved. In the SMOTE method, comprising:

calculating the distance from each sample x in the minority class to all samples in the minority class sample set Smin by taking the Euclidean distance as a standard, and obtaining k nearest neighbors of the sample x;

the sampling rate is set according to the sample imbalance rate to determine the sampling rate N. For each minority class sample x, randomly selecting a plurality of samples from k nearest neighbors centered on the minority class sample x, assuming that the selected nearest neighbors are xn; and

for each randomly selected neighbor xn, a new sample is constructed from the original sample according to the following formula:

x _new ＝x+rand(0，1)*|x-x _n |。

ridge, SVM, random-Forest, XGboost, MLP may be used as reference models in step 202.

In the Ridge model, the first term of the cost function of the regression is consistent with the standard linear regression, which is the sum of squared euclidean distances, except that the L2 norm is added as a penalty term. The cost function is expressed as:

the XGBoost model is an improvement on a (GBDT) Gradient Boosting Decision Tree. The objective function (loss function and regularization) needed to minimize iteration t in the XGBoost model is expressed as:

the method is operated in a Random Forest model by constructing a plurality of decision trees during training and outputting an average prediction class, wherein the method comprises the following steps:

generating n samples (same number as original training set) from the sample set by resampling (first bagging);

assuming that the feature number is k, randomly selecting t features from the k features of the n samples, and obtaining an optimal segmentation point by constructing a decision tree (bagging for the second time); repeating m times to generate m decision trees; and

the prediction is made by a majority voting mechanism.

An SVM (support vector machine) model is a generalized linear classifier, binary classification is carried out on data according to supervised learning, and a decision boundary is the maximum margin for solving a learning sample. The maximum margin hyperplane can also be used for regression problems. A linear kernel can be used in the regression problem.

As shown in fig. 3, a four-layer neural network may be constructed in an MLP (multi layer Perceptron) model, in which Batch norms are used, and Batch norm and a leak ReLU function are used in multiple neural layers

In step 203, a plurality of reference models can be combined by an ensemble learning method to obtain a model with better prediction effect. Ensemble learning is generally classified into three types, bagging, boosting, and Stacking. In the invention, only Bagging is used for constructing the ensemble learning model, and the ensemble learning model is averaged on the basis of the reference model result to obtain the final ensemble learning model result.

In step 205, 16 feature variables are extracted from the information transmitted by the user and the uploaded picture information, and input into the prediction model, which gives the predicted birth weight of the fetus, and records the relevant information according to the number of the doctor card of the user as an identification code, so as to perform later query and verification. Wherein the characteristic variables and sources are as follows: pregnancy weight (user input), last birth sample weight (user input), last ultrasound distance delivery day (due date-date of last delivery ultrasound), last BMI { due weight of pregnancy/(square of height (m) }, pregnancy weight gain (weight of last delivery-due weight of last delivery), last delivery distance delivery day (due date-last delivery), gestational diabetes (user input), birth time (user input), double-vertex diameter (obtained by OCR recognizing ultrasound information, range of values [50, 120 ]) mm), head circumference (obtained by OCR recognizing ultrasound information, range of values [150, 400 ]) mm), femur length (obtained by recognizing ultrasound information, range of values [20, 80 ]) mm), humerus length (obtained by OCR recognizing ultrasound information, range of values [5, 80 ]) mm), abdominal circumference (obtained by OCR recognizing ultrasound information, range of values [150, 400 ]) mm, abdominal circumference (obtained by OCR recognizing ultrasound information, range of values [40, 140 mm ], sheep circumference range of values [ OCR, 40, 140 mm ], abdominal circumference of values [ OCR information, OCR range of values [ 140, 140 mm ], abdominal circumference of values [ 140 mm ], abdominal circumference of values (obtained by OCR recognizing ultrasound information, 140 mm ], abdominal circumference of values [150, 140 mm ], abdominal circumference of values obtained by OCR recognizing ultrasound information, 140 mm, and abdominal circumference. And (3) outputting a model: it is estimated that [82.44]% of your fetal weight has a probability of [2790-3410] g, and the closest value is [3100] g. [] The result value given by the model is in the inner part, and the rest characters are fixed.

The calculation formula of the expected delivery period is as follows: adding 9 or subtracting 3 in the month of the last menstrual date to obtain the number of the month of the edd; the number of days plus 7 is the predicted date of delivery. For example, the last menstruation is 2 month 1, month 2+9, date 1+7, expected date of delivery =11 month 8 (85 years) in 1985; the last menstruation is 15 months 4 and 15 days 4-3 of 1985, day 15+7, and edd = 22 days 1 and 22 days (86 years).

The final prediction result is evaluated in a segmented manner, and the division values of the three segments are 2500g and 4000g respectively. To select an index with a float range of 10%. The prediction effects of different models in different weight intervals are greatly different, and some models have better prediction effects in low weight intervals, such as XGboost and Random Forest; some models have better prediction effect in high-weight intervals, such as Ridge and SVM. In addition, the multi-layer perceptron has better prediction effect in the normal weight range. The integrated model integrates the advantages and the disadvantages of the different algorithm models, the result has no particularly outstanding disadvantages, and the prediction effect in different weight ranges can greatly exceed that of an ultrasonic formula. The accuracy of final prediction of the fetal birth weight can reach 82.25%.

While various embodiments of the present invention have been described above, it should be understood that they have been presented by way of example only, and not limitation. It will be apparent to persons skilled in the relevant art that various combinations, modifications, and changes can be made thereto without departing from the spirit and scope of the invention. Thus, the breadth and scope of the present invention should not be limited by any of the above-described exemplary embodiments, but should be defined only in accordance with the following claims and their equivalents.

Claims

1. A fetus birth weight prediction method based on ensemble learning is characterized by comprising the following steps:

preprocessing and data balancing processing are carried out on case data;

determining a standard model;

averaging the standard models to generate an ensemble learning model;

training the ensemble learning model through the case data; and

inputting feature information into the ensemble learning model to generate a predicted fetal weight.

2. The ensemble learning-based fetal birth weight prediction method of claim 1, wherein pre-processing the case data comprises:

deleting the missing value; and/or

Converting the gestational days into gestational weeks; and/or

Converting a first feature into a second feature', wherein the first feature comprises an obstetric related feature and an ultrasound related feature, and the second feature is related to a pregnancy time and is represented by the following formula:

3. the integrated learning-based fetal birth weight prediction method of claim 1 wherein the data equalization processing of case data comprises:

classifying newborns with a weight of less than 2500g as low-weight newborns, classifying newborns with a weight of 2500g or more and 4000g or less as normal-weight newborns, and classifying newborns with a weight of 4000g or more as large-weight newborns; and

4. The ensemble learning-based fetal birth weight prediction method of claim 3, wherein the up-sampling method comprises a SMOTE method.

5. The ensemble learning-based fetal birth weight prediction method according to claim 4, wherein the standard model comprises a Ridge model, an SVM model, a Random Forest model, an XGboost model and an MLP model.

6. The ensemble learning-based fetal birth weight prediction method of claim 5, wherein the standard model is averaged by Bagging to generate an ensemble learning model.

7. The ensemble learning-based fetal birth weight prediction method of claim 1, wherein the characteristic information comprises: pre-pregnancy weight, last parity weight, last ultrasound births day, pre-pregnancy BMI, weight gain during pregnancy, last parity births day, gestational diabetes mellitus, parity, parietal diameter, abdominal circumference, left and right abdominal diameters, anteroposterior abdominal diameter, and amniotic fluid index.

8. A computer system, comprising:

a processor configured to execute machine-executable instructions; and

memory having stored thereon machine executable instructions which, when executed by the processor, perform the steps of the method according to one of claims 1 to 7.

9. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to one of claims 1 to 7.