CN111276229A

CN111276229A - Outpatient quantity prediction method and system based on deep belief network

Info

Publication number: CN111276229A
Application number: CN202010110803.8A
Authority: CN
Inventors: 谷兴龙; 李向阳
Original assignee: Shandong Health Medical Big Data Co ltd
Current assignee: Shandong Health Medical Big Data Co ltd
Priority date: 2020-02-24
Filing date: 2020-02-24
Publication date: 2020-06-12

Abstract

The invention discloses an outpatient quantity prediction method and system based on a deep belief network, belonging to the technical field of data mining analysis, aiming at solving the technical problem of more effectively and accurately predicting the outpatient quantity of a hospital according to the historical outpatient quantity, and the technical scheme is as follows: the method is based on a restricted Boltzmann machine and a mode of establishing a logistic regression layer, and utilizes historical outpatient data of a hospital to predict daily outpatient quantity, weekly outpatient quantity and monthly outpatient quantity, so that the outpatient quantity of the hospital at the current time can be predicted more accurately and effectively; the method comprises the following specific steps: s1, constructing a deep confidence network; s2, establishing a logistic regression layer: adding a logistic regression layer on the upper layer of the deep confidence network as a prediction layer, and performing supervised prediction on the clinic diagnosis amount data by combining the extracted data characteristics of the historical clinic diagnosis amount; s3, training the network: training of the neural network is performed using a greedy layer-by-layer algorithm. The invention also discloses an outpatient quantity prediction system based on the deep belief network.

Description

Outpatient quantity prediction method and system based on deep belief network

Technical Field

The invention relates to the technical field of data mining analysis, in particular to an outpatient quantity prediction method and system based on a deep belief network.

Background

The outpatient service of the hospital is the first link of the patients receiving the treatment work of the hospital, is a window of the hospital facing the society, and is favorable for the smooth expansion of the subsequent medical work by accurately predicting the outpatient service quantity of the hospital. The hospital outpatient quantity can be used as an important index for evaluating the medical working condition of the modern hospital, and is also beneficial to the hospital to have important reference value for medical resources and distribution, so that the effective method is used for carrying out feature extraction on the historical outpatient quantity data of the hospital, and is further used for predicting the hospital outpatient quantity, and more attention is paid.

The prediction method for the hospital clinic volume is multiple, the basic idea is to analyze the relation and the rule among the historical clinic volume data and predict the clinic volume in a certain time period in the future, and the main methods can be roughly divided into three categories:

(1) and a time series analysis model based on historical data: a representative method is an autoregressive integrated moving average model (ARIMA), which analyzes the rule of time variation of hospital clinic volume so as to predict hospital clinic volume, and the model has better performance in predicting monthly clinic volume and quarterly clinic volume, but the ARIMA model assumes that data are linear in advance, and if the data have a nonlinear relationship, the performance of the model is reduced;

(2) and an artificial intelligence-based model: representative methods include Artificial Neural Networks (ANNs), Support Vector Machines (SVM), Genetic Algorithms (GA), and the like, and the methods can better simulate the nonlinear relationship existing among hospital outpatient data.

(3) The time series analysis model based on historical data and the model based on artificial intelligence are mixed models of two methods: the hybrid model combines the two methods to predict the hospital clinic amount, on one hand, the advantages of the different methods are utilized, and the defects of the two methods are avoided, so that the prediction result of the general hybrid model is more accurate compared with that of a single method, but the method is complex to operate.

In summary, how to more effectively and accurately predict the outpatient quantity of a hospital according to the historical outpatient quantity is an urgent problem to be solved at present

Disclosure of Invention

The technical task of the invention is to provide a clinic volume prediction method and a clinic volume prediction system based on a deep belief network, so as to solve the problem of how to more effectively and accurately predict the clinic volume of a hospital according to the historical clinic volume.

The technical task of the invention is realized in the following way, the method for predicting the number of outpatients based on the deep confidence network is based on a Restricted Boltzmann Machine (RBM) and a way of establishing a logistic regression layer, and the method utilizes the historical outpatient data of a hospital to predict the number of daily outpatients, the number of weekly outpatients and the number of monthly outpatients, so as to realize more accurate and effective prediction of the number of outpatients of the hospital at the current time; the method comprises the following specific steps:

s1, constructing a Deep Belief Network (DBN): forming a deep confidence network (DBN) based on a plurality of limited Boltzmann machines (RBMs), wherein the output trained by the limited Boltzmann machine (RBM) of each layer is used as the input of the limited Boltzmann machine (RBM) of the next layer to form the deep confidence network (DBN), and the whole deep confidence network (DBN) is used as a data feature extraction layer for extracting the data features of the historical clinic amount;

s2, establishing a logistic regression layer: a logistic regression layer is added on the upper layer of a Deep Belief Network (DBN) to serve as a prediction layer, and the clinic volume data is subjected to supervised prediction by combining the extracted data characteristics of the historical clinic volume, so that the prediction precision is improved;

s3, training the network: training of the neural network is performed using a greedy layer-by-layer algorithm.

Preferably, the Deep Belief Network (DBN) is constructed in step S1 as follows:

s101, the lowest layer of the depth confidence network is a Restricted Boltzmann Machine (RBM), and the Restricted Boltzmann Machine (RBM) consists of a visible layer (data input layer) and a hidden layer (feature extraction layer);

s102, setting the number of neurons in a visible layer to be 6 (the number can be adjusted according to actual conditions), setting the number of neurons in a hidden layer to be 3, and setting the number of layers in the hidden layer to be 3;

s103, taking the output of a Restricted Boltzmann Machine (RBM) formed by the first visible layer and the first hidden layer as the input of the next hidden layer, thereby obtaining the output of the next hidden layer, and sequentially iterating to construct a finished depth confidence network.

Preferably, the logistic regression layer in step S2 is a regression prediction layer superimposed on the deep confidence network, and supervised prediction is performed on the hospital outpatient quantity by using the data features extracted by the deep confidence network and the selected label data; the data features extracted by using the deep belief network are feature vectors converted from original input data through a neural network; the selection of tag data is related to a time interval; the method comprises the following specific steps:

s201, initializing W, b, wherein W is initialized to a number close to 0, and b is initialized to 0; wherein W represents a weight matrix;

s202, calculating the predicted output result

S20201, calculating data characteristics Z, wherein the formula is as follows:

Z＝W^TX+b；

wherein X represents the number of samples and W represents a weight matrix; b is a constant;

s20202, calculating a result matrix A of prediction, wherein a formula is as follows:

wherein Y represents an actual value; x, A, Z are each vectorized matrices of samples;

s203, calculating a loss function according to a formula, wherein the formula is as follows:

s204, calculating a gradient, namely, performing derivation on the loss function in the step S203;

s205, updating W, b, and repeating steps S202 to S205 within the number of iterations until the derivative results in the minimized cost function J (w, b).

Preferably, the training network in step S3 is specifically as follows:

s301, training a network layer by using label-free data;

s302, after the pre-training is completed, adjusting and optimizing the parameters from top to bottom by using a BP algorithm.

Preferably, in the step S301, a contrast divergence algorithm (CD-k) is used to train the network in a layer-by-layer training network using unlabeled data, specifically as follows:

s30101, sample data x is assigned to the visible layer according to a formula P (h)_j|v)＝σ(b_j+Σ_iW_i,jx_i) Calculating the probability P (h) of each neuron in the hidden layer being activated₁|v₁) (ii) a Wherein σ represents a sigmoid activation function;

s30102, extracting a sample h from the calculated probability distribution by adopting Gibbs sampling₁～P(h₁|v₁)；

S30103, use₁Reconstructing the visible layer, i.e. back-pushing the visible layer by the hidden layer, according to the formula P (v)_i|h)＝σ(c_i+Σ_jW_i,jh_j) Calculating the probability P (v) that each neuron in the visible layer is activated₂|h₁) (ii) a Wherein, sigma also represents sigmoid activation function;

s30104, similarly, a Gibbs sample is taken from the probability distribution obtained by calculation to extract a sample v₂～P(v₂|h₁)；

S30105, passing through v₂Calculating the activated probability of each neuron in the hidden layer again to obtain probability distribution P (h)₂|v₂)；

S30106, update the weight, W ← W + λ (P (h)₁|v₁)v₁-P(h₂|v₂)v₂)，b←b+λ(v₁-v₂)，c←c+λ(h₁-h₂)。

Preferably, the daily out-patient volume prediction is specifically as follows:

(1) the input data is the outpatient service volume of each day in a certain selected time period, and the features among the outpatient service volume data are extracted through the constructed deep belief network, namely the input original data are converted into a group of feature vectors;

(2) after the characteristic vector is obtained, predicting the clinic diagnosis amount of the next day by utilizing a logistic regression layer and combining the characteristic vector, thereby obtaining a predicted value;

(3) comparing the label data (namely the actual value of the clinic volume data in the future day) with the predicted value to obtain the difference between the label data and the predicted value, finely adjusting the parameters through back propagation, and obtaining the optimal group of parameters after limited iteration to realize the prediction of the clinic volume in a certain period of time in the future.

An outpatient quantity prediction system based on a deep belief network, the system comprising,

the depth confidence network (DBN) construction unit is used for forming a depth confidence network (DBN) based on a plurality of limited Boltzmann machines (RBMs), the output trained by the limited Boltzmann machines (RBMs) of each layer is used as the input of the limited Boltzmann machines (RBMs) of the next layer to form the depth confidence network (DBN), and the whole depth confidence network (DBN) is used as a data feature extraction layer for extracting the data features of the historical clinic quantity;

establishing a logistic regression layer unit for adding a logistic regression layer on the upper layer of a Deep Belief Network (DBN) as a prediction layer, and performing supervised prediction on the clinic volume data by combining the extracted data characteristics of the historical clinic volume to improve the prediction precision;

and the network training unit is used for training the neural network by using a greedy layer-by-layer algorithm.

Preferably, the lowest layer of the deep confidence network (DBN) is a Restricted Boltzmann Machine (RBM) which is composed of a visible layer (data input layer) and a hidden layer (feature extraction layer); the number of the neurons in the visible layer is set to be 6 (can be adjusted according to actual conditions), the number of the neurons in the hidden layer is set to be 3, and the number of the layers in the hidden layer is set to be 3.

Preferably, the logistic regression layer is a regression prediction layer superposed on the deep confidence network, and the supervised prediction is carried out on the hospital clinic volume by using the data features extracted by the deep confidence network and the selected label data; the data features extracted by using the deep belief network are feature vectors converted from original input data through a neural network; the selection of tag data is related to the time interval.

Preferably, the network training unit comprises,

the data layer-by-layer training network module is used for training a network layer by using unlabeled data;

and the tuning module is used for tuning the parameters from top to bottom by using a BP algorithm after the pre-training is finished.

The data layer-by-layer training network module trains the network by using a contrast divergence algorithm (CD-k), and the specific process is as follows:

①, sample data x is assigned to the visible layer according to the formula P (h)_j|v)＝σ(b_j+Σ_iW_i,jx_i) Calculating the probability P (h) of each neuron in the hidden layer being activated₁|v₁) (ii) a Wherein σ represents a sigmoid activation function;

②, extracting a sample h from the calculated probability distribution by Gibbs sampling₁～P(h₁|v₁)；

③, use h₁Reconstructing the visible layer, i.e. back-pushing the visible layer by the hidden layer, according to the formula P (v)_i|h)＝σ(c_i+Σ_jW_i,jh_j) Calculating the probability P (v) that each neuron in the visible layer is activated₂|h₁) (ii) a Wherein, sigma also represents sigmoid activation function;

④, similarly, a sample v is extracted from the computed probability distribution by Gibbs sampling₂～P(v₂|h₁)；

⑤, passing v₂Calculating the activated probability of each neuron in the hidden layer again to obtain probability distribution P (h)₂|v₂)；

⑥, update weight, W ← W + lambda (P (h)₁|v₁)v₁-P(h₂|v₂)v₂)，b←b+λ(v₁-v₂)，c←c+λ(h₁-h₂)。

The outpatient quantity prediction method and the outpatient quantity prediction system based on the deep belief network have the following advantages that:

the method has the advantages that the characteristics in the hospital historical clinic amount data are automatically extracted based on the neural network, the logistic regression layer is additionally arranged on the top layer, the clinic amount is supervised and predicted, the hospital clinic amount can be more effectively and accurately predicted in a certain time period in the future, the characteristics contained in the data are manually analyzed, a great deal of energy consumed by extracting the characteristics is avoided, and the universality of the use in various hospitals is improved;

the invention is based on a RBM, combines a plurality of hidden layers and constructs a deep confidence network, namely a feature extraction layer, which can effectively simulate the nonlinear relation among hospital outpatient data and realize the feature extraction of the data;

thirdly, a logistic regression layer is added on the top layer of the deep confidence network, so that supervised outpatient quantity prediction can be performed by utilizing label data, and the linear relation among hospital outpatient quantity data is effectively used for supervised regression prediction;

and after the pre-training of the deep network, relevant parameters of the whole network are optimized, then the BP algorithm is used for fine adjustment, and only one local search needs to be carried out on the known parameters, so that the training efficiency of the neural network is improved.

Drawings

The invention is further described below with reference to the accompanying drawings.

FIG. 1 is a flow chart of an outpatient quantity prediction method based on a deep belief network;

FIG. 2 is a schematic diagram of a basic structure of a deep belief network;

FIG. 3 is a basic block diagram of a Restricted Boltzmann Machine (RBM)

Detailed Description

The method and system for predicting out-patient quantity based on deep belief network of the present invention will be described in detail with reference to the drawings and the detailed description.

Example 1:

as shown in fig. 1, the outpatient quantity prediction method based on the deep belief network of the present invention is based on a Restricted Boltzmann Machine (RBM) and a manner of establishing a logistic regression layer, and predicts daily outpatient quantity, weekly outpatient quantity and monthly outpatient quantity by using historical outpatient data of a hospital, so as to realize more accurate and effective prediction of outpatient quantity of the hospital at the current time; the method comprises the following specific steps:

s1, constructing a Deep Belief Network (DBN): forming a deep confidence network (DBN) based on a plurality of limited Boltzmann machines (RBMs), wherein the output trained by the limited Boltzmann machine (RBM) of each layer is used as the input of the limited Boltzmann machine (RBM) of the next layer to form the deep confidence network (DBN), and the whole deep confidence network (DBN) is used as a data feature extraction layer for extracting the data features of the historical clinic amount; the method comprises the following specific steps:

S2, establishing a logistic regression layer: a logistic regression layer is added on the upper layer of a Deep Belief Network (DBN) to serve as a prediction layer, and the clinic volume data is subjected to supervised prediction by combining the extracted data characteristics of the historical clinic volume, so that the prediction precision is improved; the logistic regression layer is a regression prediction layer superposed on the deep confidence network, and supervised prediction is carried out on the hospital outpatient quantity by utilizing the data characteristics extracted by the deep confidence network and the selected label data; the data features extracted by using the deep belief network are feature vectors converted from original input data through a neural network; the selection of tag data is related to a time interval; for example, if the selected time interval is 6, the number of neurons to be output is finally predicted to be 1 by using the outpatient volume of the first 6 days (weeks, months) as an input and the outpatient volume of the 7 th day (weeks, months) as label data.

The method comprises the following specific steps:

s202, calculating the predicted output result

S20201, calculating data characteristics Z, wherein the formula is as follows:

Z＝W^TX+b；

S3, training the network: training a neural network by using a greedy layer-by-layer algorithm; the method comprises the following specific steps:

s301, training a network layer by using label-free data; the high-level feature representation y generated from the feature representation x from bottom to top and the feature representation x' generated from the high-level feature representation y from top to bottom are consistent as much as possible, and for a sample data x, a contrast divergence algorithm (CD-k) is adopted to train the network, which is specifically as follows:

S302, after the pre-training is completed, adjusting and optimizing the parameters from top to bottom by using a BP algorithm (back propagation algorithm).

Because a plurality of RBMs are connected in series, a DBN is formed, wherein the hidden layer of the previous RBM is used as the display layer of the next RBM, and the output of the previous RBM is the input of the next RBM. In the training process, the RBMs of the previous layer are required to be fully trained, and then the RBMs of the current layer can be trained until the final layer, namely the essence of the greedy layer-by-layer training algorithm.

The specific prediction of the daily outpatient quantity is as follows:

Example 2:

the invention relates to a clinic quantity forecasting system based on a deep belief network, which comprises,

the depth confidence network (DBN) construction unit is used for forming a depth confidence network (DBN) based on a plurality of limited Boltzmann machines (RBMs), the output trained by the limited Boltzmann machines (RBMs) of each layer is used as the input of the limited Boltzmann machines (RBMs) of the next layer to form the depth confidence network (DBN), and the whole depth confidence network (DBN) is used as a data feature extraction layer for extracting the data features of the historical clinic quantity; the bottom layer of the deep confidence network (DBN) is a Restricted Boltzmann Machine (RBM), and the Restricted Boltzmann Machine (RBM) consists of a visible layer (data input layer) and a hidden layer (feature extraction layer); the number of the neurons in the visible layer is set to be 6 (can be adjusted according to actual conditions), the number of the neurons in the hidden layer is set to be 3, and the number of the layers in the hidden layer is set to be 3.

Establishing a logistic regression layer unit for adding a logistic regression layer on the upper layer of a Deep Belief Network (DBN) as a prediction layer, and performing supervised prediction on the clinic volume data by combining the extracted data characteristics of the historical clinic volume to improve the prediction precision; the logistic regression layer is a regression prediction layer superposed on the deep confidence network, and supervised prediction is carried out on the hospital clinic quantity by using the data characteristics extracted by the deep confidence network and the selected label data; the data features extracted by using the deep belief network are feature vectors converted from original input data through a neural network; the selection of tag data is related to the time interval.

The network training unit is used for training the neural network by using a greedy layer-by-layer algorithm; the network training unit comprises a network training unit,

Wherein, the high-level feature representation y generated from the feature representation x from bottom to top and the feature representation x' generated from the high-level feature representation y from top to bottom are consistent as much as possible, and for a sample data x, the data layer-by-layer training network module trains the network by using a contrast divergence algorithm (CD-k), and the specific process is as follows:

Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present invention.

Claims

1. The out-patient quantity prediction method based on the deep belief network is characterized in that the method is based on a limited Boltzmann machine and a mode of establishing a logistic regression layer, and the historical out-patient data of a hospital is utilized to predict the daily out-patient quantity, the weekly out-patient quantity and the monthly out-patient quantity, so that the out-patient quantity of the hospital at the current time can be more accurately and effectively predicted; the method comprises the following specific steps:

s1, constructing a deep confidence network: forming a depth confidence network based on a plurality of limited Boltzmann machines, wherein the output trained by the limited Boltzmann machine of each layer is used as the input of the limited Boltzmann machine of the next layer to form the depth confidence network, and the whole depth confidence network is used as a data feature extraction layer for extracting the data features of the historical clinic amount;

s2, establishing a logistic regression layer: a logistic regression layer is added on the upper layer of the deep confidence network to serve as a prediction layer, and the clinic volume data is subjected to supervised prediction by combining the extracted data characteristics of the historical clinic volume, so that the prediction precision is improved;

2. The out-patient prediction method based on the deep belief network as claimed in claim 1, wherein the deep belief network is constructed in the step S1 as follows:

s101, a limited Boltzmann machine is arranged at the bottom layer of the depth confidence network, and consists of a visible layer and a hidden layer;

s102, setting the number of neurons in a visible layer to be 6, setting the number of neurons in a hidden layer to be 3, and setting the number of layers in the hidden layer to be 3;

s103, taking the output of the restricted Boltzmann machine consisting of the first visible layer and the first hidden layer as the input of the next hidden layer, so as to obtain the output of the next hidden layer, and sequentially iterating to construct a finished depth confidence network.

3. The method for outpatient quantity prediction based on deep belief network as claimed in claim 1, wherein the logistic regression layer in step S2 is a regression prediction layer superimposed on the deep belief network, and the hospital outpatient quantity is supervised predicted by using the data features extracted by the deep belief network and the selected tag data; the data features extracted by using the deep belief network are feature vectors converted from original input data through a neural network; the selection of tag data is related to a time interval; the method comprises the following specific steps:

s202, calculating the predicted output result

S20201, calculating data characteristics Z, wherein the formula is as follows:

Z＝W^TX+b；

4. The out-patient prediction method based on deep belief network as claimed in claim 1, wherein the training network in step S3 is specifically as follows:

s301, training a network layer by using label-free data;

5. The outpatient quantity prediction method based on the deep belief network as claimed in claim 4, wherein in the step S301, the network is trained by using a contrast divergence algorithm in a layer-by-layer training network using unlabeled data, specifically as follows:

s30101, sample data x is assigned to the visible layer according to a formula P (h)_j|v)＝σ(b_j+Σ_iW_i,jx_i) Calculating the probability P (h) of each neuron in the hidden layer being activated₁|v₁) (ii) a Wherein σ represents a sigmoid activation function；

S30103, use₁Reconstructing the visible layer, i.e. back-pushing the visible layer by the hidden layer, according to the formula P (v)_i|h)＝σ(c_i+Σ_jW_i,jh_j) Calculating the probability P (v) that each neuron in the visible layer is activated₂|h₁)；

Wherein, sigma also represents sigmoid activation function;

6. The out-patient volume prediction method based on the deep belief network as claimed in claim 1, wherein the daily out-patient volume prediction is as follows:

(3) and comparing the label data with the predicted value to obtain the difference between the label data and the predicted value, finely adjusting the parameters through back propagation, and obtaining the optimal group of parameters after limited iteration to realize the prediction of the clinic amount in a certain period of time in the future.

7. An outpatient quantity prediction system based on a deep belief network, the system comprising,

the depth confidence network construction unit is used for forming a depth confidence network based on the plurality of limited Boltzmann machines, the output trained by the limited Boltzmann machine of each layer is used as the input of the limited Boltzmann machine of the next layer to form the depth confidence network, and the whole depth confidence network is used as a data feature extraction layer for extracting the data features of the historical clinic amount;

a logistic regression layer establishing unit is used for adding a logistic regression layer on the upper layer of the deep confidence network to serve as a prediction layer, and carrying out supervised prediction on the clinic diagnosis amount data by combining the extracted data characteristics of the historical clinic diagnosis amount, so that the prediction precision is improved;

8. The depth-confidence-network-based outpatient quantity prediction system of claim 7, wherein the bottom layer of the depth confidence network is a constrained Boltzmann machine, which consists of a visible layer and a hidden layer; the number of the neurons in the visible layer is set to be 6, the number of the neurons in the hidden layer is set to be 3, and the number of the layers in the hidden layer is set to be 3.

9. The depth-confidence-network-based outpatient prediction system of claim 7, wherein the logistic regression layer is a regression prediction layer superimposed on the depth confidence network, and the hospital outpatient is supervised predicted by using the data features extracted by the depth confidence network and the selected label data; the data features extracted by using the deep belief network are feature vectors converted from original input data through a neural network; the selection of tag data is related to the time interval.

10. The deep belief network-based outpatient prediction system of claim 7 or 8 or 9, wherein the network training unit comprises,

The data layer-by-layer training network module trains the network by using a contrast divergence algorithm, and the specific process is as follows:

③, use h₁Reconstructing the visible layer, i.e. back-pushing the visible layer by the hidden layer, according to the formula P (v)_i|h)＝σ(c_i+Σ_jW_i,jh_j) Calculating the probability P (v) that each neuron in the visible layer is activated₂|h₁)；

Wherein, sigma also represents sigmoid activation function;