CN113571200A - Prediction method of infectious disease spread based on graph filter-vector autoregressive model - Google Patents
Prediction method of infectious disease spread based on graph filter-vector autoregressive model Download PDFInfo
- Publication number
- CN113571200A CN113571200A CN202110868810.9A CN202110868810A CN113571200A CN 113571200 A CN113571200 A CN 113571200A CN 202110868810 A CN202110868810 A CN 202110868810A CN 113571200 A CN113571200 A CN 113571200A
- Authority
- CN
- China
- Prior art keywords
- graph
- formula
- model
- time
- filter
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 208000035473 Communicable disease Diseases 0.000 title claims abstract description 33
- 208000015181 infectious disease Diseases 0.000 title claims abstract description 29
- 238000000034 method Methods 0.000 title claims abstract description 26
- 238000013461 design Methods 0.000 claims abstract description 5
- 239000011159 matrix material Substances 0.000 claims description 22
- 238000005457 optimization Methods 0.000 claims description 20
- 230000002265 prevention Effects 0.000 abstract description 4
- 230000005541 medical transmission Effects 0.000 description 5
- 208000025721 COVID-19 Diseases 0.000 description 3
- 241000711573 Coronaviridae Species 0.000 description 3
- 241000282414 Homo sapiens Species 0.000 description 3
- 206010035664 Pneumonia Diseases 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000004088 simulation Methods 0.000 description 3
- 208000001490 Dengue Diseases 0.000 description 2
- 206010012310 Dengue fever Diseases 0.000 description 2
- 208000025729 dengue disease Diseases 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 206010003757 Atypical pneumonia Diseases 0.000 description 1
- 206010061192 Haemorrhagic fever Diseases 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/80—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for detecting, monitoring or modelling epidemics or pandemics, e.g. flu
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/11—Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
- G06F30/20—Design optimisation, verification or simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Optimization (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Pure & Applied Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Business, Economics & Management (AREA)
- Health & Medical Sciences (AREA)
- Public Health (AREA)
- Software Systems (AREA)
- Human Resources & Organizations (AREA)
- Strategic Management (AREA)
- Economics (AREA)
- Operations Research (AREA)
- Algebra (AREA)
- Medical Informatics (AREA)
- Development Economics (AREA)
- Computing Systems (AREA)
- General Business, Economics & Management (AREA)
- Tourism & Hospitality (AREA)
- Quality & Reliability (AREA)
- Marketing (AREA)
- Entrepreneurship & Innovation (AREA)
- Game Theory and Decision Science (AREA)
- Geometry (AREA)
- Evolutionary Computation (AREA)
- Biomedical Technology (AREA)
- Computer Hardware Design (AREA)
- Pathology (AREA)
- Epidemiology (AREA)
- General Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
本发明公开了基于图滤波器‑向量自回归模型的传染病传播预测方法,其特征在于,包括如下步骤:1)构建图模型;2)预测模型设计;3)图滤波器参数优化设计;4)利用优化后的VAR模型对时变图信号进行预测。这种方法实用性好,能提高传染病预测的准确率,进而为预防提高决策依据。
The invention discloses a method for predicting the spread of infectious diseases based on a graph filter-vector autoregressive model, which is characterized by comprising the following steps: 1) constructing a graph model; 2) designing a prediction model; 3) optimizing the design of graph filter parameters; 4) ) using the optimized VAR model to predict the time-varying graph signal. This method has good practicability and can improve the accuracy of infectious disease prediction, thereby improving the decision-making basis for prevention.
Description
Technical Field
The invention relates to the field of infectious disease prediction, in particular to an infectious disease propagation prediction method based on a graph filter-vector autoregressive model.
Background
Infectious diseases can be rapidly spread in a large range, so that great influence and harm are brought to human beings, and the infectious diseases further become one of the main factors endangering human health. Infectious atypical pneumonia in 2003, influenza a H1N1 in 2009, and novel coronavirus pneumonia outbreaks in 2019 are all examples of very typical infectious disease transmission. The spreading process is predicted according to the monitored infectious disease data and early warning is carried out in time, so that countermeasures are taken, and the harm of infectious diseases is greatly reduced. Infectious diseases are global public health problems threatening human society, so that the research related to prediction of transmission of infectious diseases is of great significance to prevention and control of infectious diseases.
At present, the models and methods for predicting infectious diseases are widely applied mainly to a propagation dynamics model, a markov model, a time series method, a multiple regression model, and the like. The classical susceptibility-infection (SI) model proposed by kermanck and Mckendrick can reasonably predict the transmission of infectious diseases from person to person; gomez et al propose a discrete markov chain method for contagious disease transmission based on contact, solving the problem of the number of people in contact during the transmission process that depends on one node; siregar et al have studied the onset of hemorrhagic fever of dengue fever under the influence of climate by adopting a time series regression method, provide the basis for prevention and control of dengue fever; vector Autoregressive (VAR) models, which are a type of multivariate regression model, are also often used in time series prediction problems. The infectious disease transmission process is mainly affected by human-to-human interaction in time and space, and the crowd flow between regions is a very critical influencing factor. However, in the conventional prediction method, much attention is paid to the time correlation between data as a time series, and the intrinsic topology of the data, that is, the correlation between regions is ignored. Therefore, how to better depict the relationship between data is also an important issue, and if infectious disease data is modeled as a graph signal and a region is modeled as a node on the graph, the time correlation between time series data and the correlation between regions can be well described.
Nowadays, Graph Signal Processing (GSP) is receiving attention from researchers as an emerging field. GSP models data on irregular discrete domains as graphs, and uses edges on the graph models to depict the relationship between data. In many practical applications, for example, data of a wireless sensor network may change with time, have a time-varying characteristic, and can be modeled as a time-varying graph signal. Tay et al denoise the time-varying image signal using an efficient image filter; isufi et al predict the time-varying graph signals by using an approximate time-vertex stationarity hypothesis and an extended VAR model, and solve the dimension problem existing in multivariate time sequence evolution prediction, but the method only considers the correlation between the prediction time and the previous time signals in the parameter estimation problem, and does not consider the correlation between each time before the prediction time and the previous time signals.
Disclosure of Invention
The invention aims to provide an infectious disease propagation prediction method based on a graph filter-vector autoregressive model, aiming at the defects of the prior art. The method has good practicability, can improve the accuracy of infectious disease prediction, and further improves decision basis for prevention.
The technical scheme for realizing the purpose of the invention is as follows:
the infectious disease propagation prediction method based on the graph filter-vector autoregressive model comprises the following steps:
1) constructing a graph model: infectious disease transmission time series dataEach row in the system represents one area, N areas are in total, each column represents the number of cases at a certain moment in all areas, and T is in totalpThe data of each moment takes the region as the nodes on the graph, and the nodes are connected by edges according to the characteristics of different data to construct the topological structure of the graph and transmit the infectious disease dataModelling as a time-varying graph signalNamely, the data at each moment is a graph signal;
2) designing a prediction model: when a Graph Filter-Vector Autoregressive (GF-VAR) model predicts infection propagation, a time-varying Graph signal X is predicted by the VAR model, then coefficients of the VAR model are designed by the aid of concepts of a Graph Laplace matrix and a Graph Filter, and the signal value at the time t can be representedShown as front TpFunction of the signal at 1 instant, TpAnd 2, modeling the time-varying graph signal by adopting a VAR model, wherein the expression is shown as formula (1):
wherein epsilontThe error vector is a random vector with mean value of 0 and positive definite covariance matrix, and the graph filterAs a coefficient matrix, aq,pIs the coefficient of the Laplace matrix of order p, and Q is the highest order of the Laplace matrix;
3) and (3) optimally designing parameters of the graph filter: analyzing the prediction model according to step 2), considering the correlation between each moment of the time-varying graph signal X and the previous moment, and using the parameter a of the GF-VAR model graph filterq,pIs expressed as an optimization problem as shown in equation (2):
equation (2) is summarized as an unconstrained optimization problem as shown in equation (3):
the inputs in the problem shown in equation (3) are the signal x and the Laplace matrix LGCan be solved to Q × (T)p-1) parameters aq,p;
4) Predicting a time-varying graph signal by using the optimized VAR model, and quantizing by using a normalized minimum mean square error root (rNMSE) formula to estimate the performance of the model, wherein the error formula is shown as a formula (4):
The unconstrained optimization problem described in step 3) as shown in equation (3) can be transformed into a vector of coefficients for a graph filterThe least squares problem of (2) is shown in equation (5):
the parameters involved in equation (6) are shown in equation (7):
the theoretical solution to the least squares problem in equation (5) is as shown in equation (8):
thereby obtaining Qx (T)p-1) map filter parameters aq,pAnd finally obtaining the graph filter.
The unconstrained optimization problem shown in the formula (3) in the step 3) is solved by adopting a convex optimization toolkit cvx, and then the convex optimization toolkit cvx is solved to obtain the graph filter.
Compared with the existing infectious disease prediction model and method, the technical scheme is that the graph filter is used as a coefficient matrix of the VAR model, the graph topology is embedded into the model, and the optimization problem of the parameters of the graph filter is proposed based on the correlation between each moment before the prediction moment and the previous moment signals.
Drawings
FIG. 1 is a graph comparing 6-step prediction errors of susceptible-infected SI data in examples;
FIG. 2 is a comparison of 6-step prediction error for susceptibility-infection but not disease-infection-recovery-susceptibility SEIRS data in the examples;
FIG. 3 is a graph comparing 10-step prediction errors of the data of the German novel coronavirus pneumonia COVID-19 in the example.
Detailed Description
The invention will be further elucidated with reference to the drawings and examples, without however being limited thereto.
Example (b):
the infectious disease propagation prediction method based on the graph filter-vector autoregressive model comprises the following steps:
1) constructing a graph model: infectious disease transmission time series dataEach row in the system represents one area, N areas are in total, each column represents the number of cases at a certain moment in all areas, and T is in totalpThe data of each moment takes the region as the nodes on the graph, and the nodes are connected by edges according to the characteristics of different data to construct the topological structure of the graph and transmit the infectious disease dataModelling as a time-varying graph signalNamely, the data at each moment is a graph signal;
2) designing a prediction model: when the GF-VAR model predicts the spread of infectious diseases, the VAR model is firstly adopted to predict a time-varying graph signal X, then the concept of a graph Laplace matrix and a graph filter is utilized to design the coefficient of the VAR model, and the signal value at the moment T can be represented as the previous TpFunction of the signal at 1 instant, TpAnd 2, modeling the time-varying graph signal by adopting a VAR model, wherein the expression is shown as formula (1):
wherein epsilontThe error vector is a random vector with mean value of 0 and positive definite covariance matrix, and the graph filterAs a coefficient matrix, the time correlation between time series data and the correlation between regions can be well described, graph topology is embedded into a model, aq,pThe coefficient of a p-order Laplace matrix is adopted, Q is the highest order of the Laplace matrix, the formula (1) is used as a one-step prediction result expression, and n-step prediction results can be obtained by performing iterative computation on the one-step prediction result expression for n times;
3) and (3) optimally designing parameters of the graph filter: analyzing the prediction model according to step 2), considering the correlation between each moment of the time-varying graph signal X and the previous moment, and using the parameter a of the GF-VAR model graph filterq,pIs expressed as an optimization problem as shown in equation (2):
equation (2) is summarized as an unconstrained optimization problem as shown in equation (3):
the inputs in the problem shown in equation (3) are the signal x and the Laplace matrix LGCan be solved to Q × (T)p-1) parameters aq,p;
4) Predicting a time-varying graph signal by using the optimized VAR model, and quantizing by using a normalized minimum mean square error root (rNMSE) formula to estimate the performance of the model, wherein the error formula is shown as a formula (4):
The unconstrained optimization problem described in step 3) as shown in equation (3) can be transformed into a vector of coefficients for a graph filterThe least squares problem of (2) is shown in equation (5):
the parameters involved in equation (6) are shown in equation (7):
the theoretical solution to the least squares problem in equation (5) is as shown in equation (8):
thereby obtaining Qx (T)p-1) map filter parameters aq,pAnd finally obtaining the graph filter.
The unconstrained optimization problem shown in the formula (3) in the step 3) is solved by adopting a convex optimization toolkit cvx, and then the convex optimization toolkit cvx is solved to obtain the graph filter.
Simulation experiment: simulations the performance of the proposed model was tested using mock-generated disease data (data generated by the susceptibility-infection (SI) model, data generated by the susceptibility-infection but not the pathogenesis-infection-recovery-susceptibility (SEIRS) model) and german new coronavirus pneumonia (COVID-19) data for simulated disease data based on flight network propagation at 487 days at 125 international airports: constructing graphs with the node number N being 125 and constructing a weight matrix W based on flight information by using SI data and SEIRS dataGAnd using the data from which the mean value within the sample has been subtracted as a signal, constructing a 6-neighbor graph in which the number of nodes is N100 for the german covi-19 data including 100 infected persons in 507 days of the region, and setting a parameter TpConstructing a weighted adjacency matrix W with 3 and Q with 5GNormalized Laplace matrix LnorSelecting successive TpData of dayxt:As initial data, then, the optimization problem equation (3) is solved by the CVX toolkit to obtain Qx (T)p-1) parameters aq,pAnd finally, obtaining the parameter aq,pAnd dataSubstituting the prediction model to obtain the predicted valueAnd will beAnd one-step prediction of resultsAs new continuous TpData of day xtThen, the next time is predicted.
Comparing the method with a G-VARMA model and a GP-VAR model, and quantifying by a normalized minimum mean square error root rNMSE formula to evaluate the performance of the model, wherein the error is shown as a formula (4):
In the simulation experiment process, the method of the embodiment is compared with a G-VARMA model and a GP-VAR model proposed by Isufi and the like, SI data and SEIRS data are subjected to 6-step prediction, German COVID-19 data are subjected to 10-step prediction, the prediction result of each data is visualized to obtain the prediction effect shown in figures 1, 2 and 3, different data are observed, and compared with the method, the GF-VAR model of the method of the embodiment has better stability, and the good prediction effect can be kept on different types of data.
Claims (3)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110868810.9A CN113571200B (en) | 2021-07-30 | 2021-07-30 | Infectious disease transmission prediction method based on graph filter-vector autoregressive model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110868810.9A CN113571200B (en) | 2021-07-30 | 2021-07-30 | Infectious disease transmission prediction method based on graph filter-vector autoregressive model |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113571200A true CN113571200A (en) | 2021-10-29 |
CN113571200B CN113571200B (en) | 2023-09-19 |
Family
ID=78169306
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110868810.9A Active CN113571200B (en) | 2021-07-30 | 2021-07-30 | Infectious disease transmission prediction method based on graph filter-vector autoregressive model |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113571200B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114978928A (en) * | 2022-04-24 | 2022-08-30 | 重庆邮电大学 | A dynamic message passing method for social contagion in coupled networks with time-varying properties |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112700885A (en) * | 2021-01-13 | 2021-04-23 | 大连海事大学 | Method for identifying new coronavirus propagation model parameters based on Kalman filtering |
CN112865748A (en) * | 2021-01-13 | 2021-05-28 | 西南大学 | Method for constructing online distributed multitask graph filter based on recursive least squares |
CN113035368A (en) * | 2021-04-13 | 2021-06-25 | 桂林电子科技大学 | Disease propagation prediction method based on differential migration diagram neural network |
-
2021
- 2021-07-30 CN CN202110868810.9A patent/CN113571200B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112700885A (en) * | 2021-01-13 | 2021-04-23 | 大连海事大学 | Method for identifying new coronavirus propagation model parameters based on Kalman filtering |
CN112865748A (en) * | 2021-01-13 | 2021-05-28 | 西南大学 | Method for constructing online distributed multitask graph filter based on recursive least squares |
CN113035368A (en) * | 2021-04-13 | 2021-06-25 | 桂林电子科技大学 | Disease propagation prediction method based on differential migration diagram neural network |
Non-Patent Citations (1)
Title |
---|
朱莞琪;李娟生;孟蕾;杨玫;刘新凤;牛丽霞;于德山;蒋小娟;王琪;张蕾洁;: "基于向量自回归模型分析呼吸道病原感染与气象因素的动态关系", 中国卫生统计, no. 02 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114978928A (en) * | 2022-04-24 | 2022-08-30 | 重庆邮电大学 | A dynamic message passing method for social contagion in coupled networks with time-varying properties |
Also Published As
Publication number | Publication date |
---|---|
CN113571200B (en) | 2023-09-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111524611B (en) | Method, device and equipment for constructing infectious disease trend prediction model | |
de Freitas | Bayesian methods for neural networks | |
CN109977098A (en) | Non-stationary time-series data predication method, system, storage medium and computer equipment | |
JP2020525872A (en) | Influenza prediction model generation method, device, and computer-readable storage medium | |
CN106777935A (en) | A kind of disease dynamic prediction method based on network structure | |
CN109040027A (en) | The active predicting method of network vulnerability node based on gray model | |
CN112700885A (en) | Method for identifying new coronavirus propagation model parameters based on Kalman filtering | |
CN113486303A (en) | Long-time sequence prediction method based on modification model integration | |
CN113357138B (en) | Method, device and terminal equipment for predicting remaining service life of hydraulic pump | |
CN113571200A (en) | Prediction method of infectious disease spread based on graph filter-vector autoregressive model | |
CN116415482A (en) | Wing flow field analysis method based on graph neural network MeshGraphNet | |
Li et al. | Study on prediction model of HIV incidence based on GRU neural network optimized by MHPSO | |
CN114564487B (en) | Meteorological raster data update method combined with forecast and forecast | |
CN117077771B (en) | Causal association driven aviation traffic network sweep effect prediction method | |
CN114372401A (en) | A Hierarchical Stochastic Kriging Surrogate Model Construction Method | |
Nambiar et al. | Optimization of structure and system latency in evolvable block-based neural networks using genetic algorithm | |
CN117436318B (en) | Intelligent building management method and system based on Internet of things | |
JP7400819B2 (en) | Prediction device, prediction method, and prediction program | |
CN117408171B (en) | Hydrologic set forecasting method of Copula multi-model condition processor | |
US6807652B2 (en) | Method of robust semiconductor circuit products design using rational robust optimization | |
Vanbrackle et al. | A study of the average run length characteristics of the National Notifiable Diseases Surveillance System | |
CN117217359A (en) | Adaptive Fourier decomposition and multi-scale time convolution network day runoff prediction method | |
Kuryliak et al. | Efficient and realistic stochastic simulation of the dynamics of epidemic processes on complex networks | |
Liu et al. | Rolling Iterative Prediction for Correlated Multivariate Time Series | |
Ngoy et al. | Combining epidemic model and deep learning to study cyber attacks |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |