CN113571200A - Infectious disease propagation prediction method based on graph filter-vector autoregressive model - Google Patents
Infectious disease propagation prediction method based on graph filter-vector autoregressive model Download PDFInfo
- Publication number
- CN113571200A CN113571200A CN202110868810.9A CN202110868810A CN113571200A CN 113571200 A CN113571200 A CN 113571200A CN 202110868810 A CN202110868810 A CN 202110868810A CN 113571200 A CN113571200 A CN 113571200A
- Authority
- CN
- China
- Prior art keywords
- graph
- model
- equation
- filter
- time
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/80—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for detecting, monitoring or modelling epidemics or pandemics, e.g. flu
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/11—Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
- G06F30/20—Design optimisation, verification or simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Optimization (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Pure & Applied Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Business, Economics & Management (AREA)
- Health & Medical Sciences (AREA)
- Public Health (AREA)
- Software Systems (AREA)
- Human Resources & Organizations (AREA)
- Strategic Management (AREA)
- Economics (AREA)
- Operations Research (AREA)
- Algebra (AREA)
- Medical Informatics (AREA)
- Development Economics (AREA)
- Computing Systems (AREA)
- General Business, Economics & Management (AREA)
- Tourism & Hospitality (AREA)
- Quality & Reliability (AREA)
- Marketing (AREA)
- Entrepreneurship & Innovation (AREA)
- Game Theory and Decision Science (AREA)
- Geometry (AREA)
- Evolutionary Computation (AREA)
- Biomedical Technology (AREA)
- Computer Hardware Design (AREA)
- Pathology (AREA)
- Epidemiology (AREA)
- General Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses an infectious disease propagation prediction method based on a graph filter-vector autoregressive model, which is characterized by comprising the following steps of: 1) constructing a graph model; 2) designing a prediction model; 3) optimally designing parameters of a graph filter; 4) and predicting the time-varying graph signal by using the optimized VAR model. The method has good practicability, can improve the accuracy of infectious disease prediction, and further improves decision basis for prevention.
Description
Technical Field
The invention relates to the field of infectious disease prediction, in particular to an infectious disease propagation prediction method based on a graph filter-vector autoregressive model.
Background
Infectious diseases can be rapidly spread in a large range, so that great influence and harm are brought to human beings, and the infectious diseases further become one of the main factors endangering human health. Infectious atypical pneumonia in 2003, influenza a H1N1 in 2009, and novel coronavirus pneumonia outbreaks in 2019 are all examples of very typical infectious disease transmission. The spreading process is predicted according to the monitored infectious disease data and early warning is carried out in time, so that countermeasures are taken, and the harm of infectious diseases is greatly reduced. Infectious diseases are global public health problems threatening human society, so that the research related to prediction of transmission of infectious diseases is of great significance to prevention and control of infectious diseases.
At present, the models and methods for predicting infectious diseases are widely applied mainly to a propagation dynamics model, a markov model, a time series method, a multiple regression model, and the like. The classical susceptibility-infection (SI) model proposed by kermanck and Mckendrick can reasonably predict the transmission of infectious diseases from person to person; gomez et al propose a discrete markov chain method for contagious disease transmission based on contact, solving the problem of the number of people in contact during the transmission process that depends on one node; siregar et al have studied the onset of hemorrhagic fever of dengue fever under the influence of climate by adopting a time series regression method, provide the basis for prevention and control of dengue fever; vector Autoregressive (VAR) models, which are a type of multivariate regression model, are also often used in time series prediction problems. The infectious disease transmission process is mainly affected by human-to-human interaction in time and space, and the crowd flow between regions is a very critical influencing factor. However, in the conventional prediction method, much attention is paid to the time correlation between data as a time series, and the intrinsic topology of the data, that is, the correlation between regions is ignored. Therefore, how to better depict the relationship between data is also an important issue, and if infectious disease data is modeled as a graph signal and a region is modeled as a node on the graph, the time correlation between time series data and the correlation between regions can be well described.
Nowadays, Graph Signal Processing (GSP) is receiving attention from researchers as an emerging field. GSP models data on irregular discrete domains as graphs, and uses edges on the graph models to depict the relationship between data. In many practical applications, for example, data of a wireless sensor network may change with time, have a time-varying characteristic, and can be modeled as a time-varying graph signal. Tay et al denoise the time-varying image signal using an efficient image filter; isufi et al predict the time-varying graph signals by using an approximate time-vertex stationarity hypothesis and an extended VAR model, and solve the dimension problem existing in multivariate time sequence evolution prediction, but the method only considers the correlation between the prediction time and the previous time signals in the parameter estimation problem, and does not consider the correlation between each time before the prediction time and the previous time signals.
Disclosure of Invention
The invention aims to provide an infectious disease propagation prediction method based on a graph filter-vector autoregressive model, aiming at the defects of the prior art. The method has good practicability, can improve the accuracy of infectious disease prediction, and further improves decision basis for prevention.
The technical scheme for realizing the purpose of the invention is as follows:
the infectious disease propagation prediction method based on the graph filter-vector autoregressive model comprises the following steps:
1) constructing a graph model: infectious disease transmission time series dataEach row in the system represents one area, N areas are in total, each column represents the number of cases at a certain moment in all areas, and T is in totalpThe data of each moment takes the region as the nodes on the graph, and the nodes are connected by edges according to the characteristics of different data to construct the topological structure of the graph and transmit the infectious disease dataModelling as a time-varying graph signalNamely, the data at each moment is a graph signal;
2) designing a prediction model: when a Graph Filter-Vector Autoregressive (GF-VAR) model predicts infection propagation, a time-varying Graph signal X is predicted by the VAR model, then coefficients of the VAR model are designed by the aid of concepts of a Graph Laplace matrix and a Graph Filter, and the signal value at the time t can be representedShown as front TpFunction of the signal at 1 instant, TpAnd 2, modeling the time-varying graph signal by adopting a VAR model, wherein the expression is shown as formula (1):
wherein epsilontThe error vector is a random vector with mean value of 0 and positive definite covariance matrix, and the graph filterAs a coefficient matrix, aq,pIs the coefficient of the Laplace matrix of order p, and Q is the highest order of the Laplace matrix;
3) and (3) optimally designing parameters of the graph filter: analyzing the prediction model according to step 2), considering the correlation between each moment of the time-varying graph signal X and the previous moment, and using the parameter a of the GF-VAR model graph filterq,pIs expressed as an optimization problem as shown in equation (2):
equation (2) is summarized as an unconstrained optimization problem as shown in equation (3):
the inputs in the problem shown in equation (3) are the signal x and the Laplace matrix LGCan be solved to Q × (T)p-1) parameters aq,p;
4) Predicting a time-varying graph signal by using the optimized VAR model, and quantizing by using a normalized minimum mean square error root (rNMSE) formula to estimate the performance of the model, wherein the error formula is shown as a formula (4):
The unconstrained optimization problem described in step 3) as shown in equation (3) can be transformed into a vector of coefficients for a graph filterThe least squares problem of (2) is shown in equation (5):
the parameters involved in equation (6) are shown in equation (7):
the theoretical solution to the least squares problem in equation (5) is as shown in equation (8):
thereby obtaining Qx (T)p-1) map filter parameters aq,pAnd finally obtaining the graph filter.
The unconstrained optimization problem shown in the formula (3) in the step 3) is solved by adopting a convex optimization toolkit cvx, and then the convex optimization toolkit cvx is solved to obtain the graph filter.
Compared with the existing infectious disease prediction model and method, the technical scheme is that the graph filter is used as a coefficient matrix of the VAR model, the graph topology is embedded into the model, and the optimization problem of the parameters of the graph filter is proposed based on the correlation between each moment before the prediction moment and the previous moment signals.
Drawings
FIG. 1 is a graph comparing 6-step prediction errors of susceptible-infected SI data in examples;
FIG. 2 is a comparison of 6-step prediction error for susceptibility-infection but not disease-infection-recovery-susceptibility SEIRS data in the examples;
FIG. 3 is a graph comparing 10-step prediction errors of the data of the German novel coronavirus pneumonia COVID-19 in the example.
Detailed Description
The invention will be further elucidated with reference to the drawings and examples, without however being limited thereto.
Example (b):
the infectious disease propagation prediction method based on the graph filter-vector autoregressive model comprises the following steps:
1) constructing a graph model: infectious disease transmission time series dataEach row in the system represents one area, N areas are in total, each column represents the number of cases at a certain moment in all areas, and T is in totalpThe data of each moment takes the region as the nodes on the graph, and the nodes are connected by edges according to the characteristics of different data to construct the topological structure of the graph and transmit the infectious disease dataModelling as a time-varying graph signalNamely, the data at each moment is a graph signal;
2) designing a prediction model: when the GF-VAR model predicts the spread of infectious diseases, the VAR model is firstly adopted to predict a time-varying graph signal X, then the concept of a graph Laplace matrix and a graph filter is utilized to design the coefficient of the VAR model, and the signal value at the moment T can be represented as the previous TpFunction of the signal at 1 instant, TpAnd 2, modeling the time-varying graph signal by adopting a VAR model, wherein the expression is shown as formula (1):
wherein epsilontThe error vector is a random vector with mean value of 0 and positive definite covariance matrix, and the graph filterAs a coefficient matrix, the time correlation between time series data and the correlation between regions can be well described, graph topology is embedded into a model, aq,pThe coefficient of a p-order Laplace matrix is adopted, Q is the highest order of the Laplace matrix, the formula (1) is used as a one-step prediction result expression, and n-step prediction results can be obtained by performing iterative computation on the one-step prediction result expression for n times;
3) and (3) optimally designing parameters of the graph filter: analyzing the prediction model according to step 2), considering the correlation between each moment of the time-varying graph signal X and the previous moment, and using the parameter a of the GF-VAR model graph filterq,pIs expressed as an optimization problem as shown in equation (2):
equation (2) is summarized as an unconstrained optimization problem as shown in equation (3):
the inputs in the problem shown in equation (3) are the signal x and the Laplace matrix LGCan be solved to Q × (T)p-1) parameters aq,p;
4) Predicting a time-varying graph signal by using the optimized VAR model, and quantizing by using a normalized minimum mean square error root (rNMSE) formula to estimate the performance of the model, wherein the error formula is shown as a formula (4):
The unconstrained optimization problem described in step 3) as shown in equation (3) can be transformed into a vector of coefficients for a graph filterThe least squares problem of (2) is shown in equation (5):
the parameters involved in equation (6) are shown in equation (7):
the theoretical solution to the least squares problem in equation (5) is as shown in equation (8):
thereby obtaining Qx (T)p-1) map filter parameters aq,pAnd finally obtaining the graph filter.
The unconstrained optimization problem shown in the formula (3) in the step 3) is solved by adopting a convex optimization toolkit cvx, and then the convex optimization toolkit cvx is solved to obtain the graph filter.
Simulation experiment: simulations the performance of the proposed model was tested using mock-generated disease data (data generated by the susceptibility-infection (SI) model, data generated by the susceptibility-infection but not the pathogenesis-infection-recovery-susceptibility (SEIRS) model) and german new coronavirus pneumonia (COVID-19) data for simulated disease data based on flight network propagation at 487 days at 125 international airports: constructing graphs with the node number N being 125 and constructing a weight matrix W based on flight information by using SI data and SEIRS dataGAnd using the data from which the mean value within the sample has been subtracted as a signal, constructing a 6-neighbor graph in which the number of nodes is N100 for the german covi-19 data including 100 infected persons in 507 days of the region, and setting a parameter TpConstructing a weighted adjacency matrix W with 3 and Q with 5GNormalized Laplace matrix LnorSelecting successive TpData of dayxt:As initial data, then, the optimization problem equation (3) is solved by the CVX toolkit to obtain Qx (T)p-1) parameters aq,pAnd finally, obtaining the parameter aq,pAnd dataSubstituting the prediction model to obtain the predicted valueAnd will beAnd one-step prediction of resultsAs new continuous TpData of day xtThen, the next time is predicted.
Comparing the method with a G-VARMA model and a GP-VAR model, and quantifying by a normalized minimum mean square error root rNMSE formula to evaluate the performance of the model, wherein the error is shown as a formula (4):
In the simulation experiment process, the method of the embodiment is compared with a G-VARMA model and a GP-VAR model proposed by Isufi and the like, SI data and SEIRS data are subjected to 6-step prediction, German COVID-19 data are subjected to 10-step prediction, the prediction result of each data is visualized to obtain the prediction effect shown in figures 1, 2 and 3, different data are observed, and compared with the method, the GF-VAR model of the method of the embodiment has better stability, and the good prediction effect can be kept on different types of data.
Claims (3)
1. The infectious disease propagation prediction method based on the graph filter-vector autoregressive model is characterized by comprising the following steps of:
1) constructing a graph model: infectious diseasesPropagation time series dataEach row in the system represents one area, N areas are in total, each column represents the number of cases at a certain moment in all areas, and T is in totalpThe data of each moment takes the region as the nodes on the graph, and the nodes are connected by edges according to the characteristics of different data to construct the topological structure of the graph and transmit the infectious disease dataModelling as a time-varying graph signalNamely, the data at each moment is a graph signal;
2) designing a prediction model: firstly, a VAR model is adopted to predict a time-varying graph signal X, then the coefficient of the VAR model is designed by utilizing the concepts of a graph Laplacian matrix and a graph filter, and the signal value at the time T is assumed to be represented as the previous TpFunction of the signal at 1 instant, TpAnd (2) modeling the time-varying graph signal by adopting a VAR model, wherein the expression is shown as a formula (1):
wherein epsilontThe error vector is a random vector with mean value of 0 and positive definite covariance matrix, and the graph filterAs a coefficient matrix, aq,pIs the coefficient of the Laplace matrix of order p, and Q is the highest order of the Laplace matrix;
3) and (3) optimally designing parameters of the graph filter: filtering parameter a of GF-VAR model diagramq,pIs expressed as an optimization problem as shown in equation (2):
equation (2) is summarized as an unconstrained optimization problem as shown in equation (3):
the inputs in the problem shown in equation (3) are the signal x and the Laplace matrix LGSolved to Q × (T)p-1) parameters aq,p;
4) Predicting a time-varying graph signal by using the optimized VAR model, quantizing by adopting a normalized minimum mean square error root (rNMSE) formula, and evaluating the performance of the model, wherein the error formula is shown as a formula (4):
2. The method of claim 1, wherein the unconstrained optimization problem of step 3) is transformed into a vector of coefficients for the graph filter as shown in equation (3)The least squares problem of (2) is shown in equation (5):
the parameters involved in equation (6) are shown in equation (7):
the theoretical solution to the least squares problem in equation (5) is as shown in equation (8):
thereby obtaining Qx (T)p-1) map filter parameters aq,pAnd finally obtaining the graph filter.
3. The infectious disease propagation prediction method based on graph filter-vector autoregressive model according to claim 1, wherein the unconstrained optimization problem in step 3) as shown in formula (3) is solved by using a convex optimization tool set cvx, and then the graph filter is obtained.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110868810.9A CN113571200B (en) | 2021-07-30 | 2021-07-30 | Infectious disease transmission prediction method based on graph filter-vector autoregressive model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110868810.9A CN113571200B (en) | 2021-07-30 | 2021-07-30 | Infectious disease transmission prediction method based on graph filter-vector autoregressive model |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113571200A true CN113571200A (en) | 2021-10-29 |
CN113571200B CN113571200B (en) | 2023-09-19 |
Family
ID=78169306
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110868810.9A Active CN113571200B (en) | 2021-07-30 | 2021-07-30 | Infectious disease transmission prediction method based on graph filter-vector autoregressive model |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113571200B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114978928A (en) * | 2022-04-24 | 2022-08-30 | 重庆邮电大学 | Dynamic message transmission method for social infection in coupling network with time-varying characteristic |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112700885A (en) * | 2021-01-13 | 2021-04-23 | 大连海事大学 | Method for identifying new coronavirus propagation model parameters based on Kalman filtering |
CN112865748A (en) * | 2021-01-13 | 2021-05-28 | 西南大学 | Method for constructing online distributed multitask graph filter based on recursive least squares |
CN113035368A (en) * | 2021-04-13 | 2021-06-25 | 桂林电子科技大学 | Disease propagation prediction method based on differential migration diagram neural network |
-
2021
- 2021-07-30 CN CN202110868810.9A patent/CN113571200B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112700885A (en) * | 2021-01-13 | 2021-04-23 | 大连海事大学 | Method for identifying new coronavirus propagation model parameters based on Kalman filtering |
CN112865748A (en) * | 2021-01-13 | 2021-05-28 | 西南大学 | Method for constructing online distributed multitask graph filter based on recursive least squares |
CN113035368A (en) * | 2021-04-13 | 2021-06-25 | 桂林电子科技大学 | Disease propagation prediction method based on differential migration diagram neural network |
Non-Patent Citations (1)
Title |
---|
朱莞琪;李娟生;孟蕾;杨玫;刘新凤;牛丽霞;于德山;蒋小娟;王琪;张蕾洁;: "基于向量自回归模型分析呼吸道病原感染与气象因素的动态关系", 中国卫生统计, no. 02 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114978928A (en) * | 2022-04-24 | 2022-08-30 | 重庆邮电大学 | Dynamic message transmission method for social infection in coupling network with time-varying characteristic |
Also Published As
Publication number | Publication date |
---|---|
CN113571200B (en) | 2023-09-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111524611B (en) | Method, device and equipment for constructing infectious disease trend prediction model | |
de Freitas | Bayesian methods for neural networks | |
Gillijns et al. | What is the ensemble Kalman filter and how well does it work? | |
Wang et al. | Dynamic poisson autoregression for influenza-like-illness case count prediction | |
KR102254817B1 (en) | Method and apparatus for real-time ensemble streamflow forecasting | |
Ghanem | Hybrid stochastic finite elements and generalized Monte Carlo simulation | |
CN111768875A (en) | Infectious disease epidemic situation prediction method, system, device and storage medium | |
Golchi et al. | Sequentially constrained monte carlo | |
CN113571200A (en) | Infectious disease propagation prediction method based on graph filter-vector autoregressive model | |
CN115830865A (en) | Vehicle flow prediction method and device based on adaptive hypergraph convolution neural network | |
Lee et al. | Markov chain approximation algorithm for event-based state estimation | |
Chafaa et al. | Fuzzy modelling using Kalman filter | |
KR20210044052A (en) | Method and apparatus for predicting streamflow | |
JP7400819B2 (en) | Prediction device, prediction method, and prediction program | |
Shin et al. | Dynamic ICAR Spatiotemporal Factor Models | |
CN117153423A (en) | Bayesian inference-based method for predicting outbreak time of new-born infectious disease | |
Renukadevi et al. | Covid-19 Forecasting with Deep Learning-based Half-binomial Distribution Cat Swarm Optimization. | |
Shahtori et al. | Sequential monte carlo filtering estimation of ebola progression in west africa | |
Roziqin et al. | A comparison of montecarlo linear and dynamic polynomial regression in predicting dengue fever case | |
Vanbrackle et al. | A study of the average run length characteristics of the National Notifiable Diseases Surveillance System | |
US6807652B2 (en) | Method of robust semiconductor circuit products design using rational robust optimization | |
Chao et al. | Performance modeling using additive regression splines | |
CN114566048A (en) | Traffic control method based on multi-view self-adaptive space-time diagram network | |
Chen et al. | A Novel Point Process Model for COVID-19: Multivariate Recursive Hawkes Process | |
Ramesh et al. | Statistical model checking for dynamical processes on networks: a healthcare application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |