CN114647819A - Grid point processing method of environmental data based on graph convolution network - Google Patents

Grid point processing method of environmental data based on graph convolution network Download PDF

Info

Publication number
CN114647819A
CN114647819A CN202210325115.2A CN202210325115A CN114647819A CN 114647819 A CN114647819 A CN 114647819A CN 202210325115 A CN202210325115 A CN 202210325115A CN 114647819 A CN114647819 A CN 114647819A
Authority
CN
China
Prior art keywords
matrix
data
grid point
monitoring data
monitoring
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210325115.2A
Other languages
Chinese (zh)
Inventor
张晓霞
周鹏程
胡峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing University of Post and Telecommunications
Original Assignee
Chongqing University of Post and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing University of Post and Telecommunications filed Critical Chongqing University of Post and Telecommunications
Priority to CN202210325115.2A priority Critical patent/CN114647819A/en
Publication of CN114647819A publication Critical patent/CN114647819A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Abstract

The invention relates to the field of data processing, in particular to a grid point processing method of environmental data based on a graph convolution network; the method comprises the steps of obtaining air quality monitoring data and meteorological monitoring data in a target area; performing deletion processing on all monitoring data, and mapping the monitoring data of the station to a grid point matrix divided by a target area; generating a dynamic wind field graph by using wind direction data and wind speed data, and calculating a wind field adjacency matrix by using a Dijkstra algorithm; constructing a mask matrix at each moment according to the air quality concentration data, and constructing a characteristic vector set Z at each moment according to the wind field adjacency matrix, the mask matrix and the meteorological monitoring data; generating a target matrix Y at each moment according to the mask matrix and the air mass concentration data; and inputting the characteristic vector set Z matrix into the trained graph convolution neural network model to obtain an estimation matrix P of the target matrix. The invention can improve the grid-point precision of the environmental data.

Description

Grid point processing method of environmental data based on graph convolution network
Technical Field
The invention relates to the field of data processing, in particular to a grid point processing method of environment data based on a graph convolution network.
Background
In recent years, the problem of environmental pollution is emphasized by people, the density of automatic environmental monitoring sites is greatly improved, and under the background of the rapid development of intelligent grid forecasting technology and the requirement of human production activities on air quality service based on positions, grid data generated by monitoring data with irregular spatial resolution and discrete distribution of the monitoring sites to be regular has important social service and business application values.
Common methods for processing site data into grid points include objective analysis, remote sensing inversion, data assimilation, statistical interpolation, kriging interpolation, tensor completion and the like, but the existing technology processes single environmental data without considering the influence of other factors on target data, so that the grid point effect is not ideal.
Disclosure of Invention
Aiming at the defects in the technology, the invention provides a grid point processing method of environmental data based on a graph convolution network, which is used for gridding the environmental data by using meteorological monitoring data, and fitting the influence of the meteorological data on the air quality data and the change rule of the air quality data in the grid point process by using the strong fitting capacity of a neural network, thereby improving the grid point accuracy of the environmental data.
The grid point processing method of the environment data based on the graph convolution network comprises the following steps:
s1: acquiring air quality monitoring data of N air quality monitoring stations and meteorological monitoring data of M meteorological monitoring stations in a target area; the air quality monitoring data comprises air quality concentration data, and the meteorological monitoring data comprises humidity, temperature, wind direction and wind speed data;
s2: preprocessing the missing parts of all monitoring data, and mapping the monitoring data of the corresponding station to a grid point matrix divided by a target area according to the scale according to the coordinates of the monitored station;
s3: generating a dynamic wind field graph by using wind direction data and wind speed data in meteorological monitoring data in the lattice point matrix, and calculating a wind field adjacency matrix by using a Dijkstra algorithm;
s4: constructing a mask matrix at each moment according to the air quality concentration data in the lattice point matrix, and constructing a characteristic vector set Z at each moment according to the wind field adjacency matrix, the mask matrix and the meteorological monitoring data;
s5: generating a target matrix Y at each moment according to the mask matrix and the air quality concentration data;
s6: and inputting the characteristic vector set Z matrix into a trained graph convolution neural network model to obtain an estimation matrix P of the target matrix.
The grid point processing method of the environmental data based on the graph convolution network has the following beneficial effects:
1. according to the method, wind speed data and wind direction data are converted into a dynamic wind field matrix, a wind field adjacent matrix is calculated according to the dynamic wind field matrix, and finally the adjacent matrix is input into a model, so that the influence mapping relation of a wind field on the air quality can be learned by utilizing a neural network model, and the air quality data lattice precision is improved;
2. according to the method, the GCN layer based on the spectrum is added into the graph convolution neural network model, adverse factors influencing gridding precision can be filtered in the training process, and the grid point processing effect is better and more accurate;
3. according to the method, the mask matrix after random zero setting is used for carrying out random initialization on the target matrix, so that the robustness of the model is improved, and the lattice point effect of the model on the air quality is more stable;
4. when the environment quality data is subjected to lattice transformation, F nodes before the current time node are used for providing information for the current node, so that the lattice transformation effect is more reasonably distributed;
5. the invention also fully considers the influence of temperature and humidity on the air quality, and the lattice point effect is more accurate.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.
FIG. 1 is a schematic overall flow diagram of the present invention;
FIG. 2 is a schematic diagram of the generation of a mask and a mask2 matrix according to an embodiment of the invention;
fig. 3 is a schematic diagram of generating a feature vector set Z according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of generating a target matrix Y according to an embodiment of the present invention;
fig. 5 is a main data flow diagram provided by an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Fig. 1 is a schematic overall flow chart of the present invention, and as shown in fig. 1, the method includes:
s1: acquiring air quality monitoring data of N air quality monitoring stations and meteorological monitoring data of M meteorological monitoring stations in a target area; the air quality monitoring data comprises air quality concentration data, and the meteorological monitoring data comprises humidity, temperature, wind direction and wind speed data;
in this embodiment of the present invention, the step S1 may include:
s11: acquiring data of each time node with one hour of monitoring time frequency of N air quality monitoring stations in a target area;
s12: acquiring data of each time node with one hour of humidity, temperature, wind direction and wind speed time frequency monitored by M meteorological stations in a target area;
s13: the air mass concentration data and the meteorological monitoring data are sequenced in time, the data are aligned in time, and the default part is filled with-1.
And sequencing the air quality concentration data and the meteorological monitoring data according to time to form a time sequence, and processing the time sequence in the subsequent process to realize lattice point processing of the time sequence.
It is to be understood that, in the embodiment of the present invention, the air quality monitoring data of the air quality monitoring station may select PM2.5 concentration data, and may also be other air quality concentration data such as PM10, CO, SO2, NO2, O3, AQI, and for convenience of description, the PM2.5 concentration data is exemplified in the embodiment of the present invention.
S2: preprocessing the missing parts of all monitoring data, and mapping the monitoring data of the corresponding station to a grid point matrix divided by a target area according to the scale according to the coordinates of the monitored station;
in this embodiment of the present invention, the step S2 includes the following steps:
s21: obtaining the boundary of a target area, dividing the target area into fishing grid points (grids) according to a certain scale, and obtaining the center point coordinate l of each gridijForming a central point coordinate matrix L, namely a lattice point matrix, wherein i and j are row and column coordinates of the lattice point matrix respectively;
in this embodiment, the scale may be divided according to 1km × 1km, and certainly, in an actual situation, a person skilled in the art may select different scales to divide according to actual needs, so as to divide the target area into different grids; the target area can be rectangular, circular or other irregular figures, and if the target area is non-rectangular, the non-rectangular target area needs to be filled outwards to be rectangular, so that the target area is conveniently divided into grids with the same size, and the subsequent grid point processing on the environmental data is facilitated.
After a target area is divided into a plurality of grids, the coordinates of the central point of each grid are used as the coordinates of the grid, and a central point coordinate matrix is formed; the row and column coordinates of the coordinate matrix can reflect the position of the grid in the target area.
S22: according to the coordinates and the longitude and latitude distances of the air quality monitoring station, traversing and calculating a first distance from the air quality monitoring station to each grid point in the grid point matrix, determining the grid point position when the first distance is shortest, and recording the grid point position in a first mapping table D1Performing the following steps;
wherein the first mapping table D1Comprises N mapping coordinates of air quality monitoring stations which are respectively mapped to the shortest distance in a grid point matrix,
Figure BDA0003573090620000041
Figure BDA0003573090620000042
the row and column marks representing the mapping of the ith air quality monitoring station to the data matrix are shown, i belongs to [1, N ∈]。
S23: reading the monitoring data of all the air quality monitoring stations, and supplementing the default monitoring data of the current moment into the average value of the monitoring data of the previous moment and the next moment of the monitoring station or the average value of the monitoring data of all the moments of the monitoring station for the same grid point;
in the embodiment of the invention, the monitoring data of all N air quality monitoring stations are read, and if the monitoring data at the current moment is found to have default, the monitoring data is supposed to be in
Figure BDA0003573090620000043
By default, i.e.
Figure BDA0003573090620000044
Then order
Figure BDA0003573090620000045
If a number of consecutive default values occur, it will be filled directly with the average value for that station, where
Figure BDA0003573090620000046
Represents the air mass concentration value at the time t of the nth station, t-1 represents the previous time before the time t, t +1 represents the later time after the time t,wherein t is the (0, L) and L is the time sequence length of the monitoring data.
S24: according to the coordinates and the longitude and latitude distances of the weather monitoring station, traversing and calculating a second distance from the weather monitoring station to each grid point in the grid point matrix, determining the grid point position when the second distance is shortest, and recording the grid point position in a second mapping table D2The preparation method comprises the following steps of (1) performing;
wherein the first mapping table D2Comprises M weather monitoring stations which are respectively mapped to mapping coordinates with the shortest distance in a grid point matrix,
Figure BDA0003573090620000051
Figure BDA0003573090620000052
the row and column marks representing the ith weather monitoring station mapped into the data matrix are shown, i belongs to [1, M ∈]。
S25: reading the meteorological monitoring data of all meteorological monitoring sites, and supplementing the default monitoring data of the current moment into the average value of the monitoring data of the previous and subsequent moments of the monitoring sites or the average value of the monitoring data of all moments of the monitoring sites for the same grid point;
in the embodiment of the invention, the meteorological monitoring data of all M meteorological monitoring sites are read, and for each acquired meteorological data, the meteorological monitoring data are
Figure BDA0003573090620000053
By default, i.e.
Figure BDA0003573090620000054
Then order
Figure BDA0003573090620000055
Figure BDA0003573090620000056
If a number of consecutive default values occur, it will be filled directly with the average value for that station, where
Figure BDA0003573090620000057
The method comprises the steps of representing a meteorological data value of an mth meteorological monitoring station at a time t, representing a previous time of the time t by t-1, representing a later time of the time t by t +1, wherein t belongs to (0, L), and L is the length of a monitoring data time series.
In the embodiment, for the data default part, namely the meteorological monitoring data, the generated data matrix is optimized by using the Cressman filling algorithm, so that a new optimized matrix is obtained and the original matrix is covered.
S26: at the current moment t, the air quality concentration data of each air quality monitoring station is mapped according to a first mapping table D1Mapping to matrix VtAt the corresponding position, the humidity, temperature, wind direction and wind speed data of each meteorological monitoring station are mapped according to a second mapping table D2Mapping to matrix Ht、Tt
Figure BDA0003573090620000058
At the corresponding position;
wherein, VtAir mass concentration data matrix, H, representing time tt、Tt
Figure BDA0003573090620000059
And sequentially obtaining a humidity, temperature, wind direction and wind speed data matrix at the moment t, wherein t belongs to (0, L), and L is the time sequence length of the monitoring data.
In the embodiment of the invention, for any time t, PM2.5 data of each air quality monitoring station at the time t can be mapped according to the first mapping table D1Mapping to matrix VtAt the corresponding position, the humidity, temperature, wind direction and wind speed data of each weather monitoring station at the moment t are mapped according to a second mapping table D2Mapping to matrix Ht、Tt
Figure BDA0003573090620000061
At a corresponding position where VtThe monitoring value representing PM2.5 at the time t is mapped to a target area and divided into fishing nets according to the scale of 1km multiplied by 1km, and H is similar to the monitoring valuet、Tt
Figure BDA0003573090620000062
The monitored values of humidity, temperature, wind direction and wind speed at the time t are mapped to a target area and divided into fishing nets according to the scale of 1km multiplied by 1 km.
S3: generating a dynamic wind field graph by using wind direction data and wind speed data in meteorological monitoring data in the lattice point matrix, and calculating a wind field adjacency matrix by using a Dijkstra algorithm;
in this embodiment of the present invention, the step S3 may include:
s31: using wind direction data matrices
Figure BDA0003573090620000063
And wind speed data matrix
Figure BDA0003573090620000064
Generating a dynamic wind farm map matrix
Figure BDA0003573090620000065
Wherein the content of the first and second substances,
Figure BDA0003573090620000066
representing the distance from the ith grid point to the jth grid point in the grid point matrix, wherein thetaijRepresenting the wind direction included angle from the ith grid point to the jth grid point, wherein t belongs to (0, L), and L is the length of the monitoring data time sequence;
s32: according to the dynamic wind field diagram matrix DWtCalculating the shortest path a from the ith lattice point to the jth lattice point in the lattice point matrix by using Dijkstra algorithmijRespectively calculating the shortest path for each lattice point in the lattice point matrix, and forming all the shortest paths into a wind field adjacency matrix At
S4: constructing a mask matrix at each moment according to the air quality concentration data in the lattice point matrix, and constructing a characteristic vector set Z at each moment according to the wind field adjacency matrix, the mask matrix and the meteorological monitoring data;
in this embodiment of the present invention, the step S4 includes the following steps:
s41: initializing the first mask matrix mask at time ttAnd a second mask matrix mask2tValue of air mass concentration data matrix at time t
Figure BDA0003573090620000067
When the air is not empty, the air conditioner is not empty,
Figure BDA0003573090620000068
otherwise
Figure BDA0003573090620000069
Order to
Figure BDA00035730906200000610
And from mask2tRandomly selecting K grid points with the numerical value of 1, resetting the grid points to be 0, updating a second mask matrix, wherein i and j are horizontal coordinates and vertical coordinates of the mask matrix;
s42: v, W for each of the truncated time t and the first F timess、WdH, T, A, mask and mask2 matrixes, and constructing feature vector base D of F moments before the moment tfIs shown as Df=[x1,x2,x3,x4,x5,x6,x7];
S43: constructing a characteristic vector set Z according to the length of the monitoring data time sequence and the characteristic vector base number, so that Z is [ D ═ D1,D2,…,Dn,…,DJ];
Wherein x is1,x2,x3,x4,x5,x6,x7Sequentially represents the air pollution concentration matrix V mask2 and the wind speed matrix W of the first F time nodes at the time tsWind direction matrix WdHumidity matrix H, temperature matrix T, wind field adjacency matrix A and mask matrix; f, t, F are integers, F belongs to [1, J ]]T is equal to (0, L), and F is less than L; j represents a feature vector set dimension; (n, J) epsilon (0, L-F), and L is the time sequence length of the monitoring data.
FIG. 2 shows a mask matrix according to an embodiment of the present inventionAs shown in fig. 2, in the embodiment of the present invention, first, a first mask matrix needs to be generated according to the air mass concentration data matrix at time t, where the air mass concentration data matrix V at time ttThe method comprises the steps of determining a matrix coordinate with an actual numerical value, setting the matrix coordinate to be 1 at a corresponding position of a first mask matrix and setting the matrix coordinate to be 0 at a corresponding position of being empty, thereby forming the first mask matrixtConsidering that the first mask matrix is completely determined by the air quality concentration data and has weak anti-interference capability, the invention partially randomizes the matrix coordinates set to 1 in the first mask matrix, selects partial matrix coordinates to be reset to 0, and thereby forms the second mask matrix 2tThe two mask matrices will play different roles in subsequent graph convolution networks.
It can be understood that, in the embodiment of the present invention, the first mask matrix is obtained according to the wind field adjacency matrix, and the second mask matrix is obtained according to the first mask matrix and the random initialization, in the embodiment of the present invention, the first mask matrix and the second mask matrix before and after the random initialization are respectively used, which can enhance the robustness of the graph convolution network model, so that the lattice localization effect of the model on the air quality is more stable.
Fig. 3 is a schematic diagram illustrating feature vector set generation in an embodiment of the present invention, and as shown in fig. 3, in the embodiment of the present invention, it is first necessary to obtain an air pollution concentration matrix V mask2 and a wind speed matrix W at the first F times of each time tsWind direction matrix WdThe humidity matrix H, the temperature matrix T, the wind field adjacency matrix A and the mask matrix are integrated at each moment T to form a characteristic vector set with the length J.
S5: generating a target matrix Y at each moment according to the mask matrix and the air quality concentration data;
in this embodiment of the present invention, the step S5 includes the following steps:
s51: according to the second mask matrix and air qualityThe volume concentration data matrix generates a target matrix at time t, denoted as Yt=Vt·mask2t
S52: integrating the target matrixes at different moments to construct a target matrix set Y ═ Y1,Y2,…,Yn,…,YJ];
Wherein, VtMask2 showing the mapping of the air mass concentration data at time t to the grid point position of the target regiontA second mask matrix representing time t; t belongs to (0, L), and J represents a characteristic vector set dimension; (n, J) epsilon (0, L-F), and L is the time sequence length of the monitoring data.
Fig. 4 is a schematic diagram showing generation of a target matrix in an embodiment of the present invention, and as shown in fig. 4, in the embodiment of the present invention, an air mass concentration data matrix V at each time t needs to be obtained firsttAnd a second mask matrix mask2tThe two matrixes are subjected to dot multiplication to obtain a matrix which is a target matrix Y at the moment tt
S6: and inputting the characteristic vector set Z matrix into a trained graph convolution neural network model to obtain an estimation matrix P of the target matrix.
FIG. 5 is a diagram convolution neural network model according to an embodiment of the present invention, and as shown in FIG. 5, the diagram convolution neural network model includes a time convolution network layer, a spectrum-based diagram convolution network layer, a space-based diagram convolution network layer, and a convolution neural network output layer, which are sequentially arranged according to an embodiment of the present invention; the time convolution network layer extracts time sequence characteristics of an air pollution concentration matrix, a wind speed matrix, a wind direction matrix, a humidity matrix and a temperature matrix, the spectrum-based graph convolution network layer carries out noise disturbance denoising on a wind field adjacent matrix, the space-based graph convolution network layer maps the interaction relation between the time sequence characteristics of the time convolution network layer and the wind field characteristics of the spectrum-based graph convolution network layer, and the convolution neural network output layer processes the interaction relation and the mask matrix of the space-based graph convolution network layer and restores implicit characteristics into a target matrix.
In the invention, the TCN can be used for extracting the sequence relation of the time characteristics, the change trend of data in the lattice point process can be better drawn, the spectrum-based GCN filters random noise in the lattice point process, the GCN simulates the influence of wind on air quality pollutants based on the space, and the CNN generates a target matrix, and the influence of humidity and temperature on air quality lattice point is fully considered, so that the lattice point effect is more accurate. Meanwhile, random disturbance is added to the environment data lattice model in the training process based on the graph convolution neural network, so that the stability of the model is higher.
In the embodiment of the invention, the training process of the graph convolution neural network model comprises the steps of inputting the characteristic vector set Z as a model to obtain an estimation matrix P of a target matrix, carrying out counterpoint multiplication on the estimation matrix P of the target matrix and a mask matrix, comparing a counterpoint multiplication result with the target matrix Y, and optimizing the parameters of the graph convolution neural network model according to the comparison result until the error is within a set range.
It can be understood that, in the embodiment of the present invention, only in the training process of the model, the result of the bit multiplication needs to be compared with the target matrix Y, and the neural network model parameters are optimized according to the comparison result, and in the application process of the model, that is, when the air quality monitoring data of N air quality monitoring sites and the weather monitoring data of M weather monitoring sites in the target area are directly processed, only the estimation matrix P of the target matrix needs to be output, where the estimation matrix P is a dense matrix and the target matrix is a sparse matrix.
In this example, in step S6, the graph-convolution-based neural network uses MAE as a loss function for the environment data lattice model, and the calculation formula is
Figure BDA0003573090620000091
The error threshold is set to 5 and the model stops training when the model average loss is less than 5.
In the description of the present invention, it is to be understood that the terms "coaxial", "bottom", "one end", "top", "middle", "other end", "upper", "one side", "top", "inner", "outer", "front", "center", "both ends", and the like, indicate orientations or positional relationships based on those shown in the drawings, and are only for convenience of description and simplicity of description, and do not indicate or imply that the devices or elements referred to must have a particular orientation, be constructed and operated in a particular orientation, and thus, are not to be construed as limiting the present invention.
In the present invention, unless otherwise expressly stated or limited, the terms "mounted," "disposed," "connected," "fixed," "rotated," and the like are to be construed broadly, e.g., as meaning fixedly connected, detachably connected, or integrally formed; can be mechanically or electrically connected; the terms may be directly connected or indirectly connected through an intermediate, and may be communication between two elements or interaction relationship between two elements, unless otherwise specifically limited, and the specific meaning of the terms in the present invention will be understood by those skilled in the art according to specific situations.
Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims (8)

1. A grid point processing method of environment data based on graph convolution network is characterized by comprising the following steps:
s1: acquiring air quality monitoring data of N air quality monitoring stations and meteorological monitoring data of M meteorological monitoring stations in a target area; the air quality monitoring data comprises air quality concentration data, and the meteorological monitoring data comprises humidity, temperature, wind direction and wind speed data;
s2: preprocessing the missing parts of all monitoring data, and mapping the monitoring data of the corresponding station to a grid point matrix divided by a target area according to the scale according to the coordinates of the monitored station;
s3: generating a dynamic wind field graph by using wind direction data and wind speed data in meteorological monitoring data in the lattice point matrix, and calculating a wind field adjacency matrix by using a Dijkstra algorithm;
s4: constructing a mask matrix at each moment according to the air quality concentration data in the lattice point matrix, and constructing a characteristic vector set Z at each moment according to the wind field adjacency matrix, the mask matrix and the meteorological monitoring data;
s5: generating a target matrix Y at each moment according to the mask matrix and the air quality concentration data;
s6: and inputting the characteristic vector set Z matrix into a trained graph convolution neural network model to obtain an estimation matrix P of the target matrix.
2. The grid point processing method of environment data based on graph convolution network as claimed in claim 1, wherein: the step S1 includes respectively obtaining air quality concentration data of the air quality monitoring station at each moment in a first frequency cycle and humidity, temperature, wind direction and wind speed data of the weather monitoring station in the first frequency cycle; and sequencing the corresponding air quality monitoring data and the meteorological monitoring data according to time sequence, and aligning the corresponding monitoring data according to time.
3. The grid point processing method of environment data based on graph convolution network as claimed in claim 1, wherein: the step S2 includes the following steps:
s21: obtaining the boundary of a target area, dividing the target area into fishing grid points, namely grids according to a certain scale, and obtaining the center point coordinate l of each gridijForming a central point coordinate matrix P, i.e. a grid point matrix, wherein i and j are rows of the grid point matrix respectivelyAnd column coordinates;
s22: according to the coordinates and the longitude and latitude distances of the air quality monitoring station, traversing and calculating a first distance from the air quality monitoring station to each grid point in the grid point matrix, determining the grid point position when the first distance is shortest, and recording the grid point position in a first mapping table D1Performing the following steps;
s23: reading the monitoring data of all the air quality monitoring stations, and supplementing the default monitoring data of the current moment into the average value of the monitoring data of the previous moment and the next moment of the monitoring station or the average value of the monitoring data of all the moments of the monitoring station for the same grid point;
s24: according to the coordinates and the longitude and latitude distances of the weather monitoring station, traversing and calculating a second distance from the weather monitoring station to each grid point in the grid point matrix, determining the grid point position when the second distance is shortest, and recording the grid point position in a second mapping table D2Performing the following steps;
s25: reading the meteorological monitoring data of all meteorological monitoring sites, and supplementing the default monitoring data of the current moment into the average value of the monitoring data of the previous and subsequent moments of the monitoring sites or the average value of the monitoring data of all moments of the monitoring sites for the same grid point;
s26: at the current moment t, the air quality concentration data of each air quality monitoring station is mapped according to a first mapping table D1Mapping to matrix VtAt the corresponding position, the humidity, temperature, wind direction and wind speed data of each meteorological monitoring station are mapped according to a second mapping table D2Mapping to matrix Ht、Tt
Figure FDA0003573090610000021
At the corresponding position;
wherein, VtAir mass concentration data matrix, H, representing time tt、Tt
Figure FDA0003573090610000022
And sequentially obtaining a humidity, temperature, wind direction and wind speed data matrix at the time t, wherein t belongs to (0, L), and L is the time sequence length of the monitoring data.
4. The grid point processing method of environment data based on graph convolution network according to claim 1, characterized in that: the step S3 includes the following steps:
s31: using a wind direction data matrix
Figure FDA0003573090610000023
And wind speed data matrix
Figure FDA0003573090610000024
Generating a dynamic wind farm map matrix DWt,
Figure FDA0003573090610000025
Wherein, DWt ijRepresenting the distance theta from the ith grid point to the jth grid point in the grid point matrixijRepresenting the wind direction angle from the ith grid point to the jth grid point,
Figure FDA0003573090610000026
representing the wind speed from the ith grid point to the jth grid point in the wind speed data matrix at the time t, wherein t belongs to (0, L), and L is the time sequence length of the monitoring data;
s32: according to the dynamic wind field diagram matrix DWtCalculating the shortest path a from the ith lattice point to the jth lattice point in the lattice point matrix by using Dijkstra algorithmijRespectively calculating the shortest path for each lattice point in the lattice point matrix, and forming all the shortest paths into a wind field adjacency matrix At
5. The grid point processing method of environment data based on graph convolution network as claimed in claim 1, wherein: the step S4 includes the following steps:
s41: initializing the first mask matrix mask at time ttAnd a second mask matrix mask2tValue V of the air mass concentration data matrix at time tt ijWhen the air is not empty, the air conditioner is not empty,
Figure FDA0003573090610000031
otherwise
Figure FDA0003573090610000032
Order to
Figure FDA0003573090610000033
And from mask2tRandomly selecting K grid points with the numerical value of 1 to reset to 0, and updating a second mask matrix;
s42: v, W for each of the truncated time t and the first F timess、WdH, T, A, mask and mask2 matrixes, and constructing feature vector base D of F moments before the moment tfIs represented by Df=[x1,x2,x3,x4,x5,x6,x7];
S43: constructing a characteristic vector set Z according to the length of the monitoring data time sequence and the characteristic vector base number, so that Z is [ D ═ D1,D2,…,Dn,…,DJ];
Wherein x is1,x2,x3,x4,x5,x6,x7Sequentially represents the air pollution concentration matrix V mask2 and the wind speed matrix W of the first F time nodes at the time tsWind direction matrix WdHumidity matrix H, temperature matrix T, wind field adjacency matrix A and mask matrix; f, t, F are integers, F belongs to [1, J ]],t∈(0,L),F<L; j represents a feature vector set dimension; (n, J) epsilon (0, L-F), and L is the time sequence length of the monitoring data.
6. The grid point processing method of environment data based on graph convolution network as claimed in claim 1, wherein: the step S5 includes the following steps:
s51: generating a target matrix at time t, denoted as Y, from the second mask matrix and the air mass concentration data matrixt=Vt·mask2t
S52: will be differentIntegrating the target matrixes at the moment to construct a target matrix set Y ═ Y1,Y2,…,Yn,…,YJ];
Wherein, VtMask2 showing the mapping of the air mass concentration data at time t to the grid point position of the target regiontA second mask matrix representing time t; t belongs to (0, L), and J represents a characteristic vector set dimension; (n, J) epsilon (0, L-F), and L is the time sequence length of the monitoring data.
7. The grid point processing method of environment data based on graph convolution network as claimed in claim 1, wherein: the graph convolution neural network model in the step S6 comprises a time convolution network layer, a graph convolution network layer based on a spectrum, a graph convolution network layer based on a space and a convolution neural network output layer which are sequentially arranged; the time convolution network layer extracts time sequence characteristics of an air pollution concentration matrix, a wind speed matrix, a wind direction matrix, a humidity matrix and a temperature matrix, the spectrum-based graph convolution network layer carries out noise disturbance denoising on a wind field adjacent matrix, the space-based graph convolution network layer maps the interaction relation between the time sequence characteristics of the time convolution network layer and the wind field characteristics of the spectrum-based graph convolution network layer, and the convolution neural network output layer processes the interaction relation and the mask matrix of the space-based graph convolution network layer and restores implicit characteristics into a target matrix.
8. The grid point processing method of environment data based on graph convolution network as claimed in claim 1 or 7, wherein: the training process of the graph convolution neural network model comprises the steps of inputting the characteristic vector set Z as a model to obtain an estimation matrix P of a target matrix, carrying out counterpoint multiplication on the estimation matrix P of the target matrix and a mask matrix, comparing a counterpoint multiplication result with the target matrix Y, and optimizing the parameters of the graph convolution neural network model according to the comparison result until the error is within a set range.
CN202210325115.2A 2022-03-30 2022-03-30 Grid point processing method of environmental data based on graph convolution network Pending CN114647819A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210325115.2A CN114647819A (en) 2022-03-30 2022-03-30 Grid point processing method of environmental data based on graph convolution network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210325115.2A CN114647819A (en) 2022-03-30 2022-03-30 Grid point processing method of environmental data based on graph convolution network

Publications (1)

Publication Number Publication Date
CN114647819A true CN114647819A (en) 2022-06-21

Family

ID=81996236

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210325115.2A Pending CN114647819A (en) 2022-03-30 2022-03-30 Grid point processing method of environmental data based on graph convolution network

Country Status (1)

Country Link
CN (1) CN114647819A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115840881A (en) * 2023-02-15 2023-03-24 湖南红普创新科技发展有限公司 Air data processing method and device and related equipment
CN116502539A (en) * 2023-05-18 2023-07-28 淮阴工学院 VOCs gas concentration prediction method and system
CN116610921A (en) * 2023-06-14 2023-08-18 深圳市顶尖传诚科技有限公司 Intelligent park information management system and method based on big data
CN116842073A (en) * 2023-07-10 2023-10-03 四川警察学院 Graph data mining method and device and electronic equipment
CN117745977A (en) * 2024-02-20 2024-03-22 四川开澜科技有限公司 Mountain natural wind field analysis method, system and terminal based on 3DGIS

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115840881A (en) * 2023-02-15 2023-03-24 湖南红普创新科技发展有限公司 Air data processing method and device and related equipment
CN116502539A (en) * 2023-05-18 2023-07-28 淮阴工学院 VOCs gas concentration prediction method and system
CN116502539B (en) * 2023-05-18 2024-02-27 淮阴工学院 VOCs gas concentration prediction method and system
CN116610921A (en) * 2023-06-14 2023-08-18 深圳市顶尖传诚科技有限公司 Intelligent park information management system and method based on big data
CN116610921B (en) * 2023-06-14 2024-02-09 深圳市顶尖传诚科技有限公司 Intelligent park information management system and method based on big data
CN116842073A (en) * 2023-07-10 2023-10-03 四川警察学院 Graph data mining method and device and electronic equipment
CN116842073B (en) * 2023-07-10 2024-04-09 四川警察学院 Graph data mining method and device and electronic equipment
CN117745977A (en) * 2024-02-20 2024-03-22 四川开澜科技有限公司 Mountain natural wind field analysis method, system and terminal based on 3DGIS

Similar Documents

Publication Publication Date Title
CN114647819A (en) Grid point processing method of environmental data based on graph convolution network
CN104102845B (en) The interpolation method of dimension self-adaption and the interplotation system of dimension self-adaption
CN110650436B (en) WiFi data-based position fingerprint database establishing and fitting method
CN110335355B (en) Automatic calculation method for water level height of large shallow lake
CN112905560A (en) Air pollution prediction method based on multi-source time-space big data deep fusion
CN110418283B (en) WiFi data-based location fingerprint database creation method
CN109978275B (en) Extreme strong wind speed prediction method and system based on mixed CFD and deep learning
CN115079307B (en) Wind farm weather forecast method and system based on multimode optimal integration
CN111695440A (en) GA-SVR lake level measurement and prediction method based on radar altimeter
CN114154702A (en) Pollutant concentration prediction method and device based on multi-granularity graph space-time neural network
CN113344149A (en) PM2.5 hourly prediction method based on neural network
CN116223395A (en) Near-surface trace gas concentration inversion model and inversion method
CN114760587A (en) Positioning method based on UWB sensor base station grouping strategy
CN113095586A (en) Short-term multi-meteorological-element forecasting method based on deep neural network
CN112954623A (en) Resident occupancy rate estimation method based on mobile phone signaling big data
CN115032720A (en) Application of multi-mode integrated forecast based on random forest in ground air temperature forecast
CN113297904B (en) Method and system for estimating alpine grassland biomass based on satellite driving model
CN116304391B (en) Method and system for generating recommended selection area after map area frame selection
CN117251989A (en) Meteorological site carbon water flux information determination method, system and electronic equipment
CN111811465A (en) Method for predicting sea wave effective wave height based on multi-sine function decomposition neural network
CN115840908B (en) Method for constructing PM2.5 three-dimensional dynamic monitoring field based on microwave link of LSTM model
CN107590346B (en) Downscaling correction model based on spatial multi-correlation solution set algorithm
CN115936238A (en) Method, system, equipment and medium for predicting middle and long term output of global wind power
CN112990609B (en) Air quality prediction method based on space-time bandwidth self-adaptive geographical weighted regression
CN115859789A (en) Method for improving inversion accuracy of polar atmosphere temperature profile

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination