CN116400433A - Single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution - Google Patents

Single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution Download PDF

Info

Publication number
CN116400433A
CN116400433A CN202310069184.6A CN202310069184A CN116400433A CN 116400433 A CN116400433 A CN 116400433A CN 202310069184 A CN202310069184 A CN 202310069184A CN 116400433 A CN116400433 A CN 116400433A
Authority
CN
China
Prior art keywords
time
sequence
convolution
data
station
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310069184.6A
Other languages
Chinese (zh)
Inventor
向黎
曹增辉
张滢
杨道勇
聂于棚
袁阅
吕苗
刘宸钊
甘思旧
敬文慧
张伟
许磊
张瑞林
任燕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
63796 FORCES PLA
Original Assignee
63796 FORCES PLA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 63796 FORCES PLA filed Critical 63796 FORCES PLA
Priority to CN202310069184.6A priority Critical patent/CN116400433A/en
Publication of CN116400433A publication Critical patent/CN116400433A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01WMETEOROLOGY
    • G01W1/00Meteorology
    • G01W1/10Devices for predicting weather conditions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Environmental & Geological Engineering (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Evolutionary Computation (AREA)
  • Ecology (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Data Mining & Analysis (AREA)
  • Environmental Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Atmospheric Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention relates to a single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution, and belongs to the field of weather forecasting. The method selects important single-station meteorological observation elements to obtain a single-station historical observation sequence; extracting multi-element forecast data to obtain a numerical forecast space-time sequence; normalizing the data; constructing a three-dimensional convolution module, inputting the normalized numerical forecasting space-time sequence, and finally producing a time sequence containing space features; constructing an interactive learning model ICM, and processing a single-station historical observation sequence and a three-dimensional convolution extracted time sequence to generate time characteristic information of two data; constructing an MCNN network model, fusing the two branch time characteristic information obtained in the S3, and forming a prediction result through interactive learning and full connection again; air temperature prediction is performed using the model. The time sequence prediction model built by the method improves stability and prediction accuracy, and the method is suitable for time-by-time air temperature prediction of 72-120 hours of a single station.

Description

Single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution
Technical Field
The invention belongs to the field of weather forecast, and particularly relates to a single-station time-by-time air temperature forecast method based on data fusion and mixed convolution.
Background
The data of a meteorological site, representing historical or current meteorological element changes, can be used to predict the atmospheric conditions of the site, but it is difficult to obtain the characteristics of the atmospheric nonlinear change curve. Numerical weather forecast aims to simulate the atmospheric forecast process and provide future weather conditions. However, the numerical prediction is based on discrete numerical calculation, the result is often grid data with errors, accurate local weather prediction is difficult to obtain, and the mountain local weather conditions are complex and changeable, so that the prediction is challenged. Therefore, the numerical forecasting data and the historical data can be used for fusion, the advantages of the numerical forecasting data and the historical data are fully exerted, and site atmosphere forecasting is carried out.
Disclosure of Invention
First, the technical problem to be solved
The technical problem to be solved by the invention is how to provide a single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution, so as to solve the problems that the numerical forecasting is based on discrete numerical calculation, the result is often grid data with errors, accurate local weather forecasting is difficult to obtain, and the local weather conditions of mountain areas are complex and changeable, so that challenges are brought to forecasting.
(II) technical scheme
In order to solve the technical problems, the invention provides a single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution, which comprises the following steps:
s1, data preprocessing:
s11, selecting important single-station meteorological observation elements to obtain a single-station historical observation sequence;
s12, extracting multi-element prediction data to obtain a numerical prediction space-time sequence;
s13, data normalization;
s2, three-dimensional convolution feature extraction:
constructing a three-dimensional convolution module, inputting the normalized numerical forecasting space-time sequence, and finally producing a time sequence containing space features;
s3, an interactive learning model ICM:
constructing an interactive learning model ICM, extracting local correlation of the time sequence by using a one-dimensional convolution filter cnn1d for the single-station historical observation sequence and the three-dimensional convolution extracted time sequence, establishing a hierarchical structure to extract sequences with different time scales, and generating time characteristic information of two data;
s4, a mixed convolution network model MCNN:
constructing an MCNN network model, fusing the two branch time characteristic information obtained in the S3, and forming a prediction result through interactive learning and full connection again; training the MCNN network model;
s5, predicting results:
and according to the trained MCNN network model, using latest observation data of the site and numerical mode forecast data, preprocessing, importing the model, and outputting 72-120 hours of time-by-time air temperature forecast data.
(III) beneficial effects
The invention provides a single-station time-by-time air temperature prediction method based on data fusion and mixed convolution, and provides a method MCNN (Mixed convolution neural network) based on data fusion and mixed convolution, wherein a time sequence prediction model is built, so that single-station historical data and numerical prediction data can be effectively utilized to perform time-by-time air temperature prediction, and stability and prediction accuracy are improved. The method is suitable for the time-by-time air temperature prediction of a single station for 72-120 hours.
Drawings
FIG. 1 is a structure (CNN) of a three-dimensional convolution model of the present disclosure;
FIG. 2 is a diagram of the structure of an Interactive Convolution Module (ICM) of the present invention;
FIG. 3 is a structure of a one-dimensional convolution operation (cnn 1 d) of the present invention;
FIG. 4 is a general structure of a space-time convolutional network of the present invention;
FIG. 5 is an example of test set prediction results;
FIG. 6 is a comparison of prediction results;
FIG. 7 is a comparison of two prediction results;
FIG. 8 is a comparison of the predicted results three.
Detailed Description
To make the objects, contents and advantages of the present invention more apparent, the following detailed description of the present invention will be given with reference to the accompanying drawings and examples.
The invention relates to the technical field of weather time-by-time air temperature prediction, in particular to a single-station medium-term time-by-time air temperature prediction method based on data fusion and mixed convolution.
The technical problem to be solved by the invention is to design a model architecture based on a deep neural network, integrate information characteristics of site data and forecast data, and realize single-site time-by-time air temperature prediction for 72-120 hours.
The data of a meteorological site, representing historical or current meteorological element changes, can be used to predict the atmospheric conditions of the site, but it is difficult to obtain the characteristics of the atmospheric nonlinear change curve. Numerical weather forecast aims to simulate the atmospheric forecast process and provide future weather conditions. However, the numerical prediction is based on discrete numerical calculation, the result is often grid data with errors, accurate local weather prediction is difficult to obtain, and the mountain local weather conditions are complex and changeable, so that the prediction is challenged. Therefore, the numerical forecasting data and the historical data can be used for fusion, the advantages of the numerical forecasting data and the historical data are fully exerted, and site atmosphere forecasting is carried out. The invention provides a method MCNN (Mixedconvolutionneuralnetwork) based on data fusion and mixed convolution, which is used for constructing a time sequence prediction model, so that single-station historical data and numerical prediction data can be effectively utilized to predict air temperature time by time, and the stability and the prediction accuracy are improved. The method is suitable for the time-by-time air temperature prediction of a single station for 72-120 hours.
The invention designs a deep neural network MCNN based on mixed convolution, builds a spatial convolution module and a time sequence module, realizes the time-by-time air temperature prediction in a single station 120h through feature fusion, and gives a prediction result at one time.
The technical scheme adopted by the invention is as follows: the mountain single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution comprises the following steps:
s1, data preprocessing:
s11, selecting important single-station meteorological observation elements to obtain a single-station historical observation sequence;
s12, extracting multi-element prediction data to obtain a numerical prediction space-time sequence;
s13, data normalization.
S2, three-dimensional convolution feature extraction (CNN):
a three-dimensional convolution module is constructed, which comprises a series of spatial convolution, downsampling, activation functions, full connection and the like, and numerical forecast space-time sequences after normalization are input, so that a time sequence containing spatial features is finally produced.
S3, interactive learning (ICM):
and respectively extracting local correlations of the time sequences by using a one-dimensional convolution filter for the single-station historical observation sequence and the three-dimensional convolution extracted time sequences, establishing a hierarchical structure to extract sequences with different time scales, and generating time characteristic information of the two data.
S4, hybrid convolutional network (MCNN):
and (3) fusing the two branch time characteristic information obtained in the step (S3), and forming a prediction result through interactive learning and full connection again.
S5, predicting results:
and according to the trained model, using latest detection data of the site and numerical mode forecast data, preprocessing, importing the model, and outputting temperature forecast data from time to time for 72-120 hours.
Example 1:
the mountain single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution mainly comprises the following steps: data preprocessing, three-dimensional convolution feature extraction, interactive learning, space-time convolution network and prediction result output.
S1, data preprocessing:
s11, selecting important single-station meteorological observation elements
The selected meteorological observation elements comprise: temperature, humidity, air pressure, wind speed, wind direction, total cloud amount and the like, and obtaining a single-station historical observation sequence S:
Figure SMS_1
where L is the time series length and N is the number of elements. Taking as input observation data of the site over a period of time (e.g., l=240 hours), and before that, in order to reduce the complexity of the model, importance screening is required, a random forest method is adopted to determine the importance of a temperature element by adding noise to other elements, and the greater the loss, the more critical the element to which noise is added. The results are shown in Table 1, targeting air temperature.
Table 1 importance score for variables
Figure SMS_2
S12, extracting multi-element forecast data
And carrying out numerical mode forecast data extraction on European fine grid data by taking the station as the center to obtain space data represented by time sequence, wherein the space range of the data is 5 degrees, the precision is 0.125 degree, and the time interval is 3 or 6 hours. The MCNN model adopts 2 m air temperature, ground air pressure, ground temperature, total cloud quantity and topographic data as selection elements to form a forecast data set, and the record of the data set is expressed as a numerical forecast space-time sequence P:
Figure SMS_3
where L 'represents the length of the time series, N' represents the number of meteorological elements, and W H represents the size of the spatial grid.
S13, data normalization
And normalizing the two types of data in S11 and S12 by adopting a zero mean normalization method.
S'=(S-S m )/S s (3)
P'=(P-P m )/P s (4)
Where S 'and P' are values after normalization, S and P are values before normalization, S m Is the mean value of S s Is the standard deviation of S, P m Is the average value of P, P s Is the standard deviation of P.
S2, three-dimensional convolution feature extraction:
convolutional neural networks have played a vital role in computer vision. The convolution kernel has a local receptive field that captures local features and performs spatial downsampling to continually extract more detailed features. For the forecast grid data, convolution operation can be performed, and meteorological element information can be extracted from the forecast data. According to the invention, a module is constructed by utilizing local receptive field and space downsampling of a convolution kernel to extract a time sequence of characteristic information of prediction data near a site and match the time sequence with historical data.
FIG. 1 shows the structure of the three-dimensional convolution model, and a three-dimensional convolution module (CNN) is constructed, comprising a series of spatial convolutions, downsampling, activation functions, full connection, etc., to finally form a time sequence containing spatial features, the convolution module is input as a normalized numerical prediction spatiotemporal sequence P', and output as a time sequence P containing spatial feature information 1
For the present station, 81×81 grids are selected herein, and european fine grid data aged for 0-120 hours is used as the space-time information around the station, i.e., P in step S12 is normalized P'. Features are extracted from the grid data over a time sequence using a multi-layer convolution operation, activated using a LeakyRelu after downsampling, and finally the output is expanded to one-dimensional information through a fully connected layer.
The three-dimensional convolution module CNN network adopted by the invention comprises the following components: 1. a three-dimensional space convolution layer Conv3d (5,32,3) for improving the channel number to 32; 2. a three-dimensional space convolution layer Conv3d (32,64,3) for raising the channel number to 64; 3. the first space downsampling layer is MaxPool3d (3), and the length and the width of the grid are reduced by 3 times; 4. a three-dimensional space convolution layer Conv3d (64,32,3) for reducing the channel number to 32; 5. a three-dimensional space convolution layer Conv3d (32,16,3) for reducing the channel number to 16; 6. the second space downsampling layer is MaxPool3d (3), and the length and width of the grid are reduced by 3 times again; 7. a three-dimensional space convolution layer Conv3d (16,4,3) for reducing the channel number to 4; 8. a three-dimensional space convolution layer Conv3d (4,1,3) for reducing the channel number to 1; 9. a first activation function layer LeakyRelu (); 10. a first full link layer Linear (); 11. a second activation function layer LeakyRelu (); 12. and a second full link layer Linear ().
S3, an interactive learning model (ICM):
an interactive learning model ICM is constructed, and for a single-station historical observation sequence and a three-dimensional convolution extracted time sequence, local correlation of the time sequence is extracted by using a one-dimensional convolution operation (cnn 1 d) respectively, and a hierarchical structure is built to extract sequences of different time scales.
The detailed structure of the ICM model is shown in FIG. 2. The input time sequence is layer 1, the input time sequence is divided into a sequence pair according to parity to be layer 2, each sequence of layer 2 is divided into a sequence pair according to parity, and so on. The process for each small sequence pair is denoted as an Interactive Convolution Block (ICB), and interactive learning is performed. And splicing the outputs of all ICBs of the last layer, and obtaining the output of the ICM model through the full-connection layer.
Each small sequence is split and downsampled to generate 1 subsequence pair, and the process is as follows:
X odd =X 2i-1 ,i=1,2,…,L/2 (5)
X even =X 2i ,i=1,2,…,L/2 (6)
wherein X represents the time sequence P of which the initial sequence is a single-station historical observation sequence S' or the numerical forecast is output through the S2 step 1 Layer-by-layer Split sequences (Split). X is X odd 、X even The downsampled pairs of subsequences are represented as an even term sequence and an odd term sequence, respectively. Downsampling the original sequence in the time dimension enables the module to study dynamic information of different time resolutions.
The Interactive Convolution Block (ICB) processing steps for each sub-sequence pair are: 1) Sub-sequence X odd And X even Respectively using one-dimensional convolution extraction network (cnn 1 d), converting into exp form, respectively performing dot product multiplication with the atomic sequence of each other, and obtaining the final product
Figure SMS_4
And->
Figure SMS_5
2) Extracting network by repeated one-dimensional convolution to obtain ∈Di-L>
Figure SMS_6
And->
Figure SMS_7
Projection to hidden state, added to +.>
Figure SMS_8
And->
Figure SMS_9
The output X 'of the ICB module is obtained as in equation (8)' odd And X' even
Figure SMS_10
Figure SMS_11
Wherein sigma,
Figure SMS_12
ρ, τ are both one-dimensional convolution extraction networks (cnn 1 d), as shown in fig. 3, comprising: one-dimensional filled replicarpad 1d, one-dimensional convolution Conv1d, leakyRelu activation, dropout layer, one-dimensional convolution Conv1d, tanh activation. Where T represents the sequence length, C represents the number of channels, and K represents the convolution kernel.
Finally, through repeated downsampling, updating time sequences X 'with different resolutions through a one-dimensional convolution extraction network cnn1d and an interactive convolution block ICB' odd And X' even The sub-sequences are then rearranged by an inverted parity splitting operation and concatenated into a new sequence, which is added to the original sequence by residual concatenation. In addition, a final output result is generated by the connection layer. The specific operation is as follows:
X'=Revese(X' odd ,X' even ) (9)
X out =Linear(X+X') (10)
wherein X' represents a new sequence representation, X out Denote the final output, and Linear denotes the full link layer.
4. Hybrid convolutional network Model (MCNN):
MCNN is the overall deep neural network of the present invention, as shown in fig. 4, and specifically includes the following steps: 1) Using a random forest to take S 'after normalization of a meteorological element S with high contribution as input one, and taking P' after normalization of a European fine grid numerical forecast plus terrain data sequence P as input two; 2) Input two P' uses three-dimensional convolution module (CNN) to extract space sequence feature P 1 The method comprises the steps of carrying out a first treatment on the surface of the 3) Input S' and feature P using an interactive learning model (ICM) 1 Respectively performing interactive learning to obtain Y 1 And Y 2 In this case, the intermediate loss function L can be obtained 1 And L 2 The method comprises the steps of carrying out a first treatment on the surface of the 4) Time sequence Y 1 And Y 2 Projecting (one-dimensional convolution Conv1 d) to obtain two equal-length sequence splicing (cat) to obtain Y 3 I.e. (fusion block); 5) Output Y from the previous step 3 And then obtaining Y through an interactive convolutional network (ICM) 4 ;6)Y 4 Forming a predicted result Y through full connection (Linear), and obtaining a loss function L 3 . The main operation is as follows:
Y 1 =ICM(S') (11)
Y 2 =ICM(CNN(P')) (12)
Y 3 =CAT(α(Y 1 ),β(Y 2 )) (13)
Y=Linear(ICM(Y 3 )) (14)
wherein S 'represents input one, P' represents input two, CNN represents a three-dimensional convolution module, ICM represents an interactive learning model, alpha and beta represent one-dimensional convolution (Conv 1 d), the function is to adjust the sequence length, CAT represents the combination, the function is to combine the characteristics of two branches, and Linear represents full connection.
In addition, two intermediate generation results based on intermediate supervision construction loss functions, supervision single-station historical data and numerical forecast data are used in the whole MCNN network model training. The total loss function comprises the loss function before the fusion of the two branch information, and is specifically expressed as follows:
L=aL 1 +bL 2 +cL 3 (15)
wherein L is 1 、L 2 And L 3 Representing Loss function of Loss1 for Loss1, tables a, b and c, respectivelyThe three weight parameters of the loss function are shown, and the sum of the weight parameters is 1, for example, can be set to 0.5, 0.1 and 0.4.
5. And (3) predicting results:
and according to the trained model, using new detection data and numerical forecast data, preprocessing, importing the model, and predicting a temperature sequence.
Effect verification
Fig. 5 is a test set test example in which a blue line represents a single-station measured temperature, a red line represents a predicted temperature, an abscissa represents a time of prediction, and an ordinate represents a temperature value. Through inspection statistics of 145 records, the average absolute error of the air temperature of the model prediction time by time is 1.2 degrees under the condition of 120 hours, and the accuracy rate is 93 percent under the condition that the error is less than 3 degrees.
Fig. 6-8 are updated results of time-by-time air temperature prediction applications, wherein blue lines represent measured temperatures for a single station, red lines represent predicted temperatures, abscissas represent predicted times, ordinates represent temperature values, bars represent predicted differences, red represents absolute differences greater than 3, mes is the mean absolute error, ACC is the accuracy. The results in conclusion show that the invention achieves good effects.
The time-by-time air temperature forecasting method MCNN (Mixed convolution neural network) based on data fusion and mixed convolution can effectively solve the problem that the forecasting accuracy of a product is low by simply using site data or mode numerical forecasting, improves the time-by-time air temperature forecasting accuracy, and can enable the forecasting accuracy to reach 92% under the condition of 3 degrees of error.
The foregoing is merely a preferred embodiment of the present invention, and it should be noted that modifications and variations could be made by those skilled in the art without departing from the technical principles of the present invention, and such modifications and variations should also be regarded as being within the scope of the invention.

Claims (10)

1. A single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution is characterized by comprising the following steps:
s1, data preprocessing:
s11, selecting important single-station meteorological observation elements to obtain a single-station historical observation sequence;
s12, extracting multi-element prediction data to obtain a numerical prediction space-time sequence;
s13, data normalization;
s2, three-dimensional convolution feature extraction:
constructing a three-dimensional convolution module, inputting the normalized numerical forecasting space-time sequence, and finally producing a time sequence containing space features;
s3, an interactive learning model ICM:
constructing an interactive learning model ICM, extracting local correlation of the time sequence by using a one-dimensional convolution filter cnn1d for the single-station historical observation sequence and the three-dimensional convolution extracted time sequence, establishing a hierarchical structure to extract sequences with different time scales, and generating time characteristic information of two data;
s4, a mixed convolution network model MCNN:
constructing an MCNN network model, fusing the two branch time characteristic information obtained in the S3, and forming a prediction result through interactive learning and full connection again; training the MCNN network model;
s5, predicting results:
and according to the trained MCNN network model, using latest observation data of the site and numerical mode forecast data, preprocessing, importing the model, and outputting 72-120 hours of time-by-time air temperature forecast data.
2. The single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution as claimed in claim 1, wherein said step S11 specifically comprises: selecting meteorological observation elements to obtain a single-station historical observation sequence S:
Figure FDA0004063605030000011
where L is the time series length, N is the element number, and the importance of the element is determined by adding noise to each other element for the temperature element by using a random forest method, and the larger the loss, the more critical the element to which noise is added.
3. The single-station time-by-time air temperature prediction method based on data fusion and mixed convolution as defined in claim 2, wherein the step S12 specifically includes: carrying out numerical mode forecast data extraction on European fine grid data by taking the station as the center to obtain space data represented by time sequence, wherein the space range of the data is 5 degrees, the precision is 0.125 degrees, the time interval is 3 or 6 hours, meteorological elements are selected to form a forecast data set, and the record of the data set is represented as a numerical forecast space-time sequence P:
Figure FDA0004063605030000021
where L 'represents the length of the time series, N' represents the number of meteorological elements, and W H represents the size of the spatial grid.
4. The single-station time-by-time air temperature prediction method based on data fusion and mixed convolution as defined in claim 3, wherein said step S13 specifically includes: adopting a zero mean value normalization method to normalize two types of data in S11 and S12,
S′=(S-S m )/S s (3)
P′=(P-P m )/P s (4)
where S 'and P' are values after normalization, S and P are values before normalization, S m Is the mean value of S s Is the standard deviation of S, P m Is the average value of P, P s Is the standard deviation of P.
5. The single-station time-by-time air temperature prediction method based on data fusion and mixed convolution as defined in claim 4, wherein said step S21 specifically comprises: a three-dimensional convolution module CNN is constructed, comprising: spatial convolution, downsampling, activation function and full concatenation, the convolution module input being the numerical forecast after normalizationThe empty sequence P' uses multi-layer convolution operation to extract the characteristics from the grid data on the time sequence, uses LeakyRelu to activate after downsampling, finally expands the output into one-dimensional information through the full connection layer, and outputs the time sequence P containing the space characteristic information 1
6. The single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution as claimed in claim 5, wherein the three-dimensional convolution module CNN network selects 81×81 grids, comprising: a three-dimensional space convolution layer Conv3d (5,32,3) for improving the channel number to 32; a three-dimensional space convolution layer Conv3d (32,64,3) for raising the channel number to 64; the first space downsampling layer is MaxPool3d (3), and the length and the width of the grid are reduced by 3 times; a three-dimensional space convolution layer Conv3d (64,32,3) for reducing the channel number to 32; a three-dimensional space convolution layer Conv3d (32,16,3) for reducing the channel number to 16; the second space downsampling layer is MaxPool3d (3), and the length and width of the grid are reduced by 3 times again; a three-dimensional space convolution layer Conv3d (16,4,3) for reducing the channel number to 4; a three-dimensional space convolution layer Conv3d (4,1,3) for reducing the channel number to 1; a first activation function layer LeakyRelu (); a first full link layer Linear (); a second activation function layer LeakyRelu (); and a second full link layer Linear ().
7. The single-station time-by-time air temperature prediction method based on data fusion and mixed convolution according to claim 5 or 6, wherein in the step S3, the ICM model includes: the input time sequence is layer 1, the input time sequence is split into a sequence pair according to parity to be layer 2, each sequence of layer 2 is split into a sequence pair according to parity, and so on; and (3) recording the processing of each small sequence pair as an interactive convolution block ICB, performing interactive learning, splicing the outputs of all ICBs of the last layer, and obtaining the output of the ICM model through a full connection layer.
8. The single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution as claimed in claim 7, wherein each small sequence is split and downsampled to generate 1 subsequence pair, and the process is as follows:
X odd =X 2i-1 ,i=1,2,...,L/2 (5)
X even =X 2i ,i=1,2,...,L/2 (6)
wherein X represents the time sequence P of which the initial sequence is a single-station historical observation sequence S' or the numerical forecast is output through the S2 step 1 Is a layer-by-layer split sequence of (a); x is X odd 、X even Representing the sub-sequence pairs after downsampling, which are respectively an even term sequence and an odd term sequence;
the interactive convolution block ICB processing steps for each sub-sequence pair are: 1) Sub-sequence X odd And X even Extracting the network cnn1d by using one-dimensional convolution respectively, converting the network cnn1d into an exp form, and multiplying the obtained product by dot products of the atomic sequences of the other party respectively, wherein the formula (7) is obtained
Figure FDA0004063605030000031
And->
Figure FDA0004063605030000032
2) Extracting network by repeated one-dimensional convolution to obtain ∈Di-L>
Figure FDA0004063605030000033
And->
Figure FDA0004063605030000034
Projection to hidden state, added to +.>
Figure FDA0004063605030000035
And->
Figure FDA0004063605030000036
See equation (8) to obtain the output X 'of the ICB module' odd And X' even
Figure FDA0004063605030000037
Figure FDA0004063605030000038
Wherein sigma,
Figure FDA0004063605030000039
ρ, τ are both one-dimensional convolution extraction networks (cnn 1 d) comprising: one-dimensional filled replicarpad 1d, one-dimensional convolution Conv1d, leakyRelu activation, dropout layer, one-dimensional convolution Conv1d, tanh activation; wherein T represents the sequence length;
finally, through repeated downsampling, updating time sequences X 'with different resolutions through a one-dimensional convolution extraction network cnn1d and an interactive convolution block ICB' odd And X' even The sub-sequences are then rearranged by an inverted parity splitting operation and concatenated into a new sequence, which is added to the original sequence by residual concatenation; in addition, a final output result is generated through the connection layer; the specific operation is as follows:
X′=Revese(X′ odd ,X′ even ) (9)
X out =Linear(X+X′) (10)
wherein X' represents a new sequence representation, X out Denote the final output, and Linear denotes the full link layer.
9. The single-station time-by-time air temperature prediction method based on data fusion and mixed convolution according to claim 8, wherein in the step S4, the MCNN network model includes: 1) Using a random forest to take S 'after normalization of a meteorological element S with high contribution as input one, and taking P' after normalization of a European fine grid numerical forecast plus terrain data sequence P as input two; 2) The input two P' uses a three-dimensional convolution module CNN to extract the spatial sequence feature P 1 The method comprises the steps of carrying out a first treatment on the surface of the 3) Input S' and feature P using an interactive learning model ICM 1 Respectively performing interactive learning to obtain Y 1 And Y 2 In this case, too, an intermediate can be obtainedLoss function L 1 And L 2 The method comprises the steps of carrying out a first treatment on the surface of the 4) Time sequence Y 1 And Y 2 Performing one-dimensional convolution Conv1d projection to obtain two equal-length sequence spliced CAT to obtain Y 3 The method comprises the steps of carrying out a first treatment on the surface of the 5) Output Y from the previous step 3 And obtaining Y through an interactive learning model ICM 4 ;6)Y 4 Forming a predicted result Y through the full-connection layer, and obtaining a loss function L at the moment 3 The method comprises the steps of carrying out a first treatment on the surface of the The method comprises the following steps:
Y 1 =ICM(S′) (11)
Y 2 =ICM(CNN(P′)) (12)
Y 3 =CAT(α(Y 1 ),β(Y 2 )) (13)
Y=Linear(ICM(Y 3 )) (14)
s 'represents input one, P' represents input two, CNN represents a three-dimensional convolution module, ICM represents an interactive learning model, alpha and beta represent one-dimensional convolution Conv1d, the function is to adjust the sequence length, CAT represents the combination, the function is to combine the characteristics of two branches, and Linear represents full connection.
10. The single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution as claimed in claim 9, wherein a loss function is built based on intermediate supervision in the whole MCNN network model training, two intermediate generation results of single-station historical data and numerical forecasting data are supervised, and the total loss function comprises loss functions before two branch information fusion, and specifically comprises the following steps:
L=aL 1 +bL 2 +cL 3 (15)
wherein L is 1 、L 2 And L 3 The Loss function of Loss1 is represented, a, b and c respectively represent weight parameters of the three Loss functions, and the sum of the weight parameters is 1.
CN202310069184.6A 2023-01-16 2023-01-16 Single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution Pending CN116400433A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310069184.6A CN116400433A (en) 2023-01-16 2023-01-16 Single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310069184.6A CN116400433A (en) 2023-01-16 2023-01-16 Single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution

Publications (1)

Publication Number Publication Date
CN116400433A true CN116400433A (en) 2023-07-07

Family

ID=87009272

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310069184.6A Pending CN116400433A (en) 2023-01-16 2023-01-16 Single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution

Country Status (1)

Country Link
CN (1) CN116400433A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117034780A (en) * 2023-08-31 2023-11-10 江苏省气候中心 Multi-scale sub-season precipitation prediction method based on deep learning
CN117973636A (en) * 2024-03-28 2024-05-03 南京信息工程大学 General traffic weather short-term forecasting method, device and storage medium
CN118735082A (en) * 2024-09-03 2024-10-01 南京信息工程大学 3D-TimesNet-based sub-season air temperature prediction correction method

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117034780A (en) * 2023-08-31 2023-11-10 江苏省气候中心 Multi-scale sub-season precipitation prediction method based on deep learning
CN117034780B (en) * 2023-08-31 2024-04-19 江苏省气候中心 Multi-scale sub-season precipitation prediction method based on deep learning
CN117973636A (en) * 2024-03-28 2024-05-03 南京信息工程大学 General traffic weather short-term forecasting method, device and storage medium
CN117973636B (en) * 2024-03-28 2024-05-31 南京信息工程大学 General traffic weather short-term forecasting method, device and storage medium
CN118735082A (en) * 2024-09-03 2024-10-01 南京信息工程大学 3D-TimesNet-based sub-season air temperature prediction correction method

Similar Documents

Publication Publication Date Title
CN116400433A (en) Single-station time-by-time air temperature forecasting method based on data fusion and mixed convolution
CN112381097A (en) Scene semantic segmentation method based on deep learning
CN112766283B (en) Two-phase flow pattern identification method based on multi-scale convolution network
CN114692509B (en) Strong noise single photon three-dimensional reconstruction method based on multi-stage degeneration neural network
CN109785279B (en) Image fusion reconstruction method based on deep learning
CN113298131B (en) Attention mechanism-based time sequence data missing value interpolation method
CN114091615B (en) Electric energy metering data complement method and system based on generation countermeasure network
CN112183886A (en) Short-time approaching rainfall prediction method based on convolutional network and attention mechanism
CN112529415B (en) Article scoring method based on combined multiple receptive field graph neural network
US20220372891A1 (en) Method for stability analysis of combustion chamber of gas turbine engine based on image sequence analysis
CN116819615B (en) Seismic data reconstruction method
CN110991317A (en) Crowd counting method based on multi-scale perspective sensing type network
CN114037143A (en) Short-term wind power combination prediction method
CN110648055A (en) Electric power accident event and cause relation construction method based on convolutional neural network
CN112100486A (en) Deep learning recommendation system and method based on graph model
Ye et al. SPLNet: A sequence-to-one learning network with time-variant structure for regional wind speed prediction
CN115457307A (en) Chemical process fault diagnosis method based on improved residual error network
CN115032720A (en) Application of multi-mode integrated forecast based on random forest in ground air temperature forecast
CN115170874A (en) Self-distillation implementation method based on decoupling distillation loss
JP2001344590A (en) Neural network and method for learning the same and method for analyzing the same and method for judging abnormality
CN116992607A (en) Structural topology optimization method, system and device
CN116706907A (en) Photovoltaic power generation prediction method based on fuzzy reasoning and related equipment
CN113537573B (en) Wind power operation trend prediction method based on double space-time feature extraction
CN109325585A (en) The shot and long term memory network partially connected method decomposed based on tensor ring
CN113392139A (en) Multi-element time sequence completion method and system based on association fusion

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination