CN112532439B

CN112532439B - Network flow prediction method based on attention multi-component space-time cross-domain neural network model

Info

Publication number: CN112532439B
Application number: CN202011326835.8A
Authority: CN
Inventors: 陈赓; 曾庆田; 孙强; 段华; 邵睿; 徐先杰; 张旭
Original assignee: Shandong University of Science and Technology
Current assignee: Shandong University of Science and Technology
Priority date: 2020-11-24
Filing date: 2020-11-24
Publication date: 2022-08-23
Anticipated expiration: 2040-11-24
Also published as: CN112532439A

Abstract

The invention discloses a network flow prediction method based on an attention multi-component space-time cross-domain neural network model, belongs to the technical field of intelligent communication, and solves the problem of prediction of wireless cellular network flow. Firstly, dividing wireless cellular traffic data into neighboring data, daily cycle data and weekly cycle data according to the periodic characteristics of the wireless cellular traffic data; then modeling the neighboring data, the daily cycle data and the weekly cycle data through a conv-LSTM structure or a conv-GRU structure; different weights are adaptively distributed to the three kinds of characteristic data through an attention layer, so that the characteristic extraction capability of the three kinds of characteristic data is improved, and the characteristic information which interferes with the prediction moment is inhibited; and finally, combining the embedding of the timestamp characteristics, fusing various cross-domain data and carrying out flow prediction by using a common auxiliary model. The model can effectively utilize the periodic characteristics of wireless cellular flow data, saves model training time, greatly reduces workload, and further improves the prediction performance of network flow.

Description

Network flow prediction method based on attention multi-component space-time cross-domain neural network model

Technical Field

The invention belongs to the technical field of intelligent communication, and particularly relates to a network flow prediction method based on an attention multi-component space-time cross-domain neural network model.

Background

With the advent of the age of 5G/B5G, the number of mobile devices and internet of things has exponentially increased around the world, and the demand for wireless mobile data has rapidly increased. How to scientifically and reasonably allocate and optimize the existing cellular network resources, improve the utilization rate of the resources and reduce the energy consumption of the cellular base station is a problem to be considered and solved in the communication industry.

The method and the device can accurately predict the wireless cellular flow, and are beneficial to the development of base station site selection, urban area planning, regional flow prediction and other works. However, accurate prediction of wireless traffic flow is a very challenging problem, mainly for the following 3 reasons. First, the generation source of wireless communication network traffic is a user having mobility, and the mobility of the wireless user causes spatial dependency of traffic among a plurality of areas. In particular, the advent of new types of traffic has enabled people to reach from one end of a city to the other in a short period of time. This makes the spatial dependence of wireless traffic not only local, but rather has a large-scale global dependence. On the other hand, the wireless traffic flow also has a dependency in the time dimension, and the traffic value at a certain time has a high correlation with the traffic values at its neighboring time (short-term dependency) and the corresponding time of a certain day (periodicity). Second, the spatial constraint problem that multi-source cross-domain data creates for wireless traffic. The reasons for affecting the generation of wireless traffic in a certain area are diverse. When traffic prediction is performed, not only is a regular pattern implied by wireless service traffic mined only from the perspective of historical data, but also spatial constraint factors generated by other cross-domain and cross-source data on the traffic should be considered. Factors such as base station data of a certain area, point of interest information, social activity level of an area, etc. all affect the change of traffic. Therefore, how to efficiently fuse the multi-source cross-domain data which seems not to have a direct relation with the wireless service flow is a difficult problem to be solved at present. Third, how to achieve high accuracy of wireless cellular traffic prediction in consideration of space-time factors and in combination with cross-domain data is also a difficult problem.

Currently, the main model methods for wireless cellular traffic prediction are: (1) an integrated moving average autoregressive model (ARIMA); (2) an exponential smoothing method (ES); (3) linear regression method (LR); (4) support vector machine regression method (SVR); (5) multi-layer perceptron Method (MLP); (6) based on long-short time memory network method (LSTM); (7) convolutional neural network based approach (CNN); (7) an LSTM model; (8) a space-time convolutional network STDenseNet model; (9) an STNet model; (10) a STMNet model; (11) STCNet model, etc. These model methods for solving the wireless cellular network traffic prediction are considered from the aspects of space factors, time factors, space-time factors and the like.

Disclosure of Invention

The invention provides a multi-component space-time cross-domain neural network model method based on an attention mechanism, which is used for efficiently fusing multi-source cross-domain data, fully utilizing the time-space characteristics of the multi-source cross-domain data, introducing the attention mechanism and further improving the accuracy of wireless cellular flow prediction.

In order to achieve the purpose, the invention adopts the following technical scheme:

a network flow prediction method based on attention multi-component space-time cross-domain neural network model comprises the following steps:

(1) performing Pearson correlation analysis and matrixing processing on three service data, namely short message service data, telephone service data and Internet service data;

(2) gridding and dividing different areas, and clustering and classifying the areas;

(3) wireless cellular flow of the three services is divided into neighbor data, day cycle data and week cycle data according to the characteristics of the neighbor, day cycle and week cycle;

(4) performing correlation analysis and matrixing processing on the cross-domain data, and fusing;

(5) extracting the characteristics of the timestamp of the wireless cellular traffic;

(6) and fusing and inputting various cross-domain data and service data into the attention multi-component space-time cross-domain neural network model, and finally outputting a wireless cellular network flow prediction result.

Preferably, the step (1) specifically comprises the following steps:

(1-1) analyzing the correlation among three service data, namely short messages, telephone and the Internet, and analyzing the periodicity and difference of different service data and the difference of different regional data;

(1-2) processing three service data of short message, telephone and Internet into three 100 x 100 matrixes with the same size, wherein each element in the matrixes represents a flow data value of a certain service.

Preferably, the step (2) specifically comprises the following steps:

(2-1) dividing the predicted area into 100 x 100 grid areas, each grid corresponding to a data value of wireless cellular traffic of a certain service;

(2-2) according to the similarity and the difference of wireless cellular traffic of different areas, the similar areas are gathered together to obtain three different classes, and then model training is carried out on the different classes.

Preferably, the step (3) specifically comprises the following steps:

(3-1) extracting neighbor data of the wireless cellular traffic data, the neighbor data being a cellular traffic data sequence segment of a history time period directly adjacent to the predicted time t;

(3-2) extracting daily cycle data of the wireless cellular traffic data, the daily cycle data being a cellular traffic data sequence segment which is the same as the predicted target time in n days before the predicted t time;

and (3-3) extracting cycle-per-cycle data of the wireless cellular traffic data, wherein the cycle-per-cycle data is a cellular traffic data sequence segment which has the same attribute and the same time as the n previous weeks of the predicted t time and the predicted target week.

Preferably, the step (4) specifically includes the following steps:

and (4-1) performing Pearson correlation coefficient analysis on the acquired three cross-domain data sets of Social information Social, base station distribution BS and interest point distribution POI to obtain the correlation, similarity and correlation characteristics of cross-domain data and different service data.

(4-2) processing the three kinds of cross-domain data into three matrices of 100 × 100;

and (4-3) packaging the three kinds of cross-domain data subjected to the matrixing processing into a tensor.

Preferably, the step (5) specifically comprises the following steps:

(5-1) extracting four characteristic attributes of week, hour, workday and weekend from the timestamp of the wireless cellular traffic and processing the four characteristic attributes into a vector;

(5-2) transforming the processed vector into a 100 x 100 matrix.

Preferably, the step (6) specifically comprises the following steps:

(6-1) cellular traffic D with close proximity _t ^h Day-periodic cellular traffic D _t ^d And periodic cellular traffic D _t ^w Three parts of characteristic input data are led into a conv-GRU structure with two layers, pass through an attention layer and distribute different weights to the flow data with the adjacency, the daily periodicity and the weekly periodicity through the attention layer;

(6-2) putting the timestamp feature matrix into a two-layer fully-connected neural network for embedded learning, and performing timestamp feature modeling;

(6-3) fusing three kinds of cross-domain data including base station distribution BS, Social information Social and POI (point of interest) of the prediction area into a set D _cross Introducing the two-layer convolutional neural network into a two-layer convolutional neural network for spatial correlation modeling;

and (6-4) splicing the initial characteristic outputs of the steps into a new tensor according to the specified dimensionality, inputting the new tensor into a dense connection convolutional network DenseNet, and finally outputting a flow prediction result.

Preferably, in the step (6-1), the attention model is divided into an input layer, a hidden layer and an attention layer, and the specific process is as follows:

1) firstly, a conv-GRU structure is utilized to obtain a hidden layer state (h) ₁ ，h ₂ ，…，h _t )；

2) Calculating the influence e of each current input position on the current position i _t The formula is as follows:

wherein v is _a 、W _a 、U _a Is the weight value of the attention network, relu (-) is the activation function, T is the total number of time intervals, S is the current input shapeState;

3) to e _t Performing softmax normalization to obtain an attention weight distribution, wherein the formula is as follows:

wherein exp (·) is an exponential function with a natural constant e as the base;

4) using alpha _t Weighted summation is carried out to obtain a vector c _t The formula is as follows:

preferably, in the step (6-1), the conv-GRU structure may be replaced with a conv-LSTM structure.

Preferably, in said step (6-4), the DenseNet network comprises L layers, each layer implementing a complex function transformation, the complex function comprising the batch regularization BN, the activation function Relu, and the convolution operation Conv.

The invention has the following beneficial technical effects:

the method of the invention starts from the characteristics of time and space factors, carries out fine grained division on wireless cellular traffic data, fully utilizes the time-space characteristics thereof, introduces an attention mechanism layer to adaptively distribute weight to the divided wireless cellular traffic, improves the characteristic extraction capability thereof, inhibits the characteristic information which generates interference to the prediction moment, and further improves the prediction efficiency and accuracy of the model to the wireless cellular traffic.

Drawings

FIG. 1 is a flow chart of data preprocessing according to an embodiment of the present invention;

fig. 2(a), (b), and (c) are dynamic characteristic graphs of three different services of Sms, Call, and Internet in different areas in the time dimension, respectively, according to an embodiment of the present invention;

FIG. 3 is a diagram illustrating Pearson correlation between wireless traffic and a cross-domain data set according to an embodiment of the present invention;

FIG. 4 is a diagram illustrating a tensor T according to an embodiment of the present invention;

FIG. 5 is a schematic diagram of a multi-component spatiotemporal cross-domain neural network model (att-MCSTCNet) based on an attention mechanism according to an embodiment of the present invention;

FIG. 6 is a diagram of a wireless cellular traffic time series segment as input by an embodiment of the present invention;

FIG. 7 is a schematic diagram of an LSTM structure according to an embodiment of the present invention;

FIG. 8 is a schematic structural diagram of a GRU according to an embodiment of the present invention;

fig. 9 is a schematic structural diagram of an attention according to an embodiment of the invention.

Detailed Description

The invention is described in further detail below with reference to the following figures and detailed description:

the method comprises the following specific steps:

(1) performing Pearson correlation analysis and matrixing processing on three data including short messages, telephone calls and the Internet: analyzing the correlation among three service data, namely short messages, telephone and the Internet, and analyzing the periodicity and the difference of different service data and the difference of different regional data; processing three kinds of service data of short messages, telephones and the Internet into three matrixes with the same size, namely 100 multiplied by 100; wherein each element in the matrix represents a traffic data value for a certain service.

(2) Gridding and dividing the Milan city, and clustering and classifying the divided different areas: dividing a predicted region (Milan city) into 100 x 100 grid regions, enabling each grid to correspond to a data value of wireless cellular traffic of a certain service of the matrix, gathering similar regions together according to similarity and difference of the wireless cellular traffic of different regions to obtain three different classes, and then performing model training on the different classes.

(3) Firstly, extracting neighbor data of wireless cellular traffic data, namely cellular traffic data sequence segments of a period of historical time directly adjacent to the predicted t moment; then, extracting daily cycle data of the wireless cellular traffic data, namely cellular traffic data sequence segments which are the same as the predicted target time in n days before the predicted t time; and finally, extracting cycle data of the wireless cellular traffic data, namely cellular traffic data sequence segments which have the same attribute and the same time as the target week of the previous n weeks of the predicted time t.

(4) Performing correlation analysis and matrixing processing on cross-domain data, and fusing: performing Pearson correlation coefficient analysis on the acquired three cross-domain data sets of Social information (Sociai), base station distribution (BS) and point of interest (POI) to obtain the characteristics of correlation, similarity, correlation degree and the like of cross-domain data and different service data; then, the three kinds of cross-domain data are also processed into three matrixes of 100 multiplied by 100, and the sizes of the matrixes are kept consistent with those of the other services; and encapsulating the three types of cross-domain data subjected to matrixing into a tensor.

(5) Performing feature extraction on the time stamp of the wireless cellular traffic: extracting four characteristic attributes of week, hour, working day and weekend from the timestamp of the wireless cellular flow and processing the four characteristic attributes into a vector; the processed vector is transformed into a 100 x 100 matrix, which is guaranteed to be the same size as the matrix.

(6) First, for neighbor cellular traffic D _t ^h Day-cycle cellular traffic D _t ^d Periodic cellular traffic D _t ^w Modeling, wherein three part characteristic inputs are led into a conv-LSTM or conv-GRU structure of two layers, and then the weight of historical cellular traffic information which is more critical to the target moment is increased through an attention layer, so that the weight of other information is reduced, the purpose of filtering irrelevant information is achieved, and the efficiency and the accuracy of wireless cellular traffic prediction are further improved. Secondly, performing feature modeling on the timestamp, putting a timestamp feature matrix into a two-layer fully-connected neural network for embedded learning, and then performing spatial correlation modeling on cross-domain data; wherein D _cross Three types of base station distribution BS, Social information Social and interest point distribution POI in the area are fused into a set of three types of cross-domain data, and a fused cross-domain data set D _cross A two-layer convolutional neural network is introduced to process the data and is used for assisting the prediction of wireless cellular traffic. Finally, the preliminary characteristic outputs are spliced and combined into a new one according to the specified dimensionalityTensor, and input to a dense connected convolutional network (DenseNet) which contains L layers in total, each layer implementing a complex function transformation, which is the same as the operation in cross-domain data feature learning, and contains batch regularization (BN), activation function (Relu), and convolution operation (Conv).

Further details are provided below.

(1) Data matrixing and correlation analysis of wireless cellular traffic.

Fig. 1 is a flow chart of data processing. The first step is data cleaning. In the experiment, wireless cellular flow data of short messages, telephones and the Internet are extracted, and for the missing traffic data in a certain time period in a certain area, the average traffic value of the surrounding area or the time period is used for filling. And step two, screening data. Since the recording time interval of the original data is 10 minutes, the recorded data value is mostly 0, which results in sparseness of data values. The data is divided by the number of hours, normalized by min-max, to speed up the training process. And thirdly, aligning data. In order to make the following data, the cleaned wireless cellular traffic data, cross-domain data and the milan city are divided into 100 × 100 grid areas for one-to-one correspondence. The wireless traffic data type is represented as k, where k belongs to { Sms, Call, Internet }, and taking Internet as an example, the wireless traffic of a certain city can be represented as a tensor of T dimension according to the wireless traffic data timestamp, where T is the total number of time intervals, T is {1,2, …, T }, X and Y respectively represent a coordinate point of the city, and a city regional traffic matrix D of the T-th time slot _k,t Can be expressed as:

fig. 2 is a graph of wireless cellular traffic data after being preprocessed and cleaned, and (a), (b) and (c) specifically show the dynamic characteristics of three services in the time dimension under three different areas. The three differently shaped curves in fig. 2 represent three different locations, respectively. From fig. 2, we can conclude that:

the data is periodic. Wireless cellular traffic for different services exhibits the same periodicity, for example: fig. 2(a), (b), and (c) show that the flow curves of three different services in the area of bocconi university have the same change rule. In addition, similar periodicity also exists for wireless cellular traffic in different regions, such as: in fig. 1(a), under the Sms service, the change law of the traffic curve representing three different areas shows similarity.

And difference of regional data. There is a large difference in the amount of data of wireless cellular traffic in different areas, for example: the data volume of wireless cells in this area is not very different in one week because the navelli area is a night living area of milan, while the area of bocconi university is a suburban area of milan city, so the data volume of wireless cells is relatively small.

And differences of the business data. The amount of data in wireless cellular traffic varies between different services, such as the Internet, and the duration of the traffic peaks is shorter than in the other two services.

(2) And performing matrixing processing and correlation analysis on cross-domain data.

The invention considers three data sets which have larger influence on wireless service flow, namely Social information (Social), base station distribution (BS) and point of interest (POI). Since the three data types have small variation on the time axis, the invention treats them as static data sets, and after processing, maps the data to specific areas according to the coordinate information. Referring to equation (1), equation (2), a cross-domain data set D, can be obtained _cross Is represented as follows:

for analyzing the correlation between different service flows and cross-domain data sets, a Pearson correlation coefficient is calculated, as shown in formula (3):

where cov (·) denotes covariance, and σ denotes standard deviation.

Fig. 3 is a correlation thermodynamic diagram between different services and cross-domain data sets (BS, POI, Social) obtained after Pearson correlation coefficient calculation, and the following conclusions can be drawn from fig. 3:

data correlation. The correlation of three different service data of Sms, Call and Internet is high, which shows that the model can be trained by introducing transfer learning between different service flow data.

Data similarity. The similarity between cross-domain data and wireless service flow is higher, so that the similarity can be regarded as a spatial characteristic constraint condition of the wireless service flow, and the service flow can be predicted more accurately.

And III, data correlation degree. The POI and the BS are relatively high in correlation in the cross-domain data set relative to three services (Sms, Call and Internet), and the Social is the second correlation, which shows that the influence of the POI and the BS cross-domain data set on the accurate prediction of the service flow is relatively larger than that of the Social.

Finally, we get several matrixes D with consistent size and corresponding information of each element _t 、D _meta 、D _cross Synthesizing a multidimensional tensor T, wherein the data form is shown in fig. 4, each element (such as a black square in fig. 4) in the tensor represents the traffic information of the area coordinate corresponding to a certain moment of a certain service, the timestamp information and the information of the cross-domain data set of the area, and is convenient for the following model to use, and the cross-domain data set is added as

(3) And (5) performing time stamp feature analysis processing.

In order to fully utilize the characteristics of the timestamp to perform auxiliary prediction, 4 characteristics are extracted from the timestamp, the 4 characteristics are processed into a vector m, and the vector m is processed into a tensor T with the same size as the wireless cellular traffic data set and the cross-domain data set through a full connection layer. The extracted 4 features are shown in table 1:

table 14 characteristics of time stamps

Feature(s)	Name(s)	Value taking
			1	Week	0,1,2…6
2	Hour(s)	0,1，…23
			3	Working day	0,1
4	Weekend	0,1

For example: the four feature values extracted from 12, 14, 15 in 2013 are respectively: week is 5, hour is 14, workday is 0, weekend is 1.

(4) A multi-component spatio-temporal cross-domain neural network model based on an attention mechanism.

The invention provides a multi-component space-time cross-domain neural network model based on an attention mechanism, which comprises the following 4 parts as shown in figure 5:

the first part is neighbor cellular traffic D _t ^h Day-cycle cellular traffic D _t ^d Periodic cellular traffic DD _t ^w And (6) modeling. The time sequence segment of the input wireless cellular traffic is shown as a graph. The three characteristic inputs are introduced into a conv-LSTM or conv-GRU structure with two layers, the weight of historical cellular traffic information which is more critical to the target moment is increased through an attention layer, the weight of other information is reduced, the purpose of filtering irrelevant information is achieved, and the efficiency and the accuracy of wireless cellular traffic prediction are further improved.

The second part models the display time characteristics, the input is a matrix D characterized by time stamps _meta And putting the feature matrix into a two-layer fully-connected neural network for embedded learning.

The third part is modeling cross-domain data, and the input is a cross-domain data set D _cross In which D is _cross Distributing a set of three kinds of cross-domain data including BS, Social information Social and POI for a base station; finally, the cross-domain data set D is processed _cross A two-layer convolutional neural network is introduced to process such data.

The fourth part is a feature fusion layer, the input is a new tensor formed by splicing the initial feature outputs according to the specified dimensionality, and the new tensor is input into a dense connection convolution network (Densenet), the network totally comprises L layers, each layer realizes a compound function transformation, and the compound functions are the same as the operations in cross-domain data feature learning and comprise batch regularization (BN), activation functions (Relu) and convolution operations (Conv).

The specific training process is as follows: as shown in fig. 6, the neighbor cellular traffic D associated with the predicted target time t _t ^h Day-cycle cellular traffic D _t ^d Periodic cellular traffic D _t ^w The traffic matrix (2) can be seen as a plurality of single-channel pictures input to a single conv-LSTM or conv-GRU network with two layers, so that not only can a time sequence relation be obtained, but also the characteristics can be extracted like a convolution layer to extract spatial characteristics. As shown in FIG. 7, each element of the conv-LSTM network layer has a storage unit C for storing status information, and the unit C controls the deletion and addition of data information by three gates, respectively, the input gate i _g Forgetting door f _g And an output gate o _g . Wherein, the input gate i _g A forgetting gate f for selectively storing required data information _g The redundant information is selectively 'forgotten', and the final hidden state is output from the output gate o _g And controlling and deciding important data information required by output. The key operation of conv-LSTM is as follows:

wherein σ (·) is an activation function, a convolution operation, an Hadamard product operation, W _(·) For the weight of training, H _(·) Is an output gate o _g Hidden state of (b) _(·) For the bias of training, tanh (-) is a hyperbolic tangent function,

c _τ 、

H _τ are all three-dimensional tensors, the output obtained through conv-LSTM network layer

H is the number of feature maps.

The Gate Recovery Unit (GRU) is a kind of Recurrent Neural Network (RNN), and is also a variant of LSTM, compared with LSTM, the use of GRU can achieve a comparable effect, and compared with LSTM, training is easier, and training efficiency can be improved to a great extent, so the invention adopts the conv-GRU structure. As shown in FIG. 8, r _t A reset gate for controlling the reset, called reset gate for short, the reset gate being used to control the extent to which status information of a previous moment is ignored, z _t For controlling the update gate (update gate), referred to as update gate for short, the update gate is used to control the degree to which the state information of the previous time is brought into the current state, compared to the three gates of LSTM, the parameters are reduced, but the final performance is matched, relativelyLess parameters save more resources and have faster convergence speed.

Equation (5) includes resetting gate r _t And an update gate z _t The calculation process of (2). Wherein the content of the first and second substances,

mainly comprising the current input x _t Data, is pertinently directed to

Adding to the current hidden state is equivalent to memorizing the state at the current moment. (1-z) _t )⊙h _t-1 Indicating selective "forgetting" of an originally hidden state. 1-z _t Can be regarded as a forgetting gate (forget gate), forgetting h _t-1 Some unimportant information in the dimension.

Indicating that the pair contains current node information

To perform selective memory, which can be regarded as a pair

Some information in the dimensions is selected. Therefore, one gate z _t The two operations of forgetting and selecting memory can be simultaneously carried out, which is also the advantage of the GRU structure.

Wherein h is _t-1 Is the hidden state of the previous node, which contains the relevant information of the previous node. x is the number of _t Is the current input, σ (-) is the sigmoid activation function, W _(·) For the trained weight, tanh (-), which is a hyperbolic tangent function, is a Hadamard product operation.

The attention mechanism is a solution proposed to imitate human attention, that is, a mechanism of aligning internal experience and external feeling to increase the fineness of observation of a partial region. For example, when a picture is processed by human vision, a target area needing important attention, namely an attention focus, is obtained by rapidly scanning a global image. More attention is then devoted to this area to get more detailed information about the objects that need attention and to suppress other unwanted information. For the wireless cellular traffic time sequence of the invention, for the output y at a certain moment, different attention is allocated on the hidden layer h corresponding to the input x, i.e. different weights are given to the characteristics with different importance degrees, and the characteristics are associated with the output, thereby achieving the purpose of information screening. The attention model structure is expanded as shown in fig. 9. The attention model is roughly divided into three layers: input layer, hidden layer, attention layer.

1) Firstly, a conv-GRU structure is utilized to obtain a hidden layer state (h) ₁ ，h ₂ ，...，h _t )；

wherein v is _a ，W _a ，U _a Is the weight value of the attention network, relu (·) is the activation function, T is the total number of time intervals, S is ·;

wherein exp (·) is ·;

D _meta preliminary characteristic of (A) O _meta The treatment process comprises the following steps:

o _meta ＝Reshape(o _meta ) (10)

where σ (-) is the activation function,

and

is the parameter to be learned. After being processed by a full-connection layer with two layers,

reshape has the function of converting O _meta Matrix transformation into and _t a uniform size tensor.

D _cross Preliminary characteristics of (1) _cross Can be expressed as:

o _cross ＝f(W _cross *D _cross ) (11)

wherein the content of the first and second substances,

is a splicing operation, W _cross For the parameter to be learned, f (-) is a complex function, including batch regularization, Relu activation function and convolution operation, for D _cross Performing convolution and nonlinear transformation to obtain a product _t 、O _meta Are consistent in size so as to facilitate the next splicing process.

The resulting Frobenius norm calculation of the final output:

where θ is the set of all parameters of STC-N.

The following table 2 gives the algorithm for the att-MCSTCNet model training process. A training instance is first constructed from the original sequence (lines 1-5) and then trained by back-propagation and Adam (lines 6-11).

Table 2 att-MCSTCNet model training process algorithm

It is to be understood that the above description is not intended to limit the present invention, and the present invention is not limited to the above examples, and those skilled in the art may make modifications, alterations, additions or substitutions within the spirit and scope of the present invention.

Claims

1. A network flow prediction method based on attention multi-component space-time cross-domain neural network model is characterized by comprising the following steps:

(2) carrying out gridding division on different areas, and carrying out cluster classification on the areas;

(4) performing correlation analysis and matrixing processing on the cross-domain data, and fusing; the method specifically comprises the following steps:

(4-1) performing Pearson correlation coefficient analysis on the three acquired cross-domain data sets of Social information Social, base station distribution BS and POI (point of interest) to obtain correlation, similarity and correlation characteristics of cross-domain data and different service data;

(4-3) packaging the three kinds of cross-domain data subjected to matrixing into a tensor;

(6) fusing and inputting various cross-domain data and service data into the attention multi-component space-time cross-domain neural network model, and finally outputting a wireless cellular network flow prediction result; the method specifically comprises the following steps:

(6-1) cellular traffic D with close proximity _t ^h Day-periodic cellular traffic D _t ^d And periodic cellular traffic D _t ^w Three parts of characteristic input data are led into a conv-GRU structure with two layers, and different weights are distributed to flow data with the adjacency, day periodicity and week periodicity through an attention layer;

in the step (6-1), the attention model is divided into an input layer, a hidden layer and an attention layer, and the specific process is as follows:

wherein v is _a 、W _a 、U _a Is the weight value of the attention network, relu (·) is the activation function, T is the total number of time intervals, S is the current input state;

3) to e _t Performing softmax normalization to obtain attention weight distribution, wherein the formula is as follows:

(6-3) fusing three kinds of cross-domain data including base station distribution BS, Social information Social and POI (point of interest) of the prediction area into a set D _cross Introducing the two layers of convolution neural networks into a two-layer convolution neural network for spatial correlation modeling;

2. The method for predicting network traffic based on the attention multi-component spatiotemporal cross-domain neural network model according to claim 1, wherein the step (1) comprises the following steps:

3. The method for predicting network traffic based on the attention multi-component spatiotemporal cross-domain neural network model according to claim 2, wherein the step (2) comprises the following steps:

(2-2) according to the similarity and difference of wireless cellular traffic of different regions, the similar regions are gathered together to obtain three different classes, and then model training is carried out on the different classes.

4. The method for predicting network traffic based on the attention multi-component spatiotemporal cross-domain neural network model according to claim 1, wherein the step (3) comprises the following steps:

(3-2) extracting day cycle data of the wireless cellular traffic data, wherein the day cycle data is a cellular traffic data sequence segment which is the same as the predicted target time in n days before the predicted t time;

5. The method for predicting network traffic based on the attention multi-component spatiotemporal cross-domain neural network model according to claim 1, wherein the step (5) comprises the following steps:

(5-2) converting the processed vector into a 100 x 100 matrix.

6. The method for predicting network traffic based on attention multi-component spatiotemporal cross-domain neural network model according to claim 1, wherein in the step (6-1), the conv-GRU structure can be replaced by a conv-LSTM structure.

7. The method for predicting network traffic based on the attention multi-component space-time cross-domain neural network model according to claim 1, wherein in the step (6-4), the DenseNet network comprises L layers, each layer implements a complex function transformation, and the complex function comprises batch regularization BN, activation function Relu and convolution operation Conv.