CN111798066A

CN111798066A - Multi-dimensional prediction method and system for cell flow under urban scale

Info

Publication number: CN111798066A
Application number: CN202010693644.9A
Authority: CN
Inventors: 王红; 岳秀明; 张文华; 陈梅
Original assignee: Shandong Xiehe University
Current assignee: Shandong Xiehe University
Priority date: 2020-07-17
Filing date: 2020-07-17
Publication date: 2020-10-20

Abstract

The utility model provides a multi-dimensional prediction method and system for regional flow under urban scale, which collects flow data in the region and preprocesses the collected data; different prediction models are constructed in advance, the prediction models are trained, and the prediction period and the moving time of training data of each prediction model are different; and (4) carrying out statistics by taking the error rate and the filling rate as indexes, evaluating the prediction accuracy and performance of different models in different dimensions, and selecting an optimized model to perform flow prediction to obtain a prediction result. The artificial participation degree is reduced, the operation efficiency is improved, and the convergence of the prediction model is better when the MAPE value is verified to be lower, so that the prediction accuracy is improved, an independent prediction model is trained by more than 5 ten thousand network elements, and the accurate prediction of each network element is realized.

Description

Multi-dimensional prediction method and system for cell flow under urban scale

Technical Field

The disclosure belongs to the technical field of pattern recognition, and relates to a multi-dimensional prediction method and a multi-dimensional prediction system for cell flow under an urban scale.

Background

The statements in this section merely provide background information related to the present disclosure and may not necessarily constitute prior art.

With the continuous development of human society, cities below 5G will bear more and more intelligent terminal devices. The research of community traffic has important research significance on all social levels, such as the field of intelligent cities, the deployment and distribution of 5G network regional network resources and the number of base stations from the global perspective are realized through traffic prediction, such as planning geographic positions and the like, on the premise of ensuring the traffic demand of a community, and whether more or fewer base stations are opened is determined according to the predicted service load. The prediction technology can also be applied to the aspects of network capacity prediction, network performance early warning, user perception guarantee and the like.

With the continuous development of mobile communication technology, mobile cellular networks are applied more and more widely. The traffic of the cellular network generates a large amount of data and consumes a large amount of power. In order to satisfy the competitive design tradeoff between data service bandwidth and hardware energy cost, it is important to analyze and predict service data of a cellular network base station with emphasis while satisfying the quality of service (QoS) requirements, and to reduce the energy consumption of the base station. In particular, a base station is generally composed of three cells. When the upcoming data traffic is predicted to be low, the whole base station or a specific cell of the base station is closed in advance to reduce energy consumption; when the increase of the traffic is predicted, the base station is opened in advance to ensure the data QoS. Traffic prediction is not only a problem that an operator knows in advance to consider service perception of a user, but also a problem that the operator models overall distribution of data, groups multidimensional space and time, and predicts effectiveness of a plurality of scenes to predict application conditions of the user in the scenes.

"5 +5+ 4" multidimensional prediction. Wherein the time dimension is from year, month, week, day and hour from large to small; the spatial dimension is five levels from big to small, namely, a full network level, a district level, a county level, a micro-area level, a grid level and a cell level; the independent dimension is four types including a manufacturer, a coverage scene, a coverage type and a frequency band, and all the dimensions except the dimension of the manufacturer can be drilled below a cell level. There are a total of 5 tens of thousands of network elements that need to be predicted.

Various network capacity and flow prediction methods are provided aiming at different problems, and the methods comprise smoothing, trend fitting, a combined model, an AR model, an MA model, an ARMA model and an ARIMA model. Generally, the above method is to find the most suitable predicted trend line according to the daily traffic history data of the whole network. The accuracy of these methods is limited by factors such as historical data integrity, size of the data volume, etc. The prediction result often has an over-fitting or under-fitting phenomenon, and the deviation of the prediction result and actual data is often not obvious. With the gradual maturity of machine learning-based methods, the deep learning field is continuously developed and has been applied to traffic prediction in different scenes.

The inventor finds that an exponential smoothing method and a trend fitting method are common methods for early time series prediction, but the exponential smoothing method and the trend fitting method are not suitable for frequently changing data sets, cannot process nonlinear time series, and can only fit results of complex change rules by one method; the prediction of the AR/MA/ARMA/ARIMA model requires a plurality of manual adjustment parameters, and has low efficiency and inaccurate prediction.

Disclosure of Invention

The method and the system can reduce the manual workload, and are high in efficiency and high in prediction accuracy.

According to some embodiments, the following technical scheme is adopted in the disclosure:

a multi-dimensional prediction method for regional flow under an urban scale comprises the following steps:

collecting flow data in the area, and preprocessing the collected data;

different prediction models are constructed in advance, the prediction models are trained, and the prediction period and the moving time of training data of each prediction model are different;

and (4) carrying out statistics by taking the error rate and the filling rate as indexes, evaluating the prediction accuracy and performance of different models in different dimensions, and selecting an optimized model to perform flow prediction to obtain a prediction result.

As an alternative embodiment, the predictive model comprises at least seven of:

model 1: one week after each prediction, the training data is shifted one day each time;

model 2: one week after each prediction, the training data is shifted one week each time;

model 3: two weeks after each prediction, the training data was shifted one day each time;

model 4: two weeks after each prediction, the training data was shifted one week each time;

model 5: four weeks after each prediction, the training data is shifted one day each time;

model 6: four weeks after each prediction, the training data is shifted one week each time;

model 7: one day after each prediction, the training data was shifted one day each.

As an alternative, when the prediction accuracy and performance of different models in different dimensions are evaluated, the selection of the optimized model is performed before each network element predicts.

As an alternative embodiment, before prediction, the regional traffic data is collected and normalized, and the regional traffic in the whole network is mapped into the range of [0,1 ].

As an alternative embodiment, the collected traffic data is divided into a training set and a test set.

As an alternative embodiment, the data pre-processing process includes backfilling, deduplication and fragmentation of the acquired data.

As an alternative embodiment, after the prediction is completed, the predicted value is restored to the traffic data with the time stamp by date before the data normalization.

As an alternative embodiment, the prediction granularity is composed of three dimensions of time, space and independence, and each dimension has a plurality of layers.

As an alternative embodiment, the optimization model is an optimal prediction model combining temporal granularity and spatial granularity.

A multi-dimensional prediction system for regional traffic under a city scale comprises:

the data acquisition and processing module is configured to acquire flow data in the region and preprocess the acquired data;

the model training module is configured to train different pre-constructed prediction models, and the prediction period and the moving time of training data of each prediction model are different;

and the model evaluation and selection module is configured to evaluate the prediction accuracy and performance of different models in different dimensions by taking the error rate and the filling rate as index statistics, and select the optimized model to perform flow prediction to obtain a prediction result.

A computer readable storage medium having stored therein a plurality of instructions adapted to be loaded by a processor of a terminal device and to execute the method for multi-dimensional prediction of regional traffic at a city scale.

A terminal device comprising a processor and a computer readable storage medium, the processor being configured to implement instructions; the computer readable storage medium is used for storing a plurality of instructions, and the instructions are suitable for being loaded by a processor and executing the regional flow multidimensional prediction method under the urban scale.

Compared with the prior art, the beneficial effect of this disclosure is:

according to the method, the time characteristics and the behavior rules of the model data are deeply researched through statistical analysis of the urban scale real traffic data with the time span of more months. Considering that the time sequence data show different change rules under different time dimensions, space dimensions and independent dimensions, a multi-dimensional recurrent neural network (MDRNN) prediction model is established, and the future cell telephone traffic under different time dimensions and space dimensions is predicted.

The method fully utilizes training and inspection of actual data, the granularity of the provided prediction model can be deep to a unit level, compared with the traditional trend fitting method, the Mean Absolute Percentage Error (MAPE) of the model is reduced by 6.56%, and guidance is provided for energy efficiency optimization and power consumption reduction of a base station in different space-time dimensions. The manual workload is reduced, the efficiency is high, and the prediction accuracy is high.

The method and the device do not need to manually appoint the cycle of the time sequence or parameters such as difference, the manual participation degree is reduced, the operation efficiency is improved, the convergence is better, the prediction accuracy is improved, the independent prediction model is trained by more than 5 ten thousand network elements, and the accurate prediction of each network element is realized.

Drawings

The accompanying drawings, which are included to provide a further understanding of the disclosure, illustrate embodiments of the disclosure and together with the description serve to explain the disclosure and are not to limit the disclosure.

FIG. 1 is a flow diagram of a flow prediction for a multi-dimensional prediction model;

fig. 2 is an overall framework diagram of the automatic flow prediction system.

The specific implementation mode is as follows:

the present disclosure is further described with reference to the following drawings and examples.

It should be noted that the following detailed description is exemplary and is intended to provide further explanation of the disclosure. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs.

It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of example embodiments according to the present disclosure. As used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, and it should be understood that when the terms "comprises" and/or "comprising" are used in this specification, they specify the presence of stated features, steps, operations, devices, components, and/or combinations thereof, unless the context clearly indicates otherwise.

First, the term explanation is made:

(1) data normalization: the goal is to map the full network cell traffic into the range of [0,1 ]. Normalization refers to scaling the data to fall within a small specific interval. Since RNN is sensitive to the size of the input data, it is necessary to map the data uniformly into the range of [0,1 ]. After normalization, the convergence rate of the model and the precision of the model are improved.

(2) And (3) data shape conversion: 2D: [ samples, features ] to 3D: [ samples, timepieces, features ] conversion. Since the RNN network expects the incoming data to be 3D, Timesteps is set to 1 when converting from 2-dimensional to 3-dimensional.

(3) Data set partitioning: the data set is divided into a training set (train _ data) and a test set (test _ data). Training data: the test data is about 7.5: 1.

(4) and (3) a time sequence prediction method: according to the time sequence of historical statistical data, the future change trend is subjected to predictive analysis. In general, a time series consists of four variation components, such as long-term trend variations, seasonal variations, periodic variations, and random fluctuations. Some simple prediction models can be used to predict the three trend changes, such as an exponential smoothing model, a moving average model, and the like. The random variation component is unpredictable, and is a kind of "noise" mixed into the time series, and must be filtered out in an attempt to avoid affecting the accuracy of the prediction result.

(5) Rnn (current Neural networks): the recurrent neural network is a neural network with memory and can remember the things at the previous time point. It can be treated as a black box that can fit any function, as long as the training data is sufficient to obtain the desired output given a particular input.

(6) LSTM (Long Short-Term Memory): is a specific RNN that remembers longer memory relative to RNNs, but is still short-term in nature. The LSTM model is more complex than the RNN model, and training data needs to be more time consuming than traditional RNNs.

The traditional method is limited by factors such as the integrity of historical data, the size of a data gauge and the like, the prediction result is often over-fit or under-fit, the prediction result is compared with real data, the deviation is often large, and the prediction cannot be carried out to the cell level.

In order to solve the above problems, a first aspect of the present invention provides a prediction method using a multidimensional prediction model, which reduces the workload of workers, and has high efficiency and high prediction accuracy. As shown in fig. 1, the method for constructing a multidimensional prediction model based on RNN, Python, and Oracle in this embodiment includes:

(1) saving models after each iteration

In the training process, the training is carried out according to the cell data files under the catalog, so the trained model of the cell is saved according to the cell name each time, the cell name is stored by ci and saved by using cover.

(2) Selecting an optimization model

The output _ mse is a dictionary type, the stored content is ({ (step length: corresponding loss value), () }), and the key { (output _ mse. get) obtains the step length corresponding to the minimum loss value.

(3) Saving optimal models

According to the iteration step length corresponding to the obtained optimal model, the optimal model is saved by a code "save _ path [ min _ ep ] ═ save (sess, save _ path _1+ '/model/' + str (ci) +". ckpt ")" so as to be directly called and used later, and thus, multiple times of training in the same cell can be reduced.

(4) Loading the saved optimal model

And carrying out data loading on the optimal model stored in the last step.

(5) Predicting data

Outputs refers to the predicted output values and feed _ fact refers to the input data.

(6) Restoring the predicted value to the state before normalization

And restoring the predicted value into the flow data of the hour level with the time-of-day as the time stamp before the data normalization.

The different models formed include: model 1: one week after each prediction, the training data was shifted one day each, model 2: one week after each prediction, one week for each shift of training data, model 3: two weeks after each prediction, the training data was shifted one day each, model 4: two weeks after each prediction, the training data was shifted one week each, model 5: four weeks after each prediction, the training data was shifted one day each time, model 6: four weeks after each prediction, the training data moved one week each time, model 7: one day after each prediction, the training data was shifted one day each. Each network element needs to make an evaluation and automatic selection of 7 models.

In this embodiment, the prediction granularity is composed of three dimensions of time, space and independence, and each dimension has a plurality of levels. The spatial dimension is five levels from big to small, namely, a full network level, a district level, a county level, a micro-area level, a grid level and a cell level; the independent dimensionalities comprise four types, namely a manufacturer, a coverage scene, a coverage type and a frequency band, and all the dimensionalities except the manufacturer dimensionality can be drilled down to a cell level; the time dimension is five levels from large to small, namely year, month, week, day and hour.

Due to the use of RNN for prediction, enough data is needed to represent a better advantage. The change rule of 24-hour traffic of each cell in one day can be better reflected by selecting hour-level data for prediction, and the time period in which the cell is not busy can be predicted. However, some cells have deletions in the data aggregation process, so that the daily data fluctuation is large, and the trend of the daily overall traffic is influenced. Therefore, in the process of predicting by using the hour data, the hour data is preprocessed, for example, to reflect the change rule of one week, so when the missing value of the hour data is filled, a weighted average is used to fill the missing value of the hour data by using the data at the same time point in the previous week and the next week.

The result of the embodiment is an optimal model, the optimal prediction model combining the time granularity and the space granularity is given by the model, and the prediction accuracy of the model is verified to be high by testing on an independent dimension.

In this embodiment, for a plurality of network elements of 5 ten thousand, a model after each iteration of each model can be automatically stored, and the prediction accuracy and performance of the model in different dimensions are evaluated by taking the error rate and the fill factor as indexes from the automatically stored models. The filling rate is low, the error rate is the minimum optimal model, the system automatically selects the optimal model during training, the optimal model during training is directly used during prediction, and the flow prediction is carried out after the optimal model is loaded.

Fig. 2 shows the system implementation process of data acquisition → data preprocessing → model auto-training → data prediction → data output and storage Shell.

(1) Data acquisition: an Oracle database is used for design, and a design database storage process is developed and used for timing automatic acquisition of flow data. This phase is expected to take 10 hours.

(2) Data preprocessing: the method comprises the steps of adopting Java language, exporting collected data to a server in a text mode, designing and developing a data preprocessing module, and carrying out 3 preprocessing operations of backfilling, removing duplicate and fragmenting on the collected data (as the RNN needs continuous data and cannot have abnormal values when carrying out model training, a program is needed to automatically preprocess the data). Data pretreatment for this period was expected to be 10 minutes.

(3) Automatic training of a model: and respectively carrying out model training on the 5+5+4 dimensionality fragment original data by adopting a Python language to realize a special model for each network element, and storing model parameters into a model result set.

(4) And (3) data prediction: and adopting Python language, selecting an optimal model from the model result set, respectively predicting each network element for more than 5 ten thousand times in total, and storing the prediction result into a text. Data training and prediction two phases are expected to take 3 hours.

(5) Data output and storage: and (3) outputting and storing the predicted data: and developing a shell script, and warehousing the prediction result into an Oracle database, so that result verification and prediction result query are facilitated. One round of model training and prediction is expected to be completed within 24 hours.

In order to reduce the manual workload and improve the efficiency, the embodiment utilizes the model to automatically train and predict, and puts the prediction result into the Oracle database, thereby reducing the manual participation, reducing the manual workload and improving the efficiency.

As will be appreciated by one skilled in the art, embodiments of the present disclosure may be provided as a method, system, or computer program product. Accordingly, the present disclosure may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present disclosure may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and so forth) having computer-usable program code embodied therein.

The present disclosure is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the disclosure. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

The above description is only a preferred embodiment of the present disclosure and is not intended to limit the present disclosure, and various modifications and changes may be made to the present disclosure by those skilled in the art. Any modification, equivalent replacement, improvement and the like made within the spirit and principle of the present disclosure should be included in the protection scope of the present disclosure.

Although the present disclosure has been described with reference to specific embodiments, it should be understood that the scope of the present disclosure is not limited thereto, and those skilled in the art will appreciate that various modifications and changes can be made without departing from the spirit and scope of the present disclosure.

Claims

1. A multi-dimensional prediction method for regional flow under an urban scale is characterized by comprising the following steps: the method comprises the following steps:

collecting flow data in the area, and preprocessing the collected data;

2. The method for predicting the regional flow under the urban scale in multiple dimensions as claimed in claim 1, wherein the method comprises the following steps: the predictive model includes at least seven types:

3. The method for predicting the regional flow under the urban scale in multiple dimensions as claimed in claim 1, wherein the method comprises the following steps:

before prediction, the regional flow data is collected and normalized, and the flow in the whole network region is mapped into the range of [0,1 ].

4. The method for predicting the regional flow under the urban scale in multiple dimensions as claimed in claim 1, wherein the method comprises the following steps: dividing the collected flow data into a training set and a testing set;

the data preprocessing process comprises backfilling, duplicate removal and fragmentation of the collected data.

5. The method for predicting the regional flow under the urban scale in multiple dimensions as claimed in claim 1, wherein the method comprises the following steps: and after the prediction is finished, restoring the predicted value into the flow data with the time stamp by date and time before the data normalization.

6. The method for predicting the regional flow under the urban scale in multiple dimensions as claimed in claim 1, wherein the method comprises the following steps: the prediction granularity is composed of three dimensions of time, space and independence, and each dimension has a plurality of layers.

7. The method for predicting the regional flow under the urban scale in multiple dimensions as claimed in claim 6, wherein the method comprises the following steps: the optimization model is an optimal prediction model combining time granularity and space granularity.

8. A multi-dimensional prediction system for regional flow under urban scale is characterized in that: the method comprises the following steps:

9. A computer-readable storage medium characterized by: a plurality of instructions are stored, the instructions are suitable for being loaded by a processor of a terminal device and executing the regional flow multidimensional prediction method under the urban scale according to any one of claims 1-7.

10. A terminal device is characterized in that: the system comprises a processor and a computer readable storage medium, wherein the processor is used for realizing instructions; the computer-readable storage medium is used for storing a plurality of instructions, and the instructions are suitable for being loaded by a processor and executing the regional flow multidimensional prediction method under the urban scale according to any one of claims 1 to 7.