CN111612254B

CN111612254B - Road motor vehicle exhaust emission prediction method based on improved attention bidirectional long-short term memory network

Info

Publication number: CN111612254B
Application number: CN202010443547.4A
Authority: CN
Inventors: 张玉钧; 谢皓; 何莹; 尤坤; 李潇毅; 范博强; 余冬琪; 李梦琪; 雷博恩; 刘建国; 刘文清
Original assignee: Hefei Institutes of Physical Science of CAS
Current assignee: Hefei Institutes of Physical Science of CAS
Priority date: 2020-05-22
Filing date: 2020-05-22
Publication date: 2022-12-23
Anticipated expiration: 2040-05-22
Also published as: CN111612254A

Abstract

The invention discloses a road motor vehicle exhaust emission prediction method for improving an attention bidirectional long-short term memory network, which comprises the following steps: 1. acquiring the exhaust emission data of the motor vehicle by using the PEMS and the OBD detection equipment together; 2. carrying out missing data compensation and normalization pretreatment on the tail gas emission data set; 3. establishing an improved Attention-Bi-LSTM Attention bidirectional long-short term memory network model; 4. determining the hyper-parameters of the model by adopting a pre-experiment; 5. and optimizing model parameters by adopting a self-adaptive learning rate algorithm to finish the training of the prediction model. The invention can fully consider all characteristic factors influencing the tail gas emission of the road motor vehicle, improve the prediction precision of the tail gas emission and have a larger application range, thereby effectively shortening the PEMS tail gas emission test time and reducing the consumption of manpower, resources and time cost.

Description

Road motor vehicle exhaust emission prediction method based on improved attention bidirectional long-short term memory network

Technical Field

The invention relates to the technical field of road motor vehicle exhaust emission prediction algorithms, in particular to an actual road pollutant emission prediction method based on an improved bidirectional long-short term memory network.

Background

In recent years, the number of motor vehicles in China is rapidly increased, so that the tail gas emission of road motor vehicles becomes one of the main factors polluting urban environment, and an effective road motor vehicle tail gas emission monitoring means is adopted, so that the method has important significance for improving the urban air quality. At present, the common methods for monitoring the exhaust emission of road motor vehicles mainly comprise: a chassis power measuring method, a tunnel testing method, a laser remote measuring method, a smoke plume tracing measuring method and a Portable Emission System (PEMS) measuring method. The experimental result of the chassis power measurement method cannot reflect the actual road emission condition of the motor vehicle, the tunnel test method is limited by special geographical environment conditions, the laser telemetry method is easily interfered by external environment, the measurement accuracy is not high, the smoke plume chasing measurement method requires the experimental vehicle to carry test equipment to track and chase the vehicle to be tested, the measurement mode is easy to enforce law, but the accuracy is not as good as that of the vehicle-mounted tail gas detection equipment measurement method. PEMS is used as the most accurate measuring mode in the detection of motor vehicle tail gas roads, has been written into the motor vehicle pollutant emission standard of the sixth stage by the national ministry of environmental protection and the national quality control administration, and is used as one of the necessary inspection links before a novel vehicle goes on the road.

However, during the actual use of PEMS, there are several problems:

(1) Before the test of the tail gas emission of the road motor vehicle begins, a heating pipeline of a Flame Ionization Detection (FID) module and an optical detection air chamber of a Fuel Economy Metering (FEM) module in the PEMS need to be preheated for half an hour and about one hour respectively.

(2) After the PEMS instrument is preheated, zero marking and calibration work of the gas to be measured is required, and the work requires mixed gas and NO ₂ And N ₂ Three-type standard sample gas cylinder and FID ignition aid H ₂ The gas cylinder needs trained professional technicians to operate, so that the detection precondition is harsh, and certain labor, resource and time costs need to be consumed.

(3) In the process of testing actual road pollutant emission of PEMS, FEM, NOx and FID modules often have equipment faults, are in communication interruption with an upper computer and the like, and need related professionals to accompany and follow a vehicle in the whole process, so that waste of human resources and time and certain potential safety hazards are caused.

(4) After the PEMS is continuously monitored for about two hours, the baseline drift phenomenon of the tail gas emission test data is more obvious along with the lapse of the measurement time, and the detection precision is reduced.

Disclosure of Invention

The invention aims to overcome the defects of the prior art, and provides a road motor vehicle exhaust emission prediction method based on an improved deep learning network, so that all characteristic factors influencing the road motor vehicle exhaust emission can be fully considered, the exhaust emission prediction precision is improved, and the method has a wide application range, so that the PEMS exhaust emission test time can be effectively shortened, the consumption of manpower, resources and time cost is reduced, the problems of drift and loss of exhaust emission monitoring data caused by long-time work, faults and the like of detection equipment in the actual PEMS test process can be solved, and the effect of repairing a key exhaust emission data segment is achieved.

In order to achieve the purpose, the invention adopts the following technical scheme:

the invention relates to a road motor vehicle exhaust emission prediction method based on an improved attention bidirectional long-short term memory network, which is characterized by comprising the following steps:

step 1, collecting exhaust emission data of a road motor vehicle in p days by using a PEMS detection device and an OBD vehicle-mounted diagnosis system together, collecting data of q working conditions every day, wherein the collecting time of each working condition is T, thereby obtaining n = p × q × T exhaust emission data sets containing m characteristics, and recording the data as D _origin ＝(d _ij ) _n×m Wherein d is _ij Representing the jth characteristic value at the ith acquisition time; i is more than or equal to 1 and less than or equal to n; j is more than or equal to 1 and less than or equal to m;

step 2, exhaust emission data set D _origin ＝(d _ij ) _n×m Carrying out missing data compensation and normalization pretreatment to obtain a tail gas emission characteristic matrix marked as D _scaled ＝(d′ _ij ) _n×m (ii) a Wherein, d' _ij Representing the j characteristic value at the ith pre-processed acquisition time; the normalized data set feature matrix D is processed _scaled Division into training sets D _train And a verification set D _verify Wherein, training set D _train Has a feature dimension of m-1, a verification set D _verify Has a feature dimension of 1, and a verification set D _verify Predicting a true value of the exhaust emission data for the model;

step 3, establishing an improved Attention-Bi-LSTM Attention bidirectional long-short term memory network model composed of an input layer, a hidden layer, an Attention layer, a full connection layer and an output layer, initializing parameters of the model, and defining a time step as lambda and a prediction time as t;

let the data structure of the input layer of the Attention-Bi-LSTM Attention bidirectional long-short term memory network model be D _train ＝{d _(t-λ)j ,...,d _tj ,...,d _(t+λ)j },j＝1,2,...,m-1；d _tj Represents the j-th characteristic value at the predicted time t;

enabling the hidden layer of the Attention-Bi-LSTM Attention bidirectional long-short term memory network model to comprise a forward LSTM network and a backward LSTM network;

the input of the forward LSTM network is d _t-λ ,...,d _t ...,d _t+λ ；d _t M-1 characteristic values at the predicted time t are represented;

the forward state output of the forward LSTM network is

Representing the hidden layer state output of the forward LSTM network at the predicted time t;

the input of the backward LSTM network is d _t+λ ,...,d _t ...,d _t-λ ；

The backward state output of the backward LSTM network is

Representing the hidden layer state output of the backward LSTM network at the predicted time t;

let the output of the hidden layer of the Attention-Bi-LSTM Attention bidirectional long-short term memory network model be output from the forward state as

And the backward state output is

Is composed of, and is denoted as

h _t A state output indicating the hidden layer at the prediction time t;

obtaining a matching scoring function F (h) in an Attention layer of the Attention-Bi-LSTM Attention bidirectional long-short term memory network model by using the formula (1) _i ,H _k )：

F(h _i ,H _k )＝V ^T tanh(W ₁ h _i +W ₂ H _k ) (1)

In the formula (1), h _i Hidden state output, H, representing the ith hidden layer _k A hidden state output representing a kth output layer; tanh () represents a hyperbolic tangent function, matrix V, W ₁ 、W ₂ Is Attention model parameter and is obtained by network training, and the dimensions are d respectively ₃ ×1、d ₃ ×d ₁ And d ₃ ×d ₂ Wherein d is ₁ 、d ₂ 、d ₃ Are respectively h _i 、H _k Dimension of V, V ^T Represents the transpose of the parameter V;

obtaining a weight vector a between the ith hidden layer and the kth output layer in the full-connection layer of the Attention-Bi-LSTM Attention bidirectional long-short term memory network model by using the formula (2) _ik ：

In formula (2), softmax () represents a logistic regression function;

obtaining the kth output vector c in the full connection layer of the Attention-Bi-LSTM Attention bidirectional long-short term memory network model by using the formula (3) _k Comprises the following steps:

obtaining the state output H of the output layer of the Attention-Bi-LSTM Attention bidirectional long-short term memory network model at the prediction time t by using the formula (4) _t ：

H _t ＝Bi-LSTM(H _t-1 ,y _t-1 ,H _t+1 ,y _t+1 ,c _t ) (4)

In the formula (4), bi-LSTM () represents a bidirectional LSTM network;

step 4, based on the training set D _train Determining hyper-parameters of the model using pre-experiments, comprising: the number of hidden layer units, time step, training batch, training iteration times, training period and learning rate;

adjusting the hyper-parameters according to a single variable principle within a certain range, and using the verification set D _verify Determining the optimal hyper-parameter when the average absolute error of the MAE changes from descending to increasing as a reference index;

step 5, the training set D _train Inputting the Attention-Bi-LSTM Attention bidirectional long-short term memory network model with the set hyper-parameters for training, and adopting an adaptive learning rate algorithm Adam as the self-parameters of the gradient descent algorithm optimization model in the training process to obtain a road motor vehicle exhaust emission prediction model so as to realize the prediction of future exhaust emission data.

Compared with the prior art, the invention has the following beneficial effects:

1. the invention uses the improved Attention-Bi-LSTM Attention bidirectional long-short term memory network prediction model, can predict the motor vehicle exhaust emission data in a longer time period in the future, shortens the PEMS vehicle-mounted experimental test time, reduces the waste of time, resources and labor cost, and reduces the potential safety hazard which is possibly caused to professional technicians for long-time vehicle-mounted PEMS road test.

2. The invention uses the characteristic factors of tail gas emission data related to time before and after, inputs the trained Attention-Bi-LSTM Attention bidirectional long-short term memory network model, and repairs the loss of key tail gas emission data segments caused by PEMS equipment failure, upper computer communication interruption and the like in the process of monitoring the tail gas emission of the PEMS motor vehicle.

3. According to the invention, an Attention mechanism is introduced into a neural network, and the mechanism is used for monitoring the PEMS by endowing correlation weights to time sequence data before and after a prediction node, so that the problem of baseline drift caused by long-time tail gas monitoring is solved, and the detection precision of the PEMS is improved.

4. The invention uses Bi-LSTM bidirectional network, can correlate the front and back time sequences of the prediction node, and improves the accuracy of prediction.

5. The method uses the adaptive learning rate algorithm Adam, avoids the risk that the model falls into local optimum due to random gradient reduction, and improves the generalization capability of the prediction model.

Drawings

FIG. 1 is a schematic flow chart of a method for predicting exhaust emission of a PEMS road motor vehicle according to the present invention;

FIG. 2 is a schematic diagram of the Attention-Bi-LSTM network of the present invention;

FIG. 3 is a schematic diagram of the Bi-LSTM structure of the present invention;

FIG. 4 is a schematic diagram of the structure of an LSTM cell;

FIG. 5 is a diagram of a model training loss function;

FIG. 6 is a diagram of a PEMS exhaust emission test under WLTC conditions;

FIG. 7 is a graph comparing CO predicted and actual emissions for WLTC operating conditions;

FIG. 8 is a graph comparing NO predicted versus actual emissions for WLTC conditions;

FIG. 9 shows NO in WLTC operating mode ₂ A comparison of predicted versus actual emissions map;

FIG. 10 is a graph comparing THC predicted versus actual emissions for WLTC operating conditions.

Detailed Description

In this embodiment, as shown in fig. 1, a method for predicting road motor vehicle exhaust emission based on an improved attention bidirectional long-short term memory network is performed as follows:

step 1, jointly acquiring exhaust emission data of a road motor vehicle in p days by using a PEMS detection device and an OBD vehicle-mounted diagnosis system, acquiring data of q working conditions every day, wherein the acquisition time of each working condition is T, thereby obtaining n = p × q × T exhaust emission data sets containing m characteristics, wherein the exhaust emission data sets contain PEMS and OBD, and the PEMS data contains real-time CO ₂ 、CO、NO、NO ₂ 、THC、O ₂ Concentration, environment humidity, temperature, sampling mass flow, sampling volume flow rate, sampling pipe temperature and the like, wherein the OBD data comprise vehicle instantaneous speed, engine instantaneous power, engine rotating speed, engine load and the like, and m test data, namely m characteristics, are recorded as D _origin ＝(d _ij ) _n×m Wherein d is _ij Representing the jth characteristic value at the ith acquisition time; i is more than or equal to 1 and less than or equal to n; j is more than or equal to 1 and less than or equal to m;

step 2, carrying out data set D on exhaust emission _origin ＝(d _ij ) _n×m Performing missing data compensationPre-processing of compensation and normalization, wherein data compensation adopts a method of averaging the first M data and the last M data of missing data, the general value of M is 10-20, namely the missing value is filled into the average value of the front and the back 2M effective data, and the characteristic matrix of the repaired data set is marked as D _fit The data set D after the repair is finished _fit Carrying out normalization processing to calculate the characteristic d of each time node _ij Normalized value d of _ij ', as shown in formula (1):

in the formula (1), d _(max)j And d _(min)j Respectively the maximum value and the minimum value of the same characteristic data in the data set before normalization, thereby obtaining an exhaust emission characteristic matrix which is marked as D _scaled ＝(d _ij ′) _n×m (ii) a Wherein, d _ij ' represents the j characteristic value at the i acquisition time after the pretreatment; the normalized data set feature matrix D _scaled Division into training sets D _train And a verification set D _verify Wherein, training set D _train Has a feature dimension of m-1, and a verification set D _verify The characteristic dimension of the model is 1, and the verification set is a true value of the model prediction exhaust emission data;

step 3, as shown in fig. 2, establishing an Attention-Bi-LSTM network model composed of an input layer, a hidden layer, an Attention layer, a full connection layer and an output layer, initializing parameters of the model, and defining a time step length as λ and a prediction time as t;

let the data structure of the input layer of Attention-Bi-LSTM Attention bidirectional long-short term memory network model be D _train ＝{d _(t-λ)j ,...,d _tj ,...,d _(t+λ)j },j＝1,2,...,m-1；d _tj Represents the j-th characteristic value at the predicted time t;

let the hidden layer of Attention-Bi-LSTM Attention two-way long-short term memory network model include forward LSTM network and backward LSTM network, as shown in FIG. 3;

the input to the Forward LSTM network is d _t-λ ,...,d _t ...,d _t+λ ；d _t M-1 eigenvalues at the predicted time t are represented;

the forward state output of the forward LSTM network is

the input to the LSTM network is d _t+λ ,...,d _t ...,d _t-λ ；

Backward state output to the LSTM network as

let the output of the hidden layer of Attention-Bi-LSTM bidirectional long-short term memory network model be outputted from the forward state as

And the backward state output is

Is composed of, and is marked as

h _t A state output indicating the hidden layer at the prediction time t;

obtaining a matching scoring function F (h) in an Attention layer of an Attention-Bi-LSTM Attention bidirectional long-short term memory network model by using the formula (2) _i ,H _k )：

F(h _i ,H _k )＝V ^T tanh(W ₁ h _i +W ₂ H _k ) (2)

In the formula (2), h _i Hidden state output, H, representing the ith hidden layer _k A hidden state output representing a kth output layer; tanh () represents hyperbolic tangent function, V, W ₁ 、W ₂ The model is an Attention model parameter matrix, which is obtained by network training and has dimensions d ₃ ×1、d ₃ ×d ₁ And d ₃ ×d ₂ ，d ₁ 、d ₂ 、d ₃ Are respectively h _i 、H _k Dimension of V, which belongs to the hyper-parameter, V ^T Represents the transpose of V;

obtaining a weight vector a between the ith hidden layer and the kth output layer in the fully-connected layer of the Attention-Bi-LSTM Attention bidirectional long-short term memory network model by using the formula (3) _ik ：

In the formula (3), softmax () represents a logistic regression function;

obtaining the kth output vector c in the full connection layer of the Attention-Bi-LSTM Attention bidirectional long-short term memory network model by using the formula (4) _k Comprises the following steps:

obtaining state output H of output layer of Attention-Bi-LSTM Attention bidirectional long-short term memory network model at prediction time t by using formula (5) _t ：

H _t ＝Bi-LSTM(H _t-1 ,y _t-1 ,H _t+1 ,y _t+1 ,c _t ) (5)

A Bi-directional LSTM network represented by Bi-LSTM () in formula (5);

specifically, as shown in fig. 4, an LSTM (Long Short-term Memory) introduces a Memory neuron, which includes three determination conditions of an input gate, a forgetting gate and an output gate, and solves the problem of gradient disappearance in a back propagation process caused by an excessively Long time sequence of a cyclic convolution neural network (RNN). The input gate (input gate) indicates the proportion of the information allowed to be added to the memory unit; a forgetting gate (forget gate) represents a ratio of keeping the history information stored in the node of the current state; the output gate (output gate) represents the proportion of taking the information of the current state node as output, and the expressions of the input gate, the forgetting gate and the output gate are respectively as follows:

an input gate:

i _t ＝σ(W _i ·[h _t-1 ,x _t ]+b _i ) (6)

forget the door:

f _t ＝σ(W _f ·[h _t-1 ,x _t ]+b _f ) (9)

an output gate:

o _t ＝σ(W _o ·[h _t-1 ,x _t ]+b _o ) (10)

h _t ＝o _t *tanh(c _t ) (11)

here, W _i 、W _f 、W _c And W _o Weight matrices representing input gate, forgetting gate, output gate and cell activation vector, respectively, b _i 、b _f 、b _c And b _o Representing the bias of the input gate, the forget gate, the output gate, and the cell activation vector, respectively.

σ denotes the Sigmoid activation function:

tanh excitation function:

step 4, determining the hyper-parameters of the model by adopting a pre-experiment, comprising the following steps: the number of hidden layer units, time step, training batch, training iteration times, training period and learning rate; with training set

As input data, the number h of hidden layer units of the network, the time step lambda, the training batch, the training iteration number iteration, the Attention-Bi-LSTM Attention bidirectional long-short term memory network of the training period epoch are used for training, the values of h, lambda, batch, iteration and epoch are adjusted within a certain range, the model is trained, and the output result and the verification set are calculated by using the formula (14)

Mean Absolute Error of (MAE):

f (x) in the formula (14) _i ) And y _i The predicted value and the true value of the model are respectively.

Specifically, the h parameter is increased from 10, step 5; the lambda parameter is increased from 1 with a step size of 1; the batch parameter is increased from 24, step size 24; the iteration parameter is increased from 100, step 50; the epoch parameter is increased from 10 by a step size of 10;

adjusting the hyper-parameters within a certain range, wherein the hyper-parameter adjustment follows the principle of single variable, i.e. when adjusting a certain parameter, the rest hyper-parameters are kept unchanged, and the verification set D is used _verify For reference index, when the average absolute error of MAE changes from decreasing to increasing, the optimal hyper-parameter h is determined _best ,λ _best ,batch _(best) ,iteration _(best) And epoch _(best) 。

And 5, as shown in the figure 5, inputting the exhaust emission characteristic matrix into an Attention-Bi-LSTM Attention bidirectional long-short term memory network model with super parameters for training, and adopting an adaptive learning rate algorithm Adam as the self parameters of a gradient descent algorithm optimization model in the training process to obtain a road motor vehicle exhaust emission prediction model so as to realize prediction of future exhaust emission data, wherein a loss function in the model training process is shown in the figure 6. Adam, namely adaptive movements, designs independent adaptive learning rates for different parameters by calculating first moment estimation and second moment estimation of the gradient, and avoids the risk of convergence of a model to local optimum due to random gradient descent.

And 5.1, calculating the gradient at the t moment by using the formula (15):

in the formula (15), f (theta) is a random objective function;

step 5.2, updating the biased first moment estimation by using the formula (16):

s _t ＝p ₁ ·s _t-1 +(1-p ₁ )·g _t (16)

and 5.3, updating the biased second moment estimation by using the formula (17):

and 5.4, correcting the deviation of the first moment by using an equation (18):

and 5.5, correcting the deviation of the second moment by using an equation (19):

and 5.6, updating parameters by using the formula (20):

specifically, after the completed Attention-Bi-LSTM Attention bidirectional long-short term memory network model is trained, the emission test set data of the motor vehicle for the future u days is input

Predicting the motor vehicle exhaust emission trend through the model, outputting a prediction result and carrying out inverse normalization processing, wherein the formula is as follows:

y＝y _scaled ×(x _max -x _min )+x _min (21)

in the formula (21), y represents the model prediction result after inverse normalization, y _scaled Representing the normalized model prediction result, x _max And x _min The maximum and minimum values of the predicted feature data in the training set before normalization, respectively.

And finishing normalization processing, and finally outputting the prediction result of the model. And quantitatively comparing the prediction result with a verification set to verify the accuracy of model prediction. The prediction evaluation index adopts Root Mean Square Error (RMSE) and has the following formula:

in the formula (22), y _i The predicted value of the exhaust emission at the ith time node of the model,

and n is the length of the prediction sequence to verify the real value of the tail gas emission of the ith time node in the set. RMSE ranges from [0, + ∞) and is equal to 0 when the predicted value completely matches the true value, i.e. a perfect model; the larger the error, the larger the value.

TABLE 1 exhaust emission prediction error RMSE TABLE

The experimental results of fig. 7, fig. 8, fig. 9, fig. 10 and table 1 show that the present invention can effectively predict four key index gases in the exhaust emission of an automotive vehicle: the prediction trend of CO, NO2 and THC is basically consistent with the actual emission result, the RMSE is less than 50ppm on the whole, the long-time sequence data section loss in the tail gas emission detection of the PEMS road motor vehicle is effectively repaired, and the test time of a PEMS vehicle-mounted experiment is reduced, so that the time, the resource and the labor cost are reduced, and the potential safety hazard possibly generated to professional technicians by long-time vehicle-mounted road tests is reduced.

Claims

1. A road motor vehicle exhaust emission prediction method based on an improved attention bidirectional long-short term memory network is characterized by comprising the following steps:

step 1, jointly acquiring exhaust emission data of a road motor vehicle in p days by using a PEMS detection device and an OBD vehicle-mounted diagnosis system, acquiring data of q working conditions every day, wherein the acquisition time of each working condition is T, thereby obtaining n = p × q × T exhaust emission data sets containing m characteristics, and recording the data as D _origin ＝(d _ij ) _n×m Wherein d is _ij Representing the jth characteristic value at the ith acquisition time; i is more than or equal to 1 and less than or equal to n; j is more than or equal to 1 and less than or equal to m;

step 2, carrying out data set D on exhaust emission _origin ＝(d _ij ) _n×m Carrying out missing data compensation and normalization pretreatment to obtain a tail gas emission characteristic matrix marked as D _scaled ＝(d′ _ij ) _n×m (ii) a Wherein, d' _ij Representing a j characteristic value at the i acquisition time after the pretreatment; the normalized data set feature matrix D is processed _scaled Division into training sets D _train And a verification set D _verify Wherein, training set D _train Has a feature dimension of m-1, and a verification set D _verify Has a feature dimension of 1, and a verification set D _verify Predicting a true value of the exhaust emission data for the model;

step 3, establishing an Attention-Bi-LSTM Attention bidirectional long-short term memory network model composed of an input layer, a hidden layer, an Attention layer, a full connection layer and an output layer, initializing parameters of the model, and defining a time step length as lambda and a prediction time as t;

the forward state output of the forward LSTM network is

the input to the backward LSTM network is d _t+λ ,...,d _t ...,d _t-λ ；

The backward state output of the backward LSTM network is

And the backward state output is

Is composed of, and is denoted as

h _t Representing the hidden state output of the input layer at time t;

F(h _i ,H _k )＝V ^T tanh(W ₁ h _i +W ₂ H _k ) (1)

In the formula (1), h _i Indicating a hidden state output, H, of the input layer at time i _k Represents a hidden state output of the output layer at time k; tanh () represents a hyperbolic tangent function, matrix V, W ₁ 、W ₂ Is Attention model parameter and is obtained by network training, and the dimensions are d respectively ₃ ×1、d ₃ ×d ₁ And d ₃ ×d ₂ Wherein d is ₁ 、d ₂ 、d ₃ Are respectively h _i 、H _k Dimension of V, V ^T Represents the transpose of the parameter V;

obtaining a weight vector a between a hidden layer at the time i and an output layer at the time k in a full-connection layer of the Attention-Bi-LSTM Attention bidirectional long-short term memory network model by using the formula (2) _ik ：

In the formula (2), softmax () represents a logistic regression function;

obtaining an output vector c at a time k in a full connection layer of the Attention-Bi-LSTM Attention bidirectional long-short term memory network model by using the formula (3) _k Comprises the following steps:

H _t ＝Bi-LSTM(H _t-1 ,y _t-1 ,H _t+1 ,y _t+1 ,c _t ) (4)

In the formula (4), bi-LSTM () represents a bidirectional LSTM network;

step 4, based on the training set D _train Determining hyper-parameters of the model using pre-experiments, comprising: the hidden layer unit number, the time step length, the training batch, the training iteration number, the training period and the learning rate;

adjusting the hyper-parameters within a certain range according to a single variable principle, and using the verification set D _verify Determining the optimal hyper-parameter when the average absolute error of the MAE changes from descending to increasing as a reference index;

step 5, training set D _train Inputting an Attention-Bi-LSTM Attention bidirectional long-short term memory network model with set hyper-parameters for training, and adopting an adaptive learning rate algorithm Adam as a self-parameter of a gradient descent algorithm optimization model in the training process to obtain a road motor vehicle exhaust emission prediction model so as to realize prediction of future exhaust emission data;

step 5.1, calculating the gradient at the t moment by using the formula (5):

g _t ＝▽ _θ f _t (θ _t-1 ) (5)

in the formula (5), f _t (θ) is a random objective function at time t;

step 5.2, updating the biased first moment estimate s by using the formula (16) _t-1 Obtaining an estimated biased first moment s at the time t _t ：

s _t ＝p ₁ ·s _t-1 +(1-p ₁ )·g _t (16)

Step 5.3, updating the biased second moment estimation by using the formula (17)Meter r _t-1 Obtaining an estimate r of biased second moment at time t _t ：

And 5.4, obtaining the corrected first moment deviation by utilizing the formula (18)

Step 5.5, obtaining the corrected deviation of the second moment by using the formula (19)

Step 5.6, updating parameter theta by using equation (20) _t-1 Obtaining the parameter theta at the time t _t ：

Inputting the emission test set data of the motor vehicle for the future u days after completing Attention-Bi-LSTM Attention bidirectional long-short term memory network model training

And predicting the motor vehicle exhaust emission trend through the model, outputting a prediction result, performing inverse normalization processing, and finally outputting the prediction result of the model.