CN110491146B - Deep learning-based traffic signal control scheme real-time recommendation method - Google Patents

Deep learning-based traffic signal control scheme real-time recommendation method Download PDF

Info

Publication number
CN110491146B
CN110491146B CN201910772945.8A CN201910772945A CN110491146B CN 110491146 B CN110491146 B CN 110491146B CN 201910772945 A CN201910772945 A CN 201910772945A CN 110491146 B CN110491146 B CN 110491146B
Authority
CN
China
Prior art keywords
data
time
control scheme
traffic
function
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910772945.8A
Other languages
Chinese (zh)
Other versions
CN110491146A (en
Inventor
郭海锋
李瑶
何德峰
金峻臣
孔桦桦
周浩敏
丁楚吟
谢竞成
杨宪赞
温晓岳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yinjiang Technology Co Ltd
Zhejiang University of Technology ZJUT
Original Assignee
Zhejiang University of Technology ZJUT
Enjoyor Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University of Technology ZJUT, Enjoyor Co Ltd filed Critical Zhejiang University of Technology ZJUT
Priority to CN201910772945.8A priority Critical patent/CN110491146B/en
Publication of CN110491146A publication Critical patent/CN110491146A/en
Application granted granted Critical
Publication of CN110491146B publication Critical patent/CN110491146B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2474Sequence data queries, e.g. querying versioned data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/049Temporal neural networks, e.g. delay elements, oscillating neurons or pulsed inputs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0637Strategic management or analysis, e.g. setting a goal or target of an organisation; Planning actions based on goals; Analysis or evaluation of effectiveness of goals
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/07Controlling traffic signals
    • G08G1/08Controlling traffic signals according to detected number or speed of vehicles

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Mathematical Physics (AREA)
  • Human Resources & Organizations (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Strategic Management (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Tourism & Hospitality (AREA)
  • Biomedical Technology (AREA)
  • Educational Administration (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Economics (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Optimization (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Development Economics (AREA)
  • Pure & Applied Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Probability & Statistics with Applications (AREA)
  • Mathematical Analysis (AREA)
  • Computational Mathematics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Game Theory and Decision Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)

Abstract

A traffic signal control scheme real-time recommendation method based on deep learning comprises the following steps: preprocessing traffic data based on the acquired traffic state data, wherein the traffic data comprises cleaning error data, correcting abnormal data and repairing missing data; the method comprises the steps of constructing a training data set model of a time sequence, training a traffic signal control scheme real-time recommendation model of the intersection based on a deep learning method of a CNN-DA-RNN framework, recommending a traffic signal control scheme at the next moment, and realizing a problem intersection signal control scheme real-time recommendation function. The invention reduces the time for optimizing the intersection, improves the working efficiency of personnel, feeds back the recommended scheme in real time, and increases the reliability and reproducibility of the recommended scheme.

Description

Deep learning-based traffic signal control scheme real-time recommendation method
Technical Field
The invention relates to the field of intelligent traffic and urban traffic control, in particular to the field of traffic signal control scheme recommendation.
Background
With the increasing of the automobile holding amount in cities in China, the motorization level of roads is continuously improved, and the problem of urban traffic jam becomes a pain point and a difficulty point for urban management, thereby restricting the economic development to a certain extent. The urban traffic jam also causes great negative influence on the trip experience of citizens, and the most urgent task in the current urban management is to relieve the urban traffic jam problem, improve the urban traffic operation efficiency and enable the constructed signal system to exert the maximum efficiency.
The operation of urban traffic systems is complex and changeable, and the road traffic systems are continuously changed along with the change of time and space. Due to the openness, randomness and dynamics of the road traffic system, the traffic system is severely blocked or even paralyzed when traffic accidents, rainstorms, snowstorms or other emergencies occur, such as rush hour on duty or off duty. A series of factors affecting road traffic systems are uncertain and sudden, and thus a plurality of factors need to be integrated in the traffic optimization control process. However, in the face of traffic problems caused by excessively complex factors, the traditional traffic signal control method cannot meet the requirements of current traffic control optimization, and how to establish optimal control for seeking traffic signals is the current key.
In order to fully exert the function of the urban traffic control system, traffic signal optimization service has been developed in recent two years, the development of the service is driven by policies from management departments such as the ministry of public security, province and city on the one hand, and the requirement of urban traffic control business really exists on the other hand, especially in first-line cities. The traffic signal optimization service aims to enable the constructed signal system to exert the maximum efficiency and improve the urban traffic operation efficiency to the maximum extent.
In the traffic signal optimization service, under the condition that the change of traffic flow rules is large, such as the peak at morning and evening or holidays, various parameters of crossing monitoring and real-time signal timing adjustment of a signal control system need to be checked manually. The regulation and control mode has the defects of non-reproducibility, low efficiency, low reliability and the like, and a novel technology is urgently needed to be used as an auxiliary means to alleviate the problems.
Disclosure of Invention
The invention provides a signal control scheme recommendation method conforming to urban traffic control rules, aiming at overcoming the defects of low regulation and control efficiency, low reliability and non-replicability in the traffic signal control process in the prior art.
The invention trains a traffic signal control scheme real-time recommendation model based on an algorithm of an artificial intelligence deep learning neural network according to traffic state data, and outputs a signal control scheme suitable for the flow and the saturation of the current signalized intersection. The output scheme can be recommended to the front-line traffic control personnel in real time, the scheme can be issued in real time after the judgment is reasonable, the time for optimizing the intersection is reduced to a certain extent, the working efficiency of the personnel is improved, the recommended scheme is fed back in real time, and the reliability and the reproducibility of the recommended scheme are improved.
The invention achieves the aim through the following technical scheme: a traffic signal control scheme real-time recommendation method based on deep learning comprises the following steps:
1.1 collecting traffic data including traffic control data and traffic status data, including but not limited to: signal system cycle end time, cycle duration, split data, the traffic state data includes but is not limited to: flow, saturation, velocity;
1.2 preprocessing traffic data, including cleaning error data, correcting abnormal data and repairing missing data;
1.3 constructing a time series of data sets, the steps comprising:
1.3.1 extracting sample points of a data set, the sample points referring to cycle end times when control scheme data is satisfied, the control scheme data being a green to green ratio variation;
1.3.2 constructing sample point data, and extracting T groups of traffic data x corresponding to the sample pointsiT group control scheme data y target,i1 group of split data yhis,iWherein i represents the ith sample point;
1.3.3 constructing a time series data set satisfying the training requirements, including a traffic data set xtControl scheme data set ytargetAnd a split data set yhisTraining requirements include, but are not limited to: the number of sample points, the length of time for extracting traffic data for the sample points;
1.4, constructing a deep learning algorithm model based on CNN-DA-RNN, wherein the first layer CNN adopts an unsubbed convolutional layer neural network, and obtains output data with unchanged dimensionality by performing convolution calculation on input data, and the output data is used for extracting the short-time dependency relationship and the dependency relationship among variables in the time dimensionality of the input data; the second layer DA-RNN adopts a recurrent neural network based on a two-stage attention mechanism, and is used for performing attention allocation in space and time dimensions on input data, and encoding and decoding, and specifically includes:
1.4.1 performing spatial attention distribution on the traffic data set data output from the convolutional layer;
1.4.2, encoding the data after the space attention distribution;
1.4.3 time attention distribution is carried out on the coded data;
1.4.4 weighted calculation of the time attention assigned data, weighted data and control scheme data set ytargetDecoding is carried out;
1.5 training a traffic signal control scheme real-time recommendation model: taking the data set in the step 1.3 as a training data set, and performing learning training on the deep learning algorithm model in the step 1.4, wherein the method for training the model includes but is not limited to: the method comprises a random gradient descent optimizer method, an Adam optimizer method and an automatic parameter adjusting method, wherein the end condition of a training model is that the convergence degree of a loss function meets requirements, and the loss function is the mean square error of data of a prediction control scheme and data of an actual control scheme.
1.6 recommending a traffic signal control scheme at the next moment: and collecting real-time traffic state data, inputting a traffic signal control scheme real-time recommendation model, and obtaining prediction control scheme data output by the model.
Further, the period duration of the traffic control data in step 1.1 indicates the time required by the signal lamp to display various lamp colors for one week in turn, and the data dimension is 1 dimension; the split of the traffic control data is the split data of each phase at the intersection, and the data dimension is the phase number; the traffic state data refers to traffic state data of all lanes at the intersection, and the data dimension is the number of traffic state data types and the number of lanes.
Further, the error data is cleaned in the step 1.2, wherein the default value and the repeated value are deleted; correcting abnormal data, judging whether the data is an abnormal value by using a t test method in statistics, carrying out interpolation processing on the abnormal value by using a spline function method, and carrying out interpolation by using historical data; repairing missing data by adopting a multivariate linear regression model method, and comprising the following steps of: (1) making a scatter diagram on the existing data and performing multiple regression processing; (2) solving a multiple linear regression polynomial and a confidence interval; (3) making a residual error analysis graph, and verifying the fitting effect, wherein the smaller the residual error is, the better the coincidence degree of the regression polynomial and the source data is, and (4) supplementing the missing data by a polynomial equation with the minimum residual error.
Further, the specific content of the sample point data constructed in step 1.3.2 is as follows: t groups of traffic data, the traffic state data of the sample points and the first T-1 group of the sample points which are sorted according to the cycle end time are taken out and stored in an array form as x of the sample point dataiThe part is specifically as follows:
xi={Cyclei,Ai,Bi,...,Gi,VO1i,VO2i,...,VOki,DS1i,DS2i,...,DSki} (1)
wherein C isiIndicating the period duration, Cyclei=[Ci-T,...,Ci-1,Ci];Ai,Bi,...,GiGreen ratio data indicating the control phases A, B, …, G of the signal, Ai=[ai-T,...,ai-1,ai],Bi=[bi-T,...,bi-1,bi],…,Gi=[gi-T,...,gi-1,gi]; VO1i,VO2i,…,VOkiThe traffic data of the traffic lane is indicated,
Figure BDA0002174159430000031
Figure BDA0002174159430000032
DS1i,DS2i,…,DSkithe data of the saturation degree of the lane is indicated,
Figure BDA0002174159430000033
Figure BDA0002174159430000034
the number of the signal control phases is related to the intersections, and the number of the phases and the phase sequence of the operation of different intersections are different;
t sets of control scheme data, the control scheme data of the sample points and the first T-1 sets of sample points are taken out and stored in the form of an array as y of sample point datatargetIn part, in particularComprises the following steps:
ytarget,i={ΔAi,ΔBi,ΔCi,ΔDi,ΔEi,ΔFi,ΔGi} (2)
wherein Δ Ai,ΔBi,...,ΔGiMeans the amount of change Δ A of adjacent spliti=[Δai-T,...,Δai-1,Δai],ΔBi=[Δbi-T,...,Δbi-1,Δbi],…,ΔGi=[Δgi-T,...,Δgi-1,Δgi],ytarget,iThe dimensionality of the intersection is determined by the actual running phase quantity of the intersection;
a set of split plan data: the green ratio data of the next moment of the sample point is taken and stored in an array form as yhis,iThe part is specifically as follows:
yhis,i={A′i+1,B′i+1,C′i+1,D′i+1,E′i+1,F′i+1,G′i+1} (3)
wherein A'i+1,B′i+1,C′i+1,D′i+1,E′i+1,F′i+1,G′i+1Phase-referred green signal ratio value A'i+1=[ai+1],B′i+1=[bi+1], …,G′i+1=[gi+1],yhis,iThe dimension of (c) is determined by the number of phases actually operated at the intersection.
Further, the extraction method of the time dependency relationship and the dependency relationship between the variables of the unsuccessfully-pooled convolutional layer neural network for the input data, described in step 1.4, is as follows:
the input data of the convolutional layer is a time-series traffic data set xtThe convolutional layer is composed of a plurality of filters with width omega and height n, wherein the setting of width omega is the same as the green ratio of input data, the setting of height n is the same as the column dimension of input data variables, and the k filter scans the input matrix xiAnd generating:
hcnnk=RELU(Wcnnk*xi+bcnnk) (4)
meaning of formula: wherein denotes a convolution operation, hcnnkIs an output vector, RELU (x) linear modification unit neuron activation function, RELU function can accelerate gradient descent and backward propagation, and avoid the problems of gradient sharp rise and sharp decrease, Wcnnk,bcnnkThe convolution matrix and the offset to be learned are continuously corrected in the training process, and the range of k is the ratio of the length of input data to the size (omega n) of the filter;
to keep the convolution output hcnnkIs consistent with the dimension of the input data by aligning the input matrix xiThe method for increasing the dimension is realized, and the variable value of the dimension is increased to be 0; the method comprises the following implementation processes: x is the number ofiDimension i j, convolution matrix WkDimension 3 x 3, h for obtaining dimension i x jcnnkBy varying xiIs (i +1) × (j +1) and the variable value of the added dimension is 0.
Further, step 1.4.1 specifically includes: the spatial attention allocation is the first stage of the two-stage attention mechanism, the spatial attention is introduced as the input attention mechanism, the correlation is automatically extracted for the input data at each moment, and the input attention weight is calculated according to the previous hidden state of the encoder, and the method comprises the following steps:
for X at each time of input data XtUsing the attention mechanism, the formula is as follows:
Figure BDA0002174159430000051
Figure BDA0002174159430000052
wherein [ ht-1;st-1]Is the last hidden state ht-1And the last state st-1Of a cascade function of ve,We, UeRefers to the parameters of the high-dimensional matrix to be learned,
Figure BDA0002174159430000053
is the spatial attention weight assigned to the kth input feature at time t, the output after spatial attention assignment
Figure BDA0002174159430000054
Comprises the following steps:
Figure BDA0002174159430000055
the specific process 1.4.2 encodes the data after spatial attention allocation: the neural state of LSTM of the encoder LSTM unit is dynamically summed along with the time, the long-term dependence relationship is memorized, the problem of rapid gradient reduction is easily solved, the method is effective for processing the time sequence problem, and the LSTM method is used for inputting data
Figure BDA0002174159430000056
The encoding method is as follows:
first the encoder can learn from xtTo htMapping of (2):
Figure BDA0002174159430000057
wherein h istFor the hidden state of the encoder at time t, ht-1A hidden state on the finger, f1Is a non-linear activation function;
secondly, the encoding unit updates the state using the LSTM network as an activation function: the LSTM recurrent neural network comprises a forgetting gate ftInput door itOutput gate otEach LSTM cell has a state s at time ttMemory cell, state htThe updating method comprises the following steps:
ft=σ(Wf[ht-1;xt]+bf) (9)
it=σ(Wi[ht-1;xt]+bi) (10)
ot=σ(Wo[ht-1;xt]+bo) (11)
st=ft⊙st-1+it⊙tanh(Ws[ht-1;xt]+bs) (12)
ht=ot⊙tanh(st) (13)
wherein, [ h ]t-1;xt]Is the previous hidden state ht-1And the current input xtA cascade function of which Wf,Wi, Wo,Ws,bf,bi,bo,bsIs the parameter to be trained and learned, sigma and ⊙ are the logical function and element multiplication, respectively;
the specific process 1.4.3 performs time attention allocation on the encoded data: the time attention allocation is the second stage of the two-stage attention mechanism, the time attention mechanism is introduced to capture the long-term timing dependence information of the encoder, and the state data h input is subjected to based on the hidden state of the previous decodertThe time attention weight is calculated by the following method:
based on previous decoder hidden state dt-1And state s 'of the last LSTM cell't-1Calculating a time attention weight for each encoder hidden state at time t using an attention mechanism
Figure BDA0002174159430000061
The formula is as follows:
Figure BDA0002174159430000062
Figure BDA0002174159430000063
wherein [ d ]t-1;s′t-1]Is a cascade function of the hidden state of the previous decoder and the state of the last LSTM cell, vd,Wd,UdAre the high-dimensional matrix parameters that need to be learned,
Figure BDA0002174159430000064
is the temporal attention weight assigned to the ith set of features at time t;
step 1.4.4 said weighting and calculating the time attention distributed data, weighting data and ytargetAnd (3) decoding: the method specifically comprises the following steps:
1.4.4.1. calculating all hidden states hiWeighted sum vector c oft
Figure BDA0002174159430000065
Wherein, ctIs the decoder LSTM unit input.
1.4.4.2. Calculating updated target outputs
Figure 1
Figure BDA0002174159430000067
Wherein, [ y ]t-1;ct-1]Is the output state y of the last decodert-1And the last one of the weighted sums of all hidden states ct-1The function of the cascade of functions of (a),
Figure BDA0002174159430000071
and
Figure BDA0002174159430000072
are the parameters to be learned and trained.
1.4.4.3. Updating the hidden state d of the decoder at time tt: utilizing new target output
Figure BDA0002174159430000073
And previous hidden state:
Figure BDA0002174159430000074
wherein f is2For establishing time series for non-linear activation functionsLong term dependencies, choosing to use LSTM cells as f for updating hidden states2Function, then hidden state dtThe specific calculation is as follows:
Figure BDA0002174159430000075
Figure BDA0002174159430000076
Figure BDA0002174159430000077
Figure BDA0002174159430000078
dt=o′t⊙tanh(s′t) (23)
wherein the content of the first and second substances,
Figure BDA0002174159430000079
is the previous hidden state dt-1And of the preceding objective function
Figure 2
Of cascade function of, wherein W'f,W′i,W′o,W′s,b′f,b′i,b′o,b′sIs the parameter to be trained and learned, σ and ⊙ are the logistic function and the element multiplication, respectively.
1.4.4.4. Estimating the output of the current moment:
decoder LSTM cell output yDTA simulation function F is constructed through a DA-RNN structure, the function F can observe given input and previous output, and the output of the current moment is estimated
Figure BDA00021741594300000711
Figure BDA00021741594300000712
Wherein [ d ] isT;cT]Is a hidden state d of the decoding layerTSum vector cTOf the cascade function, parameter WyAnd bwIs a parameter to be learned and trained, the weight of a linear function
Figure BDA00021741594300000713
And bias bvIs the parameter to be learned, determines the final prediction result
Figure BDA00021741594300000714
Further, the loss function calculation and judgment in step 1.5: the training process of the model comprises the steps of grouping all data in small batches, and training the model by using a Stochastic Gradient Descent (SGD) optimizer and an Adam optimizer; designing a smooth and differentiable output result to ensure that the parameters can be obtained through standard reverse propagation learning; designing loss functions of the objective function, namely predicted control scheme data and actual control scheme data:
Figure BDA0002174159430000081
where N is the amount of samples of training,
Figure BDA0002174159430000082
is a predicted solution to be used in the future,
Figure BDA0002174159430000083
is a practical solution, the result of the training causes the loss function to converge rapidly to a very small value β, convention β<0.2%。
Further, the step 1.5 of correcting the predictive control scheme data output by the traffic signal control scheme real-time recommendation model specifically adopts the following method:
correction result yTIncluding predictive control scheme data yDTAnd the result y of the linear regression calculation of the mixed regression modelATTwo parts, the vector sum of the two:
yT=yDT+yAT(26)
wherein the model for the linear regression calculation is:
Figure BDA0002174159430000084
where q is the input matrix yt-kK denotes the kth filter, WaukAnd baukAre parameters that need to be learned.
Further, whether the recommended control scheme data of step 1.6 meets constraints, the constraints including:
(1) whether the actual flow and saturation conditions of the intersection are met or not; (2) whether the recommendation period is less than the maximum period time of the intersection or not is judged; (3) whether the recommended green signal ratio of each phase is greater than the minimum green light or not; (4) whether the safety time of the pedestrian phase is completely met; (5) whether the time setting of the particular phase is completed.
Further, the sample point in step 1.3 refers to the cycle end time when the control scheme data meets the requirement, and the control scheme data is the variation of the split ratio, specifically:
and judging the variation of the green signal ratio of any phase in the two adjacent groups of data, and when the variation exceeds 5% of the total cycle time, the control scheme data meets the requirement.
The invention has the beneficial effects that: and only starting from the traffic state data, calculating a signal control scheme suitable for the flow and the saturation of the current signalized intersection based on an artificial intelligence deep learning neural network algorithm, and recommending the traffic signal control scheme in real time. The output scheme can be recommended to the front-line traffic control personnel in real time, the scheme can be issued in real time after the judgment is reasonable, the time for optimizing the intersection is reduced to a certain extent, the working efficiency of the personnel is improved, the recommended scheme is fed back in real time, and the reliability and the reproducibility of the recommended scheme are improved.
Drawings
FIG. 1 is a sample point extraction flow diagram of the present invention;
FIG. 2 is a flow chart of the training data set construction of the present invention;
FIG. 3 is a diagram of a CNN-DA-RNN framework model of the present invention;
FIG. 4 is a diagram of a DA-RNN neural network model of the present invention;
FIG. 5 is a CNN-DA-RNN framework training loss function convergence curve of the present invention;
FIG. 6a is a graph comparing the average speed of the Yanan road in the 2019 month 1 and the 2018 month 12 weekdays;
fig. 6b is a graph comparing the average speed of the peace road in 2019 month 1 and 2018 month 12 off-weekdays.
Detailed Description
The invention will be further described with reference to specific examples, but the scope of the invention is not limited thereto.
An example is as follows: an important intersection of a main road Yanan road in Hangzhou city is selected: and (4) testing a real-time recommendation scheme generation method on the Qingchun road and the Yangan road, and verifying the effectiveness of the signal control scheme recommendation method designed by the invention. The method comprises the following steps:
firstly, collecting traffic data: collecting six-month traffic state data (data flow, saturation) and traffic control data (period and green letter ratio case data) of the intersection, storing the control data and the state data into the same data table of a database according to the period ending time, wherein the data also comprises the period ending time.
Secondly, preprocessing data: and performing statistics including a data amount, a null value amount, an abnormal null value and an abnormal value (for example, the sum of the split plan is not 0) on the data table al _ input _ scheme _ match _ state _ data, and saving the statistical result in the data quality table. Then, sequentially carrying out data preprocessing:
2.1 deleting default values and removing duplicate values;
2.2, correcting abnormal data, and performing interpolation of abnormal position data by adopting historical data;
and 2.3, repairing the missing data, performing residual error analysis by adopting a multiple linear regression model method, and supplementing the missing data by using a polynomial equation with the minimum residual error.
And after the data are preprocessed, counting abnormal data again and judging the data quality, and constructing a data set after the data quality meets the requirements. The data quality comprises a null value rate of less than 0.01, an abnormal value of less than 0.05 and the like.
And thirdly, constructing a time-series data set, and sequentially extracting sample points, sample point data and constructing a training data set according to the steps of constructing the data set.
3.1 sample points were taken as shown in FIG. 1. Segmenting the data of the trained original data, and determining sample points of the data in a certain time in sequence;
3.2 extract sample point data as shown in FIG. 2.
Extracting T groups of traffic data x corresponding to sample pointsiThe traffic control data of the sample point and the first T-1 group of time data of the time are taken.
3.2.1T group traffic data, taking out the traffic state data of the sample point and the first T-1 group of the sample point according to the cycle end time sequence, storing the traffic state data in an array form as x of the sample point dataiThe part is specifically as follows:
xi={Cyclei,Ai,Bi,...,Gi,VO1i,VO2i,...,VOki,DS1i,DS2i,...,DSki} (1)
wherein C isiIndicating the period duration, Cyclei=[Ci-T,...,Ci-1,Ci];Ai,Bi,...,GiGreen ratio data indicating the control phases A, B, …, G of the signal, Ai=[ai-T,...,ai-1,ai],Bi=[bi-T,...,bi-1,bi],…, Gi=[gi-T,...,gi-1,gi];VO1i,VO2i,…,VOkiNumber of traffic flow VO1 for traffic lanei=[vo1i-T,...,vo1i-1,vo1i],
Figure BDA0002174159430000101
DS1i,DS2i,…,DSkiLane-indicating saturation data
Figure BDA0002174159430000102
Figure BDA0002174159430000103
The number of the signal control phases is related to the intersections, and the number of the phases and the phase sequence of the operation of different intersections are different;
3.2.2T group control scheme data ytarget,iThat is, the control scheme data of the sample point and the first T-1 group of time data of the time, the T group of control scheme data, the control scheme data of the sample point and the first T-1 group of the sample point are taken and stored in the form of an array as the y of the sample point datatargetThe part is specifically as follows:
ytarget,i={ΔAi,ΔBi,ΔCi,ΔDi,ΔEi,ΔFi,ΔGi} (2)
wherein Δ Ai,ΔBi,...,ΔGiMeans the amount of change Δ A of adjacent spliti=[Δai-T,...,Δai-1,Δai],ΔBi=[Δbi-T,...,Δbi-1,Δbi],…,ΔGi=[Δgi-T,...,Δgi-1,Δgi],ytarget,iThe dimensionality of the intersection is determined by the actual running phase quantity of the intersection;
3.2.31 sets of split data yhis,iAnd taking a group of split data after the sample point. A set of split plan data: the green ratio data of the next moment of the sample point is taken and stored in an array form as yhis,iThe part is specifically as follows:
yhis,i={A′i+1,B′i+1,C′i+1,D′i+1,E′i+1,F′i+1,G′i+1} (3)
wherein A'i+1,B′i+1,C′i+1,D′i+1,E′i+1,F′i+1,G′i+1Phase-referred green signal ratio value A'i+1=[ai+1],B′i+1=[bi+1],…,G′i+1=[gi+1],yhis,iThe dimension of (c) is determined by the number of phases actually operated at the intersection.
3.3 construct a training data set as shown in FIG. 2. The training data set is a set of sample point data, including a traffic data set xtControl scheme data set ytargetAnd a split data set yhisAnd (4) three parts.
Fourthly, a deep learning algorithm model based on the CNN-DA-RNN is constructed, and the model architecture is shown in the attached figure 3. The first layer CNN adopts an unbooled convolutional layer neural network, and the second layer DA-RNN adopts a recurrent neural network based on a two-stage attention mechanism. The first layer is mainly used for extracting the dependency relationship of short time in the time dimension of the input data and the dependency relationship among variables. The second layer mainly functions to perform attention allocation in spatial and temporal dimensions on input data and perform encoding and decoding, and specifically includes:
4.1 extracting the time dependency of the input data and the dependency between the variables:
the convolutional layer is composed of a plurality of filters with width omega and height n, wherein the width omega is set to be the same as the green ratio of input data, the height n is set to be the same as the column dimension of input data variables, and the k filter scans the input matrix xiAnd generating:
hcnnk=RELU(Wcnnk*xi+bcnnk) (4)
to keep the convolution output hcnnkIs consistent with the dimension of the input data by aligning the input matrix xiThe method for increasing the dimension is realized, and the variable value of the dimension is increased to be 0; the method comprises the following implementation processes: x is the number ofiDimension i j, convolution matrix WkDimension 3 x 3, h for obtaining dimension i x jcnnkBy varying xiIs (i +1) × (j +1) and the variable value of the added dimension is 0.
4.2 performing spatial attention distribution on the traffic data set data output from the convolutional layer;
for each time of input data XX oftUsing the attention mechanism, the formula is as follows:
Figure BDA0002174159430000111
Figure BDA0002174159430000112
wherein [ ht-1;st-1]Is the last hidden state ht-1And the last state st-1Of a cascade function of ve,We, UeRefers to the parameters of the high-dimensional matrix to be learned,
Figure BDA0002174159430000113
is the spatial attention weight assigned to the kth input feature at time t, the output after spatial attention assignment
Figure BDA0002174159430000114
Comprises the following steps:
Figure BDA0002174159430000115
4.3 encoding the data after spatial attention allocation:
first, the encoder can learn from xtTo htMapping of (2):
Figure BDA0002174159430000121
second, the encoding unit updates the state h using the LSTM network as an activation functiont
ft=σ(Wf[ht-1;xt]+bf) (9)
it=σ(Wi[ht-1;xt]+bi) (10)
ot=σ(Wo[ht-1;xt]+bo) (11)
st=ft⊙st-1+it⊙tanh(Ws[ht-1;xt]+bs) (12)
ht=ot⊙tanh(st) (13)
4.4 time attention allocation is carried out on the encoded data:
based on previous decoder hidden state dt-1And state s 'of the last LSTM cell't-1Calculating a time attention weight for each encoder hidden state at time t using an attention mechanism
Figure BDA0002174159430000129
The formula is as follows:
Figure BDA0002174159430000122
Figure BDA0002174159430000123
4.5 weighted calculation of the time attention assigned data, weighted data and control scheme data set ytargetAnd (3) decoding:
4.5.1 calculate all hidden states hiWeighted sum vector c oft
Figure BDA0002174159430000124
4.5.2 calculating updated target output
Figure BDA0002174159430000125
Figure BDA0002174159430000126
4.5.3 updating the hidden state d at the decoder moment tt: utilizing new target output
Figure BDA0002174159430000127
And previous hidden state:
Figure BDA0002174159430000128
wherein f is2For long-term dependencies of the nonlinear activation function to build the time series, the choice is made to use the LSTM cell as f for updating the hidden state2Function, then hidden state dtThe specific calculation is as follows:
Figure BDA0002174159430000131
Figure BDA0002174159430000132
Figure BDA0002174159430000133
Figure BDA0002174159430000134
dt=o′t⊙tanh(s′t) (23)
4.5.4 estimate the output y at the current timeDT
Decoder LSTM cell output yDTAnd constructing a simulation function F through a DA-RNN structure, wherein the function F can observe given input and previous output and estimate the output at the current moment:
Figure BDA0002174159430000135
the output of the current time determines the final prediction result
Figure 3
Fifthly, training a traffic signal control scheme real-time recommendation model: and (3) taking the data set in the step (1.3) as a training data set, carrying out learning training on the deep learning algorithm model in the step (1.4), and adopting a random gradient descent optimizer method, an Adam optimizer method and an automatic parameter adjusting method to finish the training when the loss function converges to 0.02%.
The training set is used for training the model, the training set is divided into small batches, the data set is divided into N batches according to s being 128, training of the model is carried out according to the divided batch data, and batch grouping has the advantage that training of the training model can be accelerated. The training set is from a training data set, and 80% of the training data set is taken as the training set.
The test set is used for judging the model, test data is input into the model, and a loss function of the model is calculated as follows:
Figure BDA0002174159430000137
the loss function converges rapidly and to 0.02%, and the model under the parameter is judged to be available. The test set is from the training data set, taking 80% of the training data set as the test set.
In the experiment, the size of the small batch of packets is set to be S, S is 128, the learning rate is alpha, alpha is 0.01%, the iteration number is M, M is 5000, and the loss function of the experiment result is converged rapidly, as shown in the figure five. Storing the traffic signal control scheme real-time recommendation model under the parameter;
and sixthly, recommending a traffic signal control scheme at the next moment: collecting real-time traffic state data, inputting a traffic signal control scheme real-time recommendation model, obtaining prediction control scheme data output by the model, and correcting a prediction result by a mixed regression model linear regression method:
yT=yDT+yAT(26)
Figure BDA0002174159430000141
the intersection recommends a sample case of the split plan and the actual split plan as follows:
Figure BDA0002174159430000142
Figure BDA0002174159430000151
the intersection carries out real-time scheme recommendation model deployment and application in 2019 and 1 month, and fig. 6a and 6b are average speed comparison graphs of the Yanan road in 2019 and 1 month, and working days and non-working days in 2018 and 12 months respectively: the speed equalizing speed in 1 month is obviously improved compared with the speed equalizing speed in 12 months, which shows that the recommended scheme plays a role in optimizing the traffic state of the intersection and improving the speed equalizing speed of the road.
The embodiments described in this specification are merely illustrative of implementations of the inventive concept and the scope of the present invention should not be considered limited to the specific forms set forth in the embodiments but rather by the equivalents thereof as may occur to those skilled in the art upon consideration of the present inventive concept.

Claims (9)

1. A traffic signal control scheme real-time recommendation method based on deep learning comprises the following steps:
1.1 collecting traffic data including traffic control data and traffic status data, including but not limited to: signal system cycle start time, cycle end time, cycle duration, split data, belonging to traffic state data including but not limited to: flow, saturation, velocity;
1.2 preprocessing traffic data, including cleaning error data, correcting abnormal data and repairing missing data;
1.3 constructing a time series of data sets, the steps comprising:
1.3.1 extracting sample points of a data set, the sample points referring to cycle end times when control scheme data is satisfied, the control scheme data being a green to green ratio variation;
1.3.2 constructing sample point data, and extracting T groups of traffic data corresponding to the sample points
Figure 55630DEST_PATH_IMAGE002
T group control scheme data
Figure 628563DEST_PATH_IMAGE004
1 group of split data
Figure 939458DEST_PATH_IMAGE006
Wherein i represents the ith sample point; the method specifically comprises the following steps:
t groups of traffic data, the traffic state data of the sample point and the first T-1 group of the sample point which are sorted according to the cycle end time are taken out and stored in an array form as the sample point data
Figure 6771DEST_PATH_IMAGE002
The part is specifically as follows:
Figure DEST_PATH_IMAGE008
(1)
wherein
Figure DEST_PATH_IMAGE010
The duration of the cycle is referred to as the period,
Figure DEST_PATH_IMAGE012
the green ratio data of signal control phases A, B, …, G,
Figure DEST_PATH_IMAGE014
Figure DEST_PATH_IMAGE016
the traffic data of the traffic lane is indicated,
Figure DEST_PATH_IMAGE018
Figure DEST_PATH_IMAGE020
,,…,
Figure DEST_PATH_IMAGE022
Figure DEST_PATH_IMAGE024
,…,
Figure DEST_PATH_IMAGE026
the data of the saturation degree of the lane is indicated,
Figure DEST_PATH_IMAGE028
,
Figure DEST_PATH_IMAGE030
,..,
Figure DEST_PATH_IMAGE032
the number of the signal control phases is related to the intersections, and the number of the phases and the phase sequence of different intersections are different;
t sets of control scheme data, the control scheme data of the sample points and the first T-1 sets of sample points are taken out and stored in the form of an array as the sample point data
Figure DEST_PATH_IMAGE034
The part is specifically as follows:
Figure DEST_PATH_IMAGE036
(2)
wherein
Figure DEST_PATH_IMAGE038
Means amount of change in adjacent green ratio of A phase
Figure DEST_PATH_IMAGE040
Figure DEST_PATH_IMAGE042
Figure DEST_PATH_IMAGE044
The dimensionality of the intersection is determined by the actual running phase quantity of the intersection;
a set of split plan data: the green ratio data of the next moment of the sample point is taken and stored in an array form as
Figure DEST_PATH_IMAGE046
The part is specifically as follows:
Figure DEST_PATH_IMAGE048
(3)
wherein
Figure DEST_PATH_IMAGE050
Green ratio of finger phase
Figure DEST_PATH_IMAGE052
Figure DEST_PATH_IMAGE054
The dimensionality of the intersection is determined by the actual running phase quantity of the intersection;
1.3.3 constructing time series data sets that meet training requirements, including traffic data sets
Figure DEST_PATH_IMAGE056
Control scheme data set
Figure DEST_PATH_IMAGE058
Sum-to-Lu ratio data set
Figure DEST_PATH_IMAGE060
Training requirements include, but are not limited to: the number of sample points, the length of time for extracting traffic data for the sample points;
1.4, constructing a deep learning algorithm model based on CNN-DA-RNN, wherein the first layer CNN adopts an unsubbed convolutional layer neural network, and obtains output data with unchanged dimensionality by performing convolution calculation on input data, and the output data is used for extracting the short-time dependency relationship and the dependency relationship among variables in the time dimensionality of the input data; the second layer DA-RNN adopts a recurrent neural network based on a two-stage attention mechanism, and is used for performing attention allocation in space and time dimensions on input data, and encoding and decoding, and specifically includes:
1.4.1 performing spatial attention distribution on the traffic data set data output from the convolutional layer;
1.4.2, encoding the data after the space attention distribution;
1.4.3 time attention distribution is carried out on the coded data;
1.4.4 weighted calculation of the time attention assigned data, weighted data and control scheme data set
Figure 683128DEST_PATH_IMAGE062
Decoding is carried out;
1.5 training a traffic signal control scheme real-time recommendation model: taking the data set in the step 1.3 as a training data set, and performing learning training on the deep learning algorithm model in the step 1.4, wherein the method for training the model includes but is not limited to: the method comprises a random gradient descent optimizer method, an Adam optimizer method and an automatic parameter adjusting method, wherein the end condition of a training model is that the convergence degree of a loss function meets requirements, and the loss function is the mean square error of data of a prediction control scheme and data of an actual control scheme.
1.6 recommending a traffic signal control scheme at the next moment: and collecting real-time traffic state data, inputting a traffic signal control scheme real-time recommendation model, and obtaining prediction control scheme data output by the model.
2. The method according to claim 1, wherein the concept and data dimension of the specific content of the traffic control data in step 1.1 are as follows:
the period duration refers to the time required by the signal lamp for displaying various lamp colors for one week in turn, and the data dimension is 1 dimension;
the split is green ratio data of each phase of the road junction, and the data dimension is the phase number;
the traffic state data refers to traffic state data of all lanes at the intersection, and the data dimension is the number of traffic state data types and the number of lanes.
3. The method of claim 1, wherein the step 1.2 of cleaning the error data comprises deleting default values and duplicate values;
correcting abnormal data, judging whether the data is an abnormal value by using a t test method in statistics, carrying out interpolation processing on the abnormal value by using a spline function method, and carrying out interpolation by using historical data; repairing missing data by adopting a multivariate linear regression model method, and comprising the following steps of: (1) making a scatter diagram on the existing data and performing multiple regression processing; (2) solving a multiple linear regression polynomial and a confidence interval; (3) making a residual error analysis graph, and verifying the fitting effect, wherein the smaller the residual error is, the better the coincidence degree of the regression polynomial and the source data is, and (4) supplementing the missing data by a polynomial equation with the minimum residual error.
4. The method for recommending traffic signal control schemes based on deep learning of claim 1, wherein the extraction method of the time dependency relationship and the dependency relationship between variables of the unsuccessfully-pooled convolutional neural network for the input data is as follows in step 1.4:
the input data of the convolutional layer is a time-series traffic data set
Figure 36749DEST_PATH_IMAGE064
The convolutional layer is composed of a plurality of filters with width omega and height n, wherein the setting of width omega is the same as the green ratio of input data, the setting of height n is the same as the column dimension of input data variable, and the k filter scans the input matrix
Figure 974223DEST_PATH_IMAGE066
And generating:
Figure 704281DEST_PATH_IMAGE068
(4)
meaning of formula: wherein x represents the operation of convolution,
Figure 779685DEST_PATH_IMAGE070
is the output vector of the output,
Figure DEST_PATH_IMAGE072
linearly modify the activation function of the unit neurons,
Figure DEST_PATH_IMAGE074
the function can accelerate gradient descent and backward propagation, avoid the problems of sharp gradient rise and sharp gradient decrease,
Figure DEST_PATH_IMAGE076
the convolution matrix and the offset to be learned are continuously corrected in the training process, and the range of k is the ratio of the length of input data to the size (omega n) of the filter;
to preserve the output after convolution
Figure DEST_PATH_IMAGE078
Is consistent with the dimension of the input data by aligning the input matrix
Figure DEST_PATH_IMAGE080
The method for increasing the dimension is realized, and the variable value of the dimension is increased to be 0; the method comprises the following implementation processes:
Figure 350212DEST_PATH_IMAGE082
of dimension i x j, convolution matrix
Figure 776645DEST_PATH_IMAGE084
Dimension 3 x 3, for obtaining dimension i x j
Figure 44816DEST_PATH_IMAGE086
By variation of
Figure 223993DEST_PATH_IMAGE088
Is (i +1) × (j +1) and the variable value of the added dimension is 0.
5. The deep learning-based traffic signal control scheme real-time recommendation method according to claim 1, wherein the method is characterized in that
The specific process 1.4.1: the spatial attention allocation is the first stage of the two-stage attention mechanism, the spatial attention is introduced as the input attention mechanism, the correlation is automatically extracted for the input data at each moment, and the input attention weight is calculated according to the previous hidden state of the encoder, and the method comprises the following steps:
for input data
Figure 184996DEST_PATH_IMAGE090
At each moment of time of
Figure 833146DEST_PATH_IMAGE092
Using the attention mechanism, the formula is as follows:
Figure 32571DEST_PATH_IMAGE094
(5)
Figure DEST_PATH_IMAGE096
(6)
wherein
Figure 613725DEST_PATH_IMAGE098
Is the last hidden state
Figure DEST_PATH_IMAGE100
And the last state
Figure 135842DEST_PATH_IMAGE102
The function of the cascade of functions of (a),
Figure 395922DEST_PATH_IMAGE104
refers to the parameters of the high-dimensional matrix to be learned,
Figure 146840DEST_PATH_IMAGE106
is the spatial attention weight assigned to the kth input feature at time t, the output after spatial attention assignment
Figure 175976DEST_PATH_IMAGE108
Comprises the following steps:
Figure 134574DEST_PATH_IMAGE110
(7)
the specific process 1.4.2 encodes the data after spatial attention allocation: the neural state of LSTM of the encoder LSTM unit is dynamically summed along with the time, the long-term dependence relationship is memorized, the problem of rapid gradient reduction is easily solved, the method is effective for processing the time sequence problem, and the LSTM method is used for inputting data
Figure 491737DEST_PATH_IMAGE112
The encoding method is as follows:
first the encoder can learn from
Figure 905401DEST_PATH_IMAGE114
Mapping of (2):
Figure 913677DEST_PATH_IMAGE116
(8)
wherein
Figure 528329DEST_PATH_IMAGE118
For the hidden state of the encoder at time t,
Figure 497422DEST_PATH_IMAGE120
the last one of the hidden states is referred to,
Figure 576761DEST_PATH_IMAGE122
is a non-linear activation function;
secondly, the encoding unit updates the state using the LSTM network as an activation function: the LSTM recurrent neural network comprises a forgetting gate
Figure 580489DEST_PATH_IMAGE124
Output door
Figure 100463DEST_PATH_IMAGE126
Each LSTM cell has a state at time t
Figure DEST_PATH_IMAGE128
Memory cell, state
Figure DEST_PATH_IMAGE130
The updating method comprises the following steps:
Figure DEST_PATH_IMAGE132
(9)
Figure DEST_PATH_IMAGE134
(10)
Figure 947065DEST_PATH_IMAGE136
(11)
Figure 827166DEST_PATH_IMAGE138
(12)
Figure 419821DEST_PATH_IMAGE140
(13)
wherein the content of the first and second substances,
Figure 376276DEST_PATH_IMAGE142
is the previous hidden state
Figure 444595DEST_PATH_IMAGE144
And current input
Figure 3752DEST_PATH_IMAGE146
A cascade function of which
Figure DEST_PATH_IMAGE148
Figure DEST_PATH_IMAGE150
Figure DEST_PATH_IMAGE152
Is a parameter to be trained and learned,
Figure DEST_PATH_IMAGE154
respectively logical function and element multiplication;
the specific process 1.4.3 performs time attention allocation on the encoded data: the time attention allocation is the second stage of the two-stage attention mechanism, and the time attention mechanism is introduced to capture the long-term timing dependence information of the encoder and apply the hidden state of the previous decoder to the input state data
Figure DEST_PATH_IMAGE156
The time attention weight is calculated by the following method:
based on previous decoder hidden states
Figure DEST_PATH_IMAGE158
And the state of the last LSTM cell
Figure DEST_PATH_IMAGE160
Calculate each compilation using an attention mechanismTime attention weight of decoder hidden state at time t
Figure DEST_PATH_IMAGE162
The formula is as follows:
Figure DEST_PATH_IMAGE164
(14)
Figure DEST_PATH_IMAGE166
(15)
wherein the content of the first and second substances,
Figure DEST_PATH_IMAGE168
is a cascaded function of the hidden state of the previous decoder and the state of the last LSTM unit,
Figure DEST_PATH_IMAGE170
are the high-dimensional matrix parameters that need to be learned,
Figure DEST_PATH_IMAGE172
is allocated at time t
Figure DEST_PATH_IMAGE174
Temporal attention weight of group features;
the specific process 1.4.4) weights and calculates the data after the time attention distribution, and weights the data sum
Figure DEST_PATH_IMAGE176
And (3) decoding: the method comprises the following steps:
1.4.4.1. computing all hidden states
Figure DEST_PATH_IMAGE178
Weighted sum vector of
Figure DEST_PATH_IMAGE180
Figure DEST_PATH_IMAGE182
(16)
Wherein the content of the first and second substances,
Figure DEST_PATH_IMAGE184
is the decoder LSTM unit input.
1.4.4.2. Calculating updated target outputs
Figure 396120DEST_PATH_IMAGE186
Figure 116951DEST_PATH_IMAGE188
(17)
Wherein the content of the first and second substances,
Figure 423298DEST_PATH_IMAGE190
is the output state of the last decoder
Figure 520567DEST_PATH_IMAGE192
And the last one of the weighted sums of all hidden states
Figure 478028DEST_PATH_IMAGE194
The function of the cascade of functions of (a),
Figure 635340DEST_PATH_IMAGE196
are the parameters to be learned and trained.
1.4.4.3. Updating the hidden state of the decoder at time t
Figure 428984DEST_PATH_IMAGE198
: utilizing new target output
Figure 329943DEST_PATH_IMAGE200
And previous hidden state:
Figure 876331DEST_PATH_IMAGE202
(18)
wherein
Figure DEST_PATH_IMAGE204
For long-term dependencies of the nonlinear activation function for establishing the time series, the choice is made to use LSTM cells as the update of the hidden state
Figure 876648DEST_PATH_IMAGE204
Function, then hidden state
Figure DEST_PATH_IMAGE206
The specific calculation is as follows:
Figure DEST_PATH_IMAGE208
(19)
Figure DEST_PATH_IMAGE210
(20)
Figure DEST_PATH_IMAGE212
(21)
Figure DEST_PATH_IMAGE214
(22)
Figure 65578DEST_PATH_IMAGE216
(23)
wherein the content of the first and second substances,
Figure 645595DEST_PATH_IMAGE218
is the previous hidden state
Figure 187434DEST_PATH_IMAGE220
And of the preceding objective function
Figure 76762DEST_PATH_IMAGE222
Of a cascade function of, wherein
Figure 579418DEST_PATH_IMAGE224
Is a parameter to be trained and learned,
Figure 822181DEST_PATH_IMAGE226
respectively logical functions and element multiplications.
1.4.4.4. Estimating the output of the current time
Figure 343161DEST_PATH_IMAGE228
:
Decoder LSTM cell output
Figure 278756DEST_PATH_IMAGE228
And constructing a simulation function F through a DA-RNN structure, wherein the function F can observe given input and previous output and estimate the output at the current moment:
Figure 268709DEST_PATH_IMAGE230
(24)
wherein
Figure DEST_PATH_IMAGE232
Is a hidden state of the decoding layer
Figure DEST_PATH_IMAGE234
Sum vector
Figure DEST_PATH_IMAGE236
Cascade function of, parameter
Figure 911567DEST_PATH_IMAGE238
Is a parameter to be learned and trained, the weight of a linear function
Figure 552633DEST_PATH_IMAGE240
And bias
Figure 393550DEST_PATH_IMAGE242
Is the parameter to be learned, determines the final prediction result
Figure 870799DEST_PATH_IMAGE244
6. The method for recommending traffic signal control scheme based on deep learning of claim 1, wherein the loss function of step 1.5 is calculated and judged by:
the training process of the model comprises the steps of grouping all data in small batches, and training the model by using a Stochastic Gradient Descent (SGD) optimizer and an Adam optimizer; designing a smooth and differentiable output result to ensure that the parameters can be obtained through standard reverse propagation learning; designing loss functions of the objective function, namely predicted control scheme data and actual control scheme data:
Figure 455364DEST_PATH_IMAGE246
(25)
where N is the amount of samples of training,
Figure 950936DEST_PATH_IMAGE248
is a predicted solution to be used in the future,
Figure 228334DEST_PATH_IMAGE250
is a practical solution, the result of the training is that the loss function converges rapidly to a very small value
Figure 927300DEST_PATH_IMAGE252
Contract to
Figure DEST_PATH_IMAGE254
7. The method for recommending traffic signal control scheme based on deep learning of claim 1, wherein step 1.6 further comprises: correcting the data of the predictive control scheme output by the real-time traffic signal control scheme recommendation model, and specifically adopting the following steps:
corrected result
Figure DEST_PATH_IMAGE256
Including predictive control scheme data
Figure DEST_PATH_IMAGE258
And the result of the linear regression calculation of the mixed regression model
Figure DEST_PATH_IMAGE260
Two parts, the vector sum of the two:
Figure 833332DEST_PATH_IMAGE262
(26)
wherein the model for the linear regression calculation is:
Figure 58777DEST_PATH_IMAGE264
(27)
where q is the input matrix
Figure 116863DEST_PATH_IMAGE266
K refers to the kth filter,
Figure 958917DEST_PATH_IMAGE268
are parameters that need to be learned.
8. The method for recommending traffic signal control scheme based on deep learning of claim 1, wherein step 1.6 further comprises: judging whether the output control scheme data meets constraint conditions, wherein the constraint conditions comprise:
(1) whether the actual flow and saturation conditions of the intersection are met or not; (2) whether the recommendation period is less than the maximum period time of the intersection or not is judged; (3) whether the recommended green signal ratio of each phase is greater than the minimum green light or not; (4) whether the safety time of the pedestrian phase is completely met; (5) whether the time setting of the particular phase is completed.
9. The deep learning-based traffic signal control scheme real-time recommendation method according to claim 1, wherein: the sample point described in step 1.3 refers to the cycle end time when the control scheme data meets the requirement, and the control scheme data is the variation of the split ratio, specifically:
and judging the variation of the green signal ratio of any phase in the two adjacent groups of data, and when the variation exceeds 5% of the total cycle time, the control scheme data meets the requirement.
CN201910772945.8A 2019-08-21 2019-08-21 Deep learning-based traffic signal control scheme real-time recommendation method Active CN110491146B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910772945.8A CN110491146B (en) 2019-08-21 2019-08-21 Deep learning-based traffic signal control scheme real-time recommendation method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910772945.8A CN110491146B (en) 2019-08-21 2019-08-21 Deep learning-based traffic signal control scheme real-time recommendation method

Publications (2)

Publication Number Publication Date
CN110491146A CN110491146A (en) 2019-11-22
CN110491146B true CN110491146B (en) 2020-08-21

Family

ID=68551595

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910772945.8A Active CN110491146B (en) 2019-08-21 2019-08-21 Deep learning-based traffic signal control scheme real-time recommendation method

Country Status (1)

Country Link
CN (1) CN110491146B (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111127887B (en) * 2019-12-19 2021-03-16 银江股份有限公司 Signal control optimization system and method based on real-time traffic state driving
CN111127892A (en) * 2019-12-27 2020-05-08 北京易华录信息技术股份有限公司 Intersection timing parameter optimization model construction and intersection signal optimization method
CN111260919B (en) * 2020-01-15 2021-06-22 厦门大学 Traffic flow prediction method
CN111260937B (en) * 2020-02-24 2021-09-14 武汉大学深圳研究院 Cross traffic signal lamp control method based on reinforcement learning
CN111445694B (en) * 2020-03-04 2022-02-01 青岛海信网络科技股份有限公司 Festival and holiday traffic scheduling method and device based on traffic flow prediction
CN111554111B (en) * 2020-04-21 2021-04-20 河北万方中天科技有限公司 Signal timing optimization method and device based on multi-source data fusion and terminal
CN111341109B (en) * 2020-05-19 2020-09-11 银江股份有限公司 City-level signal recommendation system based on space-time similarity
CN111785014B (en) * 2020-05-26 2021-10-29 浙江工业大学 Road network traffic data restoration method based on DTW-RGCN
CN111899510B (en) * 2020-07-28 2021-08-20 南京工程学院 Intelligent traffic system flow short-term prediction method and system based on divergent convolution and GAT
CN112183453B (en) * 2020-10-15 2021-05-11 哈尔滨市科佳通用机电股份有限公司 Deep learning-based water injection port cover plate unlocking-in-place fault detection method and system
CN113257008B (en) * 2021-05-12 2022-06-21 兰州交通大学 Pedestrian flow dynamic control system and method based on deep learning
CN113449905A (en) * 2021-05-21 2021-09-28 浙江工业大学 Traffic jam early warning method based on gated cyclic unit neural network
CN114038212B (en) * 2021-10-19 2023-03-28 南京航空航天大学 Signal lamp control method based on two-stage attention mechanism and deep reinforcement learning
CN115083160A (en) * 2022-06-15 2022-09-20 国汽智控(北京)科技有限公司 Signal lamp control method and device and signal lamp controller
CN115146842B (en) * 2022-06-24 2023-07-18 沈阳建筑大学 Multi-element time sequence trend prediction method and system based on deep learning
CN116522007B (en) * 2023-07-05 2023-10-20 中国科学技术大学 Recommendation system model-oriented data forgetting learning method, device and medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107230351A (en) * 2017-07-18 2017-10-03 福州大学 A kind of Short-time Traffic Flow Forecasting Methods based on deep learning
CN108664687A (en) * 2018-03-22 2018-10-16 浙江工业大学 A kind of industrial control system space-time data prediction technique based on deep learning
CN109472984A (en) * 2018-12-27 2019-03-15 苏州科技大学 Signalized control method, system and storage medium based on deeply study
CN109887284A (en) * 2019-03-13 2019-06-14 银江股份有限公司 A kind of smart city traffic signalization recommended method, system and device

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4082312B2 (en) * 2003-08-28 2008-04-30 オムロン株式会社 Signal control device
US8825350B1 (en) * 2011-11-22 2014-09-02 Kurt B. Robinson Systems and methods involving features of adaptive and/or autonomous traffic control
KR20150072471A (en) * 2013-12-19 2015-06-30 건국대학교 산학협력단 Traffic flow prediction system using spatiotemporal stochastic model
KR101797818B1 (en) * 2015-10-26 2017-11-15 현대오토에버 주식회사 Apparatus and method for providing road information based on deep learnig
CN106023605B (en) * 2016-07-15 2018-10-12 汤一平 A kind of method for controlling traffic signal lights based on depth convolutional neural networks
CN106846834A (en) * 2017-01-21 2017-06-13 亚信蓝涛(江苏)数据科技有限公司 A kind of traffic control optimization method based on deep learning
CN106710253B (en) * 2017-03-10 2019-12-10 东南大学 High-reliability intelligent intersection traffic control system and control method
CN106971563B (en) * 2017-04-01 2020-05-19 中国科学院深圳先进技术研究院 Intelligent traffic signal lamp control method and system
JP6922425B2 (en) * 2017-05-29 2021-08-18 株式会社デンソー Vehicle display control device and display control program
CN107016861A (en) * 2017-05-31 2017-08-04 电子科技大学 Traffic lights intelligent control system based on deep learning and intelligent road-lamp
CN109697867B (en) * 2019-01-28 2020-10-20 深圳市欧德克科技有限公司 Deep learning-based traffic control method and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107230351A (en) * 2017-07-18 2017-10-03 福州大学 A kind of Short-time Traffic Flow Forecasting Methods based on deep learning
CN108664687A (en) * 2018-03-22 2018-10-16 浙江工业大学 A kind of industrial control system space-time data prediction technique based on deep learning
CN109472984A (en) * 2018-12-27 2019-03-15 苏州科技大学 Signalized control method, system and storage medium based on deeply study
CN109887284A (en) * 2019-03-13 2019-06-14 银江股份有限公司 A kind of smart city traffic signalization recommended method, system and device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
基于深度神经网络的时间序列预测技术研究;吴双双;《中国优秀硕士学位论文全文数据库(信息科技辑)》;20190115;全文 *
基于神经网络的城市交通信号灯的智能控制系统设计;吴夏旭;《中国优秀硕士学位论文全文数据库(信息科技辑)》;20150515;全文 *

Also Published As

Publication number Publication date
CN110491146A (en) 2019-11-22

Similar Documents

Publication Publication Date Title
CN110491146B (en) Deep learning-based traffic signal control scheme real-time recommendation method
CN111161535B (en) Attention mechanism-based graph neural network traffic flow prediction method and system
CN111210633B (en) Short-term traffic flow prediction method based on deep learning
CN110619430B (en) Space-time attention mechanism method for traffic prediction
CN103177570B (en) Method for predicting traffic jam indexes for rush hours in morning and evening
CN109886387B (en) Traffic time sequence prediction method based on gating network and gradient lifting regression
CN110060489B (en) Signal lamp timing scheme recommendation method based on neural network
Massaoudi et al. Performance evaluation of deep recurrent neural networks architectures: Application to PV power forecasting
CN111553535B (en) Navigation reference method based on AE-LSTM-BO traffic flow prediction
CN111862592B (en) Traffic flow prediction method based on RGCN
CN111160650B (en) Adaboost algorithm-based traffic flow characteristic analysis and prediction method
CN111860787A (en) Short-term prediction method and device for coupling directed graph structure flow data containing missing data
CN112508305A (en) Public place entrance pedestrian flow prediction method based on LSTM
CN110570035A (en) people flow prediction system for simultaneously modeling space-time dependency and daily flow dependency
CN112614346B (en) Short-term traffic flow prediction method based on singular spectrum analysis and echo state network
CN111341109B (en) City-level signal recommendation system based on space-time similarity
CN115410372A (en) Reliable prediction method for highway traffic flow based on Bayesian LSTM
CN112784479A (en) Flood flow prediction method
CN115206092A (en) Traffic prediction method of BiLSTM and LightGBM model based on attention mechanism
CN116865251A (en) Short-term load probability prediction method and system
CN113192325B (en) Traffic incident correlation analysis method and system
CN111815075B (en) Prediction method for transportation travel demand under major public health incident
Guo et al. A multi-step traffic speed forecasting model based on graph convolutional LSTM
Zaki et al. Time aware hybrid hidden Markov models for traffic Congestion prediction
CN113096378B (en) Highway OD prediction method based on depth set empirical mode decomposition

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP01 Change in the name or title of a patent holder
CP01 Change in the name or title of a patent holder

Address after: 310014 No. 18 Chao Wang Road, Xiacheng District, Zhejiang, Hangzhou

Patentee after: ZHEJIANG University OF TECHNOLOGY

Patentee after: Yinjiang Technology Co., Ltd

Address before: 310014 No. 18 Chao Wang Road, Xiacheng District, Zhejiang, Hangzhou

Patentee before: ZHEJIANG University OF TECHNOLOGY

Patentee before: Yinjiang Co., Ltd