CN112270355A - Active safety prediction method based on big data technology and SAE-GRU - Google Patents

Active safety prediction method based on big data technology and SAE-GRU Download PDF

Info

Publication number
CN112270355A
CN112270355A CN202011172029.XA CN202011172029A CN112270355A CN 112270355 A CN112270355 A CN 112270355A CN 202011172029 A CN202011172029 A CN 202011172029A CN 112270355 A CN112270355 A CN 112270355A
Authority
CN
China
Prior art keywords
data set
traffic
running state
risk
representing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011172029.XA
Other languages
Chinese (zh)
Other versions
CN112270355B (en
Inventor
郝威
吴其育
戎栋磊
张兆磊
易可夫
伍文广
吴伟
李永福
王正武
谷健
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Changsha University of Science and Technology
Original Assignee
Changsha University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Changsha University of Science and Technology filed Critical Changsha University of Science and Technology
Priority to CN202011172029.XA priority Critical patent/CN112270355B/en
Publication of CN112270355A publication Critical patent/CN112270355A/en
Application granted granted Critical
Publication of CN112270355B publication Critical patent/CN112270355B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
    • G06F18/24133Distances to prototypes
    • G06F18/24137Distances to cluster centroïds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/40Business processes related to the transportation industry
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • G08G1/0129Traffic data processing for creating historical data or processing based on historical data
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • G08G1/0133Traffic data processing for classifying traffic situation
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0137Measuring and analyzing of parameters relative to traffic conditions for specific applications
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Business, Economics & Management (AREA)
  • Analytical Chemistry (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Chemical & Material Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Tourism & Hospitality (AREA)
  • Development Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Game Theory and Decision Science (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Traffic Control Systems (AREA)

Abstract

The invention discloses an active safety prediction method based on big data technology and SAE-GRU, firstly, obtaining an original data set and preprocessing the original data set to form a training data set; carrying out dynamic traffic running state identification based on cluster analysis by utilizing a training data set to obtain a sample data set with a traffic running state label; taking a sample data set with a traffic running state label as prior knowledge of classification analysis to generate a traffic running state classifier; adopting a training data set to construct a data set for risk operation state judgment, carrying out risk operation state judgment according to different traffic operation states to obtain a training data set with a risk operation state label, training an SAE-GRU model by using the training data set, and simultaneously obtaining an optimal SAE-GRU active safety prediction model by parameter adjustment; and carrying out active safety prediction by using an SAE-GRU active safety prediction model. The application range is wide, and the requirements of high-precision and high-efficiency prediction are met.

Description

Active safety prediction method based on big data technology and SAE-GRU
Technical Field
The invention belongs to the technical field of traffic state identification, and relates to a main road active safety prediction method based on a traffic big data technology and SAE-GRU.
Background
With the rapid rise of automobile holding capacity and the advance of urbanization, traffic jam, frequent accidents and management delay become main problems which hinder traffic development. The traffic decision based on big data and deep learning is more intelligent, and support is provided for relieving traffic jam, optimizing road resources and improving safety indexes. Therefore, the construction of a perfect active safety management method by utilizing a big data technology to mine characteristic parameters related to traffic safety and applying deep learning to establish a safety model has become a research hotspot of an Intelligent Transportation System (ITS).
In the aspect of traffic running state identification, when the sample size is large, the calculation is complex, and the phenomenon of failure in judgment is often caused; in the aspect of risk operation state identification, the method mainly focuses on principal component analysis and multiple regression analysis, and the reliability, effectiveness and universality of identification are not evaluated. Therefore, it is necessary to construct a state recognition model with migratable learning.
In recent years, traffic data explosion has led to an increase in the difficulty of analysis and data value. For better analysis, mining and modeling, various state label based prediction methods are proposed. Although the existing research method promotes the accuracy of traffic state identification and safety prediction to a certain extent, the following defects still exist:
1. the relevance between the traffic running state and the active safety prediction method is poor, so that the information is disjointed, and the risk running state cannot be accurately predicted;
2. a model with universal applicability in a certain field cannot be researched, so that a state identification method is narrow;
3. the traffic data volume is rapidly increased, the existing method cannot meet the prediction requirements of high precision and high efficiency, and the applicability is reduced.
Disclosure of Invention
The embodiment of the invention aims to provide an active safety prediction method based on a big data technology and SAE-GRU (system architecture analysis-general rule Unit), and aims to solve the problems that the relevance of a traffic running state and the active safety prediction method is poor, information is disconnected, and a risk running state cannot be accurately predicted, and the problems that the application range of the existing prediction method based on a state label is narrow, and the high-precision and high-efficiency prediction requirements cannot be met.
The technical scheme adopted by the embodiment of the invention is that the active safety prediction method based on the big data technology and SAE-GRU is carried out according to the following steps:
step S1, obtaining an original data set, wherein the original data set comprises a plurality of single data sets, each single data set comprises 5 types of feature data including average vehicle speed, average acceleration, average occupancy, average queuing time and average travel time in a delta t time interval and 1 type of conventional data including average standard time, and the original data set is preprocessed to form a training data set;
s2, performing dynamic traffic running state recognition based on cluster analysis by using the training data set obtained in the step S1 to obtain a sample data set with a traffic running state label;
s3, using the sample data set with the traffic running state label as the prior knowledge of classification analysis to generate a traffic running state classifier;
s4, constructing a data set for risk running state judgment by adopting the training data set formed in the step S1, and judging the risk running state of the constructed data set for risk running state judgment according to different traffic running states based on a fuzzy comprehensive evaluation method to obtain a training data set with a risk running state label;
step S5, an SAE-GRU model is established, a training process is carried out on the established SAE-GRU model by adopting the training data set with the risk running state label obtained in the step S4, and an optimal SAE-GRU active safety prediction model is obtained through parameter adjustment;
and step S6, performing active safety prediction on the main trunk by using the obtained optimal SAE-GRU active safety prediction model, and predicting to obtain the risk running state in the next stage, namely the next delta t time interval.
The embodiment of the invention has the beneficial effects that:
1) after different traffic running state classification results are obtained based on a dynamic traffic running state classifier, fuzzy comprehensive judgment is carried out on traffic flow attributes and vehicle running attributes in all traffic running states to obtain a risk running state, SAE-GRU is finally used for carrying out active safety prediction based on a risk running state label, the traffic running state is tightly connected with the active safety prediction, and the problems that information is disconnected and the risk running state cannot be accurately predicted due to the fact that the traffic running state and the active safety prediction method are poor in relevance in the existing prediction method based on the state label are solved.
2) The SAE-GRU active safety prediction model provided by the embodiment of the invention can be applied to different scenes (main roads, intersections, expressways and the like), so that the model is favorable for improving the universality and the mobility of prediction, effectively expands the application range and solves the problem of narrow application range of the conventional prediction method based on the state label.
3) The embodiment of the invention firstly obtains the data characteristics based on an SAE model, and then realizes high-efficiency and high-precision safety prediction based on a GRU model. Therefore, the combination of SAE and GRU effectively excavates data characteristics, optimizes the prediction process on the premise of ensuring prediction accuracy, improves training efficiency, and solves the problems that the existing prediction method based on the state label cannot meet the prediction requirements of high accuracy and high efficiency and the applicability is reduced.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
Fig. 1 is a schematic structural diagram of a dynamic state recognition model according to an embodiment of the present invention.
FIG. 2 is a graph of the variation of the coefficient of variation of velocity according to an embodiment of the present invention.
FIG. 3 is a graph showing the change in occupancy coefficient of variation according to an embodiment of the present invention.
FIG. 4 is a graph of variation of the travel time coefficient of variation according to an embodiment of the present invention.
FIG. 5(a) is a velocity coefficient of variation distribution plot according to an embodiment of the present invention.
FIG. 5(b) is a distribution diagram of occupancy coefficient of variation in accordance with an embodiment of the present invention.
FIG. 5(c) is a graph of the travel time coefficient of variation distribution according to an embodiment of the present invention.
FIG. 6 is a diagram illustrating controlling reset/update gating according to an embodiment of the present invention.
FIG. 7 is a diagram illustrating h' update according to an embodiment of the present invention.
Fig. 8 is a schematic diagram of the GRU neural network operation according to an embodiment of the present invention.
FIG. 9(a) is a graph comparing the prediction accuracy and RMSE for groups 1-5 of an embodiment of the present invention.
FIG. 9(b) is a graph comparing the prediction accuracy and RMSE for groups 5-10 of an embodiment of the present invention.
FIG. 9(c) is a graph comparing the prediction accuracy and RMSE for sets 10-15 of an embodiment of the present invention.
Fig. 10 is a graph of clustering results and status division of trunk traffic flow data.
FIG. 11 is a graph of the predicted temporal comparisons of SAE-GRU, LSTM, CNN-LSTM.
FIG. 12 is a graph comparing MAE and RMSE for SAE-GRU, LSTM, CNN-LSTM.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The embodiment of the invention provides an active safety prediction method based on big data technology and SAE-GRU, which is carried out according to the following steps:
step S1, obtaining an original data set, where the original data set includes a plurality of single data sets, each single data set includes 5 types of feature data, i.e., average vehicle speed, average acceleration, average occupancy, average queuing time, and average travel time, and 1 type of conventional data, i.e., average standard time, in a Δ t time interval, and the original data set is preprocessed to form a training data set:
the specific analysis was performed using the sunset avenue, california as an example, with the following steps:
firstly, acquiring an original data set, collecting traffic flow data according to 30s by taking a working day from 1 month and 1 day in 2020 to 28 days in 2020 and 2 months and 28 days in 2020 as a research time period, wherein the data are fitted at intervals of 5min to form 9805 data sets, and each data set comprises 5 types of characteristic data of vehicle speed, acceleration, occupancy, queuing time and travel time and 1 type of standard time conventional data.
Then, the raw data set collected in step S11 is preprocessed to form a training data set:
step 1, data cleaning and filling: calculating the missing rate of each single data set in the sample data set, wherein the missing rate of the single data set is the missing number of the data of the single data set/the total number of the data of the single data set, and deleting the single data set when the missing rate of the single data set is more than or equal to 80%; when the single data set loss rate is less than 80%, filling non-characteristic data (such as time) in the single data set by adopting statistics, filling characteristic data (such as speed, acceleration and the like) in the single data set by a Lagrange interpolation method of formulas (1) to (2), establishing a polynomial function for associating a plurality of data points, and fully utilizing the time sequence of the data:
Figure RE-GDA0002822952400000041
Figure RE-GDA0002822952400000042
wherein L (x) represents a missing value to be found,
Figure RE-GDA0002822952400000044
as interpolation basis functions, xjDenotes the j-th position point, xiDenotes dividing by xjOther position points than the above, x representing the position point of the missing value to be found, yjDenotes xjThe value of the position point, k represents the number of the given value points;
step 2, feature transformation: in the embodiment of the invention, a maximum-minimum normalization method is adopted to perform characteristic transformation on each numerical value of each type of characteristic data in a sample data set after data cleaning and filling are completed, so that the numerical value of each type of data is in a range of [0,1], and the formula of the maximum-minimum normalization method is shown as a formula (3):
Figure RE-GDA0002822952400000043
wherein, ynewRepresenting the value after feature transformation, y representing the value before feature transformation, yminRepresenting each class of features in a sample data setMinimum of the characterization data, ymaxAnd representing the maximum value of each type of feature data in a sample data set, wherein the sample data set refers to an original data set.
Step S2, performing dynamic traffic state recognition based on clustering analysis by using the training data set obtained in step S1, and obtaining a sample data set with a traffic state label, where in the embodiment of the present invention, the dynamic traffic state recognition is performed based on a fuzzy C-means clustering algorithm, as shown in fig. 1, and the specific steps are as follows:
step S21, constructing a traffic flow data set X ═ X1,x2,…,xi,…xn]TAnd making each sample item X in the traffic flow data set Xi=(xi1,xi2,xi3,xi4,xi5) Each sample item xiEach composed of traffic flow attribute parameters and vehicle operating parameters, wherein xi1、xi2、xi3、xi4、xi5Respectively and correspondingly representing the average speed, the average occupancy, the average acceleration, the average delay time and the average travel time in a delta t time interval in a one-to-one manner, wherein the average speed and the average occupancy in the delta t time interval are traffic flow attribute parameters, and the average acceleration, the average delay time and the average travel time in the delta t time interval are vehicle operation parameters; dividing all sample items in a traffic flow data set X into k classes C ═ C1,c2,…,ck-each category represents a traffic state; and constructing a clustering loss function according to equation (4):
Figure RE-GDA0002822952400000051
wherein J (U, X, C) represents a clustering loss function, XiRepresenting the ith sample item in the traffic flow data set X, wherein n is the total number of samples in the traffic flow data set X; k is the number of clusters, CjRepresents the jth class cjU represents a membership matrix, U ═ Uij},uijRepresents a sample xiFor class cjDegree of membership of, m is fuzzyThe weighting index, ξ, represents the cluster loss function constraint space, which is defined in equation (5):
Figure RE-GDA0002822952400000052
step S22, initializing a membership degree matrix U, and calculating and updating the jth category c according to formulas (6) to (7)jFuzzy clustering center C ofjAnd membership matrix U:
Figure RE-GDA0002822952400000053
Figure RE-GDA0002822952400000061
in the formula (d)ijRepresenting a sample item xiAnd the jth class cjFuzzy clustering center C ofjEuclidean distance of dihRepresenting a sample item xiAnd h category chFuzzy clustering center C ofhThe Euclidean distance of (c);
step S23, comparing the membership degree matrix U of the lambda +1 iterationλ+1And a membership matrix U of the lambda iterationλIf Uλ+1-Uλ| ≦ epsilon or λ ═ λmaxStopping iteration and outputting the current fuzzy clustering center CjAnd each sample item X in the traffic flow data set XiFor the current fuzzy clustering center CjJ is more than or equal to 1 and less than or equal to 3; otherwise, returning to the step S22, and continuing the iteration, wherein lambda is the iteration number, lambdamaxAnd epsilon is the maximum iteration number and the iteration termination threshold. The embodiment of the invention sets the clustering number k to be 3, respectively represents three traffic states of smooth traffic, congestion traffic and congestion traffic, the fuzzy weighting index m is 2, and the maximum iteration number lambda ismax1000, the iteration end threshold epsilon is 10-5(ii) a The clustering loss function constructed by the formulas (6) and (7) is replaced to the minimum, and when the condition of | | | U is metλ+1-Uλ| ≦ epsilon or λ ═ λmaxAnd (4) when the target function is optimal, the loss is minimum.
Step S24, fuzzy clustering center C output according to step S23jAnd each sample item X in the traffic flow data set XiFor the current fuzzy clustering center CjJ is more than or equal to 1 and less than or equal to 3, and a traffic running state class label set Y corresponding to the traffic flow data set X is determined by the maximum membership method, wherein Y is { Y | Y ═ Y%iτ; i is 1,2, …, n, τ is 1,2,3, yiFor each sample item X in the traffic flow data set XiThe traffic running state class labels 1,2 and 3 are the fuzzy clustering centers C1~C3Clustering labels in one-to-one correspondence;
step S25, according to the traffic running state class label set Y obtained in step S24, the traffic flow data set X is visualized on the three-dimensional space of the average speed, the occupancy and the travel time, for example, using "+" to represent Y in the clustering resultiWhen 1 is true, y is represented by ajWhen the sample term is 2, y is represented by "●kThe sample term corresponding to 3, where i, j, k ∈ [1, n ]]Obtaining a clustering analysis chart; determining the traffic running states corresponding to the clustering labels 1,2 and 3 in the category label set Y according to the clustering analysis chart to obtain each sample item X in the traffic flow data set XiTraffic running state category label yiThen, obtaining a sample data set gamma { (X, Y) | (X) with a traffic operation state label1,y1),(x2,y2),…,(xi,yi),…,(xn,yn) }; meanwhile, in order to eliminate the interference of different dimensions of the characteristic parameters on the classifier, a formula (3) is adopted to carry out normalization processing on the sample data set gamma with the traffic operation state label.
And step S3, using the sample data with the traffic running state label as the prior knowledge of classification analysis to generate a traffic running state classifier.
In the embodiment of the invention, a Support Vector Machine (SVM) is selected as a classifier for recognizing the traffic state, a nonlinear SVM is specifically adopted as the classifier for recognizing the traffic state, a sample data set gamma with a traffic operation state label is taken as a data set of the nonlinear SVM, then the traffic state represented by traffic flow parameters is learned by performing offline classification evaluation model training, and finally the classifier for the traffic operation state is obtained, and real-time or future traffic flow state is subjected to online classification prediction, and the specific implementation process is as follows:
step S31, constructing and solving a convex quadratic programming problem, which is shown in formula (8):
Figure RE-GDA0002822952400000071
wherein Q (alpha) is an optimal Lagrange multiplier objective function, K (x)i,xz) Representing a kernel function, n is the total number of samples of a sample data set gamma with a traffic running state label, and alphaizAll are lagrange multipliers, and C is a penalty coefficient. In the embodiment of the present invention, a penalty coefficient C is set to be 1.1, and a kernel function is a Radial Basis (RBF) function, see formula (9):
Figure RE-GDA0002822952400000072
where γ denotes a coefficient of a kernel function, and is set to 0.2 which is the reciprocal of the number of features of the input traffic flow data set X, and g denotes a kernel function width.
Solving to obtain the optimal Lagrange multiplier solution
Figure RE-GDA0002822952400000073
Step S32, calculating the optimal offset value b*
Selecting the optimal Lagrangian multiplier solution alpha in step S31*A component element of
Figure RE-GDA0002822952400000074
Satisfies the conditions
Figure RE-GDA0002822952400000075
According to
Figure RE-GDA0002822952400000076
The subscript l selects x from a sample data set gamma with a traffic running state labellAnd ylThen b is calculated according to the formula (10)*
Figure RE-GDA0002822952400000077
Step S33, solving classification decision function f (x)i):
Figure RE-GDA0002822952400000078
In the formula, f (x)i) Namely, the generated traffic operation state classifier represents the traffic operation state classification result of the ith sample item in the sample data set gamma with the traffic operation state label, and sgn () represents a sign function. And returning the signs of the parameters, namely completing the one-time two-classification problem. The traffic running state recognition system constructed in the embodiment of the invention is a three-classification problem, a support vector machine model needs to be expanded, a plurality of support vector machine classifiers are established, and the construction of a multi-class classifier is realized by combining a plurality of two classifiers.
Step S4, constructing a data set for risk running state judgment by adopting the training data set formed in the step S1, and judging the risk running state of the constructed data set for risk running state judgment based on a fuzzy comprehensive evaluation method according to different traffic running states to obtain the training data set with a risk running state label, wherein the specific implementation process is as follows:
step S41, using the training data set of step S1 to establish an evaluation target factor set P aggregated at time intervals of Δ t, where P is [ S ]V,SO,ST]In which S isVRepresenting the coefficient of variation of vehicle speed, SV=σVV represents the average vehicle speed within the time interval of delta t, sigmaVRepresenting the standard deviation of the vehicle speed in the time interval of delta t; sODenotes the coefficient of occupancy variation, SO=σOO, O represents the average occupancy in the time interval Δ t, σORepresents the standard deviation of occupancy within the Δ t time interval; sTRepresenting the coefficient of variation of the time of flight, ST=σTT, T denotes the mean travel time within the time interval Δ T, σTRepresents the standard deviation of travel time within the Δ t time interval; in the examples of the present invention,. DELTA.t was 5 minutes.
Step S42, establishing a judgment comment set F, wherein F is [ F ]1,f2,f3],f1Indicating that the critical operating state is a low-risk operating state, f2Indicating that the risky operating state is in a medium risky operating state, f3Indicating that the risk operation state is in a high risk operation state;
step S43, establishing a fuzzy relation matrix R, wherein R is a fuzzy mapping R epsilon (P → F) from the evaluation object factor set P to the evaluation criterion set F, and R is (R (S → F)V),R(SO),R(ST))TThereby inducing each judgment factor in the judgment object factor set P, namely SV、SO、STFuzzy relation matrix R (S) of membership degree of evaluation comment set FV)、R(SO)、R(ST) Wherein R (S)V)=(r11,r12,r13),r11Representing the degree of membership of the coefficient of variation of velocity to low risk, r12Representing the degree of membership of the coefficient of variation of velocity to the intermediate risk, r13Representing the degree of membership of the coefficient of variation of velocity to a high risk; r (S)O)=(r21,r22,r23),r21Representing the degree of membership of the occupancy coefficient of variation to the low risk, r22Representing the degree of membership of the occupancy coefficient of variation to the intermediate risk, r23Representing the degree of membership of the occupancy coefficient of variation to a high risk; r (S)T)=(r31,r32,r33),r31Indicating that the coefficient of travel time variation is relatively lowDegree of membership of the risk, r32Representing the degree of membership of the coefficient of travel time variation to the intermediate risk, r33Representing the degree of membership of the travel time coefficient of variation to a high risk. As shown in formula (12), trapezoidal membership functions are adopted in the embodiments of the present invention:
Figure RE-GDA0002822952400000091
wherein the content of the first and second substances,
Figure RE-GDA0002822952400000092
rμθsee the equations (19) to (27).
S44, carrying out traffic running state classification on sample items corresponding to the judgment object factor set P in the traffic flow data set X by using the traffic running state classifier generated in the step S3, and then obtaining a judgment object factor set P with a traffic running state label based on the traffic running state classification result; and then establishing a fuzzy weight matrix S (S) by utilizing the evaluation object factor set P with the traffic running state label according to different traffic running states1,s2,s3),s1Representing the influence degree of the speed variation coefficient in the fuzzy relation matrix R on the risk running state under different traffic running states, s2Representing the influence degree of occupancy rate variation coefficient in the fuzzy relation matrix R on the risk running state under different traffic running states, s3The method represents the influence degree of the travel time variation coefficient in the fuzzy relation matrix R in different traffic running states on the risk running state, and the specific implementation process is as follows:
step S441, the traffic operation state classifier generated in step S3 is used for classifying the traffic flow data set X in step S2 in the traffic operation state, and the generated traffic operation state labels are in one-to-one correspondence with the judgment object factor sets P according to the standard time characteristics to obtain the judgment object factor sets P with the traffic operation state labels; then, the entropy weight method is applied to the evaluation object factor set P with the traffic running state label to determine the evaluation object under different traffic running statesThe weights of all factors in the factor set P are normalized, and a weight matrix A is obtained by normalizing the weights of all factors in the evaluation object factor set P (A)f,Ac,Aj)TWherein A isfA weight vector representing a speed variation coefficient, an occupancy variation coefficient, and a travel time variation coefficient in a smooth flow state, AcA weight vector representing a speed variation coefficient, an occupancy variation coefficient, and a travel time variation coefficient in a congested flow state, AjA weight vector representing a velocity variation coefficient, an occupancy variation coefficient, and a travel time variation coefficient in a choked flow state; in the embodiment of the invention, A is obtained by solvingfThe weight vector a of the speed variation coefficient, occupancy variation coefficient, and travel time variation coefficient in the congested flow state is obtained in the same manner as (0.30,0.29,0.41)c(0.25,0.35,0.40), weight vector a of velocity coefficient of variation, occupancy coefficient of variation, and travel time coefficient of variation in choked flow statej=(0.2,0.28,0.52)。
Step S442, the clustering result shows that clustering unclear phenomena exist in part of data points, so that the embodiment of the invention innovatively provides the fuzzy influence vector wi=(wi1,wi2,wi3) Blurring the weight matrix A to eliminate the clustering ambiguity problem, wherein wi1Representing the ith sample item in the sample data set gamma with the traffic operation state label to the smooth flow clustering center
Figure RE-GDA0002822952400000101
Degree of membership, wi2Representing the ith sample item in the sample data set gamma with the traffic running state label to the crowd stream clustering center
Figure RE-GDA0002822952400000102
Degree of membership, wi3Representing the ith sample item in the sample data set gamma with the traffic running state label to the clustering center of the choked flow
Figure RE-GDA0002822952400000103
Is solved according to the formula (1)3):
Figure RE-GDA0002822952400000104
In the formula IijRepresenting the clustering centers of the ith sample item and different traffic states in the data set gamma
Figure RE-GDA0002822952400000105
The Euclidean distance of (c);
step S443, obtaining the fuzzy weight matrix S ═ wi×A。
Step S45, calculating a fuzzy composite value matrix B,
Figure RE-GDA0002822952400000106
wherein
Figure RE-GDA0002822952400000107
Representing generalized fuzzy operators, common fuzzy operators include M (V, V), M (V, etc.),
Figure RE-GDA0002822952400000108
The embodiment of the invention adopts
Figure RE-GDA0002822952400000109
The operator gives consideration to the action of the evaluation set in a balanced manner according to all evaluation factors, and is suitable for evaluating the integral indexes. Finally obtaining B ═ B1,b2,b3) The fuzzy composite value matrix B represents the fuzzy comprehensive evaluation result of the evaluation object, wherein B1Representing the low-risk running state F of the fuzzy comprehensive evaluation result relative to the evaluation comment set F1Degree of membership of b2Representing the running state F of the risk in the fuzzy comprehensive evaluation result relative judgment comment set F2Degree of membership of b3High-risk running state F representing relative judgment comment set F of fuzzy comprehensive evaluation result3Degree of membership of bθθ is 1,2,3, as calculated in equation (14):
Figure RE-GDA00028229524000001010
step S46, determining a fuzzy comprehensive evaluation result z, namely the risk running state category, according to the fuzzy synthetic value matrix B:
according to the embodiment of the invention, a maximum membership method is adopted to determine a fuzzy comprehensive evaluation result, namely, an element (F) of an evaluation set F corresponding to the maximum value in a fuzzy synthetic value matrix B is selected1、f2Or f3) The indicated critical operating state is used as the final result of the evaluation, i.e. z ═ fθ=max bθ;θ=1,2,3;
Step S47, based on the establishing method of the judgment object factor set P in the step S41, the training data set formed in the step S1 is adopted to construct a data set for risk operation state judgment
Figure RE-GDA00028229524000001011
Wherein, PiThe method comprises the steps that (1) the ith sample item of an ith evaluation object factor set, namely a data set omega for evaluating the risk running state is evaluated, and n is the total amount of the samples of the data set omega for evaluating the risk running state; and calculating each sample item in the data set omega for risk running state judgment according to the steps S43-S46 to obtain a risk running state label data set Z corresponding to the data set omega for risk running state judgment, wherein the risk running state label data set Z is a fuzzy comprehensive judgment result set of all sample items of the data set omega for risk running state judgment, and the risk running state label data set Z is normalized by adopting a formula (3) to finally obtain a training data set with risk running state labels
Figure RE-GDA0002822952400000111
Wherein z isiAnd (3) representing the fuzzy comprehensive judgment result of the ith sample item of the data set omega for the risk running state judgment.
In order to facilitate the use of the data as tag data for subsequent active traffic safety prediction, i.e. as data set for active traffic safety prediction, the embodiments of the present invention respectively enable low riskOperating state f 11, medium risk operating state f 22, high risk operating state f3And 3, obtaining a risk operation state label data set Z ═ { Z | Z ═ Zi=fθ=maxbθ;i=1,2,…,n,θ=1,2,3}。
Step S5, an SAE-GRU model is established, a training process is executed on the established SAE-GRU model by adopting the training data set with the risk operating state label (the training data set psi with the risk operating state label) obtained in step S4, and an optimal SAE-GRU active safety prediction model is obtained by adjusting parameters, and the specific implementation process is as follows:
and S51, building an SAE model, inputting the training data set with the risk running state label obtained in the step S4 into the SAE model to train the SAE model, processing the training data set with the risk running state label through the SAE model, extracting abstract features, namely data features in a high-risk running state layer by layer, obtaining a feature data set output by the SAE model and an output loss value after training is finished, and judging to obtain the optimal SAE model according to the output loss value.
The self-encoder (AE) encodes the input x to obtain a new feature y, and it is desired that the original input x can be reconstructed from the new feature y, as follows:
y=f(Wx+b); (15)
after the coding is linear combination, a nonlinear activation function f is added, and by using a new characteristic y, the input x can be reconstructed, namely, the decoding process:
x'=f(W'x+b'); (16)
the finally reconstructed x' and x are as consistent as possible, and a loss function minimizing negative log-likelihood can be adopted to train the model:
L=-logP(x|x'); (17)
in equations (15) to (17), W and b denote a weight and an offset at the time of encoding, and W 'and b' denote a weight and an offset at the time of decoding.
The stacked self-encoder is formed by cascading a plurality of self-encoders to complete the task of feature extraction layer by layer, and finally obtained features are representative. The training process is that n AEs are trained in sequence, after the 1 st AE training is finished, the output of the encoder is used as the input of the 2 nd AE, the finally obtained features are used as the input of a classifier, and the final classification training is finished.
The embodiment of the invention processes the traffic data set based on SAE, extracts the data characteristics under high risk layer by using the stacked self-encoder, reduces the data dimension and provides a low-dimensional and high-value data set for the safety prediction of a GRU model at the next stage. The SAE consists of a multilayer sparse self-encoder, a Softmax classifier and a multilayer AE, and the training steps and the process are as follows:
step S511, environment definition: define the auto-encoder, define the output layer using the Softmax classifier, define the loss and optimizer. Then, performing parameter initialization and creating a coordinator, and traversing two self-encoders to be applied by using a Softmax classifier in a fine tuning step;
step S512, taking the data set after the traffic state recognition as the input of SAE, executing the training of the first self-encoder, and taking the result after the training as the characteristic output of the first self-encoder;
step S513, taking the characteristic output of the upper network as the input of the lower network, and repeating the training according to the step S411; circulating all batches, performing adaptation training by using batch processing data, and calculating average loss;
step S514, using the characteristic output in the step S413 as an input of a Softmax classifier, and training the Softmax classifier by combining an initial data set;
step S515, repeating the steps S412 to S414, calculating the cost value of each Epoch, and storing the cost values;
and step S516, obtaining the characteristic output of the data set after all the data sets are trained, and finishing SAE training.
The SAE designed by the embodiment of the invention consists of a main function (SAE test) and three side functions (load data, init and Autoencoder). SAE test is used for training process and extracting characteristic data, load data is used for loading a data set based on traffic state identification, init is used for loading initial parameters, Autoencoder is used for loading self-encoder, and a main function is usedThe number calls 3 side functions. In order to debug the SAEs used for active security prediction, the parameters of the existing SAEs need to be finely adjusted. According to the embodiment of the invention, the Batch size and the Learning rate are selected as the parameter adjusting objects, the Batch size influences the SAE training degree by adjusting the number of data sets captured by each training, and the value of the Batch size is generally 2nN is 5,6,7, 8; the Learning rate affects the Learning rate of each iteration and provides valid information for the next iteration. Therefore, in the embodiment of the present invention, the parameter value taking table shown in table 1 is set for testing, and the test precision before each fine adjustment is used as the judgment index of the training quality.
TABLE 1 Batch size and Learning rate parameter evaluation and training results Table
Figure RE-GDA0002822952400000131
In the first stage test, the embodiment of the present invention controls the Batch size to be 128, and sets the spare rate to be in a gradient distribution with the value of 0.001/0.003/0.005/0.007/0.009, so as to obtain the first stage test result in table 1, and the accuracy is highest when the Batch size is 128 and the spare rate is 0.007 in the five experiments, and reaches 0.9149. Based on the first stage test results, the embodiment of the present invention controls the Learning rate to be 0.007. The Batch size is set to be distributed in a gradient mode, and the value is 32/64/128/256, so that the second stage test result of the table 1 is obtained, and the precision is the highest when the Learning rate is 0.007 and the Batch size is 64 in the four groups of experiments, and the precision reaches 0.9487. Therefore, based on the above two-stage test results, SAE with Learning rate of 0.007 and Batch size of 64 was selected as the experimental model.
Step S52, building a GRU model based on TensorFlow, designing a loss function and an optimizer of a GRU neural network, dividing a characteristic data set output by the optimal SAE model into a training set and a prediction set, executing a training process on the GRU model by using the training set, testing the GRU model by using the prediction set after each polling for a certain training number of times, comparing a prediction result with an actual result to calculate loss, adjusting a hyper-parameter according to the loss to obtain the optimal GRU model, and obtaining risk operation state output in the next stage, namely the next delta t time interval after the training is finished, so that the optimal SAE-GRU active safety prediction model is obtained.
Firstly, the high-risk characteristic state h transmitted at the last momentt-1And the input of the current node, namely the characteristic data x with the risk operation state labeltTo obtain two gating states. As shown in fig. 6, where r is reset gate, z is update gate, and σ is Sigmoid function to serve as the gate signal. After receiving the gating signal, the reset gating r is used to obtain the reset state ht-1'=ht-1r, followed by ht-1'And input xtSplicing, zooming the data to an interval (-1,1) through an activation function tanh (), judging and obtaining a data set with a risk operation state label according to a traffic operation state as shown in the principle of fig. 7, then extracting a high-risk characteristic data set of the data set with the risk operation state label through an SAE model, and finally predicting the risk operation state through the GRU model by utilizing the characteristic data set with the risk operation state label and realizing visualization through images. Finally, the GRU executes an update memory phase, in which two steps of forgetting and memorizing are performed simultaneously, based on the previous update gate z (update gate), the update formula is as follows:
ht=(1-z)ht-1+zht; (18)
as can be seen from the formula (18), the range of the value of the update gate z is 0-1, the closer the value of the update gate z is to 1, the more data is stored in memory, and the closer the value of the update gate z is to 0, the more data is forgotten; h istRepresenting the high risk characteristic state at the present moment. Thus, based on the above interpretation of the GRU mechanism, a final GRU run graph is obtained, as shown in fig. 8. Then, the optimal prediction effect of active safety is realized through parameter adjustment: train hours influence the effect of model training by controlling the proportion of a training set and a test set; epochs and Batch size represent the cadence of the entire training process. The settings of the three parameters are shown in table 2.
The analysis process is divided into two stages:
step S521: controlling the influence of different combinations of Epochs and Batch size on the prediction result under the same Train hours;
step S522: after determining the combination of the optimal Epochs and Batch size, researching the prediction results under different Train hours, and finally determining the optimal GRU model.
In step S521, comparison is performed according to groups 1 to 5,6 to 10, and 11 to 15, and the prediction accuracy and the RMSE are used as evaluation bases, so as to obtain FIGS. 9(a) to 9(c), respectively. First, the prediction accuracy is relatively small in the same Train hours and different combinations of Epochs and Batch sizes, and the prediction accuracy is greatly different in the different Train hours. Wherein the prediction accuracy of the set 1/6/11 is optimal; secondly, RMSE gradually shows the advantages and disadvantages of different groups along with the deepening of the training degree. Of these groups 1/6/11, all were the best combination and were clearly superior to the control group. Therefore, based on the analysis of fig. 9, it can be seen that the SAE-GRU model proposed in the embodiment of the present invention has higher prediction accuracy, where the group 1/6/11 predicts the best effect, i.e. the model predicts the best effect in the cases of Epochs 1960 and Batch size 5.
In step S522, the accuracy of prediction, the prediction time, and the RMSE of the group 1/6/11 are compared after the fact that the epoch is 1960 and the Batch size is 5 are determined in step S521. Thus, Table 3 was obtained. As can be seen from table 3, the best value of the set 11 is found in the prediction accuracy, prediction time and RMSE, which proves the efficiency and accuracy of the set 11. Therefore, based on the above analysis, it can be determined that group 11 is the best choice among all groups, i.e., when Train hours is 6384, Epochs is 1960, and Batch size is 5, the prediction accuracy and stability are the best.
TABLE 2 parameter settings for Train hours/Epochs/Batch size
Figure RE-GDA0002822952400000141
TABLE 3 prediction accuracy, prediction time and RMSE for the experimental group 1/6/11
Group of Prediction accuracy (%) Time(s) RMSE
1 76.204 38054 0.031284
6 89.788 36466 0.014847
11 95.157 32265 0.005689
And (3) state recognition result and analysis:
traffic flow data used for the experiments were derived from the data set of the U.S. california department of transportation (PeMS). The study road segment is a city backbone located in the county of los angeles, and the study period is a weekday of 1 month and 1 day to 2 months and 28 days in 2020. According to the embodiment of the invention, after removing holiday data in a research time period, experimental analysis is carried out, firstly, preprocessing is carried out by utilizing the steps S1 and S2, then, sample data is fitted by taking 5min as time granularity, and meanwhile, data of 0-5 points in the morning every day is removed, and finally, 9804 pieces of experimental data in the total of 43 working days in the research time period can be obtained, as shown in Table 4.
TABLE 4 partial traffic flow parameter data for urban arterial road
Figure RE-GDA0002822952400000151
(1) Traffic state identification
And (3) performing cluster analysis on the historical traffic data of the target road section by using the steps 11 to 14, determining the category of each sample data according to the membership degree of each cluster center, automatically dividing the sample data into the category with the maximum membership degree, dividing 9804 samples into 3 categories according to the clustering result, respectively representing the 3 categories by using different symbols, and performing visual display by taking data of a certain day as an example, as shown in fig. 10. As can be determined from fig. 10, the data of the + symbol part has a lower travel time and a higher vehicle speed as a whole, and can be determined as the clear state "1", the data of the a-solidup symbol part is determined as the crowded state "2", the travel time of the ● part is the highest as a whole, and is determined as the blocked state "3", and finally, the traffic running state corresponding to the historical data for training the support vector machine classifier is obtained. On the basis of an experiment of fuzzy C-means clustering state identification, the steps S21-S23 are utilized to carry out off-line training on historical traffic flow data to obtain an on-line real-time traffic state classifier. The data of 30 days before is used as a training set, the data of 13 days after is used as a test set, common machine learning algorithms such as a Decision Tree (DT), a Gradient Boosting Decision Tree (GBDT), a Random Forest (RF), K Nearest Neighbor (KNN), Logistic Regression (LR) and naive Bayes (GNB) are selected to perform prediction performance comparison, and finally, the result analysis of the model real-time traffic state recognition is obtained, as shown in Table 5.
TABLE 5 comparison of traffic state prediction performance by different methods
Algorithm SVM LR RF DT GBDT KNN GNB
Accuracy (%) 98.92 98.04 96.06 92.93 95.87 94.63 91.33
As can be seen from Table 5, the prediction accuracy of the SVM classifier is as high as 98.92%, which is superior to other 6 machine learning classifiers. Therefore, the method can be used for real-time traffic state classification prediction, the experimental result meets the actual traffic condition, and the online traffic state identification of the urban main road can be realized.
(2) Risk running state discriminant analysis
In the embodiment of the invention, the traffic parameter data detected by two detectors at the same section position of 1 month and 1 day on the major street of los Angeles sunset are taken as an example for statistical analysis, and the sigma is respectively calculatedVTO,SV,ST,SOThe variation relationship between them is shown in FIGS. 2 to 4. The coefficient of variation is the coefficient for measuring the difference of the change range of the index and is formed by the standard deviationThe average value is determined, and the dispersion degree of the index can be objectively reflected. The smaller the variation coefficient is, the smaller the discrete degree is, and the smaller the risk is; conversely, the greater the risk. It can be seen from fig. 2 that, under the condition that the average speed fluctuates sharply, the coefficient of variation is at a higher value, and fig. 3 and 4 also show that the occupancy and the travel time have the same phenomenon. In order to establish the relationship between each coefficient of variation and the road risk operation state, the cumulative frequency distribution maps of the speed coefficient of variation, the occupancy coefficient of variation, and the travel time coefficient of variation in different intervals are further generated, as shown in fig. 5(a) to 5 (c). According to the change curve rule graphs of the indexes in the graphs 2-4 and the coefficient distribution graphs in the graphs 5(a) -5 (c), a risk degree grading table of various kinds of variation coefficients can be finally constructed, so that the membership degrees of various kinds of variation coefficients respectively corresponding to three traffic jam states of smooth, crowded and blocked are determined, and the table 6 is referred.
Table 6 grading table for different risk grades corresponding to each variation coefficient index
Rating of evaluation Low risk Low-medium critical Middle risk Medium-high critical High risk
Coefficient of variation of velocity [0,0.36] (0.36,0.4) [0.4,0.48] (0.48,0.52) ≥0.52
Coefficient of occupancy variation [0,0.55] (0.55,0.6) [0.6,0.7] (0.7,0.75) ≥0.75
Coefficient of travel time variation [0,0.366] (0.366,0.4) [0.4,0.5] (0.5,0.566) ≥0.566
On the basis of the identification of the traffic running state, membership function formulas of the variation coefficients in low-risk, medium-risk and high-risk running states are respectively constructed on the basis of a risk grade classification table, and see formulas (19) to (27).
Figure RE-GDA0002822952400000161
Figure RE-GDA0002822952400000171
Figure RE-GDA0002822952400000172
Figure RE-GDA0002822952400000173
Figure RE-GDA0002822952400000174
Figure RE-GDA0002822952400000175
Figure RE-GDA0002822952400000176
Figure RE-GDA0002822952400000177
Figure RE-GDA0002822952400000178
Wherein, the expressions (19) to (21) respectively represent membership functions of the velocity variation coefficient relative to low risk, medium risk and high risk; expressions (22) to (24) represent membership functions of the occupancy coefficient of variation with respect to low risk, medium risk, and high risk, respectively; equations (25) to (27) represent membership functions of the travel time variation coefficient with respect to low risk, medium risk, and high risk, respectively. The training data set for active traffic safety prediction can be obtained through the steps S41-S47.
(3) Active safety prediction method analysis
To evaluate the superiority of SAE-GRU for active safety prediction, the prediction accuracy and prediction efficiency will be analyzed based on the same traffic data and prediction environment. In the prediction accuracy analysis, the accuracy rates of GRU, LSTM, CNN-LSTM, SVM, KNN and random forest are obtained through experimental reproduction and compared with SAE-GRU. Wherein GRU is used as the same attribute comparison object, and the other methods are used as different attribute comparison objects; in the analysis of the prediction efficiency, the prediction time of SAE-GRU, LSTM, CNN-LSTM, MAE, RMSE were analyzed as comparison parameters.
And (6) analyzing the prediction precision. The existing prediction method gradually conforms to the time series characteristics of traffic data, so that the prediction result is more objective. The comparison prediction method selected by the embodiment of the invention has certain representativeness and makes certain research progress in the field of traffic prediction. Therefore, it would be instructive to compare SAE-GRU with the above-described method. The predicted results are detailed in table 7.
TABLE 7 active safety prediction accuracy under different methods
SAE-GRU GRU LSTM CNN-LSTM SVM KNN Random forest
Precision ratio (%) 95.157 89.584 86.756 90.757 87.066 89.772 91.340
As can be seen from Table 7, SAE-GRU achieved 95.157% accuracy and the lowest LSTM was only 86.756% under the same conditions and predicted by different methods. Meanwhile, compared with the GRU with the same attribute, the SAE-GRU precision is obviously improved. The prediction precision of SAE-GRU is better than that of LSTM with complex structure and optimization model CNN-LSTM thereof. Based on the comparative analysis, the active safety prediction method provided by the embodiment of the invention can meet the safety prediction requirement and is superior to other representative methods.
And (3) analyzing the prediction efficiency: in order to accurately evaluate the application effects of various prediction methods, the embodiment of the invention analyzes the efficiency of SAE-GRU, LSTM and CNN-LSTM. The time consumption is used as a core efficiency evaluation index, and the MAE and the RMSE are used as result evaluation indexes. Based on the evaluation requirements, the comprehensive efficiency evaluation of each prediction method can be realized.
As can be seen from FIG. 11, the SAE-GRU prediction time is significantly lower than the GRU prediction time under the same environment, and the reduction reaches 10.45%. Comparing SAE-GRU with LSTM and CNN-LSTM, the predicted time is only 85.25% and 88.81%. Thus, FIG. 11 illustrates the superior prediction capabilities of SAE-GRU. In FIG. 12, MAE represents the mean of absolute error, RMSE represents the square root of the mean of the squared differences between predicted and actual observations, and both MAE and RMSE for SAE-GRU are the lowest values among all methods, and its MAE is significantly lower than for the other methods. Namely, the efficiency of SAE-GRU is confirmed.
The above description is only for the preferred embodiment of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims (10)

1. The active safety prediction method based on the big data technology and SAE-GRU is characterized by comprising the following steps:
step S1, obtaining an original data set, wherein the original data set comprises a plurality of single data sets, each single data set comprises 5 types of feature data including average vehicle speed, average acceleration, average occupancy, average queuing time and average travel time in a delta t time interval and 1 type of conventional data including average standard time, and the original data set is preprocessed to form a training data set;
s2, performing dynamic traffic running state recognition based on cluster analysis by using the training data set obtained in the step S1 to obtain a sample data set with a traffic running state label;
s3, using the sample data set with the traffic running state label as the prior knowledge of classification analysis to generate a traffic running state classifier;
s4, constructing a data set for risk running state judgment by adopting the training data set formed in the step S1, and judging the risk running state of the constructed data set for risk running state judgment according to different traffic running states based on a fuzzy comprehensive evaluation method to obtain a training data set with a risk running state label;
step S5, an SAE-GRU model is established, a training process is carried out on the established SAE-GRU model by adopting the training data set with the risk running state label obtained in the step S4, and an optimal SAE-GRU active safety prediction model is obtained through parameter adjustment;
and step S6, performing active safety prediction on the main trunk by using the obtained optimal SAE-GRU active safety prediction model, and predicting to obtain the risk running state in the next stage, namely the next delta t time interval.
2. The active security prediction method based on big data technology and SAE-GRU as claimed in claim 1, wherein the step S6 is implemented as follows:
firstly, preprocessing a real-time traffic flow data set of a predicted place, then classifying the traffic running state of the preprocessed real-time traffic flow data set of the predicted place by using a generated traffic running state classifier, judging the risk running state by using a step S4 based on the classification result of the traffic running state to obtain the real-time traffic flow data set with a risk running state label, then taking the real-time traffic flow data set with the risk running state label as input data of an SAE-GRU active safety prediction model to perform active safety prediction of a trunk road, and predicting to obtain the risk running state in the next stage, namely the next delta t time interval.
3. The active safety prediction method based on big data technology and SAE-GRU as claimed in claim 2, wherein the step S1 is implemented by preprocessing the original data set, and the step S6 is implemented by preprocessing the real-time traffic flow data set at the predicted location as follows:
step 1, data cleaning and filling: calculating the missing rate of each single data set in the sample data set, wherein the missing rate of the single data set is the missing number of the data of the single data set/the total number of the data of the single data set, and deleting the single data set when the missing rate of the single data set is more than or equal to 80%; when the single data set loss rate is less than 80%, filling the non-characteristic data in the single data set by adopting statistics, and filling the characteristic data in the single data set by a Lagrange interpolation method of formulas (1) to (2):
Figure FDA0002747590180000021
Figure FDA0002747590180000022
wherein L (x) represents a missing value to be determined, lj(x) As interpolation basis functions, xjDenotes the j-th position point, xiDenotes dividing by xjOther position points than the above, x representing the position point of the missing value to be found, yjDenotes xjThe value of the position point, k represents the number of the given value points;
step 2, feature transformation: performing feature transformation on each numerical value of each type of feature data in the sample data set after data cleaning and filling are completed by adopting a maximum-minimum normalization method shown in formula (3):
Figure FDA0002747590180000023
wherein, ynewRepresenting the value after feature transformation, y representing the value before feature transformation, yminRepresents the minimum value, y, of each type of feature data in the sample data setmaxAnd representing the maximum value of each type of feature data in a sample data set, wherein the sample data set refers to an original data set or a real-time traffic flow data set of a forecast place.
4. The active safety prediction method based on big data technology and SAE-GRU according to claim 1, wherein the step S2 is to identify the dynamic traffic operation state based on a fuzzy C-means clustering algorithm, and obtain a sample data set with a traffic operation state label, and the specific implementation process is as follows:
step S21, constructing a traffic flow data set X ═ X1,x2,…,xi,…xn]TAnd making each sample item X in the traffic flow data set Xi=(xi1,xi2,xi3,xi4,xi5) Each sample item xiEach composed of traffic flow attribute parameters and vehicle operating parameters, wherein xi1、xi2、xi3、xi4、xi5Respectively and correspondingly representing the average speed, the average occupancy, the average acceleration, the average delay time and the average travel time in a delta t time interval in a one-to-one manner, wherein the average speed and the average occupancy in the delta t time interval are traffic flow attribute parameters, and the average acceleration, the average delay time and the average travel time in the delta t time interval are vehicle operation parameters; dividing all sample items in a traffic flow data set X into k classes C ═ C1,c2,…,ckEach category represents a traffic state, k is 3, and the three categories are divided into three categoriesRespectively representing three traffic states of smooth, crowded and blocked; and constructing a clustering loss function according to equation (4):
Figure FDA0002747590180000031
wherein J (U, X, C) represents a clustering loss function, XiRepresenting the ith sample item in the traffic flow data set X, wherein n is the total number of samples in the traffic flow data set X; k is the number of clusters, CjRepresents the jth class cjThe fuzzy clustering center of (1); u represents a membership matrix, and U is { U ═ U }ij},uijRepresents a sample xiFor class cjDegree of membership of; m is a fuzzy weighting index; ξ represents the cluster loss function constraint space, which is defined as equation (5):
Figure FDA0002747590180000032
step S22, initializing a membership degree matrix U, and calculating and updating the jth category c according to formulas (6) to (7)jFuzzy clustering center C ofjAnd membership matrix U:
Figure FDA0002747590180000033
Figure FDA0002747590180000034
in the formula (d)ijRepresenting a sample item xiAnd the jth class cjFuzzy clustering center C ofjEuclidean distance of dihRepresenting a sample item xiAnd h category chFuzzy clustering center C ofhThe Euclidean distance of (c);
step S23, comparing the membership degree matrix U of the lambda +1 iterationλ+1And a membership matrix U of the lambda iterationλIf Uλ+1-Uλ| ≦ epsilon or λ ═ λmaxStopping iteration and outputting the current fuzzy clustering center CjAnd each sample item X in the traffic flow data set XiFor the current fuzzy clustering center CjJ is more than or equal to 1 and less than or equal to 3; otherwise, returning to the step S22, and continuing the iteration, wherein lambda is the iteration number, lambdamaxIs the maximum iteration number, and epsilon is an iteration termination threshold;
step S24, fuzzy clustering center C output according to step S23jAnd each sample item X in the traffic flow data set XiFor the current fuzzy clustering center CjJ is more than or equal to 1 and less than or equal to 3, and a traffic running state class label set Y corresponding to the traffic flow data set X is determined by the maximum membership method, wherein Y is { Y | Y ═ Y%iτ; i is 1,2, …, n, τ is 1,2,3, yiFor each sample item X in the traffic flow data set XiThe traffic running state class labels 1,2 and 3 are the fuzzy clustering centers C1~C3Clustering labels in one-to-one correspondence;
s25, visualizing the traffic flow data set X on a three-dimensional space of average vehicle speed, occupancy and travel time according to the traffic running state category label set Y obtained in the step S24 to obtain a cluster analysis chart; determining the traffic running states corresponding to the clustering labels 1,2 and 3 in the category label set Y according to the clustering analysis chart to obtain each sample item X in the traffic flow data set XiTraffic running state category label yiThen, obtaining a sample data set gamma { (X, Y) | (X) with a traffic operation state label1,y1),(x2,y2),…,(xi,yi),…,(xn,yn) }; meanwhile, normalization processing is carried out on the sample data set gamma with the traffic running state label.
5. The active security prediction method based on big data technology and SAE-GRU as claimed in claim 4, wherein the step S3 is implemented as follows:
step S31, constructing and solving a convex quadratic programming problem, which is shown in formula (8):
Figure FDA0002747590180000041
in the formula, Q (alpha) is an optimal Lagrange multiplier objective function, n is the total number of samples of a sample data set gamma with a traffic operation state label, and alphaizAll are lagrange multipliers, and C is a penalty coefficient; k (x)i,xz) Representing a kernel function;
solving to obtain the optimal Lagrange multiplier solution
Figure FDA0002747590180000042
Step S32, calculating the optimal offset value b*
Selecting the optimal Lagrangian multiplier solution alpha in step S31*A component element of
Figure FDA0002747590180000043
Satisfies the conditions
Figure FDA0002747590180000044
According to
Figure FDA0002747590180000045
The subscript l selects x from a sample data set gamma with a traffic running state labellAnd ylThen b is calculated according to the formula (10)*
Figure FDA0002747590180000046
Step S33, solving classification decision function f (x)i):
Figure FDA0002747590180000051
In the formula, f (x)i) Namely, the generated traffic operation state classifier represents the traffic operation state classification result of the ith sample item in the sample data set gamma with the traffic operation state label, and sgn () represents a sign function.
6. The active security prediction method based on big data technology and SAE-GRU according to any of claims 1-5, wherein the step S4 is implemented as follows:
step S41, using the training data set of step S1 to establish an evaluation target factor set P aggregated at time intervals of Δ t, where P is [ S ]V,SO,ST]In which S isVRepresenting the coefficient of variation of vehicle speed, SV=σVV represents the average vehicle speed within the time interval of delta t, sigmaVRepresenting the standard deviation of the vehicle speed in the time interval of delta t; sODenotes the coefficient of occupancy variation, SO=σOO, O represents the average occupancy in the time interval Δ t, σORepresents the standard deviation of occupancy within the Δ t time interval; sTRepresenting the coefficient of variation of the time of flight, ST=σTT, T denotes the mean travel time within the time interval Δ T, σTRepresents the standard deviation of travel time within the Δ t time interval;
step S42, establishing a judgment comment set F, wherein F is [ F ]1,f2,f3],f1Indicating a low risk operating condition, f2Indicating an impending operational state of stroke, f3Indicating a high risk operating condition;
step S43, establishing a fuzzy relation matrix R, as shown in formula (12):
Figure FDA0002747590180000052
wherein R (S)V) Represents the coefficient of variation S of vehicle speedVFuzzy relation matrix of membership degree of evaluation item set F, R (S)O) Represents the occupancy coefficient of variation SOFuzzy relation matrix of membership degree of evaluation item set F, R (S)T) Representing the coefficient of variation S of the travel timeTA fuzzy relation matrix of the membership degree of the evaluation comment set F; r is11Representing the coefficient of variation of speed with respect to a low risk operating condition f1Degree of membership of r12Representing the coefficient of variation of velocity versus the medium risk operating condition f2Degree of membership of r13Representing the coefficient of variation of velocity with respect to a high risk operating condition f3Degree of membership of; r is21Representing the occupancy coefficient of variation versus a low risk operating condition f1Degree of membership of r22Representing occupancy coefficient of variation versus an at-risk operating condition f2Degree of membership of r23Representing the occupancy coefficient of variation versus the high-risk operating condition f3Degree of membership of; r is31Representing the coefficient of travel time variation with respect to a low risk operating condition f1Degree of membership of r32Representing the travel time coefficient of variation versus the medium risk operating condition f2Degree of membership of r33Representing the coefficient of travel time variation with respect to a high risk operating condition f3Degree of membership of;
Figure FDA0002747590180000061
s44, carrying out traffic running state classification on sample items corresponding to the judgment object factor set P in the traffic flow data set X by using the traffic running state classifier generated in the step S3, and then obtaining a judgment object factor set P with a traffic running state label based on the traffic running state classification result; and then establishing a fuzzy weight matrix S (S) by utilizing the evaluation object factor set P with the traffic running state label according to different traffic running states1,s2,s3),s1Representing the influence degree of the speed variation coefficient in the fuzzy relation matrix R on the risk running state under different traffic running states, s2Representing the influence degree of occupancy rate variation coefficient in the fuzzy relation matrix R on the risk running state under different traffic running states, s3Representing the influence degree of the travel time variation coefficient in the fuzzy relation matrix R on the risk running state under different traffic running states;
step S45, calculating a fuzzy composite value matrix B,
Figure FDA0002747590180000062
step S46, determining a fuzzy comprehensive evaluation result z, namely a risk running state category, by using a maximum membership method according to a fuzzy synthetic value matrix B;
step S47, based on the establishing method of the judgment object factor set P in the step S41, the training data set formed in the step S1 is adopted to construct a data set for risk operation state judgment
Figure FDA0002747590180000063
Wherein, PiThe method comprises the steps that (1) the ith sample item of an ith evaluation object factor set, namely a data set omega for evaluating the risk running state is evaluated, and n is the total amount of the samples of the data set omega for evaluating the risk running state; and calculating each sample item in the data set omega for risk running state judgment according to the steps S43-S46 to obtain a risk running state label data set Z corresponding to the data set omega for risk running state judgment, wherein the risk running state label data set Z is a fuzzy comprehensive judgment result set of all sample items of the data set omega for risk running state judgment, and the risk running state label data set Z is subjected to normalization processing to finally obtain a training data set with risk running state labels
Figure FDA0002747590180000064
Wherein z isiAnd (3) representing the fuzzy comprehensive judgment result of the ith sample item of the data set omega for the risk running state judgment.
7. The active security prediction method based on big data technology and SAE-GRU as claimed in claim 6, wherein R in the fuzzy relation matrix RμθCalculated according to equations (19) to (27):
Figure FDA0002747590180000065
Figure FDA0002747590180000071
Figure FDA0002747590180000072
Figure FDA0002747590180000073
Figure FDA0002747590180000074
Figure FDA0002747590180000075
Figure FDA0002747590180000076
Figure FDA0002747590180000077
Figure FDA0002747590180000078
8. the active security prediction method based on big data technology and SAE-GRU according to claim 6, wherein the step S44 is implemented as follows:
step S441, the pair of traffic operation state classifiers generated in step S3Classifying traffic running states of the traffic flow data set X in the step S2, and corresponding the generated traffic running state labels and the judgment object factor sets P one by one according to standard time characteristics to obtain judgment object factor sets P with the traffic running state labels; then, the weight of each factor in the evaluation object factor set P under different traffic running states is determined by applying an entropy weight method to the evaluation object factor set P with the traffic running state label, and the weight of each factor in the evaluation object factor set P is normalized to obtain a weight matrix A ═ A (A ═ A-f,Ac,Aj)TWherein A isfA weight vector representing a speed variation coefficient, an occupancy variation coefficient, and a travel time variation coefficient in a smooth flow state, AcA weight vector representing a speed variation coefficient, an occupancy variation coefficient, and a travel time variation coefficient in a congested flow state, AjA weight vector representing a velocity variation coefficient, an occupancy variation coefficient, and a travel time variation coefficient in a choked flow state;
step S442 of obtaining a fuzzy influence vector wi,wi=(wi1,wi2,wi3) Wherein w isi1Representing the ith sample item in the sample data set gamma with the traffic operation state label to the smooth flow clustering center
Figure FDA0002747590180000081
Degree of membership, wi2Representing the ith sample item in the sample data set gamma with the traffic running state label to the crowd stream clustering center
Figure FDA0002747590180000082
Degree of membership, wi3Representing the ith sample item in the sample data set gamma with the traffic running state label to the clustering center of the choked flow
Figure FDA0002747590180000083
Is solved by the following formula (13):
Figure FDA0002747590180000084
in the formula IijRepresenting the clustering centers of the ith sample item and different traffic states in the data set gamma
Figure FDA0002747590180000085
The Euclidean distance of (c);
step S443, obtaining the fuzzy weight matrix S ═ wi×A。
9. The active safety prediction method based on big data technology and SAE-GRU as claimed in claim 6, wherein the fuzzy synthetic value matrix B ═ B (B)1,b2,b3) Wherein, b1Representing the low-risk running state F of the fuzzy comprehensive evaluation result relative to the evaluation comment set F1Degree of membership of b2Representing the running state F of the risk in the fuzzy comprehensive evaluation result relative judgment comment set F2Degree of membership of b3High-risk running state F representing relative judgment comment set F of fuzzy comprehensive evaluation result3Degree of membership of, degree of membership bθThe calculation of θ is given in formula (14):
Figure FDA0002747590180000086
the critical operating state tag dataset Z ═ { Z | Zi=fθ=max bθ;i=1,2,…,n,θ=1,2,3}。
10. The active security prediction method based on big data technology and SAE-GRU according to any one of claims 1 to 5 or 7 to 9, wherein the step S5 is implemented as follows:
step S51, building an SAE model, inputting the training data set with the risk running state label obtained in the step S4 into the SAE model to train the SAE model, processing the training data set with the risk running state label through the SAE model, extracting abstract features, namely data features in a high-risk running state layer by layer, obtaining a feature data set output by the SAE model and an output loss value after training is finished, and judging to obtain an optimal SAE model according to the output loss value;
step S52, building a GRU model based on TensorFlow, designing a loss function and an optimizer of a GRU neural network, dividing a characteristic data set output by the optimal SAE model into a training set and a prediction set, executing a training process on the GRU model by using the training set, testing the GRU model by using the prediction set after each polling for a certain training number of times, comparing a prediction result with an actual result to calculate loss, adjusting a hyper-parameter according to the loss to obtain the optimal GRU model, and obtaining risk operation state output in the next stage, namely the next delta t time interval after the training is finished, so that the optimal SAE-GRU active safety prediction model is obtained.
CN202011172029.XA 2020-10-28 2020-10-28 Active safety prediction method based on big data technology and SAE-GRU Active CN112270355B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011172029.XA CN112270355B (en) 2020-10-28 2020-10-28 Active safety prediction method based on big data technology and SAE-GRU

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011172029.XA CN112270355B (en) 2020-10-28 2020-10-28 Active safety prediction method based on big data technology and SAE-GRU

Publications (2)

Publication Number Publication Date
CN112270355A true CN112270355A (en) 2021-01-26
CN112270355B CN112270355B (en) 2023-12-05

Family

ID=74344417

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011172029.XA Active CN112270355B (en) 2020-10-28 2020-10-28 Active safety prediction method based on big data technology and SAE-GRU

Country Status (1)

Country Link
CN (1) CN112270355B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112561215A (en) * 2021-02-24 2021-03-26 江苏铨铨信息科技有限公司 Atmospheric haze prediction method based on SAE-LSTM model
CN113011440A (en) * 2021-03-19 2021-06-22 中联煤层气有限责任公司 Coal bed gas well field monitoring heavy identification technology
CN113256075A (en) * 2021-04-29 2021-08-13 浙江非线数联科技股份有限公司 Enterprise risk level evaluation method based on hierarchical analysis and fuzzy comprehensive evaluation method
CN113313417A (en) * 2021-06-23 2021-08-27 北京鼎泰智源科技有限公司 Complaint risk signal grading method and device based on decision tree model
CN113569950A (en) * 2021-07-28 2021-10-29 大唐环境产业集团股份有限公司 Power station equipment fault monitoring model generation method, system and device
CN115116212A (en) * 2022-05-06 2022-09-27 浙江科技学院 Traffic prediction method for road network, computer device, storage medium and program product
CN116993167A (en) * 2023-09-27 2023-11-03 交通运输部水运科学研究所 Real-time risk judging method and system in production process

Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100057773A1 (en) * 2008-08-29 2010-03-04 Prodip Hore Fuzzy tagging method and apparatus
CN103761876A (en) * 2014-01-10 2014-04-30 山东大学 Vehicle-road cooperation-based method for traffic information collection and status evaluation
US20150294246A1 (en) * 2014-04-10 2015-10-15 International Business Machines Corporation Selecting optimal training data set for service contract prediction
US20160145994A1 (en) * 2014-11-20 2016-05-26 Petrochina Company Limited Evaluation Method and Evaluation Device for Water Breakthrough Risk of Production Wells in Aquifer Drive Gas Reservoirs
CN106056308A (en) * 2016-06-13 2016-10-26 宁波工程学院 Highway tunnel operation environment safety risk automatic judgment method
CN106228499A (en) * 2016-07-06 2016-12-14 东南大学 A kind of cargo security evaluation model based on people's bus or train route goods multi-risk System source
CN107330458A (en) * 2017-06-27 2017-11-07 常州信息职业技术学院 A kind of fuzzy C-means clustering method of minimum variance clustering of optimizing initial centers
CN109308803A (en) * 2018-07-31 2019-02-05 北京航空航天大学 Path forms time reliability analysis based on Stochastic Volatility Model
CN110070713A (en) * 2019-04-15 2019-07-30 浙江工业大学 A kind of traffic flow forecasting method based on two-way nested-grid ocean LSTM neural network
US20190258878A1 (en) * 2018-02-18 2019-08-22 Nvidia Corporation Object detection and detection confidence suitable for autonomous driving
US20190354810A1 (en) * 2018-05-21 2019-11-21 Astound Ai, Inc. Active learning to reduce noise in labels
CN110599766A (en) * 2019-08-22 2019-12-20 浙江工业大学 Road traffic jam propagation prediction method based on SAE-LSTM-SAD
CN110796859A (en) * 2019-10-28 2020-02-14 长安大学 Real-time traffic state identification and accident risk early warning method based on traffic flow
CN110851768A (en) * 2019-11-13 2020-02-28 四川大学 Multilayer fuzzy evaluation method for barrier lake
CN110910641A (en) * 2019-12-02 2020-03-24 北京航空航天大学 Neural network-based real-time risk assessment method for highway traffic operation
CN111063194A (en) * 2020-01-13 2020-04-24 兰州理工大学 Traffic flow prediction method
CN111210633A (en) * 2020-02-09 2020-05-29 北京工业大学 Short-term traffic flow prediction method based on deep learning
CN111310787A (en) * 2020-01-15 2020-06-19 江苏大学 Brain function network multi-core fuzzy clustering method based on stacked encoder

Patent Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100057773A1 (en) * 2008-08-29 2010-03-04 Prodip Hore Fuzzy tagging method and apparatus
CN103761876A (en) * 2014-01-10 2014-04-30 山东大学 Vehicle-road cooperation-based method for traffic information collection and status evaluation
US20150294246A1 (en) * 2014-04-10 2015-10-15 International Business Machines Corporation Selecting optimal training data set for service contract prediction
US20160145994A1 (en) * 2014-11-20 2016-05-26 Petrochina Company Limited Evaluation Method and Evaluation Device for Water Breakthrough Risk of Production Wells in Aquifer Drive Gas Reservoirs
CN106056308A (en) * 2016-06-13 2016-10-26 宁波工程学院 Highway tunnel operation environment safety risk automatic judgment method
CN106228499A (en) * 2016-07-06 2016-12-14 东南大学 A kind of cargo security evaluation model based on people's bus or train route goods multi-risk System source
CN107330458A (en) * 2017-06-27 2017-11-07 常州信息职业技术学院 A kind of fuzzy C-means clustering method of minimum variance clustering of optimizing initial centers
US20190258878A1 (en) * 2018-02-18 2019-08-22 Nvidia Corporation Object detection and detection confidence suitable for autonomous driving
US20190354810A1 (en) * 2018-05-21 2019-11-21 Astound Ai, Inc. Active learning to reduce noise in labels
CN109308803A (en) * 2018-07-31 2019-02-05 北京航空航天大学 Path forms time reliability analysis based on Stochastic Volatility Model
CN110070713A (en) * 2019-04-15 2019-07-30 浙江工业大学 A kind of traffic flow forecasting method based on two-way nested-grid ocean LSTM neural network
CN110599766A (en) * 2019-08-22 2019-12-20 浙江工业大学 Road traffic jam propagation prediction method based on SAE-LSTM-SAD
CN110796859A (en) * 2019-10-28 2020-02-14 长安大学 Real-time traffic state identification and accident risk early warning method based on traffic flow
CN110851768A (en) * 2019-11-13 2020-02-28 四川大学 Multilayer fuzzy evaluation method for barrier lake
CN110910641A (en) * 2019-12-02 2020-03-24 北京航空航天大学 Neural network-based real-time risk assessment method for highway traffic operation
CN111063194A (en) * 2020-01-13 2020-04-24 兰州理工大学 Traffic flow prediction method
CN111310787A (en) * 2020-01-15 2020-06-19 江苏大学 Brain function network multi-core fuzzy clustering method based on stacked encoder
CN111210633A (en) * 2020-02-09 2020-05-29 北京工业大学 Short-term traffic flow prediction method based on deep learning

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
H.YEO等: ""Impact of traffic states on freeway crash involvement rates"", 《ACCIDENT ANAL.PREVENTION》, vol. 50, pages 713 - 723 *
Z.Y.CUI等: ""Stacked bidirectional and unidirectional LSTM recurrent neural network for forecasting network-wide traffic state with missing value"", 《TRANSP.RES.C, EMERG TECHNOL》, vol. 118, pages 1 - 14 *
李佳文等: ""基于深度学习的短时交通流预测"", 《山东交通科技》, no. 5, pages 87 - 90 *
马昌喜等: ""突发公共卫生事件下应急定制公交线路优化"", 《交通运输工程学报》, vol. 20, no. 3, pages 89 - 99 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112561215A (en) * 2021-02-24 2021-03-26 江苏铨铨信息科技有限公司 Atmospheric haze prediction method based on SAE-LSTM model
CN113011440A (en) * 2021-03-19 2021-06-22 中联煤层气有限责任公司 Coal bed gas well field monitoring heavy identification technology
CN113011440B (en) * 2021-03-19 2023-11-28 中联煤层气有限责任公司 Coal-bed gas well site monitoring and re-identification technology
CN113256075A (en) * 2021-04-29 2021-08-13 浙江非线数联科技股份有限公司 Enterprise risk level evaluation method based on hierarchical analysis and fuzzy comprehensive evaluation method
CN113313417A (en) * 2021-06-23 2021-08-27 北京鼎泰智源科技有限公司 Complaint risk signal grading method and device based on decision tree model
CN113313417B (en) * 2021-06-23 2024-01-26 北京鼎泰智源科技有限公司 Method and device for classifying complaint risk signals based on decision tree model
CN113569950A (en) * 2021-07-28 2021-10-29 大唐环境产业集团股份有限公司 Power station equipment fault monitoring model generation method, system and device
CN113569950B (en) * 2021-07-28 2024-05-28 大唐环境产业集团股份有限公司 Power station equipment fault monitoring model generation method, system and device
CN115116212A (en) * 2022-05-06 2022-09-27 浙江科技学院 Traffic prediction method for road network, computer device, storage medium and program product
CN116993167A (en) * 2023-09-27 2023-11-03 交通运输部水运科学研究所 Real-time risk judging method and system in production process
CN116993167B (en) * 2023-09-27 2023-12-15 交通运输部水运科学研究所 Real-time risk judging method and system in production process

Also Published As

Publication number Publication date
CN112270355B (en) 2023-12-05

Similar Documents

Publication Publication Date Title
CN112270355A (en) Active safety prediction method based on big data technology and SAE-GRU
CN109697852B (en) Urban road congestion degree prediction method based on time sequence traffic events
Qin et al. A novel combined prediction scheme based on CNN and LSTM for urban PM 2.5 concentration
CN111612243B (en) Traffic speed prediction method, system and storage medium
Yao et al. A support vector machine with the tabu search algorithm for freeway incident detection
Yan et al. Spatial-temporal chebyshev graph neural network for traffic flow prediction in iot-based its
CN110555989B (en) Xgboost algorithm-based traffic prediction method
Li et al. Graph CNNs for urban traffic passenger flows prediction
CN113159364A (en) Passenger flow prediction method and system for large-scale traffic station
Alizadeh et al. Network traffic forecasting based on fixed telecommunication data using deep learning
CN112330158B (en) Method for identifying traffic index time sequence based on autoregressive differential moving average-convolution neural network
CN113283665A (en) Urban traffic accident risk prediction method based on road network
Tang et al. Short-term travel speed prediction for urban expressways: Hybrid convolutional neural network models
Li et al. Autonomous vehicle trajectory combined prediction model based on CC-LSTM
Rasaizadi et al. Short‐Term Prediction of Traffic State for a Rural Road Applying Ensemble Learning Process
CN112927507A (en) Traffic flow prediction method based on LSTM-Attention
Shrivastava et al. Selection of efficient and accurate prediction algorithm for employing real time 5G data load prediction
Wang et al. Dynamic traffic prediction based on traffic flow mining
Tan et al. A noise-immune and attention-based multi-modal framework for short-term traffic flow forecasting
CN114463978B (en) Data monitoring method based on track traffic information processing terminal
Lin et al. Network-wide multi-step traffic volume prediction using graph convolutional gated recurrent neural network
Singh et al. Implication of Mathematics in Data Science Technology Disciplines
Jain et al. An improved traffic flow forecasting based control logic using parametrical doped learning and truncated dual flow optimization model
Baruah et al. Clustering as a tool for self-generation of intelligent systems: a survey
Prathilothamai et al. Traffic prediction system using IoT cluster based evolutionary under sampling approach

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant