WO2018122801A1 - Method for detecting traffic anomaly of urban road - Google Patents

Method for detecting traffic anomaly of urban road Download PDF

Info

Publication number
WO2018122801A1
WO2018122801A1 PCT/IB2017/058531 IB2017058531W WO2018122801A1 WO 2018122801 A1 WO2018122801 A1 WO 2018122801A1 IB 2017058531 W IB2017058531 W IB 2017058531W WO 2018122801 A1 WO2018122801 A1 WO 2018122801A1
Authority
WO
WIPO (PCT)
Prior art keywords
time
traffic
data
sub
segment
Prior art date
Application number
PCT/IB2017/058531
Other languages
French (fr)
Chinese (zh)
Inventor
杜豫川
邓富文
Original Assignee
同济大学
杜豫川
许军
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 同济大学, 杜豫川, 许军 filed Critical 同济大学
Priority to GB1909405.1A priority Critical patent/GB2572717B/en
Priority to CN201780050755.4A priority patent/CN109643485B/en
Publication of WO2018122801A1 publication Critical patent/WO2018122801A1/en

Links

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • G08G1/0129Traffic data processing for creating historical data or processing based on historical data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0108Measuring and analyzing of parameters relative to traffic conditions based on the source of data
    • G08G1/0112Measuring and analyzing of parameters relative to traffic conditions based on the source of data from the vehicle, e.g. floating car data [FCD]
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • G08G1/0133Traffic data processing for classifying traffic situation
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0137Measuring and analyzing of parameters relative to traffic conditions for specific applications
    • G08G1/0141Measuring and analyzing of parameters relative to traffic conditions for specific applications for traffic information dissemination

Definitions

  • the invention belongs to the technical field of traffic detection.
  • the present invention relates to a method for real-time detection of urban road traffic anomalies.
  • the spatial position information of different time points can be obtained.
  • map matching and data fusion After data preprocessing, map matching and data fusion, the travel speed probability distribution of the specific time and space range can be obtained; according to the change of the speed distribution, the position can be effectively identified.
  • Urban road traffic anomalies Background technique
  • Traffic anomaly detection is an important part of urban traffic management and one of the core functions of intelligent transportation systems. Traffic anomalies mainly include traffic accidents, vehicle dumping, falling objects, damage or malfunction of road traffic facilities, and other special events that cause traffic flow disturbances. Such incidents are prone to traffic congestion, reduced road capacity, and severely affect the normal operation of the entire road traffic system. Through traffic anomaly detection, traffic managers can timely understand traffic anomaly information and take appropriate inducement and control measures to reduce the adverse effects of traffic anomalies.
  • Traffic anomaly detection can be divided into manual mode and automatic mode.
  • Manual methods include patrol cars, emergency telephone reporting, and video surveillance. Due to the human and material resources and poor real-time performance, traffic management needs cannot be met.
  • the automatic method relies on the automatic event detection (AID, Automated Incidence Detection) algorithm.
  • AID Automated Incidence Detection
  • the basic principle is to identify traffic anomalies by detecting changes in road traffic at different locations.
  • AID algorithms include pattern recognition algorithms (such as Califorma algorithm, Monica algorithm), statistical prediction algorithms (such as exponential smoothing, Kalman filtering), traffic flow model algorithms (such as McMaster algorithm), and intelligent recognition algorithms. (such as artificial neural networks, fuzzy logic algorithms).
  • the invention utilizes the trajectory data returned by the taxi and the bus GNSS positioning device to establish a historical traffic state database and a real-time traffic state database, and analyzes the traffic anomaly events by analyzing the difference of the traffic flow characteristics reflected by the two.
  • the method has the characteristics of good real-time performance, parallel processing, high recognition rate and low requirements for detection facilities, and is suitable for detecting urban road traffic anomalies in a data environment with real-time floating vehicle positioning data.
  • a US patent application, US 20160148512 discloses a composition principle and implementation method of a traffic anomaly detection and reporting system.
  • the system consists of a sensor, a communication module, a mobile processing module, and a user interaction module.
  • the sensor is used to collect relevant data around the vehicle;
  • the communication module is used for transmitting the vehicle data and receiving data of the surrounding vehicle;
  • the mobile processing module is for processing and analyzing the data of the relevant vehicle in a certain area and generating a traffic event report; user interaction
  • the module is able to provide traffic incident reports like a user.
  • the scheme is a traffic anomaly detection technology based on the vehicle and vehicle communication network, which can use various types of information collected by sensors to identify abnormal events.
  • the sensor and the communication unit need to be separately installed and debugged, the implementation is difficult; the processing capacity of the mobile processing unit is limited; and the mobile and fixed message receiving end is required, and the system itself has a failure probability and the reliability is not good.
  • a Chinese patent application, CN 104809878 A discloses a method for detecting abnormal state of urban road traffic using bus GPS data.
  • the scheme obtains the link delay time index according to the GPS historical data, obtains the instantaneous speed, the cycle average speed, the weighted moving average speed and the multi-vehicle average speed according to the current GPS data, and uses the gauge variable analysis algorithm to detect the abnormality.
  • This program does not need new Increase inspection facilities and facilitate implementation.
  • the characterization of the traffic situation is too simplistic, and it is impossible to analyze the characteristics and causes of traffic anomalies. There is no basis for the division of traffic scenarios, and the influence of weather and other factors on traffic situation changes cannot be considered. Summary of the invention
  • Floating car Also known as the probe car. Refers to buses and taxis that have on-board positioning devices and are driving on city roads.
  • GNSS Global Navigation Satellite System. Including GPS, GLONASS, GALILEO and Beidou satellite navigation systems.
  • Space-time sub-zone A zone divided by two dimensions, time and space, reflected in a certain space within a certain period of time. Divide the day into several time segments, for example 0:00-0: 10, 0: 10-0:20..., each time segment is called a time sub-zone; divide the implementation area of urban road traffic anomaly detection into A number of spatial segments, such as the longitude 121.58° E-121.59 0 E, the latitude 31.16° N-31.17° N, each spatial segment is called a spatial sub-region; any one time sub-region and any one spatial sub-region The space-time segment formed by the intersection is called the spatiotemporal sub-region, for example, the space-time segment of the region between 0:00-0:10 in the region between the longitude 121.58° E-121.59 0 E and the latitude 31.16° N-31.17 0 N.
  • Historical trajectory data is trajectory data accumulated over a long period of time and stored in a database. Historical trajectory data is dynamically changing data that needs to be updated in a timely manner and periodically reprocessed and analyzed to ensure the accuracy of historical traffic feature extraction. The data for each time-space sub-area can be processed in parallel to increase efficiency. In the present invention, it may be simply referred to as historical data.
  • Real-time trajectory data is a trajectory data set within a time zone that is closest to the current time. In the present invention, it may be simply referred to as real-time data.
  • Traffic situation A general term for the comprehensive situation of traffic operations within a certain period of time and within a certain space.
  • Traffic anomalies traffic flow disturbances caused by traffic accidents, vehicle dumping, truck falling, road traffic facilities damage or malfunctions.
  • Abnormal traffic severity The severity of traffic flow disorder is the difference in traffic flow characteristics after traffic flow and traffic anomalies in normal conditions.
  • Traffic Anomaly Index A measure of the severity of traffic. The range is 0 ⁇ 10. The larger the value, the more serious the traffic anomaly.
  • Traffic environment The sum of all external influences and forces acting on road traffic participants. This includes road conditions, transportation facilities, landforms, meteorological conditions, and traffic activities of other transportation participants.
  • Map Matching The process of associating geographic coordinates with a city road network.
  • Peak hourly traffic The maximum hourly traffic flow in a city's road section.
  • Finite Mixing Model A mathematical method of simulating complex density with simple density.
  • Response variable A variable that changes according to an independent variable, also called a dependent variable.
  • Bayesian information criterion It is an evaluation index of the reliability of the result of correcting the probability of occurrence by using the Bayesian formula for subjective probability estimation under partially incomplete information. Its calculation method is:
  • Likelihood function is a function of the parameters of the statistical model. Given the output X, the likelihood function (
  • Parameter Estimation A method of estimating unknown parameters contained in the overall distribution based on samples taken from the population.
  • the Expectation Maximization Algorithm is an iterative algorithm for the maximum likelihood estimation or the maximum posterior probability estimation of a probability parameter model with implicit variables.
  • Kullback-Leibler divergence A measure of the difference between two probability distributions, P and Q.
  • Jensen-Shannon divergence is a symmetrized form of Kullback-Leibler divergence.
  • K-Medoids algorithm A clustering algorithm that selects such a point from the current category for each iteration. It has the smallest sum of distances to all other points (in the current category) as the new center point.
  • the object of the present invention is to establish a scheme based on a floating vehicle trajectory recording system, using historical GNSS positioning data and real-time GNSS positioning data, combined with traffic environment information to identify road traffic anomalies.
  • the present invention provides the following technical solutions:
  • the premise of the present invention is: a floating car (a taxi, a bus, etc.) equipped with a GNSS track recorder; a data center having a large-scale storage, calculation, and real-time task processing capability.
  • the scope of application of the present invention is: Urban roads (including ground roads and elevated roads) through which the above-mentioned floating vehicles pass.
  • the implementation steps of the present invention include:
  • the time range can be set to all day, that is, 0:00-24:00; it can also be set to a specific time period. For example, to detect the traffic abnormal time during the period from 17:00 to 20:00, the time will be detected.
  • the range is set from 17:00 to 20:00.
  • the spatial scope can be set as a certain city area according to the administrative division, such as Beijing, Shanghai, Huangpu District, etc. It can also be set as a certain urban functional area according to the urban spatial structure, such as a central business district and industrial area of a certain city.
  • the establishment of the spatiotemporal sub-area refers to dividing the detected time range into a number of smaller time segments, and dividing the detected spatial range, that is, the implementation area of the urban road traffic anomaly detection, into a plurality of smaller spatial segments.
  • a variety of empirical division methods can be used, including equidistant space-time division method and non-equidistant space-time division method.
  • GNSS Global Navigation Satellite System Positioning System
  • GPS Global Positioning System
  • GLONASS Global Navigation Satellite System
  • GALILEO Global Navigation Satellite System
  • Beidou satellite navigation system It also includes QZSS in Japan and IRNSS in India.
  • QZSS Global Positioning System
  • IRNSS International Radio Network Service Set
  • Such as regional navigation and positioning systems, as well as satellite positioning enhancement systems such as WASS in the United States and MSAS in Japan.
  • GNSS positioning equipment such as taxis, buses, freight cars, private cars, etc.
  • urban taxis are often used as floating vehicles as data sources for traffic anomaly detection systems.
  • the collected GNSS positioning information contains some unreasonable information.
  • anomalous data include: data that falls outside the time and space of detection, and spatial position jumps that are clearly out of reasonable range. The so-called "space position jump beyond the reasonable range” is illustrated below.
  • the positioning point uploaded by a floating vehicle positioning device is recorded as A at 10:30:00 on a certain day
  • the positioning point uploaded by the floating vehicle positioning device at time 10:30:30 is recorded as B, the distance between position A and position B. It is 1500 meters, then the speed of the floating car is calculated to be at least 180km/h, which is beyond the common sense, so it is an abnormal spatial position jump, which should be eliminated in data processing.
  • GNSS positioning data After pre-processed GNSS positioning data, it is necessary to combine the urban road network data, map the GNSS positioning points to the city map through the map matching algorithm, establish the matching relationship between the positioning points and the road segments, and correct the error caused by the positioning drift.
  • the electronic maps of various geographical regions are relatively detailed.
  • Such electronic maps can be derived from the city's geographic information system, and of course can also be derived from other ways and means.
  • These electronic maps detail the urban road information, and several sections can be obtained by dividing. By matching the anchor points to the road segments by means of distance, angle, etc., the positioning information is matched to the actual geographical environment.
  • the path of the vehicle may not be unique given a set of starting and ending points.
  • the complex urban traffic network consists of several sections, which are numbered, for example, Ll, L2, etc. Roads may have two different directions of travel. In this case, two different directions of travel should be represented as two different sections, given different sections.
  • the intersection of the road segments in the urban road network can usually be used. Knowing the path of a floating car, it is now necessary to select the same path as the floating car path from the path information that has been sent by other floating cars, so as to obtain the same path group between the starting point and the ending point.
  • the positioning data of the floating car includes information such as position coordinates, instantaneous vehicle speed, and recording time.
  • data sampling refers to screening part of the data from all floating car data for subsequent analysis and processing, and the screening is based on the computing power of the data center and Pre-proposed accuracy requirements were made. Different data sampling methods can be used based on different calculation capabilities and accuracy requirements.
  • the computing power of the data center when the computing power of the data center is strong and the accuracy of the detection is high, all the floating vehicle positioning data can be treated as a processing object, and comprehensive processing analysis is performed; and when the computing power of the data center is limited, it is assumed
  • the current data center can process 500 data for each spatial sub-area within 1 minute, but the actual situation is that in 2000, each floating sub-area can generate 2000 floating-vehicle positioning data, so it can be from 2000 data. Randomly extract 500 data for analysis, so as to obtain processing results with limited accuracy within the computing power of the data center.
  • the urban road traffic anomaly detection method based on floating car data proposed in this patent uses the travel speed as the basis for urban road traffic anomaly detection. Therefore, data sampling refers to sampling the speed of the journey.
  • the so-called historical trajectory data refers to the floating vehicle trajectory data accumulated in long-term urban road traffic operations.
  • an urban road traffic feature model can be established to reflect the general characteristics of urban traffic operations.
  • the urban road traffic feature model mentioned here can refer to certain specific indicators, such as average speed, weighted average speed, etc.; it can also refer to various statistical models, such as the probability distribution of travel speed.
  • a single indicator was used to represent the traffic characteristics of a certain section or area (such as the historical average speed).
  • this method is simple, the accuracy is not high, the sensitivity is poor, and often it is not in traffic abnormalities. Good results in state detection. Therefore, this patent proposes to describe the traffic characteristics by using the probability distribution of traffic characteristic variables for each spatiotemporal sub-area, establish a traffic feature model and perform parameter estimation.
  • the traffic characteristic variables that can be collected including the travel speed and travel time, etc., the probability distribution of the traffic characteristic variables described in this patent refers to the probability distribution of the travel speed.
  • the so-called real-time trajectory data refers to the trajectory data of the floating car in traffic operation in a period of time not far from the current time.
  • real-time floating car trajectory data you can grasp the dynamics of traffic characteristics and reflect the current characteristics of current traffic operations.
  • This patent uses the travel speed of the current space-time sub-zone to describe the current traffic characteristics.
  • Dennrng's model is independent of any particular system, application environment, system vulnerability, and fault type, and is therefore a general anomaly detection model.
  • the model consists of five parts: subject, object, audit record, outline, exception record and activity rule.
  • a contour is a normal behavior of a subject relative to an object, represented by metrics and statistical models.
  • Dennrng's model defines three metrics, namely event counter, interval timer, resource measurer, and proposes five statistical models, namely, operational model, mean and standard deviation models, multivariate models, Markov process models, and Time series model.
  • the model proposed by Denning establishes the statistically-based normal behavioral feature profile of the system subject through the analysis of the system audit data.
  • the audit data in the system is compared with the normal behavioral feature profile of the established subject.
  • Exceeding a certain threshold is considered an abnormal event.
  • This model lays the foundation for anomaly detection, and many anomaly detection methods and systems developed in the future are developed on the basis of it.
  • user behavior data is divided into two categories according to certain statistical criteria: abnormal behavior and normal behavior.
  • the statistical-based method has certain difficulties in extracting and abstracting the audit instance, it may cause large errors, and must rely on some probability distribution hypotheses.
  • the artificial neural network is introduced. Clustering method.
  • the artificial neural network has the self-learning self-adaptive ability to train the neural network with sample points representing the normal user behavior. Through repeated learning, the neural network can extract the normal user or system activity patterns from the data and encode them into the network structure.
  • the audit data can be judged whether the system is normal by learning a good neural network. Because the anomaly evaluation criterion has certain ambiguity, the fuzzy evidence theory is introduced into the anomaly. For example, an intrusion detection framework model based on fuzzy expert system is established, which can better reduce the false alarm rate and false alarm rate.
  • This patent proposes an anomaly detection scheme based on statistical features.
  • the basic idea is to measure the difference between historical traffic characteristics and real-time traffic characteristics by Jensen-Shannon divergence to achieve the detection of abnormal traffic conditions.
  • the scheme has the advantages of good interpretability and little computational burden. It not only overcomes the inaccurate and untimely weakness of single statistic detection, but also avoids the defects of artificial neural network and other methods to calculate the negative load and high hardware requirements.
  • the severity of traffic anomalies should be released to the public in a clear and concise manner to avoid possible congestion areas and improve the efficiency of urban traffic.
  • the severity of the abnormal condition is characterized by the traffic anomaly index, ranging from 0-10, where 0 means no abnormality, 10
  • the height is abnormal.
  • the location of the anomaly is projected onto the electronic map and published publicly through the smart mobile device APP or the like.
  • the evaluation of system performance refers to the evaluation of the accuracy of traffic abnormal state detection, and its evaluation indicators include false positive rate and false negative rate. The lower the false positive rate and the false negative rate, the better the performance of the system.
  • the division of the spatiotemporal sub-area may specifically adopt the following method:
  • the time segment span is a fixed value, usually taking 30mm as a time segment;
  • determining the segment scale of the spatial dimension the spatial segment span is a fixed value, usually taking a spatial grid of 200m ⁇ 200m as a spatial segment;
  • Non-equidistant space-time division method based on road network density Based on road network density as a judgment index, when the road network density is greater than or equal to 2km/km 2 , take 30min time segment and 200m X 200m space segment; When the density is less than 2km/km 2 , take a 30 min time segment and a 400 m X 400 m spatial segment;
  • Non-equidistant space-time division method based on peak hour flow based on peak hour flow as a judgment indicator, when the peak hour flow rate is greater than or equal to 1000 vehicles/hour, take 30 min time segment and 200 m X 200 m space segment; When the flow rate is less than 1000 vehicles/hour, take a time segment of 30 minutes and a space segment of 400m X 400m.
  • the step 3) specifically includes the following steps:
  • Each grid area contains several road segments, and the set of these road segments is represented as R S , and each road segment in the set of the road segments is represented as ij. And assign a number to each road segment;
  • the matching scheme includes:
  • mapping matching process After the map matching process, combining the timestamp data of the coordinates of the positioning point, matching the positioning point to the space-time sub-region (the step 5) may specifically adopt one of the following methods:
  • the total travel speed data of each sub-floating vehicle in a time and space sub-region constitutes the whole.
  • the implementation method is to calculate the travel speed of each vehicle in the space-time sub-region: v ⁇ W ..”"- 1 '", where 4 2 ... 4 - ln is the first and second in the space-time sub-area ⁇
  • the distance between the GNSS anchor points, ..., the distance between the -1 and the GNSS anchor points, -t n is the first in the space-time sub-region, ..., the first "Timestamp of a GNSS anchor point; the data in each spatio-temporal sub-area is not filtered to form a set ⁇ for subsequent processing.
  • Time-smooth sampling plan for speed information Specify the length of the time segment, set the upper limit of the number of segments of the same time; search for the velocity data in each time segment of a time-space sub-region, if the number of velocity data in the time segment exceeds the upper limit, the data of the upper limit is taken
  • the implementation method is to calculate the travel speed of each vehicle in the space-time sub-region: v f W ..”"- 1 '", where 2 ...
  • step 6) is the first and second GNSS positioning points in the space-time sub-area f Distance, ..., the distance between -1 and the nth GNSS anchor point, ⁇ ... ⁇ is the space-time sub-area ⁇ , ising, the first GNSS Time stamp of the anchor point; specify the upper limit of the number of segments of the time segment at the same time; ⁇ ⁇ ; search for the velocity data in the time segment of the time and space sub-region, if the number of velocity data in the time segment exceeds the upper limit; ⁇ « , randomly fetching; ⁇ «bar data is added to ⁇ and used for subsequent processing.
  • the step 6) may specifically adopt one of the following methods:
  • This scheme uses a mixed Gaussian model with a fixed component quantity to describe the probability distribution of vehicle speed.
  • the mixed model with the smallest ⁇ /C is selected, and the parameter vectors ⁇ ⁇ and ⁇ are recorded, where ⁇ is the proportional vector occupied by each sub-component in the historical traffic feature model, and ⁇ is the historical traffic feature model.
  • the mean vector of each sub-component, ⁇ is the standard deviation vector of each sub-component in the historical traffic feature model, as a feature record of the present-time sub-region.
  • the density curve morphology of the hybrid model is shown in Figure 6.
  • This scheme uses the same model-based evaluation method as 612), but the distribution of the sub-components and the number of components are variable.
  • the method is as follows:
  • the probability distribution model is chosen as the distribution type of the sub-components, including but not limited to: normal distribution, gamma distribution, Weibull distribution.
  • normal distribution gamma distribution
  • Weibull distribution a normal distribution
  • the sub-distribution function takes: : exp
  • the sub-distribution function When using a gamma distribution, the sub-distribution function uses: strip: v "" - l e ⁇ ", where ⁇ ( ⁇ ): When using the Weibull distribution, the sub-distribution function uses:
  • Historical trajectory data classification method by context. According to the temperature, precipitation, visibility and traffic control measures, the historical data without traffic anomalies are divided into different categories, and models and parameter estimates are established.
  • the implementation method is as follows:
  • the traffic environment is divided into 5 ⁇ 8 categories.
  • the historical data corresponds to the different traffic environments, and the historical data is classified into the above categories.
  • the processing as described in 5) is performed separately, thereby establishing a mapping relationship R ( ⁇ , for the traffic environment, and for the traffic situation.
  • Historical data clustering method For the historical data, the difference between the time and space sub-regions is obtained, and the difference quantitative representation of different space-time regions is obtained, and the quantized differences are used for clustering. Using temperature, precipitation, visibility and traffic control measures as characteristic factors, a number of Lo gl t regressions were performed to establish a mapping relationship between traffic environment and categories. See Figure 4 for the implementation process. The implementation steps are as follows: 631) According to the method described in 5), a traffic feature model is established, and parameter estimation is performed.
  • a ( v f ) ⁇ '(; In ⁇ , the number of subcomponents representing the speed distribution of the stroke, // represents the proportion of a subcomponent in the travel speed distribution, and represents the mean of a subcomponent in the travel speed distribution, ⁇ Indicates the standard deviation of a subcomponent in the travel speed distribution.
  • Z) MC represents the Kullback-Leibler divergence approximated by Monte Carlo sampling, and / and g represent any two distribution functions.
  • the traffic environment data (including temperature, precipitation, visibility, etc.) is used as an independent variable, and multiple Logit regression is performed to obtain the mapping relationship R between the traffic environment E and the traffic situation category T (£ ⁇ .
  • the step 637) Aggregate the same type of data, and re-establish the hybrid model with the new data set after aggregation, and perform parameter estimation to obtain the final historical traffic characteristic data set.
  • the step 7) may specifically adopt the following methods:
  • the step 8) specifically includes the following steps:
  • step 72) When step 72) is adopted, according to the current traffic situation category ⁇ , the historical traffic characteristic data under the category is located, otherwise it will not be processed;
  • is the proportional vector of each subcomponent in the real-time traffic feature model
  • is the mean vector of each subcomponent in the real-time traffic feature model
  • J rt is the standard for each subcomponent in the real-time traffic feature model Difference vector
  • is the proportional vector of each sub-component in the historical traffic feature model
  • is the mean vector of each sub-component in the historical traffic feature model
  • is the standard deviation vector of each sub-component in the historical traffic feature model.
  • the step 9) specifically includes the following step:
  • the step 10) specifically includes the following steps:
  • GNSS trajectory data detect historical traffic state changes through historical traffic feature extraction and real-time traffic situation analysis, and realize real-time, low-cost, intelligent urban road traffic anomaly events Detection
  • the urban road traffic anomaly detection technology based on floating car data proposed by the present invention can realize the detection of abnormal events with high accuracy, the detection rate exceeds 90%, and the false negative rate is less than 15%.
  • the false alarm rate is lower than 20%, and it has achieved good detection results, and can be applied to urban traffic intelligent management and service.
  • FIG. 1 shows a schematic diagram of the components and basic principles of the present invention
  • Figure 2 is a schematic view showing the overall flow of the present invention in the implementation process
  • FIG. 3 is a schematic diagram showing an implementation manner of a fast map matching algorithm of the present invention.
  • FIG. 4 is a schematic flow chart showing a historical traffic feature extraction scheme implemented by the present invention.
  • FIG. 5 is a schematic flow chart showing a real-time traffic feature extraction scheme implemented by the present invention.
  • Figure 6 shows a schematic diagram of the morphology of the Gaussian mixture model probability distribution
  • Figure 7 shows a measurement of the difference in the comparison between historical traffic characteristics and real-time traffic characteristics.
  • the overall system architecture of the present invention includes: an onboard GNSS track recorder mounted on a floating vehicle, a data center, a GNSS satellite, and a communication system.
  • the GNSS here includes GPS, GLONASS, GALILEO, Beidou, IRNSS, QZSS and any similar navigation satellite positioning system.
  • GNSS track recorders equipped with floating cars, buses, etc. record the position information of the vehicle at various points in time at a certain sampling frequency / (general requirements of 0.1 Hz), and pass the GPRS mobile communication network (also can be used) Wireless network communication technologies such as WCDMA and TD-LTE, but the cost will be increased accordingly)
  • the location information will be sent to the data center in real time.
  • the data center establishes a historical road traffic characteristic database through data preprocessing, data fusion, and through a specific algorithm; establishes a real-time traffic feature database for the recently received real-time data; and determines whether the current traffic feature is abnormal through the mapping relationship between the historical database and the real-time database And visualize the display through the processing terminal and generate a traffic anomaly event report.
  • the overall process of the scheme is shown in Figure 2, including the acquisition and storage of GNSS trajectory data, the establishment of spatiotemporal sub-areas, historical traffic feature extraction, real-time traffic feature extraction, and anomaly identification.
  • Collecting and storing GNSS trajectory data is the data foundation of the whole scheme. Due to the huge amount of data, a distributed storage scheme should be adopted.
  • the basic assumption of establishing a spatiotemporal sub-area is that it has the same traffic characteristics in a certain area and a specific time period. This assumption is generally applicable after long-term observation.
  • Historical traffic feature extraction the principle is to use the GNSS trajectory data to calculate the travel speed, benefit Using a large number of travel speed data in the same space-time sub-area, the probability distribution model of the vehicle speed is established, and the parameters are estimated, and the traffic characteristics are characterized by a small number of parameters.
  • Real-time traffic feature extraction the principle is to process and analyze the speed data in the current time period, and also establish the current vehicle speed probability distribution model.
  • the abnormality identification is to use the difference measurement index to judge the degree of change of the real-time feature compared with the historical feature, and determine whether a traffic anomaly event occurs according to whether it reaches the threshold.
  • Embodiment 1 According to the combination of the embodiments of the invention, the implementation is given below. Embodiment 1
  • Step 11 Using the equidistant space-time division method, determining the segment scale of the time dimension, the time segment span is a fixed value, usually taking 30 mm as a time segment; determining the segment scale of the spatial dimension, the spatial segment span is a fixed value, usually taking 200m ⁇ 200m
  • the spatial grid acts as a spatial fragment.
  • Step 12 Perform data preprocessing to perform data cleaning, data integration, data conversion, and data reduction on the GNSS positioning data to improve the structural degree of the data.
  • Step 13 Divide the space area to be processed into a grid of a certain size, and the range of each grid area can be expressed as
  • the anchor point is matched to the spatio-temporal sub-area in combination with the timestamp data of the coordinates of the positioning point.
  • Step 14 The total travel speed data of each of the secondary floating cars in a time and space sub-zone constitutes an overall. Calculate the travel speed of each vehicle in the time-space sub-region: where 2 ... is the distance between the first and second GNSS anchor points in the space-time sub-region f, ..., "-1 The distance from the nth GNSS anchor point is the first time in the space-time sub-region, ..., the time stamp of the GNSS anchor point; the data in each spatio-temporal sub-region is not filtered. , constitute a collection V ⁇ for subsequent processing.
  • Step 15 the historical data under the condition of no traffic anomaly is used to establish a traffic feature model and estimate the parameters.
  • Step 16 Perform real-time traffic data model establishment and parameter estimation to obtain a characteristic function of the current traffic condition.
  • the method is the same as step one, and the parameter vector ⁇ , ⁇ , a is recorded.
  • Step 17 Calculate the difference between the two velocity distributions according to the description parameters ⁇ , ⁇ , ⁇ of the current traffic characteristics and the description parameters ⁇ , ⁇ , ⁇ of the historical traffic characteristics: JSD(/) ri
  • Step 18 Normalize the difference in velocity distribution of each space-time sub-region to a normalized value a d of 0 ⁇ 1:
  • Step 21 Using the equidistant space-time division method, determining the segment scale of the time dimension, the time segment span is a fixed value, usually taking 30 mm as a time segment; determining the segment scale of the spatial dimension, the spatial segment span is a fixed value, usually taking 200 m X
  • the 200m spatial grid acts as a spatial fragment.
  • Step 22 Perform data preprocessing to perform data cleaning, data integration, data conversion, and data reduction on the GNSS positioning data to improve the structural degree of the data.
  • Step 24 Calculate the travel speed of each vehicle in the space-time sub-region: , where ⁇ ... 1 nowadays is a time-space sub-
  • the distance between the first and second GNSS anchor points in the zone, ..., the distance between the -1 and the nth GNSS anchor points, ⁇ ... is the space-time sub-zone 1 , ..., the first time stamp of a GNSS anchor point; specify the maximum length of the clip data at the same time segment length; ⁇ «; search for the speed data in the time segment of the time and space sub-region If the number of speed data in the time segment exceeds the upper limit p, randomly f data is added to ⁇ . .
  • Step 25 Perform historical traffic data without traffic anomalies as a whole, and perform traffic feature model establishment and parameter estimation.
  • the mixed model with the smallest ⁇ /C is selected, and its parameter vectors ⁇ ⁇ and ⁇ are recorded as the feature records of the local time and space sub-region.
  • the probability density function p, (x) of the travel speed distribution corresponding to the spatio-temporal sub-region on different dates is written:
  • the distance matrix is used as the input of the K-Medoids algorithm to obtain clustering results and index the categories.
  • the traffic environment data (including temperature, precipitation, visibility, etc.) is used as an independent variable, and multiple Logit regressions are performed to obtain the mapping relationship between the traffic environment E and the traffic situation category T (E.
  • the same type of data is aggregated, and the mixed model is re-established with the new data set after aggregation, and the parameter estimation is performed to obtain the final historical traffic characteristic data set.
  • Step 26 Obtain a characteristic function of the traffic condition, and obtain information such as current temperature, precipitation, visibility, traffic control measures, and the type of the current traffic condition.
  • Step 28 Normalize the difference in velocity distribution of each spatiotemporal sub-region to a normalized value of 0 to 1 .
  • Step 31 Using a non-equidistant space-time division method, for a central area of the city where the road network density is greater than 2 km/km 2 or the peak hour flow is greater than 1000 vehicles/hour, a 30 min time segment and a 200 m X 200 m space segment are taken. For urban suburbs with a density of less than 2km/km 2 or a peak hour flow of less than 1000 vehicles per hour, a 30-minute time segment and a 400 m X 400 m space segment are taken.
  • Step 32 Perform data preprocessing, perform data cleaning, data integration, data conversion, and data reduction on the GNSS positioning data to improve the structural degree of the data.
  • Step 34 Calculate the travel speed of each vehicle in the space-time sub-region: where 2 ... 4 - 1, « is a time slot
  • the distance between the first and second GNSS anchor points in the zone ..., the distance between the -1 and the nth GNSS anchor point, ⁇ ... is the space-time sub-zone 1 , ..., the first time stamp of a GNSS anchor point; specify the maximum length of the clip data at the same time segment length; ⁇ «; search for the speed data in the time segment of the time and space sub-region If the number of speed data in the time segment exceeds the upper limit p, randomly f data is added to ⁇ . .
  • Step 35 Perform historical data of no traffic anomaly as a whole, and establish traffic feature model and parameter estimation.
  • the distance matrix is used as the input of the K-Medoids algorithm to obtain clustering results and index the categories.
  • the traffic environment data (including temperature, precipitation, visibility, etc.) is used as an independent variable to perform multiple logit regression to obtain the mapping relationship between the traffic environment E and the traffic situation category T (E.
  • the same category will be used.
  • the data is aggregated, and the hybrid model is re-established with the new data set after aggregation, and the parameter estimation is performed to obtain the final historical traffic characteristic data set.
  • Step 36 Obtain a characteristic function of the traffic condition, and obtain current information such as temperature, precipitation, visibility, traffic control measures, and the type of the current traffic condition.
  • Step 38 Normalize the difference in velocity distribution of each space-time sub-region to a normalized value of 0 ⁇ 1 .

Landscapes

  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Traffic Control Systems (AREA)

Abstract

A method for detecting a traffic anomaly of an urban road. By using a vehicle-mounted GNSS positioning device of a floating vehicle, spatial position information on different time points is obtained; and by means of the analysis and excavation of a great quantity of track information of the floating vehicle, a traffic anomaly event of an urban road can be smartly detected. In the detection method, a traffic state is indicated by using the probability distribution of speeds in travels, a traffic state difference is reflected by using probability distribution difference measurement indictors, the principle is clear, the implementation is simple and convenient, and the detection rate is high.

Description

一种城市道路交通异常检测方法 Urban road traffic anomaly detection method
技术领域 Technical field
本发明属于交通检测技术领域。 特别地, 本发明涉及一种城市道路交通异常实时检测方法。 通过 浮动车的车载 GNSS定位装置, 可获取其不同时刻的空间位置信息, 经过数据预处理、 地图匹配和数 据融合, 获得特定时空范围的行程车速概率分布; 根据速度分布的变化情况, 可有效识别城市道路交 通异常事件。 背景技术  The invention belongs to the technical field of traffic detection. In particular, the present invention relates to a method for real-time detection of urban road traffic anomalies. Through the on-board GNSS positioning device of the floating car, the spatial position information of different time points can be obtained. After data preprocessing, map matching and data fusion, the travel speed probability distribution of the specific time and space range can be obtained; according to the change of the speed distribution, the position can be effectively identified. Urban road traffic anomalies. Background technique
交通异常事件检测是城市交通管理的重要组成部分, 也是智能交通系统的核心功能之一。 交通异 常事件主要包括交通事故、 车辆抛铺、 货车落物、 道路交通设施损坏或故障以及其他造成交通流紊乱 的特殊事件。 该类事件容易造成交通拥堵、 路段通行能力降低, 严重时影响整个道路交通系统的正常 运行。 通过交通异常事件检测, 可使交通管理者及时了解交通异常信息, 并采取适当的诱导和控制措 施, 降低交通异常事件的不良影响。  Traffic anomaly detection is an important part of urban traffic management and one of the core functions of intelligent transportation systems. Traffic anomalies mainly include traffic accidents, vehicle dumping, falling objects, damage or malfunction of road traffic facilities, and other special events that cause traffic flow disturbances. Such incidents are prone to traffic congestion, reduced road capacity, and severely affect the normal operation of the entire road traffic system. Through traffic anomaly detection, traffic managers can timely understand traffic anomaly information and take appropriate inducement and control measures to reduce the adverse effects of traffic anomalies.
交通异常事件检测可分为人工方式和自动方式。 人工方式包括巡逻车、 紧急电话上报和视频监控 等, 由于消耗人力物力且实时性差, 无法满足交通管理的需要。 自动方式依靠自动事件检测 (AID, Automated Incidence Detection)算法实现, 基本原理是通过检测不同位置道路交通流的变化来识别交通 异常事件。 目前常用的 AID算法包括模式识别类算法 (如 Califorma算法、 莫妮卡算法)、 统计预测类 算法(如指数平滑法、 卡尔曼滤波算法)、 交通流模型算法(如 McMaster算法) 以及智能识别算法(如 人工神经网络、 模糊逻辑算法)。  Traffic anomaly detection can be divided into manual mode and automatic mode. Manual methods include patrol cars, emergency telephone reporting, and video surveillance. Due to the human and material resources and poor real-time performance, traffic management needs cannot be met. The automatic method relies on the automatic event detection (AID, Automated Incidence Detection) algorithm. The basic principle is to identify traffic anomalies by detecting changes in road traffic at different locations. Currently used AID algorithms include pattern recognition algorithms (such as Califorma algorithm, Monica algorithm), statistical prediction algorithms (such as exponential smoothing, Kalman filtering), traffic flow model algorithms (such as McMaster algorithm), and intelligent recognition algorithms. (such as artificial neural networks, fuzzy logic algorithms).
但是目前的检测方法存在对设施的要求高、 计算复杂度高、 无法对异常状况的态势做进一步判断 等缺点。 本发明利用出租车、 公交车车载 GNSS定位装置回传的轨迹数据, 建立历史交通状态数据库 和实时交通状态数据库, 通过分析两者反映的交通流特征差异, 识别交通异常事件。 该方法具有实时 性好、 可并行处理、 识别率高以及对检测设施要求低等特点, 适用于有实时浮动车定位数据的数据环 境下城市道路交通异常事件的检测。  However, the current detection methods have disadvantages such as high requirements on facilities, high computational complexity, and inability to make further judgments on the situation of abnormal conditions. The invention utilizes the trajectory data returned by the taxi and the bus GNSS positioning device to establish a historical traffic state database and a real-time traffic state database, and analyzes the traffic anomaly events by analyzing the difference of the traffic flow characteristics reflected by the two. The method has the characteristics of good real-time performance, parallel processing, high recognition rate and low requirements for detection facilities, and is suitable for detecting urban road traffic anomalies in a data environment with real-time floating vehicle positioning data.
目前, 针对交通异常事件监测, 有以下代表性技术:  At present, for the monitoring of traffic anomalies, the following representative technologies are available:
一件美国专利申请, US 20160148512, 披露了一种交通异常事件检测和上报系统的组成原理和实 施方法。 该系统由传感器、 通信模块、 移动处理模块和用户交互模块组成。 传感器用于采集车辆周边 的相关数据; 通信模块用于发送本车辆数据和接收周边车辆的数据; 移动处理模块用于处理和分析相 关车辆在某一区域内的数据并生成交通事件报告; 用户交互模块能够像用户提供交通事件报告。 该方 案是一种基于车车和车路通讯网络的交通异常事件检测技术, 能够利用传感器采集的各类信息, 判别 异常事件。 然而, 由于传感器、 通信单元需要单独安装调试, 实施难度较大; 移动处理单元处理能力 受限; 同时需要移动和固定的讯息接收端, 系统本身存在故障概率, 可靠性不佳。  A US patent application, US 20160148512, discloses a composition principle and implementation method of a traffic anomaly detection and reporting system. The system consists of a sensor, a communication module, a mobile processing module, and a user interaction module. The sensor is used to collect relevant data around the vehicle; the communication module is used for transmitting the vehicle data and receiving data of the surrounding vehicle; the mobile processing module is for processing and analyzing the data of the relevant vehicle in a certain area and generating a traffic event report; user interaction The module is able to provide traffic incident reports like a user. The scheme is a traffic anomaly detection technology based on the vehicle and vehicle communication network, which can use various types of information collected by sensors to identify abnormal events. However, since the sensor and the communication unit need to be separately installed and debugged, the implementation is difficult; the processing capacity of the mobile processing unit is limited; and the mobile and fixed message receiving end is required, and the system itself has a failure probability and the reliability is not good.
一件中国专利申请, CN 104809878 A, 披露了一种利用公交车 GPS数据检测城市道路交通异常状 态的方法。 该方案根据 GPS历史数据获得路段延误时间指数, 根据 GPS当前数据获得瞬时速度、 周期 平均速度、 加权滑动平均速度和多车平均速度, 利用规范变量分析算法检测异常。 这一方案不需要新 增检测设施, 实施便利。 但是对于交通态势的表征过于简化, 无法分析交通异常状况的特点和成因; 对交通场景的划分缺乏依据, 未能考虑天气等因素对交通态势变化的影响。 发明内容 A Chinese patent application, CN 104809878 A, discloses a method for detecting abnormal state of urban road traffic using bus GPS data. The scheme obtains the link delay time index according to the GPS historical data, obtains the instantaneous speed, the cycle average speed, the weighted moving average speed and the multi-vehicle average speed according to the current GPS data, and uses the gauge variable analysis algorithm to detect the abnormality. This program does not need new Increase inspection facilities and facilitate implementation. However, the characterization of the traffic situation is too simplistic, and it is impossible to analyze the characteristics and causes of traffic anomalies. There is no basis for the division of traffic scenarios, and the influence of weather and other factors on traffic situation changes cannot be considered. Summary of the invention
为了更清晰地阐述本发明的内容, 首先将涉及到的专业术语解释如下:  In order to explain the contents of the present invention more clearly, the technical terms involved will first be explained as follows:
浮动车: 也称探测车。 指安装了车载定位装置并行驶在城市道路上的公交汽车和出租车。  Floating car: Also known as the probe car. Refers to buses and taxis that have on-board positioning devices and are driving on city roads.
GNSS: 全球导航卫星系统(Global Navigation Satellite System )。包括 GPS、 GLONASS、 GALILEO 以及北斗卫星导航系统等。  GNSS: Global Navigation Satellite System. Including GPS, GLONASS, GALILEO and Beidou satellite navigation systems.
时空子区:按照时间和空间两个维度划分的片区,反映在一段时间内, 一定的空间范围内的情况。 将一天划分为若干时间片段, 例如 0:00-0: 10, 0: 10-0:20……, 每个时间片段称之为一个时间子区; 将 城市道路交通异常检测的实施区域划分为若干空间片段,例如经度 121.58° E-121.590 E,纬度 31.16° N-31.17° N之间的区域, 每个空间片段称为一个空间子区; 任意一个时间子区和任意一个空间子区的 交集形成的时空片段, 称为时空子区, 例如经度 121.58° E-121.590 E, 纬度 31.16° N-31.170 N之间 的区域在 0:00-0: 10的时空片段。 Space-time sub-zone: A zone divided by two dimensions, time and space, reflected in a certain space within a certain period of time. Divide the day into several time segments, for example 0:00-0: 10, 0: 10-0:20..., each time segment is called a time sub-zone; divide the implementation area of urban road traffic anomaly detection into A number of spatial segments, such as the longitude 121.58° E-121.59 0 E, the latitude 31.16° N-31.17° N, each spatial segment is called a spatial sub-region; any one time sub-region and any one spatial sub-region The space-time segment formed by the intersection is called the spatiotemporal sub-region, for example, the space-time segment of the region between 0:00-0:10 in the region between the longitude 121.58° E-121.59 0 E and the latitude 31.16° N-31.17 0 N.
历史轨迹数据: 历史轨迹数据是长时间积累并存储在数据库中的轨迹数据。 历史轨迹数据是动态 变化的数据, 需要及时进行更新, 并定期做重新处理和分析, 以保证历史交通特征提取的准确性。 每 个时空子区的数据可以并行处理以提高效率。 本发明中可简称为历史数据。  Historical trajectory data: Historical trajectory data is trajectory data accumulated over a long period of time and stored in a database. Historical trajectory data is dynamically changing data that needs to be updated in a timely manner and periodically reprocessed and analyzed to ensure the accuracy of historical traffic feature extraction. The data for each time-space sub-area can be processed in parallel to increase efficiency. In the present invention, it may be simply referred to as historical data.
实时轨迹数据: 实时轨迹数据是距离当前时刻最近的一个时间区段内的轨迹数据集合。 本发明中 可简称为实时数据。  Real-time trajectory data: The real-time trajectory data is a trajectory data set within a time zone that is closest to the current time. In the present invention, it may be simply referred to as real-time data.
交通态势: 一定时间、 一定空间内交通运行的综合情况的总称。  Traffic situation: A general term for the comprehensive situation of traffic operations within a certain period of time and within a certain space.
交通异常: 交通事故、 车辆抛铺、 货车落物、 道路交通设施损坏或故障等事件引发的交通流紊乱 的情况。  Traffic anomalies: traffic flow disturbances caused by traffic accidents, vehicle dumping, truck falling, road traffic facilities damage or malfunctions.
交通异常严重性: 即交通流紊乱的严重性, 是正常状态下交通流与交通异常发生后交通流特征的 差异。  Abnormal traffic severity: The severity of traffic flow disorder is the difference in traffic flow characteristics after traffic flow and traffic anomalies in normal conditions.
交通异常指数: 交通异常严重性的量度。 范围为 0~10, 数值越大, 交通异常越严重。  Traffic Anomaly Index: A measure of the severity of traffic. The range is 0~10. The larger the value, the more serious the traffic anomaly.
交通环境: 作用于道路交通参与者的所有外界影响与力量的总和。 包括道路状况、 交通设施、 地 物地貌、 气象条件, 以及其他交通参与者的交通活动。  Traffic environment: The sum of all external influences and forces acting on road traffic participants. This includes road conditions, transportation facilities, landforms, meteorological conditions, and traffic activities of other transportation participants.
地图匹配: 将地理坐标与城市路网关联的过程。  Map Matching: The process of associating geographic coordinates with a city road network.
髙峰小时流量: 某城市道路断面一日内小时交通流量的最大值。  Peak hourly traffic: The maximum hourly traffic flow in a city's road section.
有限混合模型: 一种用简单密度模拟复杂密度的数学方法。 变量集合为 y、 成分数量为 K的有限 混合模型可表示为: p(y) = J lpl(y) 响应变量: 根据自变量发生改变的变量, 也称因变量。 Finite Mixing Model: A mathematical method of simulating complex density with simple density. A finite mixed model with a set of variables y and a component number K can be expressed as: p(y) = J l p l (y) Response variable: A variable that changes according to an independent variable, also called a dependent variable.
贝叶斯信息准则: 是在不完全情报下, 对部分未知的状态用主观概率估计, 然后用贝叶斯公式对 发生概率进行修正的结果可靠性的评价指标。 其计算方法为:  Bayesian information criterion: It is an evaluation index of the reliability of the result of correcting the probability of occurrence by using the Bayesian formula for subjective probability estimation under partially incomplete information. Its calculation method is:
BIC = -2lnL + k - lnn  BIC = -2lnL + k - lnn
式中, 为似然函数的最大值, ^为未知参数的个数, 《为样本量。 似然函数:似然函数是一种关于统计模型参数的函数。给定输出 X时,关于参数 Θ的似然函数 ( |x) (在数值上) 等于给定参数 后变量 的概率: Z( |x)=P( =x| )。 Where, is the maximum value of the likelihood function, ^ is the number of unknown parameters, "for the sample size. Likelihood function: The likelihood function is a function of the parameters of the statistical model. Given the output X, the likelihood function ( |x) on the parameter ( (in numerical value) is equal to the probability of the variable after the given parameter: Z( |x)=P( =x| ).
参数估计: 根据从总体中抽取的样本估计总体分布中包含的未知参数的方法。  Parameter Estimation: A method of estimating unknown parameters contained in the overall distribution based on samples taken from the population.
EM算法: 最大期望算法 (Expectation Maximization Algorithm) , 是一种迭代算法, 用于含有隐变 量的概率参数模型的最大似然估计或极大后验概率估计。  EM algorithm: The Expectation Maximization Algorithm is an iterative algorithm for the maximum likelihood estimation or the maximum posterior probability estimation of a probability parameter model with implicit variables.
Kullback- Leibler散度: 两个概率分布 P和 Q差异的一种量度。  Kullback-Leibler divergence: A measure of the difference between two probability distributions, P and Q.
Jensen-Shannon散度: 是 Kullback- Leibler散度的一种对称化形式。  Jensen-Shannon divergence: is a symmetrized form of Kullback-Leibler divergence.
K-Medoids算法:一种聚类算法,每次迭代都从当前类别中选取这样一个点 它到其他所有(当 前类别中的) 点的距离之和最小 作为新的中心点。 本发明的目的是建立一套基于浮动车轨迹记录系统, 利用历史 GNSS定位数据和实时 GNSS定位 数据, 结合交通环境信息识别道路交通异常事件的方案。 为了达到上述目的, 本发明提供了如下技术 方案:  K-Medoids algorithm: A clustering algorithm that selects such a point from the current category for each iteration. It has the smallest sum of distances to all other points (in the current category) as the new center point. The object of the present invention is to establish a scheme based on a floating vehicle trajectory recording system, using historical GNSS positioning data and real-time GNSS positioning data, combined with traffic environment information to identify road traffic anomalies. In order to achieve the above object, the present invention provides the following technical solutions:
本发明的实施前提是: 搭载 GNSS轨迹记录仪的浮动车 (出租车、 公交车等); 具有大规模存储、 计算、 实时任务处理能力的数据中心。  The premise of the present invention is: a floating car (a taxi, a bus, etc.) equipped with a GNSS track recorder; a data center having a large-scale storage, calculation, and real-time task processing capability.
本发明的适用范围是: 有上述浮动车经过的城市道路 (包括地面道路和高架道路)。  The scope of application of the present invention is: Urban roads (including ground roads and elevated roads) through which the above-mentioned floating vehicles pass.
本发明的实施步骤包括:  The implementation steps of the present invention include:
1) 确定检测的时空范围和建立时空子区。  1) Determine the spatio-temporal range of the test and establish the spatio-temporal sub-area.
基于实际的应用需求, 确定需要进行交通异常事件检测的时间范围和空间范围。 时间范围可以设 定为全天, 即 0:00-24:00; 也可以设定为某一特定的时段, 例如要检测 17:00-20:00这个时段的交通异 常时间, 则将检测时间范围设定为 17:00-20:00, 这里只是列举一个特殊实例, 还有很多其他情况, 此 处不再一一说明。 空间范围可以按照行政区划设置为某个市域, 例如北京市、 上海市、 黄浦区等; 也 可以按照城市空间结构设置为某个城市功能区, 例如某市中央商务区、 工业区等。  Based on actual application requirements, determine the time range and spatial extent for which traffic anomaly events need to be detected. The time range can be set to all day, that is, 0:00-24:00; it can also be set to a specific time period. For example, to detect the traffic abnormal time during the period from 17:00 to 20:00, the time will be detected. The range is set from 17:00 to 20:00. Here is just a special example. There are many other situations, which are not explained here. The spatial scope can be set as a certain city area according to the administrative division, such as Beijing, Shanghai, Huangpu District, etc. It can also be set as a certain urban functional area according to the urban spatial structure, such as a central business district and industrial area of a certain city.
时空子区的建立是指, 将检测的时间范围划分为若干个更小的时间片段, 将检测的空间范围, 即 城市道路交通异常检测的实施区域, 划分为若干个更小的空间片段。 时空子区的建立, 可以采用多种 经验划分方法, 包括等距时空划分法和非等距时空划分法。  The establishment of the spatiotemporal sub-area refers to dividing the detected time range into a number of smaller time segments, and dividing the detected spatial range, that is, the implementation area of the urban road traffic anomaly detection, into a plurality of smaller spatial segments. For the establishment of spatiotemporal sub-areas, a variety of empirical division methods can be used, including equidistant space-time division method and non-equidistant space-time division method.
2) 数据预处理。  2) Data preprocessing.
将 GNSS定位数据进行数据清洗、数据集成、数据转换、数据归约,提高数据的结构化程度。 GNSS , 即全球导航卫星系统定位系统, 是能在地球表面或近地空间的任何地点为提供全天候三维坐标和速度 以及时间信息的空基无线电导航定位系统。 它主要包括美国的 GPS ( Global Positioning System) , 俄罗 斯的 GLONASS ( Global Navigation Satellite System)、 欧盟的 GALILEO和中国的北斗卫星导航系统四 大全球性导航定位系统, 同时还包括日本的 QZSS、 印度的 IRNSS 等区域导航定位系统以及美国的 WASS、 日本的 MSAS 等卫星定位增强系统。 为了在不同的导航定位系统设备中建立统一的数据分发 标准,美国国家海洋电子协会制定了统一的 NEMA (National Marine Electronics Association)通讯协议, 以规范 GNSS的数据广播。 因此, GNSS中的各个成员系统, 例如 GPS、 GLONASS等, 虽然分别由 不同国家和机构建立和维护, 但是拥有一致的数据分发格式, 因此不需要对数据格式进行变换。  The GNSS positioning data is used for data cleaning, data integration, data conversion, and data reduction to improve the structure of the data. GNSS, the Global Navigation Satellite System Positioning System, is a space-based radio navigation and positioning system that provides all-weather three-dimensional coordinates and speed and time information anywhere on the Earth's surface or near-Earth space. It mainly includes GPS (Global Positioning System) in the United States, GLONASS (Global Navigation Satellite System) in Russia, GALILEO in the European Union and China's Beidou satellite navigation system. It also includes QZSS in Japan and IRNSS in India. Such as regional navigation and positioning systems, as well as satellite positioning enhancement systems such as WASS in the United States and MSAS in Japan. In order to establish a unified data distribution standard among different navigation and positioning system devices, the National Ocean Electronics Association of the United States has developed a unified NEMA (National Marine Electronics Association) communication protocol to regulate GNSS data broadcasting. Therefore, each member system in GNSS, such as GPS, GLONASS, etc., although established and maintained by different countries and organizations, has a consistent data distribution format, so there is no need to transform the data format.
选定的空间范围内, 有许多安装 GNSS 定位设备的车辆, 常见的有出租车、 公交车、 货运汽车、 私家车等。 基于当前城市交通数据应用现状, 在实际应用当中, 通常选用城市出租车为浮动车作为交 通异常检测系统的数据来源。 采集的 GNSS定位信息中包含一些不合理的信息,为了保证交通异常状态检测判别结果的准确性, 首先需要进行甄别以提出异常的数据, 保证数据的可靠性。 这些异常数据包括: 落在检测时空范围之 外的数据、 明显超出合理范围的空间位置跳跃。 所谓 "明显超出合理范围的空间位置跳跃", 下面举例 说明之。 若某日 10:30:00时刻某辆浮动车定位设备上传的定位点记为 A, 当日 10:30:30时刻该浮动车 定位设备上传的定位点记为 B, 位置 A与位置 B的距离为 1500米, 那么据此计算得到该浮动车的行 驶速度至少为 180km/h, 超出了一般常识, 因此是一种异常的空间位置跳跃, 数据处理中应当予以剔 除。 Within the selected space, there are many vehicles equipped with GNSS positioning equipment, such as taxis, buses, freight cars, private cars, etc. Based on the current application status of urban traffic data, in practical applications, urban taxis are often used as floating vehicles as data sources for traffic anomaly detection systems. The collected GNSS positioning information contains some unreasonable information. In order to ensure the accuracy of the traffic abnormal state detection and discrimination results, it is first necessary to identify the abnormal data and ensure the reliability of the data. These anomalous data include: data that falls outside the time and space of detection, and spatial position jumps that are clearly out of reasonable range. The so-called "space position jump beyond the reasonable range" is illustrated below. If the positioning point uploaded by a floating vehicle positioning device is recorded as A at 10:30:00 on a certain day, the positioning point uploaded by the floating vehicle positioning device at time 10:30:30 is recorded as B, the distance between position A and position B. It is 1500 meters, then the speed of the floating car is calculated to be at least 180km/h, which is beyond the common sense, so it is an abnormal spatial position jump, which should be eliminated in data processing.
3) 快速地图匹配。  3) Quick map matching.
经过预处理后的 GNSS定位数据, 需要结合城市路网数据, 通过地图匹配算法, 将 GNSS定位点 投影到城市地图, 建立定位点与路段的匹配关系, 并修正定位漂移带来的误差。  After pre-processed GNSS positioning data, it is necessary to combine the urban road network data, map the GNSS positioning points to the city map through the map matching algorithm, establish the matching relationship between the positioning points and the road segments, and correct the error caused by the positioning drift.
目前各个地理区域的电子地图都已较为详实, 这种电子地图可以来源于城市的地理信息系统, 当 然也可以来源自其他方式和途经。 这些电子地图对城市道路信息进行了详细刻画, 通过划分可以得到 若干路段。 通过借助距离、 角度等信息, 将定位点匹配到路段上, 这样就实现了将定位信息匹配到实 际的地理环境中。  At present, the electronic maps of various geographical regions are relatively detailed. Such electronic maps can be derived from the city's geographic information system, and of course can also be derived from other ways and means. These electronic maps detail the urban road information, and several sections can be obtained by dividing. By matching the anchor points to the road segments by means of distance, angle, etc., the positioning information is matched to the actual geographical environment.
4) 浮动车路径的表示和不同车辆路径的匹配。  4) The representation of the floating vehicle path and the matching of different vehicle paths.
在给定一组起终点的前提下,车辆的路径可能不是唯一的。复杂的城市交通路网包含了若干路段, 将这些不同的路段进行编号, 例如, 将路段表示为 Ll, L2等。 道路可能有两个不同的行驶方向, 在这 种情况下, 应该将两个不同的行驶方向表示为两个不同的路段, 给予不同的路段编号。  The path of the vehicle may not be unique given a set of starting and ending points. The complex urban traffic network consists of several sections, which are numbered, for example, Ll, L2, etc. Roads may have two different directions of travel. In this case, two different directions of travel should be represented as two different sections, given different sections.
给定的起点和终点, 通常可采用城市路网中路段的交点。 已知某浮动车行驶的路径, 现需要从其 他浮动车已经发送的路径信息中, 选择与该浮动车路径相同的路径, 从而获得起点和终点间的同路径 组。  For a given starting point and ending point, the intersection of the road segments in the urban road network can usually be used. Knowing the path of a floating car, it is now necessary to select the same path as the floating car path from the path information that has been sent by other floating cars, so as to obtain the same path group between the starting point and the ending point.
5) 数据抽样。  5) Data sampling.
浮动车的定位数据中, 包含位置坐标、 瞬时车速、 记录时间等信息。 在本专利提出的基于浮动车 数据的城市道路交通异常检测方法中, 数据抽样是指从全部的浮动车数据中筛选出部分数据进行后续 的分析处理, 这种筛选是基于数据中心的计算能力以及预先提出的精度要求而进行的。 基于不同的计 算能力和精度要求, 可采用不同的数据抽样方法。 例如, 当数据中心的计算能力较强, 且对检测的精 度要求较高时, 可以将全部的浮动车定位数据作为处理对象, 进行全面的处理分析; 而当数据中心的 计算能力有限时, 假定当前的数据中心能够在 1分钟内, 对每个空间子区处理 500条数据, 而实际情 况是在 1分钟每个空间子区能产生了 2000条浮动车定位数据,那么可以从 2000条数据中随机抽取 500 条数据进行分析, 从而在数据中心的计算能力范围内, 获得精度受限的处理结果。  The positioning data of the floating car includes information such as position coordinates, instantaneous vehicle speed, and recording time. In the urban road traffic anomaly detection method based on floating car data proposed in this patent, data sampling refers to screening part of the data from all floating car data for subsequent analysis and processing, and the screening is based on the computing power of the data center and Pre-proposed accuracy requirements were made. Different data sampling methods can be used based on different calculation capabilities and accuracy requirements. For example, when the computing power of the data center is strong and the accuracy of the detection is high, all the floating vehicle positioning data can be treated as a processing object, and comprehensive processing analysis is performed; and when the computing power of the data center is limited, it is assumed The current data center can process 500 data for each spatial sub-area within 1 minute, but the actual situation is that in 2000, each floating sub-area can generate 2000 floating-vehicle positioning data, so it can be from 2000 data. Randomly extract 500 data for analysis, so as to obtain processing results with limited accuracy within the computing power of the data center.
根据对浮动车数据利用方式的不同, 可以针对浮动车数据的不同属性进行采样, 例如行程车速和 行程时间等。 本专利中提出的基于浮动车数据的城市道路交通异常检测方法, 采用行程车速作为基础 进行城市道路交通异常检测。 因此, 数据抽样是指对行程车速进行抽样。  Depending on how the floating car data is used, it is possible to sample different properties of the floating car data, such as the travel speed and travel time. The urban road traffic anomaly detection method based on floating car data proposed in this patent uses the travel speed as the basis for urban road traffic anomaly detection. Therefore, data sampling refers to sampling the speed of the journey.
6) 历史轨迹数据分析和特征提取。  6) Historical trajectory data analysis and feature extraction.
所谓历史轨迹数据, 是指在长期的城市道路交通运行中积累下来的浮动车轨迹数据。 利用历史浮 动车轨迹数据, 可以建立城市道路交通特征模型, 用来反映城市交通运行的一般特性。 这里所说的城 市道路交通特征模型, 可以指某些特定的指标, 例如平均速度、 加权平均速度等; 也可以指各种某种 统计模型, 例如行程速度的概率分布。 以往的很多模型, 采用单一的指标来表示某个路段或区域的交 通特征 (如历史平均车速), 这种方式虽然应用简便, 但是精度不高, 敏感性差, 往往不能在交通异常 状态检测中发挥良好的效果。 因此, 本专利提出对于每个时空子区, 用交通特征变量的概率分布描述 交通特征, 建立交通特征模型并进行参数估计。 The so-called historical trajectory data refers to the floating vehicle trajectory data accumulated in long-term urban road traffic operations. Using historical floating vehicle trajectory data, an urban road traffic feature model can be established to reflect the general characteristics of urban traffic operations. The urban road traffic feature model mentioned here can refer to certain specific indicators, such as average speed, weighted average speed, etc.; it can also refer to various statistical models, such as the probability distribution of travel speed. In many previous models, a single indicator was used to represent the traffic characteristics of a certain section or area (such as the historical average speed). Although this method is simple, the accuracy is not high, the sensitivity is poor, and often it is not in traffic abnormalities. Good results in state detection. Therefore, this patent proposes to describe the traffic characteristics by using the probability distribution of traffic characteristic variables for each spatiotemporal sub-area, establish a traffic feature model and perform parameter estimation.
可采集的交通特征变量,包括行程车速和行程时间等,本专利中所述的交通特征变量的概率分布, 指行程车速的概率分布。  The traffic characteristic variables that can be collected, including the travel speed and travel time, etc., the probability distribution of the traffic characteristic variables described in this patent refers to the probability distribution of the travel speed.
7) 实时轨迹数据分析和特征提取。  7) Real-time trajectory data analysis and feature extraction.
所谓实时轨迹数据, 是指距离当前时刻不远的一段时间内的交通运行中浮动车的轨迹数据。 利用 实时浮动车轨迹数据, 可以掌握交通特征的变化动态, 用来反映当前交通运行的即时特性。 本专利采 用当前时空子区的行程速度描述当前交通特征。  The so-called real-time trajectory data refers to the trajectory data of the floating car in traffic operation in a period of time not far from the current time. Using real-time floating car trajectory data, you can grasp the dynamics of traffic characteristics and reflect the current characteristics of current traffic operations. This patent uses the travel speed of the current space-time sub-zone to describe the current traffic characteristics.
8) 异常检测。  8) Anomaly detection.
系统状态异常检测的思想最早由 Dennrng提出,即通过监视系统审计记录上系统使用的异常情况, 可以检测出违反安全、 可能引发系统异常的事件。 Dennrng建立的这种模型独立于任何特定的系统、 应 用环境、 系统弱点、 故障类型, 因而是一种普遍意义上的异常检测模型。 该模型包括主体、 客体、 审 计记录、 轮廓、 异常记录和活动规则 5个部分。 轮廓是用度量和统计模型来表示的主体相对于客体的 正常行为。 Dennrng 的模型定义了 3 种度量, 即事件计数器、 间隔定时器、 资源测量器, 并提出了 5 种统计模型, 即可操作模型、 均值和标准差模型、 多变量模型、 马尔可夫过程模型和时间序列模型。 Denning提出的模型通过对系统审计数据的分析, 建立起系统主体的基于统计的正常行为特征轮廓, 检 测时, 系统中的审计数据与已建立的主体的正常行为特征轮廓相比较, 若相异部分超过某个阈值, 就 认为是一个异常事件。 该模型奠定了异常检测的基础, 以后发展的许多异常检测方法和系统都是以它 为基础而发展起来的。  The idea of system state anomaly detection was first proposed by Dennrng, that is, by monitoring the abnormality of the system used in the system audit record, it is possible to detect an event that violates security and may cause a system abnormality. Dennrng's model is independent of any particular system, application environment, system vulnerability, and fault type, and is therefore a general anomaly detection model. The model consists of five parts: subject, object, audit record, outline, exception record and activity rule. A contour is a normal behavior of a subject relative to an object, represented by metrics and statistical models. Dennrng's model defines three metrics, namely event counter, interval timer, resource measurer, and proposes five statistical models, namely, operational model, mean and standard deviation models, multivariate models, Markov process models, and Time series model. The model proposed by Denning establishes the statistically-based normal behavioral feature profile of the system subject through the analysis of the system audit data. When the detection, the audit data in the system is compared with the normal behavioral feature profile of the established subject. Exceeding a certain threshold is considered an abnormal event. This model lays the foundation for anomaly detection, and many anomaly detection methods and systems developed in the future are developed on the basis of it.
近几年在异常检测技术的发展过程中, 引入了更多人工智能的方法, 以提高异常检测的性能。 这 些人工智能的方法主要包括数据挖掘、 人工神经网络、 模糊证据理论等。 数据挖掘的方法用来确定在 大量的数据集合中什么特征是最重要的。 该技术用于异常检测中主要是寻求一种正常模式更简洁的定 义, 而不是像传统的异常检测方法那样简单列举出所有的正常模式。 数据挖掘方法的引入使得检测系 统能仅通过识别正常模式中的主要特征, 就能够概括性地包括训练数据中所未包括的正常模式。 人工 神经网络异常检测问题可被看作是一个一般的数据分类问题.在前面谈到的统计异常检测中, 用户行为 数据按照某种统计准则被分为两类: 即异常行为和正常行为。 由于基于统计的方法在提取、 抽象审计 实例时存在一定困难, 可能造成较大误差, 必须依赖于一些概率分布假设, 一般需要凭经验和感觉来 刻画用户行为的度量, 所以引入了人工神经网络的聚类方法。 人工神经网络具有自学习自适应能力, 用代表正常用户行为的样本点来训练神经网络, 通过反复多次学习, 神经网络能从数据中提取正常的 用户或系统活动的模式, 并编码到网络结构中, 检测时, 将审计数据通过学习好的神经网络, 即可判 定系统是否正常。 由于异常的评判标准具有一定的模糊性, 所以模糊证据理论被引入到异常中, 如建 立一种基于模糊专家系统的入侵检测框架模型, 能较好地降低漏警率和虚警率。  In recent years, in the development of anomaly detection technology, more artificial intelligence methods have been introduced to improve the performance of anomaly detection. These methods of artificial intelligence mainly include data mining, artificial neural networks, and fuzzy evidence theory. Data mining methods are used to determine what features are most important in a large data set. This technique is used in anomaly detection mainly to seek a more concise definition of the normal mode, rather than simply enumerating all normal modes as in the conventional anomaly detection method. The introduction of data mining methods enables the detection system to generally include normal patterns not included in the training data simply by identifying the main features in the normal mode. The artificial neural network anomaly detection problem can be regarded as a general data classification problem. In the statistical anomaly detection mentioned above, user behavior data is divided into two categories according to certain statistical criteria: abnormal behavior and normal behavior. Because the statistical-based method has certain difficulties in extracting and abstracting the audit instance, it may cause large errors, and must rely on some probability distribution hypotheses. Generally, it is necessary to describe the measure of user behavior by experience and feeling, so the artificial neural network is introduced. Clustering method. The artificial neural network has the self-learning self-adaptive ability to train the neural network with sample points representing the normal user behavior. Through repeated learning, the neural network can extract the normal user or system activity patterns from the data and encode them into the network structure. In the detection, the audit data can be judged whether the system is normal by learning a good neural network. Because the anomaly evaluation criterion has certain ambiguity, the fuzzy evidence theory is introduced into the anomaly. For example, an intrusion detection framework model based on fuzzy expert system is established, which can better reduce the false alarm rate and false alarm rate.
本专利提出一种基于统计特征的异常检测方案, 基本思想是通过 Jensen-Shannon散度衡量历史交 通特征与实时交通特征的差异, 以实现异常交通状况的检测。 该方案具有可解释性好, 计算负担不大 的优点, 既克服了采用单一统计量检测不准确、 不及时的弱点, 又避免了人工神经网络等方法计算负 担重, 硬件要求高的缺陷。  This patent proposes an anomaly detection scheme based on statistical features. The basic idea is to measure the difference between historical traffic characteristics and real-time traffic characteristics by Jensen-Shannon divergence to achieve the detection of abnormal traffic conditions. The scheme has the advantages of good interpretability and little computational burden. It not only overcomes the inaccurate and untimely weakness of single statistic detection, but also avoids the defects of artificial neural network and other methods to calculate the negative load and high hardware requirements.
9) 异常严重性量化表征及异常信息发布。  9) Quantitative characterization of abnormal severity and release of abnormal information.
交通异常状况的严重性应该通过简洁明了的方式向公众发布, 以避开可能的拥堵区域, 提高城市 交通的运行效率。 异常状况的严重程度用交通异常指数表征, 范围为 0-10, 其中 0表示无异常, 10表 示高度异常。 The severity of traffic anomalies should be released to the public in a clear and concise manner to avoid possible congestion areas and improve the efficiency of urban traffic. The severity of the abnormal condition is characterized by the traffic anomaly index, ranging from 0-10, where 0 means no abnormality, 10 The height is abnormal.
异常的发生位置投影到电子地图上, 并通过智能移动设备 APP等形式公开发布。  The location of the anomaly is projected onto the electronic map and published publicly through the smart mobile device APP or the like.
10) 系统性能评价。  10) System performance evaluation.
系统性能的评价是指评价交通异常状态检测的准确性, 其评价指标包括误报率和漏报率。 误报率 和漏报率越低表明系统的性能越好。 所述步骤 1)中, 时空子区的划分具体可以采用以下方法:  The evaluation of system performance refers to the evaluation of the accuracy of traffic abnormal state detection, and its evaluation indicators include false positive rate and false negative rate. The lower the false positive rate and the false negative rate, the better the performance of the system. In the step 1), the division of the spatiotemporal sub-area may specifically adopt the following method:
11) 等距时空划分法。确定时间维度的片段尺度, 时间片段跨度为固定值, 通常取 30mm作为一个 时间片段; 确定空间维度的片段尺度, 空间片段跨度为固定值, 通常取 200mX 200m的空间网格作为 一个空间片段;  11) Isometric space-time division method. Determining the segment scale of the time dimension, the time segment span is a fixed value, usually taking 30mm as a time segment; determining the segment scale of the spatial dimension, the spatial segment span is a fixed value, usually taking a spatial grid of 200m×200m as a spatial segment;
12) 基于路网密度的非等距时空划分法: 基于路网密度作为判断指标, 当路网密度大于等于 2km/km2时, 取 30min的时间片段和 200m X 200m的空间片段; 当路网密度小于 2km/km2时, 取 30min 的时间片段和 400m X 400m的空间片段; 12) Non-equidistant space-time division method based on road network density: Based on road network density as a judgment index, when the road network density is greater than or equal to 2km/km 2 , take 30min time segment and 200m X 200m space segment; When the density is less than 2km/km 2 , take a 30 min time segment and a 400 m X 400 m spatial segment;
13) 基于高峰小时流量的非等距时空划分法:基于高峰小时流量作为判断指标, 当高峰小时流量大 于等于 1000辆 /小时时, 取 30min的时间片段和 200m X 200m的空间片段; 当高峰小时流量小于 1000 辆 /小时时, 取 30min的时间片段和 400m X 400m的空间片段。 所述步骤 3)具体包含以下步骤:  13) Non-equidistant space-time division method based on peak hour flow: based on peak hour flow as a judgment indicator, when the peak hour flow rate is greater than or equal to 1000 vehicles/hour, take 30 min time segment and 200 m X 200 m space segment; When the flow rate is less than 1000 vehicles/hour, take a time segment of 30 minutes and a space segment of 400m X 400m. The step 3) specifically includes the following steps:
31) 将所需处理的空间区域划分为一定大小的格网, 每个格网区域的范围可表示为 31) Divide the space area to be processed into a grid of a certain size, and the range of each grid area can be expressed as
4 = {(xs,ys) \ xs ΓΛ 5 每个格网区域包含若干个路段, 把这些路段的集合表示为 RS, 所述路段的集合 中的每条路段表示为 ij, 并为每个路段赋予编号; 4 = {(x s , y s ) \ x s Γ Λ 5 Each grid area contains several road segments, and the set of these road segments is represented as R S , and each road segment in the set of the road segments is represented as ij. And assign a number to each road segment;
32) 判定定位点所在的格网区域, 并利用距离和方位角, 在路段的集合 中搜索某定位点 A所在 的路段^ 匹配方案包括:  32) Determine the grid area where the anchor point is located, and use the distance and azimuth angle to search for the section where the anchor point A is located in the set of road segments. ^ The matching scheme includes:
321) 单点匹配方案:  321) Single point matching scheme:
搜索距离点 A最近的路段, 当满足点 A的行驶方向角与路段 ij的方向角的差值小于阈值时, 即满 足 | < , 完成匹配, 所述的阈值 可取 2.5 ° , 5。 , 10° 等; 若不满足 | < , 在搜索空 间中删除路段 并继续搜索其他路段, 直至满足条件。 匹配方法如图 3所示。  Searching for the nearest road segment from the point A, when the difference between the traveling direction angle of the point A and the direction angle of the road segment ij is less than the threshold value, that is, the matching is completed, and the threshold value may be 2.5 °, 5. , 10°, etc.; if not satisfied, < , delete the segment in the search space and continue to search for other segments until the condition is met. The matching method is shown in Figure 3.
322) 点序列匹配方案:  322) Point sequence matching scheme:
本方案适用于高频浮动车数据。 将浮动车 GNSS数据采集频率表示为 f0= \l , 将时间上与 A相邻 的点 POHO) , Pfc+i。;)定义为 Α的 1-邻近点, P04-2iQ;), P 4+2iQ)定义为 A的 2-邻近点, 以此类推, 则 P(tA-kk) , Pfc+ 定义为 A的 /t-邻近点。 在/ Q<lHz时, 取 /t=l或 2。 取距离 A及 A的 /t-邻近点距离最 小的路段^并计算 A及 A的^邻近点行驶方向角的均值^ 4,,若满足 | . - | < ,完成匹配;否则, 搜索其他路段, 直至满足 | 一 | < 。 This program is suitable for high frequency floating car data. The floating vehicle GNSS data acquisition frequency is expressed as f 0 = \l , and the point POHO), Pfc+i, which is adjacent to A in time. ;) is defined as the 1-adjacent point of Α, P04-2i Q ;), P 4+2i Q ) is defined as the 2-adjacent point of A, and so on, then P(t A -kk) , Pfc+ is defined as A /t-adjacent point. When / Q <lHz, take /t=l or 2. Take the distance between the distances A and A/t-adjacent points and calculate the mean value of the driving direction angles of the neighboring points of A and A^, and if they meet | . - | < , complete the matching; otherwise, search for other road segments. Until the satisfaction of | a | <.
33) 利用路段的直线方程 (若为曲线路段则近似拆分为直线), 计算 GNSS定位点在路段上的投影 坐标, 减小因 GNSS定位漂移带来的误差。 具体方法采用 GNSS定位点直线投影法为:  33) Use the straight line equation of the road segment (if it is a curved road segment, it is roughly split into straight lines), calculate the projection coordinates of the GNSS positioning point on the road segment, and reduce the error caused by the GNSS positioning drift. The specific method uses the GNSS positioning point linear projection method as:
确定路段 ^的直线方程 (若路段为曲线, 则划分为若干直线路段): y, - y, Determine the straight line equation of the section ^ (if the section is a curve, divide it into several straight sections): y, - y,
其中斜率为: 投影直线方程为: y-yA =- x- kyA -kyt +k2xt +xA Where the slope is: The projection line equation is: yy A =- x - ky A -ky t +k 2 x t +x A
解出投影坐标 p为:  Solve the projected coordinates p is:
k2+l k 2 +l
k2yA + yt +kxA - xj k 2 y A + y t +kx A - x j
yP y P
在地图匹配过程后, 结合定位点坐标的时间戳数据, 将定位点匹配到时空子区 ( 所述步骤 5)具体可以采用以下方法之一: After the map matching process, combining the timestamp data of the coordinates of the positioning point, matching the positioning point to the space-time sub-region ( the step 5) may specifically adopt one of the following methods:
51) 速度信息的全样本方案。 由一个时空子区内各辆次浮动车的全部行程车速数据, 构成总体。 实 施方法是计算时空子区 内每辆车的行程车速: v^W ..""-1'", 其中 42...4— ln为时空子区 ^ 内的第 1个和第 2个 GNSS定位点间的距离, ......, 第《- 1个与第《个 GNSS定位点间的距离, —tn 为时空子区 内第 1个, ......, 第《个GNSS定位点的时间戳; 将每个时空子区内的数据不做筛选, 构成一个集合 ^, 用于后续处理。 51) Full sample plan for speed information. The total travel speed data of each sub-floating vehicle in a time and space sub-region constitutes the whole. The implementation method is to calculate the travel speed of each vehicle in the space-time sub-region: v^W ..""- 1 '", where 4 2 ... 4 - ln is the first and second in the space-time sub-area ^ The distance between the GNSS anchor points, ..., the distance between the -1 and the GNSS anchor points, -t n is the first in the space-time sub-region, ..., the first "Timestamp of a GNSS anchor point; the data in each spatio-temporal sub-area is not filtered to form a set ^ for subsequent processing.
52) 速度信息的时间平滑抽样方案。指定时间片段长度, 设定同一时间片段数据条数上限; 搜索一 个时空子区内时间各时间片段内的速度数据, 若时间片段内速度数据条数超过上限, 随机取上限条数 的数据用于后续处理。 实施方法是计算时空子区 内每辆车的行程车速: vf W ..""-1'", 其 中 2... 为时空子区 f 内的第 1个和第 2个 GNSS定位点间的距离, ......,第《-1个与第 n个 GNSS 定位点间的距离, ^…^为时空子区^内第丄个, ......, 第《个 GNSS定位点的时间戳; 指定时间片段 长度 同一时间片段数据条数上限;^∞; 搜索一个时空子区内时间第 各时间片段内的速度数据, 若 时间片段内速度数据条数超过上限; ^«, 随机取;^ «条数据加入 ^并用于后续处理。 所述步骤 6)具体可以采用以下方法之一: 52) Time-smooth sampling plan for speed information. Specify the length of the time segment, set the upper limit of the number of segments of the same time; search for the velocity data in each time segment of a time-space sub-region, if the number of velocity data in the time segment exceeds the upper limit, the data of the upper limit is taken Follow-up processing. The implementation method is to calculate the travel speed of each vehicle in the space-time sub-region: v f W ..""- 1 '", where 2 ... is the first and second GNSS positioning points in the space-time sub-area f Distance, ..., the distance between -1 and the nth GNSS anchor point, ^...^ is the space-time sub-area ^, ......, the first GNSS Time stamp of the anchor point; specify the upper limit of the number of segments of the time segment at the same time; ^ ∞; search for the velocity data in the time segment of the time and space sub-region, if the number of velocity data in the time segment exceeds the upper limit; ^« , randomly fetching; ^ «bar data is added to ^ and used for subsequent processing. The step 6) may specifically adopt one of the following methods:
61) 简单历史轨迹数据融合法。将无交通异常状况下的历史数据, 作为一个整体, 进行交通特征模 型建立和参数估计。 该方法利用有限混合模型, 建立交通特征模型, 并进行参数估计。 具体可采用以 下三种方案之一:  61) Simple historical trajectory data fusion method. The historical data under the condition of no traffic anomalies, as a whole, the traffic characteristic model establishment and parameter estimation. The method uses a finite mixing model to establish a traffic feature model and perform parameter estimation. Specifically, one of the following three options can be used:
611) 固定成分的混合高斯模型  611) Mixed Gaussian model of fixed composition
本方案采用固定成分数量 的混合高斯模型描述车速的概率分布。 成分数量根据车速在典型情况 下的分布模式人工指定。 为了保证概率分布的可靠性, 成分数量 不能过小。 一般可取 =4~6。  This scheme uses a mixed Gaussian model with a fixed component quantity to describe the probability distribution of vehicle speed. The number of components is manually specified according to the distribution pattern of the vehicle speed in a typical case. In order to ensure the reliability of the probability distribution, the number of components cannot be too small. Generally available = 4~6.
612) 成分数量可变的混合高斯模型  612) Mixed Gaussian model with variable composition
本方案采用一种基于模型评价的方法来选择合适的成分数量, 方法如下: 确定可能的最大成分数量 K, 并分别对《=1,2,…^个成分的混合高斯模型进行参数估计; 对于 K 个模型, 通过贝叶斯信息准则 (β/C) 确定最佳模型。 最大成分的数量一般按精度需求选取, 但必须注 意成分数量越多, 期望最大化算法收敛越慢。 这里选择的最大成分数量为 =5, 即需要计算: f»tA n {\, 2, ..., 5) 共 5种混合模型。 同时, 计算 5种模型的 其定义为: This program uses a model-based evaluation method to select the appropriate number of components, as follows: Determine the maximum number of possible components K, and estimate the parameters of the mixed Gaussian model of "1, 2, ... ^ components separately. For K models, determine the best model by Bayesian information criterion (β/C). The maximum number of components is generally selected according to the accuracy requirement, but it must be noted that the more the number of components, the slower the expectation maximization algorithm converges. The maximum number of components selected here is =5, which requires calculation: f»tA n {\, 2, ..., 5) A total of 5 mixed models. At the same time, the definition of the five models is defined as:
BIC = -2\nL + k - \nn  BIC = -2\nL + k - \nn
式中, 为最大似然函数值, 为模型中参数的个数, 《为数据总量。  Where, is the maximum likelihood function value, which is the number of parameters in the model, "for the total amount of data.
之后, 选取 β/C最小的的混合模型, 记录其参数向量 ^ μ、 σ, 其中, η是所述历史交通特征模 型中各个子成分所占的比例向量, μ是所述历史交通特征模型中各个子成分的均值向量, σ是所述历史 交通特征模型中各个子成分的标准差向量, 作为本时空子区的特征记录。 混合模型的密度曲线形态在 图 6中示出。  After that, the mixed model with the smallest β/C is selected, and the parameter vectors ^ μ and σ are recorded, where η is the proportional vector occupied by each sub-component in the historical traffic feature model, and μ is the historical traffic feature model. The mean vector of each sub-component, σ is the standard deviation vector of each sub-component in the historical traffic feature model, as a feature record of the present-time sub-region. The density curve morphology of the hybrid model is shown in Figure 6.
613) 成分数量、 分布类型均可变的有限混合模型  613) Finite mixed model with variable composition and distribution type
本方案采用与 612)相同的基于模型评价的方法, 但子成分的分布形态和成分的数量均可变, 方法 如下:  This scheme uses the same model-based evaluation method as 612), but the distribution of the sub-components and the number of components are variable. The method is as follows:
选取 Μ种概率分布模型作为子成分的分布类型, 包括但不限于: 正态分布、 伽马分布、 威布尔分 布。 使用正态分布时, 子分布函数采用: : exp  The probability distribution model is chosen as the distribution type of the sub-components, including but not limited to: normal distribution, gamma distribution, Weibull distribution. When using a normal distribution, the sub-distribution function takes: : exp
1πσ1πσ νν
使用伽马分布时, 子分布函数采用: 剥: v "" -le β", 其中 Γ(^ ) : 使用威布尔分布时, 子分布函数采用: 脚:When using a gamma distribution, the sub-distribution function uses: strip: v "" - l e β ", where Γ(^): When using the Weibull distribution, the sub-distribution function uses:
Figure imgf000010_0001
Figure imgf000010_0001
假定混合模型所有子成分的分布类型相同, 确定可能的最大成分数量 ^。 对于 M种子成分分布类 型、 种成分数量的选择, 共形成 M 种组合, 分别计算 δ/C值, 并取 δ/C最小的模型为最佳模型。  Assuming that the distribution types of all sub-components of the hybrid model are the same, determine the maximum number of possible components ^. For the selection of M seed component distribution types and the number of species components, M combinations are formed, and the δ/C values are calculated separately, and the model with the smallest δ/C is taken as the best model.
62) 分情境的历史轨迹数据分类法。 依据气温、 降水量、 能见度和交通管制措施, 将无交通异常状 况下的历史数据划分成不同的类别, 分别建立模型和进行参数估计。 实施方法如下:  62) Historical trajectory data classification method by context. According to the temperature, precipitation, visibility and traffic control measures, the historical data without traffic anomalies are divided into different categories, and models and parameter estimates are established. The implementation method is as follows:
根据气温、 降水量、 能见度和交通管制措施的不同, 将交通环境分为 5~8个类别, 由历史数据对 应的交通环境的不同, 将历史数据归入以上各个类别中。 对每个类别, 分别进行如同 5)所述的处理, 从而建立了映射关系 R( ^, 为交通环境, Γ为交通态势。  According to the temperature, precipitation, visibility and traffic control measures, the traffic environment is divided into 5~8 categories. The historical data corresponds to the different traffic environments, and the historical data is classified into the above categories. For each category, the processing as described in 5) is performed separately, thereby establishing a mapping relationship R (^, for the traffic environment, and for the traffic situation.
63) 历史数据聚类法。对于历史数据, 通过时空子区两两之间的比较, 获得不同时空区域的差异量 化表征, 并利用量化后的差异进行聚类。 将气温、 降水量、 能见度和交通管制措施作为特征因子, 进 行多项 Loglt回归, 建立交通环境与类别的映射关系。 实施流程参见附图 4。 实施步骤如下: 631) 根据 5)所述的方法, 建立交通特征模型, 并进行参数估计。 63) Historical data clustering method. For the historical data, the difference between the time and space sub-regions is obtained, and the difference quantitative representation of different space-time regions is obtained, and the quantized differences are used for clustering. Using temperature, precipitation, visibility and traffic control measures as characteristic factors, a number of Lo gl t regressions were performed to establish a mapping relationship between traffic environment and categories. See Figure 4 for the implementation process. The implementation steps are as follows: 631) According to the method described in 5), a traffic feature model is established, and parameter estimation is performed.
632) 根据之前的有限混合模型参数估计结果, 写出时空子区在不同日期对应的行程车速分布的概 率密度函数 p x) , 其参数以混合高斯模型为例:  632) Based on the previous finite-mix model parameter estimation results, write the probability density function p x) of the travel speed distribution corresponding to the spatio-temporal sub-region on different dates. The parameters are based on the mixed Gaussian model:
κ  κ
A (vf ) =∑ ' ( ; σ 式中, 表示行程车速分布的子成分数量, //表示行程车速分布中某个子成分所占的比例, 表示 行程车速分布中某个子成分的均值, σ表示行程车速分布中某个子成分的标准差。 A ( v f ) =∑ '(; In σ , the number of subcomponents representing the speed distribution of the stroke, // represents the proportion of a subcomponent in the travel speed distribution, and represents the mean of a subcomponent in the travel speed distribution, σ Indicates the standard deviation of a subcomponent in the travel speed distribution.
633) 计算各分布两两之间的 Jensen-Shanno d,J : 633) Calculate Jensen-Shanno d, J between the two distributions :
d'j = JSD(P \\ Q) = \\ M) D'j = JSD(P \\ Q) = \\ M)
Figure imgf000011_0001
Figure imgf000011_0001
式中, P、 ρ为两个不同的概率分布, Μ = θΡ + ρ;), /?为 !^^-!^!^!"散度:  Where P and ρ are two different probability distributions, Μ = θΡ + ρ;), /? For ! ^^-! ^!^!"Diversity:
D (P \\ Q) = ^ P(xk )log 在采用有限混合模型的情况下, 其值无法显式表示, 但可采用蒙特卡罗抽样方法近似计算, 其计 算方法是: D (P \\ Q) = ^ P(x k )log In the case of finite mixed model, the value cannot be explicitly expressed, but Monte Carlo sampling method can be used to approximate the calculation. The calculation method is:
^ D(f \\ g) 式中, Z)MC表示采用蒙特卡罗抽样方法近似计算得到的 Kullback-Leibler散度, /和 g表示任意两 个分布函数。 ^ D(f \\ g) where Z) MC represents the Kullback-Leibler divergence approximated by Monte Carlo sampling, and / and g represent any two distribution functions.
634) 将分布两两间的散度表示成距离矩阵:  634) Express the divergence between the two distributions as a distance matrix:
D - d„、 . .. d 该矩阵满足 4=4,, d,r0(i=j)。 D - d„, . . . d This matrix satisfies 4=4,, d, r 0(i=j).
635) 将距离矩阵作为 K-Medoids算法的输入, 得到聚类结果, 并对类别建立索引。  635) Using the distance matrix as input to the K-Medoids algorithm, the clustering results are obtained, and the categories are indexed.
636) 以类别索引为响应变量, 将交通环境数据 (包括气温、 降水量、 能见度等) 作为自变量, 进 行多项 Logit回归, 获取交通环境 E与交通态势类别 T的映射关系 R(£^。  636) Using the category index as the response variable, the traffic environment data (including temperature, precipitation, visibility, etc.) is used as an independent variable, and multiple Logit regression is performed to obtain the mapping relationship R between the traffic environment E and the traffic situation category T (£^.
637) 将相同类别的数据进行聚合,并利用聚合后新的数据集重新建立混合模型,并进行参数估计, 得到最终的历史交通特征数据集。 所述步骤 7)具体可以采用以下方法: 637) Aggregate the same type of data, and re-establish the hybrid model with the new data set after aggregation, and perform parameter estimation to obtain the final historical traffic characteristic data set. The step 7) may specifically adopt the following methods:
71) 简单实时数据处理法。 该方法与 61)同时实施。 将实时交通数据进行模型建立和参数估计, 获 取当前交通状况的特征函数。 该方法的实施步骤, 与 61)完全相同, 只是采用的数据是实时交通数据。  71) Simple real-time data processing. This method is implemented simultaneously with 61). The real-time traffic data is modeled and parameter estimated to obtain the characteristic function of the current traffic condition. The implementation steps of the method are exactly the same as 61), except that the data used is real-time traffic data.
72) 分类处理法。该方法与 62)或 63)同时实施。获取交通状况的特征函数, 同时获取当前的气温、 降水量、 能见度、 交通管制措施等信息, 并判断当前交通状况的类别。 实施流程参见附图 5。 实施步骤 如下: 72) Classification processing. This method is carried out simultaneously with 62) or 63). Obtain the characteristic function of the traffic condition, and obtain the current information such as temperature, precipitation, visibility, traffic control measures, etc., and judge the current traffic condition category. See Figure 5 for the implementation process. Implementation steps as follows:
721) 计算时空子区内的行程车速, 构成实时行程车速总体  721) Calculate the travel speed in the time-space sub-zone, which constitutes the overall real-time travel speed
722) 建立行程车速概率分布模型 (ν^) = | - ί^ ,- μ^σ^, 并进行参数估计; 722) Establish a travel speed probability distribution model (ν^) = | - ί^ , - μ^σ^, and perform parameter estimation;
723) 将当前交通环境数据 (包括气温、 降水量、 能见度等) 作为输入参数, 利用映射关系 R(£^) 获得当前交通态势的所述类别 T。 所述步骤 8)具体包含以下步骤: 723) Using the current traffic environment data (including temperature, precipitation, visibility, etc.) as an input parameter, use the mapping relationship R(£^) to obtain the category T of the current traffic situation. The step 8) specifically includes the following steps:
81) 当采用步骤 72)时, 根据当前交通态势所属类别 Τ, 定位该类别下历史交通特征数据, 否则不 进行处理;  81) When step 72) is adopted, according to the current traffic situation category Τ, the historical traffic characteristic data under the category is located, otherwise it will not be processed;
82) 根据当前交通特征的描述参数 τ 、 μ,^ 和历史交通特征的描述参数 η、 μ、 σ计算两个速度 分布间的差异:
Figure imgf000012_0001
ΙΙ Ρ) » 其中, ^是实时交通特征模型中各个子成分所 占的比例向量, ^是实时交通特征模型中各个子成分的均值向量, (Jrt是实时交通特征模型中各个子成 分的标准差向量; η是历史交通特征模型中各个子成分所占的比例向量, μ是历史交通特征模型中各个 子成分的均值向量, σ是历史交通特征模型中各个子成分的标准差向量。当历史交通特征与实时交通特 征 (即历史行程车速分布与实时行程车速分布) 相近时, 将得到较小的 Jensen-Shannon散度值, 即两 者之间的差异较小; 当历史交通特征与实时交通特征差别较大时, 将得到较大的 Jensen-Shannon散度 值, 即两者之间的差异较大, 亦即存在异常的概率较大, 参见附图 7。 所述步骤 9)具体包含以下步骤:
82) Calculate the difference between the two velocity distributions according to the description parameters τ, μ, ^ of the current traffic characteristics and the description parameters η, μ, σ of the historical traffic characteristics:
Figure imgf000012_0001
ΙΙ Ρ) » where ^ is the proportional vector of each subcomponent in the real-time traffic feature model, ^ is the mean vector of each subcomponent in the real-time traffic feature model, (J rt is the standard for each subcomponent in the real-time traffic feature model Difference vector; η is the proportional vector of each sub-component in the historical traffic feature model, μ is the mean vector of each sub-component in the historical traffic feature model, and σ is the standard deviation vector of each sub-component in the historical traffic feature model. When the traffic characteristics and real-time traffic characteristics (that is, the historical travel speed distribution and the real-time travel speed distribution) are similar, a smaller Jensen-Shannon divergence value will be obtained, that is, the difference between the two is small; when historical traffic characteristics and real-time traffic When the difference in characteristics is large, a larger Jensen-Shannon divergence value is obtained, that is, the difference between the two is large, that is, the probability of existence of an abnormality is large, see FIG. 7. The step 9) specifically includes the following step:
91) 将各个时空子区的速度分布差异标准化为 0~1的规范化数值《ί: 91) Normalize the difference in velocity distribution of each space-time sub-region to a normalized value of 0~1 .
diff^ - min(diff)  Diff^ - min(diff)
ξ' max、diff、 - min、diff、  ξ' max, diff, - min, diff,
92) 计算各个时空子区的交通异常指数
Figure imgf000012_0002
10;
92) Calculate the traffic anomaly index for each time and space sub-region
Figure imgf000012_0002
10;
93) 将异常指数高于 5的区域位置投影到电子地图上, 并智能移动设备 APP等形式向社会公开发 布, 以使司机避开潜在拥堵点, 提高城市道路交通的通行效率。 所述步骤 10)具体包含以下步骤:  93) Project the location of the region with an anomaly index higher than 5 onto the electronic map, and publish it to the public in the form of an intelligent mobile device APP, so that the driver can avoid potential congestion points and improve the traffic efficiency of urban road traffic. The step 10) specifically includes the following steps:
101) 计算交通异常状态的漏报率:  101) Calculate the false negative rate of traffic anomaly:
= -^ χ 100% = -^ χ 100%
na n a
102) 计算交通异常状态的误报率: α2 = ^ χ 100% 以上两式中, 为单位时间内漏报事件总数, 为单位时间内误报事件总数, 《。为单位时间内实际 发生的异常时间总数。 本发明相较于同一领域的相似技术, 具有以下优点: 102) Calculate the false alarm rate of traffic abnormal state: α 2 = ^ χ 100% In the above two formulas, the total number of missed events per unit time, which is the total number of false positive events per unit time. Actual time per unit time The total number of abnormal times that occurred. The present invention has the following advantages over similar technologies in the same field:
(1) 充分利用现有的浮动车运营数据 (GNSS 轨迹数据), 通过历史交通特征提取和实时交通态势 分析, 检测交通状态发生的变化, 可以实现城市道路交通异常事件实时性、 低成本、 智能化检测; (1) Make full use of existing floating vehicle operation data (GNSS trajectory data), detect historical traffic state changes through historical traffic feature extraction and real-time traffic situation analysis, and realize real-time, low-cost, intelligent urban road traffic anomaly events Detection
(2) 将交通特征参数的概率分布作为交通特征的描述, 反映的特征更加全面, 避免了利用单一指数 表征交通特征的片面性、 不稳定性, 检测的可靠性更高; (2) Taking the probability distribution of traffic characteristic parameters as the description of traffic characteristics, the characteristics reflected are more comprehensive, avoiding the one-sidedness and instability of traffic characteristics using a single index, and the reliability of detection is higher;
(3) 针对交通特征受到交通环境 (如天气状况) 影响的特点, 引入了聚类一多项 Loglt回归联合算 法, 建立了交通环境特征与交通态势类别的映射关系; (3) In view of the characteristics of traffic characteristics affected by traffic environment (such as weather conditions), a clustering multiple Lo gl t regression algorithm was introduced to establish the mapping relationship between traffic environment characteristics and traffic situation categories.
(4) 经实际数据的检验, 本发明提出的基于浮动车数据的城市道路交通异常检测技术, 能够实现准 确度较高的异常事件检测, 检测率超过 90%, 漏报率低于 15%, 误报率低于 20%, 取得了良好的检测 效果, 可以应用于城市交通智能化管理、 服务。 附图说明  (4) According to the test of actual data, the urban road traffic anomaly detection technology based on floating car data proposed by the present invention can realize the detection of abnormal events with high accuracy, the detection rate exceeds 90%, and the false negative rate is less than 15%. The false alarm rate is lower than 20%, and it has achieved good detection results, and can be applied to urban traffic intelligent management and service. DRAWINGS
本发明的具体内容及优点结合以下附图将变得明晰和易于理解, 其中:  The details and advantages of the present invention will become apparent and readily understood in conjunction with the following drawings in which:
图 1示出了本发明的组成要素和基本原理示意图;  Figure 1 shows a schematic diagram of the components and basic principles of the present invention;
图 2示出了本发明在实施过程中的总体流程示意图;  Figure 2 is a schematic view showing the overall flow of the present invention in the implementation process;
图 3示出了本发明快速地图匹配算法实施方式示意图;  3 is a schematic diagram showing an implementation manner of a fast map matching algorithm of the present invention;
图 4示出了本发明实施历史交通特征提取方案的流程示意图;  4 is a schematic flow chart showing a historical traffic feature extraction scheme implemented by the present invention;
图 5示出了本发明实施实时交通特征提取方案的流程示意图;  FIG. 5 is a schematic flow chart showing a real-time traffic feature extraction scheme implemented by the present invention; FIG.
图 6示出了高斯混合模型概率分布的形态示意图;  Figure 6 shows a schematic diagram of the morphology of the Gaussian mixture model probability distribution;
图 7示出了历史交通特征与实时交通特征比较过程中差异的衡量示意图。 具体实施方案  Figure 7 shows a measurement of the difference in the comparison between historical traffic characteristics and real-time traffic characteristics. Specific implementation
为了更加清晰明确地表述本发明的目的、 技术方案和优势, 下面对本发明的具体实施方案进行详 细描述。  In order to more clearly and clearly clarify the objects, technical solutions and advantages of the present invention, the specific embodiments of the present invention are described in detail below.
如附图 1所示, 本发明的整体系统构架包括: 浮动车搭载的车载 GNSS轨迹记录仪、 数据中心、 GNSS卫星以及通信系统。 此处的 GNSS包括 GPS、 GLONASS、 GALILEO、 北斗、 IRNSS、 QZSS等 任何类似的导航卫星定位系统。 出租车、 公交车等浮动车搭载的 GNSS轨迹记录仪, 以一定的采样频 率/ (一般要求戶 0.1Hz) 记录车辆在行驶中各时点的位置信息, 并通过 GPRS移动通信网络 (亦可采 用 WCDMA、 TD-LTE 等无线网络通信技术, 但成本将相应提高) 将位置信息实时发送至数据中心。 数据中心通过数据预处理、 数据融合, 并通过特定算法建立历史道路交通特征数据库; 对于最近接收 的实时数据, 建立实时交通特征数据库; 通过历史数据库和实时数据库的映射关系, 判别当前交通特 征是否异常, 并通过处理终端进行可视化展示并生成交通异常事件报告。  As shown in Fig. 1, the overall system architecture of the present invention includes: an onboard GNSS track recorder mounted on a floating vehicle, a data center, a GNSS satellite, and a communication system. The GNSS here includes GPS, GLONASS, GALILEO, Beidou, IRNSS, QZSS and any similar navigation satellite positioning system. GNSS track recorders equipped with floating cars, buses, etc., record the position information of the vehicle at various points in time at a certain sampling frequency / (general requirements of 0.1 Hz), and pass the GPRS mobile communication network (also can be used) Wireless network communication technologies such as WCDMA and TD-LTE, but the cost will be increased accordingly) The location information will be sent to the data center in real time. The data center establishes a historical road traffic characteristic database through data preprocessing, data fusion, and through a specific algorithm; establishes a real-time traffic feature database for the recently received real-time data; and determines whether the current traffic feature is abnormal through the mapping relationship between the historical database and the real-time database And visualize the display through the processing terminal and generate a traffic anomaly event report.
方案的总体流程参见图 2, 包括采集和存储 GNSS轨迹数据, 建立时空子区, 历史交通特征提取, 实时交通特征提取, 异常识别等步骤。 采集和存储 GNSS轨迹数据, 是整个方案的数据基础, 由于数 据量级巨大, 应采用分布式存储方案, 对于分布式存储目前已有成熟的技术, 不是本发明的内容。 建 立时空子区, 其基本假设是在某一特定区域、 特定时段内, 有着相同的交通特征, 这一假设, 经过长 期观测, 是普遍适用的。 历史交通特征提取, 其原理是利用 GNSS轨迹数据, 计算得到行程车速, 利 用同一时空子区大量的行程车速数据, 建立车速的概率分布模型, 并进行参数估计, 用少量参数表征 交通特征。 实时交通特征提取, 其原理是将当前时间段内的速度数据进行处理分析, 同样建立当前的 车速概率分布模型。 异常识别是采用差异衡量指标, 判断实时特征相较于历史特征的变化程度, 根据 其是否达到阈值, 确定是否出现交通异常事件。 The overall process of the scheme is shown in Figure 2, including the acquisition and storage of GNSS trajectory data, the establishment of spatiotemporal sub-areas, historical traffic feature extraction, real-time traffic feature extraction, and anomaly identification. Collecting and storing GNSS trajectory data is the data foundation of the whole scheme. Due to the huge amount of data, a distributed storage scheme should be adopted. For distributed storage, there are mature technologies, which are not the content of the present invention. The basic assumption of establishing a spatiotemporal sub-area is that it has the same traffic characteristics in a certain area and a specific time period. This assumption is generally applicable after long-term observation. Historical traffic feature extraction, the principle is to use the GNSS trajectory data to calculate the travel speed, benefit Using a large number of travel speed data in the same space-time sub-area, the probability distribution model of the vehicle speed is established, and the parameters are estimated, and the traffic characteristics are characterized by a small number of parameters. Real-time traffic feature extraction, the principle is to process and analyze the speed data in the current time period, and also establish the current vehicle speed probability distribution model. The abnormality identification is to use the difference measurement index to judge the degree of change of the real-time feature compared with the historical feature, and determine whether a traffic anomaly event occurs according to whether it reaches the threshold.
根据发明内容所述实施方法的组合, 给出实施例如下。 实施例一  According to the combination of the embodiments of the invention, the implementation is given below. Embodiment 1
步骤 11、采用等距时空划分法, 确定时间维度的片段尺度, 时间片段跨度为固定值, 通常取 30mm 作为一个时间片段; 确定空间维度的片段尺度, 空间片段跨度为固定值, 通常取 200mX200m的空间 网格作为一个空间片段。  Step 11. Using the equidistant space-time division method, determining the segment scale of the time dimension, the time segment span is a fixed value, usually taking 30 mm as a time segment; determining the segment scale of the spatial dimension, the spatial segment span is a fixed value, usually taking 200m×200m The spatial grid acts as a spatial fragment.
步骤 12、 进行数据预处理, 将 GNSS定位数据进行数据清洗、 数据集成、 数据转换、 数据归约, 提高数据的结构化程度。  Step 12. Perform data preprocessing to perform data cleaning, data integration, data conversion, and data reduction on the GNSS positioning data to improve the structural degree of the data.
步骤 13、 将所需处理的空间区域划分为一定大小的格网, 每个格网区域的范围可表示为 Step 13. Divide the space area to be processed into a grid of a certain size, and the range of each grid area can be expressed as
4
Figure imgf000014_0001
- 判定定位点所在的格网区域, 并利用距离和方位角, 搜索定 位点所在的路段; 搜索距离点 Α最近的路段, 取阈值 ^=2.5° , 当满足点 A的行驶方向角与路段 ^的 方向角的差值小于阈值 时, 即满足 |< , 完成匹配; 若不满足 | - |< , 在搜索空间中删 除路段 并继续搜索其他路段, 直至满足条件; 利用路段的直线方程 (若为曲线路段则近似拆分为 直线), 计算 GNSS定位点在路段上的投影坐标, 减小因 GNSS定位漂移带来的误差, 具体方法为: 确定路段 ^的直线方程 (若路段为曲线, 则划分为若干直线路段):
4
Figure imgf000014_0001
- Determine the grid area where the anchor point is located, and use the distance and azimuth to search for the section where the anchor point is located; Search for the nearest section of the point, take the threshold ^=2.5°, and satisfy the travel direction angle and section of point A^ When the difference of the direction angle is less than the threshold, that is, |< is satisfied; the match is completed; if the - -<< is not satisfied, the link is deleted in the search space and the other sections are continued to be searched until the condition is satisfied; the straight line equation of the link is used (if The curved section is approximately split into straight lines. The projection coordinates of the GNSS positioning point on the road segment are calculated to reduce the error caused by the GNSS positioning drift. The specific method is: Determine the straight line equation of the road segment ^ (if the road segment is a curve, then divide For several straight segments):
y, - y, y, - y,
其中斜率为: 投影直线方程为: y-yA =- x- kyA -ky +k2x +χΛ Where the slope is: The projection line equation is: yy A =- x - ky A -ky +k 2 x +χ Λ
解出投影坐标 p为:  Solve the projected coordinates p is:
k2+\ k 2 +\
k2yA + yt +kxA - xj k 2 y A + y t +kx A - x j
yP y P
k2 +l k 2 +l
在地图匹配过程后, 结合定位点坐标的时间戳数据, 将定位点匹配到时空子区。  After the map matching process, the anchor point is matched to the spatio-temporal sub-area in combination with the timestamp data of the coordinates of the positioning point.
步骤 14、 由一个时空子区内各辆次浮动车的全部行程车速数据, 构成总体。 计算时空子区 内每 辆车的行程车速: 其中 2... — 为时空子区 f 内的第 1个和第 2个 GNSS定 位点间的距离, ......,第《-1个与第 n个 GNSS定位点间的距离, 为时空子区 内第 1个, ......, 第《个 GNSS定位点的时间戳;将每个时空子区内的数据不做筛选,构成一个集合 V{,用于后续处理。 Step 14. The total travel speed data of each of the secondary floating cars in a time and space sub-zone constitutes an overall. Calculate the travel speed of each vehicle in the time-space sub-region: where 2 ... is the distance between the first and second GNSS anchor points in the space-time sub-region f, ..., "-1 The distance from the nth GNSS anchor point is the first time in the space-time sub-region, ..., the time stamp of the GNSS anchor point; the data in each spatio-temporal sub-region is not filtered. , constitute a collection V { for subsequent processing.
步骤 15、 将无交通异常状况下的历史数据, 作为一个整体, 进行交通特征模型建立和参数估计。 该方法利用有限混合模型, 建立交通特征模型, 并进行参数估计。 取最大成分数量 K=5, 并分别对 «=1,2,···, 个成分的混合高斯模型进行参数估计; 对于 个模型, 通过贝叶斯信息准则 ΒΙΟ 确定最 佳模型。 计算: = ΪΆ n {\,2,...,5} 共 5种混合模型。 同时, 计算 5种模型的 Step 15. As a whole, the historical data under the condition of no traffic anomaly is used to establish a traffic feature model and estimate the parameters. The method uses a finite mixing model to establish a traffic feature model and perform parameter estimation. Take the maximum component quantity K=5, and estimate the parameters of the mixed Gaussian model of «=1,2,···, respectively. For each model, determine the best model by Bayesian information criterion. Calculation: = ΪΆ n {\,2,...,5} A total of 5 mixed models. At the same time, calculate the five models
BIC = -2\nL + k - \an  BIC = -2\nL + k - \an
式中, 为最大似然函数值, 为模型中参数的个数, 《为数据总量。  Where, is the maximum likelihood function value, which is the number of parameters in the model, "for the total amount of data.
之后, 选取 β/C最小的的混合模型, 记录其参数向量^ μ、 σ, 作为本时空子区的特征记录。 步骤 16、 将实时交通数据进行模型建立和参数估计, 获取当前交通状况的特征函数, 方法同步骤 一五, 记录参数向量 τ 、 μ、 a  After that, the mixed model with the smallest β/C is selected, and its parameter vectors ^ μ and σ are recorded as the feature records of the present time-space sub-region. Step 16. Perform real-time traffic data model establishment and parameter estimation to obtain a characteristic function of the current traffic condition. The method is the same as step one, and the parameter vector τ, μ, a is recorded.
步骤 17、 根据当前交通特征的描述参数 τ 、 μ,^ 和历史交通特征的描述参数 η、 μ、 σ计算两个 速度分布间的差异:
Figure imgf000015_0001
JSD(/)ri || P)。
Step 17. Calculate the difference between the two velocity distributions according to the description parameters τ, μ, ^ of the current traffic characteristics and the description parameters η, μ, σ of the historical traffic characteristics:
Figure imgf000015_0001
JSD(/) ri || P).
步骤 18、 将各个时空子区的速度分布差异标准化为 0~1的规范化数值 ad: Step 18: Normalize the difference in velocity distribution of each space-time sub-region to a normalized value a d of 0~1:
diffi} - min(diff) Diff i} - min(diff)
ξ' max、diff、 - min、diff、  ξ' max, diff, - min, diff,
计算各个时空子区的交通异常指数
Figure imgf000015_0002
10。 实施例二
Calculate the traffic anomaly index of each time and space sub-region
Figure imgf000015_0002
10. Embodiment 2
步骤 21、采用等距时空划分法, 确定时间维度的片段尺度, 时间片段跨度为固定值, 通常取 30mm 作为一个时间片段; 确定空间维度的片段尺度, 空间片段跨度为固定值, 通常取 200m X 200m的空间 网格作为一个空间片段。  Step 21: Using the equidistant space-time division method, determining the segment scale of the time dimension, the time segment span is a fixed value, usually taking 30 mm as a time segment; determining the segment scale of the spatial dimension, the spatial segment span is a fixed value, usually taking 200 m X The 200m spatial grid acts as a spatial fragment.
步骤 22、 进行数据预处理, 将 GNSS定位数据进行数据清洗、 数据集成、 数据转换、 数据归约, 提高数据的结构化程度。  Step 22. Perform data preprocessing to perform data cleaning, data integration, data conversion, and data reduction on the GNSS positioning data to improve the structural degree of the data.
的空间区域划分为一定大小的格网, 每个格网区域的范围可表示为
Figure imgf000015_0003
- 判定定位点所在的格网区域, 并利用距离和方位角, 搜索定 位点所在的路段; 搜索距离点 Α最近的路段, 取阈值 ^=2.5 ° , 当满足点 A的行驶方向角与路段 ^的 方向角的差值小于阈值 时, 即满足 | - | < , 完成匹配; 若不满足 | - | < , 在搜索空间中删 除路段 并继续搜索其他路段, 直至满足条件; 利用路段的直线方程 (若为曲线路段则近似拆分为 直线), 计算 GNSS定位点在路段上的投影坐标, 减小因 GNSS定位漂移带来的误差, 具体方法为: 确定路段 ^的直线方程 (若路段为曲线, 则划分为若干直线路段):
The space area is divided into grids of a certain size, and the range of each grid area can be expressed as
Figure imgf000015_0003
- Determine the grid area where the anchor point is located, and use the distance and azimuth to search for the section where the anchor point is located; Search for the nearest section of the point, take the threshold ^=2.5 °, when the travel direction angle and the section of the point A are satisfied ^ When the difference of the direction angle is less than the threshold, that is, | - | < is satisfied, and the match is completed; if the | - | < is not satisfied, the link is deleted in the search space and the other sections are continued to be searched until the condition is satisfied; If it is a curved road segment, it is roughly split into straight lines. Calculate the projection coordinates of the GNSS positioning point on the road segment, and reduce the error caused by the GNSS positioning drift. The specific method is: Determine the straight line equation of the road segment ^ (if the road segment is a curve, It is divided into several straight line segments):
y, - y, y, - y,
其中斜率为: k : 投影直线方程为:Where the slope is: k : The projection line equation is:
Figure imgf000016_0001
Figure imgf000016_0001
kyA -kyt +k2xt +xA Ky A -ky t +k 2 x t +x A
解出投影坐标 P为:
Figure imgf000016_0002
Solve the projected coordinate P as:
Figure imgf000016_0002
在地图匹配过程后, 结合定位点坐标的时间戳数据, 将定位点匹配到时空子区。 步骤 24、计算时空子区 内每辆车的行程车速: ,其中^ ... 1?!为时空子
Figure imgf000016_0003
After the map matching process, the anchor point is matched to the spatio-temporal sub-area in combination with the timestamp data of the coordinates of the positioning point. Step 24: Calculate the travel speed of each vehicle in the space-time sub-region: , where ^ ... 1?! is a time-space sub-
Figure imgf000016_0003
区 内的第 1个和第 2个 GNSS定位点间的距离, ......,第《-1个与第 η个 GNSS定位点间的距离, ^... 为时空子区 内第 1个, ......, 第《个 GNSS定位点的时间戳; 指定时间片段长度 同一时间片段 数据条数上限;^ «; 搜索一个时空子区内时间第 各时间片段内的速度数据, 若时间片段内速度数据条 数超过上限 p 随机取 p 条数据加入 ^。。 The distance between the first and second GNSS anchor points in the zone, ..., the distance between the -1 and the nth GNSS anchor points, ^... is the space-time sub-zone 1 , ..., the first time stamp of a GNSS anchor point; specify the maximum length of the clip data at the same time segment length; ^ «; search for the speed data in the time segment of the time and space sub-region If the number of speed data in the time segment exceeds the upper limit p, randomly f data is added to ^. .
步骤 25、 将无交通异常状况下的历史数据, 作为一个整体, 进行交通特征模型建立和参数估计。 该方法利用有限混合模型, 建立交通特征模型, 并进行参数估计。 取最大成分数量 K=5, 并分别对 «=1,2,···, 个成分的混合高斯模型进行参数估计; 对于 个模型, 通过贝叶斯信息准则 ΒΙΟ 确定最 佳模型。 计算:
Figure imgf000016_0004
Step 25. Perform historical traffic data without traffic anomalies as a whole, and perform traffic feature model establishment and parameter estimation. The method uses a finite mixing model to establish a traffic feature model and perform parameter estimation. Take the maximum component quantity K=5, and estimate the parameters of the mixed Gaussian model of «=1,2,···, respectively. For each model, determine the best model by Bayesian information criterion. Calculation:
Figure imgf000016_0004
共 5种混合模型。 同时, 计算 5种模型的  There are 5 mixed models in total. At the same time, calculate the five models
BIC = -2\nL + k-\an  BIC = -2\nL + k-\an
式中, 为最大似然函数值, 为模型中参数的个数, 《为数据总量。  Where, is the maximum likelihood function value, which is the number of parameters in the model, "for the total amount of data.
之后, 选取 β/C最小的的混合模型, 记录其参数向量^ μ、 σ, 作为本时空子区的特征记录。 根据参数估计结果, 写出时空子区在不同日期对应的行程车速分布的概率密度函数 p,(x):
Figure imgf000016_0005
After that, the mixed model with the smallest β/C is selected, and its parameter vectors ^ μ and σ are recorded as the feature records of the local time and space sub-region. According to the parameter estimation result, the probability density function p, (x) of the travel speed distribution corresponding to the spatio-temporal sub-region on different dates is written:
Figure imgf000016_0005
计算各分布两两之间的 Jensen-Shannon散度 d1J: Calculate the Jensen-Shannon divergence d 1J between each of the two distributions :
d^JSDiPWQ) ---D(P\\M) + -D(Q\\M) 式中, P、 β为两个不同的概率分布, Μ -(P + Q), D为 Kullback— Leibler散度:
Figure imgf000016_0006
d^JSDiPWQ) ---D(P\\M) + -D(Q\\M) where P and β are two different probability distributions, Μ -(P + Q), D is Kullback- Leibler Divergence:
Figure imgf000016_0006
在采用有限混合模型的情况下, 采用蒙特卡罗抽样方法近似计算, 其计算方法是:  In the case of a finite mixing model, Monte Carlo sampling is used to approximate the calculation. The calculation method is:
»MC( II g) = II g)»MC( II g) = II g)
Figure imgf000016_0007
将分布两两间的散度表示成距离矩阵:
Figure imgf000016_0007
The divergence between the distributions is expressed as a distance matrix:
D = D =
d„、 . .. d 该矩阵满足 4=4,, d,r0(i=j)。 d„, . . . d The matrix satisfies 4=4,, d, r 0(i=j).
将距离矩阵作为 K-Medoids算法的输入, 得到聚类结果, 并对类别建立索引。  The distance matrix is used as the input of the K-Medoids algorithm to obtain clustering results and index the categories.
以类别索引为响应变量, 将交通环境数据 (包括气温、 降水量、 能见度等) 作为自变量, 进行多 项 Logit回归, 获取交通环境 E与交通态势类别 T的映射关系 R(E 。  Using the category index as the response variable, the traffic environment data (including temperature, precipitation, visibility, etc.) is used as an independent variable, and multiple Logit regressions are performed to obtain the mapping relationship between the traffic environment E and the traffic situation category T (E.
将相同类别的数据进行聚合, 并利用聚合后新的数据集重新建立混合模型, 并进行参数估计, 得 到最终的历史交通特征数据集。  The same type of data is aggregated, and the mixed model is re-established with the new data set after aggregation, and the parameter estimation is performed to obtain the final historical traffic characteristic data set.
步骤 26、 获取交通状况的特征函数, 同时获取当前的气温、 降水量、 能见度、 交通管制措施等信 息, 并判断当前交通状况的类别。  Step 26: Obtain a characteristic function of the traffic condition, and obtain information such as current temperature, precipitation, visibility, traffic control measures, and the type of the current traffic condition.
计算时空子区内的行程车速, 构成实时行程车速总体 Virt', 建立行程车速概率分布模型 ρ ξ = ^η; , 并进行参数估计; 将当前交通环境数据 (包括气温、 降水量、 能见度等) 作为输入参数, 利用映射关系 R(£^)获得 当前交通态势的所述类别 Γ。 Calculate the travel speed in the time-space sub-zone, form the real-time travel speed overall V irt ', establish the travel speed probability distribution model ρ ξ = ^η ; and make parameter estimation; the current traffic environment data (including temperature, precipitation, visibility, etc.) As an input parameter, the category Γ of the current traffic situation is obtained using the mapping relationship R(£^).
步骤 27、 根据当前交通态势所属类别 Τ, 定位该类别下历史交通特征数据; 根据当前交通特征的 描述参数 τ 、 μ„、 (Jrt和历史交通特征的描述参数 η、 μ、 σ 计算两个速度分布间的差异: diff [(η, , μΓί , σΓί ) , (η, μ, σ)] = JSD(Prt 11 Ρ)。 Step 27. According to the category of the current traffic situation, locate the historical traffic characteristic data of the category; calculate two parameters according to the description parameters τ, μ„, (J rt and the description parameters η, μ, σ of the historical traffic feature) The difference between the velocity distributions: diff [(η, , μ Γί , σ Γί ) , (η, μ, σ)] = JSD(P rt 11 Ρ).
步骤 28、 将各个时空子区的速度分布差异标准化为 0~1的规范化数值《ί: Step 28 : Normalize the difference in velocity distribution of each spatiotemporal sub-region to a normalized value of 0 to 1 .
diff} - min(diff) Diff } - min(diff)
ξ' max、diff、 - min、diff、  ξ' max, diff, - min, diff,
计算各个时空子区的交通异常指数
Figure imgf000017_0001
10。 实施例三
Calculate the traffic anomaly index of each time and space sub-region
Figure imgf000017_0001
10. Embodiment 3
步骤 31、 采用非等距时空划分法, 对于路网密度大于 2km/km2或高峰小时流量大于 1000辆 /小时 的城市中心区, 取 30min的时间片段和 200m X 200m的空间片段, 对于路网密度小于 2km/km2或高峰 小时流量小于 1000辆 /小时的城市郊区, 取 30min的时间片段和 400m X 400m的空间片段。 Step 31: Using a non-equidistant space-time division method, for a central area of the city where the road network density is greater than 2 km/km 2 or the peak hour flow is greater than 1000 vehicles/hour, a 30 min time segment and a 200 m X 200 m space segment are taken. For urban suburbs with a density of less than 2km/km 2 or a peak hour flow of less than 1000 vehicles per hour, a 30-minute time segment and a 400 m X 400 m space segment are taken.
步骤 32、 进行数据预处理, 将 GNSS定位数据进行数据清洗、 数据集成、 数据转换、 数据归约, 提高数据的结构化程度。  Step 32: Perform data preprocessing, perform data cleaning, data integration, data conversion, and data reduction on the GNSS positioning data to improve the structural degree of the data.
步骤 33、 将所需处理的空间区域划分为一定大小的格网, 每个格网区域的范围可表示为 4 = {(xs ^s ) \ xs Step 33: Divide the space area to be processed into a grid of a certain size, and the range of each grid area can be expressed as 4 = {( x s ^s ) \ x s
将浮动车 GNSS数据采集频率表示为
Figure imgf000017_0002
, Pfc+io)定义为 A的 1-邻近点, ^04-2¾), ^04+2¾)定义为 A的 2-邻近点, 以此类推, 则 Ρθ4-/¾;), 定义为 Α的 /- 邻近点。 在/ Q<lHz时, 取^ =1或 2。 取距离 A及 A的 邻近点距离最小的路段^ 并计算 A及 A的
Express the floating vehicle GNSS data acquisition frequency as
Figure imgf000017_0002
, Pfc+io) is defined as A 1-adjacent points, ^04-23⁄4), ^04+23⁄4) are defined as 2-adjacent points of A, and so on, then Ρθ4-/3⁄4;), defined as // adjacent points of Α. When / Q <lHz, take ^ =1 or 2. Take the distance between the neighboring points of the distances A and A and calculate the A and A
^邻近点行驶方向角的均值 ^4·, 取阈值 ^=5° , 若满足 |^.- |< , 完成匹配; 否则, 搜索其他路 段, 直至满足条件。 ^The mean value of the driving direction angle of the adjacent point is ^4·, and the threshold is ^=5°. If |^.- |< is satisfied, the matching is completed; otherwise, other sections are searched until the condition is met.
利用路段的直线方程 (若为曲线路段则近似拆分为直线),计算 GNSS定位点在路段上的投影坐标, 减小因 GNSS定位漂移带来的误差。 具体方法为:  Use the straight line equation of the road segment (if it is a curved road segment, it is roughly split into straight lines), calculate the projection coordinates of the GNSS positioning point on the road segment, and reduce the error caused by the GNSS positioning drift. The specific method is:
确定路段 ^的直线方程 (若路段为曲线, 则划分为若干直线路段): y-y^kix-x,) y, - y,  Determine the straight line equation of the road segment ^ (If the road segment is a curve, divide it into several straight line segments): y-y^kix-x,) y, - y,
其中斜率为: 投影直线方程为: The slope is: The projection line equation is:
Figure imgf000018_0001
Figure imgf000018_0001
kyA -kyt +k2xt +xA Ky A -ky t +k 2 x t +x A
解出投影坐标 p为:  Solve the projected coordinates p is:
k2+l k 2 +l
-h , -kx: -h , -kx :
yP y P
在地图匹配过程后, 结合定位点坐标的时间戳数据, 将定位点匹配到时空子区。 步骤 34、计算时空子区 内每辆车的行程车速: 其中 2...4— 1,«为时空子
Figure imgf000018_0002
After the map matching process, the anchor point is matched to the spatio-temporal sub-area in combination with the timestamp data of the coordinates of the positioning point. Step 34: Calculate the travel speed of each vehicle in the space-time sub-region: where 2 ... 4 - 1, « is a time slot
Figure imgf000018_0002
区 内的第 1个和第 2个 GNSS定位点间的距离, ......,第《-1个与第 n个 GNSS定位点间的距离, ^... 为时空子区 内第 1个, ......, 第《个 GNSS定位点的时间戳; 指定时间片段长度 同一时间片段 数据条数上限;^ «; 搜索一个时空子区内时间第 各时间片段内的速度数据, 若时间片段内速度数据条 数超过上限 p 随机取 p 条数据加入 ^。。 The distance between the first and second GNSS anchor points in the zone, ..., the distance between the -1 and the nth GNSS anchor point, ^... is the space-time sub-zone 1 , ..., the first time stamp of a GNSS anchor point; specify the maximum length of the clip data at the same time segment length; ^ «; search for the speed data in the time segment of the time and space sub-region If the number of speed data in the time segment exceeds the upper limit p, randomly f data is added to ^. .
步骤 35、 将无交通异常状况下的历史数据, 作为一个整体, 进行交通特征模型建立和参数估计。 该方法利用有限混合模型, 建立交通特征模型, 并进行参数估计。 取最大成分数量 K=5, 并分别对 «=1,2,···, 个成分的混合高斯模型进行参数估计; 对于 个模型, 通过贝叶斯信息准则 ΒΙΟ 确定最 佳模型。 计算: "e{l,2 ..,5} 共 5种混合模型。 同时, 计算 5种模型的  Step 35: Perform historical data of no traffic anomaly as a whole, and establish traffic feature model and parameter estimation. The method uses a finite mixing model to establish a traffic feature model and perform parameter estimation. Take the maximum component quantity K=5, and estimate the parameters of the mixed Gaussian model of «=1,2,···, respectively; for each model, determine the best model by Bayesian information criterion ΒΙΟ. Calculation: "e{l,2 ..,5} A total of 5 hybrid models. At the same time, calculate 5 models
BIC = -2\nL + k-\nn  BIC = -2\nL + k-\nn
式中, 为最大似然函数值, 为模型中参数的个数, 《为数据总量。  Where, is the maximum likelihood function value, which is the number of parameters in the model, "for the total amount of data.
之后, 选取 β/C最小的的混合模型, 记录其参数向量^ μ、 σ, 作为本时空子区的特征记录 ε 根据参数估计结果, 写出时空子区在不同日期对应的行程车速分布的概率密度函数 p,(x): 计算各分布两两之间的 Jense -Shannon散度 dy:
Figure imgf000019_0001
After that, the mixed model with the smallest β/C is selected, and the parameter vectors ^ μ and σ are recorded. As the characteristic record ε of the present space-time sub-region, according to the parameter estimation result, the probability of the distribution of the speed of the space-time sub-region corresponding to the different speeds is written. Density function p, (x): Calculate the Jense-Shannon divergence d y between each of the two distributions:
Figure imgf000019_0001
式中, P、 ρ为两个不同的概率分布, Μ=^0Ρ + ρ;), /?为 !^^-!^!^!"散度:  Where P and ρ are two different probability distributions, Μ=^0Ρ + ρ;), /? For ! ^^-! ^!^!"Diversity:
D(P\\Q) = ^P(xk)log 在采用有限混合模型的情况下, 采用蒙特卡罗抽样方法近似计算, 其计算方法是:
Figure imgf000019_0002
D(P\\Q) = ^P(x k )log In the case of a finite-mixing model, Monte Carlo sampling is used to approximate the calculation. The calculation method is:
Figure imgf000019_0002
将分布两两间的散度表示成距离矩阵:  The divergence between the distributions is expressed as a distance matrix:
D- d„、 . .. d 该矩阵满足 4=4,, d,r0(i=j)。 D- d„, . . . d This matrix satisfies 4=4,, d, r 0(i=j).
将距离矩阵作为 K-Medoids算法的输入, 得到聚类结果, 并对类别建立索引。  The distance matrix is used as the input of the K-Medoids algorithm to obtain clustering results and index the categories.
以类别索引为响应变量, 将交通环境数据 (包括气温、 降水量、 能见度等) 作为自变量, 进行多 项 Logit回归, 获取交通环境 E与交通态势类别 T的映射关系 R(E 。 将相同类别的数据进行聚合, 并利用聚合后新的数据集重新建立混合模型, 并进行参数估计, 得 到最终的历史交通特征数据集。  Using the category index as the response variable, the traffic environment data (including temperature, precipitation, visibility, etc.) is used as an independent variable to perform multiple logit regression to obtain the mapping relationship between the traffic environment E and the traffic situation category T (E. The same category will be used. The data is aggregated, and the hybrid model is re-established with the new data set after aggregation, and the parameter estimation is performed to obtain the final historical traffic characteristic data set.
步骤 36、 获取交通状况的特征函数, 同时获取当前的气温、 降水量、 能见度、 交通管制措施等信 息, 并判断当前交通状况的类别。  Step 36: Obtain a characteristic function of the traffic condition, and obtain current information such as temperature, precipitation, visibility, traffic control measures, and the type of the current traffic condition.
计算时空子区内的行程车速, 构成实时行程车速总体 VIRT', 建立行程车速概率分布模型 pjv rt) = f lj -/;( ^^^;), 并进行参数估计; 将当前交通环境数据 (包括气温、 降水量、 能见度等) 作为输入参数, 利用映射关系 R(E 得 当前交通态势的所述类别 Γ。 Calculate the travel speed in the time-space sub-zone, form the overall V IRT ' of the real-time travel speed, establish the travel speed probability distribution model pj v rt ) = f lj - / ; ( ^^^ ; ), and make parameter estimation; The data (including temperature, precipitation, visibility, etc.) is used as an input parameter, and the mapping relationship R (E is used to obtain the category 当前 of the current traffic situation.
步骤 37、 根据当前交通态势所属类别 Τ, 定位该类别下历史交通特征数据; 根据当前交通特征的 描述参数 τ 、 μ„、 (Jrt和历史交通特征的描述参数 η、 μ、 σ 计算两个速度分布间的差异: diff [(η, , μΓί , σΓί ) , (η, μ, σ)] = JSD(PRT 11 Ρ)。 Step 37: Locating the historical traffic characteristic data of the category according to the current traffic situation category ;; calculating two parameters according to the description parameters τ , μ „, (J rt and the description parameters η, μ, σ of the historical traffic feature) The difference between the velocity distributions: diff [(η, , μ Γί , σ Γί ) , (η, μ, σ)] = JSD(P RT 11 Ρ).
步骤 38、 将各个时空子区的速度分布差异标准化为 0~1的规范化数值《ί: Step 38 : Normalize the difference in velocity distribution of each space-time sub-region to a normalized value of 0~1 .
diff^ - in(diff)  Diff^ - in(diff)
max [diff、 - min [diff、  Max [diff, - min [diff,
计算各个时空子区的交通异常指数 ^f xli^  Calculate the traffic anomaly index of each spatiotemporal sub-area ^f xli^

Claims

权利要求书 Claim
1. 一种城市道路交通异常检测方法, 包括如下步骤:  1. An urban road traffic anomaly detection method, comprising the following steps:
1) 建立时空子区: 将一天划分为若干时间片段, 每个时间片段称为一个时间子区; 将城市道路 交通异常检测的实施区域划分为若干空间片段, 每个空间片段称为一个空间子区; 任意一个 时间子区和任意一个空间子区的交集称为时空子区;  1) Establish space-time sub-area: divide the day into several time segments, each time segment is called a time sub-region; divide the implementation area of urban road traffic anomaly detection into several spatial segments, each spatial segment is called a space sub- Area; the intersection of any one of the time sub-areas and any one of the spatial sub-areas is called a spatio-temporal sub-area;
2) 历史轨迹数据的预处理: 将浮动车 GNSS定位历史数据处理为历史轨迹的抽样车速数据; 实时轨迹数据的预处理: 将浮动车 GNSS定位实时数据处理为实时轨迹的抽样车速数据; 2) Preprocessing of historical trajectory data: processing the floating vehicle GNSS positioning historical data as the sampled vehicle speed data of the historical trajectory; Preprocessing of the real-time trajectory data: processing the floating vehicle GNSS positioning real-time data into the sampled vehicle speed data of the real-time trajectory;
3) 历史轨迹数据分析和特征提取: 利用所述历史轨迹的抽样车速数据, 建立历史行程速度概率 分布, 得到历史交通特征模型 PA, 实施方法是: 将无交通异常状况下的历史数据, 作为一个 整体, 采用有限混合模型方法建立交通特征模型, 并进行参数估计; 3) Historical trajectory data analysis and feature extraction: Using the sampled vehicle speed data of the historical trajectory, establishing a historical travel speed probability distribution, and obtaining a historical traffic characteristic model P A , the implementation method is: using historical data under the condition of no traffic abnormality as A whole, a finite mixed model method is used to establish a traffic feature model, and parameter estimation is performed;
实时轨迹数据分析和特征提取: 利用所述实时轨迹的抽样车速数据, 建立实时行程速度概率 分布, 得到实时交通特征模型  Real-time trajectory data analysis and feature extraction: Using the sampled vehicle speed data of the real-time trajectory, establishing a real-time travel speed probability distribution, and obtaining a real-time traffic feature model
4) 异常检测: 通过 Jensen-Shannon散度衡量所述历史交通特征模型与所述实时交通特征模型的 差异, 通过利用历史交通特征模型计算得到的 Jensen-Shannon散度与利用实时交通特征模型 计算得到的 Jensen-Shannon散度, 计算得到历史与实时交通特征差异值;  4) Anomaly detection: The Jensen-Shannon divergence is used to measure the difference between the historical traffic feature model and the real-time traffic feature model. The Jensen-Shannon divergence calculated by using the historical traffic feature model is calculated by using the real-time traffic feature model. Jensen-Shannon divergence, calculated historical and real-time traffic characteristics difference values;
5) 异常严重性量化表征: 利用所述历史与实时交通特征差异值, 计算交通状况异常指数; 5) Quantitative characterization of abnormal severity: Calculate the abnormality index of traffic conditions by using the difference between the historical and real-time traffic characteristics;
6) 系统性能评价: 评价交通异常状态检测的准确性, 衡量系统运行的稳定程度。 6) System performance evaluation: Evaluate the accuracy of traffic abnormality detection and measure the stability of system operation.
2. 如权利要求 1所述的城市道路交通异常检测方法, 其特征在于, 步骤 1)采用下述方法之一: 2. The urban road traffic anomaly detecting method according to claim 1, wherein: step 1) adopting one of the following methods:
la) 等距时空划分法: 确定时间维度的片段尺度, 时间片段跨度为固定值, 取 30mm作为一个时间 片段; 确定空间维度的片段尺度, 空间片段跨度为固定值, 取 200mX 200m 的空间网格作为 一个空间片段;  La) Isometric space-time division method: Determine the segment scale of the time dimension, the time segment span is a fixed value, take 30mm as a time segment; determine the segment scale of the spatial dimension, the spatial segment span is a fixed value, take a spatial grid of 200mX 200m As a space segment;
lb) 基于路网密度的非等距时空划分法: 基于路网密度作为判断指标, 当路网密度大于等于 2km/km2时,取 30min的时间片段和 200m X 200m的空间片段;当路网密度小于 2km/km2时, 取 30min的时间片段和 400m X 400m的空间片段; lb) based network density of non-equidistant temporal division method: based network density as a determination indicator, when the road network density greater than or equal / km 2 when 2km, time segments takes 30min and 200m X 200m spatial segment; when the road network When the density is less than 2km/km 2 , take a 30 min time segment and a 400 m X 400 m spatial segment;
lc) 基于高峰小时流量的非等距时空划分法: 基于高峰小时流量作为判断指标, 当高峰小时流量大 于等于 1000辆 /小时时,取 30mm的时间片段和 200mX 200m的空间片段; 当高峰小时流量小 于 1000辆 /小时时, 取 30min的时间片段和 400mX 400m的空间片段。  Lc) Non-equidistant space-time division method based on peak hour flow: Based on peak hourly flow rate as a judgment indicator, when the peak hour flow rate is greater than or equal to 1000 vehicles/hour, take 30mm time segment and 200mX 200m space segment; When less than 1000 vehicles/hour, take a 30-minute time segment and a 400-mX 400-meter space segment.
3. 如权利要求 1所述的城市道路交通异常检测方法, 其特征在于, 步骤 2)所述的历史轨迹数据的预处 理包括: 3. The urban road traffic anomaly detecting method according to claim 1, wherein the preprocessing of the historical trajectory data in step 2) comprises:
2a) 数据结构化: 将浮动车 GNSS定位历史数据进行数据清洗、 数据集成、 数据转换、 数据归约, 得到结构化 GNSS定位历史数据;  2a) Data structuring: Data cleaning, data integration, data conversion, and data reduction of floating vehicle GNSS positioning history data to obtain structured GNSS positioning history data;
2b) 快速地图匹配: 结合城市路网数据, 通过地图匹配算法, 将结构化 GNSS定位历史数据投影到 城市路网, 建立所述结构化 GNSS 定位历史数据中的定位点与路段的匹配关系, 得到所述结 构化 GNSS定位历史数据中的定位点与所述路段的匹配关系表,并修正定位漂移带来的误差; 2c) 历史轨迹的车速计算和抽样: 根据所述结构化 GNSS定位历史数据计算交通运行特征参数, 得 到历史轨迹的车速数据, 并对所述历史轨迹的车速数据进行数据抽样, 得到历史轨迹的抽样 车速数据。 2b) Fast map matching: Combine the urban road network data, map the structured GNSS positioning historical data to the urban road network through the map matching algorithm, and establish the matching relationship between the positioning points and the road segments in the structured GNSS positioning historical data, Forming a matching relationship between the positioning point in the structured GNSS positioning history data and the road segment, and correcting the error caused by the positioning drift; 2c) calculating and sampling the vehicle speed of the historical trajectory: calculating according to the structured GNSS positioning historical data The traffic operation characteristic parameter obtains the vehicle speed data of the historical trajectory, and performs data sampling on the vehicle speed data of the historical trajectory to obtain a sampling of the historical trajectory. Speed data.
4. 如权利要求 3所述的城市道路交通异常检测方法,其特征在于,步骤 2b)所述的快速地图匹配包括:4. The method of detecting urban road traffic anomaly according to claim 3, wherein the fast map matching according to step 2b) comprises:
2b 1) 将所需处理的空间区域划分为一定大小的格网, 每个格网区域的范围可表示为2b 1) Divide the space area to be processed into a grid of a certain size, and the range of each grid area can be expressed as
4 ={(xs,ys)\xs G[xf,xf+1),^ [yr,yr+1)}^ 每个格网区域包含若干个路段, 把这些路段的集合表 示为 Rs, 所述路段的集合 Rs中的每条路段表示为 并为每个路段赋予编号; 4 ={(x s ,y s )\x s G[x f ,x f+1 ),^ [y r ,y r+1 )}^ Each grid area contains several sections, which are The set is represented as R s , and each of the sets R s of the road segments is represented as and assigned a number to each road segment;
2b2) 判定定位点所在的格网区域, 并利用距离和方位角, 在路段的集合 中搜索某定位点 A所在 的路段  2b2) Determine the grid area where the anchor point is located, and use the distance and azimuth angle to search for the section where the anchor point A is located in the set of road segments.
2b3) 利用 GNSS定位点直线投影法, 计算 GNSS定位点在路段上的投影坐标。  2b3) Calculate the projection coordinates of the GNSS anchor point on the road segment using the GNSS anchor point linear projection method.
5. 如权利要求 4所述的城市道路交通异常检测方法, 其特征在于, 步骤 2b2)采用下述方法之一:The urban road traffic anomaly detecting method according to claim 4, wherein the step 2b2) adopts one of the following methods:
2b21) 单点匹配方法: 搜索距离某定位点 A最近的路段, 实施方法是: 对于路段的集合 中的某 一路段 ij, 当满足点 A的行驶方向角 与路段 ij的方向角 的差值小于阈值 时, 即满足2b21) Single point matching method: Search for the nearest road segment to a certain positioning point A. The implementation method is: For a certain road segment ij in the set of road segments, the difference between the traveling direction angle satisfying the point A and the direction angle of the road segment ij is smaller than When the threshold is satisfied
| - |< 时, 完成匹配; 若不满足 -6 < , 继续搜索路段集合 R 中的其他路段, 直至 满足 | - |< ; | - |< When the match is completed; if -6 < is not satisfied, continue to search for other sections in the link set R until | - |<;
2b22) 点序列匹配方法:本方案适用于高频浮动车数据;将每两条相邻时间的浮动车 GNSS数据时 间间隔表示为 ί。, 将浮动车 GNSS数据采集频率表示为 /Q=l/iQ, 将某定位点 A的时间记录表 示为 将时间上与所述定位点 A相邻的点 P(tA-t0), Pfc+if 定义为 A的 1-邻近点, P(tA-2h), P04+2iQ;)定义为某定位点 A的 2-邻近点, 以此类推, 则 Pi -kt^h 定义为某定位点 A 的 ^邻近点; 在/ Q<lHz时, 取^ =1或 2; 取距离某定位点 A及该定位点的 ^邻近点距离最小 的一个路段^ 并计算该定位点及其 邻近点行驶方向角的均值 ^4i, 若满足 | 一 |< ,完 成匹配; 否则, 搜索其他路段, 直至满足 | 一 |< 。 2b22) Point sequence matching method: This scheme is applicable to high frequency floating car data; the time interval of floating car GNSS data of each two adjacent time is expressed as ί. , the floating vehicle GNSS data acquisition frequency is expressed as / Q = l / i Q , and the time record of a certain positioning point A is represented as a point P (t A - t 0 ) that is temporally adjacent to the positioning point A, Pfc+if is defined as 1-adjacent point of A, P(t A -2h), P04+2i Q ;) is defined as 2-adjacent point of a certain anchor point A, and so on, then Pi -kt^h is defined as a neighboring point of a certain positioning point A; when / Q <lHz, take ^ =1 or 2; take a section of the distance from a certain positioning point A and the adjacent point of the positioning point ^ and calculate the positioning point and The mean value of the driving direction angle of the adjacent point ^4i, if |_|< is satisfied, the matching is completed; otherwise, the other road segments are searched until the |1|< is satisfied.
6. 如权利要求 3所述的城市道路交通异常检测方法, 其特征在于, 步骤 2c)所述的历史轨迹的车速计算 和抽样采用下述方法之一: 6. The urban road traffic anomaly detecting method according to claim 3, wherein the vehicle speed calculation and sampling of the historical trajectory of step 2c) adopt one of the following methods:
2c 1) 全样本方法: 由一个时空子区 ξ内各辆次浮动车的全部行程车速数据, 构成总体, 实施方法 是计算时空子区 内每辆车的行程车速: 其中 42...4— ln为时空子区
Figure imgf000021_0001
2c 1) Full sample method: The overall speed data of each sub-floating vehicle in a time-space sub-region is composed of the total speed of the vehicle. The implementation method is to calculate the travel speed of each vehicle in the space-time sub-region: 4 2 ... 4 — ln is a spatiotemporal subregion
Figure imgf000021_0001
内的第 1个和第 2个 GNSS定位点间的距离, ......, 第《-1个与第 n个 GNSS定位点间的距 离, iL.A为时空子区 内第 1个, ......, 第《个 GNSS定位点的时间戳; 将每个时空子区内 的行程车速数据不做筛选, 构成一个集合 ^, 用于后续处理;  The distance between the first and second GNSS anchor points, ..., the distance between the -1 and the nth GNSS anchor point, iL.A is the first in the space-time sub-region , ..., the first time stamp of the GNSS anchor points; the travel speed data of each time and space sub-zone is not filtered, and constitutes a set ^ for subsequent processing;
2c2) 时间平滑的抽样方法: 指定时间片段长度, 设定同一时间片段数据条数上限; 搜索一个时空 子区内各时间片段内的速度数据, 若时间片段内速度数据条数超过上限, 随机取上限条数的 数据用于后续处理,实施方法是计算时空子区 内每辆车的行程车速: , 其中 2... 为时空子区 内的第 1个和第 2个 GNSS定位点间的距离, ... ..., 第《- 1个与 第《个GNSS定位点间的距离, ^… 为时空子区^内第丄个, ... ..., 第《个GNSS定位点的 时间戳; 指定时间片段长度 同一时间片段数据条数上限;^ «; 搜索一个时空子区内时间第 i各时间片段内的速度数据, 若时间片段内速度数据条数超过上限 p 随机取 条数据加 入 ^并用于后续处理。 2c2) Time-smooth sampling method: Specify the length of the time segment, set the upper limit of the number of segments of the same time; search for the velocity data in each time segment in a time-space sub-region, if the number of velocity data in the time segment exceeds the upper limit, randomly take Upper limit The data is used for subsequent processing. The implementation method is to calculate the travel speed of each vehicle in the space-time sub-region: , where 2 ... is the distance between the first and second GNSS ... ..., the first "- and the distance between the GNSS anchor points, ^... is the first time in the space-time sub-area ^, ..., the time stamp of the GNSS anchor point; time slice a maximum length of the number of data pieces in the same time segment; ^ «; speed data within the search a temporal sub-area at the time of the i each time segment, if the time slice speed data number of pieces exceeds the upper limit p randomly of data added ^ and dried For subsequent processing.
7. 如权利要求 1至 6之一所述的城市道路交通异常检测方法,其特征在于,步骤 3)采用下述方法之一:The urban road traffic anomaly detecting method according to any one of claims 1 to 6, wherein the step 3) adopts one of the following methods:
3a) 固定成分混合高斯模型法: 采用固定成分数量 的混合高斯模型描述车速的概率分布, 成分 数量根据车速在典型情况下的分布模式人工指定, 成分数量 取 4~6; 将实时交通数据进行 模型建立和参数估计, 获取当前交通状况的特征函数; 3a) Fixed-component mixed Gaussian model method: The mixed Gaussian model with a fixed number of components is used to describe the probability distribution of the vehicle speed. The number of components is manually specified according to the distribution pattern of the vehicle speed under typical conditions. The number of components is 4~6. The real-time traffic data is modeled. Establishing and parameter estimation to obtain a characteristic function of the current traffic condition;
3b) 可变混合高斯模型法: 采用可变的成分数量, 或者可变的成分数量和可变的分布类型; 采用分 类处理法, 获取交通状况的特征函数, 同时获取当前的气温、 降水量、 能见度、 交通管制措 施等信息, 并判断当前交通状况的类别。  3b) Variable mixed Gaussian model method: using variable component quantities, or variable component quantities and variable distribution types; using classification processing method to obtain the characteristic function of traffic conditions, and obtaining current temperature and precipitation, Information such as visibility, traffic control measures, and the type of current traffic conditions.
8. 如权利要求 7所述的交通异常检测方法, 其特征在于, 步骤 3b)所述的可变混合高斯模型法包括下 述方法之一: 8. The traffic anomaly detection method according to claim 7, wherein the variable hybrid Gaussian model method according to step 3b) comprises one of the following methods:
3b l) 成分数量可变的混合高斯模型法: 采用基于模型评价的方法来选择合适的成分数量, 方法如 下:确定可能的最大成分数量 K, 并分别对《= 1,2,…^个成分的混合高斯模型进行参数估计; 对于 个模型, 通过贝叶斯信息准则 (β/C) 确定最佳模型;  3b l) Mixed Gaussian model method with variable number of components: The method based on model evaluation is used to select the appropriate number of components. The method is as follows: Determine the maximum number of possible components K, and respectively for the components of = 1, 2, ... a mixed Gaussian model for parameter estimation; for each model, the best model is determined by Bayesian information criterion (β/C);
3b2) 成分数量、 分布类型均可变的有限混合模型法: 子成分的分布形态和成分的数量均可变; 步骤 3b)所述的分类处理法包括:  3b2) The finite mixed model method in which the number of components and the type of distribution are variable: the distribution pattern and the number of components of the subcomponent are variable; the classification processing described in step 3b) includes:
3b3) 计算时空子区内的行程车速, 构成实时行程车速总体  3b3) Calculate the travel speed in the time-space sub-zone, which constitutes the overall real-time travel speed
3b4) 建立行程车速概率分布模型^ ^二 ! ^ / ^ ^^,^.), 式中, 表示所述实时交通特征的 子成分数量, //表示所述实时交通特征的某个子成分所占的比例, 表示所述实时交通特征模 型的某个子成分的均值, σ表示所述实时交通特征模型的某个子成分的标准差, 并进行参数估 计, 得到当前实时交通特征的描述参数 T!rt、 art, 其中, ,是所述实时交通特征模型中各 个子成分所占的比例向量, ^是所述实时交通特征模型中各个子成分的均值向量, (Trt是所述 实时交通特征模型中各个子成分的标准差向量; 3b4) Establish a travel speed probability distribution model ^ ^ 2 ! ^ / ^ ^^, ^.), where the number of sub-components of the real-time traffic feature is represented, // represents a sub-component of the real-time traffic feature The ratio represents the mean value of a certain sub-component of the real-time traffic feature model, σ represents the standard deviation of a certain sub-component of the real-time traffic feature model, and performs parameter estimation to obtain a description parameter T! rt of the current real-time traffic feature, a rt , where , is the proportional vector of each sub-component in the real-time traffic feature model, ^ is the mean vector of each sub-component in the real-time traffic feature model, (T rt is in the real-time traffic feature model The standard deviation vector of each subcomponent;
3b5) 将当前交通环境数据 (包括气温、 降水量、 能见度等) 作为输入参数, 利用映射关系 R(£^) 获得当前交通态势的所述类别 T, 其中 £表示所述交通环境数据。  3b5) Taking the current traffic environment data (including temperature, precipitation, visibility, etc.) as an input parameter, using the mapping relationship R(£^) to obtain the category T of the current traffic situation, where £ represents the traffic environment data.
9. 如权利要求 1至 6之一所述的城市道路交通异常检测方法, 其特征在于, 步骤 4)异常检测包括:The urban road traffic anomaly detecting method according to any one of claims 1 to 6, wherein the step 4) the abnormality detecting comprises:
4a) 根据当前交通态势所属类别 Γ, 定位该类别下历史交通特征数据, 如无类别划分, 则不必区分 类别; 4b) 根据当前交通特征的描述参数 τ 、 μ,^ 和历史交通特征的描述参数 η、 μ、 σ计算两个速度 分布间的差异: ^[(τ^,μ ,σ Μη,μ^^^/ ^ ΙΙΑ), 其中, 是所述实时交通特征模型 中各个子成分所占的比例向量, ^是所述实时交通特征模型中各个子成分的均值向量, (^是 所述实时交通特征模型中各个子成分的标准差向量; η是所述历史交通特征模型中各个子成分 所占的比例向量, μ是所述历史交通特征模型中各个子成分的均值向量, σ是所述历史交通特 征模型中各个子成分的标准差向量。 4a) According to the category of the current traffic situation, locate the historical traffic characteristic data under the category. If there is no category division, it is not necessary to distinguish the category; 4b) Calculate the difference between the two velocity distributions according to the description parameters τ, μ, ^ of the current traffic characteristics and the description parameters η, μ, σ of the historical traffic characteristics: ^[(τ^, μ , σ Μη, μ^^^ / ^ ΙΙΑ), where is the proportional vector of each sub-component in the real-time traffic feature model, ^ is the mean vector of each sub-component in the real-time traffic feature model, (^ is the real-time traffic feature model The standard deviation vector of each sub-component; η is the proportional vector of each sub-component in the historical traffic feature model, μ is the mean vector of each sub-component in the historical traffic feature model, and σ is the historical traffic feature model The standard deviation vector of each subcomponent.
10. 如权利要求 1至 6之一所述的城市道路交通异常检测方法, 其特征在于, 步骤 5)异常严重性量化 表征包括: The urban road traffic anomaly detecting method according to any one of claims 1 to 6, wherein the step 5) the abnormal severity quantification characterization comprises:
5a) 将各个时空子区的
Figure imgf000023_0001
5a) will be in each time and space sub-area
Figure imgf000023_0001
5b) 计算各个时空子区的交通异常指数^ 10。  5b) Calculate the traffic anomaly index ^ 10 for each time-space subzone.
PCT/IB2017/058531 2016-12-30 2017-12-30 Method for detecting traffic anomaly of urban road WO2018122801A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
GB1909405.1A GB2572717B (en) 2016-12-30 2017-12-30 A Method for Urban Traffic Incident Detecting
CN201780050755.4A CN109643485B (en) 2016-12-30 2017-12-30 Urban road traffic abnormity detection method

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IBPCT/IB2016/058105 2016-12-30
PCT/IB2016/058105 WO2018122585A1 (en) 2016-12-30 2016-12-30 Method for urban road traffic incident detecting based on floating-car data

Publications (1)

Publication Number Publication Date
WO2018122801A1 true WO2018122801A1 (en) 2018-07-05

Family

ID=59713568

Family Applications (7)

Application Number Title Priority Date Filing Date
PCT/IB2016/058105 WO2018122585A1 (en) 2016-12-30 2016-12-30 Method for urban road traffic incident detecting based on floating-car data
PCT/IB2017/058536 WO2018122806A1 (en) 2016-12-30 2017-12-30 Travel time distribtuion-based multimodal traffic anomaly detection method
PCT/IB2017/058533 WO2018122803A1 (en) 2016-12-30 2017-12-30 Smart road traffic anomaly detection method
PCT/IB2017/058531 WO2018122801A1 (en) 2016-12-30 2017-12-30 Method for detecting traffic anomaly of urban road
PCT/IB2017/058534 WO2018122804A1 (en) 2016-12-30 2017-12-30 Road traffic anomaly detection method using non-isometric time/space division
PCT/IB2017/058532 WO2018122802A1 (en) 2016-12-30 2017-12-30 Multimodal road traffic anomaly detection method
PCT/IB2017/058535 WO2018122805A1 (en) 2016-12-30 2017-12-30 Traffic anomaly detection method based on time distribution of travels

Family Applications Before (3)

Application Number Title Priority Date Filing Date
PCT/IB2016/058105 WO2018122585A1 (en) 2016-12-30 2016-12-30 Method for urban road traffic incident detecting based on floating-car data
PCT/IB2017/058536 WO2018122806A1 (en) 2016-12-30 2017-12-30 Travel time distribtuion-based multimodal traffic anomaly detection method
PCT/IB2017/058533 WO2018122803A1 (en) 2016-12-30 2017-12-30 Smart road traffic anomaly detection method

Family Applications After (3)

Application Number Title Priority Date Filing Date
PCT/IB2017/058534 WO2018122804A1 (en) 2016-12-30 2017-12-30 Road traffic anomaly detection method using non-isometric time/space division
PCT/IB2017/058532 WO2018122802A1 (en) 2016-12-30 2017-12-30 Multimodal road traffic anomaly detection method
PCT/IB2017/058535 WO2018122805A1 (en) 2016-12-30 2017-12-30 Traffic anomaly detection method based on time distribution of travels

Country Status (3)

Country Link
CN (7) CN109923595B (en)
GB (11) GB201711408D0 (en)
WO (7) WO2018122585A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111192456A (en) * 2020-01-14 2020-05-22 泉州市益典信息科技有限公司 Road traffic operation situation multi-time scale prediction method
CN111540194A (en) * 2020-03-23 2020-08-14 深圳奇迹智慧网络有限公司 Vehicle monitoring data processing method and device, computer equipment and storage medium
CN112863185A (en) * 2021-01-15 2021-05-28 林安齐 Intelligent management system and method for road traffic equipment
CN115086910A (en) * 2022-04-27 2022-09-20 同济大学 Real-time road segment division method based on dynamic and static characteristics under V2X environment
CN115086910B (en) * 2022-04-27 2024-05-31 同济大学 Real-time road section dividing method based on dynamic and static characteristics in V2X environment

Families Citing this family (106)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108388540B (en) * 2018-03-01 2022-09-23 兰州交通大学 Road network selection method based on fuzzy comprehensive evaluation
CN110766936A (en) * 2018-07-25 2020-02-07 高德软件有限公司 Traffic running state sensing method and system based on multi-source data fusion
WO2020078899A1 (en) * 2018-10-15 2020-04-23 Starship Technologies Oü Method and system for operating a mobile robot
DE102018130457B4 (en) * 2018-11-30 2021-03-25 Bayerische Motoren Werke Aktiengesellschaft System and process for map matching
US11100793B2 (en) * 2019-01-15 2021-08-24 Waycare Technologies Ltd. System and method for detection and quantification of irregular traffic congestion
CN109871876B (en) * 2019-01-22 2023-08-08 东南大学 Expressway road condition identification and prediction method based on floating car data
CN111613047B (en) * 2019-02-26 2022-07-08 阿里巴巴集团控股有限公司 Information processing method and device
CN109948701B (en) * 2019-03-19 2022-08-16 太原科技大学 Data clustering method based on space-time correlation among tracks
CN109918368A (en) * 2019-03-27 2019-06-21 成都市公安科学技术研究所 A kind of system and method that vehicle driver is identified by Track association degree
CN110197293A (en) * 2019-04-15 2019-09-03 同济大学 Timesharing rent-a-car Travel Demand Forecasting method and system based on floating car data
CN110322054B (en) * 2019-06-14 2023-04-28 中交第一公路勘察设计研究院有限公司 Optimized layout method of road section traffic monitor
CN110298500B (en) * 2019-06-19 2022-11-08 大连理工大学 Urban traffic track data set generation method based on taxi data and urban road network
CN112135239B (en) * 2019-06-25 2023-03-17 杭州萤石软件有限公司 Position monitoring method and device
US11587448B2 (en) 2019-07-26 2023-02-21 General Electric Company Systems and methods for manifolds learning of airline network data
CN110634288B (en) * 2019-08-30 2022-06-21 上海电科智能系统股份有限公司 Multi-dimensional urban traffic abnormal event identification method based on ternary Gaussian mixture model
CN110533249B (en) * 2019-09-02 2021-09-14 合肥工业大学 Metallurgical enterprise energy consumption prediction method based on integrated long-term and short-term memory network
CN110716925B (en) * 2019-09-05 2023-08-04 中电科大数据研究院有限公司 Cross-border behavior recognition method based on track analysis
CN110674858B (en) * 2019-09-16 2023-06-09 长沙理工大学 Traffic public opinion detection method based on space-time correlation and big data mining
CN110795467B (en) * 2019-09-26 2024-02-27 腾讯大地通途(北京)科技有限公司 Traffic rule data processing method, device, storage medium and computer equipment
CN110895598B (en) * 2019-10-23 2021-09-14 山东九州信泰信息科技股份有限公司 Real-time anomaly detection parallelization method based on multi-source prediction
US11587433B2 (en) 2019-10-31 2023-02-21 Here Global B.V. Method, apparatus, and system for probe anomaly detection
JP7226248B2 (en) * 2019-10-31 2023-02-21 トヨタ自動車株式会社 Communication device and abnormality determination device
CN110956803A (en) * 2019-11-14 2020-04-03 深圳尚桥信息技术有限公司 Multi-mode-based vehicle detection method and system
CN110941278B (en) * 2019-12-20 2023-05-23 交控科技股份有限公司 Dynamic safety analysis method in station
CN111192454B (en) * 2020-01-07 2021-06-01 中山大学 Traffic abnormity identification method and system based on travel time evolution and storage medium
CN111311910B (en) * 2020-02-19 2020-12-22 中南大学 Abnormal track detection method for multi-level road-level floating vehicle
CN111310833B (en) * 2020-02-19 2022-11-15 清华大学 Travel mode identification method based on Bayesian neural network
CN112164223B (en) * 2020-02-27 2022-04-29 浙江恒隆智慧科技集团有限公司 Intelligent traffic information processing method and device based on cloud platform
CN111353828B (en) * 2020-03-30 2023-09-12 中国工商银行股份有限公司 Method and device for predicting number of people coming to store at website
CN111475774B (en) * 2020-03-31 2022-03-18 清华大学 Method and device for detecting abnormal state of photovoltaic power station equipment
CN111583642B (en) * 2020-05-06 2021-03-23 北京航空航天大学 Traffic track streaming type big data real-time processing method
CN111580500B (en) * 2020-05-11 2022-04-12 吉林大学 Evaluation method for safety of automatic driving automobile
CN111722252B (en) * 2020-05-14 2023-04-11 江苏北斗卫星应用产业研究院有限公司 High-precision operation surface identification method based on track data
CN111739289B (en) * 2020-06-02 2024-02-20 腾讯科技(深圳)有限公司 Method and device for processing vehicle early warning information
CN111667015B (en) * 2020-06-11 2023-06-27 深圳市兴海物联科技有限公司 Method and device for detecting state of equipment of Internet of things and detection equipment
CN111709378B (en) * 2020-06-18 2022-07-12 湖南大学 Novel road condition state assessment method based on JS divergence and fuzzy evidence theory
CN111723997A (en) * 2020-06-23 2020-09-29 上海电科智能系统股份有限公司 Automatic generation method of urban major traffic accident data sample based on GAN
CN111932873B (en) * 2020-07-21 2022-10-04 重庆交通大学 Real-time traffic early warning management and control method and system for mountain city hot spot area
CN112085949B (en) * 2020-08-13 2023-06-27 浙江工业大学 Road network vulnerability identification, analysis and coping method based on traffic running condition abnormality
CN112000653B (en) * 2020-08-21 2024-03-29 睿驰达新能源汽车科技(北京)有限公司 Space and time-based regional gridding driving behavior data preprocessing method
CN111986487B (en) * 2020-09-11 2022-02-25 腾讯科技(深圳)有限公司 Road condition information management method and related device
CN112053561B (en) * 2020-09-11 2021-11-23 深兰人工智能芯片研究院(江苏)有限公司 Method, system and device for judging and positioning traffic accident on non-monitored road section
CN112100243B (en) * 2020-09-15 2024-02-20 山东理工大学 Abnormal aggregation detection method based on massive space-time data analysis
CN112215261A (en) * 2020-09-18 2021-01-12 武汉理工大学 Vehicle OD point clustering method, system, device and storage medium based on meta-learning
CN112308116B (en) * 2020-09-28 2023-04-07 济南大学 Self-optimization multi-channel fusion method and system for old-person-assistant accompanying robot
CN112489419B (en) * 2020-10-28 2022-04-26 华为技术有限公司 Method and device for determining road capacity and storage medium
CN112419722B (en) * 2020-11-18 2022-08-30 百度(中国)有限公司 Traffic abnormal event detection method, traffic control method, device and medium
WO2022104790A1 (en) * 2020-11-23 2022-05-27 深圳元戎启行科技有限公司 Travel trajectory determining method and apparatus, computer device, and storage medium
CN112652170B (en) * 2020-12-24 2022-04-08 航天科工智能运筹与信息安全研究院(武汉)有限公司 Method and device for positioning traffic jam hot spot
CN112927497B (en) * 2021-01-14 2023-01-17 阿里巴巴集团控股有限公司 Floating car identification method, related method and device
CN112907949B (en) * 2021-01-20 2022-11-22 北京百度网讯科技有限公司 Traffic anomaly detection method, model training method and device
CN112784963B (en) * 2021-01-22 2022-07-01 重庆邮电大学 Indoor and outdoor seamless positioning method based on simulated annealing optimization BP neural network
CN113221677B (en) * 2021-04-26 2024-04-16 阿波罗智联(北京)科技有限公司 Track abnormality detection method and device, road side equipment and cloud control platform
GB202106070D0 (en) * 2021-04-28 2021-06-09 Tomtom Navigation Bv Methods and systems for determining estimated travel times through a navigable network
CN113240265B (en) * 2021-05-11 2023-10-27 西北工业大学 Urban space division method based on multi-mode traffic data
CN113470347B (en) * 2021-05-20 2022-07-26 上海天壤智能科技有限公司 Congestion identification method and system combining bayonet vehicle passing record and floating vehicle GPS data
CN113312722B (en) * 2021-05-28 2023-05-05 广西大学 Reliability prediction optimization method for key parts of urban rail train
CN113380028B (en) * 2021-06-01 2022-09-06 公安部交通管理科学研究所 Intelligent travel traffic data fusion method and device
CN113313317B (en) * 2021-06-11 2024-04-12 哈尔滨工业大学 INLA algorithm-based shared bicycle use demand prediction method and prediction system
CN113378301B (en) * 2021-06-22 2022-05-24 北京航空航天大学 Importance sampling-based key test case generation method for unmanned vehicle overtaking scene
CN113535510B (en) * 2021-06-24 2024-01-26 北京理工大学 Self-adaptive sampling model optimization method for data acquisition of large-scale data center
CN113436433B (en) * 2021-06-24 2022-06-21 福建师范大学 Efficient urban traffic outlier detection method
CN113343905B (en) * 2021-06-28 2022-06-14 山东理工大学 Method and system for training road abnormity intelligent recognition model and recognizing road abnormity
CN113554869B (en) * 2021-07-01 2022-04-05 华东师范大学 Road closure detection method based on multi-feature fusion
CN113569759B (en) * 2021-07-29 2022-06-10 沭阳新辰公路仪器有限公司 Road falling object identification method and system based on artificial intelligence
US11828860B2 (en) 2021-08-27 2023-11-28 International Business Machines Corporation Low-sampling rate GPS trajectory learning
CN113779169B (en) * 2021-08-31 2023-09-05 西南电子技术研究所(中国电子科技集团公司第十研究所) Space-time data stream model self-enhancement method
CN113920728B (en) * 2021-10-11 2022-08-12 南京微达电子科技有限公司 Detection and early warning method and system for obstacles thrown on highway
CN113920739B (en) * 2021-10-29 2023-05-23 复旦大学 Traffic data driving frame based on information physical fusion system and construction method
CN114093164B (en) * 2021-11-15 2022-08-19 上海市城乡建设和交通发展研究院 Dynamic traffic flow identification and correction method, device and equipment based on vehicle track
CN114239929A (en) * 2021-11-30 2022-03-25 东南大学 Taxi traffic demand characteristic prediction method based on random forest
CN113888877B (en) * 2021-12-08 2022-03-08 南方科技大学 Traffic state detection method, device, equipment and storage medium
CN114495498B (en) * 2022-01-20 2023-01-10 青岛海信网络科技股份有限公司 Traffic data distribution effectiveness judging method and device
CN114626682A (en) * 2022-02-17 2022-06-14 华录智达科技股份有限公司 Urban public transport network planning method considering aggregation area
CN114217333A (en) * 2022-02-21 2022-03-22 北京交研智慧科技有限公司 Road network topological abnormal position positioning method and device and related equipment
CN114822066B (en) * 2022-04-14 2023-06-13 北京百度网讯科技有限公司 Vehicle positioning method, device, electronic equipment and storage medium
CN114492544B (en) * 2022-04-15 2022-07-26 腾讯科技(深圳)有限公司 Model training method and device and traffic incident occurrence probability evaluation method and device
CN115019507B (en) * 2022-06-06 2023-12-01 上海旷途科技有限公司 Urban road network travel time reliability real-time estimation method
CN114996251A (en) * 2022-06-07 2022-09-02 武汉众智数字技术有限公司 Safety service management method and system based on radio station GPS track data
CN115171372B (en) * 2022-06-20 2023-10-24 青岛海信网络科技股份有限公司 Traffic abnormality detection method, equipment and device
CN115311846B (en) * 2022-06-24 2023-08-11 华东师范大学 Factory road congestion prediction method and prediction system combining truck task states
CN114863685B (en) * 2022-07-06 2022-09-27 北京理工大学 Traffic participant trajectory prediction method and system based on risk acceptance degree
CN115185780B (en) * 2022-07-21 2023-10-24 北京国联视讯信息技术股份有限公司 Data acquisition method and system based on industrial Internet
CN115440044B (en) * 2022-07-29 2023-10-13 深圳高速公路集团股份有限公司 Highway multisource event data fusion method, device, storage medium and terminal
CN115691111B (en) * 2022-09-22 2024-01-23 连云港杰瑞电子有限公司 Network-connected vehicle minimum permeability calculation method suitable for traffic flow data acquisition
CN115661672B (en) * 2022-10-24 2023-03-14 中国人民解放军海军工程大学 PolSAR image CFAR detection method and system based on GMM
CN115798198B (en) * 2022-11-03 2024-04-05 公安部交通管理科学研究所 Urban road network travel time distribution estimation method based on data fusion
CN115985088B (en) * 2022-11-30 2024-01-26 东南大学 Traffic flow stability improving method based on vehicle collision time feedback
CN116029736B (en) * 2023-01-05 2023-09-29 浙江警察学院 Real-time detection and safety early warning method and system for abnormal track of network vehicle
CN116165274B (en) * 2023-02-17 2023-11-14 哈尔滨工业大学 Urban orbit damage identification method based on Bayesian global sparse probability principal component analysis
CN115984077B (en) * 2023-02-24 2023-06-13 南方科技大学 Traffic abnormal flow causal detection method and equipment
CN116245362B (en) * 2023-03-07 2023-12-12 北京磁浮有限公司 Urban rail contact network risk assessment method and related device
CN117115759B (en) * 2023-04-12 2024-04-09 盐城工学院 Road side traffic target detection system and method based on category guidance
CN116703004B (en) * 2023-07-19 2023-09-29 共享数据(福建)科技有限公司 Water system river basin intelligent patrol method and device based on pre-training model
CN116777120B (en) * 2023-08-16 2023-10-27 合肥综合性国家科学中心人工智能研究院(安徽省人工智能实验室) Urban road mobile source carbon emission calculation method based on road network OD pairs
CN116820014B (en) * 2023-08-24 2023-11-14 山西交通科学研究院集团有限公司 Intelligent monitoring and early warning method and system for traffic electromechanical equipment
CN116860840B (en) * 2023-09-05 2023-11-07 陕西交通电子工程科技有限公司 Rapid retrieval method for highway pavement information
CN117419732A (en) * 2023-10-10 2024-01-19 中国船舶集团有限公司第七〇九研究所 Road network-based perception target positioning deviation rectifying method, equipment and storage medium
CN117372969B (en) * 2023-12-08 2024-05-10 暗物智能科技(广州)有限公司 Monitoring scene-oriented abnormal event detection method
CN117456482B (en) * 2023-12-25 2024-05-10 暗物智能科技(广州)有限公司 Abnormal event identification method and system for traffic monitoring scene
CN117517596B (en) * 2024-01-08 2024-03-15 辽宁中消安全设备有限公司 Method and system for monitoring combustible and toxic harmful gases in real time based on Internet of things
CN117572470B (en) * 2024-01-15 2024-04-19 广东邦盛北斗科技股份公司 Beidou system positioning updating method and system applied to artificial intelligence
CN117575546B (en) * 2024-01-17 2024-04-05 北京白龙马云行科技有限公司 Background management system for network vehicle-restraining platform
CN117706478B (en) * 2024-02-02 2024-05-03 腾讯科技(深圳)有限公司 Positioning drift identification method, device, equipment and storage medium
CN117688505B (en) * 2024-02-04 2024-04-19 河海大学 Prediction method and system for vegetation large-range regional negative abnormality
CN117894181B (en) * 2024-03-14 2024-05-07 北京动视元科技有限公司 Global traffic abnormal condition integrated monitoring method and system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101976505A (en) * 2010-10-25 2011-02-16 中国科学院深圳先进技术研究院 Traffic evaluation method and system
CN103971521A (en) * 2014-05-19 2014-08-06 清华大学 Method and device for detecting road traffic abnormal events in real time
CN104573116A (en) * 2015-02-05 2015-04-29 哈尔滨工业大学 Taxi GPS data mining based traffic abnormality recognition method
CN104657746A (en) * 2015-01-29 2015-05-27 电子科技大学 Anomaly detection method based on vehicle trajectory similarity
KR101598343B1 (en) * 2014-09-23 2016-02-29 목원대학교 산학협력단 System for automatically Identifying spatio-temporal congestion patterns and method thereof
US20160267789A1 (en) * 2014-12-01 2016-09-15 Here Global B.V. Traffic Classification Based on Spatial Neighbor Model

Family Cites Families (65)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05250598A (en) * 1992-03-06 1993-09-28 Matsushita Electric Ind Co Ltd Traffic information providing device
JPH07220193A (en) * 1994-01-28 1995-08-18 Nagoya Denki Kogyo Kk Traffic condition discriminating device
JP4200747B2 (en) * 2001-11-27 2008-12-24 富士ゼロックス株式会社 Movement information classification device, movement information classification method, and movement information classification program
US7187800B2 (en) * 2002-08-02 2007-03-06 Computerized Medical Systems, Inc. Method and apparatus for image segmentation using Jensen-Shannon divergence and Jensen-Renyi divergence
DE102005002760B4 (en) * 2004-01-20 2018-05-24 Volkswagen Ag Device and method for accident prevention in motor vehicles
US7522995B2 (en) * 2004-02-05 2009-04-21 Nortrup Edward H Method and system for providing travel time information
CN102289935B (en) * 2006-03-03 2015-12-16 因瑞克斯有限公司 Use the data estimation road traffic condition from Mobile data source
US7460948B2 (en) * 2006-03-10 2008-12-02 Gm Global Technology Operations, Inc. Traffic notification system for reporting traffic anomalies based on historical probe vehicle data
JP4591395B2 (en) * 2006-03-31 2010-12-01 アイシン・エィ・ダブリュ株式会社 Navigation system
CN100492434C (en) * 2006-11-30 2009-05-27 上海交通大学 Traffic flow state analysis required detection vehicle sampling quantity obtaining method
JP4539666B2 (en) * 2007-03-19 2010-09-08 アイシン・エィ・ダブリュ株式会社 Traffic condition calculation system
CN101373559B (en) * 2007-08-24 2010-08-18 同济大学 Method for evaluating city road net traffic state based on floating vehicle data
DE102007050154A1 (en) * 2007-10-19 2009-04-23 Siemens Ag Prediction system for predicting driving time, vehicle-based route planning system, traffic information system and method
CN101286269A (en) * 2008-05-26 2008-10-15 北京捷讯畅达科技发展有限公司 Traffic flow forecasting system combining dynamic real time traffic data
CN101620781B (en) * 2008-06-30 2012-08-29 株式会社查纳位资讯情报 System and method for forecasting passenger information and searching the same
CN201262784Y (en) * 2008-09-28 2009-06-24 华南理工大学 Detection and evaluation system for urban signal control road junction traffic status based on data characteristics
CN101727747A (en) * 2009-12-16 2010-06-09 南京信息工程大学 Abnormal road jam alarming method based on flow detection
CN101794510A (en) * 2009-12-30 2010-08-04 北京世纪高通科技有限公司 Data processing method and device of floating cars
CN101950477B (en) * 2010-08-23 2012-05-23 北京世纪高通科技有限公司 Method and device for processing traffic information
CN201946111U (en) * 2011-01-13 2011-08-24 西安邮电学院 Data acquisition and releasing node for early warning system of traffic abnormal events in express way
CN201927174U (en) * 2011-01-13 2011-08-10 西安邮电学院 Early warning system for traffic abnormal events of expressway
JP5716935B2 (en) * 2011-04-20 2015-05-13 日本電気株式会社 Traffic condition monitoring system, method, and program
CN202075864U (en) * 2011-04-28 2011-12-14 北京市劳动保护科学研究所 Abnormal traffic state automatic detection system
CN102231235B (en) * 2011-04-29 2016-02-24 陈伟 A kind of traffic flow outlier detection localization method
US8775059B2 (en) * 2011-10-26 2014-07-08 Right There Ware LLC Method and system for fleet navigation, dispatching and multi-vehicle, multi-destination routing
US20130166188A1 (en) * 2011-12-21 2013-06-27 Microsoft Corporation Determine Spatiotemporal Causal Interactions In Data
CN102637357B (en) * 2012-03-27 2013-11-06 山东大学 Regional traffic state assessment method
US9141874B2 (en) * 2012-07-19 2015-09-22 Qualcomm Incorporated Feature extraction and use with a probability density function (PDF) divergence metric
CN102855638B (en) * 2012-08-13 2015-02-11 苏州大学 Detection method for abnormal behavior of vehicle based on spectrum clustering
CN103050005B (en) * 2012-11-16 2015-06-03 北京交通大学 Method and system for space and time analysis of urban road traffic states
CN103065466B (en) * 2012-11-19 2015-01-21 北京世纪高通科技有限公司 Detection method of and detection device for abnormal traffic conditions
CN103065468A (en) * 2012-12-14 2013-04-24 中国航天系统工程有限公司 Assessment method of traffic information and device
CN103903433B (en) * 2012-12-27 2016-09-14 南京中兴新软件有限责任公司 The Real-time and Dynamic method of discrimination of a kind of road traffic state and device
CN103903436A (en) * 2012-12-28 2014-07-02 上海优途信息科技有限公司 Expressway traffic jam detecting method and system based on floating car
CN103258427B (en) * 2013-04-24 2015-03-11 北京工业大学 Urban expressway traffic real-time monitoring system and method based on information physical network
CN103247177B (en) * 2013-05-21 2016-01-20 清华大学 Large-scale road network traffic flow real-time dynamic prediction system
CN103309964A (en) * 2013-06-03 2013-09-18 广州市香港科大霍英东研究院 High-efficiency visible monitoring analysis system for large-scale traffic data
CN103354030B (en) * 2013-07-29 2015-06-24 吉林大学 Method for determining road traffic situation by utilizing floating bus CAN bus information
CN103514743B (en) * 2013-09-28 2016-01-06 上海电科智能系统股份有限公司 A kind of abnormal traffic state characteristic recognition method of real-time index-matched memory range
US9582999B2 (en) * 2013-10-31 2017-02-28 Here Global B.V. Traffic volume estimation
CN103632546B (en) * 2013-11-27 2016-01-20 中国航天系统工程有限公司 A kind of Urban Road Traffic Accidents impact prediction method based on floating car data
US9240123B2 (en) * 2013-12-13 2016-01-19 Here Global B.V. Systems and methods for detecting road congestion and incidents in real time
CN104282151A (en) * 2014-09-30 2015-01-14 北京交通发展研究中心 Real-time floating vehicle path matching method based on high-frequency satellite positioning data
CN104408958B (en) * 2014-11-11 2016-09-28 河海大学 A kind of city dynamic route Travel Time Estimation Method
CN104408924B (en) * 2014-12-04 2016-06-01 深圳北航新兴产业技术研究院 A kind of urban road abnormal traffic stream detection method based on coupled hidden markov model
CN104537833B (en) * 2014-12-19 2017-03-29 深圳大学 A kind of accident detection method and system
CN104504901B (en) * 2014-12-29 2016-06-08 浙江银江研究院有限公司 A kind of traffic abnormity point detecting method based on multidimensional data
CN104778834B (en) * 2015-01-23 2017-02-22 哈尔滨工业大学 Urban road traffic jam judging method based on vehicle GPS data
KR101728219B1 (en) * 2015-02-23 2017-04-19 전북대학교산학협력단 Traffic Dispersion Method and System with Space Time Control based on Duplex Communication
US11482100B2 (en) * 2015-03-28 2022-10-25 Intel Corporation Technologies for detection of anomalies in vehicle traffic patterns
CN104778837B (en) * 2015-04-14 2017-12-05 吉林大学 A kind of road traffic operation situation Multiple Time Scales Forecasting Methodology
CN104809787B (en) * 2015-04-23 2017-11-17 中电科安(北京)科技股份有限公司 A kind of intelligent passenger volume statistic device based on camera
US9576481B2 (en) * 2015-04-30 2017-02-21 Here Global B.V. Method and system for intelligent traffic jam detection
CN104809878B (en) * 2015-05-14 2017-03-22 重庆大学 Method for detecting abnormal condition of urban road traffic by utilizing GPS (Global Positioning System) data of public buses
CN105005760B (en) * 2015-06-11 2018-04-24 华中科技大学 A kind of recognition methods again of the pedestrian based on Finite mixture model
CN105261212B (en) * 2015-09-06 2018-06-19 中山大学 A kind of trip space-time analysis method based on GPS data from taxi map match
CN105404890B (en) * 2015-10-13 2018-10-16 广西师范学院 A kind of criminal gang's method of discrimination for taking track space and time order into account
CN105513350A (en) * 2015-11-30 2016-04-20 华南理工大学 Time-phased multi-parameter short-term traffic flow prediction method based on time-space characteristics
CN105489008B (en) * 2015-12-28 2018-10-19 北京握奇智能科技有限公司 Urban road congestion computational methods and system based on Floating Car satellite location data
CN105608895B (en) * 2016-03-04 2017-11-10 大连理工大学 A kind of urban traffic blocking section detecting method based on local outlier factor
CN105761488B (en) * 2016-03-30 2018-11-23 湖南大学 Real-time extreme learning machine Short-time Traffic Flow Forecasting Methods based on fusion
CN106067248B (en) * 2016-05-30 2018-08-24 重庆大学 A kind of traffic status of express way method of estimation considering speed dispersion characteristic
CN106023592A (en) * 2016-07-11 2016-10-12 南京邮电大学 Traffic jam detection method based on GPS data
CN106228808B (en) * 2016-08-05 2019-04-30 北京航空航天大学 City expressway travel time prediction method based on Floating Car space-time grid data
CN106781468B (en) * 2016-12-09 2018-06-15 大连理工大学 Link Travel Time Estimation method based on built environment and low frequency floating car data

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101976505A (en) * 2010-10-25 2011-02-16 中国科学院深圳先进技术研究院 Traffic evaluation method and system
CN103971521A (en) * 2014-05-19 2014-08-06 清华大学 Method and device for detecting road traffic abnormal events in real time
KR101598343B1 (en) * 2014-09-23 2016-02-29 목원대학교 산학협력단 System for automatically Identifying spatio-temporal congestion patterns and method thereof
US20160267789A1 (en) * 2014-12-01 2016-09-15 Here Global B.V. Traffic Classification Based on Spatial Neighbor Model
CN104657746A (en) * 2015-01-29 2015-05-27 电子科技大学 Anomaly detection method based on vehicle trajectory similarity
CN104573116A (en) * 2015-02-05 2015-04-29 哈尔滨工业大学 Taxi GPS data mining based traffic abnormality recognition method

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111192456A (en) * 2020-01-14 2020-05-22 泉州市益典信息科技有限公司 Road traffic operation situation multi-time scale prediction method
CN111540194A (en) * 2020-03-23 2020-08-14 深圳奇迹智慧网络有限公司 Vehicle monitoring data processing method and device, computer equipment and storage medium
CN111540194B (en) * 2020-03-23 2021-08-10 深圳奇迹智慧网络有限公司 Vehicle monitoring data processing method and device, computer equipment and storage medium
CN112863185A (en) * 2021-01-15 2021-05-28 林安齐 Intelligent management system and method for road traffic equipment
CN115086910A (en) * 2022-04-27 2022-09-20 同济大学 Real-time road segment division method based on dynamic and static characteristics under V2X environment
CN115086910B (en) * 2022-04-27 2024-05-31 同济大学 Real-time road section dividing method based on dynamic and static characteristics in V2X environment

Also Published As

Publication number Publication date
GB2587588A (en) 2021-03-31
GB2582532A (en) 2020-09-23
CN109716414A (en) 2019-05-03
GB201909409D0 (en) 2019-08-14
WO2018122585A1 (en) 2018-07-05
CN110168520A (en) 2019-08-23
GB2572717B (en) 2020-08-05
GB2572717A (en) 2019-10-09
GB2569924A (en) 2019-07-03
CN109844832B (en) 2021-06-15
WO2018122806A1 (en) 2018-07-05
GB201905907D0 (en) 2019-06-12
GB2582532B (en) 2021-02-24
WO2018122805A1 (en) 2018-07-05
CN109923595B (en) 2021-07-13
WO2018122802A1 (en) 2018-07-05
CN109923595A (en) 2019-06-21
CN109791729B (en) 2021-10-15
CN109844832A (en) 2019-06-04
GB202100341D0 (en) 2021-02-24
GB2588556B (en) 2021-10-27
GB201711408D0 (en) 2017-08-30
GB2587588B (en) 2021-10-27
GB2588556A (en) 2021-04-28
GB202009833D0 (en) 2020-08-12
WO2018122803A1 (en) 2018-07-05
CN109791729A (en) 2019-05-21
GB2569924B (en) 2021-02-24
CN109997179A (en) 2019-07-09
GB2582531B (en) 2021-02-24
GB201909405D0 (en) 2019-08-14
GB201909408D0 (en) 2019-08-14
WO2018122804A1 (en) 2018-07-05
CN109643485A (en) 2019-04-16
GB201909406D0 (en) 2019-08-14
GB2582531A (en) 2020-09-23
GB202009834D0 (en) 2020-08-12
GB201909407D0 (en) 2019-08-14
GB202100340D0 (en) 2021-02-24
GB2569924A8 (en) 2021-01-27
CN109643485B (en) 2021-04-30
CN109716414B (en) 2021-10-15

Similar Documents

Publication Publication Date Title
CN109791729B (en) Traffic anomaly detection method based on travel time distribution
Aslam et al. City-scale traffic estimation from a roving sensor network
CN110176139A (en) A kind of congestion in road identification method for visualizing based on DBSCAN+
US20180233035A1 (en) Method and filter for floating car data sources
CN110838232A (en) Single vehicle OD (origin-destination) acquisition method based on vehicle-passing electric alarm data
Carli et al. Automated evaluation of urban traffic congestion using bus as a probe
Robinson The development and application of an urban link travel time model using data derived from inductive loop detectors
Peng et al. Evaluation of roadway spatial-temporal travel speed estimation using mapped low-frequency AVL probe data
Snowdon et al. Spatiotemporal traffic volume estimation model based on GPS samples
Gong et al. Estimating link travel time with sparse GPS data on highway corridors
Qin et al. Spatiotemporal K-Nearest Neighbors Algorithm and Bayesian Approach for Estimating Urban Link Travel Time Distribution From Sparse GPS Trajectories
Wang Calibration of big traffic data for a transport smart city
Finogeev et al. Proactive big data analysis for traffic accident prediction
Graser et al. Fcd in the real world–system capabilities and applications
Syrjärinne Urban Traffic Analysis with Bus Location Data
Gündüz et al. Vehicle travel time estimation using sequence prediction
Chidlovskii et al. Traffic delay detection by mining ticket validation transactions
Xiang et al. Network-wide performance assessment of urban traffic based on probe vehicle data
Babu et al. Smart Transport System for Passenger Comfort using IoT
Liu et al. Overall Architecture Design of Digital Smart Highway Based on DL Algorithm
PUGLIESE et al. USING GPS DATA FROM A SAMPLE OF PRIVATE CARS FOR MODELLING THE URBAN TRAFFIC
Wang Link Travel Speed Estimation Using Transit GPS Data
Li Improving Estimation Accuracy of GPS-based Arterial Travel Time using k-Nearest Neighbors Algorithm by Zheng Li
Shi Bus Arrival Time Reliability Analyses and Dynamic Prediction Model Based on Multi-source Data
Xian et al. The probe allocation problem

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17887282

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 201909405

Country of ref document: GB

Kind code of ref document: A

Free format text: PCT FILING DATE = 20171230

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17887282

Country of ref document: EP

Kind code of ref document: A1