CN114613124A - Traffic information processing method, device, terminal and computer readable storage medium - Google Patents
Traffic information processing method, device, terminal and computer readable storage medium Download PDFInfo
- Publication number
- CN114613124A CN114613124A CN202011395101.5A CN202011395101A CN114613124A CN 114613124 A CN114613124 A CN 114613124A CN 202011395101 A CN202011395101 A CN 202011395101A CN 114613124 A CN114613124 A CN 114613124A
- Authority
- CN
- China
- Prior art keywords
- traffic
- information
- points
- track
- adjacency
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000010365 information processing Effects 0.000 title abstract description 27
- 238000003672 processing method Methods 0.000 title abstract description 17
- 238000000034 method Methods 0.000 claims abstract description 52
- 239000011159 matrix material Substances 0.000 claims abstract description 48
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 14
- 230000003595 spectral effect Effects 0.000 claims abstract description 14
- 238000004590 computer program Methods 0.000 claims description 21
- 238000004364 calculation method Methods 0.000 claims description 11
- 238000012545 processing Methods 0.000 claims description 7
- 101100481876 Danio rerio pbk gene Proteins 0.000 claims description 5
- 101100481878 Mus musculus Pbk gene Proteins 0.000 claims description 5
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 230000004044 response Effects 0.000 claims description 3
- 238000000605 extraction Methods 0.000 claims description 2
- 238000012163 sequencing technique Methods 0.000 claims 1
- 238000004891 communication Methods 0.000 abstract description 5
- 238000004458 analytical method Methods 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 241000233805 Phoenix Species 0.000 description 6
- 230000008569 process Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 230000029305 taxis Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000004888 barrier function Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000007621 cluster analysis Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/01—Detecting movement of traffic to be counted or controlled
- G08G1/0104—Measuring and analyzing of parameters relative to traffic conditions
- G08G1/0125—Traffic data processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/29—Geographical information databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2323—Non-hierarchical techniques based on graph theory, e.g. minimum spanning trees [MST] or graph cuts
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/01—Detecting movement of traffic to be counted or controlled
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/01—Detecting movement of traffic to be counted or controlled
- G08G1/0104—Measuring and analyzing of parameters relative to traffic conditions
- G08G1/0125—Traffic data processing
- G08G1/0133—Traffic data processing for classifying traffic situation
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Artificial Intelligence (AREA)
- Chemical & Material Sciences (AREA)
- Analytical Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Remote Sensing (AREA)
- Discrete Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention is suitable for the field of intelligent traffic, and provides a traffic information processing method, a device, a terminal and a computer readable storage medium, wherein the method comprises the following steps: acquiring information of traffic points; converting the information of the traffic points into traffic track information; determining the adjacency relation of any two adjacent traffic points in the traffic track and the track passing times according to the traffic track information; obtaining a high-dimensional map characteristic matrix of the traffic network by using a spectral clustering method based on the information of the traffic points, the adjacency relation of any two adjacent traffic points and the passing times; extracting the category information of the traffic points from the high-dimensional representation; and classifying and storing the traffic points based on the class information to form a database. In the technical scheme, the similarity between the traffic points is measured by using the track communication times, and the relation between the spatio-temporal data is represented from the angle of a graph network by using a spectral clustering algorithm, so that the clustering result of the traffic points is more consistent with the actual traffic network condition.
Description
Technical Field
The invention belongs to the field of intelligent transportation, and particularly relates to a traffic information processing method, a traffic information processing device, traffic information processing equipment and a computer-readable storage medium.
Background
In the fields of urban calculation and intelligent transportation, a plane clustering method is widely applied, such as traffic passenger flow trend analysis and prediction, public transportation area division and the like. By using the clustering method, the planes of the city can be divided into different classes, each class may represent different roads, cells and the like, and business circles, residences, congested road sections and the like can be further mined from the classes.
In recent years, with the development of urban big data and intelligent transportation technology, more and more information can be mined from the data, and support is further provided for different applications, such as business district mining, accurate advertisement, traffic capacity analysis, passenger flow prediction and the like. For convenience of analysis, the applications all need a basic premise, that is, a city plan is discretized, that is, the plane of the city is divided into different small blocks, so that each small block can be labeled with different labels to support further analysis.
Generally, the collected data are all GPS data reported by traffic participants such as buses, taxi appointments, taxis, subways and private cars, and the GPS data represent different roads which can pass through in a city, and how to divide the GPS points into different clusters is a fundamental and important problem.
For spatial plane clustering, there are currently many methods, such as K-Means, DBSCAN, and a large number of variants based on these two methods, and GPS points can be classified into different classes by calculating euclidean distances between GPS points, that is, the spatial planes are classified into different blocks by combining these two methods.
The main problem with the above prior art is that they use euclidean distance as a measure, i.e. the spatial distance relationship. In practical scenarios, however, objects (such as vehicles or people) typically move in a spatial network, i.e., in a network-structured manner, rather than in euclidean planar space. For example, vehicles are often moving along urban roads, and although two roads may be closely spaced, it is virtually impossible for a vehicle to move across a barrier from one road to the other. In K-Means or DBSCAN with euclidean distance measurement, the GPS reported by the vehicles on the two roads is considered as a category, but obviously, the category does not conform to the actual situation. That is, clustering using spatial distance will classify neighboring points that are unlikely to intersect at all into the same class.
Disclosure of Invention
In view of this, embodiments of the present invention provide a traffic information processing method, an apparatus, a terminal, and a computer-readable storage medium, so as to solve the problem that a complex urban traffic network cannot be represented due to the use of euclidean distance in a conventional clustering algorithm when traffic information processing is performed.
A first aspect of an embodiment of the present invention provides a method for processing traffic information, including:
acquiring information of traffic points;
converting the information of the traffic points into traffic track information;
determining the adjacency relation of any two adjacent traffic points in the traffic track and the track passing times according to the traffic track information;
based on the information of the traffic points, the adjacency relation between any two adjacent traffic points and the passing times, obtaining a high-dimensional representation of a traffic network formed by the traffic points by using a spectral clustering method;
extracting the category information of the traffic points from the high-dimensional representation;
and classifying and storing the information of the traffic points based on the class information to form a database.
A second aspect of an embodiment of the present invention provides a traffic information processing apparatus, including:
an acquisition unit configured to acquire information of a traffic point;
the conversion unit is used for converting the information of the traffic points into traffic track information;
the determining unit is used for determining the adjacency relation of any two adjacent traffic points in the traffic track and the track passing times according to the traffic track information;
the calculation unit is used for obtaining a high-dimensional representation of a traffic network formed by the traffic points by using a spectral clustering method based on the information of the traffic points, the adjacency relation of any two adjacent traffic points and the passing times;
the extraction unit is used for extracting the category information of the traffic points from the high-dimensional representation;
and the storage unit is used for classifying and storing the information of the traffic points based on the class information to form a database.
A third aspect of the embodiments of the present invention provides a terminal, including a memory, a processor, and a computer program stored in the memory and operable on the processor, where the processor implements the steps of the method when executing the computer program.
A fourth aspect of embodiments of the present invention provides a computer-readable storage medium, which stores a computer program, characterized in that the computer program, when executed by a processor, implements the steps of the method as described above.
Compared with the prior art, the embodiment of the invention has the following beneficial effects:
in the technical scheme, the similarity between the traffic points is measured by using the track communication times, and the relation between the space-time data is represented from the angle of a graph network by using a spectral clustering algorithm, so that the clustering result of the traffic points is more accordant with the actual traffic network condition.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the embodiments or the prior art descriptions will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.
Fig. 1 is a flowchart of a first embodiment of a traffic information processing method of the present invention;
fig. 2 is a detailed flowchart of S13 in the first embodiment of the traffic information processing method of the present invention;
fig. 3 is a detailed flowchart of S14 in the first embodiment of the traffic information processing method of the present invention;
fig. 4 is a schematic structural diagram of a distributed database system in the first embodiment of the traffic information processing method of the present invention;
FIG. 5 is a graph of the effects of different clustering algorithms;
FIG. 6 is a flow chart of a second embodiment of a traffic information processing method of the present invention;
FIG. 7 is a data query latency test chart;
fig. 8 is a schematic configuration diagram of a first embodiment of a traffic information processing apparatus of the present invention;
fig. 9 is a schematic structural diagram of a first embodiment of the terminal of the present invention.
Detailed Description
In the following description, for purposes of explanation and not limitation, specific details are set forth, such as particular system structures, techniques, etc. in order to provide a thorough understanding of the embodiments of the invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced in other embodiments that depart from these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present invention with unnecessary detail.
In order to illustrate the technical means of the present invention, the following description is given by way of specific examples.
In the embodiment of the traffic information processing method, the information of the traffic points is converted into the information of the traffic tracks by acquiring the information of the traffic points, the adjacency relation and the track passing times of any two adjacent traffic points in the traffic tracks are determined according to the information of the traffic tracks, and the high-dimensional map feature matrix of the traffic network formed by the traffic points is obtained by using a spectral clustering method based on the information of the traffic points, the adjacency relation and the passing times of any two adjacent traffic points. And classifying and storing the traffic points based on the class information of the traffic points extracted from the high-dimensional graph feature matrix to form a database.
In the embodiment of the invention, the similarity between the traffic points is measured by using the track communication times, and the relation between the spatio-temporal data is represented from the angle of a graph network by using a spectral clustering algorithm, so that the clustering result of the traffic points is more consistent with the actual traffic network condition.
Fig. 1 is a flowchart of a first embodiment of a traffic information processing method of the present invention, and as shown in fig. 1, the traffic information processing method includes the steps of:
and S11, acquiring the information of the traffic points.
The information of the traffic points includes GPS data reported by traffic participants (such as buses, taxi appointments, taxis, subways, private cars, and the like). Each piece of GPS data may be represented as (participant ID, reporting time, longitude longtitude, latitude). The acquisition mode can be direct reception, or can be reading historical GPS data stored in a local or other storage medium.
And S12, converting the information of the traffic points into traffic track information.
In this embodiment, the traffic track information includes a traffic track sequence, and specifically, the GPS data reported by the same participant is divided into a group according to the participant ID of the GPS data, and the group is sorted according to the reporting time of the GPS data, so as to obtain the traffic track sequence of each traffic participant.
In other embodiments of the method of the present invention, the GPS data may be classified according to other parameters in the GPS data to obtain corresponding traffic track information.
And S13, determining the adjacency relation of any two adjacent traffic points in the traffic track and the track passing times according to the traffic track information.
In this implementation, as shown in fig. 2, step S13 specifically includes the following sub-steps:
s131, splitting the traffic track information into adjacent traffic point pairs;
s132, determining the adjacency relation of any two adjacent traffic points according to the adjacent traffic point pairs;
and S133, determining the track passing times of any two adjacent traffic points according to the adjacent traffic point pairs.
Specifically, in sub-step S131, the longitude and latitude of the GPS data may be encoded into a number lx by using a geohash method, i.e., lx represents a specific traffic point, i.e., a pair of longitude and latitude. Assuming that x1, x2, x3 and x4 are four adjacent traffic points one by one, the track sequence can be represented as (i, lx1, t1), (i, lx2, t2), (i, lx3, t3), (i, lx4, t4), wherein i is the ID of a traffic participant reporting corresponding to GPS data, lx1-lx4 are numbers obtained by geohash coding corresponding to the longitude and latitude of the traffic points x1-x4, and t1-t4 is the reporting time of the corresponding GPS data. The respective split adjacent traffic point pairs are denoted as (lx1, lx2), (lx2, lx3) and (lx3, lx 4). By the above method, each sequence of tracks is split into a set of pairs of adjacent traffic points (la, lb), where a and b represent any two adjacent traffic points.
In sub-step S132, each GPS point is considered as a vertex, and an adjacent edge exists between vertices la and lb, i.e., the adjacent relationship e (a, b).
In the sub-step S133, since it is an undirected graph, counting the number of (la, lb) and (lb, la) appearing in the set of adjacent traffic point pairs, the number of passing times w (a, b) of the track of any two adjacent traffic points can be obtained, where w (a, b) is expressed as:
w(a,b)=w′(a,b)+w′(b,a)
and S14, obtaining a high-dimensional map feature matrix of the traffic network formed by the traffic points by using a spectral clustering method based on the information of the traffic points, the adjacency relation of any two adjacent traffic points and the passing times.
In this embodiment, as shown in fig. 3, step S14 specifically includes the following sub-steps:
s141, determining an adjacency matrix by taking the track passing times of any two adjacent traffic points as the weight values of the adjacency relation of the corresponding two adjacent traffic points;
s142, obtaining a weighted undirected adjacency graph of the traffic network formed by the traffic points according to the information of the traffic points, the adjacency relation and the adjacency matrix of any two adjacent traffic points;
and S143, calculating a high-dimensional map feature matrix of the traffic network formed by the traffic points according to the undirected adjacency graph with the weight.
Specifically, in the sub-step S141, the total weight of the adjacency e (a, b) is set to W (a, b), that is, the similarity between any two adjacent traffic points, so that an adjacency matrix W can be obtained.
In sub-step S142, a weighted undirected adjacency graph G (V, E, W) of the traffic network formed by any GPS points is obtained according to the adjacency relation E between any two adjacent traffic points and the adjacency matrix W (the adjacency matrix indicates whether or not there is a connection between all vertices, and if there is a connection, the weight of the connecting edge is the value on the corresponding matrix).
In sub-step S143, based on the weighted undirected adjacency graph G (V, E, W), the degree (degree) di corresponding to each traffic point can be calculated:
the degree corresponding to all the traffic points is calculated, and a degree matrix Deg is obtained.
Based on the degree matrix Deg and the adjacency matrix W, the non-regularized laplacian matrix Lap of the undirected adjacency graph G with weights is calculated by the following formula:
Lap=Deg-W
then, the eigenvectors and eigenvalues of the laplace matrix Lap are solved. To reduce the amount of computation while avoiding the occurrence of a large number of meaningless clusters, the eigenvectors Λ will be addressednUsing the TopK algorithm:
Λn←solve|Lap-λI|=0extract the eigenvalues(λ1,λ2,…,λn);
Λk←TopK(Λn);
specifically, the placian matrix Lap is n × n, and n eigenvalues (n is the number of vertices) are obtained by taking the eigenvalues in the foregoing, and each eigenvalue corresponds to an eigenvector with a length of n. Then, topk (the largest k eigenvalues) is taken according to the eigenvalues, and eigenvectors corresponding to the first k eigenvalues are arranged in sequence to obtain a new k × n matrix, namely the high-dimensional map eigenvalue matrix of the traffic network formed by the GPS points.
And S15, extracting the type information of the traffic points from the high-dimensional map feature matrix.
In this embodiment, by clustering on the high-dimensional map feature matrix, the category information of different traffic points can be extracted, and the extracted category information of the traffic points includes a classification table (longitude, latitude, category number) of GPS points.
And S16, classifying and storing the information of the traffic points based on the class information to form a database.
In this embodiment, after the classification table of the GPS point is obtained, the original GPS data is stored in the database according to the classification table, and then the corresponding GPS data can be searched by using the class number clusterID for further analysis.
In particular, a distributed database may be used to convert and store the raw GPS data. For example, data is stored in a manner of HBase + Phoenix (Phoenix is a plug-in of HBase, and can provide a query interface similar to SQL for HBase, through which SQL can be directly used to query data of HBase), and the implementation of the clustering algorithm is based on Spark-SQL framework, so that the architecture of the whole system is as shown in fig. 4.
The classification is realized by the GPS data RawData reported by the traffic participants and the algorithm operated in the Analysis Engine. The Analysis Engine is a big data framework and a program for program operation, and the algorithm can be realized by Spark-SQL. The Master is a server node used by spark to manage the cluster, and the program is submitted through the Master. The server node worker is a server node of a distributed operation algorithm, a program submitted to the master is divided and then respectively sent to different workers to operate, and a result is sent back to the master after the operation is finished.
The client Query can run on a client terminal device, such as a client computer, and the data stored in the HBASE can be queried by installing the SQL client.
The Spark-SQL realizes the clustering algorithm, stores the GPS data into the HBase according to the generated classification table, and the HBase uses a plurality of servers (namely, Region servers in the figure) as storage media, namely, the GPS data are dispersed on different servers. Specifically, different types of GPS data are placed in different servers by placing the ClusterID at the beginning of a row key Rowkey (HBase queries data through Rowkey), then placing the data with the ClusterID of c1 (the beginning of Rowkey is c1) in a first Server Region Server1, placing the data with the ClusterID of c2 in a second Server Region Server2, and so on.
In addition, in order to optimize the query efficiency, the RowKey of HBase can be designed by using an inverted index:
RowKey=ClusterID+MaxValue-Timestamp+ID
the ClusterID is the classification ID of the cluster, the MaxValue is the maximum value of time, and the reverse index is realized by utilizing the time difference between the ClusterID and the maximum value of the time, namely, the latest data is placed on the upper layer of the database, so that the query efficiency is higher. In general, the data is not uniformly distributed in space, so that the hot spot problem can occur inevitably by using the ClusterID as the prefix of the RowKey. To solve this problem and to make the computations proceed in parallel, the RowKey is scattered as widely as possible. For example, a function sold Tables of Phoenix can be used to specify how many Servers (Region Servers) it will transparently add a Hash symbol in front of the Rowkey, and then put the Rowkey data starting with different symbols on different Servers according to the above, and put the data on different Servers uniformly. Since the second string of rowkey is ClusterID, it is possible to put data of the same ClusterID on a server.
For the taxi track of beijing, the GPS data is classified by using (a) the KMeans method, (b) the dbss method and (c) the method according to the first embodiment of the present invention (i.e. the spectrum in fig. 5), and the classification result is shown in fig. 5, it can be seen that clustering performed by using (a) the KMeans method and (b) the dbss method generally divides the space into more regular blocks, and these and actual situations do not always meet. And (c) the method of the first embodiment of the present invention is used for clustering, the space is represented in the form of graph network, and the divided different categories are more consistent with the distribution of real tracks or roads.
In the first embodiment of the traffic information processing method, the adjacency relation between different traffic points is represented in a graph mode, and then the adjacency graph is used for carrying out cluster analysis to divide an urban traffic space plane. Aiming at the problem of measuring the correlation between different traffic points, the track passing times between two adjacent traffic points are analyzed by utilizing the statistics of historical traffic data to represent.
Fig. 6 is a flowchart of a second embodiment of the traffic information processing method of the present invention, in this embodiment, the method of the present invention includes:
s61, acquiring information of traffic points;
s62, converting the information of the traffic points into traffic track information;
s63, determining the adjacency relation of any two adjacent traffic points in the traffic track and the track passing times according to the traffic track information;
s64, based on the information of the traffic points and the adjacency relation and the passing times of any two adjacent traffic points, obtaining a high-dimensional map feature matrix of the traffic network formed by the traffic points by using a spectral clustering method;
s65, extracting the type information of the traffic points from the high-dimensional map feature matrix;
s66, classifying and storing the information of the traffic points based on the category information to form a database;
s67, receiving a request for acquiring information of the traffic points of the specific category;
and S68, acquiring the information of the traffic points of the corresponding category from the database based on the category information in the request for response.
In this embodiment, steps S61-S66 correspond to steps S11-S16 of the first embodiment of the method, and are not repeated herein.
Referring to fig. 4, in step S67, a request to acquire information of a traffic point is received from a client Query, the request including category information of the traffic point. In step S68, based on the category information, information of the traffic point of the corresponding category is acquired from the database to respond. For example, the information of the corresponding traffic point is directly sent to the corresponding client or sent to the corresponding client after being subjected to preset processing.
Due to the fact that the data are stored in the HBase + Phoenix mode, the Phoenix can provide a query interface similar to SQL for the HBase, the data of the HBase can be directly queried through the Phoenix, SQL sentences can be analyzed into parallel sentences by the Phoenix during query, the data are queried in parallel on a plurality of storage media, efficiency is improved, and the method can be applied to millisecond-level real-time query.
As shown in FIG. 7, for the test on the 43 hundred million GPS data sets, the framework of the invention increases the query delay in an approximately linear manner with the increase of the input scale of the query, and the delay within 4s can be achieved in most cases, so as to meet the requirement of real-time query.
It should be understood that, the sequence numbers of the steps in the foregoing embodiments do not imply an execution sequence, and the execution sequence of each process should be determined by its function and inherent logic, and should not constitute any limitation to the implementation process of the embodiments of the present invention.
In the embodiment of the present invention, a traffic information processing apparatus is further provided, where the traffic information processing apparatus includes units for executing the steps in the embodiment corresponding to fig. 1. Please refer to fig. 1-5 for related descriptions of embodiments. Fig. 8 shows a schematic configuration diagram of a first embodiment of a traffic information processing apparatus 800 of the present invention, including:
an acquisition unit 81 for acquiring information of a traffic point;
a conversion unit 82 for converting information of the traffic points into traffic track information;
the determining unit 83 is configured to determine an adjacency relation between any two adjacent traffic points in the traffic track and the track passing frequency according to the traffic track information;
a calculating unit 84, configured to obtain a high-dimensional map feature matrix of a traffic network formed by traffic points by using a spectral clustering method based on information of the traffic points, an adjacency relation between any two adjacent traffic points, and a passing frequency;
the extracting unit 85 is used for extracting the category information of the traffic points from the high-dimensional representation;
the storage unit 86 is configured to store the traffic point information in a classified manner based on the category information to form a database.
Wherein the determining unit 83 includes:
a first determining module 831, configured to split the traffic track information into adjacent traffic point pairs;
a second determining module 832, configured to determine an adjacency relation between any two adjacent traffic points according to the adjacent traffic point pairs;
and a third determining module 833, configured to determine the number of track passing times of any two adjacent traffic points according to the adjacent traffic point pairs.
Among them, the calculation unit 84 includes:
the adjacency matrix calculation module 841 is used for determining an adjacency matrix by taking the track passing times of any two adjacent traffic points as the weight values of the adjacency relation of the corresponding two adjacent traffic points;
an adjacency graph calculation module 842, configured to obtain a weighted undirected adjacency graph of a traffic network formed by traffic points according to information of the traffic points, an adjacency relation between any two adjacent traffic points, and the adjacency matrix;
and the high-dimensional representation calculation module 843 is used for calculating a high-dimensional map feature matrix of the traffic network formed by the traffic points according to the weighted undirected adjacency graph.
Further, the high-dimensional characterization calculation module 843 includes the following sub-modules (not shown in the figure):
the first sub-module is used for calculating the corresponding degrees of all traffic points according to the undirected adjacency graphs with weights so as to obtain a degree matrix;
the second submodule is used for calculating a Laplace matrix of the undirected adjacency graph with weights based on the degree matrix and the adjacency matrix;
the third submodule is used for solving eigenvectors and eigenvalues of the Laplace matrix;
and the fourth submodule is used for obtaining a high-dimensional map feature matrix of the traffic network formed by the traffic points by using a TopK algorithm according to the feature vector and the feature value of the Laplace matrix.
In a second embodiment of the traffic information processing apparatus according to the present invention, based on fig. 8, the apparatus further includes a receiving unit and a responding unit, configured to execute corresponding steps in the embodiment corresponding to fig. 6, and refer to the related description in the embodiment corresponding to fig. 6 specifically.
The receiving unit is used for receiving a request for acquiring information of traffic points in a specific category.
The response unit is used for acquiring the information of the traffic points of the corresponding category from the database based on the category information in the request so as to respond.
The present invention also provides a terminal, as shown in fig. 9, the terminal 100 includes: a processor 101, a memory 102, and a computer program 103 stored in the memory 102 and operable on the processor 101. The steps in the embodiments of the traffic information processing method described above are implemented when the processor 101 executes the computer program 103. Alternatively, the processor 101, when executing the computer program 103, implements the functions of each unit/module/submodule in each device embodiment described above.
Illustratively, the computer program 103 may be partitioned into one or more units/modules/sub-modules that are stored in the memory 102 and executed by the processor 101 to carry out the invention. The one or more units/modules/sub-modules may be a series of computer program instruction segments capable of performing specific functions, which are used to describe the execution process of the computer program 103 in the traffic information processing apparatus/terminal 100. For example, the computer program 62 may be divided into an acquisition module, an execution module, and a generation module (module in a virtual device), and the specific functions of each module are as follows:
acquiring information of traffic points; converting the information of the traffic points into traffic track information; determining the adjacency relation of any two adjacent traffic points in the traffic track and the track passing times according to the traffic track information; based on the information of the traffic points, the adjacency relation between any two adjacent traffic points and the passing times, a high-dimensional map feature matrix of the traffic network formed by the traffic points is obtained by using a spectral clustering method; extracting the category information of the traffic points from the high-dimensional map feature matrix; and classifying and storing the traffic points based on the class information to form a database.
The terminal 100 may be a computing device such as a desktop computer, a notebook, a palm top computer, and a cloud server. The terminal 100 can include, but is not limited to, a processor 101, a memory 102. Those skilled in the art will appreciate that fig. 9 is merely an example of the terminal 100 and does not constitute a limitation of the terminal 100 and may include more or less components than those shown, or combine certain components, or different components, e.g., the terminal 100 may further include input-output devices, network access devices, buses, etc.
The Processor 101 may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic, discrete hardware components, etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The storage 102 may be an internal storage unit of the terminal 100, such as a hard disk or a memory of the terminal 100. The memory 102 may also be an external storage device of the terminal 100, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), etc. provided on the terminal 100. Further, the memory 102 may also include both internal storage units of the terminal 100 and external storage devices. The memory 102 is used for storing the computer program 103 and other programs and data required by the terminal 100. The memory 102 may also be used to temporarily store data that has been output or is to be output.
The invention also provides a computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the steps of any of the embodiments of the traffic information processing method, for example.
It will be apparent to those skilled in the art that, for convenience and brevity of description, only the above-mentioned division of the functional units and modules is illustrated, and in practical applications, the above-mentioned function distribution may be performed by different functional units and modules according to needs, that is, the internal structure of the apparatus is divided into different functional units or modules to perform all or part of the above-mentioned functions. Each functional unit and module in the embodiments may be integrated in one processing unit, or each unit may exist alone physically, or two or more units are integrated in one unit, and the integrated unit may be implemented in a form of hardware, or in a form of software functional unit. In addition, specific names of the functional units and modules are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present application. The specific working processes of the units and modules in the system may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the above embodiments, the descriptions of the respective embodiments have respective emphasis, and reference may be made to the related descriptions of other embodiments for parts that are not described or illustrated in a certain embodiment.
Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
In the embodiments provided in the present invention, it should be understood that the disclosed apparatus/terminal device and method may be implemented in other ways. For example, the above-described embodiments of the apparatus/terminal device are merely illustrative, and for example, the division of the modules or units is only one logical division, and there may be other divisions when actually implemented, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one position, or may be distributed on multiple network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated modules/units, if implemented in the form of software functional units and sold or used as separate products, may be stored in a computer readable storage medium. Based on such understanding, all or part of the flow of the method according to the embodiments of the present invention may also be implemented by a computer program, which may be stored in a computer-readable storage medium, and when the computer program is executed by a processor, the steps of the method embodiments may be implemented. . Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, usb disk, removable hard disk, magnetic disk, optical disk, computer Memory, Read-Only Memory (ROM), Random Access Memory (RAM), electrical carrier wave signals, telecommunications signals, software distribution medium, and the like. It should be noted that the computer readable medium may contain content that is subject to appropriate increase or decrease as required by legislation and patent practice in jurisdictions, for example, in some jurisdictions, computer readable media does not include electrical carrier signals and telecommunications signals as is required by legislation and patent practice.
The above-mentioned embodiments are only used for illustrating the technical solutions of the present invention, and not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; such modifications and substitutions do not substantially depart from the spirit and scope of the embodiments of the present invention, and are intended to be included within the scope of the present invention.
Claims (10)
1. A method for processing traffic information, the method comprising:
acquiring information of traffic points;
converting the information of the traffic points into traffic track information;
determining the adjacency relation of any two adjacent traffic points in the traffic track and the track passing times according to the traffic track information;
based on the information of the traffic points, the adjacency relation between any two adjacent traffic points and the passing times, obtaining a high-dimensional representation of a traffic network formed by the traffic points by using a spectral clustering method;
extracting the category information of the traffic points from the high-dimensional representation;
and classifying and storing the information of the traffic points based on the class information to form a database.
2. The method of claim 1, wherein the information about the traffic point comprises GPS data, and wherein converting the information about the traffic point into traffic track information comprises:
and sequencing the GPS data reported by the same traffic participant according to the reporting time to obtain the traffic track information of the traffic participant.
3. The method of claim 1, wherein the determining the adjacency relation and the passing times of any two adjacent traffic points in the traffic track according to the traffic track information comprises:
splitting the traffic track information into adjacent traffic point pairs;
determining the adjacency relation of any two adjacent traffic points according to the adjacent traffic point pairs;
and determining the track passing times of any two adjacent traffic points according to the adjacent traffic point pairs.
4. The method of claim 1, wherein the obtaining of the high-dimensional representation of the traffic network formed by the traffic points by using a spectral clustering method based on the information of the traffic points, the adjacency relation between any two adjacent traffic points and the track passing times comprises:
determining an adjacency matrix by taking the track passing times of any two adjacent traffic points as the weight values of the adjacency relation of the corresponding two adjacent traffic points;
obtaining a weighted undirected adjacency graph of a traffic network formed by the traffic points according to the information of the traffic points, the adjacency relation of any two adjacent traffic points and the adjacency matrix;
and calculating a high-dimensional representation of a traffic network formed by the traffic points according to the weighted undirected adjacency graph.
5. The method of claim 4, wherein computing a high-dimensional representation of a traffic network of the traffic points from the weighted undirected adjacency graph comprises:
calculating the corresponding degrees of all traffic points according to the weighted undirected adjacency graphs to obtain a degree matrix;
calculating a Laplace matrix of the weighted undirected adjacency graph based on the degree matrix and the adjacency matrix;
solving eigenvectors and eigenvalues of the Laplace matrix;
and according to the eigenvectors and the eigenvalues of the Laplace matrix, using a TopK algorithm to obtain a high-dimensional representation of the traffic network formed by the traffic points.
6. The method of claim 1, further comprising:
receiving a request for acquiring information of traffic points of a specific category;
and acquiring information of the traffic points of the corresponding category from the database based on the category information in the request for response.
7. An apparatus for processing traffic information, the apparatus comprising:
an acquisition unit configured to acquire information of a traffic point;
the conversion unit is used for converting the information of the traffic points into traffic track information;
the determining unit is used for determining the adjacency relation of any two adjacent traffic points in the traffic track and the track passing times according to the traffic track information;
the calculation unit is used for obtaining a high-dimensional representation of a traffic network formed by the traffic points by using a spectral clustering method based on the information of the traffic points, the adjacency relation of any two adjacent traffic points and the passing times;
the extraction unit is used for extracting the category information of the traffic points from the high-dimensional representation;
and the storage unit is used for classifying and storing the information of the traffic points based on the class information to form a database.
8. The apparatus of claim 7, wherein the computing unit comprises:
the adjacency matrix calculation module is used for determining an adjacency matrix by taking the track passing times of any two adjacent traffic points as the weight values of the adjacency relation of the corresponding two adjacent traffic points;
the adjacency graph calculation module is used for obtaining a weighted undirected adjacency graph of a traffic network formed by the traffic points according to the information of the traffic points, the adjacency relation of any two adjacent traffic points and the adjacency matrix;
and the high-dimensional representation calculation module is used for calculating the high-dimensional representation of the traffic network formed by the traffic points according to the undirected adjacency graph with the weight.
9. A terminal comprising a memory, a processor and a computer program stored in the memory and executable on the processor, characterized in that the processor implements the steps of the method according to any of claims 1 to 6 when executing the computer program.
10. A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 6.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011395101.5A CN114613124B (en) | 2020-12-03 | 2020-12-03 | Traffic information processing method, device, terminal and computer readable storage medium |
PCT/CN2020/140134 WO2022116326A1 (en) | 2020-12-03 | 2020-12-28 | Transportation information processing method, device, terminal, and computer-readable storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011395101.5A CN114613124B (en) | 2020-12-03 | 2020-12-03 | Traffic information processing method, device, terminal and computer readable storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114613124A true CN114613124A (en) | 2022-06-10 |
CN114613124B CN114613124B (en) | 2023-02-10 |
Family
ID=81852892
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011395101.5A Active CN114613124B (en) | 2020-12-03 | 2020-12-03 | Traffic information processing method, device, terminal and computer readable storage medium |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN114613124B (en) |
WO (1) | WO2022116326A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118278454A (en) * | 2024-05-31 | 2024-07-02 | 湖南工商大学 | Traffic passenger flow short-term prediction method considering dynamic and static multiple factors |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114021580B (en) * | 2021-10-14 | 2023-06-27 | 华南师范大学 | Classroom dialogue processing method, system and storage medium based on sequence pattern mining |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110246537A1 (en) * | 2010-03-31 | 2011-10-06 | International Business Machines Corporation | Matrix re-ordering and visualization in the presence of data hierarchies |
CN102855638A (en) * | 2012-08-13 | 2013-01-02 | 苏州大学 | Detection method for abnormal behavior of vehicle based on spectrum clustering |
US20140278052A1 (en) * | 2013-03-15 | 2014-09-18 | Caliper Corporation | Lane-level vehicle navigation for vehicle routing and traffic management |
US20150332122A1 (en) * | 2014-05-19 | 2015-11-19 | Jinling Institute Of Technology | Method and apparatus for image processing |
CN105787472A (en) * | 2016-03-28 | 2016-07-20 | 电子科技大学 | Abnormal behavior detection method based on time-space Laplacian Eigenmaps learning |
CN107272405A (en) * | 2017-05-26 | 2017-10-20 | 广州汽车集团股份有限公司 | The vehicle distributed director gain acquiring method and device of homogeneous vehicle platoon |
CN109448370A (en) * | 2018-10-29 | 2019-03-08 | 东南大学 | A kind of traffic control sub-area division method based on track of vehicle data |
CN109859054A (en) * | 2018-12-13 | 2019-06-07 | 平安科技(深圳)有限公司 | Network community method for digging, device, computer equipment and storage medium |
US20190180612A1 (en) * | 2016-08-15 | 2019-06-13 | University Of Southern California | Latent space model for road networks to predict time-varying traffic |
CN110222589A (en) * | 2019-05-16 | 2019-09-10 | 五邑大学 | A kind of pedestrian recognition methods and its system, device, storage medium again |
CN110929962A (en) * | 2019-12-13 | 2020-03-27 | 中国科学院深圳先进技术研究院 | Traffic flow prediction method and device based on deep learning |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1991933A1 (en) * | 2006-02-27 | 2008-11-19 | Robert Bosch GmbH | Trajectory retrieval system, method and software for trajectory data retrieval |
CN110598747B (en) * | 2019-08-13 | 2023-05-02 | 广东工业大学 | Road classification method based on self-adaptive K-means clustering algorithm |
CN110992683B (en) * | 2019-10-29 | 2021-07-27 | 山东科技大学 | Dynamic image perception-based intersection blind area early warning method and system |
-
2020
- 2020-12-03 CN CN202011395101.5A patent/CN114613124B/en active Active
- 2020-12-28 WO PCT/CN2020/140134 patent/WO2022116326A1/en active Application Filing
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110246537A1 (en) * | 2010-03-31 | 2011-10-06 | International Business Machines Corporation | Matrix re-ordering and visualization in the presence of data hierarchies |
CN102855638A (en) * | 2012-08-13 | 2013-01-02 | 苏州大学 | Detection method for abnormal behavior of vehicle based on spectrum clustering |
US20140278052A1 (en) * | 2013-03-15 | 2014-09-18 | Caliper Corporation | Lane-level vehicle navigation for vehicle routing and traffic management |
US20150332122A1 (en) * | 2014-05-19 | 2015-11-19 | Jinling Institute Of Technology | Method and apparatus for image processing |
CN105787472A (en) * | 2016-03-28 | 2016-07-20 | 电子科技大学 | Abnormal behavior detection method based on time-space Laplacian Eigenmaps learning |
US20190180612A1 (en) * | 2016-08-15 | 2019-06-13 | University Of Southern California | Latent space model for road networks to predict time-varying traffic |
CN107272405A (en) * | 2017-05-26 | 2017-10-20 | 广州汽车集团股份有限公司 | The vehicle distributed director gain acquiring method and device of homogeneous vehicle platoon |
CN109448370A (en) * | 2018-10-29 | 2019-03-08 | 东南大学 | A kind of traffic control sub-area division method based on track of vehicle data |
CN109859054A (en) * | 2018-12-13 | 2019-06-07 | 平安科技(深圳)有限公司 | Network community method for digging, device, computer equipment and storage medium |
CN110222589A (en) * | 2019-05-16 | 2019-09-10 | 五邑大学 | A kind of pedestrian recognition methods and its system, device, storage medium again |
CN110929962A (en) * | 2019-12-13 | 2020-03-27 | 中国科学院深圳先进技术研究院 | Traffic flow prediction method and device based on deep learning |
Non-Patent Citations (3)
Title |
---|
MOHAMMADREZA SAEEDMANESH: "Dynamic clustering and propagation of congestion in heterogeneously congested urban traffic networks", 《TRANSPORTATION RESEARCH PARTB》 * |
熊文等: "基于时空相似性的大规模轨迹数据融合技术", 《集成技术》 * |
陈晓光: "基于网格的密度峰值聚类算法研究及其应用", 《中国优秀硕士学位论文全文数据库》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118278454A (en) * | 2024-05-31 | 2024-07-02 | 湖南工商大学 | Traffic passenger flow short-term prediction method considering dynamic and static multiple factors |
CN118278454B (en) * | 2024-05-31 | 2024-08-20 | 湖南工商大学 | Traffic passenger flow short-term prediction method considering dynamic and static multiple factors |
Also Published As
Publication number | Publication date |
---|---|
WO2022116326A1 (en) | 2022-06-09 |
CN114613124B (en) | 2023-02-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zhan et al. | Inferring urban land use using large-scale social media check-in data | |
WO2022095351A1 (en) | Target area division method and apparatus, and electronic device and storage medium | |
CN109241846B (en) | Method and device for estimating space-time change of remote sensing image and storage medium | |
Saha et al. | Pattern recognition using clustering analysis to support transportation system management, operations, and modeling | |
Yang et al. | Analysis of traffic state variation patterns for urban road network based on spectral clustering | |
CN107679558A (en) | A kind of user trajectory method for measuring similarity based on metric learning | |
Bhanu et al. | Embedding traffic network characteristics using tensor for improved traffic prediction | |
CN114613124B (en) | Traffic information processing method, device, terminal and computer readable storage medium | |
US11255678B2 (en) | Classifying entities in digital maps using discrete non-trace positioning data | |
Meratnia et al. | Aggregation and comparison of trajectories | |
Ma et al. | Modeling and analysis of daily driving patterns of taxis in reshuffled ride-hailing service market | |
CN114416900A (en) | Method and device for analyzing track stop point | |
CN110298687B (en) | Regional attraction assessment method and device | |
CN110704652A (en) | Vehicle image fine-grained retrieval method and device based on multiple attention mechanism | |
Ding et al. | Efficient vanishing point detection method in complex urban road environments | |
CN114077912A (en) | Data prediction method and data prediction device | |
CN106547862A (en) | Traffic big data dimension-reduction treatment method based on manifold learning | |
Zhang et al. | Hunting image: Taxi search strategy recognition using sparse subspace clustering | |
CN116662839A (en) | Associated big data cluster analysis method and device based on multidimensional intelligent acquisition | |
Tang et al. | On discovering motifs and frequent patterns in spatial trajectories with discrete Fréchet distance | |
Lian et al. | Mining regional mobility patterns for urban dynamic analytics | |
Salinas et al. | Cityhub: A library for urban data integration | |
He et al. | Spatio-temporal trajectory anomaly detection based on common sub-sequence | |
CN114972552B (en) | Graph data compression method based on two-dimensional graph fractional Fourier transform | |
AT&T |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |