CN113538940B

CN113538940B - Real-time optimal lane selection method suitable for vehicle-road cooperative environment

Info

Publication number: CN113538940B
Application number: CN202110770608.2A
Authority: CN
Inventors: 钱志奇; 戚国华; 吕钰新; 李赟鹏; 罗文奇
Original assignee: Xiwan Wisdom Guangdong Information Technology Co ltd
Current assignee: Xiwan Wisdom Guangdong Information Technology Co ltd
Priority date: 2021-07-08
Filing date: 2021-07-08
Publication date: 2022-05-17
Anticipated expiration: 2041-07-08
Also published as: CN113538940A

Abstract

The invention discloses a real-time optimal lane selection method suitable for a vehicle-road cooperative environment, which specifically comprises the following steps: s1, establishing a traffic model under the vehicle-road cooperative environment as a network evolution game model with state-dependent time lag by using a mixed value logic function, and deriving a strategy evolution equation by using a matrix method; s2, acquiring vehicle data of each lane within limited time by using road data collecting equipment; s3, taking the vehicle speed as income, taking the selection of lanes as a strategy and taking the decision made by all the current vehicles as situation; when information time lag exists, analyzing the influence of different situations on the speed of the vehicle, finding out a strategy for realizing the maximum benefit of the individual vehicle, and further making a strategy updating rule of optimal response; s4, constructing a state transition matrix H by using a matrix method; and analyzing the characteristics, judging whether the traffic system is stable, if so, solving a mean cross point strategy, and finding out a criterion for convergence of the traffic system, namely the optimal path planning.

Description

Real-time optimal lane selection method suitable for vehicle-road cooperative environment

Technical Field

The invention relates to the field of intelligent traffic systems, in particular to a real-time optimal lane selection method suitable for a vehicle-road cooperative environment.

Background

In the traditional traffic system, the random lane change of vehicles is an important reason for traffic jam and accidents. Most drivers can only rely on self experience and visual judgment to achieve the purpose of lane changing, unpredictable risks exist behind the decision, especially when traffic is congested, blind lane changing further aggravates the traffic jam, and even accidents are caused. With the development of perception and communication technology, a vehicle-road cooperative system provides an effective way for solving the problems, and the design of a reliable lane decision method is the key for the effective operation of the vehicle-road cooperative system.

The game theory provides a research method and an analysis means for solving the problem of competition tendency. Lane changes between vehicles can be considered a non-cooperative gambling activity. The driver in the lane selection process is regarded as a game player in the game, the win-win is realized through the game process, and the participants can make optimal decision by combining with the expected income of the participants under the mutual constraint of the external conditions. The method not only accords with the driving psychology of the driver pursuing the maximum benefit, but also enables the lane changing decision to be more reasonable, so that the acceptability of the driver to the game cooperation strategy is high, the cooperation will is strong, and the method is very suitable for solving the problem of the optimal lane selection of the vehicle by adopting a game theory method. In an actual traffic network, the topological relation among vehicles has an important influence on the evolution of the situation, and especially the influence of the lane selection of the vehicles in the neighborhood on each other is more direct; meanwhile, when the vehicle receives the information of the data acquisition equipment, time lag occurs, so that the judgment of the vehicle on the lane selection of the neighbor vehicle is influenced, the time lag has an inseparable relationship with the situation, and the influence of state-dependent time lag on the optimal lane selection has to be considered.

Disclosure of Invention

The invention aims to provide an optimal lane selection method under a vehicle-road cooperative environment, which is used for guiding a driver to make optimal lane selection, so that the road smoothness is improved and traffic jam is relieved on the premise of ensuring traffic safety, and the method has very important social value and engineering significance.

In order to solve the technical problems, the invention adopts the technical scheme that: the real-time optimal lane selection method suitable for the vehicle-road cooperative environment specifically comprises the following steps:

s1: establishing a traffic model under the vehicle-road cooperative environment as a network evolution game model with state-dependent time lag by using a mixed value logic function, and deriving a strategy evolution equation by using a matrix method;

s2: acquiring historical vehicle data of each lane within limited time by using road data collection equipment;

s3: taking the vehicle speed as the income, taking the selection of lanes as a strategy, and taking the decision made by all the vehicles at present as the situation; when information time lag exists, analyzing the influence of different situations on the speed of the vehicle, further finding out a strategy for enabling the individual vehicle to realize the maximum benefit, and further making a strategy updating rule of optimal response;

s4: constructing a state transition matrix H by using a matrix method; and analyzing the characteristics of the state transition matrix H, judging whether the traffic system is stable, if so, solving a mean transverse point strategy, finding out a criterion for convergence of the traffic system, and further obtaining a stable point, namely the sought optimal path plan.

By adopting the technical scheme, mathematical modeling is firstly carried out according to the real-time traffic network condition; then, according to the income parameter selection of the vehicle owner, the acquisition work of model data is carried out by means of the vehicle-road cooperation technology; then, according to the income parameter selection of the vehicle owner, a strategy updating rule is made; further judging whether the system is stable, namely whether the game can converge to the equilibrium point, and if the system is stable, directly solving an equilibrium point strategy, namely an optimal strategy; if the system is unstable, the participant strategy is intervened through a system controller to realize system stability, then the equilibrium point strategy is solved, and finally an optimal lane selection suggestion is given; the method for modeling the traffic model in the vehicle-road cooperative environment by using the mathematical formula obtains a network evolution game model with state-dependent time lag to simulate the vehicle-road cooperative traffic model so as to solve the problem of optimal lane selection of individual vehicles, and on one hand, helps a vehicle owner to select a scientific optimal lane and maximize the benefit of the vehicle owner; on the other hand, the existence of the global optimal solution is guaranteed, so that the pressure of the whole traffic jam is relieved, and the core of the method is to solve the game problem of the individual vehicles and the adjacent vehicles by utilizing the thought of a game theory, find out the global optimal solution and realize the optimal control.

As a preferred embodiment of the present invention, the method further includes step S5: and if the traffic system is unstable, namely convergence cannot be realized, designing a state feedback controller to perform route optimization control on the individual vehicle so as to ensure that the traffic system is converged.

As a preferred embodiment of the present invention, the step S1 specifically includes:

s11: establishing a traffic model under a vehicle-road cooperative environment into a network evolution game model G with state-dependent time lag by using a mixed value logic network_d；

Assume that there are N vehicles forming a set N ═ {1, 2.., N }, where the set of lanes in which an individual vehicle i may be allowed to travel is S_i＝{k₁，k₂，...，k_i}，i＝1，2，...，n，S＝S₁×S₂×…×S_nIs a set of situations, c_i: s → a set of real numbers, where i ═ 1, 2., n is the revenue function of individual vehicle i; the topology between vehicles is represented by a connection graph (N, E), where N is the set of points representing the vehicle and E is the set of edges; each side is connected with two vehicles with information interaction relation, and because the influence of adjacent vehicles on each other is the largest, the influence of one-step neighbor vehicles on each other is only considered in lane selection; n is a radical of_iRepresenting a neighborhood of individual vehicle i that contains all neighboring vehicles of vehicle i; game G of vehicle i and vehicle j in its neighborhood_ij(ii) a Using mixed-value logic function f_iA policy update rule representing an individual vehicle i, where i ═ 1, 2_i：S→{0，1，...，λ_iDenotes the time lag, τ, of the individual vehicle i at the time of receiving the information_iIs a mixed value logic function, the networked vehicle G_dThe dynamic model of (2) can be built into the following patterns:

x_i(t+1)＝f_i(x_j(t-τ_i(x(t)))|j∈N_i，t＝0，1，...，)；

wherein x_i(t) is the lane selection for individual vehicle i at time t, x (t) e S is the situation at time t, τ_i(x (t)) is the state-dependent time lag of the individual vehicle i at time t, c_i(t) is the average profit of the neighbor vehicle of the individual vehicle i when playing the game, namely the speed of the vehicle i at the time t, and the formula is as follows:

as a preferred embodiment of the present invention, the road data collection device in step S2 includes a vehicle speed sensor, a roadside sensor and a high-precision map; the step S2 specifically includes: based on the vehicle running speed acquired by the vehicle speed sensor, taking the vehicle running speed as a profit quantification index of the individual vehicle; the lane position of the individual vehicle is positioned based on a positioning system, a road side sensor and a high-precision map of the intelligent vehicle; and (3) providing data support for the optimal decision of the individual vehicle by combining the historical lane selection condition of each individual vehicle in the t steps stored by the intelligent vehicle-road cooperative system and the information time lag of the individual vehicle in the data receiving process.

As a preferred embodiment of the present invention, the step S3 specifically includes:

s31: according to the revenue function c of the individual vehicle i_i(t) searching for the time t, selecting the lane with the maximum profit for the neighbor vehicle j, and forming a set P_i(t)：

Wherein i represents a variable of the player, whose value range is {1, 2.., n }, S_iIs a set of policies for player i, x_i(t) is a strategy variable selected by player i at time t;

s32: suppose P_i(t)＝{s₁，s₂，...，s_r}∈S_iIf the lane selection for the vehicle is already the best lane, then the next time the vehicle will remain traveling in that lane; if the lane of the vehicle is not the best lane for the vehicles in its neighborhood, then the set P is selected at the next time_i(t) designing a lane with the minimum subscript, namely designing a short potential optimal response strategy updating rule as follows:

as a preferred embodiment of the present invention, the step S4 specifically includes:

s41: identifying the real number set as vector set, converting it into algebraic form z (t +1) ═ Hz (t) and λ ═ max { λ } by using matrix half tensor product method₁，λ₂，...，λ_n}; the situation of the lambda +1 step is fused into z (t) by a dimension expansion method, so that the above formula becomes a standard logic dynamic system; the state transition matrix H absorbs the original time delay information, reflects the reachability of the situation transition, reveals the property of the game, calculates and observes and calculates the state transition matrix

S42: judging whether the traffic system is stable or not, wherein the judgment formula is as follows:

wherein Row_s(H) Represents the s-th Row, of the matrix H_-s(H) Denotes the rows of the matrix H except the s-th row, k ═ k₁×k₂×...×k_n(ii) a If the integer s is existed, the traffic system is stable, and if the integer s is not existed, the traffic system is unstable;

s43: if the traffic system is stable, the traffic game is solved to be converged to an equilibrium point

Will be provided with

Performing a unique decomposition yields:

i.e. when the individual vehicle i selects lane S_iAt the same time, its speedThe optimal degree can be realized, and the lane S is_iMost unobstructed and stable, with i 1, 2.

As a preferred technical solution of the present invention, the step S5 specifically includes: if the traffic system is unstable, namely the traffic game cannot be converged, intervention is applied to the selection of the individual paths through the system state feedback controller, and lane selection of individual vehicles is controlled, so that the traffic system reaches a stable state; and returning to the step S43 to obtain the optimal lane selection.

As a preferred embodiment of the present invention, the specific method of the system state feedback controller in step S5 to intervene in selecting the individual path includes the following three methods:

s5-1: restricting the vehicle's selection set of lanes, i.e. the set of lanes for which vehicle i can be allowed to be

Is a set S_iTo narrow the degree of freedom of the vehicle;

s5-2: the selection of the lane has priority, the original mode of uniformly and mixedly selecting the lane is broken through, and the lane different from the adjacent vehicle is preferentially selected, so that the system achieves dynamic balance;

s5-3: one of the vehicles is directly controlled and subjected to direct lane selection intervention, so that the lane selection intervention influences the behavior of the neighbor vehicles and further influences the selection of the global vehicle, namely:

where t represents a time node, N_iRepresents the neighbor set of player i, the value range of player i is {1, 2., n }, u (t) represents the inputtable external control variable, g₁Is a artificially designed, fully controllable mixed-value logic function, f_iIs the policy of player iThe rules are updated slightly.

As a preferable technical scheme of the invention, the roadside sensor comprises a camera and a millimeter wave radar.

The invention has the beneficial effects that: the optimal lane selection method under the vehicle-road cooperative environment is provided, and is used for guiding a driver to make scientific optimal lane selection, so that the road smoothness is improved on the premise of ensuring traffic safety, traffic jam is relieved, and the method has great social value and engineering significance.

Drawings

FIG. 1 is a schematic overall flow chart of a real-time optimal lane selection method for use in a vehicle-road cooperative environment according to the present invention;

FIG. 2 is a schematic diagram of lane simulation in the real-time optimal lane selection method for the vehicle-road cooperative environment according to the present invention;

FIG. 3 is a schematic diagram of a vehicle topological relation in an embodiment of a real-time optimal lane selection method for use in a vehicle-road coordination environment according to the present invention.

Detailed Description

The technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the drawings of the embodiments of the present invention.

Example (b): as shown in fig. 1, the real-time optimal lane selection method applied to the vehicle-road cooperative environment specifically includes the following steps:

the step S1 includes the following steps:

x_i(t+1)＝f_i(x_j(t-τ_i(x(t)))|j∈N_i，t＝0，1，...，)；

the road data collection device in the step S2 includes a vehicle speed sensor, a roadside sensor and a high-precision map; the step S2 specifically includes: based on the vehicle running speed acquired by the vehicle speed sensor, taking the vehicle running speed as a profit quantification index of the individual vehicle; positioning the lane position of the individual vehicle based on a positioning system, a road side sensor and a high-precision map of the intelligent vehicle; the road side sensor comprises a camera and a millimeter wave radar; the historical lane selection condition of each individual vehicle in the t steps stored by the intelligent vehicle-road coordination system is combined with the information time lag of the individual vehicle when receiving data, so that data support is provided for the optimal decision of the individual vehicle;

the step S3 specifically includes:

The step S4 specifically includes:

s41: identifying the real number set as vector set, converting it into algebraic form z (t +1) ═ Hz (t) and λ ═ max { λ } by using matrix half tensor product method₁，λ₂，...，λ_n}., respectively; the situation of the lambda +1 step is fused into z (t) by a dimension expansion method, so that the above formula becomes a standard logic dynamic system; the state transition matrix H absorbs the original time delay information, reflects the reachability of the situation transition, reveals the property of the game, calculates and observes and calculates the state transition matrix

Judging whether the traffic system is stable or not, wherein the judgment formula is as follows:

Will be provided with

Performing a unique decomposition yields:

i.e. when the individual vehicle i selects lane s_iAt the time, the speed of the lane S can be optimized_iMost unobstructed and stable, with i 1, 2.

S5: if the traffic system is unstable, namely convergence cannot be achieved, designing a state feedback controller to perform route optimization control on the individual vehicle so that the traffic system is converged;

the step S5 specifically includes: if the traffic system is unstable, namely the traffic game cannot be converged, intervention is applied to the selection of the individual paths through the system state feedback controller, and lane selection of individual vehicles is controlled, so that the traffic system reaches a stable state; returning to the step S43 to obtain the optimal lane selection;

the specific method for the system state feedback controller in step S5 to intervene in the selection of the individual path includes the following three methods:

s5-1: restricting the vehicle's selection set of lanes, i.e. the set of lanes that vehicle i can be allowed to be

Is a set S_iTo narrow the degree of freedom of the vehicle;

s5-2: the selection of the lane has priority, the original lane mode of uniformly mixing and selecting is broken through, and the lane different from the adjacent vehicle is preferentially selected, so that the system achieves dynamic balance;

where u (t) represents the external control variables that can be entered，g₁Is a fully controllable mixed value logic function which is designed artificially; the matrix half tensor product method adopts the following documents of Yang Zheng, Changxi Li, and Jun-e Feng, Modeling and Dynamics of network evolution with Switched Time Delay, DOI: 10.1109/TCNS.2021.3084548.

FIG. 2 depicts a simple gaming diagram having a network architecture; by adopting the method, without loss of generality, only three vehicles are considered, and a fire truck p is assumed₁Private car p₂Bus p₃The three vehicles are driven with r₁，r₂，r₃On the highway with three lanes, the fire engine can run on any lane, and the private car only allows r to run₁，r₂Lane, bus allowed to travel r only₂，r₃Driveways, i.e. S₁＝{r₁，r₂，r₃}，S₂＝{r₁，r₂}，S₃＝{r₂，r₃The strategy situation is S ═ S₁×S₂×S₃By calculation, 12(3 × 2 × 2) different situations are found.

FIG. 3 depicts a topology between vehicles where two vehicle representations connected by a line segment have direct information interaction. Let p be₁And p₂Game notation G for road selection in driving process₁₂，p₁And p₃Game play G₁₃，p₂And p₃There is no direct gaming relationship. At this time G₁₂And G₁₃The earnings in the traffic game are quantified as shown in tables 1 and 2;

TABLE 1G₁₂The revenue matrix

c₁\c₂	r₁	r₂
			r₁	(3，0)	(1，1)
r₂	(1，1)	(2，0)
			r₃	(2，1)	(3，0)

TABLE 2G₁₃The revenue matrix

c₁\c₃	r₂	r₃
			r₁	(0，0)	(3，1)
r₂	(4，1)	(2，2)
			r₃	(3，2)	(1，0)

And selecting the lanes by the vehicle according to the short potential optimal response rule. Traffic congestion can result in information skew. Different time lags can be generated in different situations, and the greater the traffic jam degree is, the greater the time lag is. This situation-dependent time lag is described as follows:

wherein R is¹＝{(r₁，r₂，r₃)，(r₂，r₁，r₃)，(r₃，r₁，r₂)}；R³＝{(r₂，r₂，r₂)}；R²＝S\(R¹∪R³) (ii) a The evolution equation of the traffic game can be constructed as follows:

by calculation, we can find t 10, when H¹¹＝H¹⁰(ii) a And Row₄₇₂(H¹⁰)+Row₁₂₅₇(H¹⁰)＝1₁₇₂₈(ii) a The result shows that the traffic game can be converged; namely from (r)₁，r₁，r₂，r₁，r₁，r₂，r₁，r₁，r₂) The departing vehicle is finally stabilized in the situation (r)₁，r₂，r₃) (ii) a From (r)₂，r₁，r₃，r₂，r₁，r₃，r₂，r₁，r₃) The obtained vehicle is finally stabilized on the lane (r)₂，r₁，r₃) At the moment, the traffic is smooth.

The above description is only exemplary of the present invention and should not be taken as limiting the invention, as any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A real-time optimal lane selection method suitable for a vehicle-road cooperative environment is characterized by specifically comprising the following steps:

s4: constructing a state transition matrix H by using a matrix method; analyzing the characteristics of the state transition matrix H, judging whether the traffic system is stable, if so, solving a mean transverse point strategy, finding out a criterion of convergence of the traffic system, and further obtaining a stable point, namely the sought optimal path plan;

the step S3 specifically includes:

Wherein i represents a variable of the player, whose value range is {1, 2.., n }, S_iIs a playeri policy set, x_i(t) is a strategy variable selected by player i at time t;

s32: suppose P_i(t)＝{s₁，s₂，...，s_r}∈S_iIf the lane selection for the vehicle is already the best lane, then the next time the vehicle will remain traveling in that lane; if the lane of the vehicle is not the best lane for the vehicles in its neighborhood, then the set P is selected at the next time_iAnd (t) designing a lane with the minimum subscript, namely designing a short potential optimal response strategy updating rule as follows:

the step S4 specifically includes:

s41: identifying a real number set as a vector set, and converting the real number set into an algebraic form z (t +1) ═ Hz (t) by using a matrix half tensor product method; the state transition matrix H absorbs the original time delay information, reflects the reachability of situation transition and reveals the property of the game; since the dimension of one situation is k, the dimension of (lambda +1) situations is k^λ+1Thus computing and observing the computation state transition matrix

s43: if the traffic systemThe system is stable, and the traffic game is solved to obtain a convergence equilibrium point

Will be provided with

Performing a unique decomposition yields:

i.e. when the individual vehicle i selects lane s_iAt the time, the speed can be optimized, at which time the lane s_iMost unobstructed and stable, with i 1, 2.

2. The method for selecting the optimal lane in real time under the cooperative vehicle and road environment as claimed in claim 1, further comprising step S5: and if the traffic system is unstable, namely convergence cannot be realized, designing a state feedback controller to perform route optimization control on the individual vehicle so as to ensure that the traffic system is converged.

3. The method for selecting the optimal lane in real time under the cooperative vehicle and road environment as claimed in claim 2, wherein the step S1 includes the following steps:

Assume that there are N vehicles forming a set N ═ {1, 2.., N }, where the set of lanes in which an individual vehicle i may be allowed to travel is S_i＝{k₁，k₂，...，k_i}，i＝1，2，...，n，S＝S₁×S₂×…×S_nIs a set of situations, c_i: s → a set of real numbers, where i ═ 1, 2., n is the revenue function of individual vehicle i; the topology between vehicles is represented by a connection graph (N, E), where N is the set of points representing the vehicle and E is the set of edges; n is a radical of_iRepresenting individual vehicles iA neighborhood of (c); using mixed-value logic function f_iA policy update rule representing an individual vehicle i, where i ═ 1, 2_i：S→{0，1，...，λ_iRepresents the time lag of the individual vehicle i when receiving the information, and the networked vehicle G_dThe dynamic model of (2) can be built into the following patterns:

x_i(t+1)＝f_i(x_j(t-τ_i(x(t)))|j∈N_i，t＝0，1，...，)；

wherein x_i(t) is the lane selection for individual vehicle i at time t, x (t) e S is the situation at time t, τ_i(x (t)) is the state-dependent time lag of the individual vehicle i at time t, c_i(t) is the average income of the neighbor vehicle of the individual vehicle i when the game is played, and the formula is as follows:

4. the method for selecting the optimal lane in real time under the cooperative environment of the vehicle and road according to claim 1, wherein the road data collecting device in the step S2 comprises a vehicle speed sensor, a road side sensor and a high precision map; the step S2 specifically includes: based on the vehicle running speed acquired by the vehicle speed sensor, taking the vehicle running speed as a profit quantification index of the individual vehicle; positioning the lane position of the individual vehicle based on a positioning system, a road side sensor and a high-precision map of the intelligent vehicle; and (3) providing data support for the optimal decision of the individual vehicle by combining the historical lane selection condition of each individual vehicle in the t steps stored by the intelligent vehicle-road cooperative system and the information time lag of the individual vehicle in the data receiving process.

5. The method for selecting the optimal lane in real time according to claim 1, wherein the step S5 specifically includes: if the traffic system is unstable, namely the traffic game cannot be converged, intervention is applied to the selection of the individual paths through the system state feedback controller, and lane selection of individual vehicles is controlled, so that the traffic system reaches a stable state; and returning to the step S43 to obtain the optimal lane selection.

6. The method for selecting the optimal lane in real time under the cooperative vehicle and road environment as claimed in claim 5, wherein the specific method of the system status feedback controller in step S5 to intervene in selecting the individual path includes the following three methods:

Is a set S_iTo narrow the degree of freedom of the vehicle;

s5-3: one of the vehicles is directly controlled and subjected to direct lane selection intervention, so that the lane selection intervention influences the behaviors of the neighbor vehicles and further influences the selection of the global vehicle, namely:

where t represents a time node, N_iRepresents the neighbor set of player i, the value range of player i is {1, 2., n }, u (t) represents the inputtable external control variable, g₁Is a artificially designed, fully controllable mixed-value logic function, f_iIs the policy update rule for player i.

7. The method for selecting the optimal lane in real time under the cooperative environment of the vehicle and the road according to claim 4, wherein the road side sensor comprises a camera and a millimeter wave radar.