CN112133086B

CN112133086B - Regional traffic signal data driving control method based on multi-agent network

Info

Publication number: CN112133086B
Application number: CN202010798063.1A
Authority: CN
Inventors: 张海波; 王力; 吉鸿海; 潘彦斌; 李丹阳
Original assignee: North China University of Technology
Current assignee: North China University of Technology
Priority date: 2020-08-10
Filing date: 2020-08-10
Publication date: 2022-01-18
Anticipated expiration: 2040-08-10
Also published as: CN112133086A

Abstract

The invention relates toA regional traffic signal data driving control method based on a multi-agent network is characterized in that a storage forwarding model is established, constraint conditions for green light time of an intersection are adjusted, a time-varying control signal period C (k) is introduced, and a brand-new green light time constraint condition is obtained, so that green light time waste is avoided, the constraint conditions for breaking a fixed period are considered, and the space occupancy rate is guaranteed to meet x all the time_iAnd (k +1) is not less than 0, and the established regional traffic signal control model can describe three traffic state forms of undersaturation, critical saturation and supersaturation at the intersection at the same time, so that the applicability is stronger. The control algorithm is a dynamic feedback control algorithm, can respond to the change of periodic traffic flow, estimates and dynamically adjusts a periodic timing scheme based on the number of vehicles in the current period, and realizes the balance of space occupancy in each direction. The method has the advantages that the saturation degree in all directions is balanced, so that the possibility of supersaturation in certain directions is reduced, the traffic fairness is ensured, and the overall traffic operation efficiency is improved.

Description

Regional traffic signal data driving control method based on multi-agent network

Technical Field

The invention relates to the technical field of intelligent traffic signal control, in particular to a regional traffic signal data driving control method based on a multi-agent network.

Background

The method comprises the steps of obtaining intersection flow parameters through an estimation method on the basis of floating car data (typical mobile detection data sources), constructing a periodic vehicle number estimation model on the basis of a storage-forwarding modeling method, and constructing a regional traffic signal data driving control model which aims at space occupancy balance on the basis of a multi-agent network.

In the traditional analysis of a store-and-forward model, generally, only the traffic signal timing problem in the oversaturated traffic state is considered, more green light time is allocated to a certain direction and a certain period of an undersaturated intersection, and the problem of green light time waste, namely the idle discharge phenomenon, exists; in fact, at this time, the traffic capacity can be guaranteed only by allocating less green light time. However, most traffic signal controls are periodic control, and the constraint of the maximum and minimum green time of the phase is also required to be met, so the adjustable range of the green time in the traditional signal control is limited, and the control problems of three traffic states of undersaturation, critical saturation and oversaturation cannot be perfectly compatible.

In order to solve the problem, in the establishment of a store-and-forward model, constraint conditions for green light time of an intersection are adjusted, a time-varying control signal period C (k) is introduced, and a brand-new green light time constraint condition is obtained, so that green light time waste is avoided, the constraint conditions for breaking a fixed period are considered, and the space occupancy rate is guaranteed to meet x all the time_iAnd (k +1) is not less than 0, and the established regional traffic signal control model can describe three traffic state forms of undersaturation, critical saturation and supersaturation at the intersection at the same time, so that the applicability is stronger.

The invention content is as follows:

the method is based on the practical problems of urban regional traffic detection and signal control, high-precision low-occupancy-ratio floating car data are used as data sources, intersection flow parameters are obtained through an estimation method, a multi-intersection multi-directional vehicle conservation equation is established by combining the networking property, the expansibility and the reproducibility of distributed single-intersection multi-directional space occupancy balance control with a positive system model, and a regional traffic signal data driving control method based on a multi-agent network and aiming at space occupancy balance is provided.

With the rapid development of intelligent vehicles and internet traffic and communication technologies, the scale, quality, accuracy, instantaneity and the like of mobile detection data are greatly improved. The signal control method of the invention is based on floating car data (typical mobile detection data source), does not depend on any model information and fixed detector input, and can provide a better theoretical and application basis for the optimization design of a novel traffic control system under the future road environment. The invention adopts the following technical scheme:

a regional traffic signal data driving control method based on a multi-agent network comprises the following steps:

(1) creating a component-positive system model

Wherein x is_m(k) N represents the state of the component system element m at the time k, wherein m is equal to or more than 0, and m is equal to 1; a is_mnM ≠ n denotes the state quantity transfer proportionality coefficient of the component system element m to the component system element n in the sampling period T > 0; i is_m(k) Not less than 0 and O_m(k) ≧ 0 represents input and output of Compartment System element m within the sampling period, respectively, and a_mmThe proportional coefficient is output for the state quantity of more than or equal to 0;

describing the component system model into a vector form:

x(k+1)＝Ax(k)+I(k)

in the formula: x (k) ═ x₁(k)，...，x_N(k)]^T∈R^NIs a system state vector;

I(k)＝[I₁(k)，...，I_N(k)]^T∈R^Ninputting for the outside of the system; a is an element of R^N×NIs a system state matrix and has

Assuming that the current state quantity of any component is greater than or equal to the total quantity of state transitions in the sampling period, diagonal elements of the state matrix A are all non-negative, and the column sum satisfies:

(2) establishing a traffic area multi-intersection variable-period multi-direction space occupancy model with green light time constraint

The global balanced dynamic model of its space occupancy can be described as:

wherein the global space occupancy state vector is x ═ x₁，...，x_M]^T∈R^M，x_i∈R^NThe global nonlinear dynamic vector is f ═ f₁，...，f_M]^T∈R^M，f_i∈R^NControlling the input green time to

u_i∈R^NM represents the number of urban regional intersections, and N represents the number of each direction of a single intersection;

the distributed consistency coordination error for defining the space occupancy rate of the ith direction of the mth intersection is as follows:

wherein the content of the first and second substances,

representing a connection matrix of a multi-agent system formed by a single cross port and multiple directions,

connection matrix representing multi-intersection constituting multi-agent system, b_iThe connection coefficient between the space occupancy of the ith direction of the mth intersection and the mean value of the expected space occupancy is represented;

the distributed global consistency coordination error of the space occupancy of each direction of the single intersection is described as follows:

wherein e ═ e₁，...，e_N]^T∈R^N，

B＝diag(b_i)∈R^N×NRepresenting a diagonal matrix;

according to the distributed consistency coordination error, a consistency coordination control item v in the following form is selected_m，i(k)，

Or

Wherein c > 0 represents an error learning gain,

distributed green light time coordination control strategy for each direction of mth intersection

The design is as follows,

or

Where μ > 0 denotes the coordinated control gain, p_m，i> 0, and the parameter learning rate is designed as follows:

or

Wherein, F_i＝Π_iIs greater than 0; κ > 0 represents the parameter learning gain;

the control and learning gains satisfy the following conditions:

wherein P ═ P^T∈R^N×N＞0，Q＝Q^T∈R^N×N> 0 is a positive definite matrix;

wherein the global consistency coordination error vector e (k) e R^NThe consistency converges to a bounded neighborhood near the zero value, and the neighborhood upper bound of the consistency coordination error is reduced by increasing the error learning gain c.

Drawings

FIG. 1 is a schematic diagram of the component model.

Detailed Description

A positive system with n ≧ 2 components is shown in FIG. 1, in which x_m(k) N represents the state of the component system element m at the time k, wherein m is equal to or more than 0, and m is equal to 1; a is_mnM ≠ n denotes the state quantity transfer proportionality coefficient of the component system element m to the component system element n in the sampling period T > 0; i is_m(k) Not less than 0 and O_m(k) ≧ 0 represents input and output of Compartment System element m within the sampling period, respectively, and a_mmAnd more than or equal to 0 is a state quantity output proportional coefficient. Thus, the state of the component system element m satisfies the following conservation equation:

further, equation (4-93) is written in vector form:

x(k+1)＝Ax(k)+I(k)

thus, the state matrix A is a non-negative matrix, so that the component system is a type of positive system, and the matrix A is referred to as the component matrix.

The traffic area multi-intersection variable-period multi-direction space occupancy model considering the green light time constraint is as follows:

the global balanced dynamic model of its space occupancy can be described as:

u_i∈R^NM represents the number of urban regional intersections, and N represents the number of each direction of a single intersection.

The control target of the space occupancy equilibrium control is to make the distribution consistency coordination error of the space occupancy of each intersection in each direction zero. The distributed consistency coordination error for defining the space occupancy rate of the ith direction of the mth intersection is as follows:

wherein the content of the first and second substances,

connection matrix representing multi-intersection constituting multi-agent system, b_iAnd the connection coefficient between the space occupancy of the ith direction of the mth intersection and the mean value of the expected space occupancy is shown.

The distributed global consistency coordination error of the space occupancy of each direction of the single intersection can be described as follows:

wherein e ═ e₁，...，e_N]^T∈R^N，

B＝diag(b_i)∈R^N×NA diagonal matrix is represented.

Considering a multi-agent multi-intersection multidirectional space occupancy network system, all direction communication directed graphs at each intersection are strongly connected, and at least one b exists_iNot equal to 0, and selecting a consistency coordination control item v in the following form according to the distributed consistency coordination error_m，i(k)，

Or

Wherein c > 0 represents an error learning gain,

The design is as follows,

or

or

Wherein, F_i＝Π_iIs greater than 0; κ > 0 represents the parameter learning gain.

The control and learning gains satisfy the following conditions:

then there is a global consistency coordination error vector e (k) e R^NThe method is characterized in that the method is uniformly converged in a bounded neighborhood near a zero value, the space occupancy of each direction of a single intersection is uniformly converged to a desired space occupancy, and the upper boundary of the neighborhood of the consistency coordination error can be reduced by increasing the error learning gain c.

Claims

1. A regional traffic signal data driving control method based on a multi-agent network is characterized by comprising the following steps:

(1) creating a component-positive system model

Wherein x is_m(k) N represents the state of the component system element m at the time k, wherein m is equal to or more than 0, and m is equal to 1; a is_mnAnd m ≠ n denotes Compertent System elements m through CompaThe state quantity of the moment system element n in the sampling period T is greater than 0 and is transferred to a proportionality coefficient; i is_m(k) Not less than 0 and O_m(k) More than or equal to 0 respectively represents the input and the output of the component system element m in the sampling period;

describing the component system model into a vector form:

x(k+1)＝Ax(k)+I(k)

The global balanced dynamic model of its space occupancy can be described as:

wherein the global space occupancy state vector is x ═ x₁，...，x_M]^T∈R^M，x_i∈R^NGlobal non-linear dynamicsThe vector is f ═ f₁，...，f_M]^T∈R^M，f_i∈R^NControlling the input green time to

wherein the content of the first and second substances,

connection matrix representing multi-intersection constituting multi-agent system, b_m，iThe connection coefficient between the space occupancy of the ith direction of the mth intersection and the mean value of the expected space occupancy is represented;