WO2021073080A1

WO2021073080A1 - Nash bargaining criterion-based man-car cooperative game control method

Info

Publication number: WO2021073080A1
Application number: PCT/CN2020/090243
Authority: WO
Inventors: 赵万忠; 张子俊; 周小川; 郭志强; 王安
Original assignee: 南京航空航天大学
Priority date: 2019-10-15
Filing date: 2020-05-14
Publication date: 2021-04-22
Also published as: CN110826192B; CN110826192A

Abstract

A nash bargaining criterion-based man-car cooperative smart steering game control method, comprising: establishing a sixth-order automobile dynamic model, a driver optimal preview model, and a driver neuromuscular model; identifying some important parameters in the driver neuromuscular model, and designing an active rear wheel steering controller by using a sliding mode variable structure algorithm; reasonably assuming manipulation strategies of the driver and the active rear wheel steering controller, to propose six strategy combinations of a man-car game; and calculating, by using a maximum-minimum criterion, benefits of the game parties under various policy combinations, and solving a nash bargaining solution by using a nash bargaining criterion. The man-car cooperative game control method can effectively solve a game problem and maintain a good balance between two game parties by enabling the two game parties to pre-negotiate a strategy combination.

Description

A Human-Vehicle Cooperative Game Control Method Based on Nash Negotiation Criterion

Technical field

The invention belongs to the technical field of human-vehicle game control, and specifically relates to a human-vehicle cooperative game control method based on Nash negotiation criteria.

Background technique

At present, a variety of control methods have emerged in the field of human-vehicle games, including active front-wheel steering technology that does not consider driver manipulation, active steering technology that neutralizes driver manipulation, model-based control technology, and game-based control technology.

Compared with the game-based control technology, the active steering control that does not consider the driver's manipulation only focuses on the control performance of the controller and ignores the driver's changeable manipulation, which causes frequent conflicts between the controller and the person, which is not very good. Solve the problem of the human-vehicle game effectively. The active steering technology developed in order to neutralize the driver's manipulation will stimulate the driver's response and enable the driver to improve his steering manipulation to achieve his own control purpose. Model-based control technology uses algorithms such as model predictive control and fuzzy control, but seldom pays attention to the conflict between human-vehicle co-driving. Game-based control technology uses a variety of methods such as Nash equilibrium solution and linear quadratic, but it has not fully studied the influence of the driver's driving experience and steering characteristics.

In order to solve the above problems, it is necessary to realize that the essence of the human-vehicle game is the contradiction between the driver's personal manipulation characteristics and the control rules of the controller. The two sides of the game compete for control due to different goals. In order to resolve this conflict, it is necessary to study the human-vehicle interaction process and incorporate the driver's driving habits into the design of game strategies.

Summary of the invention

In view of the above-mentioned shortcomings of the prior art, the purpose of the present invention is to provide a human-vehicle cooperative game control method based on the Nash negotiation criteria, which can fully study the human-vehicle interaction process, and control the driver's control The characteristics and driving habits are integrated into the human-vehicle cooperative game control framework to eliminate the manipulation conflict between the two sides of the game and maximize the benefits of the non-zero-sum game.

In order to achieve the above objectives, the technical solutions adopted by the present invention are as follows:

The human-vehicle cooperative game control method based on the Nash negotiation criterion of the present invention includes the following steps:

1) Establish a sixth-order vehicle dynamics model and driver model;

2) Design of active rear-wheel steering controller using sliding mode variable structure algorithm;

3) Identify the driver neuromuscular model parameters in the driver model;

4) Propose six combinations of human-car game strategy;

5) Use the maximum-minimum criterion to calculate the benefits of both parties in the game;

6) Use the Nash negotiation criterion to solve the Nash negotiation solution of the man-car game.

Preferably, the step 1) specifically includes:

The sixth-order vehicle dynamics model:

Where

Is the state vector; θ _sw is the steering wheel angle;

Is the steering wheel angular velocity; v is the lateral velocity of the vehicle; γ is the yaw angular velocity of the vehicle; Y is the lateral displacement of the vehicle; ψ is the yaw angle of the vehicle; T _d is the driver's torque input;

Is the input of the rotation angle of the rear-wheel steering motor; w is the steering resistance torque applied to the front-wheel steering system; y _c is the model output vector; the coefficient matrix is:

Where m is the mass of the car; u is the speed of the car; a and b are the distances from the center of mass of the car to the front and rear axles; J _s and B _s are the moment of inertia and steering damping of the steering system, respectively; C _f and C _r are respectively Front wheel cornering stiffness and rear wheel cornering stiffness _{of the car; I z} is the vertical moment of inertia of the car; i ₀ is the standard transmission ratio of the steering system; i _m is the reduction ratio of the rear-wheel steering motor;

The sixth-order vehicle dynamics model receives the steering wheel torque of the driver's neuromuscular model and the rear wheel angle command of the active rear-wheel steering controller, and outputs the vehicle state quantity;

According to the actual operating conditions of the driver, the sampling time is taken as T _s , and the driver model is established, including:

Pilot preview model:

In the formula, k represents the pilot's preview point number; x _d (k) is the state vector at the k-th preview point; is the driver torque input at the k-th preview point; y _d (k) The output vector for the model at the kth preview point; the coefficient matrix is:

By using a shift register, the update process of the pilot preview information can be expressed as:

In the formula, Y _pa (k) and ψ _pa (k) are respectively the lateral displacement and yaw angle of the car at the k-th preview point; the number of preview points n=T _p /T _s ; T _p is Preview time

Driver neuromuscular model:

In the formula, G _d (s) also represents the transfer function from the driver's preview input to the driver's torque output; s is the Laplacian operator; K _r and B _r are the reflection stiffness and reflection damping, respectively; τ _r Is the transmission delay; ω _r is the cutoff frequency.

The driver’s optimal preview model takes road preview information as input, and its output is the optimal steering wheel angle; the driver’s neuromuscular model takes the optimal steering wheel angle as input, and its output is the steering wheel torque. The steering wheel torque is transferred to the sixth-order vehicle dynamics model.

Preferably, the design process of the active rear-wheel steering controller in step 2) is specifically as follows:

In order to make the vehicle yaw rate accurately track the reference value, the sliding mode variable structure algorithm is used to design the active rear-wheel steering controller, and the error index is e=γ ^* -γ, and the switching function is

Control rate

Among them, γ ^* and γ are the expected yaw rate and actual yaw rate respectively; c is the switching coefficient, α is the error coefficient, and β is the error rate coefficient. The three parameters determine the control effect of the algorithm on the stability of the car; choose two Group parameter values (0.82, 0.47, 0.11), (0.63, 0.35, 0.08), the control intensity of the former is slightly higher than the latter, and the oscillation of the two is relatively small.

The rear wheel angle command output from the active rear wheel steering controller is transmitted to the sixth-order vehicle dynamics model.

Preferably, the process of identifying the parameters of the driver's neuromuscular model in step 3) is specifically:

Facing the same driving conditions, different drivers _{exhibit different steering characteristics by adjusting K r} , B _r and ω _r , that is, adopt different steering strategies; in fact, the transmission delay varies little between different drivers. It can be taken as τ _r =0.04s. In the present invention, the transmission delay is reflected in the data stream of MATLAB/Simulink, so it can be omitted in the following formula derivation; the relationship between the input and output of the driver's neuromuscular model is:

In the formula, T _d (s) is the driver’s output steering wheel torque,

The output of the driver's optimal preview model is the driver's expected steering wheel angle;

Incorporating the above driver neuromuscular model into the driver input-output relationship expression, we get:

In the formula, a _i (i=1, 2, 3) is the parameter to be identified.

The identified parameters of the driver's neuromuscular model are input to the Nash negotiation criteria, which are used to generate the driver's manipulation strategy set to form six combinations of human-vehicle game strategies.

Preferably, the six human-vehicle game strategy combinations in step 4) are specifically:

According to the identification result of the driver's neuromuscular model, the driver's manipulation strategy set can be expressed as

It contains three game strategies:

In the formula, → the parameter value on the left becomes the value on the right after the game starts. The change in the parameter value represents the different strategies adopted by the driver, K _r ⁰ , B _r ⁰ ,

They are the identified muscle stiffness, muscle damping and cut-off frequency;

At the same time, the active rear-wheel steering controller can choose a strong interference strategy

That is, the controller parameters take (c, α, β) = (0.82, 0.47, 0.11), or weak interference strategy

That is, the controller parameters take (c, α, β) = (0.63, 0.35, 0.08), and its strategy set is

Therefore, there are a total of six strategy combinations for both parties in the game

i=1, 2, 3, j=1, 2.

Preferably, the use of the maximum-minimum criterion in the step 5) to calculate the benefits of the two parties in the game is specifically as follows:

In the game, the goals of the two parties are different. The driver’s goal is to make the actual lateral displacement Y(k) of the car equal to the lateral displacement Y _pa (k) at the road, and to make the actual yaw angle ψ(k) of the car equal to the lateral displacement of the road. Swing angle ψ _pa (k); and the goal of the controller is to make the actual yaw rate γ (k) of the car equal to the desired yaw rate γ ^* (k), and to make the lateral acceleration u·γ (k) of the car as small as possible ; Therefore, the formula for calculating the benefits of both parties can be expressed as:

In the formula, P _ij and Q _ij respectively represent the strategy combination

Below, the gains of the driver and the active rear-wheel steering system; ω _l is the weight of the gain index, where l=1, 2, 3, 4, which aims to normalize the gain index for comparison; g is the local gravity acceleration ；

Let the driver and the active rear-wheel steering controller adopt a certain set of fixed strategies. After collecting the driving data of the car in the double shifting condition, the profit of both parties can be calculated according to the profit calculation formula; after test and measurement, in the strategy combination

Under the circumstance, the gains of both parties are P _ij =1.6341 and Q _ij =4.0049. In the strategy combination

Under the strategy combination, the gains of both parties are P _ij = 2.1679 and Q _ij = 1.9022.

Under the circumstance, the gains of both parties are P _ij =3.0004 and Q _ij =8.1775. In the strategy combination

Under the circumstance, the gains of both parties are P _ij =3.7883 and Q _ij =3.2357. In the strategy combination

Under the circumstance, the gains of both parties are P _ij =2.2804 and Q _ij =6.3381. In the strategy combination

In the next step, the income of both parties is P _ij =2.9147,Q _ij =2.5386;

Then use the maximum-minimum criterion to find the maximum-minimum value of both parties in the game:

Preferably, the Nash negotiation solution using the Nash negotiation criterion in the step 6) to solve the human-vehicle game is specifically as follows:

First draw the benefits of both parties in the game on a two-dimensional plane, where the horizontal axis is the driver's revenue, and the vertical axis is the revenue of the active rear-wheel steering system;

Then draw the maximum-minimum value of the two parties to determine the Nash negotiation set {(p,q)|q=-6.2721p+26.9964,3.0004≤p≤3.6657}, then the Nash negotiation solution (p ^* ,q ^* ) Must exist in the Nash negotiation center;

Then find the point that maximizes the overall income I _n =(pv _D )(qv _AD ) in the Nash negotiation center, and find the Nash negotiation solution (p ^* , q ^* ).

The beneficial effects of the present invention:

1. Compared with other game control methods, the present invention does not place the driver and the advanced driver assistance system in an opposing position, allowing the two to operate independently according to their respective strategies, but applies the concept of non-zero-sum game to humans. -In the car game, solve the problem of manipulation conflict.

2. In order to integrate the driver’s personal driving habits and manipulation characteristics into the design of the game control strategy, the present invention identifies some important parameters of the driver’s neuromuscular model, and designs three possible driver’s options based on the identification results. Kind of strategy.

3. By analyzing the feasible strategies of the driver and the active rear-wheel steering system, the present invention proposes six game strategy combinations as the basis for solving the game problem.

4. By analyzing the control targets of the driver and the active rear-wheel steering system, the present invention proposes a profit calculation method for both, which constitutes the profit of both.

5. The present invention uses the Nash negotiation criterion to solve the Nash negotiation solution of the game parties based on the profit. This negotiation solution will eliminate the human-vehicle manipulation conflict to the greatest extent and meet the goals of the game parties.

Description of the drawings

Figure 1 is a block diagram of a human-vehicle cooperative game control method;

Figure 2 is a schematic diagram of the pilot preview model;

Figure 3 is a schematic diagram of human-vehicle-road interaction;

Figure 4 is a schematic diagram of man-vehicle revenue and Nash negotiation set.

Detailed ways

In order to facilitate the understanding of those skilled in the art, the present invention will be further described below in conjunction with the embodiments and the drawings, and the content mentioned in the embodiments does not limit the present invention.

Referring to Figure 1, a human-vehicle cooperative game control method based on Nash negotiation criteria of the present invention is characterized in that it includes the following steps:

1) Establish a sixth-order vehicle dynamics model and driver model;

3) Identify the driver neuromuscular model parameters in the driver model;

4) Propose six combinations of human-car game strategy;

Wherein, the step 1) specifically includes:

The sixth-order vehicle dynamics model:

Where

Is the state vector; θ _sw is the steering wheel angle;

(1) Pilot preview model, as shown in Figure 2:

(2) Driver neuromuscular model:

In the formula, G _d (s) represents the transfer function from driver preview input to driver torque output; s is the Laplacian operator; K _r and B _r are reflection stiffness and reflection damping respectively; τ _r is Transmission delay; ω _r is the cut-off frequency.

The process of interaction between the driver and the car is shown in Figure 3. The driver uses vision to obtain road preview information, uses tactile sense to receive car state feedback information, and calculates the optimal steering wheel angle through the optimal preview model.

Then the optimal steering wheel angle command is executed through the neuromuscular model G _d (s), that is, the driver outputs the steering wheel torque T _d , and the steering wheel torque is transmitted to the steering system together with the torque fed back from the steering system, and finally produces the actual The steering wheel angle θ _sw also acts on the sixth-order dynamics model of the car.

Wherein, the step 2) specifically includes: using a sliding mode variable structure algorithm to design an active rear wheel steering controller, taking the error index as e = γ ^* -γ, and the switching function as

Control rate

The process of identifying the parameters of the driver's neuromuscular model in step 3) is specifically as follows:

In the formula, T _d (s) is the driver’s output steering wheel torque,

In the formula, a _i is the parameter to be identified, i=1, 2, 3.

The six human-vehicle game strategy combinations in the step 4) are specifically:

According to the identification results of the driver’s neuromuscular model, the driver’s manipulation strategy set is expressed as

It contains three game strategies:

They are the identified muscle stiffness, muscle damping and cut-off frequency;

There are a total of six strategy combinations for both sides of the game

i=1, 2, 3, j=1, 2.

The use of the maximum-minimum criterion in the step 5) to calculate the benefits of both parties in the game is specifically:

In the game, the goals of the two parties are different. The driver’s goal is to make the actual lateral displacement Y(k) of the car equal to the lateral displacement Y _pa (k) at the road, and to make the actual yaw angle ψ(k) of the car equal to the lateral displacement of the road. Swing angle ψ _pa (k); and the goal of the controller is to make the actual yaw rate γ (k) of the car equal to the desired yaw rate γ ^* (k), and to make the lateral acceleration u·γ (k) of the car as small as possible ; The income calculation formula of both parties is expressed as:

In the formula, P _ij and Q _ij respectively represent the strategy combination

Let the driver and the active rear-wheel steering controller adopt a certain set of fixed strategies, collect the driving data of the car under the double-line shifting condition, and calculate the revenue of both parties according to the revenue calculation formula; after testing and measurement, in the strategy combination

In the next step, the income of both parties is P _ij =2.9147,Q _ij =2.5386;

Referring to Fig. 4, the Nash negotiation solution using the Nash negotiation criterion to solve the human-car game in the step 6) is specifically:

Then find the point that maximizes the overall income I _n = (pv _D )(qv _AD ) in the Nash negotiation center, and find the Nash negotiation solution (p ^* , q ^* ), (p ^* , q ^* ) = (3.3330, 6.0912) ).

There are many specific applications of the present invention. The above are only the preferred embodiments of the present invention. It should be pointed out that for those of ordinary skill in the art, without departing from the principle of the present invention, several improvements can be made. These Improvements should also be regarded as the protection scope of the present invention.

Claims

A human-vehicle cooperative game control method based on Nash negotiation criteria, which is characterized in that it includes the following steps:

1) Establish a sixth-order vehicle dynamics model and driver model;

2) Design of active rear-wheel steering controller using sliding mode variable structure algorithm;

3) Identify the driver neuromuscular model parameters in the driver model;

4) Propose six combinations of human-car game strategy;

5) Use the maximum-minimum criterion to calculate the benefits of both parties in the game;

6) Use the Nash negotiation criterion to solve the Nash negotiation solution of the man-car game.
The human-vehicle cooperative game control method based on Nash negotiation criteria according to claim 1, wherein said step 1) specifically comprises:

The sixth-order vehicle dynamics model:

Where
Is the state vector; θ sw is the steering wheel angle;
Is the steering wheel angular velocity; v is the lateral velocity of the vehicle; γ is the yaw angular velocity of the vehicle; Y is the lateral displacement of the vehicle; ψ is the yaw angle of the vehicle; T d is the driver's torque input;
Is the input of the rotation angle of the rear-wheel steering motor; w is the steering resistance torque applied to the front-wheel steering system; y c is the model output vector; the coefficient matrix is:

Where m is the mass of the car; u is the speed of the car; a and b are the distances from the center of mass of the car to the front and rear axles; J s and B s are the moment of inertia and steering damping of the steering system, respectively; C f and C r are respectively Front wheel cornering stiffness and rear wheel cornering stiffness of the car; I z is the vertical moment of inertia of the car; i 0 is the standard transmission ratio of the steering system; i m is the reduction ratio of the rear-wheel steering motor;

According to the actual operating conditions of the driver, the sampling time is taken as T s , and the driver model is established, including:

(1) Pilot preview model:

In the formula, k represents the pilot's preview point number; x d (k) is the state vector at the k-th preview point; is the driver torque input at the k-th preview point; y d (k) The output vector for the model at the kth preview point; the coefficient matrix is:

By using a shift register, the update process of the pilot preview information can be expressed as:

In the formula, Y pa (k) and ψ pa (k) are respectively the lateral displacement and yaw angle of the car at the k-th preview point; the number of preview points n=T p /T s ; T p is Preview time

(2) Driver neuromuscular model:

In the formula, G d (s) represents the transfer function from driver preview input to driver torque output; s is the Laplacian operator; K r and B r are reflection stiffness and reflection damping respectively; τ r is Transmission delay; ω r is the cut-off frequency.
The human-vehicle cooperative game control method based on Nash negotiation criteria according to claim 2, wherein the design process of the active rear-wheel steering controller in step 2) is specifically:

Use the sliding mode variable structure algorithm to design the active rear wheel steering controller, take the error index as e = γ * -γ, and the switch function as
Control rate
Among them, γ * and γ are the expected yaw rate and actual yaw rate respectively; c is the switching coefficient, α is the error coefficient, and β is the error rate coefficient. The three parameters determine the control effect of the algorithm on the stability of the car; choose two Group parameter values (0.82, 0.47, 0.11), (0.63, 0.35, 0.08), the control intensity of the former is slightly higher than the latter, and the oscillation of the two is relatively small.
The human-vehicle cooperative game control method based on Nash negotiation criteria according to claim 3, wherein the process of identifying the parameters of the driver's neuromuscular model in step 3) is specifically:

Facing the same driving conditions, different drivers exhibit different steering characteristics by adjusting K r , B r and ω r , that is, adopt different steering strategies; the relationship between the input and output of the driver's neuromuscular model is :

In the formula, T d (s) is the driver’s output steering wheel torque,
The output of the driver's optimal preview model is the driver's expected steering wheel angle;

Incorporating the above driver neuromuscular model into the driver input-output relationship expression, we get:

In the formula, a i is the parameter to be identified, i=1, 2, 3.
The human-vehicle cooperative game control method based on Nash negotiation criteria according to claim 4, wherein the six human-vehicle game strategy combinations in step 4) are specifically:

According to the identification results of the driver’s neuromuscular model, the driver’s manipulation strategy set is expressed as
It contains three game strategies:

In the formula, → the parameter value on the left becomes the value on the right after the game starts. The change in the parameter value represents the different strategies adopted by the driver, K r 0 , B r 0 ,
They are the identified muscle stiffness, muscle damping and cut-off frequency;

At the same time, the active rear-wheel steering controller can choose a strong interference strategy
That is, the controller parameters take (c, α, β) = (0.82, 0.47, 0.11), or weak interference strategy
That is, the controller parameters take (c, α, β) = (0.63, 0.35, 0.08), and its strategy set is

There are a total of six strategy combinations for both sides of the game
The human-vehicle cooperative game control method based on the Nash negotiation criterion according to claim 5, characterized in that, in the step 5), using the maximum-minimum criterion to calculate the benefits of the two parties in the game is specifically:

In the game, the goals of the two parties are different. The driver’s goal is to make the actual lateral displacement Y(k) of the car equal to the lateral displacement Y pa (k) at the road, and to make the actual yaw angle ψ(k) of the car equal to the lateral displacement of the road. Swing angle ψ pa (k); and the goal of the controller is to make the actual yaw rate γ (k) of the car equal to the desired yaw rate γ * (k), and to make the lateral acceleration u·γ (k) of the car as small as possible ; The income calculation formula of both parties is expressed as:

In the formula, P ij and Q ij respectively represent strategy combination
Below, the gains of the driver and the active rear-wheel steering system; ω l is the weight of the gain index, where l=1, 2, 3, 4, which aims to normalize the gain index for comparison; g is the local gravity acceleration ；

Let the driver and the active rear-wheel steering controller adopt a certain set of fixed strategies, collect the driving data of the car under the double-line shifting condition, and calculate the revenue of both parties according to the revenue calculation formula; after testing and measurement, in the strategy combination
Under the circumstance, the gains of both parties are P ij =1.6341 and Q ij =4.0049. In the strategy combination
Under the strategy combination, the gains of both parties are P ij = 2.1679 and Q ij = 1.9022.
Under the circumstance, the gains of both parties are P ij =3.0004 and Q ij =8.1775. In the strategy combination
Under the circumstance, the gains of both parties are P ij =3.7883 and Q ij =3.2357. In the strategy combination
Under the circumstance, the gains of both parties are P ij =2.2804 and Q ij =6.3381. In the strategy combination
In the next step, the income of both parties is P ij =2.9147,Q ij =2.5386;

Then use the maximum-minimum criterion to find the maximum-minimum value of both parties in the game:
The human-vehicle cooperative game control method based on the Nash negotiation criterion according to claim 6, wherein the Nash negotiation solution of the human-vehicle game using the Nash negotiation criterion in the step 6) is specifically:

First draw the benefits of both parties in the game on a two-dimensional plane, where the horizontal axis is the driver's revenue, and the vertical axis is the revenue of the active rear-wheel steering system;

Then draw the maximum-minimum value of the two parties to determine the Nash negotiation set {(p,q)|q=-6.2721p+26.9964,3.0004≤p≤3.6657}, then the Nash negotiation solution (p * ,q * ) Must exist in the Nash negotiation center;

Then find the point that maximizes the overall income I n =(pv D )(qv AD ) in the Nash negotiation center, and find the Nash negotiation solution (p * , q * ).