CN112558476B

CN112558476B - Non-linear multi-wisdom system leader-free consistent control method based on attack compensation

Info

Publication number: CN112558476B
Application number: CN202011403180.XA
Authority: CN
Inventors: 周琪; 陈广登; 鲁仁全; 李鸿一; 姚得银
Original assignee: Guangdong University of Technology
Current assignee: Guangdong University of Technology
Priority date: 2020-12-04
Filing date: 2020-12-04
Publication date: 2022-06-14
Anticipated expiration: 2040-12-04
Also published as: CN112558476A

Abstract

The invention provides a non-leader consistent control method of a non-linear multi-agent system based on attack compensation, which solves the problem that the prior method can not realize the consistent control of a continuous time non-linear system under malicious attack, firstly establishes a continuous time non-linear multi-agent system model which is subjected to double attacks of a sensor and an executor, introduces a directed graph to describe the communication relation of the non-leader intelligent system, establishes an auxiliary variable and a consistent error based on the directed graph to realize the state consistency of agents, designs an attack compensation variable, establishes a new coordinate transformation auxiliary variable based on the attacked sensor data and the attack compensation variable, designs a virtual controller and an actual controller, designs a neural network weight adaptive law for regulating the weight of a radial basis neural network at the moment, and ensures the realization of good consistent control effect of the controllers, under the condition of considering double attacks, the states of the agents are enabled to be consistent and can be kept bounded.

Description

Non-linear multi-wisdom system leaderless consistency control method based on attack compensation

Technical Field

The invention relates to the technical field of information physical system safety and nonlinear multi-intelligence system control, in particular to a non-leader consistent control method of a nonlinear multi-intelligence system based on attack compensation.

Background

In recent decades, with the rapid development of electronic information technology, computer technology and network technology, the scale of many control systems becomes increasingly large and the internal association becomes increasingly complex, and if the traditional centralized control method is still used, the adverse consequences of large calculation amount, difficult expansion and the like are brought. Therefore, a multi-agent system based on group behaviors arises, and the multi-agent system means that a certain number of individuals (nodes) show certain behaviors and orders on a group level through self-organization and mutual cooperation.

In addition, with the rapid development of communication technology, Cyber-Physical Systems (CPS) have been developed. The CPS is mainly divided into a sensing execution layer, a network layer and an information layer. The perception execution layer mainly comprises various physical sensors and the like and is a source of information in the whole physical information system. In order to adapt to a variable environment, the network information physical system nodes are mostly arranged in an unsupervised environment and are easy to attack by attackers. Common attack approaches to the perceptual execution layer are: 1) data destruction: an attacker is unauthorized, and tampering, addition and deletion or damage and the like are carried out on the information acquired by the perception execution layer; 2) information interception: an attacker acquires information by wiring or by utilizing electromagnetic leakage in a transmission process, so that problems such as data privacy leakage and the like are caused; 3) and (3) node capture: an attacker controls part of network nodes, which may cause key leakage and endanger the communication security of the whole system. For the control aspect, whether the sensor can accurately acquire data and whether the actuator can accurately execute the control instruction has a great decisive role in whether the control performance of the system is good, and the traditional control method cannot realize good control performance when the data is damaged. Due to the characteristics of malicious attacks, existing research on security problems is basically directed at linear systems or discrete systems, and is difficult to conduct research on continuous time nonlinear systems.

The consistency control is used as a basic problem of multi-agent system research, has important theoretical research and practical application values, and the multi-agent system in real life has a plurality of applications, such as spacecrafts, unmanned planes, underwater vehicles, sensor networks, multi-mechanical arm cooperative equipment and the like. The consistency control is divided into leader consistency control and leaderless consistency control. 11/13/2018, a chinese patent (CN108803349A) discloses an optimal consistency control method and system for a nonlinear multi-agent system, which adopts a leader-follower control mode to form a multi-agent system composed of individual reference behavior models, and can efficiently solve the consistency problem of a complex multi-agent system while ensuring optimal control performance, and has practical application value and high expandability, however, in many cases, there is no leader or information of the leader cannot be normally received, for example, in a severe working environment, there is no reliable leader in the multi-agent system or the multi-agent system can communicate with the outside, and the agents can only rely on mutual information transfer to achieve consistency. Therefore, the problem of leaderless consistency is a problem of practical significance.

Disclosure of Invention

The invention provides a non-linear multi-wisdom system leaderless consistency control method based on attack compensation, which aims to solve the problem that the existing system consistency control method research cannot realize consistency control of a continuous time non-linear system under malicious attack, and does not consider the double attack condition of sensor attack and actuator attack at the same time.

In order to achieve the technical effects, the technical scheme of the invention is as follows:

a non-linear multi-intelligence system leaderless consistency control method based on attack compensation at least comprises the following steps:

s1, establishing a continuous time nonlinear multi-agent system model suffering from dual attacks of a sensor and an actuator;

s2, approximating a composite uncertainty nonlinear item in a nonlinear multi-agent system model based on a radial basis function neural network;

s3, confirming a communication topological relation among the agents in the leader-free multi-agent system by utilizing a directed graph;

s4, constructing an auxiliary variable without considering sensor attack by adopting a dynamic surface technology, and defining a consistent error;

s5, determining an attack compensation variable when the sensor is attacked, and constructing a new coordinate transformation auxiliary variable based on the attacked sensor data and the attack compensation variable;

and S6, designing a virtual controller, an actual controller and a neural network weight value self-adaptation law based on the new coordinate transformation auxiliary variable.

In the technical scheme, firstly, a continuous time nonlinear multi-agent system model suffering from double attacks of a sensor and an actuator is established, then a directed graph is introduced to describe a communication relation of a leader-free intelligent system, an auxiliary variable and a consistent error are established based on the directed graph, considering that an actual system state cannot be used for control design due to the fact that the sensor of the continuous time nonlinear multi-agent system is attacked, an attack compensation variable is designed, a new coordinate transformation auxiliary variable is established based on attacked sensor data and the attack compensation variable, a virtual controller and an actual controller are designed, at the moment, in order to adjust the weight of a radial basis neural network, a neural network weight adaptive law is designed, and the realization of a good consistent control effect of the controller is further ensured.

Preferably, the continuous-time nonlinear multi-agent system model building process of step S1 is:

defining parameters: k is 1,2,. n; q-1, 2,. n-1; p ═ 2,3,. n; r-3, 4.,. n-1, and obtaining a model expression of the continuous-time nonlinear multi-agent system as follows:

wherein x is_i＝[x_i,1,...,x_i,n]^TIs a state vector of a continuous-time nonlinear multi-agent system;

represents a complex non-linear term of uncertainty, an

u_iA controller representing a continuous time nonlinear multi-agent system to be designed, to achieve consistency control;

representing an actuator attack vector;

representing a sensor attack vector;

and

are functions of state vector and time; x is the number of_a,i＝[x_a,i,1,...,x_a,i,n]^TSensor data representing a continuous-time non-linear multi-agent system model after an attack. Here, x_a,iCan be used to control the design rather than the actual system state x_i。

Preferably, the uncertainty nonlinear term is compounded in step S2

Is uniformly set tof (x), using radial basis function neural network to compact

To any degree of accuracy

The process of approximating f (x) satisfies:

wherein x represents the input of the radial basis function neural network, and q is the input dimension of the radial basis function neural network; epsilon (x) is an approximation error, and satisfies the following conditions:

φ(x)＝[φ₁(x),φ₂(x),…,φ_l(x)]a vector of basis functions is represented as,

represents an ideal weight, defined as: for all that

The value of θ that minimizes the approximation error | ε (x) is expressed as:

in this case, the problem that it is difficult to accurately model the intelligent system, that is, the system model

The specific expression of the method cannot be accurately known, which makes the design of the controller of the continuous-time nonlinear multi-agent system to be designed difficult, namely, the method adopts neural network technology to approximate the unknown nonlinear item of the system. In fact, the disturbance related to the system state and the actuator attack function can be solved by using the neural network technology.

Preferably, the basis function vector phi_i(x) Selecting a Gaussian function, wherein the expression is as follows:

wherein l is the number of nodes of the neural network, ζ_i＝[ζ_i,1,…,ζ_iq]The acceptance domain of the basis function is represented,

representing the width of the gaussian function.

Preferably, the process of applying the directed graph to confirm the communication topology relationship between the agents in the leaderless multi-agent system in step S3 is:

providing a directed graph G ═ { V, E }, wherein V ═ 1, 2.., N } represents the set of nodes in the directed graph G, and "1, 2.., N" represents the number of the multi-agent system;

represents the set of directed edges in the directed graph, and (j, i) means that the multi-agent system i can receive information from the multi-agent system j, the multi-agent system j is a neighbor of the multi-agent system i, and the expression of the set of neighbors of the multi-agent system i is as follows: n is a radical of_i＝{j∈V(i,j)∈E,i≠j}；

Definition a ═ a_ij]∈R^n×nWeight matrix as directed graph G, a_ijIs the weight matrix element of the directed graph G; if the multi-agent system i is able to receive information from the multi-agent system j, then a_ij1(i ≠ j), otherwise, a_ij＝0。

Preferably, in step S4, the dynamic surface technique is used to construct the auxiliary variable and the consistent error without considering sensor attack, and the expression of the auxiliary variable is:

wherein, beta_iAuxiliary variables are represented to achieve leaderless consistent control.

The consistent error expression is:

e_i,1＝x_i,1-β_i

in fact, when e_i,1X → 0 time_i,1→β_iThe output of the agent i in the multi-agent system is consistent with the output of the neighbor; the error table expression is:

wherein the content of the first and second substances,

representing a virtual controller alpha_i,p-1Output after passing through a first order filter to avoid the need for repeating pairs of alpha in controller design_i,p-1The differentiation causes a problem of an increase in the amount of calculation;

the expression is satisfied:

wherein, tau_i,p-1Denotes the time constant, τ_i,p-1And if the filtering error is more than 0, the filtering error is obtained as follows:

wherein eta is_i,p-1Representing the filtering error.

Preferably, the attack compensation variable when the sensor attack is suffered in step S5 is set as

Attack compensation variable

Satisfies the following conditions:

wherein, b_i,k,c_i,k,d_i,k,ρ_i,kAre all design parameters, b_i,k,c_i,k,d_i,k,ρ_i,kAre all greater than 0.

Preferably, the attacked sensor data in step S5 is x_a,iBased on the attacked sensor data as x_a,iAnd attack compensation variables

The expression for constructing the new coordinate transformation is as follows:

wherein z is_i,pRepresenting sensor data based on attacks as x_a,iAnd attack compensation variables

And constructing a new auxiliary variable after coordinate transformation. Due to unknown data injection attacks

The actual state vector x_iCan not be accurately measured, error variable e_i,kCannot be used for subsequent design of controllers or adaptation lawsThus, the auxiliary variable z is constructed_i,pAnd a subsequent virtual controller can be designed by combining the neural network technology.

Preferably, the expression of the designed virtual controller in step S6 is:

actual controller u based on virtual controller relates to_iComprises the following steps:

wherein phi is_i,kIs a Gaussian function of the neural network,

the self-adaptive parameters are the weights of the radial basis function neural network.

In order to avoid the problem of excessive calculation caused by repeatedly deriving the virtual controller, a dynamic surface technology is adopted, and the virtual control signal is made to pass through a first-order filter to obtain the virtual control signal and an approximate value of a derivative of the virtual control signal.

Preferably, the adaptive parameters of the weights of the radial basis function neural network

The self-adaptive rate of (2) satisfies:

wherein, gamma is_i,kRepresenting the adaptive law gain matrix, σ_i,kTo modify the parameter matrix.

The weight of the adaptive law on-line adjusting neural network is designed based on the Lyapunov stability theorem, and a good control effect is achieved by combining an actual controller.

Compared with the prior art, the technical scheme of the invention has the beneficial effects that:

the invention provides a leader-free consistent control method of a nonlinear multi-agent system based on attack compensation, which comprises the steps of firstly establishing a continuous time nonlinear multi-agent system model suffering from dual attacks of a sensor and an actuator, introducing a directed graph to describe a communication relation of the leader-free intelligent system, establishing an auxiliary variable and a consistent error based on the directed graph to realize the state consistency of the intelligent system, considering that the actual system state cannot be used for control design due to the fact that the sensor of the continuous time nonlinear multi-agent system is attacked, designing an attack compensation variable, establishing a new coordinate transformation auxiliary variable based on attacked sensor data and the attack compensation variable, designing a virtual controller and an actual controller, designing a neural network weight self-adaption law for adjusting the weight of a radial basis neural network, and further ensuring the realization of good consistent control effect of the controller, under the condition of considering double attacks, the states of the intelligent agents are consistent, the bounded state can be kept, and the consistency control of the nonlinear multi-intelligent-agent system without a leader is realized.

Drawings

Fig. 1 shows a flow chart of a non-linear multi-intelligence system leaderless coherent control method based on attack compensation proposed in an embodiment of the present invention;

FIG. 2 is a topological diagram of communication relationships between multi-agent systems in accordance with an embodiment of the present invention;

FIG. 3 shows state x of the multi-agent system proposed in the embodiment of the present invention_i,1A graph of the relationship over time t;

FIG. 4 shows state x of the multi-agent system proposed in the embodiment of the present invention_i,2A graph of the relationship over time t;

FIG. 5 shows state x of the multi-agent system proposed in the embodiment of the present invention_i,1，x_i,2A relation curve chart of the time t and the time t;

FIG. 6 shows a controller signal u according to an embodiment of the present invention_iA graph of the relationship over time t;

FIG. 7 is a representation of a leader-less uniform auxiliary variable signal β as set forth in an embodiment of the present invention_iA graph of the relationship over time t;

FIG. 8 shows a first attack compensation signal proposed in an embodiment of the present invention

A graph of the relationship over time t;

FIG. 9 shows a second shot compensation signal proposed in an embodiment of the present invention

Graph of the relationship over time t.

Detailed Description

The drawings are for illustrative purposes only and are not to be construed as limiting the patent;

for better illustration of the present embodiment, certain parts of the drawings may be omitted, enlarged or reduced, and do not represent actual dimensions;

it will be understood by those skilled in the art that certain well-known descriptions of the figures may be omitted.

The technical solution of the present invention is further described below with reference to the accompanying drawings and examples.

Example 1

The positional relationships depicted in the drawings are for illustrative purposes only and are not to be construed as limiting the present patent;

a flow chart of a method for leaderless cooperative control of a nonlinear multi-wisdom system based on attack compensation as shown in fig. 1, the steps of the method comprising:

s1, establishing a continuous time nonlinear multi-agent system model suffering from double attacks of a sensor and an actuator;

and S6, designing a virtual controller, an actual controller and a neural network weight self-adaptation law based on the new coordinate transformation auxiliary variable.

The process of establishing the continuous-time nonlinear multi-agent system model in step S1 is as follows:

represents a complex non-linear term of uncertainty, an

representing an actuator attack vector;

representing a sensor attack vector;

and

In the present embodiment, the uncertainty nonlinear term is compounded in step S2

Uniformly set as f (x), and tightly gather by using radial basis function neural network

To any degree of accuracy

The process of approximating f (x) satisfies:

wherein x represents the input of the radial basis function neural network, and q is the input dimension of the radial basis function neural network; epsilon (x) is the approximation error, and satisfies:

φ(x)＝[φ₁(x),φ₂(x),...,φ_l(x)]a vector of basis functions is represented as,

represents an ideal weight, defined as: for all

The value of θ that minimizes the approximation error | ε (x), expressed as:

the problem that the intelligent system is difficult to accurately model is solved, namely in the system model

The specific expression of the method cannot be accurately known, which makes the design of the controller of the continuous-time nonlinear multi-agent system to be designed difficult, namely, the method adopts neural network technology to approximate the unknown nonlinear item of the system. In fact, the disturbance related to the system state and the actuator attack function can be solved by using the neural network technology. Vector of basis functions phi_i(x) Selecting a Gaussian function, wherein the expression is as follows:

wherein l is the number of nodes of the neural network, ζ_i＝[ζ_i,1,...,ζ_iq]The acceptance domain of the basis function is represented,

representing the width of the gaussian function.

In this embodiment, the process of applying the directed graph to confirm the communication topology relationship between the agents in the leaderless multi-agent system in step S3 is as follows:

represents the set of directed edges in the directed graph, and (j, i) means that the multi-agent system i can receive information from the multi-agent system j, the multi-agent system j is a neighbor of the multi-agent system i, and the neighbor set expression of the multi-agent system i is as follows: n is a radical of_i＝{j∈V(i,j)∈E,i≠j}；

Step S4, constructing the auxiliary variable and the consistent error without considering the sensor attack by using the dynamic surface technique, where the expression of the auxiliary variable is:

The consistent error expression is:

wherein, the first and the second end of the pipe are connected with each other,

representing a virtual controller alpha_i,p-1Output after passing through a first order filter to avoid the need for repeating pairs of alpha in controller design_i,p-1Differentiation causes a problem of an increase in the amount of calculation;

the expression is satisfied:

wherein eta is_i,p-1Representing the filtering error.

In the present embodiment, the attack compensation variable when the sensor attack is applied in step S5 is set as

Attack compensation variable

Satisfy the requirement of：

In this embodiment, the attacked sensor data in step S5 is x_a,iBased on the attacked sensor data as x_a,iAnd attack compensation variables

The actual state vector x_iCan not be accurately measured, error variable e_i,kCannot be used for subsequent design of the controller or adaptation law, and therefore the auxiliary variable z is constructed_i,pAfter being designed by combining with neural network technologyAnd continuing to virtualize the controller.

The expression of the designed virtual controller is as follows:

wherein phi_i,kIs a Gaussian function of the neural network,

the self-adaptive parameters are the weights of the radial basis function neural network. In order to avoid the problem of overlarge calculated amount caused by repeatedly carrying out derivation on the virtual controller, a dynamic surface technology is adopted, and the virtual control signal is made to pass through a first-order filter to obtain the virtual control signal and an approximate value of a derivative of the virtual control signal.

Adaptive parameters of radial basis function neural network weight

The self-adaptive rate of (2) satisfies:

In order to further verify the effectiveness of the method provided by the present invention, the following is further described in combination with an actual simulation, in the present simulation, the control objective is to make the states of the multi-agent systems consistent from the unordered initial state under the attack of the sensors and the actuators, and the ith multi-agent system in the simulation is:

wherein the sensor attack signal is

The actuator attack signal is

To avoid loss of generality, the initial values for the randomly chosen agent states are:

x₁(0)＝[-1；-0.1]；x₂(0)＝[1.2；-0.5]；x₃(0)＝[0.8；1](ii) a Other initial states such as the initial states of the adaptive law and the attack compensation signalIs 0; the design parameters of the controller are as follows:

b_i,1＝b_i,2＝2；c_i,1＝c_i,2＝2；d_i,1＝d_i,2＝1；τ_i＝0.1；σ_i,1＝σ_i,2＝0.1；ρ_i＝1；Γ_i,1＝Γ_i,10.1; a communication relationship topology diagram of a multi-agent system is shown in FIG. 2, FIG. 2 is a directed graph represented by 4 nodes (1,2,3,4), wherein arrows represent information transmission directions, FIG. 3 represents a first agent system state x_i,1Graph with time t; FIG. 4 shows a second agent system state x_i,2Graph with time t; in fig. 3 and 4, the abscissa represents time, and the ordinate represents the state value of the smart system, where i is 1,2,3, 4; for a clearer presentation, FIG. 5 shows state x of the multi-agent system from a three-dimensional perspective_i,1，x_i,2And the relation curve chart of the time t, as can be seen from fig. 3 to 5, after about 4s of adjustment time, the different initial states of the agent are consistent, that is, the states are all converged to 0. FIG. 6 shows a controller signal u according to an embodiment of the present invention_iA graph of the relationship over time t; FIG. 7 is a leader-less uniform auxiliary variable signal β_iThe control signal also tends to 0 after the system is consistent with the curve chart of the relation changing along with the time t, and the control signal also tends to be 0 at beta_iAfter going to 0, the agent has reached full agreement. FIGS. 8-9 are graphs of attack compensation variable signals, with FIG. 8 showing a first attack compensation signal

A graph of the relationship over time t; FIG. 9 shows a second attack compensation signal

Graph of the relationship over time t. Since the attack signal is related to the system state, when the state goes to 0, the attack signal also gradually goes to 0, so that attack compensation is not needed, and the attack compensation signal also gradually goes to 0.

it should be understood that the above-described embodiments of the present invention are merely examples for clearly illustrating the present invention, and are not intended to limit the embodiments of the present invention. Other variations and modifications will be apparent to persons skilled in the art in light of the above description. And are neither required nor exhaustive of all embodiments. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention should be included in the protection scope of the claims of the present invention.

Claims

1. A non-linear multi-intelligence system leaderless consistency control method based on attack compensation is characterized by at least comprising the following steps:

let the attack compensation variable when the sensor attack is applied in step S5 be

Attack compensation variable

Satisfies the following conditions:

wherein, b_i,k,c_i,k,d_i,k,ρ_i,kAre all design parameters, b_i,k,c_i,k,d_i,k,ρ_i,kAre all larger than 0;

the attacked sensor data in step S5 is x_a,iBased on the attacked sensor data as x_a,iAnd attack compensation variables

wherein z is_i,1And z_i,pRepresenting sensor data based on attacks as x_a,iAnd attack compensation variables

Constructing a new auxiliary variable after coordinate transformation;

s6, designing a virtual controller, an actual controller and a neural network weight value self-adaptation law based on the new coordinate transformation auxiliary variable,

the expression of the designed virtual controller in step S6 is:

wherein phi is_i,kIs a Gaussian function of the neural network,

self-adaptive parameters of the radial basis function neural network weight; adaptive parameters of radial basis function neural network weight

The self-adaptive rate of (2) satisfies:

2. The non-leader consensus control method for nonlinear multi-agent system based on attack compensation according to claim 1, wherein the process of continuous-time nonlinear multi-agent system model building of step S1 is:

represents a complex non-linear term of uncertainty, an

representing an actuator attack vector;

representing a sensor attack vector;

and

are functions of state vector and time; x is the number of_a,i＝[x_a,i,1,...,x_a,i,n]^TSensor data representing a continuous-time non-linear multi-agent system model after an attack.

3. The non-linear multi-intelligence system leaderless consistency control method based on attack compensation according to claim 2, wherein the uncertainty non-linear term is compounded in step S2

To any degree of accuracy

The process of approximating f (x) satisfies:

wherein x represents a radial basis function neural networkH is the input dimension of the radial basis function neural network; epsilon (x) is the approximation error, and satisfies:

represents an ideal weight, defined as: for all

The value of θ that minimizes the approximation error | ε (x) |, expressed as:

4. the non-linear multi-intelligence system leader-less consensus control method based on attack compensation according to claim 3, wherein the basis function vector φ_i(x) I 1, 2., l selects a gaussian function, the expression:

representing the width of the gaussian function.

5. The attack compensation-based nonlinear multi-agent system leaderless consistency control method as claimed in claim 4, wherein the process of applying the directed graph to confirm the communication topology relationship between the agents in the leaderless multi-agent system in step S3 is as follows:

providing a graph G ═ { V, E }, wherein V ═ 1, 2., N } represents a set of nodes in the directed graph G, wherein 1,2, …, N represents the number of the multi-agent system;

represents the set of directed edges in the directed graph, and (j, i) means that the multi-agent system i can receive information from the multi-agent system j, the multi-agent system j is a neighbor of the multi-agent system i, and the neighbor set expression of the multi-agent system i is as follows: n is a radical of_i＝{j∈V|(i,j)∈E,i≠j}；

Definition A ═ a_ij]∈R^n×nWeight matrix as directed graph G, a_ijIs the weight matrix element of the directed graph G; if the multi-agent system i is able to receive information from the multi-agent system j, then a_ij1, i ≠ j, otherwise, a_ij＝0。

6. The non-linear multi-wisdom system leaderless consistency control method based on attack compensation as claimed in claim 5, wherein the step S4 adopts dynamic surface technique to construct auxiliary variables and consistency errors without considering sensor attack, and the expression of the auxiliary variables is as follows:

wherein beta is_iRepresenting an auxiliary variable;

the consistent error expression is:

e_i,1＝x_i,1-β_i

wherein the content of the first and second substances,

satisfying the expression:

wherein, tau_i,p-1Denotes the time constant, τ_i,p-1And if the filtering error is more than 0, obtaining the filtering error as:

wherein eta is_i,p-1Representing the filtering error.