CN112637120B - Multi-agent system consistency control method, terminal and storage medium - Google Patents
Multi-agent system consistency control method, terminal and storage medium Download PDFInfo
- Publication number
- CN112637120B CN112637120B CN202011277400.9A CN202011277400A CN112637120B CN 112637120 B CN112637120 B CN 112637120B CN 202011277400 A CN202011277400 A CN 202011277400A CN 112637120 B CN112637120 B CN 112637120B
- Authority
- CN
- China
- Prior art keywords
- agent system
- agent
- consistency control
- consistency
- control method
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L69/00—Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
- H04L69/26—Special purpose or proprietary protocols or architectures
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/12—Discovery or management of network topologies
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Computing Systems (AREA)
- Computer Security & Cryptography (AREA)
- Computer And Data Communications (AREA)
- Feedback Control In General (AREA)
Abstract
The invention discloses a multi-agent system consistency control method, a terminal and a storage medium, wherein an interactive topology of the multi-agent system is constructed according to communication relations among agents in the multi-agent system, a consistency control model of the multi-agent system is further constructed according to an interactive topological graph, constraints of the model are set to comprise consistency condition constraints and non-smooth inequality constraints, an optimal decoupling terminal of the consistency control model is designed, and a control protocol of the multi-agent system is determined according to the optimal decoupling terminal, so that the multi-agent system can simultaneously meet the consistency condition constraints and the non-smooth inequality constraints.
Description
Technical Field
The invention relates to the technical field of multi-agent system control, in particular to a multi-agent system consistency control method, a terminal and a storage medium.
Background
The multi-agent system can be regarded as a self-organizing intelligent unmanned platform group, numerous group tasks can be completed through interaction of information and actions, the intelligent degree of the whole group is greatly improved through cooperative behaviors among agents in the multi-agent system, complex tasks which cannot be completed by single bodies can be met, and the multi-agent system is widely applied to the fields of sensor group deployment, multi-unmanned aerial vehicle formation control, multi-mechanical arm cooperative carrying and the like.
In distributed control of a multi-agent system, the states of the multi-agent system are generally required to be consistent when the optimal solution of an optimization problem is achieved, and in addition, constraint conditions are required to be processed, for example, the motion range constraint of the multi-agent system is described as a constraint function of an indication function as the optimization problem.
Thus, there is a need for improvements and enhancements in the art.
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides a multi-agent system consistency control method, a terminal and a storage medium, and aims to solve the problem that in the prior art, a consistency control protocol with consistency constraint and non-smooth inequality constraint conditions does not exist in a multi-agent system consistency control scheme.
In order to solve the technical problems, the technical scheme adopted by the invention is as follows:
in a first aspect of the present invention, there is provided a multi-agent system consistency control method, the method comprising:
acquiring communication relations among all agents in a multi-agent system, and constructing an interactive topology of the multi-agent system according to the communication relations;
constructing a consistency control model of the multi-agent system according to the interaction topology, wherein constraints of the consistency control model comprise consistency condition constraints and non-smooth inequality constraints;
designing a decoupling operator of an optimal solution of the consistency control model;
determining a control protocol of the multi-agent system according to a decoupling operator of the optimal solution, such that the individual agents in the multi-agent system control their own motion according to the control protocol.
The multi-agent system consistency control method, wherein the constructing an interactive topology of the multi-agent system according to the communication relation, comprises:
the laplacian matrix of the interaction topology is: l isnD-a, wherein the matrixAssociated with agent iIs a diagonal matrix of diagonal elements, i belongs to {1, 2., n }, n is the number of agents in the multi-agent system, and an agent j and an agent i are connected through an information interaction edge eijE belongs to E connection and information interaction edge EijE represents that the agent i and the agent j can exchange information in two directions, if the information exchange is carried out, EijE, then aij=aji> 0, otherwise aij=0,Is a weighted adjacency matrix.
The multi-agent system consistency control method, wherein the building of the consistency control model of the multi-agent system according to the interaction topology includes:
obtaining cost functions and non-smooth inequality constraint information of each agent in the multi-agent system, and constructing a consistency control model of the multi-agent system according to the interactive topology and the kinematics model as follows:
wherein the content of the first and second substances,i belongs to {1, 2.,. n }, n is the number of agents in the multi-agent system,for the two local cost functions of agent i,is a smooth convex function of the image to be displayed,non-smooth convex function, stateIs a position state vector of agent i, is a q-dimensional vector, and0nqis a zero vector of the nq-dimension,is a matrix LnAnd IqKronecker product of LnLaplace matrix, I, which is the topology of the information interactionqIs a q-dimensional unit matrix and is,is a non-smooth inequality constraint for agent i,is a non-smooth convex function that can be approximated,andare respectively-Liphoxitz in continuous with-Liphoz continuous.
The multi-agent system consistency control method, wherein the designing a decoupling operator of an optimal solution of the consistency control model comprises:
the decoupling operator for designing the optimal solution of the consistency control model is as follows:
wherein, the lambda and the mu are Lagrange multipliers,as an auxiliary variable, the number of variables,for introducing parameters, alpha and beta are preset parameters, for anyi ∈ {1, …, n }, each having:z*satisfies the following conditions:and is
The multi-agent system consistency control method comprises the following control protocols of the multi-agent system:
the multi-agent system consistency control method, wherein the designing an optimal solution of the consistency control model, comprises:
the decoupling operator for designing the optimal solution of the consistency control model is as follows:
wherein, the lambda and the mu are Lagrange multipliers,as an auxiliary variable, the number of variables,for introducing parameters, alpha and beta are preset parameters, for anyi ∈ {1, …, n }, each having:z*satisfies the following conditions:and is
The multi-agent system consistency control method comprises the following control protocols of the multi-agent system:
wherein beta is more than 0 and less than 1/Lambdamax(L),0<γ<1-βΛmax(L), 0<α<mink=1,2,3[οk]C is a constant, t is a time variable, Λmax(L) is the maximum eigenvalue of L,
the multi-agent system consistency control method is characterized in that the kinematic model of the agents in the multi-agent system is as follows:wherein u isi(t) is the speed control quantity of agent i at time t,is the location state of agent i, which is a q-dimensional vector.
In a second aspect of the present invention, there is provided a terminal comprising a processor, a storage medium communicatively connected to the processor, the storage medium adapted to store a plurality of instructions, and the processor adapted to invoke the instructions in the storage medium to perform the steps of implementing the multi-agent system consistency control method according to any one of the above.
In a third aspect of the present invention, there is provided a storage medium storing one or more programs, the one or more programs being executable by one or more processors to implement the steps of the multi-agent system consistency control method of any of the above.
Compared with the prior art, the multi-agent system consistency control method, the terminal and the storage medium are provided by the invention, the multi-agent system consistency control method constructs the interactive topology of the multi-agent system according to the communication relation among the agents in the multi-agent system, further constructs the consistency control model of the multi-agent system according to the interactive topology, sets the constraints of the model to comprise consistency condition constraints and non-smooth inequality constraints, designs the optimal decoupling terminal of the consistency control model, and determines the control protocol of the multi-agent system according to the optimal decoupling terminal, so that the multi-agent system can simultaneously meet the consistency condition constraints and the non-smooth inequality constraints.
Drawings
FIG. 1 is a flow chart of an embodiment of a multi-agent system consistency control method provided by the present invention;
FIG. 2 is a schematic diagram of an interaction topology of a multi-agent system in an embodiment of a multi-agent system consistency control method provided by the present invention;
FIG. 3 is a first schematic diagram illustrating the validity verification result of the consistency control method of the multi-agent system according to the present invention;
FIG. 4 is a schematic diagram illustrating a second validity verification result of the consistency control method of the multi-agent system according to the present invention;
FIG. 5 is a third schematic diagram illustrating the validity verification result of the consistency control method of the multi-agent system according to the present invention;
FIG. 6 is a fourth schematic diagram illustrating the validity verification result of the consistency control method of the multi-agent system according to the present invention;
FIG. 7 is a schematic diagram showing the validity verification result of the consistency control method of the multi-agent system according to the present invention;
FIG. 8 is a diagram illustrating a sixth verification result of the consistency control method of the multi-agent system according to the present invention;
FIG. 9 is a seventh schematic diagram illustrating the validity verification result of the consistency control method of the multi-agent system according to the present invention;
FIG. 10 is a diagram illustrating the validity verification result of the consistency control method of the multi-agent system according to the present invention;
fig. 11 is a schematic diagram illustrating an embodiment of a terminal according to the present invention.
Detailed Description
In order to make the objects, technical solutions and effects of the present invention clearer and clearer, the present invention is further described in detail below with reference to the accompanying drawings and examples. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Example one
The multi-agent system consistency control method provided by the invention can be applied to a terminal, and the terminal can determine the control protocol of the multi-agent system through the multi-agent system consistency control method provided by the invention, so that each agent in the multi-agent system controls the self-movement according to the control protocol, and the multi-agent system consistency is realized.
In this specification, the euclidean norm is denoted by |, andrepresenting a set of real numbers.Representing a set of all non-negative real n-dimensional vectors. Matrix arrayRepresenting diagonal matrices, wherein the ith matrix diagonal element is corresponding to i e {1, …, n }(Vector)Representing an n-dimensional vector with all zero elements. (.)TRepresenting a transpose operation of the matrix.
As shown in fig. 1, in one embodiment of the multi-agent system consistency control method, the method comprises the steps of:
s100, obtaining the communication relation among all agents in the multi-agent system, and constructing the interactive topology of the multi-agent system according to the communication relation.
As shown in fig. 2, in the multi-agent system, a plurality of agents are included, and the information interaction relationship between the agents can be represented by a weighted undirected topology G (V, E, a), where V ═ 1, …, n is a set of information nodes identified by the agents in the topology, E is a set of information interactions formed between nodes in the topology,is a weighted adjacency matrix, and n is the number of agent nodes. An information interaction edge eijE means that agent i and agent j can interact information in both directions. If the information interaction is edge eijE, then aij=aji> 0, otherwise aijA is 0 andiii ∈ I, I is the set of agent numbers. Agent j ∈ NiIndicating that agent j is a neighbor of agent i, i.e. agent j and agent i interact via information edge eijIs E connected to NiA set of agent i neighbors. Both the in-degree and out-degree of agent iThe topology may be represented by a Laplace matrix of the topology, and the Laplace matrix of the interaction topology of the multi-agent system may be represented by a Laplace matrix of the interaction topology of the multi-agent systemIs LnWherein, L is ═ D-anRepresenting the subject matrix as an n-dimensional matrix, the matrixAssociated with agent iIs a diagonal matrix of diagonal elements.
After determining the interaction topology of the multi-agent system, the multi-agent system consistency control method provided by the embodiment further includes the steps of:
s200, constructing a consistency control model of the multi-agent system according to the interactive topology.
Constraints in the consistency control model include consistency condition constraints and non-smooth inequality constraints, and particularly, the agents in the multi-agent system need to consider the problems of shortest execution time, minimum energy consumption, shortest path and the like in task execution, so that the completion of the tasks of the multi-agent system involves optimizing one or more cost functions, that is, the control of the agents in the multi-agent system needs to make the cost function values as small as possible. Based on this, a consistency control model of the multi-agent system can be constructed as follows:
wherein the content of the first and second substances,i belongs to {1, 2.,. n }, n is the number of agents in the multi-agent system,for the two local cost functions of agent i,is a smooth convex function of the image to be displayed,non-smooth convex function, stateIs a position state vector of agent i, is a q-dimensional vector, and0nqis a zero vector of the nq-dimension,is a matrix LnAnd IqKronecker product of LnLaplace matrix, I, which is the topology of the information interactionqIs a q-dimensional unit matrix and is,is a non-smooth inequality constraint for agent i,is a non-smooth convex function that can be approximated,andare respectively-Liphoxitz in continuous with-Liphoz continuous.
The consistency constraint is defined by Lx being 0nqRepresenting, non-smooth inequality constraints byThat is, in this embodiment, the cost function of the multi-agent system needs to be optimized based on the consistency condition and the non-smooth inequality constraint condition.
In this embodiment, the agent in the multi-agent system is a first-order integrator model, i.e. the kinematic model is:wherein u isi(t) is the speed control quantity of agent i at time t,is the location state of agent i, which is a q-dimensional vector. That is, in the consistency control model, the position state is a variable,is the goal of consistency control.
S300, designing a decoupling operator of the optimal solution of the consistency control model.
After the consistency control model is established, designing a decoupling operator of an optimal solution of the consistency control model, and further designing a control protocol of the multi-agent system according to the decoupling operator of the optimal solution, so that the consistency control model can realize the optimal solution.
Specifically, the decoupling operator for designing the optimal solution of the consistency control model is as follows:
wherein, the lambda and the mu are Lagrange multipliers,as an auxiliary variable, the number of variables,for introducing parameters, alpha and beta are preset parameters, for anyi ∈ {1, …, n }, each having:z*satisfies the following conditions:and is
Specifically, in this embodiment, the design premise of the decoupling operator of the optimal solution includes:
1: for agent i e {1, …, n },is a quadratic continuous differentiable function and is strongly convex, i.e. there is a constant c > 0 such that there is a contribution to agent iWherein Without loss of generality, c > 1 can be assumed.
2: for all i e {1, …, n }, eachAndare (non-smooth) semi-continuous, tight, suitably convex functions at the bottom, and are approximatable.
3: the weighted topology G is connectionless.
4: in the inequality constraintAndare respectively-Liphoxitz in continuous with-Liphoxitz continuous, wherein for j ═ 1,2 there are
5: there is always at least one feasible solution to the consistency control model.
For two non-smooth functions carried in the consistency control modelAndand (3) designing a decoupling double-near-end operator by a near-end technology and a segmentation method, and designing a smooth control protocol for the next step. First, the definition and correlation properties of the near-end operator are introduced. For a next semicontinuous convex function f (x), whereWhich is atNear-end operator ofWhereinRepresents the sub-gradient of f (x). f (x) being convex meansMonotonous, i.e. for allAndare all provided withx=proxf[y]Is equivalent toThe specific form of the augmented Lagrange equation corresponding to the consistency control model is as follows:
K(x,λ,μ)=F(x)+λT[g1(x)+g2(x)]+μTLx+xTLx
under the above 5 premises, there is a feasible pointIs a feasible solution to the consistency control model if and only if there is oneFor the location state to which the respective agent states eventually converge consistently,andso that
g1(x*)+g2(x*)≤0n,(λ*)T[g1(x*)+g2(x*)]=0,Lx*=0nqWherein
Due to F1(x)+λTg2(x) The method is not possible to realize proximity, is inspired by a segmentation method, provides a decoupling operator based on a near-end technology, and needs to introduce a group of auxiliary variablesAnd parametersTo estimateSo that there is a set of z*Satisfy the requirement ofAnd isDesigning an optimal solution x of the consistency control problem (1) according to the properties of a near-end operator*The decoupled double near-end operator of (1) is:
Referring to fig. 1 again, after designing the decoupling operator of the optimal solution, the method for controlling consistency of a multi-agent system provided in this embodiment further includes the steps of:
s400, determining a control protocol of the multi-agent system according to the decoupling operator of the optimal solution, so that each agent in the multi-agent system controls the agent to move according to the control protocol.
Specifically, determining the control protocol of the multi-agent system according to the decoupling operator of the optimal solution is as follows:
wherein beta is more than 0 and less than 1/Lambdamax(L),0<γ<1-βΛmax(L), 0<α<mink=1,2,3[οk]C is a constant, t is a time variable, Λmax(L) is the maximum eigenvalue of L,
each agent in the multi-agent system controls its own motion according to the control protocol so that the state of the multi-agent system reaches the optimal solution, and as described above, the kinematic model of the agent in the multi-agent system is:wherein u isi(t) is the speed control quantity of agent i at time t,the position state of the agent i is a q-dimensional vector, so that the speed control quantity of the agent at different moments can be obtained according to the position state x (t), and the agent controls the speed according to the speed control quantity to realize the consistency control of the system.
According to the control protocol, for a first-order integrator model multi-agent system under the control of the control protocol, a system state x, a Lagrange multiplier lambda corresponding to inequality constraint and respective optimal point value x*And λ*There is the following relationship between:
let (x)*,z*,λ*,μ*) Is a balance point of the control protocol. Firstly, a Lyapunov alternative function V (x, z, lambda, mu) is given1(x,z,λ)+V2(x,μ)+V3(x, λ) wherein each term is represented by
For a multi-agent system containing n first-order integrator model agents, the following conclusions exist:
driven by the control protocol, x (t) converges over time andis an optimal solution of the consistency control model, i.e., the state x (t) of the multi-agent system will eventually agree on the location of the optimal solution of the consistency control model and satisfy all local non-smooth inequality constraints. That is, according to the control protocol, it is possible to achieve consistency control of the multi-agent system, to achieve minimization of the total cost function of the system, and to satisfy consistency constraints and non-smooth inequality constraints.
This conclusion is demonstrated below: due to the fact thatThe convexity of i e {1, …, n }, so that a conclusion can be drawnWhere G is a diagonal matrix whose diagonal elements areAndnote that 0 < beta < 1/Λmax(L) and xTLx≤Λmax(L)‖x‖2Thus, therefore, it is
From this, V is known2(x) Is more than or equal to 0. While noting λ*>0nThus, therefore, it is
And is
Wherein h is1=1-βΛmax(L)>0。
From the above, it is clear that V (x, z, λ, μ) is non-negative and radially unbounded, V (x, z, λ, μ) ≧ 0, and V (x, z, λ, μ) ≧ 0 and only if (x, z, λ, μ) ═ x*,z*,λ*,μ*)。
And is provided with
From this, the Lyapunov alternative function V (x, y, V) can be known1,v2) Trajectory of derivative satisfies
Thus, it can be seen that
Further, since V (x, z, λ, μ) is positive, radially unbounded, and has a lower bound, (x) is known*,z*,λ*,μ*) Is Lyapunov stable and bounded by trajectories (x (t), z (t), λ (t), μ (t)). According to the principle of the Lassel invariant set, and the control protocol balance point is the optimal solution of the optimization problem in the consistency control model, it can be known that x (t) converges over time andis an optimal solution of the consistency control model, i.e., the state x (t) of the multi-agent system will eventually agree on the location of the optimal solution of the consistency control model and satisfy all local non-smooth inequality constraints.
Due to a non-smooth cost function F in the consistency control model1(x) Representing a certain practical physical meaning, such as an indication function of a local area constraint that is satisfied when the location of an agent is required, while the above-mentioned control protocol fails to characterize the processing priorities of two non-smooth functions in the consistency control model, so that, in one possible implementation, a non-smooth cost function F is considered1(x) Is a high priority, i.e. a non-smooth cost function F1(x) Is/are as followsThe priority is higher than that of a non-smooth inequality constraint function, and a decoupling operator for designing the optimal solution of the consistency control model is as follows:
wherein, the lambda and the mu are Lagrange multipliers,as an auxiliary variable, the number of variables,for introducing parameters, alpha and beta are preset parameters, for anyi ∈ {1, …, n }, each having:z*satisfies the following conditions:and is
The corresponding control protocol is:
wherein beta is more than 0 and less than 1/Lambdamax(L),0<γ<1-βΛmax(L), 0<α<mink=1,2,3[οk]C is a constant, t is a time variable, Λmax(L) is the maximum eigenvalue of L,
likewise, the system state x (t) converges over time and, by means of the control protocol, takes into account the priority andis an optimal solution of the consistency control model, i.e., the state x (t) of the multi-agent system will eventually agree on the location of the optimal solution of the consistency control model and satisfy all local non-smooth inequality constraints. The certification process may be generalized simply from the foregoing conclusion certification and will not be discussed in detail herein.
The effectiveness of the multi-agent system consistency control method provided by this embodiment is verified through the corresponding simulation of the multi-agent system consistency control task with the non-smooth cost function and the non-smooth inequality constraint under the control protocol given in the multi-agent system consistency control method provided by this embodiment. Suppose the system consists of four agents modeled as first-order integrators, moving over a flat ground surface. The specific form of the multi-agent system consistency control model aiming at the simulation and having the non-smooth inequality constraint optimization problem is
minf(x)=f0(x)+f1(x)
=‖x-q‖2+Ω(x)
WhereinIn the unsmooth inequality constraintAndlocal cost function f of agent ii(xi) Consisting of the following functions:
whereinAndrespectively represent a smooth cost function and a local limit set xi∈ΩiIs used to indicate the function. Cost functionThe derivative of (a) of (b),andrespectively of the near-end operator
The Laplace matrix of the undirected information interaction topology G of the multi-agent system is
Initial position of agent is x1(0)=[-5m,4.3m]T,x2(0)=[5.6m,4.7m]T,x3(0)=[4.2m,-3.5m]TAnd x4(0)=[-4.7m,-5.2m]T. Each agent has an inequality constraint parameter A1=[0.3,0.9]T,A2=[1.1,0.7]T,A3=[0.2,0.4]T,A4=[1.1,0.5]T,b=[b1,b2,b3,b4]T=[1,6.5,6,5]T. Let the Lagrange multiplier λ, μ and the auxiliary variable z have zero initial values. Simulation step length of tp0.1s, 8000 steps and 319.1 s.
The track change of the multi-agent system along with the motion of time under the driving of the control protocol obtained by the multi-agent system consistency control method provided by the invention is shown in fig. 3. System stateAndthe track change of (2) is shown in fig. 4 and 5. Fig. 6 shows the trajectory variation of the global cost function f (x). FIG. 7 illustrates the non-smooth inequality constraints imposed on individual agentsThe trajectory of i ∈ {1,2,3,4} over time. As can be seen from fig. 3 to 7, the state of the multi-agent system eventually converges to the optimal point satisfying the non-smooth inequality constraint, and the agreement is reached. Lagrange multiplierλiThe trajectory of i e {1,2,3,4} over time is shown in FIGS. 8-10, while demonstrating the boundedness and convergence of the overall state of the multi-agent system.
In summary, the embodiment provides a method for controlling consistency of a multi-agent system, which constructs an interactive topology of the multi-agent system according to a communication relationship between agents in the multi-agent system, further constructs a consistency control model of the multi-agent system according to an interactive topology, sets constraints of the model including consistency condition constraints and non-smooth inequality constraints, designs an optimal decoupling terminal of the consistency control model, and determines a control protocol of the multi-agent system according to the optimal decoupling terminal, so that the multi-agent system can simultaneously satisfy the consistency condition constraints and the non-smooth inequality constraints.
It should be understood that, although the steps in the flowcharts shown in the figures of the present specification are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least a portion of the steps in the flowchart may include multiple sub-steps or multiple stages, which are not necessarily performed at the same time, but may be performed at different times, and the order of performing the sub-steps or stages is not necessarily sequential, but may be performed alternately or alternately with other steps or at least a portion of the sub-steps or stages of other steps.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, databases, or other media used in embodiments provided herein may include non-volatile and/or volatile memory. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
Example two
Based on the above embodiments, the present invention further provides a terminal, as shown in fig. 11, where the terminal includes a processor 10 and a memory 20. Fig. 11 shows only some of the components of the terminal, but it is to be understood that not all of the shown components are required to be implemented, and that more or fewer components may be implemented instead.
The memory 20 may in some embodiments be an internal storage unit of the terminal, such as a hard disk or a memory of the terminal. The memory 20 may also be an external storage device of the terminal in other embodiments, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like provided on the terminal. Further, the memory 20 may also include both an internal storage unit and an external storage device of the terminal. The memory 20 is used for storing application software installed in the terminal and various data. The memory 20 may also be used to temporarily store data that has been output or is to be output. In one embodiment, the memory 20 stores a multi-agent system consistency control program 30, and the multi-agent system consistency control program 30 can be executed by the processor 10, so as to implement the multi-agent system consistency control method of the present application.
The processor 10 may be, in some embodiments, a Central Processing Unit (CPU), microprocessor or other chip for running program codes stored in the memory 20 or Processing data, such as executing the multi-agent system consistency control method.
EXAMPLE III
The present invention also provides a storage medium having one or more programs stored thereon that are executable by one or more processors to implement the steps of the multi-agent system consistency control method as described above.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.
Claims (8)
1. A multi-agent system consistency control method, characterized in that the method comprises:
acquiring communication relations among all agents in a multi-agent system, and constructing an interactive topology of the multi-agent system according to the communication relations;
constructing a consistency control model of the multi-agent system according to the interaction topology, wherein constraints of the consistency control model comprise consistency condition constraints and non-smooth inequality constraints;
designing a decoupling operator of an optimal solution of the consistency control model;
determining a control protocol of the multi-agent system according to a decoupling operator of the optimal solution, so that each agent in the multi-agent system controls its own motion according to the control protocol;
the kinematic model of the agent in the multi-agent system is:wherein u isi(t) is the speed control quantity of agent i at time t,is the position state of agent i, which is a q-dimensional vector;
the building a consistency control model of the multi-agent system according to the interaction topology includes:
obtaining cost functions and non-smooth inequality constraint information of each agent in the multi-agent system, and constructing a consistency control model of the multi-agent system according to the interactive topology and the kinematics model as follows:
wherein the content of the first and second substances,n is the number of agents in the multi-agent system, fi(xi)=fi 0(xi)+fi 1(xi),fi 0、fi 1Two local cost functions, f, for agent ii 0Is a smooth convex function, fi 1Is a non-smooth convex function of the image,
and is0nqIs a zero vector of the nq-dimension,is a matrix LnAnd IqKronecker product of LnIs a Laplace matrix of the interaction topology, IqIs a q-dimensional unit matrix and is,is a non-smooth inequality constraint for agent i,is a non-smooth convex function that can be approximated,andare respectively-Liphoxitz in continuous with-Liphoz continuous.
2. The multi-agent system consistency control method as recited in claim 1, wherein said constructing an interaction topology of the multi-agent system according to the communication relationships comprises:
the laplacian matrix of the interaction topology is: l isnD-a, wherein the matrixAssociated with agent iAgent j and agent i via information interaction edge e as diagonal matrix of diagonal elementsijE belongs to E connection and information interaction edge EijE represents that the agent i and the agent j can exchange information in two directions, if the information exchange is carried out, EijE, then aij=aji> 0, otherwise aij=0,Is a weighted adjacency matrix.
3. The multi-agent system consistency control method of claim 1, wherein said designing a decoupling operator of an optimal solution for said consistency control model comprises:
the decoupling operator for designing the optimal solution of the consistency control model is as follows:
4. A multi-agent system consistency control method according to claim 3, characterized in that the control protocol of the multi-agent system is:
5. the multi-agent system consistency control method of claim 1, wherein said designing an optimal solution for the consistency control model comprises:
the decoupling operator for designing the optimal solution of the consistency control model is as follows:
6. The multi-agent system consistency control method according to claim 5, wherein the control protocol of the multi-agent system is:
7. a terminal, characterized in that the terminal comprises: a processor, a storage medium communicatively connected to the processor, the storage medium adapted to store a plurality of instructions, the processor adapted to invoke the instructions in the storage medium to perform the steps of implementing the multi-agent system consistency control method of any of the preceding claims 1-6.
8. A storage medium storing one or more programs, the one or more programs being executable by one or more processors to implement the steps of the multi-agent system consistency control method as claimed in any one of claims 1 to 6.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011277400.9A CN112637120B (en) | 2020-11-16 | 2020-11-16 | Multi-agent system consistency control method, terminal and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011277400.9A CN112637120B (en) | 2020-11-16 | 2020-11-16 | Multi-agent system consistency control method, terminal and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112637120A CN112637120A (en) | 2021-04-09 |
CN112637120B true CN112637120B (en) | 2022-04-19 |
Family
ID=75303272
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011277400.9A Active CN112637120B (en) | 2020-11-16 | 2020-11-16 | Multi-agent system consistency control method, terminal and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112637120B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117669271B (en) * | 2024-01-31 | 2024-04-26 | 北京理工大学 | Near-end operator-based non-smooth optimization method for multi-agent system and storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104865960A (en) * | 2015-04-29 | 2015-08-26 | 山东师范大学 | Multi-intelligent-body formation control method based on plane |
CN110308659A (en) * | 2019-08-05 | 2019-10-08 | 沈阳航空航天大学 | Uncertain multi-agent system mixing with time delay and switching topology triggers consistent control method |
CN111259327A (en) * | 2020-01-15 | 2020-06-09 | 桂林电子科技大学 | Subgraph processing-based optimization method for consistency problem of multi-agent system |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6088689A (en) * | 1995-11-29 | 2000-07-11 | Hynomics Corporation | Multiple-agent hybrid control architecture for intelligent real-time control of distributed nonlinear processes |
US6027112A (en) * | 1998-03-02 | 2000-02-22 | Xerox Corporation | Adaptive multiagent control system for controlling object motion with smart matter |
CN109079780B (en) * | 2018-08-08 | 2020-11-10 | 北京理工大学 | Distributed mobile mechanical arm task layered optimization control method based on generalized coordinates |
CN110442022B (en) * | 2019-07-19 | 2022-10-11 | 中国工程物理研究院计算机应用研究所 | Grouping consistency control method with time-lag heterogeneous multi-agent system |
-
2020
- 2020-11-16 CN CN202011277400.9A patent/CN112637120B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104865960A (en) * | 2015-04-29 | 2015-08-26 | 山东师范大学 | Multi-intelligent-body formation control method based on plane |
CN110308659A (en) * | 2019-08-05 | 2019-10-08 | 沈阳航空航天大学 | Uncertain multi-agent system mixing with time delay and switching topology triggers consistent control method |
CN111259327A (en) * | 2020-01-15 | 2020-06-09 | 桂林电子科技大学 | Subgraph processing-based optimization method for consistency problem of multi-agent system |
Also Published As
Publication number | Publication date |
---|---|
CN112637120A (en) | 2021-04-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6811894B2 (en) | Neural network structure generation methods and devices, electronic devices, and storage media | |
US20130325774A1 (en) | Learning stochastic apparatus and methods | |
Liu et al. | Feature selection and feature learning for high-dimensional batch reinforcement learning: A survey | |
Zhao et al. | Distributed optimal coordination control for nonlinear multi-agent systems using event-triggered adaptive dynamic programming method | |
CN113778619B (en) | Multi-agent state control method, device and terminal for multi-cluster game | |
Dimitrov et al. | On the implementation of model predictive control for on-line walking pattern generation | |
Van Hoof et al. | Learning of non-parametric control policies with high-dimensional state features | |
CN109726676B (en) | Planning method for automatic driving system | |
US11281232B2 (en) | Systems and methods for multi-agent system control using consensus and saturation constraints | |
Wang et al. | Model-free event-triggered optimal consensus control of multiple Euler-Lagrange systems via reinforcement learning | |
Xia et al. | Optimal synchronization control of heterogeneous asymmetric input-constrained unknown nonlinear MASs via reinforcement learning | |
CN112637120B (en) | Multi-agent system consistency control method, terminal and storage medium | |
CN111931901A (en) | Neural network construction method and device | |
CN112734033A (en) | Model training method, device, equipment and storage medium | |
Fu et al. | A cross‐coupling control approach for coordinated formation of surface vessels with uncertain disturbances | |
CN113419424B (en) | Modeling reinforcement learning robot control method and system for reducing overestimation | |
CN111046955B (en) | Multi-agent confrontation strategy intelligent prediction method and device based on graph network | |
US20230342626A1 (en) | Model processing method and related apparatus | |
CN112818986A (en) | Text detection method, device, system and medium based on depth relation reasoning | |
Jang et al. | AVAST: Attentive variational state tracker in a reinforced navigator | |
CN114779792B (en) | Medical robot autonomous obstacle avoidance method and system based on simulation and reinforcement learning | |
CN115903901A (en) | Output synchronization optimization control method for unmanned cluster system with unknown internal state | |
CN115903842A (en) | Multi-agent formation control method and system and electronic equipment | |
US20140006321A1 (en) | Method for improving an autocorrector using auto-differentiation | |
CN111880571A (en) | Rigid formation switching method and device for unmanned aerial vehicle |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |