CN116699989A - Non-linear multi-agent gradient-free optimization method based on edge event triggering - Google Patents
Non-linear multi-agent gradient-free optimization method based on edge event triggering Download PDFInfo
- Publication number
- CN116699989A CN116699989A CN202310647617.1A CN202310647617A CN116699989A CN 116699989 A CN116699989 A CN 116699989A CN 202310647617 A CN202310647617 A CN 202310647617A CN 116699989 A CN116699989 A CN 116699989A
- Authority
- CN
- China
- Prior art keywords
- agent
- gradient
- edge
- event triggering
- edge event
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000005457 optimization Methods 0.000 title claims abstract description 28
- 238000000034 method Methods 0.000 title claims abstract description 24
- 238000004891 communication Methods 0.000 claims abstract description 27
- 238000013528 artificial neural network Methods 0.000 claims abstract description 11
- 230000001960 triggered effect Effects 0.000 claims description 7
- 230000003044 adaptive effect Effects 0.000 claims description 5
- 230000007246 mechanism Effects 0.000 claims description 3
- 230000000694 effects Effects 0.000 abstract description 2
- 239000011159 matrix material Substances 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000005265 energy consumption Methods 0.000 description 2
- 230000004913 activation Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 238000004451 qualitative analysis Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/02—Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Medical Informatics (AREA)
- Software Systems (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Automation & Control Theory (AREA)
- Multi Processors (AREA)
Abstract
The invention discloses a non-linear multi-agent gradient-free optimal control method based on edge event triggering, which comprises the following steps: the method comprises the steps of carrying out online identification on unknown gradient items in a system by adopting a radial basis function neural network, introducing an edge event triggering communication strategy on the basis of the online identification to reduce communication resource loss, and then combining information of an intelligent agent node, reference model information and information of neighbor intelligent agent nodes thereof to design a distributed controller based on the neural network so as to enable multiple intelligent agents to achieve the optimal effect under the objective function and constraint. The invention realizes the online identification and constraint target optimization control of the nonlinear multi-agent optimization problem with unknown gradient, thereby solving the problem of multi-agent gradient-free optimization.
Description
Technical Field
The invention relates to the technical field of multi-agent control, in particular to a non-gradient optimization method for a nonlinear multi-agent based on edge event triggering.
Background
In recent years, the multi-agent system has wide application potential in the fields of micro-grids, unmanned aerial vehicles, sensors and the like, and the coordination control problem of the multi-agent system is widely studied. The problem of optimizing control of multiple intelligent agents is also receiving more and more attention, and the method is widely applied to the control fields of formation control, cluster control, gathered beehives and the like.
First, a definition of an agent is given, which is a device or machine that is autonomous, reactive, proactive, social, and evolutionary. The multi-agent system is a network composed of a plurality of agents, and the agents communicate through the topological rule of the network, so that the complex tasks are solved by cooperation.
With the deep research, the multi-agent system is also applied to more practical engineering, the problem to be solved is also more complex, and different from the simple consistency problem, the energy-saving factor is considered in practical application, and the multi-agent is often required to achieve a specific target, namely, multi-agent optimization. Aiming at the multi-agent optimization problem, not only the cooperation among the multi-agents but also whether the target requirement reaches the optimal or not are needed to be considered, so that the multi-agent optimization problem is more complex than the consistency problem, and the problem is more and more needed to be solved.
Multi-agent distributed optimization has been receiving much attention from communities, unlike centralized optimization, which solves some engineering problems in a distributed manner, i.e., each agent can only access local information about itself and neighbors, but not global information. Thus, in multi-agent systems, the goal of distributed optimization is typically to minimize the global goal that can only be achieved through cooperation between a single agent and its neighboring agents.
In a long period of time, the cost function of the multi-agent distributed optimization problem is obtained by assuming that the function gradient is available, but in the actual engineering background, there are many situations that the cost function gradient of the optimization target cannot be obtained normally or the gradient calculation is expensive. Therefore, the multi-agent distributed optimization problem with unknown research gradient has practical significance.
In general, communications between multiple agents are divided into continuous communications and discrete communications, where both communications may exist in which the states of the agents are not changed much over a period of time, and communications between agents are still implemented, resulting in wasteful use of communications resources, so that many students introduce event-triggered control, i.e., determining whether or not an agent communicates with all connected neighbors through events, where general event-triggered control determines whether or not an agent communicates with all connected neighbors through events related to the states of the agents, and edge event-triggered control determines whether or not two agents corresponding to an edge communicate by determining events related to the states of the edge, thereby using fewer communications resources.
Disclosure of Invention
The invention aims to provide a non-linear multi-agent gradient-free optimization method based on edge event triggering, so as to solve the problems in the background technology.
In order to achieve the above purpose, the present invention adopts the following technical scheme:
a non-linear multi-agent gradient-free optimization method based on edge event triggering comprises the following steps:
s1: the problem dynamics model and the optimization target are clearly researched;
s2: aiming at the gradient unknown item in the optimization target, an adaptive algorithm based on an RBF artificial neural network is adopted to identify the gradient unknown item on line so as to meet the requirements of system instantaneity and rapidity;
s3: introducing an edge event triggering control communication strategy, and determining whether two intelligent agents corresponding to an edge communicate or not by judging an event related to the edge state, so that fewer communication resources are used;
s4: combining S2 and S3, utilizing the information of the intelligent agent node and the information of the neighbor intelligent agent nodes, and designing a distributed controller under the Lyapunov stability theorem to ensure that non-linear multi-intelligent agent gradient-free optimization triggered based on side events is realized;
s5: and filling the unknown gradient identification strategy, the control algorithm and the edge event triggering communication mechanism into each multi-agent through programming.
Compared with the prior art, the technical bright point of the invention is as follows: the self-adaptive algorithm based on the RBF artificial neural network can identify the unknown gradient of the system on line, so that the real-time performance of the system is met; meanwhile, the communication strategy of side event trigger control is considered, so that the communication resources among multiple agents in an actual system can be reduced better. In conclusion, the problem of gradient-free optimization of nonlinear multi-agent can be solved.
Drawings
Fig. 1 is a multi-agent communication topology diagram in the present embodiment.
Fig. 2 is a state response diagram of the multi-agent according to the method proposed herein in this embodiment.
Fig. 3 is an event-triggered timing diagram of the multi-agent in the side-event-triggered communication manner in this embodiment.
Detailed Description
The present invention will be described in further detail with reference to the drawings and embodiments, so that the objects, technical solutions and advantages of the present invention can be more clearly understood.
A non-linear multi-agent gradient-free optimization method based on edge event triggering comprises the following steps:
s1: a problem dynamics model is clearly researched;
the multi-agent dynamics model of the model comprises N agents, and the ith agent dynamics model is as follows:
wherein ,xi (t)∈R n ,u i (t)∈R n ,f i (x i ,t)∈R n Representing the position state, control inputs and nonlinear functions of the agent, respectively.
The optimization problem is considered to satisfy the following form:
in S1, the communication network topology of the agent is shown in fig. 1.
Assuming that the undirected connected graph G has m edges, defining a certain edge e j Connecting agent x i And x l ,Is the correlation matrix of graph G. Definition:
then there is l=dwd T 。Is an edge weight vector.
S2: optimizing item g for (1) i (x i (t)) and adopting an adaptive algorithm based on RBF artificial neural network to identify the gradient unknown condition on line so as toThe requirements of system instantaneity and rapidity are met;
order theIdeal case-> Ideal weight matrix for representing optimal RBF artificial neural network, G i (. Cndot.) represents RBF artificial neural network activation function, ε i And (t) represents an approximation error of the RBF artificial neural network.
The optimal weight matrix can be obtained according to the following formula:
wherein ,Gi (. Cndot.) is a Gaussian function.
Formula (3) is generally used for qualitative analysis, and is practically used
Approximation of an unknown function.
The update rate of the weight matrix is further given by combining an adaptive algorithm:
wherein ,ei (t)=x i (t)-x * ,x * Representing a global optimum state, delta i > 0 is the designed control parameter Γ i =Γ i T ∈R ρ×ρ 。
S3: the communication strategy among the intelligent agents is designed based on the side event triggering mechanism, so that the purposes of saving energy consumption and improving operation time are achieved;
first defining the communication state of the first edge:
e l (t)=x i (t)-x j (t) (6)
the edge event triggering conditions of the intelligent agent are designed as follows:
wherein ,at t k Agent x before the moment i And x j The latest edge event trigger time of the corresponding edge is 0 < { lambda ] p }<1,{μ p P=1, 2, } > m is a constant sequence satisfying the condition. When the edge state violates the trigger condition (7), agent x i And x j Control is performed.
S4: in combination with the above method, the controller of agent i is designed to:
weight updatingAs shown in the above formula (5), and each control parameter is designed to satisfy the following formula:
η>0,b>0
s5: and filling the unknown model identification strategy and the control algorithm into each multi-agent through programming.
In the present embodiment, preferably, x i (t)∈R,u i (t)∈R,f i (x i (t))∈R,g i (x i (t))∈R。
The optimal problem is considered as follows:
it is easy to know that the globally optimal solution is x * =30。
The laplace corresponding to the multi-agent communication topology in this embodiment is as follows:
preferably, the control parameters of the system are designed as follows:
η=2.4,b=0.35,λ i =0.85,μ i =1.2, i=1, …, m, the remaining corresponding parameters can be calculated from the known information. The initial value of each agent is set as x= [ -2,4,12, -8,8]。
According to fig. 2, it can be seen that, within the allowable error, the state of each agent can reach the optimal state, that is, the algorithm can realize the optimal control of the multi-intelligent system with unknown gradient, and according to fig. 3, the respective event trigger time and times of each side can be seen, that is, the distributed event trigger control is realized, the energy consumption of the system is reduced, and the running duration of the system is prolonged.
The above examples are only preferred embodiments of the present invention, and it should be noted that it will be apparent to those skilled in the art that several variations and modifications can be made without departing from the spirit of the invention, and these should also be regarded as the scope of the invention, which does not affect the effect of the implementation of the invention and the practical applicability of the patent.
Claims (7)
1. The non-linear multi-agent gradient-free optimization method based on edge event triggering has the following characteristics:
s1: the problem dynamics model and the optimization target are clearly researched;
s2: aiming at the gradient unknown item in the optimization target, an adaptive algorithm based on an RBF artificial neural network is adopted to identify the gradient unknown item on line so as to meet the requirements of system instantaneity and rapidity;
s3: introducing an edge event triggering control communication strategy, and determining whether two intelligent agents corresponding to an edge communicate or not by judging an event related to the edge state, so that fewer communication resources are used;
s4: combining S2 and S3, utilizing the information of the intelligent agent node and the information of the neighbor intelligent agent nodes, and designing a distributed controller under the Lyapunov stability theorem to ensure that non-linear multi-intelligent agent gradient-free optimization triggered based on side events is realized;
s5: and filling the unknown gradient identification strategy, the control algorithm and the edge event triggering communication mechanism into each multi-agent through programming.
2. The non-linear multi-agent gradient-free optimal control method based on edge event triggering as claimed in claim 1, wherein the method is characterized by comprising the following steps: in S1:
the multi-agent dynamics model for defining a model comprises N agents, and the ith agent dynamics model is as follows:
wherein ,xi (t)∈R n ,u i (t)∈R n ,f i (x i ,t)∈R n Representing the position state, control inputs and nonlinear functions of the agent, respectively.
The optimization problem is considered to satisfy the following form:
s.t.x i =x j
3. the non-linear multi-agent gradient-free optimal control method based on edge event triggering as claimed in claim 1, wherein the method is characterized by comprising the following steps: the communication network of the intelligent agent is in an undirected communication topology, and a feasible input for realizing the control target exists in the system.
4. The non-linear multi-agent gradient-free optimal control method based on edge event triggering as claimed in claim 1, wherein the method is characterized by comprising the following steps: in S2: for unknown itemsAn RBF artificial neural network-based adaptive algorithm is adopted to identify the RBF artificial neural network on line, which comprises the following steps:
defining a tracking error:
e i (t)=x i (t)-x ★ ,i=1,…,N
approximation of unknown functions:
wherein ,δi > 0 is the control parameter of the design.
5. The non-linear multi-agent gradient-free optimal control method based on edge event triggering as claimed in claim 1, wherein the method is characterized by comprising the following steps: in S3: introducing an edge event trigger control communication policy comprising:
first defining the communication state of the first edge:
e l (t)=x i (t)-x j (t)
the edge event triggering conditions of the intelligent agent are designed as follows:
wherein ,at t k Agent x before the moment i And x j The latest edge event trigger time of the corresponding edge is 0 < { lambda ] p }<1,{μ p P=1, 2, …, m is a constant sequence satisfying the condition. When the edge state violates the trigger condition, agent x i And x j Control is performed.
6. The non-linear multi-agent gradient-free optimal control method based on edge event triggering as claimed in claim 1, wherein the method is characterized by comprising the following steps: in S4:
the controller of the design agent i is designed as follows:
weight updatingAs shown in claim 4, and the control parameters are designed to satisfy the following equation:
η>0,b>0
7. the non-linear multi-agent gradient-free optimal control method based on edge event triggering as claimed in claim 1, wherein the method is characterized by comprising the following steps: in S5:
and filling the unknown model identification strategy and the control algorithm into each multi-agent through programming.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310647617.1A CN116699989A (en) | 2023-06-02 | 2023-06-02 | Non-linear multi-agent gradient-free optimization method based on edge event triggering |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310647617.1A CN116699989A (en) | 2023-06-02 | 2023-06-02 | Non-linear multi-agent gradient-free optimization method based on edge event triggering |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116699989A true CN116699989A (en) | 2023-09-05 |
Family
ID=87840416
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310647617.1A Pending CN116699989A (en) | 2023-06-02 | 2023-06-02 | Non-linear multi-agent gradient-free optimization method based on edge event triggering |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116699989A (en) |
-
2023
- 2023-06-02 CN CN202310647617.1A patent/CN116699989A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110597109B (en) | Multi-agent consistency control method based on event triggering | |
CN112558470A (en) | Optimal consistency control method and device for actuator saturated multi-agent system | |
Xu et al. | Online event-triggered optimal control for multi-agent systems using simplified ADP and experience replay technique | |
Yan et al. | Fuzzy finite-time consensus control for uncertain nonlinear multi-agent systems with input delay | |
Yousefi et al. | A hybrid neuro-fuzzy approach for greenhouse climate modeling | |
CN112363392B (en) | Multi-agent grouping consistency control method with unknown first-class model | |
CN116699989A (en) | Non-linear multi-agent gradient-free optimization method based on edge event triggering | |
van der Sar et al. | Multi-Agent Reinforcement Learning for Power Grid Topology Optimization | |
CN116794987A (en) | Heterogeneous multi-agent system consistency control method based on event trigger control | |
CN115469548A (en) | Unknown nonlinear multi-agent finite time clustering consistency control method with input saturation | |
CN116088317A (en) | Multi-agent consistency control method based on dynamic event triggering | |
CN116974185A (en) | Multi-agent binary consistency control method, device, equipment and storage medium | |
Zhang et al. | Output-constrained fixed-time coordinated control for multi-agent systems with event-triggered and delayed communication | |
CN114861435B (en) | Multi-agent system consistency sliding mode control algorithm under finite time observer | |
CN109993271A (en) | Grey neural network forecasting based on theory of games | |
Masoumzadeh et al. | Deep blue: A fuzzy q-learning enhanced active queue management scheme | |
CN113341729A (en) | Fixed time consistency control method for multi-agent system | |
Namerikawa et al. | Event-triggered and self-triggered formation control of a multi-agent system | |
Fravolini et al. | Comparison of different growing radial basis functions algorithms for control systems applications | |
Zhang et al. | UAV Swarm Task Assignment Method Based on Artificial Gorilla Troops Optimizer | |
CN114114904B (en) | Finite time and fixed time distributed event triggering consistency method | |
Qiming et al. | Residual Sarsa algorithm with function approximation | |
CN115857539A (en) | Event-driven strategy-based distributed formation control method for unmanned aerial vehicles | |
Cao et al. | Towards energy-efficient communication management in the distributed control of networked cyber-physical systems | |
Niu et al. | Robust Consensus Tracking Strategy of Heterogeneous Nonlinear Multi-Agent Systems With Time-Varying Input Delays |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |