CN114488802A - Nash equilibrium designated time searching method for multi-group game with consistent decision in group - Google Patents
Nash equilibrium designated time searching method for multi-group game with consistent decision in group Download PDFInfo
- Publication number
- CN114488802A CN114488802A CN202210056868.8A CN202210056868A CN114488802A CN 114488802 A CN114488802 A CN 114488802A CN 202210056868 A CN202210056868 A CN 202210056868A CN 114488802 A CN114488802 A CN 114488802A
- Authority
- CN
- China
- Prior art keywords
- cluster
- agent
- group
- matrix
- game
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 50
- 238000004891 communication Methods 0.000 claims abstract description 34
- 239000011159 matrix material Substances 0.000 claims description 44
- 238000005070 sampling Methods 0.000 claims description 7
- 238000010586 diagram Methods 0.000 claims description 5
- 238000010187 selection method Methods 0.000 claims description 3
- 102000002274 Matrix Metalloproteinases Human genes 0.000 claims 1
- 108010000684 Matrix Metalloproteinases Proteins 0.000 claims 1
- ONUFESLQCSAYKA-UHFFFAOYSA-N iprodione Chemical compound O=C1N(C(=O)NC(C)C)CC(=O)N1C1=CC(Cl)=CC(Cl)=C1 ONUFESLQCSAYKA-UHFFFAOYSA-N 0.000 claims 1
- 238000013461 design Methods 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 4
- 238000010845 search algorithm Methods 0.000 description 4
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
Abstract
The invention provides a method for searching the convergence Nash equilibrium of appointed time of a multi-group game under the constraint of decision consistency in a group, which comprises the following steps: a multi-cluster game problem model is built according to a multi-agent system, a communication topological structure meeting conditions is built, a time planning method is introduced, a continuous time distributed Nash equilibrium searching method is designed aiming at each agent, and relevant parameters enabling the Nash equilibrium searching method to achieve specified time convergence are designed. The method can realize the convergence Nash equilibrium search in the appointed time aiming at the multi-group game problem under the constraint of the decision consistency in the group, and provides a basis for the decision of the situation that the multi-unmanned cluster system collaborates in the cluster in a cooperative way and competes for the game among the clusters.
Description
Technical Field
The invention belongs to the communication technology, relates to a multi-agent game decision technology, and particularly relates to a designated time convergence Nash equilibrium search method for multi-group games under the constraint of intra-group decision consistency.
Background
With the development of artificial intelligence, the unmanned and intelligent technologies become powerful power for promoting a new round of military change after mechanization and informatization, and impact and even subversive influence are generated on the fighting forms. In future military operations, a large-scale unmanned cluster confronts the unmanned cluster in a scene. However, limited by communication latency, the communication topology hierarchy of the unmanned cluster system cannot be extended indefinitely, thereby limiting the cluster size. The solution available is to study multiple unmanned cluster systems, where each small-scale unmanned cluster system is treated as a cluster, and a plurality of such cluster systems are jointly deployed for combat. Due to the complexity of the whole large system dynamics caused by the diversity of tasks and the cluster number of a plurality of unmanned cluster systems, the task conflict often exists in practice. At this time, since there is no global unified director, there is a cooperation and competition relationship at a task level between each unmanned cluster, and there is also a cooperation and competition relationship between the individuals within the cluster. Therefore, modeling of the multi-task decision problem of the complex multi-cluster system is urgently needed, and through a dynamics analysis evolution mechanism, on one hand, theoretical explanation is provided for a multi-task decision result of the actual multi-unmanned cluster system, and on the other hand, the modeling is used for guiding the design and optimization of a cluster framework in the multi-unmanned cluster system. In the existing multi-cluster game continuous time nash equilibrium search algorithm, for a consistency-constrained multi-cluster game problem, in a literature (x.zeng, s.liang, and y.hong.distributed variable equilibrium search of multi-correlation gain a variable equilibrium. IFAC-paperon line, 20th IFAC World consistency, 50(1): 940-. The limitation of this scheme is that the designed balanced search algorithm requires clusters with the same number of agents and the same topology. On the basis, the document (X.Zeng, J.Chen, S.Liang, and Y.Hong. generalized Nash equilibrium for distributed non-smooth multi-cluster machine. Automatica,2019,103: 20-26.) further generalizes the method to the situation of different topological structure cluster maps, proposes a distributed non-smooth algorithm using projection differential inclusion, and analyzes the convergence of the algorithm. However, the implementation of this scheme relies on an undirected topology and the convergence rate of the algorithm is not specifically analyzed in the literature. In the literature (x.nian, f.niu and z.yang.distributed away computing games Switching Communication on Systems, Man, and Cybernetics: Systems, doi: 10.1109/tsmc.2021.30515.) Under a joint strongly connected directed Switching Communication topology, a new Nash Equilibrium search algorithm based on a coherence protocol and a gradient Game rule is proposed, and all the actions of the agent in the cluster are estimated using a leader-follower coherence protocol, so that a more general multiple cluster Game Nash Equilibrium search algorithm suitable for the agent to know only part of the decision information is designed, and results of local convergence and non-local convergence are given for the two algorithms respectively. However, this approach does not take into account decision consistency constraints inside the cluster.
Disclosure of Invention
In order to solve the problems, the invention provides a continuous time Nash equilibrium searching method aiming at a multi-group game under the constraint of intra-group decision consistency, and the convergence of the designated time is realized by utilizing a time planning method. The method considers the complex scene of multi-task combined operation of a plurality of unmanned cluster systems, comprehensively considers cooperative tasks in each unmanned cluster system and operation tasks of each unmanned cluster system, provides a rapid solution method of appointed time for researching balance of multi-task cluster game by establishing cluster game models among the unmanned cluster systems and in the unmanned cluster systems, and provides a solution thought for multi-task decision and control problems of the unmanned cluster systems from the perspective of cluster game.
In order to achieve the purpose, the invention provides the following technical scheme:
a method for carrying out convergence Nash equilibrium search at specified time aiming at a multi-group game under the constraint of decision consistency in a group comprises the following steps:
step 1: aiming at the situations that a multi-unmanned cluster system collaborates in a cluster and competes for games among the clusters, a multi-cluster game model which obeys a consistency constraint set is constructed for a multi-agent system;
step 2: constructing a communication topology structure aiming at the multi-agent system;
and step 3: based on a time planning method, a balanced fast and accurate searching method of a multi-task cluster game with specified time convergence is designed for each agent.
And 4, step 4: designing a Nash equilibrium search method to realize the parameter condition of appointed time convergence.
Further, the step 1 specifically includes the following sub-steps:
step 1-1: aiming at the situation that the conflict exists between the internal cooperative task and each unmanned cluster system task in the multi-unmanned cluster system, a multi-cluster game model which obeys a consistency constraint set is constructed as follows:
wherein N is the number of clusters participating in the game, and the cluster i comprises NiThe number of the individual agents is set to be,for the state of cluster i, the number ij indicates the jth agent in cluster i,in order for the state of the agent ij, representing all clusters togetherThe State of the Complex State, Cluster, is obeyed with a set of consistency constraints ofQuadratic continuous slightly convex function fij(x) Representing a cost function, function f, of agent j in cluster iij(x) With a Lipschitz continuous gradient: i.e. for anySatisfy the requirement of wherein lijAnd 0 is Lipschitz constant. Function fi(x) Cost function for cluster i:
further, the step 2 specifically includes the following sub-steps:
step 2-1: the communication topology of a multi-agent system is described as follows:
modeling communication topology among all agents as directed graphNode set isThe edges are collected asAnd N is the number of the clusters participating in the game. The inside of the cluster and different clusters can carry out directed communication on topological connection edges. In particular, cluster i contains niIndividual agents, agent sets represented asInduced subgraph for communication topology within cluster iIt is shown that,the number ij indicates the jth agent in the cluster i for which agentDefine its set of in-neighbors in the network asDefine the set of in-neighbors in its cluster asOut-neighbor set within its clusterDefinition mapIs a contiguous matrix of wherein Is the first of matrix ALine ofColumn element, if (pq, ij) ∈ ε, pq ≠ ijOtherwiseDefinition mapIs a contiguous matrix of wherein Is a matrix AiIf (il, ij) e epsiloniJ is not equal to l, thenOtherwiseObviously, A1,...,ANDiagonal blocks of the matrix a.Definition ofIs shown as a drawingOf laplacian matrix of, whereinIs the second of the matrix LLine ofColumn element, if ij ═ pq, thenOtherwise
Step 2-2: the communication topology requirements of a multi-agent system are as follows:
Further, the step 3 specifically includes the following sub-steps:
step 3-1: and (3) estimating global state information for the intelligent agent based on a leader-following consistency idea by combining a time planning method:
wherein ,representing the estimate, normal, of the global state x by agent ijSatisfy the requirement ofdijRepresents the in degree of agent ij:Tk=tk+1-tkis a sampling interval, a time sequence of sampling intervalsIs designed as
Step 3-2: the state iteration law of the intelligent agent and the auxiliary variable updating law for gradient information estimation are designed into
wherein ,xij(t) represents the state of agent ij at time t,for estimating items of gradient information, initialisingIs composed ofα is a positive constant to be designed. Matrix arrayIn the case of a random row, the data is transmitted,for elements in the jth row and m columns, letMatrix arrayIn the case of a column that is random,for the elements in row j and column m,each agent ij selects two sets of positive parameter setsAndthe following conditions are satisfied:
these two sets of parameters serve as weights for information received from inner neighbors within the cluster and information sent to outer neighbors within the cluster, respectively.Is defined as a matrix RiLeft eigenvector corresponding to eigenvalue 1, i.e. satisfyviIs defined as a matrix CiThe right eigenvector corresponding to eigenvalue 1, i.e. satisfiedA simpler selection method is:
Further, the step 4 specifically includes the following sub-steps:
step 4-1: requiring a pseudo-gradientIs strongly monotonic, i.e. there is a constant l > 0 such that
wherein Can be regarded as an objective function of the cluster i, y ═ y1,y2,...yN]TThe state of N virtual participants.
Step 4-2: the step length parameter requirements for realizing the specified time convergence by the Nash equilibrium search method of the multi-group game under the constraint of decision consistency in a design group are as follows:
wherein ,
σ=maxi{b1i}+γ2maxi{b2i}+γ3maxi{b3i},
is a symmetric positive definite matrix and satisfies Is a symmetric positive definite matrix and satisfies Is a symmetric positive definite matrix and satisfies Is shown as a drawingThe laplacian matrix of.
Compared with the prior art, the invention has the following advantages and beneficial effects:
1. aiming at the situation that competition relations of task levels exist among all unmanned clusters and cooperation relations exist among individuals in the clusters, the invention considers the Nash equilibrium search method of multi-cluster game under the constraint of intra-cluster decision consistency, and provides a solution for multi-task decision and control problems of a multi-unmanned cluster system.
2. In the constructed rapid solution method of Nash equilibrium, a design method based on time planning introduces an infinite series design sampling interval of convergence, which plays an important role in realizing the Nash equilibrium solution in specified time and greatly reduces the communication cost.
3. Compared with the correlation result of finite time and fixed time, the proposed Nash equilibrium fast solving method can achieve the convergence of the designated time, and the convergence time of the method does not depend on the initial action and the method parameters, thereby being convenient to determine the convergence time in advance according to the actual requirement.
Drawings
FIG. 1 is a schematic diagram of the steps of the time-specific convergent Nash equilibrium search method for multi-group games under intra-group decision consistency constraint of the present invention;
FIG. 2 is a specific flowchart of the time-specific convergence Nash equilibrium search method for multi-group game under the constraint of intra-group decision consistency according to the present invention;
FIG. 3 is a diagram of a communication topology of a multi-agent system provided by an example of the present invention;
fig. 4 is a graph showing the evolution of the state convergence of agents to equilibrium at a given time of 1 second, according to an embodiment of the present invention.
Detailed Description
The technical solutions provided by the present invention will be described in detail below with reference to specific examples, and it should be understood that the following specific embodiments are only illustrative of the present invention and are not intended to limit the scope of the present invention.
The invention provides a specified time convergence Nash equilibrium search method for a multi-group game under the constraint of decision consistency in a group, which comprises the following specific steps and flows as shown in figures 1 and 2:
step 1: aiming at the situations that a multi-unmanned cluster system collaborates in a cluster and competes for games among the clusters, a multi-cluster game model which obeys a consistency constraint set is constructed for a multi-agent system, and the method specifically comprises the following sub-steps:
step 1-1: aiming at the situations that a multi-unmanned cluster system collaborates in a cluster and competes for games among clusters, the following multi-cluster game model obeying a consistency constraint set is constructed:
wherein N is the number of clusters participating in the game, and the cluster i comprises NiThe number of the individual agents is set to be,for the state of cluster i, the number ij indicates the jth agent in cluster i,in order for the state of the agent ij, representing the joint state of all clusters, the state of the clusters obeying a set of consistency constraints ofQuadratic continuous slightly convex function fij(x) Representing a cost function, function f, of agent j in cluster iij(x) With a Lipschitz continuous gradient: i.e. for anySatisfy the requirements of wherein lijAnd 0 is Lipschitz constant. Function fi(x) Cost function for cluster i:
step 2: on the basis of the step 1, the communication topology of the multi-agent system can realize nash equilibrium search only if corresponding conditions are met. The method for constructing the communication topology structure of the multi-agent system meeting the conditions specifically comprises the following substeps:
step 2-1: the communication topology of a multi-agent system is described as follows:
modeling communication topology among all agents as directed graphNode set isThe edges are collected asAnd N is the number of the clusters participating in the game. The inside of the cluster and different clusters can carry out directed communication on topological connection edges. In particular, cluster i contains niIndividual agents, agent sets represented asInduced subgraph for communication topology within cluster iIt is shown that,(i ═ 1, 2.., N). The number ij indicates the jth agent in the cluster i for which agentDefine its set of in-neighbors in the network asDefine the set of in-neighbors in its cluster asOut-neighbor set within its clusterDefinition mapIs a contiguous matrix of wherein Is the first of matrix ALine ofColumn element, if (pq, ij) ∈ ε, pq ≠ ijOtherwiseDefinition mapIs a contiguous matrix of wherein Is a matrix AiIf (il, ij) e epsiloniJ is not equal to l, thenOtherwiseObviously, A1,...,ANDiagonal blocks of the matrix a.Definition ofIs shown as a drawingOf a laplacian matrix of, whereinIs the second of the matrix LLine ofColumn element, if ij ═ pq, thenOtherwise
Step 2-2: the communication topology requirements for a multi-agent system are as follows:
And step 3: and designing a balanced rapid and accurate searching method of the multi-task cluster game with specified time convergence for each agent by combining a time planning method.
Step 3-1: a time planning method is introduced, global state information is estimated for an agent based on a leader-follower consistency idea:
wherein ,representing the estimate, normal, of the global state x by agent ijSatisfy the requirement ofdijRepresents the in degree of agent ij:Tk=tk+1-tkis a sampling interval, a time sequence of sampling intervalsIs designed as
Step 3-2: the state iteration law of the intelligent agent and the auxiliary variable updating law for gradient information estimation are designed into
wherein ,xij(t) represents the state of agent ij at time t,for estimating items of gradient information, initialisingIs composed ofα is a positive constant to be designed. Matrix arrayIn the case of a random row, the data is,for elements in j row and m column, letMatrix arrayIn the case of a column that is random,for the elements in row j and column m,each agent ij selects two sets of positive parameter setsAndthe following conditions are satisfied:
wherein These two sets of parameters serve as weights for information received by ij from the inner neighbors within the cluster and for information sent to the outer neighbors within the cluster, respectively.Is defined as a matrix RiLeft eigenvector corresponding to eigenvalue 1, i.e. satisfyviIs defined as a matrix CiThe right eigenvector corresponding to eigenvalue 1, i.e. satisfyA simpler selection method is:
And 4, step 4: on the basis of step 3, the designed Nash equilibrium search method can specify time convergence when the design parameters meet certain conditions. The parameter condition for realizing the specified time convergence by the Nash equilibrium solving method is provided, and the method specifically comprises the following substeps:
step 4-1: requiring a pseudo-gradientIs strongly monotonic, i.e. there is a constant l > 0 such that
wherein Can be regarded as an objective function of the cluster i, y ═ y1,y2,…yN]TThe state of N virtual participants.
Step 4-2: the step length parameter requirements for realizing the specified time convergence by the Nash equilibrium search method of the multi-group game under the constraint of decision consistency in a design group are as follows:
wherein ,
σ=maxi{b1i}+γ2maxi{b2i}+γ3maxi{b3i},
is a symmetric positive definite matrix and satisfies Is a symmetric positive definite matrix and satisfies Is a symmetric positive definite matrix and satisfies Is shown as a drawingThe laplacian matrix of.
Example 1
Step 1: consider the game problem of 3 clusters, each containing N1=3,n2=4,n 33 agents. The cost function of agent ij isThe correlation coefficient is set as follows: m is11=3,m12=11,m13=22,m21=m22=2,m23=64,m24=8,m31=60,m32=m33=4,s11=s12=s13=10,s21=s22=s23=50,s31=s32=s33=20,h11=0.35,h12=0.25,h13=0.15,h21=0.2,h22=0.1,h23=0.05,h24=0.25,h31=0.02,h32=0.08,h33=0.2.
Step 2: the communication topology of the multi-agent system is shown in fig. 3.
And step 3: aiming at the designed multi-group designated time convergence Nash equilibrium search method obeying the consistency constraint, the state of an initialization node is selected as x (t)0)=[0,10,20,0,10,20,30,0,10,20]。
And 4, step 4: the parameter of the design method is alpha is 0.02. The state of each agent is set to converge to equilibrium at a specified time of 1 second, with the state trace shown in fig. 4. The simulation results show that the state of each agent converges to Nash equilibrium at 1 second, and
x*=[6.837,6.837,6.837,26.026,26.026,26.026,26.026,10.412,10.412,10.412]T,
y*=[6.837,26.026,10.412]T.
it is to be noted that, in the attached drawings or in the description, the implementation modes not shown or described are all the modes known by the ordinary skilled person in the field of technology, and are not described in detail. Further, the above definitions of the various elements and methods are not limited to the various specific structures, shapes or arrangements of parts mentioned in the examples, which may be easily modified or substituted by those of ordinary skill in the art.
The technical means disclosed in the invention scheme are not limited to the technical means disclosed in the above embodiments, but also include the technical scheme formed by any combination of the above technical features. It should be noted that those skilled in the art can make various improvements and modifications without departing from the principle of the present invention, and such improvements and modifications are also considered to be within the scope of the present invention.
Claims (5)
1. A Nash equilibrium designated time search method for multi-group game with consistent intra-group decision is characterized by comprising the following steps:
step 1: aiming at the situations that a multi-unmanned cluster system collaborates in a cluster and competes for games among the clusters, a multi-cluster game model which obeys a consistency constraint set is constructed for a multi-agent system;
step 2: constructing a communication topology structure aiming at the multi-agent system;
and step 3: designing a multi-task cluster game equilibrium rapid and accurate searching method with appointed time convergence for each agent by combining a time planning method;
and 4, step 4: and providing a parameter condition for realizing specified time convergence by a Nash equilibrium search method.
2. The nash equilibrium designated time search method for multi-group game with consistent decision in group as claimed in claim 1, wherein the step 1 specifically comprises the following sub-steps:
step 1-1: aiming at the situations that a multi-unmanned cluster system collaborates in a cluster and competes for games among clusters, the following multi-cluster game model which obeys a consistency constraint set is constructed:
wherein N is the number of clusters participating in the game, and the cluster i comprises NiThe number of the individual agents is set to be,for the state of cluster i, the number ij indicates the jth agent in cluster i,in order for the state of the agent ij, representing the joint state of all clusters, the state of the clusters obeying a set of consistency constraints ofQuadratic continuous slightly convex function fij(x) Representing a cost function, function f, of agent j in cluster iij(x) With a Lipschitz continuous gradient: i.e. for anySatisfy the requirement of wherein lij0 is Lipschitz constant, function fi(x) Cost function for cluster i:
3. the nash equilibrium designated time search method for multi-group game with consistent decision in group as claimed in claim 1, wherein the step 2 specifically comprises the following sub-steps:
step 2-1: the communication topology of a multi-agent system is described as follows:
modeling communication topology among all agents as directed graphNode set isThe edges are collected asN is the number of clusters participating in the game, directional communication can be carried out on topological connection sides inside the clusters and among different clusters, and specifically, the cluster i comprises NiIndividual agents, agent sets represented asInduced subgraph for communication topology within cluster iIt is shown that,the number ij indicates the jth agent in the cluster i for which agentDefine its set of in-neighbors in the network asDefine the set of in-neighbors in its cluster asOut-neighbor set within its clusterDefinition mapIs a contiguous matrix of wherein Is the first of matrix ALine ofColumn element, if (pq, ij) ∈ ε, pq ≠ ijOtherwiseDefinition mapIs a contiguous matrix of wherein Is a matrix AiIf (il, ij) e epsiloniJ is not equal to l, thenOtherwiseObviously, A1,...,ANFor the diagonal blocks of the matrix a,definition ofIs shown as a drawingOf laplacian matrix of, whereinIs the second of the matrix LLine ofColumn element, if ij ═ pq, thenOtherwise
Step 2-2: the communication topology requirements for a multi-agent system are as follows:
4. The nash equilibrium designated time search method for multi-group game with consistent decision in group as claimed in claim 1, wherein the step 3 specifically comprises the following sub-steps:
step 3-1: and (3) estimating global state information for the intelligent agent based on a leader-following consistency idea by combining a time planning method:
wherein ,representing the estimate, normal, of the global state x by agent ijSatisfy the requirement ofdijRepresents the in degree of agent ij:is a sampling interval, a time sequence of sampling intervalsIs designed as
Step 3-2: the state iteration law of the agent and the auxiliary variable updating law for gradient information estimation are designed into the following forms:
wherein ,xij(t) represents the state of agent ij at time t,for estimating items of gradient information, initialisingIs composed ofα is a positive constant to be designed, the matrixIn the case of a random row, the data is,for elements in j row and m column, letMatrix arrayIs a random number of columns,for the elements in row j and column m,each agent ij selects two sets of positive parameter setsAndthe following conditions are satisfied:
wherein These two sets of parameters serve as weights for information received from inner neighbors within the cluster and information sent to outer neighbors within the cluster,is defined as a matrix RiLeft eigenvector corresponding to eigenvalue 1, i.e. satisfyviIs defined as a matrix CiThe right eigenvector corresponding to eigenvalue 1, i.e. satisfyA simpler selection method is:
5. The nash equilibrium designated time search method for multi-group game with consistent decision in group as claimed in claim 1, wherein the step 4 comprises the following sub-steps:
step 4-1: requiring a pseudo-gradientIs strongly monotonic, i.e. there is a constant l > 0 such that
wherein Can be regarded as an objective function of the cluster i, y ═ y1,y2,...yN]TStates for N virtual participants;
step 4-2: the designed Nash equilibrium search method for the multi-group game under the constraint of the intra-group decision consistency realizes the following requirements on the parameters of the specified time convergence step length:
wherein ,
σ=maxi{b1i}+γ2maxi{b2i}+γ3maxi{b3i},
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210056868.8A CN114488802B (en) | 2022-01-18 | 2022-01-18 | Nash equilibrium appointed time searching method for intra-group decision-making consistent multi-group game |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210056868.8A CN114488802B (en) | 2022-01-18 | 2022-01-18 | Nash equilibrium appointed time searching method for intra-group decision-making consistent multi-group game |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114488802A true CN114488802A (en) | 2022-05-13 |
CN114488802B CN114488802B (en) | 2023-11-03 |
Family
ID=81472207
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210056868.8A Active CN114488802B (en) | 2022-01-18 | 2022-01-18 | Nash equilibrium appointed time searching method for intra-group decision-making consistent multi-group game |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114488802B (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150295407A1 (en) * | 2014-04-10 | 2015-10-15 | Nec Laboratories America, Inc. | Decentralized Energy Management Platform |
US20190347371A1 (en) * | 2018-05-09 | 2019-11-14 | Volvo Car Corporation | Method and system for orchestrating multi-party services using semi-cooperative nash equilibrium based on artificial intelligence, neural network models,reinforcement learning and finite-state automata |
US20200160168A1 (en) * | 2018-11-16 | 2020-05-21 | Honda Motor Co., Ltd. | Cooperative multi-goal, multi-agent, multi-stage reinforcement learning |
CN113534660A (en) * | 2021-05-27 | 2021-10-22 | 山东大学 | Multi-agent system cooperative control method and system based on reinforcement learning algorithm |
CN113778619A (en) * | 2021-08-12 | 2021-12-10 | 鹏城实验室 | Multi-agent state control method, device and terminal for multi-cluster game |
-
2022
- 2022-01-18 CN CN202210056868.8A patent/CN114488802B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150295407A1 (en) * | 2014-04-10 | 2015-10-15 | Nec Laboratories America, Inc. | Decentralized Energy Management Platform |
US20190347371A1 (en) * | 2018-05-09 | 2019-11-14 | Volvo Car Corporation | Method and system for orchestrating multi-party services using semi-cooperative nash equilibrium based on artificial intelligence, neural network models,reinforcement learning and finite-state automata |
US20200160168A1 (en) * | 2018-11-16 | 2020-05-21 | Honda Motor Co., Ltd. | Cooperative multi-goal, multi-agent, multi-stage reinforcement learning |
CN113534660A (en) * | 2021-05-27 | 2021-10-22 | 山东大学 | Multi-agent system cooperative control method and system based on reinforcement learning algorithm |
CN113778619A (en) * | 2021-08-12 | 2021-12-10 | 鹏城实验室 | Multi-agent state control method, device and terminal for multi-cluster game |
Also Published As
Publication number | Publication date |
---|---|
CN114488802B (en) | 2023-11-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Jiang et al. | Deep-learning-based joint resource scheduling algorithms for hybrid MEC networks | |
CN110929394B (en) | Combined combat system modeling method based on super network theory and storage medium | |
Xu et al. | Learning to explore via meta-policy gradient | |
CN112650239B (en) | Multi-underwater robot formation obstacle avoidance method and system based on improved artificial potential field method | |
CN107562066B (en) | Multi-target heuristic sequencing task planning method for spacecraft | |
CN114741886B (en) | Unmanned aerial vehicle cluster multi-task training method and system based on contribution degree evaluation | |
US11281232B2 (en) | Systems and methods for multi-agent system control using consensus and saturation constraints | |
CN111191728A (en) | Deep reinforcement learning distributed training method and system based on asynchronization or synchronization | |
CN109657794B (en) | Instruction queue-based distributed deep neural network performance modeling method | |
Bedi et al. | Asynchronous online learning in multi-agent systems with proximity constraints | |
Shi et al. | LANA: an ADMM-like Nash equilibrium seeking algorithm in decentralized environment | |
CN111414006A (en) | Unmanned aerial vehicle cluster reconnaissance task planning method based on distributed sequential distribution | |
CN113487029A (en) | Transplantable neural network distributed parallel strategy searching method | |
Xu et al. | Decentralized machine learning through experience-driven method in edge networks | |
Zhang et al. | Multi-agent system application in accordance with game theory in bi-directional coordination network model | |
Wang et al. | Oracle-guided deep reinforcement learning for large-scale multi-UAVs flocking and navigation | |
Li et al. | Multi-AUV distributed task allocation based on the differential evolution quantum bee colony optimization algorithm | |
Sarma et al. | Fog computing: an enhanced performance analysis emulation framework for IoT with load balancing smart gateway architecture | |
AlSuwaidan et al. | Swarm intelligence algorithms for optimal scheduling for cloud-based fuzzy systems | |
Zhang et al. | A parallel strategy for convolutional neural network based on heterogeneous cluster for mobile information system | |
Pruekprasert et al. | Quantitative supervisory control game for discrete event systems | |
CN114488802A (en) | Nash equilibrium designated time searching method for multi-group game with consistent decision in group | |
CN116362327A (en) | Model training method and system and electronic equipment | |
Hu et al. | Enhanced multi-strategy bottlenose dolphin optimizer for UAVs path planning | |
Xia et al. | A Collaborative Neurodynamic Optimization Approach to Distributed Nash-Equilibrium Seeking in Multicluster Games With Nonconvex Functions |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |