CN111882101A - Control method based on supply chain system consistency problem under switching topology - Google Patents

Control method based on supply chain system consistency problem under switching topology Download PDF

Info

Publication number
CN111882101A
CN111882101A CN202010446158.7A CN202010446158A CN111882101A CN 111882101 A CN111882101 A CN 111882101A CN 202010446158 A CN202010446158 A CN 202010446158A CN 111882101 A CN111882101 A CN 111882101A
Authority
CN
China
Prior art keywords
supply chain
consistency
chain system
switching topology
strategy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010446158.7A
Other languages
Chinese (zh)
Inventor
李庆奎
弓镇宇
易军凯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Information Science and Technology University
Original Assignee
Beijing Information Science and Technology University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Information Science and Technology University filed Critical Beijing Information Science and Technology University
Priority to CN202010446158.7A priority Critical patent/CN111882101A/en
Publication of CN111882101A publication Critical patent/CN111882101A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/042Backward inferencing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • G06Q10/06315Needs-based resource requirements planning or analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0637Strategic management or analysis, e.g. setting a goal or target of an organisation; Planning actions based on goals; Analysis or evaluation of effectiveness of goals

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • Strategic Management (AREA)
  • Theoretical Computer Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Marketing (AREA)
  • Game Theory and Decision Science (AREA)
  • General Business, Economics & Management (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention relates to a control method for the consistency problem of a supply chain system based on switching topology, which comprises the following specific steps: s1: firstly, a supply chain system is modeled as a multi-agent system and a product matching problem under uncertain market demands is summarized as an H-infinity consistency problem; s2: secondly, the H infinity consistency problem of discrete time is researched by using a game theory under the switching topology, and the process of inhibiting the ox penis effect can be regarded as a game process; s3: then, a double-loop strategy iterative algorithm is given to solve the decoupled HJI equation; s4: finally, a simulation example is given to prove the effectiveness of the method. The invention utilizes the distributed optimal control and the zero sum game theory to research the H infinity consistent problem of a multi-agent supply chain system based on a switching topology, the productivity and the market demand are considered as game participants, and the optimal productivity aiming at the worst disturbance situation is obtained by solving the HJI equation according to the zero sum game theory.

Description

Control method based on supply chain system consistency problem under switching topology
Technical Field
The invention relates to the technical field of supply chain systems, in particular to a control method based on a supply chain system consistency problem under a switching topology.
Background
A supply chain system is a network system that integrates a series of devices and business entities and includes a plurality of sub-supply chains. From a practical perspective, the new generation of technology of internet of things enables devices in a supply chain system to be equipped with computing capabilities, and thus such a system can be considered a multi-agent system. In recent years, with the development of information technology, a multi-agent based supply chain system has received more and more attention due to its wide application in research directions such as power systems, intelligent manufacturing systems, and intelligent transportation systems. In an enterprise, raw material processing, intelligent factories, logistics and customers are integrated into one cyber-physical system, and the control manner is converted from centralized control to distributed control, which improves production quality and efficiency. On the other hand, from a theoretical point of view, a multi-agent based supply chain system has self-organization, scalability, and coordination. Therefore, the supply chain system based on the intelligent agent has stronger robustness in the face of uncertain market demands, has better reliability in the face of information flow logistics interruption or production faults, and has high flexibility due to coordination design when node enterprises are subjected to increase and decrease changes, and a scientific and rigorous control method for the consistency problem of the supply chain system does not exist, so that a control method based on the consistency problem of the supply chain system under switching topology is provided.
Disclosure of Invention
The invention aims to solve the defects in the prior art, and provides a control method based on the consistency problem of a supply chain system under a switching topology.
In order to achieve the purpose, the invention provides the following technical scheme:
a control method based on a supply chain system consistency problem under switching topology comprises the following specific steps:
s1: first, the supply chain system is modeled as a multi-agent system and the product matching problem under uncertain market demand is resolved to the H ∞ consistency problem;
s2: secondly, the H infinity consistency problem of discrete time is researched by game theory under the switching topology, the process of inhibiting the ox penis effect can be regarded as the game process, the productivity and the market demand are respectively participants of the game, and the solution of the coupled HJI equation is needed to be obtained when the game is solved, singular terms can appear in the solution due to the characteristic of average consistency and the situation of appearance of isolated points is considered, so that a decoupling method is designed to indirectly obtain the solution of the HJI equation, and the optimal control strategy is proved to be at a Nash equilibrium point under certain specific conditions;
s3: next, a double-loop strategy iterative algorithm is given to solve the decoupled HJI equation;
s4: finally, a simulation example is given to prove the effectiveness of the proposed method.
Preferably, preliminary knowledge and problem modeling is required in step S1, first modeling the interaction between agents through graph theory, and then modeling based on the multi-agent supply chain system.
Preferably, in the step S2, the distributed zero sum game problem of the supply chain system, the H ∞ problem of the supply chain system is classified as solving a two-person zero sum game problem, the decoupling method is designed to simplify the calculation of the global HJI equation, and the proof that the system obtains H ∞ consistency at nash equilibrium under specific conditions is given, the decoupled HJI equation is solved by using a strategy iteration algorithm, and an optimal production strategy for uncertain demand and switching topology is obtained.
Preferably, the strategy iteration and decoupling HJI equation in step S3 further calculates the optimal productivity, and it is obvious to solve the lei-poonfov HJI equation by using the strategy iteration algorithm, where the strategy iteration algorithm includes an inner loop and an outer loop, and the inner loop performs strategy evaluation, where the production control and consistency protocol is fixed, the market demand gain is continuously updated until convergence, and the outer loop performs strategy update, and the production control and consistency protocol is updated.
Compared with the prior art, the invention has the beneficial effects that: the invention utilizes distributed optimal control and zero-sum game theory to research a class of H infinity consistent problem based on a multi-agent supply chain system with switching topology, productivity and market demand are considered as game participants, according to the zero-sum game theory, the optimal productivity aiming at worst-case disturbance is obtained by solving an HJI equation, and a decoupling method and a strategy iteration algorithm are further given to implement specific solving steps, and finally, a simulation example shows that the optimal production strategy designed according to the proposed method has good control performance and can realize the inhibition of the bullwhip effect.
Drawings
FIG. 1 is a schematic view of the production process of the ith subchain containing n devices according to the present invention;
FIG. 2 is a schematic diagram of the topology between agents and the internal structure of an ith agent in accordance with the present invention;
FIG. 3 is a schematic diagram of a switching signal according to the present invention;
FIG. 4 shows inventory level x under the switching topology zero sum gaming method of the present inventioni1A schematic diagram of variations;
FIG. 5 shows inventory level x under the switching topology zero sum gaming method of the present inventioni2A schematic diagram of variations;
FIG. 6 shows the error inventory level under the switching topology zero sum gaming method of the present inventioni1A schematic diagram of variations;
FIG. 7 shows the error inventory level under the switching topology zero sum gaming method of the present inventioni2A schematic diagram of variations;
FIG. 8 shows inventory level x under the switching topology H ∞ control method of the present inventioni1A schematic diagram of variations;
FIG. 9 shows inventory level x under the switching topology H ∞ control method of the present inventioni2A schematic diagram of variations;
FIG. 10 is a diagram illustrating the error inventory level under the switching topology H ∞ control method of the present inventioni1A schematic diagram of variations;
FIG. 11 shows the error inventory level under the switching topology H ∞ control method of the present inventioni2Schematic diagram of the variation.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail with reference to the following embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1-11, the present invention provides a technical solution:
a control method based on a supply chain system consistency problem under switching topology comprises the following specific steps:
s1: first, the supply chain system is modeled as a multi-agent system and the product matching problem under uncertain market demand is resolved to the H ∞ consistency problem;
s2: secondly, the H infinity consistency problem of discrete time is researched by game theory under the switching topology, the process of inhibiting the ox penis effect can be regarded as the game process, the productivity and the market demand are respectively participants of the game, and the solution of the coupled HJI equation is needed to be obtained when the game is solved, singular terms can appear in the solution due to the characteristic of average consistency and the situation of appearance of isolated points is considered, so that a decoupling method is designed to indirectly obtain the solution of the HJI equation, and the optimal control strategy is proved to be at a Nash equilibrium point under certain specific conditions;
s3: next, a double-loop strategy iterative algorithm is given to solve the decoupled HJI equation;
s4: finally, a simulation example is given to prove the effectiveness of the proposed method.
Preliminary knowledge and problem modeling:
A. theory of the drawings
Modeling interactions between agents with a graph G { V, E, a }, where V {1, …, N } is a finite, nonempty set and each agent can be viewed as a node, E ═ V × V is an edge set if node i can send information to node j, then (i, j) ∈ E, a ═ Vij]An adjacency matrix representing a graph, for the ith and jth nodes in the graph, a when (i, j) ∈ E ij1, otherwise aij0; neighbor set for agent i is NiJ | j ∈ V, (i, j) ∈ E }. An orphan point is that the node has no neighbors and is also notIf there is at least one node in the graph that can reach any other node through a directed path, the graph is called to contain a directed spanning tree, and D is set to be diag (D)i) Representing an in-degree matrix of a graph, wherein
Figure RE-GDA0002666746100000041
Representing the degree of entry of node i, the laplacian matrix of the graph may be defined as L ═ D-a, the element L of the laplacian matrixijThe definition is as follows:
Figure RE-GDA0002666746100000042
the laplace matrix of a symmetric graph is a symmetric matrix, the graph considered herein is a symmetric graph and does not contain self-loops;
we consider the case where there is and only one isolated point present at some time, and he can restore the connection with other child chains; the isolated point is not fixed, it changes over time, so the topology is time-varying; let the switching signal be σ: [0, ∞) → N ═ 0, 1., N } and σ (k) ═ h, which means that the h-th agent is an isolated point and no isolated point appears when σ (k) → 0; the time-varying topology may be denoted as Gσ(k)Drawing union { Gσ(k)K ∈ [0, ∞) } is GuChanging the isolated points can change the neighbor sets of some sub-chains and the Laplace matrix of the graph, and can also affect the interaction among the sub-chains; time-varying neighbor set
Figure RE-GDA0002666746100000043
Laplace matrix is LhAnd L is0L; the laplacian matrix elements when h ≠ 0 are:
Figure RE-GDA0002666746100000051
for further discussion, the following assumptions are first introduced:
assume that 1: union of graphs { Gσ(k)K ∈ [0, ∞) } contains the spanning tree;
B. modeling based on a multi-agent supply chain system:
assuming that a supply chain system based on multi-agent modeling includes N sub-chains, which can be regarded as agents and also nodes in a topological graph, first, we discuss the internal operation process of the ith sub-chain, as shown in fig. 1, where there are N devices,
wherein the production rate, market demand and desired inventory level of the ith subchain may be expressed as:
Figure RE-GDA0002666746100000052
Figure RE-GDA0002666746100000053
Figure RE-GDA0002666746100000054
Figure RE-GDA0002666746100000055
from a control point of view, xij(k),uij(k) Can be regarded as the state variable and the control input of the jth device in the ith sub-chain respectively, considering that each sub-chain can be regarded as a cascade system, wherein the control input is not only the input of the jth device but also the demand of the last device, in the real production activity, the production rate uij(k) Is non-negatively bounded, meaning 0 ≦ uij(k)≤um. Wherein u ismIs the maximum production rate. The last device is directly connected to the market and customer needs directly affect it, and in order to measure and handle the market needs the following assumptions need to be given.
Assume 2: the demand consists of a known invariant demand and an unknown time variant demand, then the market demand can be expressed as:
Figure RE-GDA0002666746100000056
in which the demand ω is not determinedin(k)∈l2[0,∞]Can be considered as an external disturbance.
Assume that 3: in a supply chain system, a reasonable production plan can meet market demands.
In practice, the stock capacity is at an upper bound and the stored products in the warehouse are spoiled for various reasons, respectively by sijAnd ρjRepresenting the upper bound and the corruption rate of the jth warehouse store of the ith subcohain. The following assumptions can therefore be given.
Assume 4: the stock level satisfies 0 ≤ xij(k)≤sijAnd the product has a spoilage rate rhojThe rate of deterioration occurs.
Here we can model the ith sub-chain according to assumptions 2-4:
xi1(k+1)=(1-ρ1)xi1(k)+ui1(k)-ui2(k)
xi2(k+1)=(1-ρ2)xi2(k)+ui2(k)-ui3(k)
Figure RE-GDA0002666746100000061
xin(k+1)=(1-ρn)xin(k)+uin(k)-din(k)
therefore, the model of the ith subchain can be given by
xL(k+1)=(I-ρ)xi(k)+Bui(k)-di(k) (4)
In the formula:
ρ=diag{ρ1,ρ2,...,ρn}
Figure RE-GDA0002666746100000062
and integrating the sub-chains into a global dynamic equation:
Figure RE-GDA0002666746100000063
a commodity may be formed by combining a plurality of component parts, different sub-chains of a supply chain system may be responsible for different part processing, and reasonably planning the stock levels of different products is beneficial to reducing the stock cost while ensuring the quantity of output commodities, which indicates that the research for achieving consistency or matching the product stock quantity is indispensable, and in one sub-chain, the stock level of each warehouse device is directly or indirectly influenced by other devices, so that a desired storage level needs to be designed for each warehouse device according to market demands, and the desired storage level of the jth device of the ith sub-chain is recorded as cijIt is clear that the size of this variable is within the warehouse's allowable storage range, so the product quantity matching problem implies a state xij(k) To achieve the desired value, it is necessary to introduce the following definitions before further discussion.
Definition 1: for a multi-agent system containing N agents, achieving consistency means that the individual agent states in the system satisfy at any initial state:
Figure RE-GDA0002666746100000071
the storage level reaching the desired value is actually a solution to the state tracking problem, which can be seen as an extension of the consistency problem.
To achieve consistency in the supply chain system, the production rate of each child chain should be appropriately adjusted according to the inventory levels of its neighbors, which indicates that a reasonable consistency protocol needs to be designed, we assume that the production rate contains two parts,
Figure RE-GDA0002666746100000072
is responsible for the control of the production,
Figure RE-GDA0002666746100000073
is in charge of the consistency protocol and is,since the topology changes depending on the switching signal σ (k) ═ h, the controller can be designed to:
Figure RE-GDA0002666746100000074
Figure RE-GDA0002666746100000075
wherein
Figure RE-GDA0002666746100000076
And is
Figure RE-GDA0002666746100000077
Respectively represent a production control gain and a uniformity gain, and
Figure RE-GDA0002666746100000078
ci=[ci1,ci2,…,cin]T
we can derive the global control law as:
Figure RE-GDA0002666746100000079
Figure RE-GDA00026667461000000710
Figure RE-GDA00026667461000000711
wherein
Figure RE-GDA00026667461000000712
Figure RE-GDA00026667461000000713
Figure RE-GDA00026667461000000714
Note 1: the coherence protocol is distributed and topology dependent, and the state information received from other agents is incomplete when soliton nodes are present.
Recording the average state as
Figure RE-GDA0002666746100000081
Error vector formed by
Figure RE-GDA0002666746100000082
Given, we can therefore derive:
Figure RE-GDA0002666746100000083
then, the global error dynamics of the supply chain system can be normalized as:
Figure RE-GDA0002666746100000084
wherein:
Figure RE-GDA0002666746100000085
Figure RE-GDA0002666746100000086
Figure DEST_PATH_BDA0002505874380000084
note 3: known constant demand can be directly eliminated by production control, so we only need to consider the effect of unknown demand on the supply chain system, and time-varying demand will cause the bullwhip effect, but can suppress its effect by solving the H ∞ consistency problem.
Note 4: symmetrical drawing of laplaThe gaussian matrix is also symmetric, so in combination with the laplace matrix property one can derive: l ishM=MLh=Lh
Order to
Figure RE-GDA0002666746100000091
Figure RE-GDA0002666746100000092
The error system can therefore be rewritten as follows:
written as follows:
Figure RE-GDA0002666746100000093
wherein z (k) is the system output, and
Figure RE-GDA0002666746100000094
and
Figure RE-GDA0002666746100000095
can be represented by the following formula:
Figure RE-GDA0002666746100000096
Figure RE-GDA0002666746100000097
solving the H-infinity consistency problem requires designing the consistency protocol and production control so that the error system is in
Figure RE-GDA0002666746100000098
The time is gradually stabilized and the following suppression conditions are satisfied.
Definition 2: let gamma be*Is the lower bound of the disturbance suppression level, for non-zero
Figure RE-GDA0002666746100000099
And a bounded function beta, the system (10) is l2The gain is bounded and is less than or equal to a given positive scalar γ, where γ ≧ γ*
Figure RE-GDA00026667461000000910
Distributed zero-sum gaming problems for supply chain systems:
in this section, the H ∞ problem of the supply chain system is summarized as solving the two-person zero-sum game problem, and in order to simplify the computation of the global HJI equation, this section designs a decoupling method and gives proof that the system achieves H ∞ consistency at nash equilibrium under specific conditions, and furthermore, we adopt a strategy iteration algorithm to solve the decoupled HJI equation and obtain an optimal production strategy for uncertain demand and switching topology.
For convenience of writing, let x (k) be xkUnder the switching topology, an infinite interval performance function is defined for the supply chain system as follows:
Figure RE-GDA0002666746100000101
of formula (II) Q'h≥0,
Figure RE-GDA0002666746100000102
T>0 is an adaptive symmetric matrix, and the switching signal is σ (k) ═ h. Matrix Q'hDepending on the topology of the graph, it also means that it is associated with the Laplace matrix LhCorrelation, the corresponding value function can be defined as:
Figure RE-GDA0002666746100000103
the H ∞ consistency problem can be reduced to the zero and differential gaming problems, the production control and consistency protocol minimizes the performance function, and the uncertain market demand maximizes the performance function, so the gaming process can be expressed as:
Figure RE-GDA0002666746100000104
if a saddle point exists in the game
Figure RE-GDA0002666746100000105
It has only one solution
Figure RE-GDA0002666746100000106
Figure RE-GDA0002666746100000107
It is equivalent to nash equilibrium conditions:
Figure RE-GDA0002666746100000108
from the bellman optimality principle and equation (14), the bellman equation can be derived:
Figure RE-GDA0002666746100000109
consider a quadratic form of the value function:
Figure RE-GDA00026667461000001010
substituting (19) into (18) yields:
Figure RE-GDA0002666746100000111
the Hamiltonian is then:
Figure RE-GDA0002666746100000112
through the stable conditions:
Figure RE-GDA0002666746100000113
the optimal production control, optimal consistency protocol and worst case disturbances are available, hence:
Figure RE-GDA0002666746100000114
Figure RE-GDA0002666746100000115
in the formula
Figure RE-GDA0002666746100000116
Wherein the matrix pair
Figure RE-GDA0002666746100000117
Is capable of being calmed down and has the advantages of,
Figure RE-GDA0002666746100000118
is detectable and for an optimal coherence protocol there exists a matrix G such that:
Figure RE-GDA0002666746100000119
note 5: the matrix G provides greater freedom for uniform design, in this context, the Laplace matrix LhIs singular and therefore
Figure RE-GDA00026667461000001110
Is a singular gain matrix, whereas when G is 0, the unity gain matrix in (24) is non-singular, apparently with
Figure RE-GDA00026667461000001111
Contradictory, so for the sake of the following discussion, the addition of the non-zero matrix G is chosen here.
Moreover, substituting (20) production control (22), consistency protocol (23), and market demand (24) may result in a global HJI equation:
Figure RE-GDA00026667461000001112
Figure RE-GDA0002666746100000121
wherein:
Figure RE-GDA0002666746100000122
note 6: matrix Q'hAccording to the topology change, thus resulting in the solution P 'of the HJI equation'hAnd also varied, whereby the production rate at which isolated nodes occur can be designed.
The error state based feedback control law is shown in (11), so the optimal productivity under the switching topology is related to the switching topology, assuming that
Figure RE-GDA0002666746100000123
And the solution of the HJI equation satisfies
Figure RE-GDA0002666746100000124
Combining (22) to obtain:
Figure RE-GDA0002666746100000125
similarly, the optimal coherency protocol is:
Figure RE-GDA0002666746100000126
suppose that
Figure RE-GDA0002666746100000127
And the worst-case market demand has the form:
Figure RE-GDA0002666746100000128
in the formula
Figure RE-GDA0002666746100000129
Represents the worst case market demand gain, and therefore
Figure RE-GDA00026667461000001210
And 7, note: through the theory of zero-sum gaming, optimal productivity can suppress the bullwhip effect produced by worst-case disturbances. If the production rate of a supply chain system design can achieve consistency under worst case disturbances, agreement can also be reached when addressing other market needs.
To decouple the HJI equation, the following arguments apply.
Introduction 1: consider a multi-agent based supply chain error dynamic system (10), assuming
Figure RE-GDA0002666746100000131
G1=(Lh-IN)P1
Figure RE-GDA0002666746100000132
And Q'hSatisfies the following conditions:
Figure RE-GDA0002666746100000133
in the formula
Figure RE-GDA0002666746100000134
In relation to the switching signal, solving the HJI equation is equivalent to solving the following equation:
Figure RE-GDA0002666746100000135
in the formula:
Figure RE-GDA0002666746100000136
and (3) proving that: value function and matrix
Figure RE-GDA0002666746100000137
Is correlated and the weight matrix is in the form of
Figure RE-GDA0002666746100000138
Will be provided with
Figure RE-GDA0002666746100000139
Substituting (26) and (29), respectively, can obtain:
Figure RE-GDA0002666746100000141
Figure RE-GDA0002666746100000142
order to
Figure RE-GDA0002666746100000143
Combining (27) to obtain:
Figure RE-GDA0002666746100000144
thus, HJI equation (25) is equivalent to:
Figure RE-GDA0002666746100000145
wherein:
Figure RE-GDA0002666746100000146
selecting
Figure RE-GDA0002666746100000147
The following can be obtained:
Figure RE-GDA0002666746100000148
where the matrix is symmetric, thus:
Figure RE-GDA0002666746100000149
Figure RE-GDA0002666746100000151
if weight matrix Q'hSatisfies the following conditions:
Figure RE-GDA0002666746100000152
in the formula
Figure RE-GDA0002666746100000153
The global HJI equation can be decoupled as:
Figure RE-GDA0002666746100000154
in the formula
Figure RE-GDA0002666746100000155
And has:
Figure RE-GDA0002666746100000156
thus, P 'can be obtained by solving (31) and further'h
Note 8: matrix QhAssociated with the switching signal, so that different solutions P can be derived from different topologieshTopology information is hidden, and the optimal state feedback gain under the switching topology is further obtained.
Assume that 5: for some matrices S, the weight matrix Qh=STLhAnd S. And S may be a pair of matrices
Figure RE-GDA0002666746100000157
Can be detected.
B.l2Gain boundedAnd Nash equilibrium
In this subsection, we demonstrate that a set production rate can suppress the bullwhip effect at a level while achieving consistency of the supply chain system, and that the solution of the HJI equation satisfies the nash equilibrium condition, first introducing the following reasoning:
2, leading: suppose V*Is a positive solution of the HJI equation, then:
Figure RE-GDA0002666746100000161
theorem 1: assuming that the suppression level of the bullwhip effect satisfies gamma>γ*And the HJI equation has a smooth positive solution. The supply chain system will achieve H ∞ consistency under optimal production control and optimal consistency protocol, where market demand is satisfied
Figure RE-GDA0002666746100000162
And (3) proving that: substituting the solution of the HJI equation into the hamiltonian (21) yields:
Figure RE-GDA0002666746100000163
in the formula
Figure RE-GDA0002666746100000164
Order to
Figure RE-GDA0002666746100000165
We have:
Figure RE-GDA0002666746100000166
furthermore, there is a need for the market
Figure RE-GDA0002666746100000167
(36) can be formed as:
Figure RE-GDA0002666746100000168
thus, the equalization point of the error system (10) by the Lyapunov theorem is progressively stabilized. Taking into account the constraint condition (12) and theorem 2, adding up both sides of the inequality (36) yields:
Figure RE-GDA0002666746100000169
wherein
Figure RE-GDA0002666746100000171
If N → ∞, then there are:
Figure RE-GDA0002666746100000172
thus, in connection with definition 2, it is clear that the supply chain system is capable of suppressing the bullwhip effect at a level and achieving consistency.
Definition 2: assuming that the suppression level of the bullwhip effect satisfies gamma>γ*And the HJI equation has a smooth positive solution
Figure RE-GDA0002666746100000173
Figure RE-GDA0002666746100000174
When the supply chain system reaches consistency, when G4Satisfy the requirement of
Figure RE-GDA0002666746100000175
Then
Figure RE-GDA0002666746100000176
For Nash equilibrium and optimal game solution
Figure RE-GDA0002666746100000177
And (3) proving that: for the smoothing value function V: (k) The performance function (13) can be rewritten as:
Figure RE-GDA0002666746100000178
suppose V*(k) Can be derived by solving the HJI equation, and the optimal strategy is
Figure RE-GDA0002666746100000179
The preparation method can obtain:
Figure RE-GDA00026667461000001710
the supply chain error system progressively stabilizes at the equilibrium point, which indicates that the supply chain achieves H ∞ consistency.
Thus is provided with
Figure RE-GDA0002666746100000181
And is
Figure RE-GDA0002666746100000182
Is obviously provided with
Figure RE-GDA0002666746100000183
When in use
Figure RE-GDA0002666746100000184
The performance function (38) may be written as:
Figure RE-GDA0002666746100000185
suppose that
Figure RE-GDA0002666746100000186
Then there are:
Figure RE-GDA0002666746100000187
if G is4Satisfies the inequality (37), then
Figure RE-GDA0002666746100000188
Further satisfying the nash equalization condition (17), at the equalization point, one obtains:
Figure RE-GDA0002666746100000189
wherein the optimal game is solved as
Figure RE-GDA00026667461000001810
Strategy iteration and decoupling HJI equation:
to further calculate the optimal productivity, a strategic iterative algorithm is used to solve the Lyapunov-like HJI equation (31)
It is evident that the strategy iteration algorithm contains an inner loop and an outer loop. The inner loop performs strategic evaluation, wherein the production control and consistency protocols are fixed, the market demand gain is continuously updated through (33) until convergence, and the outer loop performs strategic update, updating the production control and consistency protocols through (32) and (34).
Figure RE-GDA0002666746100000191
Figure RE-GDA0002666746100000201
Note 9: the decoupled HJI equation is a non-linear equation and still difficult to directly solve, so a reinforcement learning algorithm is adopted to obtain a locally converged solution, and therefore, optimal production control, an optimal consistency protocol and worst market requirements can be achieved through the algorithm.
Assume that there are three sub-supply chains for a region, each supply chain including two devices that are responsible for production and sale, respectively. Assuming that the ith sub-chain is disconnected from other sub-chains within a certain period of time, i.e. the ith agent becomes an isolated node, fig. 2 is a possible topology and a corresponding laplace matrix;
Figure RE-GDA0002666746100000202
Figure RE-GDA0002666746100000211
assuming that the goods in the warehouses in the three sub-chains deteriorate at the same rate, the deterioration rate matrix is as follows:
Figure RE-GDA0002666746100000212
given the initial state of each child chain as x1(0)=[6,7]T,x2(0)=[3.5,2]T,x3(0)=[2.5,3]TThe expected state value is c1=[2,2]T,c2=[2.5,2.5]T,c3=[3,3]T. The appropriate weight matrix is selected and substituted into the HJI equation to obtain the optimal production control strategy, the optimal consistency protocol and the worst case requirements, and the switching signals are as shown in fig. 3.
It can be seen from fig. 4 and 5 that the status of each sub-chain approaches the respective expected status value, and the stock status fluctuates during the occurrence of the isolated point because the status information interaction is interrupted, and the status of the supply chain returns to the expected value when the system resumes the connection, and further, fig. 6 and 8 show that the average consistency error of the supply chain approaches zero, so that the three sub-supply chains can reach the expected stock level when the isolated point increases or decreases, which means that the system can achieve consistency in the switching topology, and the production rate can suppress the bullwhip effect caused by uncertain market demand, therefore, the supply chain system can achieve H ∞ consistency, and further, by continuously reducing the value of γ, the minimum suppression level γ can be searched*=0.346。
By contrast, a standard H ∞ control method is also implemented here, with simulation results as shown in FIGS. 8-11, and similarly suppliedThe chain system can achieve consistency and reduce the bullwhip effect, and the minimum inhibition level is gamma*0.534, a comprehensive consideration of the two methods can make the system realize H-infinity coincidence, but there are still significant differences, such as shown in fig. 1, that the suppression level of the zero-sum game method is lower than that of the H-infinity control method, and the drastic change of the state curve in the H-infinity control can be observed, so that the zero-sum game method has better uncertainty suppression capability, and in addition, the stabilization time of the dynamic response of the system under the zero-sum game method is less than that of the standard H-infinity control.
Although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that various changes in the embodiments and/or modifications of the invention can be made, and equivalents and modifications of some features of the invention can be made without departing from the spirit and scope of the invention.

Claims (4)

1. A control method based on supply chain system consistency problem under switching topology is characterized in that: the method comprises the following specific steps:
s1: first, the supply chain system is modeled as a multi-agent system and the product matching problem under uncertain market demand is resolved to the H ∞ consistency problem;
s2: secondly, the H infinity consistency problem of discrete time is researched by game theory under the switching topology, the process of inhibiting the ox penis effect can be regarded as the game process, the productivity and the market demand are respectively participants of the game, and the solution of the coupled HJI equation is needed to be obtained when the game is solved, singular terms can appear in the solution due to the characteristic of average consistency and the situation of appearance of isolated points is considered, so that a decoupling method is designed to indirectly obtain the solution of the HJI equation, and the optimal control strategy is proved to be at a Nash equilibrium point under certain conditions;
s3: next, a double-loop strategy iterative algorithm is given to solve the decoupled HJI equation;
s4: finally, a simulation example is given to prove the effectiveness of the proposed method.
2. The method for controlling supply chain system consistency problem based on switching topology as claimed in claim 1, wherein: preliminary knowledge and problem modeling is required in said step S1, first modeling the interaction between agents by graph theory and then modeling based on the multi-agent supply chain system.
3. The method for controlling supply chain system consistency problem based on switching topology as claimed in claim 1, wherein: in the step S2, the H ∞ problem of the supply chain system is summarized as solving the two-person zero sum game problem, the decoupling method is designed to simplify the calculation of the global HJI equation, and the proof that the system obtains H ∞ consistency at nash equilibrium under the condition is given, the decoupled HJI equation is solved by using a strategy iteration algorithm, and an optimal production strategy for uncertain demand and switching topology is obtained.
4. The method for controlling supply chain system consistency problem based on switching topology as claimed in claim 1, wherein: the strategy iteration and decoupling HJI equation in the step S3, calculating the optimal productivity, and solving the lei apunov-like HJI equation by using the strategy iteration algorithm obviously shows that the strategy iteration algorithm comprises an inner ring and an outer ring, the inner ring executes strategy evaluation, wherein the production control and consistency protocol is fixed, the market demand gain is continuously updated until convergence, the outer ring executes strategy update, and the production control and consistency protocol is updated.
CN202010446158.7A 2020-05-25 2020-05-25 Control method based on supply chain system consistency problem under switching topology Pending CN111882101A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010446158.7A CN111882101A (en) 2020-05-25 2020-05-25 Control method based on supply chain system consistency problem under switching topology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010446158.7A CN111882101A (en) 2020-05-25 2020-05-25 Control method based on supply chain system consistency problem under switching topology

Publications (1)

Publication Number Publication Date
CN111882101A true CN111882101A (en) 2020-11-03

Family

ID=73154013

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010446158.7A Pending CN111882101A (en) 2020-05-25 2020-05-25 Control method based on supply chain system consistency problem under switching topology

Country Status (1)

Country Link
CN (1) CN111882101A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114708043A (en) * 2022-05-25 2022-07-05 湖南工商大学 Method, system, equipment and storage medium for measuring bullwhip effect of supply chain
CN114760101A (en) * 2022-03-18 2022-07-15 北京信息科技大学 Product and supply chain cooperative evolution system compensation method and system under network attack
CN115311027A (en) * 2022-10-11 2022-11-08 工业云制造(四川)创新中心有限公司 Supply chain management method and system based on digital twin

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130273514A1 (en) * 2007-10-15 2013-10-17 University Of Southern California Optimal Strategies in Security Games
CN110083063A (en) * 2019-04-29 2019-08-02 辽宁石油化工大学 A kind of multiple body optimal control methods based on non-strategy Q study
CN110782011A (en) * 2019-10-21 2020-02-11 辽宁石油化工大学 Networked multi-agent system distributed optimization control method based on reinforcement learning

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130273514A1 (en) * 2007-10-15 2013-10-17 University Of Southern California Optimal Strategies in Security Games
CN110083063A (en) * 2019-04-29 2019-08-02 辽宁石油化工大学 A kind of multiple body optimal control methods based on non-strategy Q study
CN110782011A (en) * 2019-10-21 2020-02-11 辽宁石油化工大学 Networked multi-agent system distributed optimization control method based on reinforcement learning

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
唐毅等: "基于博弈论的供应链信息共享行为研究", 《内蒙古科技与经济》 *
弓镇宇等: "基于零和博弈方法的多智能体系统H_∞一致性", 《河南科学》 *
晏妮娜等: "电子市场环境下双源渠道模型及其牛鞭效应H∞控制", 《东北大学学报(自然科学版)》 *
覃茜等: "多智能体系统在切换拓扑和时变时延下的H_∞一致性", 《陕西科技大学学报(自然科学版)》 *
覃茜等: "多智能体系统在切换拓扑和时变时延下的H∞一致性", 《陕西科技大学学报(自然科学版)》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114760101A (en) * 2022-03-18 2022-07-15 北京信息科技大学 Product and supply chain cooperative evolution system compensation method and system under network attack
CN114708043A (en) * 2022-05-25 2022-07-05 湖南工商大学 Method, system, equipment and storage medium for measuring bullwhip effect of supply chain
CN114708043B (en) * 2022-05-25 2022-09-13 湖南工商大学 Method, system, equipment and storage medium for measuring bullwhip effect of supply chain
CN115311027A (en) * 2022-10-11 2022-11-08 工业云制造(四川)创新中心有限公司 Supply chain management method and system based on digital twin

Similar Documents

Publication Publication Date Title
CN111882101A (en) Control method based on supply chain system consistency problem under switching topology
Maestre et al. Distributed model predictive control based on agent negotiation
Maestre et al. Distributed model predictive control based on a cooperative game
Parise et al. A distributed algorithm for almost-Nash equilibria of average aggregative games with coupling constraints
Xue et al. Dissipativity-based filter design for Markov jump systems with packet loss compensation
Romano et al. Dynamic Nash equilibrium seeking for higher-order integrators in networks
Bianchi et al. A continuous-time distributed generalized Nash equilibrium seeking algorithm over networks for double-integrator agents
Islek et al. A Decision Support System for Demand Forecasting based on Classifier Ensemble.
Kambhampati et al. Inverse model control using recurrent networks
Zhao et al. Semi-global cooperative cluster output regulation for heterogeneous multi-agent systems with input saturation
Tao et al. Adaptive fuzzy fixed time control for pure-feedback stochastic nonlinear systems with full state constraints
Maestre et al. Distributed MPC: a supply chain case study
Xu et al. Active management strategy for supply chain system using nonlinear control synthesis
Parise et al. A distributed algorithm for average aggregative games with coupling constraints
Brock et al. Regular economies and conditions for uniqueness of steady states in optimal multi-sector economic models
CN111624882A (en) Zero and differential game processing method for supply chain system based on reverse-thrust design method
Yu et al. Distributed consensus-based estimation and control of large-scale systems under gossip communication protocol
Li et al. Adaptive resilient containment control for nonlower triangular multiagent systems with time-varying delay and sensor faults
Norouzi The effect of forecast evolution on production planning with resources subject to congestion
Sun et al. Dynamical investigation and distributed consensus tracking control of a variable-order fractional supply chain network using a multi-agent neural network-based control method
Farraa et al. Inventory control of a class of logistic networks
Nagurney et al. Spatial price equilibrium and food webs: The economics of predator-prey networks
Zhang et al. Simulation and Analysis of the Complex Dynamic Behavior of Supply Chain Inventory System from Different Decision Perspectives
Basterrech et al. A more powerful random neural network model in supervised learning applications
Wang et al. Parameterizing the Rank List Model with Optimal Pricing Decision

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20201103