CN116107276A - Logistics storage optimal coordination method based on distributed differential game - Google Patents

Logistics storage optimal coordination method based on distributed differential game Download PDF

Info

Publication number
CN116107276A
CN116107276A CN202211729139.0A CN202211729139A CN116107276A CN 116107276 A CN116107276 A CN 116107276A CN 202211729139 A CN202211729139 A CN 202211729139A CN 116107276 A CN116107276 A CN 116107276A
Authority
CN
China
Prior art keywords
guided vehicle
automated guided
guiding trolley
automatic
strategy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211729139.0A
Other languages
Chinese (zh)
Inventor
黄捷
薛文艳
林定慈
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fuzhou Zheyan Intelligent Technology Co ltd
Fuzhou University
Original Assignee
Fuzhou Zheyan Intelligent Technology Co ltd
Fuzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fuzhou Zheyan Intelligent Technology Co ltd, Fuzhou University filed Critical Fuzhou Zheyan Intelligent Technology Co ltd
Priority to CN202211729139.0A priority Critical patent/CN116107276A/en
Publication of CN116107276A publication Critical patent/CN116107276A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/08Logistics, e.g. warehousing, loading or distribution; Inventory or stock management
    • G06Q10/087Inventory or stock management, e.g. order filling, procurement or balancing against orders
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/418Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS], computer integrated manufacturing [CIM]
    • G05B19/41865Total factory control, i.e. centrally controlling a plurality of machines, e.g. direct or distributed numerical control [DNC], flexible manufacturing systems [FMS], integrated manufacturing systems [IMS], computer integrated manufacturing [CIM] characterised by job scheduling, process planning, material flow
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/11Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
    • G06F17/12Simultaneous equations, e.g. systems of linear equations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/11Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
    • G06F17/13Differential equations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • G06Q10/06312Adjustment or analysis of established resource schedule, e.g. resource or task levelling, or dynamic rescheduling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/08Logistics, e.g. warehousing, loading or distribution; Inventory or stock management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/08Logistics, e.g. warehousing, loading or distribution; Inventory or stock management
    • G06Q10/083Shipping
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/30Nc systems
    • G05B2219/32Operator till task planning
    • G05B2219/32252Scheduling production, machining, job shop
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Abstract

The invention relates to an optimal logistics storage coordination method based on distributed differential game, which comprises the following steps: step S1, establishing a communication topological structure between guide trolleys in a multi-automation guide trolley system based on a warehouse logistics scene by utilizing graph theory, wherein the communication topological structure comprises a game participant model and an obstacle environment model; step S2, designing an operation cost function of the automatic guided vehicle in a game participant model by using an artificial potential field method, and step S3, regarding the logistics storage coordination problem of the multi-automatic guided vehicle system limited by perception and communication as a distributed differential game problem, and establishing a distributed differential game model; and S4, analyzing the existence and uniqueness of the local optimal coordination strategy by utilizing the Pontrisia minimum value principle, solving the expression form of the local optimal coordination strategy, and giving a convergence condition from local optimal coordination to global Nash equilibrium to obtain an optimal coordination scheme. The invention can reduce the time for the agent to reach the target point and promote the expandability of the automatic guiding trolley.

Description

Logistics storage optimal coordination method based on distributed differential game
Technical Field
The invention relates to the technical field of coordinated control of a multi-automatic guided trolley system, in particular to an optimal logistics storage coordination method based on distributed differential game.
Background
Along with the development of economy, the automatic guiding trolley becomes a core component part of modern warehouse logistics, so that the transportation time can be shortened, and the labor burden is lightened. In recent years, research on effective coordination strategies for multiple automated guided vehicle systems has attracted attention from many researchers. The coordination strategy can effectively realize local target requirements while considering global targets, maximize economic benefits of the warehouse logistics, and improve transportation efficiency of the warehouse logistics when the automatic guiding trolley executes respective given tasks in the warehouse logistics. Namely, the transportation task is efficiently completed while collision is avoided. Considering that the perception and communication of the automatic guiding trolley in the warehouse logistics are limited, it is very practical to analyze the coordination problem of the automatic guiding trolley under limited communication.
In order to improve the transportation efficiency of warehouse logistics, the automatic guiding trolley should ensure that a given transportation package task is completed safely and efficiently. Therefore, a good coordination strategy needs to optimize the track, which is a key to improve the transportation efficiency of the automated guided vehicles. Currently, two main types of coordination strategies, negotiation-based strategies and planning-based strategies, are classified. The negotiation-based policy is a greedy algorithm, and the final policy cannot achieve optimal coordination. Planning-based strategies are generally optimal, but with increasing number of automated guided vehicles, the computational effort increases and scalability is poor.
The game theory can optimize the behavior of each automatic guided vehicle from the perspective of group performance so as to realize Nash equilibrium and further improve the group performance. In particular, differential gaming can quantify the dynamic interactions and collision processes between multiple automated guided vehicles, often used to solve coordination problems, such as chase problems, formation problems. The literature (Mylvaganam T, sassano M, astolfi A. Introduction game approachto multi-agent collision avoidance [ J ]. IEEE Transactions onAutomatic Control,2017,62 (8): 4229-4235) solves the problem of coordinating collision avoidance strategies by first utilizing differential gaming methods. Including literature (LinW, qu Z, siman MA. Nash strategies forpursuit-evasion differential games involving limited observations [ J ]. IEEE Transactions onAerospace and Electronic Systems,2015,51 (2): 1347-1356.) proposes a method of constructing a feedback chase strategy that does not rely on global state information of an agent. Literature (de la Cruz N, jimenez-lizarra m. Finish time robust feedbackNash equilibrium for linear quadratic games J. IFAC-paperson line,2017,50 (1): 11794-11799.) creates a centralized differential game model with external disturbances that are considered virtual players that maximize cost functions, but without considering limited communication capabilities of agents, literature (FuY, chai t. On solution of-player zero-sum games for continuous-time nonlinear systems with completely unknown dynamics J. IEEE transactions on neural networks and learning systems,2015,27 (12): 2577-2587.) creates a distributed uncertainty and differential game that yields local Lu Bangna assortment, but without strict theoretical assurance. To achieve coordination of multi-agent global tasks, global convergence guarantees with local Lu Bangna assorted balances are required. Although the problem of collision avoidance of multiple intelligent agents can be successfully solved, the method can not be directly applied to storage logistics to solve the problem of coordination collision avoidance of the automatic guided vehicle. Mainly because the obtained local optimal solution cannot improve the working efficiency of the automatic guiding trolley; second, the communication assumption of the automated guided vehicle is perfect and is not limited by communication.
Therefore, in order to solve the problem of low task completion efficiency caused by the introduction of the differential game method into the collision avoidance problem, the artificial potential field method can be considered to be introduced to design the collision avoidance rule in combination with the distributed optimization method. A distributed differential game coordination collision avoidance strategy method is designed. The solution of differential game proposed by the current prior art mainly focuses on the situation that global information is known, and for the distributed differential game method, the solution is still rarely applied to the problem of collision prevention of an automatic guiding trolley in logistics storage, and a proper solution cannot be provided.
Disclosure of Invention
Therefore, the present invention aims to provide an optimal logistics storage coordination method based on distributed differential game, which aims to solve the above problems.
In order to achieve the above purpose, the invention adopts the following technical scheme:
a logistics storage optimal coordination method based on distributed differential game comprises the following steps:
step S1, establishing a communication topological structure between guide trolleys in a multi-automation guide trolley system based on a warehouse logistics scene by utilizing graph theory, wherein the communication topological structure comprises a game participant model and an obstacle environment model;
s2, designing an operation cost function of the automatic guiding trolley in a game participant model by using an artificial potential field method;
s3, regarding the logistics storage coordination problem of the multi-automation guiding trolley system limited by sensing and communication as a distributed differential game problem, and establishing a distributed differential game model;
and S4, analyzing the existence and uniqueness of the local optimal coordination strategy by utilizing the Pontrisia minimum value principle, solving the expression form of the local optimal coordination strategy, and giving a convergence condition from local optimal coordination to global Nash equilibrium to obtain an optimal coordination scheme.
Further, the warehouse logistics scene specifically includes
1) Task allocation: in the working environment, the staff assigns the command of transporting the package to the corresponding suitable automated guided vehicle;
2) And (3) parcel collection: after receiving the command, the automatic guiding trolley runs to a corresponding goods shelf from the charging station to pick up the package;
3) And (5) package transportation: after the automated guided vehicle picks up all the assigned packages, transporting the packages to the designated demand locations;
4) Returning to a charging station: after the automated guided vehicle performs the parcel transport task, it returns to the nearest charging station.
Further, the game participant model specifically comprises the following steps:
the specific form of the dynamic equation of the automatic guiding trolley is as follows:
Figure SMS_1
wherein p is i (t) A method of producing a solid-state image sensor
Figure SMS_2
Position and speed information of ith automated guided vehicle at time t, u i (t) is the control input of the ith automated guided vehicle at the moment t, B ii Is a corresponding constant matrix;
the automatic guiding trolley position error is specifically formed as follows:
Figure SMS_3
in the method, in the process of the invention,
Figure SMS_4
the position of the desired transport target point for the ith automated guided vehicle,/->
Figure SMS_5
At tMarking a position error between the current transportation position of the ith automatic guiding trolley and a desired transportation target point;
the dynamic equation of the multi-automation guiding trolley system is specifically formed as follows:
Figure SMS_6
in the method, in the process of the invention,
Figure SMS_7
for the state of the automated guided vehicle system at time t, < >>
Figure SMS_8
The state change rate of the automatic guiding trolley system at the time t is N, the number of the automatic guiding trolleys is B i Is a matrix of corresponding constants,
Figure SMS_9
establishing directed interaction topological graphs G (v, epsilon) of N automated guided vehicles; wherein v= { v 1 ,...,v N And represents a collection of automated guided vehicles,
Figure SMS_10
representing a collection of edges, e ij Indicating the connection relation between the automatic guiding trolley i and the automatic guiding trolley j, e ij Epsilon indicates that the automated guided vehicle i can receive the information of the automated guided vehicle j, and the neighbor set of the automated guided vehicle i is +.>
Figure SMS_11
Defining a neighbor information matrix of the automated guided vehicle i as follows:
Figure SMS_12
wherein I is 2 For a 2 x 2 matrix of units,
Figure SMS_13
is Cronecker product, and is a kind of->
Figure SMS_14
Information matrix for neighbor guided vehicles j other than automated guided vehicle i, wherein only matrix F i Row i and row j 1, the remainder being 0;
the local dynamic equation defining the automated guided vehicle i is:
Figure SMS_15
in the method, in the process of the invention,
Figure SMS_16
for the local status information of the automated guided vehicle i at time t,/>
Figure SMS_17
Is a corresponding constant matrix.
Further, the obstacle environment model specifically includes:
expanding the outline of various shaped obstacles in logistics storage into an ellipse to define a collision prevention area S ik The method comprises the following steps:
Figure SMS_18
wherein r is i For automating the safety radius of the guiding trolley i,
Figure SMS_19
is the radius of obstacle k, c k (t) is the centroid position of obstacle k at time t, E k =I 2 Is a unit matrix;
defining a sensing region D ik The method comprises the following steps:
Figure SMS_20
wherein R is i Is the induction range of the automated guided vehicle i;
definition of free region M ik The method comprises the following steps:
Figure SMS_21
further, the step S2 specifically includes:
the following preset conditions are given:
(1): the automatic guiding trolley does not slip during running;
(2): the directed interaction topology graph G (v, epsilon) between the automated guided vehicles is fixed and strongly connected;
design the anti-collision rule based on the artificial potential field method:
Figure SMS_22
in the formula, the constant χ i (0<χ i < 1) and the like
Figure SMS_23
Is constant (I)>
Figure SMS_24
The obstacle penalty function for the automated guided vehicle i at time t is defined as follows:
Figure SMS_25
Figure SMS_26
a distance penalty function for the automated guided vehicle i; for the problem of trajectory optimization of automated guided vehicles, the penalty function is defined as follows:
Figure SMS_27
wherein, gamma i For automatically guiding the deviation angle between the current track of the trolley i and the predefined reference track.
Further, the step S3 specifically includes:
the expression form of the cost function for establishing the distributed differential game is as follows:
Figure SMS_28
wherein t is f For the end time of the game of automated guided vehicle i, u -i (t) is the control strategy set of the neighbor automatic guiding trolley except the automatic guiding trolley i at the moment t,
Figure SMS_29
for the time t the operating cost function of the automated guided vehicle i is defined as +.>
Figure SMS_30
u ij (t) is the control strategy of the neighbor automatic guiding trolley j of the automatic guiding trolley i at the moment t, U (t) is the moment t control cost, which is defined as
Figure SMS_31
Wherein R is ii ,R ij The adjustable positive weight matrixes are respectively corresponding to the control strategies of the automatic guiding trolley i and the neighbor automatic guiding trolley j;
the objective of the distributed multi-automation guiding trolley system coordination control problem is to design an optimal coordination strategy for each automation guiding trolley, and the automation guiding trolley is required to safely reach a target point within a preset time, meanwhile, the automation guiding trolley i and the neighbor automation guiding trolley can be converged to local Nash equilibrium, namely a t moment strategy set
Figure SMS_32
The method meets the following conditions:
Figure SMS_33
in the method, in the process of the invention,
Figure SMS_34
and (5) an optimal cost function for the automated guided vehicle i at the moment t.
Further, the step S4 specifically includes:
let t be the time additional state:
Figure SMS_35
in the method, in the process of the invention,
Figure SMS_36
for the t moment phase cost function, +.>
Figure SMS_37
For the initial condition +.>
Figure SMS_38
Cost for end time;
let t time expansion state be:
Figure SMS_39
the corresponding dynamic equation is as follows:
Figure SMS_40
in the method, in the process of the invention,
Figure SMS_41
the distributed differential game problem is converted into an optimal control problem, and the expression form is as follows:
Figure SMS_42
according to the Pontrisia minimum principle, a Hamiltonian is defined as
Figure SMS_43
Wherein lambda is i (t) is the Lagrangian multiplier at time t;
the locally optimal coordination strategy satisfies the following differential equation set,
Figure SMS_44
and boundary conditions thereof:
p * (0)=p 0 ,
λ i (t f )=0 (20)
wherein p is 0 For automatically guiding initial position of trolley system lambda i (t f ) The Lagrangian multiplier is the last moment;
the only existing condition of the local optimal coordination strategy is set: aiming at the distributed differential game problem of the automatic guided vehicle in a given warehouse logistics, the existing local optimal coordination strategy
Figure SMS_45
The satisfied form of (c) is converted into the following matrix form: />
Figure SMS_46
In the method, in the process of the invention,
Figure SMS_47
for a given matrix, e.g.)>
Figure SMS_48
If the matrix is positive, a unique local coordination strategy exists;
analyzing the convergence of the local optimal coordination strategy to the global optimal strategy, wherein the method is defined as follows: for differential gaming of N participants, the set of t-moment policies { u }, if the following inequality holds 1 (t),...,u i (t),...,u N (t) } converges to a global Nash equilibrium solution,
Figure SMS_49
in the method, in the process of the invention,
Figure SMS_50
strategy for all automated guided vehicles remaining at time t except automated guided vehicle i, furthermore strategy +.>
Figure SMS_51
The following inequality is also satisfied:
Figure SMS_52
Figure SMS_53
is a locally optimal coordination strategy of the automatic guided vehicles relative to the neighbors, and if the topological graph among the automatic guided vehicles is strongly communicated, the method comprises the following steps: 1) For the automated guided vehicle i, each local optimum coordination strategy is equal, 2) the local optimum coordination of the automated guided vehicle can converge to global Nash equilibrium if and only if the communication topology of the automated guided vehicle is strongly connected.
Compared with the prior art, the invention has the following beneficial effects:
the invention aims at an automatic guiding trolley with a first-order linear model with limited perception and communication, and converts the coordination control problem of a multi-automatic guiding trolley system in a warehouse logistics into a distributed type differential game problem. Considering the existing differential game method only considering the obstacle avoidance target, introducing a track optimization target to punish the degree of the automatic guiding trolley deviating from the target point based on an artificial potential field method, weighing the distance between the automatic guiding trolley and the distance obstacle, and reducing the time for the automatic guiding trolley to reach the target point; based on the optimal control principle, giving out the expression form and the unique existence analysis of the local optimal strategy; under the assumption of a fixed strong communication topological graph, the global convergence of a local optimal strategy is ensured. The method can reduce the time for the automatic guiding trolley to reach the transportation point. In addition, a distributed architecture differential game model is introduced, compared with a centralized architecture, the distributed architecture differential game model has good expandability, and compared with the existing popular warehouse logistics coordination collision prevention method, the method reduces the task execution time by 16%.
Drawings
FIG. 1 is a block diagram of a method of an embodiment of the present invention;
FIG. 2 is a schematic view of a collision avoidance rule based on an artificial potential field method according to an embodiment of the present invention, wherein 1-automatic guided vehicle charging station, 2-shelf, 3-workbench, 4-obstacle, 5-automatic guided vehicle in operation, 6-package transportation point;
FIG. 3 is a graph of the positional deviation of the multi-homing car system in the abscissa axis in a simulation example of the practice of the present invention;
FIG. 4 is a graph of the positional deviation of the multiple automated guided vehicle system on the ordinate axis in a simulation example of the implementation of the present invention;
FIG. 5 is a communication topology of a multiple automated guided vehicle system employed in a simulation example of the present invention;
FIG. 6 is a diagram showing a transportation track of a multi-automated guided vehicle system in a simulation example of the implementation of the present invention;
FIG. 7 is a diagram showing algorithm transportation time versus time in a simulation example of an implementation of the present invention.
Detailed Description
The invention will be further described with reference to the accompanying drawings and examples.
Referring to fig. 1, the present invention provides an optimal logistics storage coordination method based on distributed differential game, comprising the following steps:
describing a storage logistics scene, utilizing graph theory, establishing a communication topological structure among guide trolleys in a multi-automation guide trolley system, wherein in order to cope with obstacles possibly occurring in the work of the guide trolleys, taking the guide trolleys and neighbors thereof as game participants, establishing a first-order linear integrator as a model of the guide trolleys, considering the complex working environment of the guide trolleys, inevitably encountering obstacles, designing a collision area, a sensing area and a free area for the working environment of the guide trolleys, and expanding various obstacles with different shapes in logistics storage into an ellipse;
designing a collision avoidance rule by using an artificial potential field method as an operation cost function of the automatic guiding trolley in the game model;
the logistics storage coordination problem of the multi-automation guiding trolley system limited by sensing and communication is regarded as a distributed differential game problem; establishing a distributed differential game model, wherein the model comprises an operation cost function and a control cost;
analyzing the existence and uniqueness of the local optimal coordination strategy by utilizing the Pontrian Jin Jixiao value principle, and solving the expression form of the local optimal coordination strategy; and analyzing the convergence of the global Nash equilibrium of the local optimal coordination.
In this embodiment, the transportation package task executed by the automated guided vehicle is a corresponding transportation target point of the given multi-automated guided vehicle system, so that the automated guided vehicle can reach the target point without collision under the condition of limited sensing and communication, and the completion time of the transportation task of the automated guided vehicle is reduced. The position deviation of each motorized guiding trolley tends to zero, and the transportation task is completed.
In this embodiment, the information acquired by the automated guided vehicle includes the following categories: the method comprises the steps of calculating local state information of the automatic guiding trolley and a first-order linear model of the automatic guiding trolley according to position information and strategy information of the neighbor automatic guiding trolley:
Figure SMS_54
wherein p is i (t) A method of producing a solid-state image sensor
Figure SMS_55
Position and speed information of ith automated guided vehicle at time t, u i (t) is the control input of the ith automated guided vehicle at the moment t, B ii Is a corresponding constant matrix;
the automatic guiding trolley position error is specifically formed as follows:
Figure SMS_56
in the method, in the process of the invention,
Figure SMS_57
for the desired transport position of the ith automated guided vehicle,/or->
Figure SMS_58
The position error between the current transportation position of the ith automatic guiding trolley at the t moment and the expected transportation target point is calculated;
the dynamic equation of the multi-automation guiding trolley system is specifically formed as follows:
Figure SMS_59
/>
in the method, in the process of the invention,
Figure SMS_60
for the state of the automated guided vehicle system at time t, < >>
Figure SMS_61
The state change rate of the automatic guiding trolley system at the time t is N, the number of the automatic guiding trolleys is B i Is a matrix of corresponding constants,
Figure SMS_62
establishing a directed interaction topological graph G (v, epsilon) of N automated guided vehicles, wherein v= { v 1 ,...,v N And represents a collection of automated guided vehicles,
Figure SMS_63
representing a collection of edges, e ij Indicating the connection relation between the automatic guiding trolley i and the automatic guiding trolley j, e ij Epsilon indicates that automated guided vehicle i can receive information of automated guided vehicle j, and neighbor set of automated guided vehicle i is +.>
Figure SMS_64
Defining a neighbor information matrix of the automated guided vehicle i as follows:
Figure SMS_65
wherein I is 2 For a 2 x 2 matrix of units,
Figure SMS_66
information matrix for neighboring guided vehicles j other than automated guided vehicle i, wherein only matrix F i Row i and row j 1, the remainder being 0;
the local dynamic equation defining the automated guided vehicle i is:
Figure SMS_67
in the method, in the process of the invention,
Figure SMS_68
for the local status information of the automated guided vehicle i at time t,/>
Figure SMS_69
Is a corresponding constant matrix;
establishing an obstacle environment model:
expanding obstacles with various shapes in warehouse logistics into an ellipse, and defining a collision prevention area S ik The method comprises the following steps:
Figure SMS_70
wherein r is i For automating the safety radius of the guiding trolley i,
Figure SMS_71
is the radius of obstacle k, c k (t) is the centroid position of obstacle k at time t, E k =I 2 Is a unit matrix;
defining a sensing regionDomain D ik The method comprises the following steps:
Figure SMS_72
wherein R is i Is the induction range of the automated guided vehicle i;
definition of free region M ik The method comprises the following steps:
Figure SMS_73
the following hypothetical conditions are given:
assume one: the automatic guiding trolley does not slip during running;
suppose two: the directed interaction topology graph G (v, ε) between automated guided vehicles is fixed and strongly connected;
design the anti-collision rule based on the artificial potential field method:
Figure SMS_74
in the formula, the constant χ i (0<χ i < 1) and the like
Figure SMS_75
Is constant (I)>
Figure SMS_76
Barrier penalty function for automated guided vehicle i at time t +.>
Figure SMS_77
A distance penalty function of the automated guided vehicle i at the moment t; />
The distance penalty function is represented as follows:
Figure SMS_78
in order to optimize the trajectory of the automated guided vehicle i, a distance penalty function is introduced, and the deviation of the penalty agent from the target point is expressed as follows:
Figure SMS_79
wherein, gamma i A deviation angle for the running of the automated guided vehicle i, which is the angle between the current trajectory of the automated guided vehicle i and the predefined reference trajectory;
the expression form of the cost function for establishing the distributed differential game is as follows:
Figure SMS_80
wherein t is f For the end time of the game of automated guided vehicle i, u -i (t) is the control strategy set of the neighbor automatic guiding trolley except the automatic guiding trolley i at the moment t,
Figure SMS_81
for the time t the operating cost function of the automated guided vehicle i is defined as +.>
Figure SMS_82
u ij (t) is the control strategy of the neighbor automatic guiding trolley j of the automatic guiding trolley i at the moment t, U (t) is the moment t control cost, which is defined as
Figure SMS_83
Wherein R is ii ,R ij The adjustable positive weight matrixes are respectively corresponding to the control strategies of the automatic guiding trolley i and the neighbor automatic guiding trolley j;
the objective of the distributed multi-automation guided vehicle system coordination control problem is to design an optimal coordination strategy for each automation guided vehicle, and the automation guided vehicle needs to safely reach a target point in as little time as possible, meanwhile, the automation guided vehicle i and the neighbor automation guided vehicles can converge to local Nash equilibrium, namely, a t-moment strategy set
Figure SMS_84
The method meets the following conditions:
Figure SMS_85
in the method, in the process of the invention,
Figure SMS_86
the optimal cost function of the automated guided vehicle i at the moment t;
the following relevant theorem and expression form are given for the locally optimal coordination strategy:
giving the additional state at time t:
Figure SMS_87
/>
in the method, in the process of the invention,
Figure SMS_88
for the t moment phase cost function, +.>
Figure SMS_89
For the initial state +.>
Figure SMS_90
For the end cost, the expansion state at time t is given as:
Figure SMS_91
the corresponding dynamic equation is as follows:
Figure SMS_92
in the method, in the process of the invention,
Figure SMS_93
the distributed differential game problem is converted into an optimal control problem, and the expression form is as follows:
Figure SMS_94
according to the Pontriere principle, a Hamiltonian is defined as
Figure SMS_95
Wherein lambda is i (t) is the Lagrangian multiplier at time t;
the locally optimal coordination strategy satisfies the following partial differential equation set:
Figure SMS_96
boundary conditions:
p * (0)=p 0 ,
λ i (t f )=0 (20)
wherein p is 0 For automatically guiding initial position of trolley system lambda i (t f ) The Lagrangian multiplier is the last moment;
giving the condition that the locally optimal coordination strategy exists only: aiming at the distributed differential game problem of the automatic guided vehicle in a given warehouse logistics, the existing local optimal coordination strategy
Figure SMS_97
The satisfied form of (c) is converted into the following matrix form:
giving the rule that the locally optimal coordination strategy exists only: for the distributed differential gaming problem of automated guided vehicles in a given warehouse logistics, the locally optimal coordination strategy can be converted into the following form
Figure SMS_98
In the method, in the process of the invention,
Figure SMS_99
for a given matrix, e.g.)>
Figure SMS_100
If the matrix is positive, a unique local coordination strategy exists;
in order to achieve global optimal coordination of transportation tasks of a multi-automation guided off-board system, an analysis bureau is required
The convergence of the partially optimal coordination policy to the globally optimal policy is defined as follows: for differential gaming of N participants, the set of t-moment policies { u }, if the following inequality holds 1 (t),...,u i (t),...,u N (t) } converges to a global Nash equilibrium solution.
Figure SMS_101
In the method, in the process of the invention,
Figure SMS_102
strategy for all automatic guided vehicles except for automatic guided vehicle i, furthermore strategy +.>
Figure SMS_103
So that the following formula is established;
Figure SMS_104
giving the proposition of global convergence of a local optimal coordination strategy: order the
Figure SMS_105
Is a locally optimal coordination strategy of the automatic guided vehicles relative to the neighbors, and if the topological graph among the automatic guided vehicles is strongly communicated, the method comprises the following steps: 1) For the automated guided vehicle i, each locally optimal coordination strategy is equal; 2) The local optimal coordination of the automated guided vehicles can converge to global nash equalization if and only if the communication topology of the automated guided vehicles is strongly connected.
In this embodiment, two specific examples are given to show the effectiveness and superiority of the proposed distributed differential gaming method in solving the problem of multi-homing dolly coordination and to verify the advantages of the method through experimentation.
In order to prove that the obtained local optimal coordination strategy can reduce the time for the automatic guided vehicle to complete the transportation task, the simulation experiment is carried out in the embodiment, and compared with the existing centralized differential game method only considering the obstacle punishment target. The specific model expression form of the multi-automatic guided vehicle system is given as follows:
Figure SMS_106
Figure SMS_107
wherein p is 1 (t) A method of producing a solid-state image sensor
Figure SMS_108
Position and speed information of the 1 st automatic guiding trolley at t moment respectively, p 2 (t) and->
Figure SMS_109
Position and speed information of the 2 nd automatic guiding trolley at the time t respectively, u 1 (t),u 2 (t) represents the control strategy of the automated guided vehicles 1,2, respectively;
the initial position and the transport target point position of each Automatic Guided Vehicle (AGV) are: p is p 1 (0)=[80,80] T ,p 2 (0)=[360,360] T
Figure SMS_110
The specific form of each automatic guiding trolley benefit function is as follows: />
Figure SMS_111
Figure SMS_112
As can be seen from fig. 3 to 4, the comparison method and the proposed method can both enable the deviation of the position of the agent to be 0, but the convergence time of the centralized differential game method introducing the trajectory optimization target is 105s and the convergence time of the centralized differential game method considering only the obstacle penalty is 125s, so the centralized differential game method introducing the trajectory optimization target can reduce the time for the agent to complete the task, and the transportation time is reduced by 16% compared with the comparison method.
To illustrate the distributed availability of the proposed method, this example provides a directed communication topology of 3 automated guided vehicles, as shown in fig. 5. The simulation experiment is carried out in the embodiment, and the specific model expression form of the multi-automatic guiding trolley is shown as follows:
Figure SMS_113
Figure SMS_114
Figure SMS_115
wherein p is 1 (t) A method of producing a solid-state image sensor
Figure SMS_116
Position and speed information of the 1 st automatic guiding trolley at t moment respectively, p 2 (t) and->
Figure SMS_117
Position and speed information of the 2 nd automatic guiding trolley at the time t respectively, p 3 (t) and->
Figure SMS_118
Position and speed information of the 3 rd automated guided vehicle at time t, u 1 (t),u 2 (t)u 3 (t) represents automated guidance respectivelyControl strategy for the trolley 1,2, 3.
The initial position and the transport target point position of each automatic guiding trolley are as follows: p is p 1 (0)=[30,30] T ,p 2 (0)=[370,370] T ,p 3 (0)=[370,30] T
Figure SMS_119
Figure SMS_120
p 3 (0)=[30,370]T. Induction radius R 1 =R 2 =48。
The specific form of each automatic guiding trolley benefit function is as follows:
Figure SMS_121
Figure SMS_122
Figure SMS_123
according to the method shown in fig. 6, the position deviation of each automatic guiding trolley can be made to be 0 by the proposed distributed differential game method, which indicates that the task is completed.
TO further illustrate the optimal coordination of the proposed distributed differential gaming method (DDG-TO), the negotiation-based method (FCFS) and the planning-based method (ETCEN-MCS) are compared TO the current more popular logistic warehousing methods.
Each automated guided vehicle was performed 10 times using the above three methods, respectively, and averaged. As can be seen from fig. 7, when the number of the automated guided vehicles is 4, the transportation time of the three methods is similar, the transportation time of the FCFS method is reduced and the transportation time of the ETCEN-MCS method is reduced as the number of the automated guided vehicles increases, and the transportation time of the proposed distributed differential game method (DDG-TO) method is the shortest, which is reduced by 15.7% and 9.2% compared with the former two methods, respectively. In summary, the distributed differential game method can enable each automatic guiding trolley to safely complete the transportation task, and can reduce the transportation time of each automatic guiding trolley.
The foregoing description is only of the preferred embodiments of the invention, and all changes and modifications that come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.

Claims (7)

1. The logistics storage optimal coordination method based on the distributed differential game is characterized by comprising the following steps of:
step S1, establishing a communication topological structure between guide trolleys in a multi-automation guide trolley system based on a warehouse logistics scene by utilizing graph theory, wherein the communication topological structure comprises a game participant model and an obstacle environment model;
s2, designing an operation cost function of the automatic guiding trolley in a game participant model by using an artificial potential field method;
s3, regarding the logistics storage coordination problem of the multi-automation guiding trolley system limited by sensing and communication as a distributed differential game problem, and establishing a distributed differential game model;
and S4, analyzing the existence and uniqueness of the local optimal coordination strategy by utilizing the Pontrisia minimum value principle, solving the expression form of the local optimal coordination strategy, and giving a convergence condition from local optimal coordination to global Nash equilibrium to obtain an optimal coordination scheme.
2. The distributed differential game-based logistics storage optimal coordination method as claimed in claim 1, wherein the storage logistics scene specifically comprises the following steps
1) Task allocation: in the working environment, the staff assigns the command of transporting the package to the corresponding suitable automated guided vehicle;
2) And (3) parcel collection: after receiving the command, the automatic guiding trolley runs to a corresponding goods shelf from the charging station to pick up the package;
3) And (5) package transportation: after the automated guided vehicle picks up all the assigned packages, transporting the packages to the designated demand locations;
4) Returning to a charging station: after the automated guided vehicle performs the parcel transport task, it returns to the nearest charging station.
3. The distributed differential game-based logistics storage optimal coordination method of claim 1, wherein the game participant model is specifically:
the specific form of the dynamic equation of the automatic guiding trolley is as follows:
Figure QLYQS_1
wherein p is i (t) A method of producing a solid-state image sensor
Figure QLYQS_2
Position and speed information of ith automated guided vehicle at time t, u i (t) is the control input of the ith automated guided vehicle at the moment t, B ii Is a corresponding constant matrix;
the automatic guiding trolley position error is specifically formed as follows:
Figure QLYQS_3
in the method, in the process of the invention,
Figure QLYQS_4
the position of the desired transport target point for the ith automated guided vehicle,/->
Figure QLYQS_5
The position error between the current transportation position of the ith automatic guiding trolley at the t moment and the expected transportation target point is calculated;
the dynamic equation of the multi-automation guiding trolley system is specifically formed as follows:
Figure QLYQS_6
in the method, in the process of the invention,
Figure QLYQS_7
for the state of the automated guided vehicle system at time t, < >>
Figure QLYQS_8
The state change rate of the automatic guiding trolley system at the time t is N, the number of the automatic guiding trolleys is B i Is a matrix of corresponding constants,
Figure QLYQS_9
establishing directed interaction topological graphs G (v, epsilon) of N automated guided vehicles; wherein v= { v 1 ,...,v N And represents a collection of automated guided vehicles,
Figure QLYQS_10
representing a collection of edges, e ij Indicating the connection relation between the automatic guiding trolley i and the automatic guiding trolley j, e ij Epsilon indicates that the automated guided vehicle i can receive the information of the automated guided vehicle j, and the neighbor set of the automated guided vehicle i is +.>
Figure QLYQS_11
Defining a neighbor information matrix of the automated guided vehicle i as follows:
Figure QLYQS_12
wherein I is 2 For a 2 x 2 matrix of units,
Figure QLYQS_13
is Cronecker product, and is a kind of->
Figure QLYQS_14
Letter for neighbor guided vehicle j other than automated guided vehicle iInformation matrix, wherein only matrix F i Row i and row j 1, the remainder being 0;
the local dynamic equation defining the automated guided vehicle i is:
Figure QLYQS_15
in the method, in the process of the invention,
Figure QLYQS_16
for the local status information of the automated guided vehicle i at time t,/>
Figure QLYQS_17
Is a corresponding constant matrix.
4. The logistic warehousing optimal coordination method based on the distributed differential game according to claim 1, wherein the obstacle environment model is specifically:
expanding the outline of various shaped obstacles in logistics storage into an ellipse to define a collision prevention area S ik The method comprises the following steps:
Figure QLYQS_18
wherein r is i For automating the safety radius of the guiding trolley i,
Figure QLYQS_19
is the radius of obstacle k, c k (t) is the centroid position of obstacle k at time t, E k =I 2 Is a unit matrix;
defining a sensing region D ik The method comprises the following steps:
Figure QLYQS_20
wherein R is i Is an automated guided vehicle iIs a sensing range of (a);
definition of free region M ik The method comprises the following steps:
Figure QLYQS_21
5. the logistic warehousing optimal coordination method based on the distributed differential game according to claim 1, wherein the step S2 is specifically:
the following preset conditions are given:
(1): the automatic guiding trolley does not slip during running;
(2): the directed interaction topology graph G (v, epsilon) between the automated guided vehicles is fixed and strongly connected;
design the anti-collision rule based on the artificial potential field method:
Figure QLYQS_22
in the formula, the constant χ i (0<χ i < 1) and the like
Figure QLYQS_23
Is constant (I)>
Figure QLYQS_24
The obstacle penalty function for the automated guided vehicle i at time t is defined as follows: />
Figure QLYQS_25
Figure QLYQS_26
A distance penalty function for the automated guided vehicle i; for the problem of trajectory optimization of automated guided vehicles, the penalty function is defined as follows:
Figure QLYQS_27
wherein, gamma i For automatically guiding the deviation angle between the current track of the trolley i and the predefined reference track.
6. The logistic warehousing optimal coordination method based on the distributed differential game according to claim 1, wherein the step S3 is specifically:
the expression form of the cost function for establishing the distributed differential game is as follows:
Figure QLYQS_28
wherein t is f For the end time of the game of automated guided vehicle i, u -i (t) is the control strategy set of the neighbor automatic guiding trolley except the automatic guiding trolley i at the moment t,
Figure QLYQS_29
for the time t the operating cost function of the automated guided vehicle i is defined as +.>
Figure QLYQS_30
u ij (t) is the control strategy of the neighbor automatic guiding trolley j of the automatic guiding trolley i at the moment t, U (t) is the moment t control cost, which is defined as
Figure QLYQS_31
Wherein R is ii ,R ij The adjustable positive weight matrixes are respectively corresponding to the control strategies of the automatic guiding trolley i and the neighbor automatic guiding trolley j;
the objective of the distributed multi-automation guiding trolley system coordination control problem is to design an optimal coordination strategy for each automation guiding trolley, and the automation guiding trolley is required to safely reach a target point within a preset time, and meanwhile, the automation guiding trolley i and an adjacent guiding trolley i are required to safely reach the target pointThe automated guided vehicle can converge to local Nash equilibrium, i.e., the set of t-moment strategies
Figure QLYQS_32
The method meets the following conditions:
Figure QLYQS_33
in the method, in the process of the invention,
Figure QLYQS_34
and (5) an optimal cost function for the automated guided vehicle i at the moment t.
7. The logistic warehousing optimal coordination method based on the distributed differential game according to claim 1, wherein the step S4 specifically includes:
let t be the time additional state:
Figure QLYQS_35
/>
in the method, in the process of the invention,
Figure QLYQS_36
for the t moment phase cost function, +.>
Figure QLYQS_37
For the initial condition +.>
Figure QLYQS_38
Cost for end time;
let t time expansion state be:
Figure QLYQS_39
the corresponding dynamic equation is as follows:
Figure QLYQS_40
in the method, in the process of the invention,
Figure QLYQS_41
the distributed differential game problem is converted into an optimal control problem, and the expression form is as follows:
Figure QLYQS_42
according to the Pontrisia minimum principle, a Hamiltonian is defined as
Figure QLYQS_43
Wherein lambda is i (t) is the Lagrangian multiplier at time t;
the locally optimal coordination strategy satisfies the following differential equation set,
Figure QLYQS_44
and boundary conditions thereof:
p * (0)=p 0 ,
λ i (t f )=0 (20)
wherein p is 0 For automatically guiding initial position of trolley system lambda i (t f ) The Lagrangian multiplier is the last moment;
the only existing condition of the local optimal coordination strategy is set: aiming at the distributed differential game problem of the automatic guided vehicle in a given warehouse logistics, the existing local optimal coordination strategy
Figure QLYQS_45
The satisfied form of (c) is converted into the following matrix form: />
Figure QLYQS_46
In the method, in the process of the invention,
Figure QLYQS_47
for a given matrix, e.g.)>
Figure QLYQS_48
If the matrix is positive, a unique local coordination strategy exists;
analyzing the convergence of the local optimal coordination strategy to the global optimal strategy, wherein the method is defined as follows: for differential gaming of N participants, the set of t-moment policies { u }, if the following inequality holds 1 (t),...,u i (t),...,u N (t) } converges to a global Nash equilibrium solution,
Figure QLYQS_49
in the method, in the process of the invention,
Figure QLYQS_50
strategy for all automatic guided vehicles remaining at time t except for automatic guided vehicle i, and strategy
Figure QLYQS_51
The following inequality is also satisfied:
Figure QLYQS_52
Figure QLYQS_53
is a locally optimal coordination strategy of the automatic guided vehicles relative to the neighbors, and if the topological graph among the automatic guided vehicles is strongly communicated, the method comprises the following steps: 1) For the automated guided vehicle i, each locally optimal coordination strategy is equal, 2) automated guidedThe locally optimal coordination of the carts can converge to global nash equalization if and only if the communication topology of the automated guided carts is strongly connected. />
CN202211729139.0A 2022-12-30 2022-12-30 Logistics storage optimal coordination method based on distributed differential game Pending CN116107276A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211729139.0A CN116107276A (en) 2022-12-30 2022-12-30 Logistics storage optimal coordination method based on distributed differential game

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211729139.0A CN116107276A (en) 2022-12-30 2022-12-30 Logistics storage optimal coordination method based on distributed differential game

Publications (1)

Publication Number Publication Date
CN116107276A true CN116107276A (en) 2023-05-12

Family

ID=86255576

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211729139.0A Pending CN116107276A (en) 2022-12-30 2022-12-30 Logistics storage optimal coordination method based on distributed differential game

Country Status (1)

Country Link
CN (1) CN116107276A (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108171459A (en) * 2017-12-29 2018-06-15 长春师范大学 intelligent storage optimization method based on game theory
CN108829105A (en) * 2018-06-21 2018-11-16 浙江工业大学 It is a kind of to dispatch avoidance optimization method based on the warehouse logistics of KM algorithm and Artificial Potential Field Method
CN110751293A (en) * 2019-09-29 2020-02-04 浙江财经大学 Cloud manufacturing multi-task scheduling optimization method based on game theory
CN113159681A (en) * 2021-04-20 2021-07-23 华南理工大学 Multi-type intermodal dynamic path planning method based on game reinforcement learning
CN113393040A (en) * 2021-06-18 2021-09-14 重庆邮电大学工业互联网研究院 Industrial park logistics scheduling method and system based on game theory
WO2022133330A1 (en) * 2020-12-18 2022-06-23 Strong Force Vcn Portfolio 2019, Llc Robot fleet management and additive manufacturing for value chain networks
CN114662381A (en) * 2022-02-24 2022-06-24 上海交通大学 Layered game-based fusion operation method and system for energy traffic system of port ship
CN115411728A (en) * 2022-09-26 2022-11-29 南京理工大学 Multi-microgrid system coordination control method integrating Q learning and potential game

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108171459A (en) * 2017-12-29 2018-06-15 长春师范大学 intelligent storage optimization method based on game theory
CN108829105A (en) * 2018-06-21 2018-11-16 浙江工业大学 It is a kind of to dispatch avoidance optimization method based on the warehouse logistics of KM algorithm and Artificial Potential Field Method
CN110751293A (en) * 2019-09-29 2020-02-04 浙江财经大学 Cloud manufacturing multi-task scheduling optimization method based on game theory
WO2022133330A1 (en) * 2020-12-18 2022-06-23 Strong Force Vcn Portfolio 2019, Llc Robot fleet management and additive manufacturing for value chain networks
CN113159681A (en) * 2021-04-20 2021-07-23 华南理工大学 Multi-type intermodal dynamic path planning method based on game reinforcement learning
CN113393040A (en) * 2021-06-18 2021-09-14 重庆邮电大学工业互联网研究院 Industrial park logistics scheduling method and system based on game theory
CN114662381A (en) * 2022-02-24 2022-06-24 上海交通大学 Layered game-based fusion operation method and system for energy traffic system of port ship
CN115411728A (en) * 2022-09-26 2022-11-29 南京理工大学 Multi-microgrid system coordination control method integrating Q learning and potential game

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
庞明宝, 魏连雨: "区域物流线路网络双层规划研究", 公路交通科技, no. 10, 15 October 2005 (2005-10-15), pages 162 - 166 *

Similar Documents

Publication Publication Date Title
CN110443412B (en) Reinforced learning method for logistics scheduling and path planning in dynamic optimization processing process
CN112833905B (en) Distributed multi-AGV collision-free path planning method based on improved A-x algorithm
CN114296440B (en) AGV real-time scheduling method integrating online learning
CN113156954B (en) Multi-agent cluster obstacle avoidance method based on reinforcement learning
CN104346658A (en) Automatic trolley storage system scheduling method based on improved dynamic banker&#39;s algorithm
CN116400651A (en) Multi-AGV cooperative scheduling method and device for intelligent factory digital twin platform
Yuan et al. Multirobot task allocation in e-commerce robotic mobile fulfillment systems
CN109581987B (en) AGV (automatic guided vehicle) scheduling path planning method and system based on particle swarm optimization
Wang et al. Study on scheduling and path planning problems of multi-AGVs based on a heuristic algorithm in intelligent manufacturing workshop
Takahashi et al. Online optimization of AGV transport systems using deep reinforcement learning
CN116107276A (en) Logistics storage optimal coordination method based on distributed differential game
Xu et al. Research on open-pit mine vehicle scheduling problem with approximate dynamic programming
CN114757591B (en) Multi-vehicle type collaborative sorting scheduling method based on behavior dependency graph
CN114840001A (en) Multi-vehicle collaborative track planning method in closed environment
Zhou et al. AGV Path planning based on improved adaptive genetic algorithm
CN114690754B (en) Repeated operation type unmanned vehicle iterative learning self-optimizing tracking control algorithm
CN115358455A (en) Electric vehicle path optimization method considering battery replacement and hybrid time window constraints
CN115293394A (en) Multi-robot joint scheduling optimization method based on dynamic space-time diagram
CN114347031A (en) Mobile mechanical arm collision avoidance control method and device considering time-varying time lag
Fu et al. Path planning of cooperative robotics and robot team
Li et al. Improved genetic algorithm for multi-agent task allocation with time windows
Huang et al. Scheduling control of AGV system based on game theory
Wu et al. Event-triggered cooperative formation control for multi-agent system with dynamic role assignment
Fang et al. Autonomous docking trajectory planning of heavy AGV based on slip compensation model
Guo et al. Warehouse AGV path planning based on Improved A* algorithm

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination