CN114089755B

CN114089755B - Multi-robot task allocation method based on consistency packet algorithm

Info

Publication number: CN114089755B
Application number: CN202111352698.XA
Authority: CN
Inventors: 夏卫国; 李富强; 丁男; 吴迪; 孙希明
Original assignee: Dalian University of Technology
Current assignee: Dalian University of Technology
Priority date: 2021-11-16
Filing date: 2021-11-16
Publication date: 2024-02-02
Anticipated expiration: 2041-11-16
Also published as: CN114089755A

Abstract

The invention belongs to the field of multi-robot cooperative control, and provides a multi-robot task allocation method based on a consistency packet algorithm. Meanwhile, the task allocation problem is realized in a layered mode, namely, robots are allocated to each task area, and then tasks are allocated to the robots in each task area. In addition, considering the condition that single tasks and double tasks coexist in the intelligent warehousing system, in practice, the double tasks need to be completed by cooperation of a plurality of robots. Further, a certain priority relation exists between different tasks, and priorities in different areas serve as influencing factors of task allocation. The extended CBBA algorithm adopted by the invention has high success rate and higher task income when the obtained task allocation result is actually executed.

Description

Multi-robot task allocation method based on consistency packet algorithm

Technical Field

The invention belongs to the field of cooperative control of multiple robots, and particularly relates to a multi-robot task allocation method based on a consistency packet algorithm.

Background

In recent years, with the development of intelligent warehousing technology, the effect of mutually cooperating multiple robots to complete tasks presents a difficult advantage relative to a single robot body, and becomes a vital component in an intelligent warehousing system. Therefore, multi-robot systems are receiving increasing attention for practical application potential. The task allocation is a very critical combination optimization problem in the intelligent warehousing system, and the goal is to reasonably allocate the tasks to the warehousing robots on the premise of considering complex constraint conditions, so that the system performance is effectively improved, and the task allocation is usually realized by reducing the total execution time or the total distance of executing the tasks.

The task allocation quality directly determines the task completion efficiency and cost of the warehousing system, and research strategies about warehousing robots mainly focus on distributed methods, including a consistency algorithm, an auction algorithm, a consistency package algorithm and the like, and have been widely applied to various fields of social life. Such methods, while widely used, still have problems, such as consistency algorithms requiring robots to converge on a consistent situational awareness, may require a significant amount of time and often require the transmission of large amounts of data, which may lead to significant delays; the auction algorithm must somehow communicate the bids for each robot to the auctioneer, limiting the network topologies that can be used; the consistency bag algorithm (CBBA algorithm) mainly considers the application scenario where a single task is executed by one agent, and has a limitation in the scenario where multiple agents are required to cooperatively execute a complex task.

As warehousing systems continue to increase in complexity, systems become larger in size, and there may be numerous and remote picking subregions. Tasks that require multiple robots to cooperatively complete may also exist in each area, and priorities may also exist between different tasks. In order to solve the problem of optimizing the warehousing system under complex constraint, the probability of failure in allocation is increased by adopting a traditional task allocation mode, the real-time requirement cannot be met, and more reasonable solutions are difficult to obtain.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention provides a multi-robot task allocation method based on a consistency packet algorithm, which utilizes a hierarchical extended CBBA algorithm and aims to construct the distributed storage robot scheduling system so as to efficiently realize the allocation of tasks in multiple robots.

The technical proposal of the invention

A multi-robot task allocation method based on a consistency packet algorithm comprises the following steps:

step 1: consider that picking tasks in a warehouse system are distributed in N _a In the individual sub-areas, the task area labels are gathered asThe robot set involved in picking is +.>All robots are classified into two types. Totally N _T The individual tasks need to be sorted, and the task label set is +.>The task area contains N _t The tasks with different priorities are classified into a single task type and a double task type. Wherein, the storage robot I epsilon I can execute L at most _i Tasks, task area M E M needs L at most _a The number of robots needed for picking and executing task J E J is num _j 。

The optimal objective function expression for the task allocation problem is as follows:

wherein x is _mi And x _ij Representing a task allocation decision variable consisting of 0, 1 variables, x _mi =1 indicates that robot i is assigned to task area m, x _ij =1 indicates that task j is assigned to robot i. C (C) _ij Performing benefits for the task of robot i, vector p _i An ordered sequence of tasks for robot i is shown.

Task allocation in each task area also needs to take into account the priority of the task: v (V) _j ＝C _j /dur _j Wherein C _j And dur _j The value and execution time of task j, respectively. And selecting the task with the largest product of the priority and the bidding function as the optimal task of the corresponding robot for distribution. A viable solution to the task allocation problem needs to satisfy both the optimization objective function and this constraint relationship.

Assume that task j has an earliest start time of t _{min_start} At the latest the start time is t _{max_start} . The new bid function considering the complex constraint is expressed as:

wherein C is _j0 A fixed benefit value representing the execution of task j;and->Respectively represent the start time and the end time of the task j, lambda _j V is a time discounting factor _i And f _i Respectively representing the speed and the fuel consumption per unit time of the robot i, D _ij Is the distance from robot i to task j.

Step 2: in the task distribution process, the warehousing robot, the task area and the task all need to store and update some data information structures, and the method specifically comprises the following steps:

the data information structure that each warehousing robot needs to maintain includes:

(1) Task Bundle set (Bundle): task packageRepresents the set of tasks assigned to robot i, |b _i The I represents the length of the task package, where the tasks are arranged in the order of joining the packages.

(2) Execution Path list (Path): execution path listRepresents a list of tasks to be performed assigned to robot i, |p _i And the I represents the length of the list, and the tasks in the list are arranged according to the execution sequence of the robot plan.

(3) Execution Time set (Time):t _in ∈R ⁺ representing a robot execution path list p according to a task _i Time to arrival at each task.

(4) Winner matrix (Winning Uavs): winning robot matrix Z is N _u ×N _T A dimension matrix, the elements in the matrix storing the winner numbers of all tasks currently, where Z _ij =k means that robot i considers robot k to be the winner of task j. The number Sum of non-negative elements in the j-th column of the matrix Z _j Indicating the total number of robots assigned to task j.

(5) Winning bid matrix (wining Bids): winning bid matrix byThe rows of the matrix represent all robot identifications, and the columns of the matrix represent different tasks. The elements are in one-to-one correspondence with the matrix Z, representing the winner's bid for the winning mission. Wherein Y is _ij =0 means that robot i considers task j to have no winner.

(6) Timestamp sets (Time Stamps):s _in ∈R ⁺ the vector represents the time of the last information exchange between the robot i and the adjacent robot, and is an important index of the conflict resolution stage.

Each task area needs to store and update the following data information structure:

(1) Robot bag set (Bundle):|b _m the I represents the length of the bundle, contains all the robot sets assigned to the task area m, and the robots are arranged in the order of joining the bundles.

(2) Winning area matrix (wining Uavs): matrix Z ^A Is N _a ×N _u A dimension matrix, the elements in the matrix storing winning zone numbers of all current robots, whereinThe representation task area m considers task area n to be the winner of robot i.

(3) Winning area bid matrix (wining Bids): from the following componentsThe rows of the matrix represent all task area identifications and the columns of the matrix represent different robots. Each element and matrix Z ^A One-to-one correspondence indicates the bidding of the winning robot by the winning task area. Wherein->Indicating that task area m considers robot i to have no winner.

(4) Timestamp sets (Time Stamps):the vector represents the time of the last information exchange between the task area m and the adjacent task area, and is an important index of the conflict resolution stage.

The tasks comprise a single task and a double task, and the data information structures required to be maintained are respectively as follows:

(1) Single task: multi-robot bag setRepresenting a set of multiple robots assigned to task j, wherein +.>A robot that is beaten.

(2) Double-tasking: double robot bag setRepresenting a set of dual robots assigned to task j, where only two robots can be stored in a package at a time.

The position of all robots I e I is known as (x _i ,y _i ,z _i ) Speed and fuel consumption per unit time are v _i And f _i . The position (x) of all tasks J e J _j ,y _j ,z _j ) Time windowDiscount factor lambda _j Are known. And knowing the specific task T contained by all task areas m.epsilon.M _m . The total profit value obtained by the robot i performing the assigned task is denoted +.>In order to ensure the task allocation efficiency, the task allocation of the warehousing robots is realized in a layered mode, namely, all the warehousing robots are allocated to each task area, and then specific tasks in the task areas are allocated to the warehousing robots. The following steps describe the allocation process in detail.

Step 3: and distributing the warehousing robots among the task areas. The robot allocation of the task area is divided into three steps of task area parameter generation, robot package construction and area conflict resolution. The specific allocation flow for the task area M epsilon M is as follows:

(1) According to the task list T in the task area m _m And generating task area parameters. Specifically including the location (x _m ,y _m ,z _m ) Time windowDiscount factor lambda _m Value C _m And a fixed value C _m0 . Wherein->And->Respectively T _m Is->And->Minimum value->Minimum value corresponds to lambda of task _j And (x) _j ,y _j ,z _j ) I.e. as a task areaλ _m And (x) _m ,y _m ,z _m )，C _m Is T _m All C in (3) _j Average value of the sum, C _m0 Then is T _m All C in (3) _j0 And (5) accumulating the average value of the sums.

(2) The task area m is configured as a robot bag. For robot i not in the task area m bundle,sequentially calculating the benefit value C assigned to the mission region m _mi The method comprises the steps of carrying out a first treatment on the surface of the Will benefit value C _mi Corresponding +.>Comparing if for the same robot C _mi Is greater than->Then set bid flag +.>OtherwiseSelect->Robot corresponding to maximum value of (2), bundle added to task m +.>Updating matrix Z simultaneously ^A Sum matrix Y ^A 。

(3) And carrying out conflict resolution between task areas. After the task area M epsilon M receives the sharing information of the task area n epsilon M, the data information structure updating action rule is as follows:

updating: will beThe value of +.>Will->The value of +.>Resetting: will->Reset to-1, ">Reset to-1

Leaving:and->Does not make any changes

The two steps of robot bag construction of the task area and conflict solution between areas are iterated repeatedly until all robots are distributed, and a robot set obtained by distributing the task area m is expressed as R _m 。

Step 4: and constructing a task package set by all the storage robots in the task area. The task package set construction flow of the warehousing robot i in the task area m is as follows:

(1) Cycling through the robot set R in the task area m _m Taking out robots i epsilon R in sequence _m If |b _i |＜L _i Turning to the step (2), otherwise traversing the next robot;

(2) For task list T in task area m _m Consider an execution path list p _i Not yet contain tasks and new tasks add p _i Two conditions of the likelihood of each position in the database. Then, all the tasks j E T meeting the conditions are fetched _m The sequential computing tasks j are inserted at p _i Edge benefit value obtained at position n:

if C is present at this time _ij The value is larger than 0, the step (3) is switched to, otherwise, the step (1) is returned to;

(3) Considering that each task j in the task area m has priority V _j Selecting C with the largest product _max ＝C _ij ×V _j Task j corresponding to > 0. If num is _j =1, the task is of single task type, turning to step (4); if num is _j The task is of a double-task type and is converted into the step (5);

(4) Single task case: will benefit value C _ij Y corresponding to the current winning bid matrix _ij Comparing if the benefit value C is for the same task _ij Greater than Y _ij Setting an auction flag h _ij =1 go to step (6), otherwise h _ij =0 to step (7).

(5) Double-task case: if num _j ＞Sum _j H is then _ij =1 go to step (6); if num _j ＝Sum _j And C _ij A minimum bid value larger than the current task j is h _ij =1 go to step (6); if the two conditions are not satisfied, the robot i gives up the task j, namely h _ij =0 to step (7).

(6) The best task is denoted as J _i ＝argmax _j C _max ×h _ij Task J _i In the execution path list p _i The best position in (a) is expressed asMeanwhile, updating a data information structure corresponding to the robot i:

(7) Robotic i bid C for task j _ij Resetting to be-1, and returning to the step (2).

(8) When in the task area mRobot set R _m After the traversal is completed, the process jumps to step 5.

Step 5: and carrying out conflict resolution between the storage robots in the task area. Synchronous communication mechanism is adopted among all robots in the task area m, and the warehousing robot i epsilon R _m Receiving k epsilon R of storage robot _m And carrying out conflict resolution after sharing information.

In the conflict resolution stage, after one interaction is completed between robots, the time stamp information and the time stamp s are updated _i The updated formula of (c) is as follows:

wherein g _ik =1 means that there is a communication link between robots i and k, otherwise g _ik =0. Each node has a self-connecting edge, g _ii ＝1。τ _r Is the message reception time. The specific conflict resolution flow is as follows:

(1) Cycling through all conflicting tasks T in task area m _m The conflicting tasks j epsilon T are sequentially fetched _m . If num is _j =1, the conflicting task is of single task type, turning to step (2); if num is _j The conflict task is of a double-task type and is converted into the step (3);

(2) Single task case: robot i directly takes one of three possible actions according to the received allocation information, and then proceeds to step (6):

updating: y is set to _kj Is assigned to Y _ij Will Z _kj Is assigned to Z _ij 。

Resetting: y is set to _ij Reset to-1, Z _ij Reset to-1

Leaving: y is Y _ij And Z _ij Does not make any changes

(3) Double-task case: consider another robot q.epsilon.R _m The specific conflict resolution may be divided into two parts. The first part updates the self-stored information of the robot i to be the latest, and the step (4) is performed. The second part adjusts its own information for robot i,turning to the step (5).

(4) Robot i compares its bidding information with robot k, s _kq ＞s _iq Indicating that robot k has assigned an information update. The robot i stores the latest data and can confirm that the stored information is the latest.

(5) Setting that sender k thinks that robot q performs conflicting task j, while receiver i thinks that robot q does not perform this task, and satisfies i not equal to q and s _kq ＞s _iq Two conditions. If num _j ＞Sum _j Then update Y _ij ＝Y _kj ，Z _ij ＝Z _kj . If num _j ＝Sum _j And Y is _qj Minimum bid value greater than current task jY is then _rj ＝-1，Z _rj ＝-1，Y _ij ＝Y _kj ，Z _ij ＝Z _kj 。

(6) Robot i updates timestamp information s _i Returning to the step (1).

(7) When conflicting task set T in task area m _m After the traversal is completed, the step 4 is skipped.

The two steps of task package construction of the robots and conflict solution among the robots are iterated repeatedly until the interior of each task area converges to a conflict-free task allocation result, and therefore multi-area-based hierarchical task allocation is achieved.

The beneficial effects of the invention are as follows: firstly, complex constraints such as robot speed, task time window constraint, arrival time and travel cost loss are comprehensively considered, and a new bidding function is designed. Meanwhile, the task allocation problem is realized in a layered mode, namely, robots are allocated to each task area, and then tasks are allocated to the robots in each task area. In addition, considering the condition that single tasks and double tasks coexist in the intelligent warehousing system, in practice, the double tasks need to be completed by cooperation of a plurality of robots. Further, a certain priority relation exists between different tasks, and priorities in different areas serve as influencing factors of task allocation. The extended CBBA algorithm adopted by the invention has high success rate and higher task income when the obtained task allocation result is actually executed.

Drawings

FIG. 1 is a general flow chart of the allocation of picking tasks by the warehousing robot of the invention.

Fig. 2 is a flowchart of a specific allocation process in the present invention.

Fig. 3 is a graph of allocation results using a hierarchical based extended CBBA algorithm in accordance with the present invention.

Fig. 4 is a task allocation timing diagram employing a hierarchical-based extended CBBA algorithm in accordance with the present invention.

Fig. 5 is a graph of the allocation results using the basic CBBA algorithm.

Detailed Description

The following describes embodiments of the present invention in further detail with reference to the drawings and technical schemes.

As shown in fig. 1, a multi-robot task allocation method based on a consistency packet algorithm comprises the following steps: step 1: consider that picking tasks in a warehouse system are distributed in N _a Sub-region, task region label set asThe robot set involved in picking is +.>All robots are classified into two types. Totally N _T The individual tasks need to be sorted, and the task label set is +.>The task area contains N _t The tasks with different priorities are classified into a single task type and a double task type. Wherein, the storage robot I epsilon I can execute L at most _i Tasks, task area M E M needs L at most _a The number of robots needed for picking and executing task J E J is num _j 。

(5) Winning bid matrix (wining Bids): winning bid matrix byThe rows of the matrix represent all robot identifications, and the columns of the matrix represent different tasks. The elements are in one-to-one correspondence with the matrix Z, representing the winner's bid for the winning mission. Wherein Y is _ij =0 means that robot i believes the taskj has no winner.

(4) Timestamp sets (Time Stamps):representing task area m and adjacent task area mostThe time of the last information exchange, the vector is an important indicator of the conflict resolution stage.

The position of all robots I e I is known as (x _i ,y _i ,z _i ) Speed and fuel consumption per unit time are v _i And f _i . The position (x) of all tasks J e J _j ,y _j ,z _j ) Time windowDiscount factor lambda _j Are known. And knowing the specific task T contained by all task areas m.epsilon.M _m . The total profit value obtained by the robot i performing the assigned task is denoted +.>In order to ensure the task allocation efficiency, the task allocation of the warehouse robot is realized in a layered manner, namely, all robots are allocated to each task area, then specific tasks in the task areas are allocated to the robots, and a specific allocation flow is shown in fig. 2. The following steps will specifically describe the allocation process.

(1) According to the task list T in the task area m _m And generating task area parameters. Specifically including the location (x _m ,y _m ,z _m ) Time windowDiscount factor lambda _m Value C _m And a fixed value C _m0 . Wherein->And->Respectively T _m Is->And->Minimum value->Minimum value corresponds to lambda of task _j And (x) _j ,y _j ,z _j ) I.e. lambda of the task area _m And (x) _m ,y _m ,z _m )，C _m Is T _m All C in (3) _j Average value of the sum, C _m0 Then is T _m All C in (3) _j0 And (5) accumulating the average value of the sums.

Leaving:and->Does not make any changes

The two steps of robot bag construction of the task area and conflict solution between areas are iterated repeatedly, as shown in fig. 2, until all robots are distributed, and a robot set obtained by distributing the task area m is expressed as R _m 。

(8) Robot set R in task area m _m After the traversal is completed, the process jumps to step 5.

Resetting: y is set to _ij Reset to-1, Z _ij Reset to-1

Leaving: y is Y _ij And Z _ij Does not make any changes

(3) Double-task case: consider another robot q.epsilon.R _m The specific conflict resolution may be divided into two parts. The first part updates the self-stored information of the robot i to be the latest, and the step (4) is performed. The second part adjusts the information of the robot i and goes to the step (5).

(6) Robot i updates timestamp information s _i Returning to the step (1).

The two steps of task package construction of the robot and conflict solution among robots are iterated repeatedly, as shown in fig. 2, until the interior of each task area converges to a conflict-free task allocation result, and therefore multi-area-based hierarchical task allocation is achieved.

Examples:

system simulation environment: interl 2.8GHz,8GB memory PC, windows10 operating system, python3.6 version, pyCharm integrated development environment.

The invention adopts a three-dimensional map model, the height is known, and the coordinate system is a plane coordinate system. Assuming 10 stocker robots, a total of 20 picking tasks in the 4 task areas (A, B, C, D) need to be picked. Initial state information of the warehouse robot and the picking task in the simulation is shown in tables 1 and 2. Wherein the robots participating in picking are divided into two types, the maximum task number which can be allocated to a single robot is 10, and the maximum number of robots can be allocated to a single task area. The tasks to be picked are also classified into two types: single task (1 robot is required), double task (2 robots are required).

Table 1 robot parameter settings

TABLE 2 task parameter settings

Table 3 describes the parameters and allocation results for the respective task areas. In step 3, each task area is based on the internal task list T _m Generating self parameters, and realizing that all the storage robots are distributed to each task area, namely completing the distribution of the first layer.

TABLE 3 task area parameters

FIG. 3 is a final tasking result of an example of the present invention. In the figure, A0-A9 represent storage robots, gray planes represent storage robots divided into 4 task areas, and T0-T19 represent picking tasks. It can be seen that 10 stocker robots can be allocated to each task area and converge to a collision-free task allocation inside each task area. Task area a: a0- > T0- > T4; a1→t8→t12→t13; a8→t0→t8→t4;

task area B: a2- > T9- > T5- > T1; a4→t17→t19; a9- > T5- > T1- > T14- > T18 task area C: a6→t2→t6→t10→t15; a7- > T2- > T6;

task area D: a3→t7→t16→t3; a5→t7→t3→t11;

FIG. 4 is a task allocation timing diagram of an example of the present invention. In the figure, the different types of blocks represent different tasks, the start and end positions of the blocks represent the start and end times assigned to the current task, and the length represents the duration of the current task. From the figure, it is clear that the time periods for 10 robots to perform the respective tasks do not conflict with each other.

Fig. 5 is a result of task allocation using basic CBBA under the same condition. At this time, the task allocation is performed in the whole selection area, and the warehousing robot executes the corresponding picking task according to the allocation result. Comparing fig. 3 and 5, it can be seen that the task allocation using the hierarchical based extended CBBA algorithm results in higher total profits and less algorithm run time.

In summary, the invention provides a hierarchical-based extended CBBA algorithm for task allocation of multiple robots, aiming at the problems of task scheduling in the existing intelligent warehousing system. The algorithm comprehensively considers a plurality of factors such as a plurality of picking subareas, complex environmental constraints and the like, and adopts a two-layer task allocation mode, namely, the robots are allocated to each task area firstly, and then the tasks are allocated to the robots in each task area. The reliability and success rate of task allocation are effectively improved, and the method has important significance for improving the performance of the intelligent warehousing system.

The above description is only of the preferred embodiments of the present invention and is not intended to limit the present invention, but various modifications and variations can be made to the present invention by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A multi-robot task allocation method based on a consistency packet algorithm is characterized by comprising the following steps:

step 1: consider that picking tasks in a warehouse system are distributed in N _a In the individual sub-areas, the task area labels are gathered asThe robot set involved in picking is +.>All robots are classified into two types; totally N _T The individual tasks need to be sorted, and the task label set is +.>The task area contains N _t The tasks with different priorities are divided into a single task type and a double task type; wherein the method comprises the steps ofThe storage robot I epsilon I can execute L at most _i Tasks, task area M E M needs L at most _a The number of robots needed for picking and executing task J E J is num _j ；

wherein x is _mi And x _ij Representing a task allocation decision variable consisting of 0, 1 variables, x _mi =1 indicates that robot i is assigned to task area m, x _ij =1 indicates that task j is assigned to robot i; c (C) _ij Performing benefits for the task of robot i, vector p _i Representing an ordered task sequence of robot i;

task allocation in each task area also needs to take into account the priority of the task: v (V) _j ＝C _j /dur _j Wherein C _j And dur _j The value and the execution time of the task j are respectively; selecting the task with the largest product of the priority and the bidding function as the optimal task of the corresponding robot for distribution; the feasible solution of the task allocation problem needs to meet the optimization objective function and the constraint relation at the same time;

assume that task j has an earliest start time of t _{min_start} At the latest the start time is t _{max_start} The method comprises the steps of carrying out a first treatment on the surface of the The new bid function considering the complex constraint is expressed as:

wherein C is _j0 A fixed benefit value representing the execution of task j;and->Respectively represent the start time and the end time of the task j, lambda _j V is a time discounting factor _i And f _i Respectively representing the speed and the fuel consumption per unit time of the robot i, D _ij Distance from robot i to task j;

(1) Task bundle set: task packageRepresents the set of tasks assigned to robot i, |b _i The I represents the length of a task package, wherein the tasks are arranged according to the sequence of adding the package;

(2) Execution path list: execution path listRepresents a list of tasks to be performed assigned to robot i, |p _i The I represents the length of the list, and the tasks in the list are arranged according to the execution sequence of the robot plan;

(3) Execution time set:t _in ∈R ⁺ representing a robot execution path list p according to a task _i Time to reach each task;

(4) Winner matrix: winning robot matrix Z is N _u ×N _T A dimension matrix, the elements in the matrix storing the winner numbers of all tasks currently, where Z _ij =k means that robot i considers robot k to be the winner of task j; the number Sum of non-negative elements in the j-th column of the matrix Z _j Representing the total number of robots assigned to task j;

(5) Winning bid matrix: winning bid matrix byThe rows of the matrix represent all robot identifications, and the columns of the matrix represent different tasks; each element corresponds to the matrix Z one by one and represents the bid of a winner on a winning task; wherein Y is _ij =0 means that robot i considers task j to have no winner;

(6) Timestamp set:s _in ∈R ⁺ the time of the last information exchange between the robot i and the adjacent robot is represented, and the vector is an important index of a conflict resolution stage;

(1) Robot bag set:|b _m the I represents the length of the bundle package, comprises all robot sets distributed to the task area m, and the robots are arranged according to the sequence of adding the package;

(2) Winning area matrix: matrix Z ^A Is N _a ×N _u A dimension matrix, the elements in the matrix storing winning zone numbers of all current robots, whereinIndicating that task area m considers task area n to be the winner of robot i;

(3) Winning zone bid matrix: from the following componentsRepresenting, rows of the matrix represent all task area identifications, and columns of the matrix represent different robots; each element and matrix Z ^A One-to-one correspondence, representing the bidding of the winning task area on the winning robot; wherein->Indicating that task area m considers robot i to have no winner;

(4) Timestamp set: the time of the latest information exchange between the task area m and the adjacent task area is represented, and the vector is an important index of a conflict resolution stage;

(1) Single task: multi-robot bag setRepresenting a set of multiple robots assigned to task j, wherein +.>A robot that is beaten;

(2) Double-tasking: double robot bag setRepresenting a set of dual robots assigned to task j, where only two robots can be stored at a time in a package;

the position of all robots I e I is known as (x _i ,y _i ,z _i ) Speed and fuel consumption per unit time are v _i And f _i The method comprises the steps of carrying out a first treatment on the surface of the The position (x) of all tasks J e J _j ,y _j ,z _j ) Time windowDiscount factor lambda _j Are known; and knowing the specific task T contained by all task areas m.epsilon.M _m The method comprises the steps of carrying out a first treatment on the surface of the The total profit value obtained by the robot i performing the assigned task is denoted +.>In order to ensure the efficiency of task allocation, the task allocation of the warehousing robots is realized in a layered mode, namely, all the warehousing robots are allocated to each task area, and then specific tasks in the task areas are allocated to the warehousing robots;

step 3: distributing storage robots among the task areas; the robot allocation of the task area is divided into three steps of task area parameter generation, robot packet construction and area conflict resolution; the specific allocation flow for the task area M epsilon M is as follows:

(1) According to the task list T in the task area m _m Generating task area parameters, including in particular the position (x _m ,y _m ,z _m ) Time windowDiscount factor lambda _m Value C _m And a fixed value C _m0 The method comprises the steps of carrying out a first treatment on the surface of the Wherein->

Andrespectively T _m Is->And->Minimum value->Minimum value corresponds to lambda of task _j And (x) _j ,y _j ,z _j ) I.e. lambda of the task area _m And (x) _m ,y _m ,z _m )，C _m Is T _m All C in (3) _j Average value of the sum, C _m0 Then is T _m All C in (3) _j0 An average value of the accumulated sums;

(2) Carrying out robot bag construction on the task area m; for robot i not in the task area m bundle,sequentially calculating the benefit value C assigned to the mission region m _mi The method comprises the steps of carrying out a first treatment on the surface of the Will benefit value C _mi Corresponding +.>Comparing if for the same robot C _mi Is greater than->Then set bid flag +.>Otherwise->SelectingRobot corresponding to maximum value of (2), bundle added to task m +.>Updating matrix Z simultaneously ^A Sum matrix Y ^A ；

(3) Conflict resolution is carried out between task areas; after the task area M epsilon M receives the sharing information of the task area n epsilon M, the data information structure updating action rule is as follows:

updating: will beThe value of +.>Will->The value of +.>

Resetting: will beReset to-1, ">Resetting to-1;

leaving:and->No change is made;

the two steps of robot bag construction of the task area and conflict solution between areas are iterated repeatedly until all robots are distributed, and a robot set obtained by distributing the task area m is expressed as R _m ；

Step 4: constructing a task package set by all storage robots in the task area; the task package set construction flow of the warehousing robot i in the task area m is as follows:

(2) For task list T in task area m _m Consider an execution path list p _i Not yet contain tasks and new tasks add p _i Two conditions of the likelihood of each position in the database; then, all the tasks j E meeting the conditions are fetchedT _m The sequential computing tasks j are inserted at p _i Edge benefit value obtained at position n:

(3) Considering that each task j in the task area m has priority V _j Selecting C with the largest product _max ＝C _ij ×V _j Task j corresponding to > 0; if num is _j =1, the task is of single task type, turning to step (4); if num is _j The task is of a double-task type and is converted into the step (5);

(4) Single task case: will benefit value C _ij Y corresponding to the current winning bid matrix _ij Comparing if the benefit value C is for the same task _ij Greater than Y _ij Setting an auction flag h _ij =1 go to step (6), otherwise h _ij =0 to step (7);

(5) Double-task case: if num _j ＞Sum _j H is then _ij =1 go to step (6); if num _j ＝Sum _j And C _ij A minimum bid value larger than the current task j is h _ij =1 go to step (6); if the two conditions are not satisfied, the robot i gives up the task j, namely h _ij =0 to step (7);

(7) Robotic i bid C for task j _ij Resetting to be-1, and returning to the step (2);

(8) Robot set R in task area m _m After the traversing is finished, jumping to the step (5);

step 5: conflict resolution is carried out among the storage robots in the task area; synchronous communication mechanism is adopted among all robots in the task area m, and the warehousing robot i epsilon R _m Receiving k epsilon R of storage robot _m After sharing information, carrying out conflict resolution;

wherein g _ik =1 means that there is a communication link between robots i and k, otherwise g _ik =0; each node has a self-connecting edge, g _ii ＝1；τ _r Is the message reception time; the specific conflict resolution flow is as follows:

(1) Cycling through all conflicting tasks T in task area m _m The conflicting tasks j epsilon T are sequentially fetched _m The method comprises the steps of carrying out a first treatment on the surface of the If num is _j =1, the conflicting task is of single task type, turning to step (2); if num is _j The conflict task is of a double-task type and is converted into the step (3);

updating: y is set to _kj Is assigned to Y _ij Will Z _kj Is assigned to Z _ij ；

Resetting: y is set to _ij Reset to-1, Z _ij Resetting to-1;

leaving: y is Y _ij And Z _ij No change is made;

(3) Double-task case: taking into account the otherRobot q epsilon R _m The specific conflict resolution is divided into two parts: the first part updates the self-stored information of the robot i to be the latest, and the step (4) is performed; the second part is that the robot i adjusts own information and the step (5) is converted;

(4) Robot i compares its bidding information with robot k, s _kq ＞s _iq Indicating that robot k distributes information update; the robot i stores the latest data and confirms that the stored information is the latest at present;

(5) Setting that sender k thinks that robot q performs conflicting task j, while receiver i thinks that robot q does not perform this task, and satisfies i not equal to q and s _kq ＞s _iq Two conditions; if num _j ＞Sum _j Then update Y _ij ＝Y _kj ，Z _ij ＝Z _kj The method comprises the steps of carrying out a first treatment on the surface of the If num _j ＝Sum _j And Y is _qj Minimum bid value greater than current task jY is then _rj ＝-1，Z _rj ＝-1，Y _ij ＝Y _kj ，Z _ij ＝Z _kj ；

(6) Robot i updates timestamp information s _i Returning to the step (1);

(7) When conflicting task set T in task area m _m After the traversing is finished, jumping back to the step (4);