CN114995397A

CN114995397A - ROS (reactive oxygen species) platform-oriented multi-robot formation algorithm based on distributed asynchronous optimization

Info

Publication number: CN114995397A
Application number: CN202210520624.0A
Authority: CN
Inventors: 杨绍富; 杨越森; 刘庆山
Original assignee: Southeast University
Current assignee: Southeast University
Priority date: 2022-05-13
Filing date: 2022-05-13
Publication date: 2022-09-02

Abstract

The invention discloses a ROS-oriented platform based distributed asynchronous optimization multi-robot formation algorithm, wherein each robot firstly determines a formation according to a target to be blocked and establishes a distributed optimization model; then, whether event triggering is met is judged by comparing whether the current updating progress of the robot is slower than that of other nodes, and if the condition is met, the robot summarizes the information of neighbors and updates the parameters of the robot; if the condition is not met, then the robot does not update and communicate; through the method, the node with the high updating rate can not gather neighbor node information for many times and further can not meet the constraint condition, the algorithm convergence is ensured to be correct on the premise of ROS-based asynchronous high-efficiency communication, the blocking behavior of the synchronous algorithm is avoided while the formation is kept, the algorithm convergence is faster, and the real-time performance of the robot system is higher.

Description

Multi-robot formation algorithm based on distributed asynchronous optimization and oriented to ROS platform

Technical Field

The invention belongs to the technical field of automation and computers, and particularly relates to a ROS platform-oriented multi-robot formation algorithm based on distributed asynchronous optimization.

Background

The cooperation/formation of multiple robots has very important practical significance, for example, in military/life cooperation, one object is enclosed, and the target object is moved and the formation is kept; in addition, for the aerial unmanned aerial vehicle, the target object can be shot in real time from multiple angles. Therefore, it is of great significance to design a dynamic queuing algorithm which can change according to the target to be enclosed and the self requirement.

The problem of optimization is not difficult to solve, but the traditional distributed optimization algorithm for solving the problem of constrained optimization is based on synchronous communication, is incompatible with the existing mainstream ROS platform based on the asynchronous communication robot system, and has many advantages.

Based on the fact that the communication mode of the published topic/the subscribed topic in the ROS platform is a natural asynchronous communication mode, the intelligent agent publishes own information through a publishing topic, the information before is covered once the published information is updated, the information of the neighbor is received through the subscribed topic, the pace is completely dependent on the intelligent agent, the received information is possibly advanced and delayed, the real-time performance is not realized, and the natural asynchronous environment is realized. However, if a traditional distributed optimization algorithm is deployed in the asynchronous environment, the time delay of information and the imbalance of updating destroy the designed constraint condition, and the error of the result of final convergence through experiments and the theoretical optimal solution can reach 50%, so that the existing optimization algorithm is not suitable for a platform based on ROS.

Disclosure of Invention

Aiming at the problem that the traditional algorithm in the prior art is not suitable for an ROS platform, the invention provides a distributed asynchronous optimization-based multi-robot formation algorithm for the ROS platform, wherein each robot firstly determines a formation according to a target to be blocked and establishes a distributed optimization model; then, whether event triggering is met is judged by comparing whether the current updating progress of the robot is slower than that of other nodes, and if the condition is met, the robot summarizes the information of neighbors and updates the parameters of the robot; if the condition is not met, then the robot does not update and communicate; through the method, the node with the high updating rate can not gather neighbor node information for many times and further can not meet the constraint condition, the algorithm convergence is ensured to be correct on the premise of ROS-based asynchronous high-efficiency communication, the blocking behavior of the synchronous algorithm is avoided while the formation is kept, the algorithm convergence is faster, and the real-time performance of the robot system is higher. In order to achieve the purpose, the invention adopts the technical scheme that: a ROS platform-oriented multi-robot formation algorithm based on distributed asynchronous optimization, which is applied to a system with i-1, 2, 3.

S1: determining a formation according to the position of a target to be enclosed and establishing a distributed optimization model, wherein the objective function of the distributed optimization model is the position x of the optimization robot _i To the optimal observation position s _i The distance of (d) is minimal, i.e.:

minf _i (x _i )＝||x _i -s _i || ²

wherein x is _i ∈R ² Is the position coordinates of robot i; s _i ∈R ² Is the optimal observation position of the robot i;

the constraints of the distributed optimization model at least comprise: robot i needs to be in its safe area Ω _i And observe the region S _i The centers of the positions of the inner robot and all the robots are coincided with the target to be blocked, namely:

s.t x _i ∈Ω _i

x _i ∈S _i

wherein d ∈ R ² Representing the position of the target to be enclosed;

s2: judging whether event triggering is met by comparing whether the current updating progress of the robot is slower than other nodes or not, if the updating times of the robot i are more than that of other nodes, stopping the robot i to wait for other agents; when the event trigger condition is satisfied, the robot i performs local update, and continues to step S3; the event triggering conditions of the robot i are as follows:

wherein k is _i The number of local updates for robot i; k is a radical of formula _j The number of local updates for robot j; τ is event triggerSpring parameters; n is a radical of _i A neighbor set of a robot i;

s3: and local updating according to the updating condition of the robot i, wherein the specific formula is as follows:

k _i ＝k _i +1

where eta is the step length, a _ij As a weight of the neighbor(s),

is a gradient, v _i Is the Lagrangian constant; i is ₂ Is an identity matrix of 2 x 2,

is kronecker product, A ═ a _ij ] _m*m A is an adjacency matrix of the robot, when i ≠ j _ij >0 if and only if the robot i sends j information, i j,

s4: judging whether a termination condition is met, if so, terminating the algorithm and outputting a final position x _i (ii) a If not, the step is returned to the step S3 to repeat the steps, and the termination condition is set as the upper limit of the maximum iteration number or the error precision.

Compared with the prior art, the scheme makes up the defect of asynchronous communication based on the topic of ROS, the event triggering condition is added into the synchronous algorithm, the algorithm is designed into the asynchronous algorithm, the final convergence result and the theoretical optimal solution error are reduced to 1%, so that multiple robots are separated from centralized control, each robot runs an independent program by adopting real distributed control, and the optimal solution is converged under the asynchronous communication environment. The scheme guarantees correct algorithm convergence on the premise of ros-based asynchronous high-efficiency communication, avoids the blocking behavior of a synchronous algorithm while maintaining the formation, and enables the algorithm convergence to be faster and the real-time performance of the robot system to be higher.

Drawings

FIG. 1 is a flow chart of the steps of the algorithm of the present invention;

FIG. 2 is a final effect diagram of a ros platform based ground enclosing trolley for an unmanned aerial vehicle in a test example of the invention;

fig. 3 is a graph of a robot position coordinate set (i.e., an equality constraint) obtained by different algorithms according to a test example of the present invention, along with the number of iterations.

Detailed Description

The present invention will be further illustrated with reference to the accompanying drawings and specific embodiments, which are to be understood as merely illustrative of the invention and not as limiting the scope of the invention.

Example 1

An ROS platform-oriented multi-robot formation algorithm based on distributed asynchronous optimization has the characteristics that each robot independently runs own programs and asynchronous communication, and for a system with n robots, the specific implementation steps of the n robots are shown in FIG. 1 for the i 1 st, 2 nd, 3 th.

S1, determining a formation and establishing a model according to the target to be enclosed, and establishing a local loss function, a constraint condition and a virtual initial position, wherein the method comprises the following specific steps:

the objective function and constraint conditions of the ith robot are shown

min f _i (x _i )＝||x _i -s _i || ² (1)

s.t x _i ∈Ω _i (2)

x _i ∈S _i (3)

Wherein x _i ∈R ² Is the position coordinate of the robot i, also the parameter to be optimized, s _i ∈R ² Is the best observation position of the robot i; objective function (1) to optimize robot position x _i To the optimal observation position s _i Is the smallest.

Constraint (2) indicates that robot i needs to be in its safe area

x _i ∈Ω _i ＝{x _i ∈R ² ||x _i -z _i ∣≤r ₁ In which z is _i ∈R ² Robot i safety Range center, r ₂ Is an observation region S _i The distance from the center to any boundary, i.e., half the side length of the square region. Constraint (3) indicates that robot i needs to be in its observation region S _i ,x _i ∈S _i ＝{x _i ∈R ² ||x _i -s _i ∣<r ₂ },r ₂ For observation region S _i The distance from the center to any border, i.e. half the side length of the square area. To satisfy the constraint conditions, the robot i needs to be in two square areas X _i ,S _i By solving the inequality, the intersection of

Wherein,

the constraints (2) and (3) can be converted into linear inequality constraints (6) and (7). The constraint condition (4) indicates that the position centers of all the robots are coincided with the target to be blocked, and d belongs to R ² Indicating the location of the object to be occluded. The final optimization problem can be transferred to the following constrained optimization problem:

min f _i (x _i )＝||x _i -s _i || ² (5)

s2, judging whether the event trigger is satisfied by comparing whether the current update progress of the robot is slower than that of other nodes, wherein the event trigger conditions of the robot i are as follows:

k _i for the local update times of the robot i, when the event trigger condition is satisfied, the robot i performs local update. If tau is 0, the event triggering of the robot i means that the robot i is the robot with the least number of updates in the current neighbor, and the idea of the event triggering is that if the number of updates of the robot i is more than that of other robots, it is described that the robot i has aggregated information of other robots for many times, and the behavior of aggregation for many times will cause the position average value of the robot system to shift towards the directions of other robots, and further cause the constraint condition to be unsatisfied, so that if the number of updates of the robot i is more than that of other agents, the robot i needs to stop to wait for other agents.

S3, local updating is carried out according to the updating condition of the robot i

Initializing the robot i parameter x _i ＝z _i ,k _i ＝0,v _i ＝(0,0,0,0),M _i (1, -1), wherein the requirements are satisfied

When an event triggersWhen the condition is satisfied, updating according to the following formula:

k _i ＝k _i +1 (12) wherein I ₂ Is an identity matrix of 2 x 2,

(x) ⁺ ＝max{0,x}。v _i is a lagrange multiplier. When the event trigger condition is triggered, the robot i calculates the gradient of the target function and a feasible gradient direction, moves towards the positive direction of the feasible gradient direction, then collects the information of the neighbor, moves towards the negative direction of the feasible gradient of the neighbor, and finally updates the Lagrange multiplier of the robot i. The neighbor's information is stored in a buffer (the old information is overwritten if there is new information).

S4, judging whether the termination condition is satisfied, if the termination condition has the maximum iteration number upper limit and the error precision, terminating the algorithm and outputting the final position x _i If not, returning to the third step

Examples of tests:

taking a multi-robot ground enclosing trolley as an example, the ROS platform-oriented multi-robot formation algorithm based on distributed asynchronous optimization specifically comprises the following steps:

the first step is to initialize parameters and establish a robot model. In the experiment, the coordinates of the object to be blocked are d (15-15), three robots are adopted (n is 3), and finally the object to be blocked is blocked into a triangle.

Initializing a virtual initial position of the robot: z is a radical of ₁ ＝(-65,-30),z ₂ ＝(10,45),z ₃ ＝(100,-60),

Optimum observation position s of robot ₁ ＝(-20,-30),s ₂ ＝(10,5),s ₃ ＝(55,-50),r ₁ ＝30,r ₂ ＝20。

Adjacency matrix a ═ a of robot system _ij ] _3×3

Initializing other parameters k of the model ₁ ＝k ₂ ＝k ₃ ＝,η＝0.05,τ＝1,v ₁ ＝v ₂ ＝v ₃ ＝(0,0,0,0)。

By the above conditions and formulas

Is calculated to

For the ith robot, the objective function is f _i (x _i )＝||x _i -s _i || ² With the constraint condition of

And secondly, judging whether the event trigger is met by comparing whether the current updating progress of the robot is slower than that of other nodes or not, wherein the judgment is shown in the following formula:

wherein k is _i And i is 1,2 and 3, the update times of the robot i, and the times of triggering the robot i event triggering condition. If the number of times of updating the robot τ is more than that of other robots, it is described that the robot i has aggregated information of other robots for many times, and the behavior of aggregation for many times may cause the position average value of the robot system to shift to the directions of other robots, thereby causing the constraint condition to be unsatisfied.

τ is an event triggering condition parameter, the larger τ represents that the event triggering condition has higher tolerance to delay, the larger τ is, the easier event triggering of the robot i is, the more frequent the robot is updated, and the convergence is faster, but the error of the convergence result is larger due to the higher tolerance to delay.

Therefore, τ can be increased if convergence speed is sought, and τ can be decreased if convergence accuracy is sought, τ ∈ N. The user can adjust τ to make the accuracy/speed tradeoff.

And thirdly, returning to the second step if the event trigger condition is not met for the ith robot, and updating the intelligent agent if the event trigger condition is met, wherein the specific updating steps are as follows:

a. for robot position

Eta is the step length, a _ij As a weight of the neighbor(s),

is a gradient, v _i Lagrange constants.

When the event triggering condition is triggered, the robot i calculates the gradient of the target function and a feasible gradient direction, moves towards the negative direction of the feasible gradient direction, then collects the information of the neighbor and moves towards the positive direction of the feasible gradient of the neighbor, thereby keeping the center of the robot system and the target to be enclosed the same.

b. Lagrange multiplier for robot

c. Finally updating the times k for the self _i ,k _i ＝k _i +1。

After the local update is finished, updating the self number of times k _i Local direction

And sending to the neighbor.

Fourthly, judging whether a termination condition is met, wherein the termination condition has the maximum iteration number upper limit and the error precision, and if one is met, terminating the algorithm and outputting a final position x _i And if not, returning to the third step.

As shown in fig. 2, fig. 2 is a final effect diagram of the ground blocking trolley for the unmanned aerial vehicle based on the ros platform, the star shape is the initial position of the unmanned aerial vehicle, the solid round shape is the final position of the unmanned aerial vehicle, i.e. the final output of the algorithm, the solid line hollow round area is the safety position of the unmanned aerial vehicle, the dotted line hollow circle center area is the observation area of the unmanned aerial vehicle, "×" is the target to be blocked, s is the target to be blocked, and ₁ ，s ₂ ，s ₃ to observe the center, FIG. 2 graphically illustrates the model and the entire optimization process.

In the test example, a synchronous algorithm syn, an asynchronous algorithm asy and an algorithm asy-dc for adding an event trigger mechanism in the case are respectively used for testing, and a graph of a finally obtained robot position coordinate set (namely an equality constraint condition) along with the iteration times is shown in fig. 3, so that the synchronous algorithm can be well maintained at xy1+ xy2+ xy3 which is 3 × dy-45, and the ideal position coordinate set is the same as the synchronous algorithm which is-45; the arithmetic final position sum xy1+ xy2+ xy3 of the asynchronous communication based on ros without an event trigger mechanism is-71.2, and the error is up to 60%, while the arithmetic final position sum xy1+ xy2+ xy3 of the asynchronous arithmetic position sum with the event trigger compensation mechanism provided by the invention is-46.1, and the error is reduced to 2%. Therefore, on the premise of ros-based asynchronous high-efficiency communication, the algorithm ensures correct algorithm convergence, avoids the blocking behavior of a synchronous algorithm while maintaining the formation, and enables the algorithm convergence to be faster and the real-time performance of the robot system to be higher.

It should be noted that the above-mentioned contents only illustrate the technical idea of the present invention, and the protection scope of the present invention is not limited thereby, and it is obvious to those skilled in the art that several modifications and decorations can be made without departing from the principle of the present invention, and these modifications and decorations fall within the protection scope of the claims of the present invention.

Claims

1. A ROS platform-oriented multi-robot formation algorithm based on distributed asynchronous optimization is applied to a system of n robots (1, 2, 3), and is characterized by comprising the following steps:

s1: determining a formation according to the position of a target to be enclosed and establishing a distributed optimization model, wherein the objective function of the distributed optimization model is the position x of the optimization robot _i To the optimal observation position s _i Is the smallest distance, i.e.:

minf _i (x _i )＝||x _i -s _i || ²

wherein x is _i ∈R ² Is the position coordinates of robot i; s _i ∈R ² Is the best observation position of the robot i;

s.tx _i ∈Ω _i

x _i ∈S _i

wherein d ∈ R ² Representing the position of the target to be enclosed;

wherein k is _i The number of local updates for robot i; k is a radical of _j Local update times for robot j; tau is an event triggering condition parameter; n is a radical of _i A neighbor set of a robot i;

k _i ＝k _i +1

where eta is the step length, a _ij As a weight of the neighbor(s),

2. The ROS-oriented platform multi-robot formation algorithm based on distributed asynchronous optimization of claim 1, wherein: in the step S1, the robot i needs to be in the safety area Ω _i And observe the region S _i In the interior of said container body,

x _i ∈Ω _i ＝{x _i ∈R ² ||x _i -z _i ∣≤r ₁ },

x _i ∈S _i ＝{x _i ∈R ² ||x _i -s _i ∣<r ₂ }

wherein z is _i ∈R ² Is the robot i safety range center; r is ₁ For a safe region omega _i The distance from the center to any boundary, namely half of the side length of the square area; r is ₂ Is an observation region S _i The distance from the center to any boundary, i.e., half the side length of the square region.

3. The ROS-oriented platform multi-robot formation algorithm based on distributed asynchronous optimization of claim 2, wherein: in the step S1, the robot i needs to be in two square areas Ω _i ,S _i Within the intersection of (a), i.e.:

s.t

x _i ≥b _i ²

wherein,

4. the ROS-oriented platform multi-robot formation algorithm based on distributed asynchronous optimization of claim 2 or 3, characterized in that: in step S2, the convergence speed or accuracy of the model can be adjusted by changing the value of τ, and τ can be increased if the convergence speed is pursued; if convergence accuracy is sought, τ ∈ n can be reduced.

5. The ROS-oriented platform multi-robot formation algorithm based on distributed asynchronous optimization of claim 4, wherein: in step S3, when the event trigger condition is satisfied, the robot i calculates the gradient of the target function and a feasible gradient direction, and moves in the negative direction of the feasible gradient direction, and then collects the information of the neighbor and moves in the positive direction of the feasible gradient of the neighbor, so as to keep the center of the robot system and the target to be blocked the same.