CN111340348B

CN111340348B - Distributed multi-agent task cooperation method based on linear time sequence logic

Info

Publication number: CN111340348B
Application number: CN202010108021.0A
Authority: CN
Inventors: 方浩; 田戴荧; 陈杰; 杨庆凯; 曾宪琳; 尉越; 陈仲瑶
Original assignee: Beijing Institute of Technology BIT
Current assignee: Beijing Institute of Technology BIT
Priority date: 2020-02-21
Filing date: 2020-02-21
Publication date: 2022-07-26
Anticipated expiration: 2040-02-21
Also published as: CN111340348A

Abstract

The invention discloses a distributed multi-agent task cooperation method based on linear time sequence logic, which solves the problem of multi-agent task decoupling. Each intelligent agent constructs a self decoupling product type Buchi automatic machine by detecting the coupling edge, and constructs a self action sequence by the decoupling product type Buchi automatic machine; the end points of the coupling edges correspond to actions needing cooperation; when each intelligent agent independently executes the self action sequence by utilizing a decoupling product type Buchi automaton, judging whether the currently executed action and the corresponding trigger condition are in the coupling set, if so, judging that the currently executed action is the action needing cooperation, and requesting the cooperation intelligent agents to cooperate to make the action; when an agent fails, the agent responsible for inheritance inheriting the task of the failed agent is elected.

Description

Distributed multi-agent task cooperation method based on linear time sequence logic

Technical Field

The invention belongs to the field of artificial intelligence, and particularly relates to a distributed multi-agent task cooperation method based on linear time sequence logic.

Background

Linear Temporal Logic (Linear Temporal Logic) is a technical field with research prospects in the field of artificial intelligence at present. The linear sequential logic can model a series of complex tasks with sequential relationships in a programming language, and transfer the language through a graph model, so that the complex sequential relationships can be constrained through a series of conditional transfers. The method is applied in a wide range of fields due to its completeness. In the field of multi-agent, the multi-agent system has the characteristics of complex tasks and coupling in time sequence relation, so that some traditional methods are difficult to model, and linear time sequence logic can well model the complex tasks of the multi-agent system.

For the time sequence logic planning task in the multi-agent system, the existing solutions are as follows:

scheme 1: a centralized method for performing task planning in urban environment by applying linear sequential logic is proposed in literature (Yuhan Chen, Xu Chu Ding, all Stefanscu, and Calin Belta.A. formal approach to deployment of robust robots of in an urea like environment. in International Symposium on Distributed Autonomous robots System,2013.) each robot models the urban environment as a finite state transfer System and transfers its own tasks into corresponding Huchi robots to construct a product formula, Buchi. And then, each robot multiplies the product Buchi automata of all the robots, and an optimal path is planned for each robot through the finally obtained product automata of all the robots.

Scheme 2: the literature (Meng Guo and Dimos V diagnostics. Multi-agent plant configuration under 34(2):218, 235,2015.) constructs a hierarchical hybrid decision-control architecture for distributed multi-agent systems, which requires that each agent is assigned a linear timing logic formula as a task, and the agents cooperate with each other via a request-response communication model.

Scheme 3: document (Meng Guo and Dimos V dimalogonas. task and motion coordination for heterologous multiple systems with local coordination. IEEE Transactions on Automation Science and Engineering,14(2): 797-808, 2017.) this document distributively constructs an action sequence for each robot by predefining the coupling dependencies between the robot tasks in combination with the Dijkstra algorithm and guarantees synchronization of the cooperative actions by a request-response-confirmation communication model.

The prior art scheme is researched in the aspects of multi-agent task decoupling, information transmission during cooperation among multiple machines and processing strategy during node failure of a cluster part, but the effect is not ideal.

Disclosure of Invention

In view of this, the invention provides a distributed multi-agent task cooperation method based on linear sequential logic, which solves the problem of multi-agent task decoupling.

Furthermore, when part of the nodes in the cluster fail, other robots can react to the failure of the part of the robots, so that the cluster has robustness to the failure of the part of the nodes.

In order to solve the technical problem, the invention is realized as follows:

a distributed multi-agent task cooperation method based on linear time sequence logic comprises the following steps:

each intelligent agent constructs a self decoupling product type Buchi automaton and constructs a self action sequence through the decoupling product type Buchi automaton; the decoupling product type Guchi automaton has the following construction mode: each agent adopts a finite state transfer system for describing its working environment

And Buchi automaton describing its own tasks

Constructing a product-type Buchi automaton PBA for guiding task execution, which is expressed as

Constructing a LGBA of a B ü chi automaton of a generalized label, detecting a coupling edge in the LGBA, determining the coupling edge in the PBA according to the projection relation of the LBGA and the PBA, and recording the coupling edge and the trigger condition thereof into a coupling set; the end points of the coupling edges correspond to actions needing cooperation;

when each agent independently executes the action sequence of the agent by using a decoupling product type Buchi automaton, judging whether the currently executed action and the corresponding trigger condition are in the coupling set, if so, judging that the currently executed action is an action needing cooperation; at the moment, the action and the cooperation position which need to be cooperated are broadcasted to other intelligent agents, and the intelligent agent responsible for response makes a cooperation action;

when there is an agent p _i When the intelligent agent fails, the failure event is broadcasted to other intelligent agents; enumerating Agents p responsible for inheritance _j Inheriting a failed agent p _i The task of (2).

Preferably, the recording process of the coupling set specifically includes:

marking nodes of the coupling tasks in the LGBA, and calling the nodes as coupling task nodes; marking child nodes of the coupling task nodes as the coupling task nodes; all edges containing coupled task nodes are marked as coupled edges (q) _i ,q _j ) (ii) a Will couple the edges (q) _i ,q _j ) Corresponding label lambda _ij Set to true; the coupling task refers to a task completion state of a certain intelligent agent, and needs certain actions of other intelligent agents as conditions;

in building PBA, when an edge (q) that does not meet the branch condition occurs _s ,q _g ) Instead of deleting edges directly, the corresponding edge (q) in the LGBA is found according to the projection relationship of LGBA and PBA _i ,q _j ) Judging the found edge (q) _i ,q _j ) Tag lambda of _ij Whether true; if true, the corresponding edge (q) in PBA is set _s ,q _g ) Determining as a coupled edge, and coupling the coupled edge (q) _s ,q _g ) And trigger condition g thereof _sg Add to the coupling set of the PBA.

Preferably, the election agent p responsible for inheritance _j Inheriting failed Agents p _i The task of (1) is as follows:

all agents p receiving the broadcast of the failure event and having the same functionality as the failed agent _j Constructing a united intelligent agent model; the united intelligent agent model is constructed as follows: constructing Agents p _j Completing agent p _i PBA of the task of (1), noted

Establishing

To

The required inheritance task between the initial action nodes A is a jump action sequence; adding the jump action sequence to

Obtaining a combined agent model; wherein, the first and the second end of the pipe are connected with each other,

for failed agent p _i PBA of (3);

selecting an agent inheriting a task according to the cooperation cost indicated by the combined agent model; the agent inheriting the task executes the action sequence according to the joint agent model of the agent, and after the task is completed, the original decoupling product type Buchi automaton is switched back.

Preferably, the building of the united intelligent agent model comprises the following specific steps:

step b1, agent p _j By describing their working environment

And describes an agent p _i Of tasks

Construction of a decoupled product-based Guchi automaton

Step b2, assuming a failed agent p _i Is currently in motion

The node in (1) is

Slave node

Starting to backtrack to a father node and searching for a forward node, wherein the backtrack reaches the initial node of the current task

And the process is finished,

to

All the nodes form a set Q;

step b3, finding description failure agent p _i Working environment

Neutralization of

Contiguous adjacency points, denoted as point sets

Find out

Neutralization of

Contiguous adjacency points, denoted as point sets

Will point set

All ambient tags in (1) _i And with

Combined to form a plurality of new nodes

Form set Y1; set points

All ambient tags in (1) _j Combining with each node in Q to form a set Y2; will be provided with

Node direction shown in the middle set Y2

The nodes shown in the middle set Y1 are connected, and then the path searching operation is carried out to obtain the path information

The combined agent model of the jump action sequence is added.

Preferably, the agent for election inheritance task is: and calculating the cost of the jump action sequence according to the joint agent model, and selecting the agent with the minimum cost as the agent for inheriting the task.

Preferably, the broadcasting the action and the cooperation position needing cooperation to other agents and the broadcasting the failure event to other agents are realized by adopting a gossip information transmission protocol:

establishing a Request message including ^k Reply message Reply ^k Acknowledgement message Confirm ^k Informing message Warning ^k The message group of (1);

when an agent performs an action requiring collaboration, a Request is issued to all other agents ^k A message; receiving the Request ^k The agent of the message calculates the cost of responding to the message and passes the cost through Reply ^k Informing other agents of completing Request through gossip protocol ^k Iteration of the message is carried out, and an agent with the minimum cost is found; finally passing through Confirm ^k End of message to Request ^k Responding to the message;

Warning ^k the information is used for the disabled agent to seek the inheritance task to other agents; sending Warning to other agents when an agent fails ^k Information; receiving Warning ^k The agent of the message firstly constructs a joint agent model, calculates the cost of inheriting the task to reach the task starting point according to the joint agent model, and passes the cost through Reply ^k Informing other agents of completing Request through gossip protocol ^k Iteration of the message is carried out, and the agent with the minimum cost is found to be used as the inheritance agent; finally passing through Confirm ^k End of message pair Warning ^k Responding to the message;

Request ^k message, Reply ^k Message, Confirm ^k Messages, Warning ^k The messages are all provided with version numbers so that the Request ^k Message and Warning ^k The message can realize the whole network broadcast and the election of the responder through the gossip information transmission protocol.

Preferably, the message group is:

wherein k is an agent identification number; sigma _d An action requiring collaboration; pi _h A location requiring cooperation; t is a unit of _m Estimated time to reach a position needing cooperation for the requesting agent;

indicating whether agent k determined to respond to the other agent's request information,

responding to a Request for agent k ^k The time that the message is expected to take,

indicating whether the requested action has been confirmed to be completed,

predicting a time for the responding agent to pass through the collaboration zone; a. the ^k In order to disable the capabilities of the agent,

the message sequence numbers of the messages are respectively, the sequence numbers are version numbers when the messages are generated, and the sequence numbers symbolize the sequence of the message generation.

Has the beneficial effects that:

(1) decoupling of the coupling tasks. When the Buchi automaton for guiding the execution of the tasks is constructed, the LGBA is used for detecting the coupling edge, and the coupling edge is added into each local decoupling product Buchi automaton, so that the process of action planning can be distributed, and the planning time complexity is reduced.

(2) And (4) designing a combined robot model. By backtracking the current state of the failure node and constructing a combined robot model, other robots can complete the succession of tasks on the premise of not destroying the time sequence relation of the tasks, and the cluster can continue to complete established tasks under the condition that partial nodes fail. The invention can greatly reduce the computational complexity of the process and improve the capability of the cluster for dealing with partial node failure.

(3) The combination of the communication model and gossip. In order to ensure the synchronization of the cooperative action, a request-response-confirmation model in the literature is combined with the gossip protocol, and the gossip protocol suitable for the scene is designed, so that the speed of consistent convergence of multiple robots is increased.

Drawings

FIG. 1 is a schematic diagram of a task collaboration method for multiple robots under normal working conditions;

FIG. 2 is a schematic diagram of a task collaboration method in the case of robot failure.

Detailed Description

The following description will be given taking a robot as an agent.

Referring to fig. 1 and 2, the distributed multi-robot task collaboration method based on linear sequential logic of the present invention includes the following steps:

● under normal working conditions, each robot independently constructs a self-decoupling product type Buchi automaton and constructs a self-action sequence through the automaton. The decoupled product Buchi automaton here is the result of adding a coupling set to the product Buchi automaton. The coupling set records the coupling edge and the trigger condition thereof, and the end point of the coupling edge corresponds to the action needing cooperation.

● when each robot independently executes the local action sequence by using a decoupling product type Buchi automaton, judging whether the currently executed action and the corresponding trigger condition are in the coupling set, if so, the currently executed action is the action needing cooperation; at the moment, the action and the cooperation position which need to be cooperated are broadcasted to other robots, and the robot in charge of response makes a cooperative action. If not, the original operation is continued.

● when a robot fails, the failure event is broadcast to other robots. And selecting the robot in charge of inheritance by the robot receiving the failure event, wherein the robot in charge of inheritance adopts the built combined robot model to inherit the task of the failure robot. In the invention, the broadcasting cooperation event and the failure event to other robots are propagated by utilizing the gossip protocol in the cluster, and iteration is carried out until convergence. In addition, the election operation may be implemented by priority or by judgment that the spatial distance is shortest. The embodiment of the invention provides a judgment mode based on a combined robot model and cooperation cost.

Therefore, the invention establishes a new theoretical framework for constructing the product automata and further deeply considers the specific information transmission process when the cooperation is carried out among multiple machines. Firstly, the scheme distributes the process of action planning by detecting the coupling edge and adding the coupling edge into each local decoupling product Buchi automaton, decouples tasks, and turns the planning process to local, so that the calculation complexity is low. Secondly, by constructing a joint agent model, other agents can react to the failure of part of agents, so that the cluster has robustness to the failure of part of nodes. In addition, by adopting the gossip protocol, the time for the multiple machines to converge to be consistent is short, and the amount of transmitted information is small.

The following describes the construction of a decoupling product type Buchi automaton, the processing of a cooperation action, the task inheritance realization of a failure robot and the design of a communication model based on a gossip protocol in detail.

Construction of (I) coupling task decoupling and decoupling Buchi automaton

(1) Finite state transition system

In order to determine the position transition relationship of the robot in the environment, the environment needs to be modeled first, and in the distributed mission planning framework, the environment is modeled as a Finite state transition system (FTS). FTS is defined as follows:

definition 1. finite state transfer system (FTS) consists of one tuple:

wherein:

Π＝{π ₁ ,π ₂ ,...,π _N representing all areas of the rasterized working environment, wherein the areas are N areas; pi is also known as an environmental label, or state;

representing connections of paths between gridsCommunication relation;

indicating an initial position of the robot;

AP represents an atomic proposition describing a task that is not repartitionable;

L:Π→2 ^AP attributes representing task atom propositions that the grid region has;

representing the cost of transferring between grid regions.

State pi _i Is denoted as Post (pi) _i )＝{π _j ∈Π|π _i → _c π _j }. The sequence of movements of the robot can be represented by a sequence of infinite states, τ ═ pi ₁ π ₂ .., wherein, n _i ∈Post(π _i-1 )。

(2) Non-deterministic Buchi automaton

Forming task expressions by combining atomic propositions AP using Linear Temporal Logic (LTL) language

With respect to each LTL expression

There must be a corresponding automatic B ü chi machine (NBA) which is recorded as

Definition 2.

Defined as the five-membered group:

wherein Q is the set of states in the automaton,

representing the initial set of states in the automaton, 2 ^AP Representing a set of alphabets consisting of task atom propositions, delta: Qx 2 ^AP →2 ^Q Representing the transition relationships between states in an automaton, Q _F Representing a set of acceptable states in the automaton.

(3) Multiplication type Buchi automaton

The product-based Buchi automaton is a combination of FTS and NBA and thus contains both environmental and task state information.

Definition 3. Product Buchi Automaton (PBA) is expressed as

Wherein:

wherein Q ∈ Q ^* Nodes in NBA represent task states.

If and only if (π) _i ,π _j ) E → and q _n ∈δ(q _m ,L(π _i ))；q _n Is a node, representing the action performed, L (π) _i ) Is a trigger condition;

Q ₀ ^* ＝{<π,q>|π∈Π ₀ ,q∈Q ₀ is the initial state set; .

Is an acceptable set;

is a weight function:

W ^* (<π _i ,q _m >,<π _j ,q _n >)＝W(π _i ,π _j )

wherein<π _j ,q _n >∈δ(<π _i ,q _m >)。

The coupling task refers to a task completion state of a certain robot, certain actions of other robots are required as conditions, and in order to distribute a task planning process, the task needs to be decoupled in a process of constructing a product type Buchi automaton. Thus increasing the flag bit τ in PBA, forming

Is denoted by τ denotes

Is not a coupled edge. The set of flag bits is called a coupling set. The coupling edge refers to a transfer relationship related to a coupling task in the PBA of each robot, and generally, such a transfer relationship cannot independently complete conversion on the condition that other robots simultaneously perform corresponding actions.

The detection idea of the coupling edge is as follows: constructing a LGBA of a B ü chi automaton of a generalized label, detecting a coupling edge in the LGBA, determining the coupling edge in the PBA according to the projection relation of the LBGA and the PBA, and recording the coupling edge in the PBA and trigger conditions thereof into a coupling set; the endpoints of the coupled edges correspond to actions that require cooperation.

The coupling edge detection process is specifically realized as follows:

and constructing a broad sense label Buchi automaton LGBA of the task. Here, LGBA may be reconstructed or LGBA formed during PBA construction may be extracted. And marking nodes where the coupling tasks are located in the LGBA, and calling the nodes as coupling task nodes. And traversing the child nodes of the coupling task in the LGBA, and marking all the child nodes of all the coupling task nodes as the coupling task nodes. An edge comprising one or two coupled task nodes is denoted as a coupled edge (q) _i ,q _j ). Where q is the end point of an edge. Will couple the edges (q) _i ,q _j ) Corresponding label lambda _ij Set to true. The label can borrow the label in the LGBA, and can also newly set a label.

In the process of building PBA, when the PBA is paired

And

while performing multiplication, traverse

And detecting whether the environment label of the subsequent node meets the transfer condition or not by each edge in the PBA, and if so, obtaining the edge in the PBA. In the existing processing procedure, when an edge (q) not meeting the transfer condition appears _s ,q _g ) And when the data is needed, the data is directly deleted. And the present invention is directed to these edges (q) _s ,q _g ) Not deleting directly, but finding out the sum (q) in LGBA according to the projection relation of LGBA and PBA _s ,q _g ) Corresponding edge (q) _i ,q _j ) Judging the found edge (q) _i ,q _j ) Tag lambda of _ij Whether true; if true, the corresponding edge (q) in PBA is set _s ,q _g ) Determining as a coupled edge, coupling the edges (q) _s ,q _g ) And trigger condition g thereof _sg Added to the coupling set of the PBA.

In the embodiment of the invention, the flag bit tau is constructed into a two-dimensional array, the rows and the columns respectively correspond to the end points of the edges, and the elements in the array record the trigger conditions. The flag bit τ is equivalent to recording the coupling edge and its trigger condition. Then, when the coupling edge (q) is determined _s ,q _g ) Then, the corresponding trigger condition g can be set _sg Element τ (q) recorded to τ _s ,q _g ) In (1).

In the step, the coupling edge is detected and added into each local decoupling product Buchi automaton, so that the action planning process can be distributed, and the planning time complexity is reduced.

Design of combined robot model

In a complex environment, the robot may have a function damage phenomenon due to its own equipment, for example, three robots D _A ，D _B ，D _C ，D _A Is assigned to D _B Task of opening the door, D _B The goods need to be transported through the door D _C Is assigned the task of cleaning the house, if D is the same _A If a problem occurs, the door cannot be opened, the entire cluster cannot complete the task, and if D _C The robot can be at D _A Inheriting D temporarily when a problem occurs _A The robot cluster can complete the task. In order to solve the problem of failure of part of nodes possibly occurring in a cluster and improve the robustness of the method, the invention designs an integration scheme of a failure robot. Let failed robot be p _i When there is an agent p _i When the intelligent agent fails, the failure event is broadcasted to other intelligent agents; all agents p receiving the broadcast of the invalidation event and having the same functionality as the invalidating agent _j And constructing a combined robot model, and selecting the robot inheritance task with the minimum cooperation cost according to the cooperation cost indicated by the combined robot model, so that the robot can inherit the disabled robot task under the condition of minimum time complexity.

The construction idea of the combined robot model is as follows: construction robot p _j Completing robot p _i PBA of the task of

Establishing

To

Obtaining a combined robot model; wherein the content of the first and second substances,

for a disabled robot p _i PBA of (3).

The construction method of the combined robot model comprises the following specific steps:

step b1, marking the robot for constructing the combined robot model as p _j . Robot p _j By describing their working environment

And describe the disabled robot p _i Of tasks

Constructing a decoupling product type Buchi automaton according to step 1

The build has been completed in a previous step, where no repeat build is needed.

Step b2, assume failed robot p _i Is currently in

The node in (1) is

In that

The method comprises the steps of backtracking from a node to a father node and searching for the previous nodes, wherein the previous nodes are all relevant to the current task. Backtracking until reaching the starting node of the current task

And the process is finished, so that the process is finished,

to

All nodes of (a) constitute a set Q. The Q contains the complete task, the start node

And is also a place where the robot inheriting the task needs to access. Since the starting task is coded into a special identifier, the task starting node can be identified by means of the special identifier.

Step b3, finding description failure robot p _i Working environment

Neutralization of

Contiguous adjacency points, denoted as point sets

Find out

Neutralization of

Contiguous adjacency points, denoted as point sets

The adjacent points are two

The closest point, i.e. the point that can be reached by a one-step jump. Set points

All ambient tags in (1) _i And

combined together by permutation and combination to formMultiple new nodes

Forming set Y1. All points in set Y1 are

In (1). Will point set

All the environmental labels in (1) _j The nodes in Q are combined and arranged to form a set Y2. All points in the set Y2 are

In (1). Will be provided with

Node direction shown in the middle set Y2

The nodes shown in the set Y1 make a connection, the connection having a direction, through which the connection is established

To

While simultaneously adding the jump action sequence to

In (1). And then, performing path searching operation to obtain a combined robot model. The path search operation may employ Dijkstra's algorithm. Through the above calculation, generate the slave

Transferring to

Way in which tasks are inheritedAnd the task timing relation is not violated at the same time.

Step b4, selecting the robot with inherited tasks: the cost of the jump action sequence, i.e. the cost of the robot jumping from the current position to the task initial action node a, which may be for example time, or other information, is calculated according to the joint robot model. And selecting the robot with the minimum cost as the robot for inheriting the task.

Step B5, the robot inheriting the task executes the action sequence according to the self combined robot model, and after the task is finished, the original decoupling product type Buchi automaton is switched back

The combined robot model can enable other robots to complete inheritance of tasks on the premise of not destroying the time sequence relation of the tasks, so that the cluster can continue to complete established tasks under the condition that partial nodes fail. Meanwhile, the computational complexity of the process is greatly reduced, and the capability of the cluster for dealing with partial node failures is improved.

Design of communication model

In the framework, in order to avoid the situation that the robot runs a planned path locally and the collaboration is not synchronous, a request-response-confirmation communication model based on gossip is adopted. (Meng Guo and Dimos V collaborative tasks. task and movement correlation for a heterologous multiagent system with local coordinated tasks. IEEE Transactions on Automation Science and Engineering,14(2):797 and 808,2017.) A request-response-validation model is proposed that guarantees the completion of collaborative tasks during a robot task by requesting, responding and validating three types of messages. However, the model is too ideal in practical application, and the process that information propagates in the cluster to reach convergence is not considered. The invention considers the message transmission in the multi-robot cluster in more detail by combining the communication model and the gossip information transmission protocol, and accelerates the message convergence. At the same time increase Warning ^k A message. Under gossip protocol, the format of each messageComprises the following steps:

wherein, Request ^k For request messages, Reply ^k In response to the message, Confirm ^k To acknowledge messages, Warning ^k To inform the message. k is the number of the robot. Sigma _d Filling the trigger condition of the action for the action needing cooperation; pi _h Filling the environment tags in the positions needing cooperation; t is _m And estimating the time of the robot to reach the position needing cooperation for the requesting robot.

Indicates whether the robot k determines to respond to the Request of other robots ^k The message, the element may be an array, the location of the array corresponds to a different robot, and 1/0 at the location indicates whether the corresponding robot responds to the request.

Responding to Request for local robot k ^k The time that the message is expected to take,

in order to confirm whether the assistance action is completed,

the time for the responding robot to pass through the cooperation area is estimated. Burning ^k Messages are prepared for the combined robot model, sent out for a disabled robot, seeking other robots to inherit their own tasks, A ^k In order to disable the capabilities of the robot, the legacy robot is required to have the same capabilities;

the message sequence numbers of the messages are respectively the version numbers of the messages when the messages are generated, and the sequence numbers symbolize the sequence of the message generation.

When the action needing cooperation is executed, the robot constructs a Request message, and the action sigma needing cooperation is added into the Request message _d And a cooperation position pi _h And calculates the time of the self-arrival at the position needing cooperation. And transmitting the Request message in the cluster by using a gossip protocol, iterating until convergence, and making a robot in charge of response perform a cooperative action.

The iterative process of the Request message is as follows: each robot continuously transmits the Request message received by the robot to the neighbor node; when each node receives a Request message, if the node does not generate a Reply message, estimating an estimated cooperation Cost, namely estimating the estimated time spent by the node in response to the Request message; the Cost is compared with t in other received replies _d By comparison, if Cost is small, it is more appropriate to respond to the request, and all 0's are generated

Updating the corresponding position of the self to be 1, and filling the Cost into the corresponding position

In generating a new Reply message, of

Set to be in all Reply messages

The maximum value of (2) plus 1, and propagation is carried out; if a Reply message is generated by itself and the message is stored in the robot, judging other received Reply messages

Whether the message is larger than the message sent by the Reply message; if received

If the message size is larger than the preset value, updating the content in the Reply message stored by the message by using the content in the received Reply message; if received

And if the value is small, the treatment is not carried out. In the end of this process,

and

is updated as information about the robot responding to the request. When new, less costly collaborators appear, only all are the same

The robot updates the messages synchronously, thereby greatly reducing the communication traffic in the network. When gossip protocol is iterated to all robots

The method for judging whether the consistency is reached is that the messages are continuously received for n times for each robot

If the communication Request does not change, the robot responding to the Request is generated, and the communication iteration process is finished, so that the robot responds.

And then, when the triggering condition of the transfer relationship of the robot proposing the Request message is met, the robot proposing the Request message generates a Confirm message, the message is propagated in the cluster, and the received robot compares the cooperative action, deletes the Request message corresponding to the Confirm message and all the generated Reply messages corresponding to the Request. And then back to the task before responding.

The response mechanism of the Warning message is consistent with the Request message, A ^k The same robot processes the message. When the robot fails, transmitting the warming information to other robots; the robot receiving the surfing message firstly constructs a combined robot model, calculates the cost of the inheritance task reaching the task starting point according to the combined robot model, informs other robots of the cost through Reply, completes the whole network iteration of the Request message through a gossip protocol, and finds the robot with the minimum cost as the inheritance robot. Reply ⁱ And σ in the Confirm message _d No recognizable numerical value such as-1 is required to be filled in or otherwise filled in.

In summary, the above description is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A distributed multi-agent task cooperation method based on linear time sequence logic is characterized by comprising the following steps:

each agent constructs its own decoupling product formula

Automaton and by the decoupled product form

The automaton constructs a self-body action sequence; the decoupled product form

The automaton has the following construction mode: each agent adopts the function of describing its working environmentLimited state transfer system

And describing self-tasks

Automatic machine

Constructing a product formula directing task execution

Automaton PBA, denoted as

Constructing generalized labels

An automatic machine LGBA, detecting a coupling edge in the LGBA, determining the coupling edge in the PBA according to the projection relation between the LBGA and the PBA, and recording the coupling edge and the trigger condition thereof into a coupling set; the end points of the coupling edges correspond to actions needing cooperation;

using decoupled product equations at each agent

When the automaton independently executes the action sequence of the automaton, judging whether the currently executed action and the corresponding trigger condition are in the coupling set, if so, judging that the currently executed action is an action needing cooperation; at the moment, the action and the cooperation position which need to be cooperated are broadcasted to other intelligent agents, and the intelligent agent responsible for response makes a cooperation action;

when there is an agent p _i When the intelligent agent fails, the failure event is broadcasted to other intelligent agents; electing agent p responsible for inheritance _j Inheriting a failed agent p _i The task of (1);

the election agent p responsible for inheritance _j Inheriting a failed agent p _i The task of (1) is as follows:

all agents p receiving the broadcast of the failure event and having the same functionality as the failed agent _j Constructing a united intelligent agent model; the united intelligent agent model is constructed as follows: building Agents p _j Completing agent p _i PBA of the task of (1), noted

Establishing

To

The required inheritance task between the initial action nodes A is a jump action sequence; adding the sequence of jump actions to

Obtaining a joint agent model; wherein, the first and the second end of the pipe are connected with each other,

for failed agent p _i PBA of (3);

selecting an agent of an inherited task according to the cooperation cost indicated by the combined agent model; the agent inheriting the task executes the action sequence according to the joint agent model of the agent, and after the task is completed, the agent is switched back to the original decoupling product formula

An automaton.

2. The method according to claim 1, characterized in that the recording process of the coupling set is specifically:

nodes of the coupling tasks are marked in the LGBA and are called as coupling task nodes; marking the child nodes of the coupling task node as the coupling task node; all edges containing the nodes of the coupled task are recordedIs a coupled edge (q) _i ,q _j ) (ii) a Will couple the edges (q) _i ,q _j ) Corresponding label lambda _ij Set to true; the coupling task refers to a task completion state of a certain intelligent agent, and needs certain actions of other intelligent agents as conditions;

in building PBA, when an edge (q) that does not meet the branch condition occurs _s ,q _g ) Instead of deleting the edge directly, the corresponding edge (q) in the LGBA is found according to the projection relation between the LGBA and the PBA _i ,q _j ) Judging the found edge (q) _i ,q _j ) Tag lambda of _ij Whether true; if true, the corresponding edge (q) in PBA is set _s ,q _g ) Determining as a coupled edge, and coupling the coupled edge (q) _s ,q _g ) And trigger condition g thereof _sg Add to the coupling set of the PBA.

3. The method of claim 1, wherein the building of the federated agent model comprises the specific steps of:

step b1, agent p _j By describing their working environment

And describe agent p _i Of tasks

Constructing decoupled product form

Automatic machine

Step b2, assuming failed agent p _i Is currently in motion

The node in (1) is

Slave node

Starting to backtrack to a father node and searching for a previous node, wherein the backtrack reaches the initial node of the current task

And the process is finished, so that the process is finished,

to

All the nodes form a set Q;

step b3, finding out description failure intelligent agent p _i Working environment

Neutralization of

Contiguous adjacency points, denoted as point sets

Find out

Neutralization of

Contiguous adjacency points, denoted as point sets

Will point set

All ambient tags in (1) _i And

combined to form a plurality of new nodes

Form set Y1; will point set

All the environmental labels in (1) _j Combined with each node in Q to form a set Y2; will be provided with

Node direction shown in the middle set Y2

The combined intelligent agent model of the jump action sequence is added.

4. The method of claim 1, wherein the agent that elects the inherited task is: and calculating the cost of the jump action sequence according to the joint agent model, and selecting the agent with the minimum cost as the agent for inheriting the task.

5. The method of claim 1, wherein the broadcasting actions and collaboration locations requiring collaboration to other agents and the broadcasting failure events to other agents are implemented using gossip messaging protocols:

when the intelligent agentWhen the action needing cooperation is executed, the Request is sent to all other agents ^k A message; receiving a Request ^k The agent of the message calculates the cost of responding to the message and passes the cost through Reply ^k Informing other agents of completing Request through gossip protocol ^k Iteration of the message is carried out, and an agent with the minimum cost is found; finally passing through Confirm ^k End of message to Request ^k Responding to the message;

Request ^k message, Reply ^k Message, Confirm ^k Messages, Warning ^k The messages are all provided with version numbers so that the Request ^k Message and Warning ^k The message realizes the whole network broadcast and the election of the responder through the gossip information transmission protocol.

6. The method of claim 5, wherein the set of messages is:

indicating whether agent k determined to respond to the request information of the other agents,

indicating whether the requested action has been confirmed to be completed,