CN113255914A - Method for structurally representing intelligent agent target implementation process - Google Patents

Method for structurally representing intelligent agent target implementation process Download PDF

Info

Publication number
CN113255914A
CN113255914A CN202110401225.8A CN202110401225A CN113255914A CN 113255914 A CN113255914 A CN 113255914A CN 202110401225 A CN202110401225 A CN 202110401225A CN 113255914 A CN113255914 A CN 113255914A
Authority
CN
China
Prior art keywords
target
node
plan
nodes
steps
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110401225.8A
Other languages
Chinese (zh)
Inventor
姚远
吴迪
夏江涵
王晓璇
胡佳仪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University of Technology ZJUT
Original Assignee
Zhejiang University of Technology ZJUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University of Technology ZJUT filed Critical Zhejiang University of Technology ZJUT
Priority to CN202110401225.8A priority Critical patent/CN113255914A/en
Publication of CN113255914A publication Critical patent/CN113255914A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/027Frames

Abstract

The invention relates to a method for structurally representing the target implementation process of an intelligent agent, which comprises the steps of constructing a multi-element group G based on a target planning graph, wherein the multi-element group G comprises a set of nodes in the target planning graph and directed edge sets of different types among the nodes, generating the target planning graph based on the target of any intelligent agent, selecting a plan and constructing an example to realize the structural execution of the target implementation process of the intelligent agent; the structure for representing the intelligent agent target implementation process more flexibly is provided, a specific use mode of the structure is given, the graph structure is used for representing the partial order execution dependency relationship among the steps based on the directed acyclic graph, the strict full order relationship in the traditional target plan tree is replaced by the partial order execution dependency relationship, the steps without the order dependency relationship are allowed to be executed in an alternative execution order or even in parallel, a more accurate target implementation mode can be represented more objectively, and the repeated reproduction can be realized on the premise that all nodes are generated based on the target plan library.

Description

Method for structurally representing intelligent agent target implementation process
Technical Field
The invention relates to the technical field of computer systems based on specific computing models, in particular to a method for structurally representing an intelligent object implementation process in the field of artificial intelligence.
Background
A Multi-Agent System (MAS) is a System that implements complex intelligence through interaction and cooperation between agents, is used to solve a problem that a single Agent is difficult or impossible to solve, and is widely used in important fields such as urban transportation and industrial manufacturing.
As a common agent structure, a BDI model constructs the mental state of an agent by defining Belief (Belief), goal (Desire) and Intention (intent), and explains the autonomous behavior of the agent in the environment according to the change of the mental state, the BDI agent constructed based on the BDI model has strong autonomous ability, interaction ability and strain ability in a dynamic environment, and the autonomous behavior has good verifiability, and is widely applied to the fields of military, aerospace and the like.
The BDI agent selects different plans (Plan) from a predetermined Plan library to achieve the goal; plan (Plan) specifies the preconditions (preconditions) for its execution and the steps required for its completion, which may be actions (actions) that the agent can directly perform or sub-targets that need to be implemented by the Plan; in the process of achieving the goal, the step that the agent promises to be performed is called the intention (interaction) of the agent to achieve the goal, namely what needs to be done to achieve the goal.
Solving the intent selection problem and the plan selection problem requires a complete, objective, repeatable analysis of the implementation process and steps of the agent objectives.
The most common way to represent the target implementation process at present is a target plan tree (goal-plan tree) structure proposed by Thangarajah et al, and the method adopts an and or tree to represent the relationship between the intelligent object target and the plan, so that the plan selection problem in the deliberate process of the intelligent object is converted into the search and traversal problem of the tree, and through the target plan tree, the intelligent object can track the complete process of target implementation and acquire all possible target implementation paths.
However, the currently defined target planning tree is limited by its own structural features, so that each step in the plan must be executed strictly according to a preset sequence, and the flexibility and expressiveness are insufficient, and the real execution situation cannot be represented. In a real environment, an agent can adjust the execution sequence of target implementation steps at any time under the condition that conditions allow, when the operation sequences of some nodes are exchanged, the change of the sequence does not affect the final target implementation, and a target plan tree cannot be used for the representation, so that the application of the target plan tree in a real scene is limited finally.
Disclosure of Invention
The invention solves the problems in the prior art and provides an optimized method for structurally representing the intelligent agent target realization process.
The technical scheme adopted by the invention is that the method for structurally representing the intelligent agent target implementation process comprises the following steps:
step 1: construction of a target plan based tuple G = (V, E)c,Eb,Eo) Where V is the set of nodes in the target planning graph, Ec、Eb、EoRespectively representing different types of directed edge sets among the nodes;
in the invention, V comprises a target node, a plan node and an action node; for a goal, one or more plans may be included, with a goal being completed if any plan is completed; for a plan, which may include sub-objectives and actions, and one or more sub-plans under the sub-objectives, the plan is completed only if the sub-objectives and actions are completed; also, all sub-goals and actions are referred to as steps.
In the invention, the targets are divided into a top-level target and sub-targets, the top-level target represents the highest state that an intelligent agent wants to reach, all plans, sub-targets and actions in the graph are used for realizing top-level target services, and the sub-targets are generated in the execution process of the plans without specifying the realized plans in advance.
In the present invention, set EcThe directed edge in (1) refers to pointing from a target node x to a planning node y, and satisfies (x, y) epsilon EcMeaning that y is a possible plan to implement x,is a corresponding relation; specifically, for any one target GiWhether it is a top level target or a sub-target, there is at least one related plan in the plan library, one of which needs to be selectedjTo implement GiRepresented in the graph structure as a slave target node NGiStarting from all for implementing GiPlanning node N ofPjAll have a directed edge (N)Gi,NPj)∈EcAnd represents the correspondence between the target and the plan.
In the present invention, set EbThe directed edge in (1) refers to pointing from a planning node x to an action or target node y, and satisfies (x, y) epsilon EbY is the step in plan x, which is a dependency; in particular, a plan P is completedjAll execution steps contained in the plan, plan P, need to be executedjThe steps in (1) can be actions that can be performed directly or sub-objectives that require selection of a plan to implement, represented in the graph structure as a slave plan node NPjGo out to all action or target nodes N it containsSkAll have a directed edge (N)Pj,NSk)∈EbThe dependency between the step and the plan is indicated.
In the present invention, set EoThe directed edge refers to the point from one target or action node x to another target or action node y in the same plan, and satisfies (x, y) epsilon EoIndicating that x must be executed before y, and performing the relation in a partial order; in particular, there may be dependencies or order relationships between steps in the same plan, such as step Sk+1Need to be in step SkAnd step Sk+2Then executed, assuming plan PjAny one of the steps SkAll have a set of pre-stagesφ(S k )So that SkIn thatφ(S k )All steps in (1) are executed after being executed, thenφ(S k )Step (2) is called SkA preliminary step of (S)kAll the preceding steps of (1) have partial order execution relation with the preceding steps, and the step nodes are arranged between the preceding stepsIs represented in the target plan graph structure as for plan PjNode N of any stepSkIf, ifφ(S k )If not, there is a directed edge slave node NSmPointing to node NSkAnd Smφ(S k )
In the present invention, any group (x, y) can only belong to Ec、Eb、EoOne kind of (1).
In the present invention, the target plan graph is a directed acyclic graph.
Step 2: any intelligent agent corresponds to a target plan library, and a target plan graph is generated based on the target of any intelligent agent;
and step 3: selecting a plan and constructing an example based on the target planning map;
and 4, step 4: structured execution of an agent object implementation process is achieved.
Preferably, in step 1, all V, E are in the initial statec、Eb、EoAre all empty sets.
Preferably, in step 1, the nodes are a set including all target nodes, plan nodes, and action nodes in the target plan graph, and the different types of directed edges include a corresponding relationship, a dependent relationship, and a partial order execution relationship.
Preferably, in step 2, the generating of the target plan map for any agent includes the following steps:
step 2.1: constructing an empty Plan Set (PS), and adding all plans in a target plan library of any intelligent agent into the PS;
step 2.2: judging whether the plan set PS is empty, if yes, proceeding step 2.6, otherwise, taking out a plan P from the PSiGenerating a planning node NPiAnd plan node NPiAdding the data to a set V in the tuple G, wherein i is the serial number of the plan in the target plan set, and i is more than or equal to 0;
step 2.3: judgment plan PiEach of the steps S;
if S is an action AnThen generate the corresponding action node NAnIs a reaction of NAnAdding to set V while adding doublets (N)Pi, NAn) Join to set Eb
If S is a sub-target GnThen, the set V is searched to determine whether there is a representation GnNode N ofGn(ii) a If not present or if present but NGnIf the degree of income is 0, the corresponding target node N is generated or reservedGnIs a reaction of NGnAdd to set V and add doublet (N)Pi, NGn) Join to set EbIf present and satisfies NGnIf the degree of income is greater than 0, then N is addedGnCopying one part of all types of nodes and edges in the whole connected set, generating new nodes and edges with different names and the same expression content, and respectively adding the new nodes and edges into the sets with corresponding types in the multi-element group G;
step 2.4: connecting the step nodes with the partial order execution relation according to the partial order execution relation among the step nodes;
step 2.5: find plan PiObject G to be achievedjGenerating a target node NGjJudging the target node NGjIf not, the target node N is determined to be in the set VGjAdding into the set V, if existing, only one target node N existsGjThen a doublet (N) of edges will be representedGj, NPi) Join set EcIf there is more than one target node NGjThen N is addedPiCopying one copy of all types of nodes and edges in the connected set to generate new nodes and edges with different names and the same type, adding the new nodes and edges into the set of corresponding types in the multi-element group G respectively, and representing the two-element group (N) of the edgeGj, NPi) Join set Ec(ii) a Completion PiDeleting P in the plan set PSiReturning to the step 2.2;
step 2.6: and checking the target plan graph, deleting logic redundant edges, and finishing the generation of the target plan graph.
Preferably, in the step 2.4, if there is a different step node NSmAnd NSk,NSmAnd NSkHas a partial order execution relation therebetween, and SmIs SkThe precondition of (2) is to use the binary group (N)Sm, NSk) Join to set Eo
Preferably, the step 3 comprises the steps of:
step 3.1: the intelligent agent determines a top target G according to the requirement0
Step 3.2: search all generated target planning graphs for G0If not with respect to G0The target planning map discards the current target and selects a new target G0Repeating the step 3.2, otherwise, carrying out the next step;
step 3.3: tuple G = (V, E) based on selected target planc,Eb,Eo) For G0Instantiation is carried out;
step 3.4: from the target G0Corresponding target node NG0Initially, a set of executable steps EX is establishedNG0Adding currently executable target node and/or action node to EXNG0
Step 3.5: random slave NG0Selects an executable plan node N from the plan child nodesPk
Preferably, any one of the nodes is provided with a state value; the status values include default, in execution, success, and failure.
Preferably, the initial values of all nodes are defaults.
Preferably, the step 4 comprises the steps of:
step 4.1: will NPkUpdate to executive status of (EX), delete EXNG0N in (1)G0
Step 4.2: will PkAll step nodes S without pre-step in the set of executable steps EXNG0Performing the following steps;
step 4.3: selection of EXNG0Any step node in the step is executed, and the state of the step node is updated to be in execution;
if the step node is an action node, directly executing;
if the node in the step is the sub-target node NGiTo establish a portable deviceColumn step set EXNGiThe current sub-target node NGiIn executable target node or action node joins EXNGiFor EXNGiRepeat step 3.5 and add NGiFrom EXNG0Removing EXNGiAdding EXNG0Performing the following steps;
step 4.4: updating executable step set EXNG0
If step node SjIf the execution is successful, the step node S is executedjIs updated to be successful and removed from the corresponding set of executable steps, updates the set of executable steps EXNG0(ii) a Judging with step node SjIf the step node of the pre-step has other non-executed pre-steps, directly repeating the step 4.3 if the step node is not the other non-executed pre-step, otherwise, directly using the step node SjAdding a step node to EX for a step of a preceding stepNG0In step (2), repeatedly executing step 4.3; set of executable steps EX if updatedNG0If the node is empty, setting the states of the plan node and the corresponding target node as successful to finish the target;
if step node SjIf the execution fails, the step node SjIs updated to failure, the state of the corresponding planning node is also updated to failure, and S is clearedjIs located in the set EXNGi(Sj∈ EXNGi) (ii) a Selecting a target node N belonging to the sameGiGo back to step 4.1 for NGiStep 4.1 and step 4.2 are executed, and step 4.3 is normally executed again; repeating step 4.3 until there are no step nodes that can be executed;
and if all plans fail, updating the state of the corresponding target node to fail.
The invention relates to a method for expressing the goal realizing course of intelligent agent structurally and optimally, construct the multivariate group G based on goal planning chart, the multivariate group includes the set of node in the goal planning chart and different kinds of directed edge sets among the node, the goal based on any intelligent agent generates the goal planning chart, choose the plan and construct the example and then realize the structural execution of the goal realizing course of the intelligent agent; the structure for representing the intelligent agent target implementation process more flexibly is provided, a specific use mode of the structure is given, the graph structure is used for representing the partial order execution dependency relationship among the steps based on the directed acyclic graph, the strict full order relationship in the traditional target plan tree is replaced by the partial order execution dependency relationship, the steps without the order dependency relationship are allowed to be executed in an alternative execution order or even in parallel, a more accurate target implementation mode can be represented more objectively, and the repeated reproduction can be realized on the premise that all nodes are generated based on the target plan library.
The invention has the beneficial effects that:
(1) the method carries out graph structural representation on the targets, plans and actions according to the logical relationship of the targets, plans and actions, and sets the front step nodes of the step nodes in the plans to be a plurality of rather than one nodes, thereby forming a graph structure, eliminating the constraint of strict sequential linear execution and better simulating the implementation process of the intelligent agent target;
(2) because no strict linear execution sequence exists among the step nodes, and no strict sequence constraint limitation exists when the logical relation is expressed, the method can be widely applied to different complex scenes, and the problem of poor expressiveness of the traditional target planning tree is solved;
(3) according to the accessibility of each node in the plan body, whether the nodes in each step in a plan can be parallel or not can be clearly known, so that various different choices are provided for the execution sequence, the flexibility is greatly improved, and the method can be used in the artificial intelligence field such as AI development and the like.
Drawings
FIG. 1 is a schematic view of a target plan of the present invention; in the figure:
the dashed edges represent dependencies between steps and plans, with directions pointing from the plan node to the step node (action or sub-goal);
the solid line edge with double arrows represents the partial order relation among the steps, and the direction is pointed to the node depending on the step by the step node;
the solid line edge with a single arrow represents the corresponding relation between the target and the plan, and the direction is pointed to the plan node by the target node;
FIG. 2 is a flow chart of the present invention.
Detailed Description
The present invention is described in further detail with reference to the following examples, but the scope of the present invention is not limited thereto.
The invention relates to a method for structurally representing the realization process of an intelligent object, wherein all subscripts are used for identifying the objects, sub-objects, plans and actions which are different under the current condition, and are not serial numbers.
As shown in FIG. 1, there is a top level target G0The top level goal may be accomplished through plan P0Or plan P1To achieve plan P0Or plan P1Any of which are considered to reach the top level goal; to plan P0For example, action A needs to be completed1Action A0Action A2And sub-target G1To complete action A1Before completing action A0To complete action A2Before completing action A0And G1So by analogy, the step that actually needs/can be executed first is action A3And action A0At this time, plan P2Completion sub-goal G1Is achieved, then action A is completed1And action A2Based on the executed action A1Action A0Action A2And sub-target G1Execution plan P0Finally reach the top level target G0(ii) a Plan P1The same is true.
The method comprises the following steps:
step 1: construction of a target plan based tuple G = (V, E)c,Eb,Eo) Where V is the set of nodes in the target planning graph, Ec、Eb、EoRespectively representing different types of directed edge sets among the nodes;
in step 1, all V, E are in the initial statec、Eb、EoAre all empty sets.
In the step 1, the nodes are a set including all target nodes, plan nodes and action nodes in the target planning graph, and the different types of directed edges include a corresponding relationship, a subordinate relationship and a partial order execution relationship.
Step 2: any intelligent agent corresponds to a target plan library, and a target plan graph is generated based on the target of any intelligent agent;
in step 2, generating the target plan map for any agent includes the following steps:
step 2.1: constructing an empty Plan Set (PS), and adding all plans in a target plan library of any intelligent agent into the PS;
step 2.2: judging whether the plan set PS is empty, if yes, proceeding step 2.6, otherwise, taking out a plan P from the PSiGenerating a planning node NPiAnd plan node NPiAdding the data to a set V in the tuple G, wherein i is the serial number of the plan in the target plan set, and i is more than or equal to 0;
step 2.3: judgment plan PiEach of the steps S;
if S is an action AnThen generate the corresponding action node NAnIs a reaction of NAnAdding to set V while adding doublets (N)Pi, NAn) Join to set Eb
If S is a sub-target GnThen, the set V is searched to determine whether there is a representation GnNode N ofGn(ii) a If not present or if present but NGnIf the degree of income is 0, the corresponding target node N is generated or reservedGnIs a reaction of NGnAdd to set V and add doublet (N)Pi, NGn) Join to set EbIf present and satisfies NGnIf the degree of income is greater than 0, then N is addedGnCopying one part of all types of nodes and edges in the whole connected set, generating new nodes and edges with different names and the same expression content, and respectively adding the new nodes and edges into the sets with corresponding types in the multi-element group G;
step 2.4: connecting the step nodes with the partial order execution relation according to the partial order execution relation among the step nodes;
in step 2.4, if there are different step nodes NSmAnd NSk,NSmAnd NSkHas a partial order execution relationship therebetween, andSmis SkThe precondition of (2) is to use the binary group (N)Sm, NSk) Join to set Eo
Step 2.5: find plan PiObject G to be achievedjGenerating a target node NGjJudging the target node NGjIf not, the target node N is determined to be in the set VGjAdding into the set V, if existing, only one target node N existsGjThen a doublet (N) of edges will be representedGj, NPi) Join set EcIf there is more than one target node NGjThen N is addedPiCopying one copy of all types of nodes and edges in the connected set to generate new nodes and edges with different names and the same type, adding the new nodes and edges into the set of corresponding types in the multi-element group G respectively, and representing the two-element group (N) of the edgeGj, NPi) Join set Ec(ii) a Completion PiDeleting P in the plan set PSiReturning to the step 2.2;
step 2.6: and checking the target plan graph, deleting logic redundant edges, and finishing the generation of the target plan graph.
In the present invention, step 2.1, the tuple G is actually needed to be constructed initially based on the agent's target plan library, in the present invention the default G exists, and in the output state, V, E in Gc、EbAnd EoAre all empty sets; the target plan library is a plurality of current individual plans, and when a target needs to be realized, the corresponding plan capable of realizing the target is searched in the target plan library.
In the invention, step 2 is to find out all the associated nodes which can achieve the goal, including plan and actions and sub-goals under the plan.
In the present invention, deleting the logical redundant edge in step 2.6 is a conventional technique in the art, that is, for a partial order execution relationship structure of the logical relationship of each step node in a plan, if there is a directed edge e satisfying the basic construction condition of G, and even if it is deleted, the logical constraint of the sequential execution between each step node is still unchanged, then e is called asIs a logical redundant edge; for example, step node A in FIG. 15、A6And G3,A6The pre-step node of (A) is5And G3And A is5The pre-step node of (2) is G3At this time A5To A6The directed edge can be deleted, and the deletion of the edge does not change the logic constraint relation between nodes; in order to simplify the logic constraint relation between nodes executed successively, all the logically redundant edges need to be deleted. After all the deletable edges are deleted, the simplest target planning graph can be obtained, the simplest target planning graph is more concise in expression, and after the redundant logic edges are deleted, when each node updates the completion condition of the preposed node, the nodes related to the redundant logic edges do not need to be processed.
And step 3: selecting a plan and constructing an example based on the target planning map;
the step 3 comprises the following steps:
step 3.1: the intelligent agent determines a top target G according to the requirement0
Step 3.2: search all generated target planning graphs for G0If not with respect to G0The target planning map discards the current target and selects a new target G0Repeating the step 3.2, otherwise, carrying out the next step;
step 3.3: tuple G = (V, E) based on selected target planc,Eb,Eo) For G0Instantiation is carried out;
any node is provided with a state value; the status values include default (default), executing (executing), success (success), and failure (fail).
All nodes have default initial values.
Step 3.4: from the target G0Corresponding target node NG0Initially, a set of executable steps EX is establishedNG0Adding currently executable target node and/or action node to EXNG0
Step 3.5: random slave NG0Selects an executable plan node N from the plan child nodesPk
In the present invention, EX is used when execution is startedNG0= NG0And N isG0Is updated to be in execution.
And 4, step 4: structured execution of an agent object implementation process is achieved.
The step 4 comprises the following steps:
step 4.1: will NPkUpdate to executive status of (EX), delete EXNG0N in (1)G0
Step 4.2: will PkAll step nodes S without pre-step in the set of executable steps EXNG0Performing the following steps;
step 4.3: selection of EXNG0Any step node in the step is executed, and the state of the step node is updated to be in execution;
if the step node is an action node, directly executing;
if the node in the step is the sub-target node NGiEstablishing a set of executable steps EXNGiThe current sub-target node NGiIn executable target node or action node joins EXNGiFor EXNGiRepeat step 3.5 and add NGiFrom EXNG0Removing EXNGiAdding EXNG0Performing the following steps;
step 4.4: updating executable step set EXNG0
If step node SjIf the execution is successful, the step node S is executedjIs updated to be successful and removed from the corresponding set of executable steps, updates the set of executable steps EXNG0(ii) a Judging with step node SjIf the step node of the pre-step has other non-executed pre-steps, directly repeating the step 4.3 if the step node is not the other non-executed pre-step, otherwise, directly using the step node SjAdding a step node to EX for a step of a preceding stepNG0In step (2), repeatedly executing step 4.3; set of executable steps EX if updatedNG0If the node is empty, setting the states of the plan node and the corresponding target node as successful to finish the target;
if step node SjExecution failureThen step node SjIs updated to failure, the state of the corresponding planning node is also updated to failure, and S is clearedjIs located in the set EXNGi(Sj∈EXNGi) (ii) a Selecting a target node N belonging to the sameGiGo back to step 4.1 for NGiStep 4.1 and step 4.2 are executed, and step 4.3 is normally executed again; repeating step 4.3 until there are no step nodes that can be executed;
and if all plans fail, updating the state of the corresponding target node to fail.
In the present invention, EX is updatedNG0And adding NPkNode of executable step (1), NPkUpdate to in-flight and delete NG0Starting execution plan NPkWhen N is presentPkSome of the steps S are without pre-stages, when these steps S are executable, all step nodes are added to the executable step set EX in turnNG0In the method, step nodes with default states are all executable, namely, no precedence order constraint exists, and EX is selected to be executedNG0Any one of the step nodes in (1); for action nodes it can be executed directly, while for sub-target nodes it is necessary to establish a new, included set of executable steps EXNGiAfter completion, EX isNGiAdding EXNG0And (4) performing updating.
In the present invention, in action A0Action A2Sub-target G1For example, when action A0After the execution is completed, action A2Still cannot be executed, must be taken as the child target G1After the execution is finished, action A can be executed2Therefore, all unexecuted prestage nodes need to be added to EXNG0And step 4.3 is performed.
In the invention, the first step is continuously and repeatedly completed, and a new executable set related to a possibly existing sub-target is added until EXNG0An empty set means that all execution step nodes in a plan node that completes the top-level target are successfully executed, and a successful execution of the corresponding plan means that the state of the target node is changedAnd the new execution is successful.
In the invention, in the actual operation, a counting mark can be set, namely the top target is G0The first sub-target is G1And numbered sequentially when G is completeiAnd i is 0, the top level goal is achieved.

Claims (9)

1. A method for structured representation of an intelligent agent object realization process, characterized by: the method comprises the following steps:
step 1: construction of a target plan based tuple G = (V, E)c,Eb,Eo) Where V is the set of nodes in the target planning graph, Ec、Eb、EoRespectively representing different types of directed edge sets among the nodes;
step 2: any intelligent agent corresponds to a target plan library, and a target plan graph is generated based on the target of any intelligent agent;
and step 3: selecting a plan and constructing an example based on the target planning map;
and 4, step 4: structured execution of an agent object implementation process is achieved.
2. A method for structured representation of an intelligent object realization process according to claim 1, characterized in that: in step 1, all V, E are in the initial statec、Eb、EoAre all empty sets.
3. A method for structured representation of an intelligent object realization process according to claim 1, characterized in that: in the step 1, the nodes are a set including all target nodes, plan nodes and action nodes in the target planning graph, and the different types of directed edges include a corresponding relationship, a subordinate relationship and a partial order execution relationship.
4. A method for structured representation of an intelligent object realization process according to claim 1, characterized in that: in step 2, generating the target plan map for any agent includes the following steps:
step 2.1: constructing an empty Plan Set (PS), and adding all plans in a target plan library of any intelligent agent into the PS;
step 2.2: judging whether the plan set PS is empty, if yes, proceeding step 2.6, otherwise, taking out a plan P from the PSiGenerating a planning node NPiAnd plan node NPiAdding the data to a set V in the tuple G, wherein i is the serial number of the plan in the target plan set, and i is more than or equal to 0;
step 2.3: judgment plan PiEach of the steps S;
if S is an action AnThen generate the corresponding action node NAnIs a reaction of NAnAdding to set V while adding doublets (N)Pi, NAn) Join to set Eb
If S is a sub-target GnThen, the set V is searched to determine whether there is a representation GnNode N ofGn(ii) a If not present or if present but NGnIf the degree of income is 0, the corresponding target node N is generated or reservedGnIs a reaction of NGnAdd to set V and add doublet (N)Pi, NGn) Join to set EbIf present and satisfies NGnIf the degree of income is greater than 0, then N is addedGnCopying one part of all types of nodes and edges in the whole connected set, generating new nodes and edges with different names and the same expression content, and respectively adding the new nodes and edges into the sets with corresponding types in the multi-element group G;
step 2.4: connecting the step nodes with the partial order execution relation according to the partial order execution relation among the step nodes;
step 2.5: find plan PiObject G to be achievedjGenerating a target node NGjJudging the target node NGjIf not, the target node N is determined to be in the set VGjAdding into the set V, if existing, only one target node N existsGjThen a doublet (N) of edges will be representedGj, NPi) Join set EcIf there is more than one target node NGjThen N is addedPiCopying one copy of all types of nodes and edges in the connected set to generate new nodes and edges with different names and the same type, adding the new nodes and edges into the set of corresponding types in the multi-element group G respectively, and representing the two-element group (N) of the edgeGj, NPi) Join set Ec(ii) a Completion PiDeleting P in the plan set PSiReturning to the step 2.2;
step 2.6: and checking the target plan graph, deleting logic redundant edges, and finishing the generation of the target plan graph.
5. The method for structured representation of intelligent agent object realization process according to claim 4, characterized in that: in step 2.4, if there are different step nodes NSmAnd NSk,NSmAnd NSkHas a partial order execution relation therebetween, and SmIs SkThe precondition of (2) is to use the binary group (N)Sm, NSk) Join to set Eo
6. A method for structured representation of an intelligent object realization process according to claim 1, characterized in that: the step 3 comprises the following steps:
step 3.1: the intelligent agent determines a top target G according to the requirement0
Step 3.2: search all generated target planning graphs for G0If not with respect to G0The target planning map discards the current target and selects a new target G0Repeating the step 3.2, otherwise, carrying out the next step;
step 3.3: tuple G = (V, E) based on selected target planc,Eb,Eo) For G0Instantiation is carried out;
step 3.4: from the target G0Corresponding target node NG0Initially, a set of executable steps EX is establishedNG0Adding currently executable target node and/or action node to EXNG0
Step 3.5: random slave NG0Plan child node ofTo select an executable plan node NPk
7. The method for structured representation of intelligent agent object realization process according to claim 6, characterized in that: any node is provided with a state value; the status values include default, in execution, success, and failure.
8. A method for structured representation of an intelligent object realization process according to claim 7, characterized in that: all nodes have default initial values.
9. The method for structured representation of intelligent agent object realization process according to claim 6, characterized in that: the step 4 comprises the following steps:
step 4.1: will NPkUpdate to executive status of (EX), delete EXNG0N in (1)G0
Step 4.2: will PkAll step nodes S without pre-step in the set of executable steps EXNG0Performing the following steps;
step 4.3: selection of EXNG0Any step node in the step is executed, and the state of the step node is updated to be in execution;
if the step node is an action node, directly executing;
if the node in the step is the sub-target node NGiEstablishing a set of executable steps EXNGiThe current sub-target node NGiIn executable target node or action node joins EXNGiFor EXNGiRepeat step 3.5 and add NGiFrom EXNG0Removing EXNGiAdding EXNG0Performing the following steps;
step 4.4: updating executable step set EXNG0
If step node SjIf the execution is successful, the step node S is executedjIs updated to be successful and removed from the corresponding set of executable steps, updates the set of executable steps EXNG0(ii) a The step of judgmentNode SjIf the step node of the pre-step has other non-executed pre-steps, directly repeating the step 4.3 if the step node is not the other non-executed pre-step, otherwise, directly using the step node SjAdding a step node to EX for a step of a preceding stepNG0In step (2), repeatedly executing step 4.3; set of executable steps EX if updatedNG0If the node is empty, setting the states of the plan node and the corresponding target node as successful to finish the target;
if step node SjIf the execution fails, the step node SjIs updated to failure, the state of the corresponding planning node is also updated to failure, and S is clearedjIs located in the set EXNGi(ii) a Selecting a target node N belonging to the sameGiGo back to step 4.1 for NGiStep 4.1 and step 4.2 are executed, and step 4.3 is then executed; repeating step 4.3 until there are no step nodes that can be executed;
and if all plans fail, updating the state of the corresponding target node to fail.
CN202110401225.8A 2021-04-14 2021-04-14 Method for structurally representing intelligent agent target implementation process Pending CN113255914A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110401225.8A CN113255914A (en) 2021-04-14 2021-04-14 Method for structurally representing intelligent agent target implementation process

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110401225.8A CN113255914A (en) 2021-04-14 2021-04-14 Method for structurally representing intelligent agent target implementation process

Publications (1)

Publication Number Publication Date
CN113255914A true CN113255914A (en) 2021-08-13

Family

ID=77221043

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110401225.8A Pending CN113255914A (en) 2021-04-14 2021-04-14 Method for structurally representing intelligent agent target implementation process

Country Status (1)

Country Link
CN (1) CN113255914A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114880083A (en) * 2022-03-24 2022-08-09 哈尔滨工业大学(深圳) Optimization method of logic complexity of DAG task execution and storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114880083A (en) * 2022-03-24 2022-08-09 哈尔滨工业大学(深圳) Optimization method of logic complexity of DAG task execution and storage medium

Similar Documents

Publication Publication Date Title
US8185421B2 (en) Multi-objective optimization within a constraint management system
CN112364984A (en) Cooperative multi-agent reinforcement learning method
CN108803609B (en) Partially observable automatic driving decision method based on constraint online planning
CN114741886B (en) Unmanned aerial vehicle cluster multi-task training method and system based on contribution degree evaluation
CN112711475A (en) Workflow scheduling method and system based on graph convolution neural network
CN112632744A (en) Combat system architecture modeling method and space exploration algorithm based on hyper-network model
CN114201885B (en) Improved behavior tree-based military force entity behavior simulation element modeling method and system
CN113255914A (en) Method for structurally representing intelligent agent target implementation process
US20190385704A1 (en) Apparatus and method for utilizing a parameter genome characterizing neural network connections as a building block to construct a neural network with feedforward and feedback paths
CN106953801B (en) Random shortest path realization method based on hierarchical learning automaton
Xu et al. Living with artificial intelligence: A paradigm shift toward future network traffic control
Kadokawa et al. Cyclic policy distillation: Sample-efficient sim-to-real reinforcement learning with domain randomization
Zhu et al. A Q-values sharing framework for multi-agent reinforcement learning under budget constraint
CN111507499B (en) Method, device and system for constructing model for prediction and testing method
WO2021194516A1 (en) Data-dependent node-to-node knowledge sharing by regularization in deep learning
CN116244301A (en) Space-time process mechanism model service chain arrangement method based on directed acyclic graph
CN113220437B (en) Workflow multi-target scheduling method and device
De Angelis et al. Logic Fragments: a coordination model based on logic inference
CN110868461B (en) Data distribution method facing heterogeneous bandwidth between nodes in Gaia cluster
Wan et al. Extending the BDI Model with Q-learning in Uncertain Environment
Dehimi et al. A novel distributed dynamic planning approach based on constraint satisfaction.
Dubravin et al. Formal and conceptual definitions of the hybrid model of distributed computings in networks
WO2019246116A1 (en) Apparatus and method for utilizing a parameter genome characterizing neural network connections as a building block to construct a neural network with feedforward and feedback paths
CN111585793B (en) Network service optimization combination method
Xiao et al. Behavioral Cloning Based Model Generation Method for Reinforcement Learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination