CN110989352B - Group robot collaborative search method based on Monte Carlo tree search algorithm - Google Patents

Group robot collaborative search method based on Monte Carlo tree search algorithm Download PDF

Info

Publication number
CN110989352B
CN110989352B CN201911272386.0A CN201911272386A CN110989352B CN 110989352 B CN110989352 B CN 110989352B CN 201911272386 A CN201911272386 A CN 201911272386A CN 110989352 B CN110989352 B CN 110989352B
Authority
CN
China
Prior art keywords
ground mobile
mobile robot
node
robot
group
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911272386.0A
Other languages
Chinese (zh)
Other versions
CN110989352A (en
Inventor
丁肇红
吴莹莹
温晓静
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Institute of Technology
Original Assignee
Shanghai Institute of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Institute of Technology filed Critical Shanghai Institute of Technology
Priority to CN201911272386.0A priority Critical patent/CN110989352B/en
Publication of CN110989352A publication Critical patent/CN110989352A/en
Application granted granted Critical
Publication of CN110989352B publication Critical patent/CN110989352B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/042Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance

Abstract

The invention discloses a group robot collaborative search method based on a Monte Carlo tree search algorithm, and belongs to the technical field of multi-agent target search. The method comprises the following steps: setting a target position according to a target monitoring area of a ground mobile robot group; the ground mobile robot track planning is used for solving the problem of two-dimensional track planning, possible access node sequences in the collaborative search process of the ground mobile robot are determined based on a Monte Carlo tree search algorithm, and probability distribution corresponding to the access sequences of the robot is optimized by a probability descent method; communicating with other ground robots, updating the joint probability distribution of the access sequences of the ground robot group, and selecting a first node in the access sequence with the highest probability as a next access node of the robot; and combining the motion constraint of the ground robot, realizing smooth closed-loop track planning of the ground mobile robot by utilizing a piecewise smooth curve, and solving the optimal observation problem of target search of the ground mobile robot group cooperation area under the time constraint.

Description

Group robot collaborative search method based on Monte Carlo tree search algorithm
Technical Field
The invention belongs to the technical field of multi-agent regional monitoring, and particularly relates to a group robot collaborative searching method based on a Monte Carlo tree searching algorithm.
Background
At present, the multi-agent environment perception technology is mainly used for passively completing tasks such as environment detection, target identification and tracking, real-time positioning, map construction and the like, and the number of related agents is mostly single. In addition, the research field of the ground mobile robot group is mostly concentrated on centralized formation of the robot group, a communication mechanism between the robots, task resource allocation between multiple robots, and the like, and the research field of the collaborative target search of the mobile robot group is rarely studied. With the rapid development of deep learning algorithms, the current popular deep learning mainly focuses on the processing of text, images, videos and other data, but the process is long in time consumption and high in operation complexity, and cannot be applied to an actual multi-robot system. In a complex large-scale dynamic environment, the amount of information required by the robot to interact with the environment is large, and the robot cannot well sense an active target through a deep learning method.
The target search of the mobile robot in the existing literature is focused on the known static environment, the path between a starting point and an end point is obtained by discretizing the environment and adopting the traditional search algorithm, and the path is an absolute shortest path under the resolution of an environment map; the robot track planning is carried out by utilizing an improved A-algorithm and a particle swarm algorithm, but the A-algorithm is slow in searching speed and large in calculation amount, so that the optimal track of a robot group is difficult to find under the condition of ensuring multiple constraints; the planning space division of the particle swarm algorithm is rough, motion constraint conditions are difficult to meet, the space search outside the optional path set cannot be realized, the accuracy of the particle swarm algorithm is not high enough, and the optimal track is difficult to find under the complex environment and the multi-constraint conditions; most mobile robotic target search research focuses on single robotic applications, and rarely involves collaborative searching and perception of group robots.
The Monte Carlo tree search algorithm is a game tree search algorithm which utilizes a Monte Carlo method as evaluation, does not need to introduce excessive field knowledge, and has huge expandability. The upper limit confidence interval strategy is a method for solving the multi-arm bandit problem. The upper confidence interval strategy based on the UCT Monte Carlo tree search algorithm is proved to greatly improve the level of a computer game engine. At present, the algorithm is applied to game development, and a few papers apply the Monte Carlo algorithm to the online trajectory planning of a single robot and mostly concentrate on a two-dimensional space. In the field of robot group target search, no patent exists yet for realizing a target search task by utilizing the algorithm.
Disclosure of Invention
Aiming at large-scale and incompletely known unstructured complex environments, the regional collaborative rapid trajectory planning and target searching problems of the group robots under multiple constraints are solved by using the Monte Carlo tree search algorithm and segmented Dubins planning. The method plans a most effective track for each ground mobile robot, searches and observes targets in an area at the minimum cost and the highest speed, and collects area information in real time to the maximum extent so as to achieve the aim of effectively monitoring a large-scale area.
In order to achieve the above purpose, the technical solution for solving the technical problem is as follows:
a group robot collaborative search method based on a Monte Carlo tree search algorithm comprises the following steps:
step 1: setting parameters including a travel budget time threshold value of a ground mobile robot group and iteration times of a Monte Carlo tree search algorithm; setting n target positions of the ground mobile robot group according to a target monitoring area of the ground mobile robot group; the ground mobile robot group comprises N ground mobile robots moving independently; each ground mobile robot is provided with a vision sensor, the monitoring radius of each ground mobile robot is set according to the sensing distance of each vision sensor, and the monitoring radius is used as the neighborhood of each target position;
step 2: randomly selecting an initial position for each ground mobile robot in an environment to be detected, obtaining an access node sequence by each ground mobile robot in a parallel mode based on a Monte Carlo tree search algorithm and detecting the environment, wherein in the observation process, the more data collected by a vision sensor and a laser radar, the higher the reward value is, forming an access sequence meeting time constraint by the robot group according to a greedy principle, and after observation, returning each ground mobile robot to the respective initial position to respectively form a corresponding closed-loop track;
and step 3: continuously repeating the step 2 until the time budget of the ground mobile robot group is exhausted or the maximum iteration times is reached, and outputting an access node sequence of the ground mobile robot group;
and 4, step 4: and synchronously executing the steps by each ground mobile robot in the ground mobile robot group, and enabling the multi-ground mobile robot to visit the position of the target object as much as possible within the shortest time by the planned track, thereby realizing a quick cooperative target search task.
Further, the step 2 is realized by the following steps:
step 21: an improved method of a Monte Carlo tree search algorithm, namely an upper limit confidence interval tree search algorithm, is adopted, the Monte Carlo tree search method is combined with an upper limit confidence interval formula, and possible access node sequences in the individual collaborative search process of each ground mobile robot are determined;
step 22: optimizing the probability distribution corresponding to the access sequence of the ground mobile robot by a random probability descent method, and completing the probability updating of the access sequence by utilizing the maximum entropy principle, wherein the probability updating formula is as follows:
Figure BDA0002306734620000031
wherein x is a possible access sequence of the ground mobile robot and q isnTo access the probability distribution corresponding to the sequence x,
Figure BDA0002306734620000032
for the objective function f to the probability distribution qnExpectation of (1), H (q)n) Is a probability distribution qnEntropy of (d);
step 23: communicating with other ground mobile robots, updating the joint probability distribution of the access sequences of the ground mobile robot group, and selecting a first node in the access sequence with the highest probability as a next access node of the ground mobile robot;
step 24: the motion of the ground mobile robot is limited by the maximum turning radius and the maximum speed, and the smooth closed-loop track planning of the ground mobile robot is realized by utilizing the segmented Dubins curve in combination with the motion characteristics.
Further, the step 21 specifically includes the following steps:
step 211: selecting the best expansion node as a child node in the search tree, preferentially selecting the node with the highest reward value in unexplored nodes, namely the node with the most observation information by adopting a greedy principle, and selecting the node with the highest upper limit confidence interval value if all the nodes are visited, wherein the upper limit confidence interval calculation formula is as follows:
Figure BDA0002306734620000033
wherein a is the node number, t is the iteration number, NtFor the number of times the node is accessed, Qt(a) A reward estimate for node a;
step 212: further before the selected child node in step 211, randomly expanding leaf nodes at the node, wherein the newly expanded node is not repeated with the previous child node or the child nodes of the search tree of other ground mobile robots;
step 213: calculating the latest reward estimated value of the expanded leaf node sequence obtained in the step 212;
step 214: after traversing all the nodes in the Monte Carlo tree, updating the reward values of the corresponding nodes on the search tree according to the reward estimation value in the step 213 by adopting a back propagation mode;
step 215: and repeating the step 211 and 214 until the reward value of at least one node in the monte carlo tree reaches the preset threshold, and selecting the node corresponding to the maximum reward value from all the reward values reaching the preset threshold as a new access node.
Further, the step 24 specifically includes the following steps:
the simple kinematic model of each wheeled ground robot system employed is as follows:
Figure BDA0002306734620000041
Figure BDA0002306734620000042
Figure BDA0002306734620000043
wherein x isP、ypTheta is the current pose of the chassis of the robot, V is the speed of the robot,μ denotes the turning speed control, the maximum turning rate corresponds to a certain minimum turning radius, and the initial and terminal tangential directions correspond to the initial and terminal coordinates.
Further, the step 3 specifically includes the following steps:
setting the total navigation time T of the ground mobile robot group as an objective function, wherein the expression is as follows:
Figure BDA0002306734620000044
if the group target function of the ground mobile robots is the minimum, returning the track curve of each ground mobile robot, namely returning the path of each ground mobile robot corresponding to the shortest total running time of the ground mobile robots, and searching the target of each ground mobile robot according to the obtained path; wherein χ is the final trajectory of the robot, XiIs the ith segmented Dubins curve of the robot.
Due to the adoption of the technical scheme, compared with the prior art, the invention has the following advantages and positive effects:
1. compared with a single robot target searching method, the group robot collaborative searching method based on the Monte Carlo tree searching algorithm provided by the invention can solve the problem of rapid track planning of a mobile robot group and realize target searching of the mobile robot group. On the premise of meeting the self motion characteristics, the mobile robot group searches and observes the target at the minimum cost and the highest speed, and collects regional information in real time to the maximum extent;
2. the method solves the problem of multi-collaborative track planning of the multi-ground mobile robot, and uses the Monte Carlo tree search algorithm and the Dubins smooth planning to solve the problems of track fast planning and target search in order to improve the algorithm execution efficiency.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the description of the embodiments will be briefly introduced below. It is obvious that the drawings in the following description are only some embodiments of the invention, and that for a person skilled in the art, other drawings can be derived from them without inventive effort. In the drawings:
FIG. 1 is a schematic flow chart of a group robot collaborative search method based on a Monte Carlo tree search algorithm according to the present invention;
FIG. 2 is a diagram of a relative position relationship among a ground mobile robot group, a target position and a target position neighborhood related to the group robot collaborative search method based on the Monte Carlo tree search algorithm.
Detailed Description
While the embodiments of the present invention will be described and illustrated in detail with reference to the accompanying drawings, it is to be understood that the invention is not limited to the specific embodiments disclosed, but is intended to cover various modifications, equivalents, and alternatives falling within the scope of the invention as defined by the appended claims.
As shown in fig. 1, the embodiment discloses a group robot collaborative search method based on a monte carlo tree search algorithm, which includes the following steps:
step 1: setting parameters including a travel budget time threshold value of a ground mobile robot group and iteration times of a Monte Carlo tree search algorithm; setting n target positions of the ground mobile robot group according to a target monitoring area of the ground mobile robot group; the ground mobile robot group comprises N ground mobile robots moving independently; each ground mobile robot is mounted with a vision sensor, and a monitoring radius R of each ground mobile robot is set according to a sensing distance of each vision sensor (in the present embodiment, each vision sensor is the same, and therefore the monitoring radius R of each ground mobile robot is equal and is set as a sensing distance of the vision sensor). In order to save the target searching time and improve the efficiency and the adaptability of target searching, the set monitoring radius is used as the neighborhood of each target position, and the robot group can be regarded as achieving the observation effect only by observing the target in the monitoring radius. Fig. 2 is a diagram of relative position relationship between a robot population (in this embodiment, a robot population is composed of 5 ground mobile robots), a target position and a target position neighborhood, where points a, B, C, D, and E are starting points of the robot population on the ground, black points are target positions of a monitoring area, and a gray circular area is a neighborhood of the target position;
step 2: randomly selecting an initial position for each ground mobile robot in an environment to be detected, obtaining an access node sequence by each ground mobile robot in a parallel mode based on a Monte Carlo tree search algorithm and detecting the environment, wherein in the observation process, the more data collected by a vision sensor and a laser radar, the higher the reward value is, forming an access sequence meeting time constraint by the robot group according to a greedy principle, and after observation, returning each ground mobile robot to the respective initial position to respectively form a corresponding closed-loop track; specifically, the steps are realized through the following steps:
step 21: an improved method of a Monte Carlo tree search algorithm, namely an upper limit confidence interval tree search algorithm, is adopted, the Monte Carlo tree search method is combined with an upper limit confidence interval formula, and possible access node sequences in the individual collaborative search process of each ground mobile robot are determined; specifically, the method specifically comprises the following steps:
step 211: selecting the best expansion node as a child node in the search tree, preferentially selecting the node with the highest reward value in unexplored nodes, namely the node with the most observation information by adopting a greedy principle, and selecting the node with the highest upper limit confidence interval value if all the nodes are visited, wherein the upper limit confidence interval calculation formula is as follows:
Figure BDA0002306734620000061
wherein a is the node number, t is the iteration number, NtFor the number of times the node is accessed, Qt(a) A reward estimate for node a;
step 212: further before the selected child node in step 211, randomly expanding leaf nodes at the node, wherein the newly expanded node is not repeated with the previous child node or the child nodes of the search tree of other ground mobile robots;
step 213: calculating the latest reward estimated value of the expanded leaf node sequence obtained in the step 212;
step 214: after traversing all the nodes in the Monte Carlo tree, updating the reward values of the corresponding nodes on the search tree according to the reward estimation value in the step 213 by adopting a back propagation mode;
step 215: and repeating the step 211 and 214 until the reward value of at least one node in the monte carlo tree reaches the preset threshold, and selecting the node corresponding to the maximum reward value from all the reward values reaching the preset threshold as a new access node.
Step 22: optimizing the probability distribution corresponding to the access sequence of the ground mobile robot by a random probability descent method, and completing the probability updating of the access sequence by utilizing the maximum entropy principle, wherein the probability updating formula is as follows:
Figure BDA0002306734620000071
wherein x is a possible access sequence of the ground mobile robot and q isnTo access the probability distribution corresponding to the sequence x,
Figure BDA0002306734620000072
for the objective function f to the probability distribution qnExpectation of (1), H (q)n) Is a probability distribution qnEntropy of (d);
step 23: communicating with other ground mobile robots, updating the joint probability distribution of the access sequences of the ground mobile robot group, and selecting a first node in the access sequence with the highest probability as a next access node of the ground mobile robot;
step 24: the motion of the ground mobile robot is limited by the maximum turning radius and the maximum speed, and the smooth closed-loop track planning of the ground mobile robot is realized by utilizing the segmented Dubins curve in combination with the motion characteristics. Specifically, the steps specifically include the following:
the simple kinematic model of each wheeled ground robot system employed is as follows:
Figure BDA0002306734620000073
Figure BDA0002306734620000074
Figure BDA0002306734620000075
wherein x isP、ypAnd theta is the current pose of the chassis of the robot, V is the speed of the robot, mu represents the turning speed control, the maximum turning speed corresponds to a certain minimum turning radius, and the initial and terminal tangential directions correspond to the initial and terminal coordinates. The Dubins curve is the shortest path connecting two-dimensional planes, which is a practically operable route for a ground mobile robot, under the condition that curvature constraints and prescribed tangential directions of the start and end are satisfied.
And 3, step 3: continuously repeating the step 2 until the time budget of the ground mobile robot group is exhausted or the maximum iteration times is reached, and outputting an access node sequence of the ground mobile robot group; specifically, the steps specifically include the following:
setting the total navigation time T of the ground mobile robot group as an objective function, wherein the expression is as follows:
Figure BDA0002306734620000081
if the group objective function of the ground mobile robots is minimum, returning to the track curve of each ground mobile robot, namely returning to the corresponding ground mobile robot with the shortest total running timeEach ground mobile robot path, and each ground mobile robot carries out target search according to the obtained path; wherein χ is the final trajectory of the robot, XiIs the ith segmented Dubins curve of the robot.
And 4, step 4: and synchronously executing the steps by each ground mobile robot in the ground mobile robot group, so that the planned track enables the multi-ground mobile robot to visit the positions of the target objects as much as possible in the shortest time, and the quick collaborative target searching task is realized.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (4)

1. A group robot collaborative search method based on a Monte Carlo tree search algorithm is characterized by comprising the following steps:
step 1: setting parameters including a travel budget time threshold value of a ground mobile robot group and iteration times of a Monte Carlo tree search algorithm; setting n target positions of the ground mobile robot group according to a target monitoring area of the ground mobile robot group; the ground mobile robot group comprises N ground mobile robots moving independently; each ground mobile robot is provided with a vision sensor, the monitoring radius of each ground mobile robot is set according to the sensing distance of each vision sensor, and the monitoring radius is used as the neighborhood of each target position;
step 2: randomly selecting an initial position for each ground mobile robot in an environment to be detected, obtaining an access node sequence by each ground mobile robot in a parallel mode based on a Monte Carlo tree search algorithm and detecting the environment, wherein in the observation process, the more data collected by a vision sensor and a laser radar, the higher the reward value is, forming an access sequence meeting time constraint by the robot group according to a greedy principle, and after observation, returning each ground mobile robot to the respective initial position to respectively form a corresponding closed-loop track;
the step 2 is realized by the following steps:
step 21: an improved method of a Monte Carlo tree search algorithm, namely an upper limit confidence interval tree search algorithm, is adopted, the Monte Carlo tree search method is combined with an upper limit confidence interval formula, and possible access node sequences in the individual collaborative search process of each ground mobile robot are determined;
step 22: optimizing the probability distribution corresponding to the access sequence of the ground mobile robot by a random probability descent method, and completing the probability updating of the access sequence by utilizing the maximum entropy principle, wherein the probability updating formula is as follows:
Figure FDA0003533746940000011
wherein x is a possible access sequence of the ground mobile robot and q isnTo access the probability distribution corresponding to the sequence x,
Figure FDA0003533746940000012
for the objective function f to the probability distribution qnExpectation of (1), H (q)n) Is a probability distribution qnEntropy of (d);
step 23: communicating with other ground mobile robots, updating the joint probability distribution of the access sequences of the ground mobile robot group, and selecting a first node in the access sequence with the highest probability as a next access node of the ground mobile robot;
step 24: the motion of the ground mobile robot is limited by the maximum turning radius and the maximum speed, and the smooth closed-loop track planning of the ground mobile robot is realized by utilizing the segmented Dubins curve in combination with the motion characteristics;
and step 3: continuously repeating the step 2 until the time budget of the ground mobile robot group is exhausted or the maximum iteration times is reached, and outputting an access node sequence of the ground mobile robot group;
and 4, step 4: and synchronously executing the steps by each ground mobile robot in the ground mobile robot group, and enabling the multi-ground mobile robot to visit the position of the target object as much as possible within the shortest time by the planned track, thereby realizing a quick cooperative target search task.
2. The cooperative group robot search method based on the monte carlo tree search algorithm according to claim 1, wherein the step 21 specifically comprises the steps of:
step 211: selecting the best expansion node as a child node in the search tree, preferentially selecting the node with the highest reward value in unexplored nodes, namely the node with the most observation information by adopting a greedy principle, and selecting the node with the highest upper limit confidence interval value if all the nodes are visited, wherein the upper limit confidence interval calculation formula is as follows:
Figure FDA0003533746940000021
wherein a is the node number, t is the iteration number, NtFor the number of times the node is accessed, Qt(a) A reward estimate for node a;
step 212: further before the selected child node in step 211, randomly expanding leaf nodes at the node, wherein the newly expanded node is not repeated with the previous child node or the child nodes of the search tree of other ground mobile robots;
step 213: calculating the latest reward estimated value of the expanded leaf node sequence obtained in the step 212;
step 214: after traversing all nodes in the Monte Carlo tree, updating the reward values of the corresponding nodes on the search tree by adopting a reverse propagation mode according to the reward estimation value in the step 213;
step 215: and repeating the step 211 and 214 until the reward value of at least one node in the monte carlo tree reaches the preset threshold, and selecting the node corresponding to the maximum reward value from all the reward values reaching the preset threshold as a new access node.
3. The cooperative group robot search method based on the monte carlo tree search algorithm according to claim 1, wherein the step 24 specifically comprises the following steps:
the simple kinematic model of each wheeled ground robot system employed is as follows:
Figure FDA0003533746940000022
Figure FDA0003533746940000023
Figure FDA0003533746940000024
wherein x isP、ypAnd theta is the current pose of the chassis of the robot, V is the speed of the robot, mu represents the turning speed control, the maximum turning speed corresponds to a certain minimum turning radius, and the initial and terminal tangential directions correspond to the initial and terminal coordinates.
4. The cooperative group robot search method based on the monte carlo tree search algorithm according to claim 1, wherein the step 3 specifically comprises the following steps:
setting the total navigation time T of the ground mobile robot group as an objective function, wherein the expression is as follows:
Figure FDA0003533746940000031
if the objective function of the ground mobile robot group is minimum, returning to the track curve of each ground mobile robot, namely returning to the ground mobile robot for the maximum total running timeThe corresponding paths of the ground mobile robots are shortened, and the ground mobile robots carry out target search according to the obtained paths; wherein χ is the final trajectory of the robot, XiIs the ith segmented Dubins curve of the robot.
CN201911272386.0A 2019-12-06 2019-12-06 Group robot collaborative search method based on Monte Carlo tree search algorithm Active CN110989352B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911272386.0A CN110989352B (en) 2019-12-06 2019-12-06 Group robot collaborative search method based on Monte Carlo tree search algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911272386.0A CN110989352B (en) 2019-12-06 2019-12-06 Group robot collaborative search method based on Monte Carlo tree search algorithm

Publications (2)

Publication Number Publication Date
CN110989352A CN110989352A (en) 2020-04-10
CN110989352B true CN110989352B (en) 2022-05-27

Family

ID=70092718

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911272386.0A Active CN110989352B (en) 2019-12-06 2019-12-06 Group robot collaborative search method based on Monte Carlo tree search algorithm

Country Status (1)

Country Link
CN (1) CN110989352B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111679679B (en) * 2020-07-06 2023-03-21 哈尔滨工业大学 Robot state planning method based on Monte Carlo tree search algorithm
CN111934948B (en) * 2020-07-20 2022-04-12 浙江理工大学 EFSM executable test sequence generation method based on Monte Carlo tree search
CN113139644B (en) * 2021-03-24 2024-02-09 北京科技大学顺德研究生院 Information source navigation method and device based on deep Monte Carlo tree search
CN113593228B (en) * 2021-07-26 2022-06-03 广东工业大学 Automatic driving cooperative control method for bottleneck area of expressway
CN114020024B (en) * 2021-11-05 2023-03-31 南京理工大学 Unmanned aerial vehicle path planning method based on Monte Carlo tree search
CN114237303B (en) * 2021-11-17 2022-09-06 中国人民解放军军事科学院国防科技创新研究院 Unmanned aerial vehicle path planning method and device based on Monte Carlo tree search
CN116027807B (en) * 2023-02-01 2023-07-25 南京航空航天大学 Heterogeneous unmanned aerial vehicle group cooperative power inspection method
CN116673968B (en) * 2023-08-03 2023-10-10 南京云创大数据科技股份有限公司 Mechanical arm track planning element selection method and system based on reinforcement learning
CN117346797B (en) * 2023-12-06 2024-03-19 合肥井松智能科技股份有限公司 B spline path planning method based on Monte Carlo search tree

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107038477A (en) * 2016-08-10 2017-08-11 哈尔滨工业大学深圳研究生院 A kind of neutral net under non-complete information learns the estimation method of combination with Q
CN109543872A (en) * 2018-09-19 2019-03-29 北京科技大学 Multirobot is without parking maneuver method and system

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9047423B2 (en) * 2012-01-12 2015-06-02 International Business Machines Corporation Monte-Carlo planning using contextual information
US20190107406A1 (en) * 2017-10-09 2019-04-11 Nio Usa, Inc. Systems and methods for trip planning under uncertainty

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107038477A (en) * 2016-08-10 2017-08-11 哈尔滨工业大学深圳研究生院 A kind of neutral net under non-complete information learns the estimation method of combination with Q
CN109543872A (en) * 2018-09-19 2019-03-29 北京科技大学 Multirobot is without parking maneuver method and system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
群智进化理论及其在智能机器人中的应用;戚骁亚等;《中国工程科学》;20180815(第04期);全文 *

Also Published As

Publication number Publication date
CN110989352A (en) 2020-04-10

Similar Documents

Publication Publication Date Title
CN110989352B (en) Group robot collaborative search method based on Monte Carlo tree search algorithm
CN111240319B (en) Outdoor multi-robot cooperative operation system and method thereof
CN106371445B (en) A kind of unmanned vehicle planning control method based on topological map
Thompson et al. A probabilistic model of human motion and navigation intent for mobile robot path planning
Cunningham et al. DDF-SAM: Fully distributed SLAM using constrained factor graphs
CN110244715B (en) Multi-mobile-robot high-precision cooperative tracking method based on ultra wide band technology
Ye et al. Multi-UAV navigation for partially observable communication coverage by graph reinforcement learning
Zhou et al. Optimal motion strategies for range-only constrained multisensor target tracking
Yang et al. Decentralized cooperative search in UAV's using opportunistic learning
CN110471426A (en) Unmanned intelligent vehicle automatic Collision Avoidance method based on quantum wolf pack algorithm
Liu et al. Evasion of a team of dubins vehicles from a hidden pursuer
CN112857370A (en) Robot map-free navigation method based on time sequence information modeling
Sadhu et al. Aerial-DeepSearch: Distributed multi-agent deep reinforcement learning for search missions
Gu et al. Path planning for mobile robot in a 2.5‐dimensional grid‐based map
Jones et al. Information-guided persistent monitoring under temporal logic constraints
Pires et al. Cooperative localization and mapping with robotic swarms
Frew et al. Target assignment for integrated search and tracking by active robot networks
Ganganath et al. Finding energy-efficient paths on uneven terrains
Saha et al. Real-time robot path planning around complex obstacle patterns through learning and transferring options
Tao et al. Motion planning for slam based on frontier exploration
Qiao et al. Dynamic self-organizing leader-follower control in a swarm mobile robots system under limited communication
CN113778093A (en) AMR autonomous mobile robot path planning method based on improved sparrow search algorithm
Yuan et al. Path-planning-enabled semiflocking control for multitarget monitoring in mobile sensor networks
Peng et al. Cooperative area search for multiple UAVs based on RRT and decentralized receding horizon optimization
Chen et al. Cooperative distributed target tracking algorithm in mobile wireless sensor networks

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant