CN117848345A - Stepping type unmanned ship path planning method adopting optimization - Google Patents

Stepping type unmanned ship path planning method adopting optimization Download PDF

Info

Publication number
CN117848345A
CN117848345A CN202410029928.6A CN202410029928A CN117848345A CN 117848345 A CN117848345 A CN 117848345A CN 202410029928 A CN202410029928 A CN 202410029928A CN 117848345 A CN117848345 A CN 117848345A
Authority
CN
China
Prior art keywords
node
depth
network model
new
path planning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202410029928.6A
Other languages
Chinese (zh)
Inventor
李梓甜
罗显涛
杨立鑫
刘畅
黄增鸿
徐雍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong University of Technology
Original Assignee
Guangdong University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong University of Technology filed Critical Guangdong University of Technology
Priority to CN202410029928.6A priority Critical patent/CN117848345A/en
Publication of CN117848345A publication Critical patent/CN117848345A/en
Pending legal-status Critical Current

Links

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Landscapes

  • Control Of Position, Course, Altitude, Or Attitude Of Moving Bodies (AREA)

Abstract

The invention is suitable for the technical field of unmanned boats, and particularly relates to a stepping type unmanned boat path planning method adopting optimization. According to the invention, the depth information and the environment information are acquired through the sensor carried by the unmanned ship, and the depth information and the environment information are fused to obtain an environment map; converting the environment map into a grid map, establishing an initial dual-depth Q network model, and training the grid map as input of the initial dual-depth Q network model to obtain a dual-depth Q network model; acquiring current position information of an unmanned ship; taking the current position information as input of the dual-depth Q network model to carry out path planning to obtain a planned path; and carrying out local path planning according to the planned path and the environmental information acquired by the unmanned ship in real time. The invention effectively improves the path planning speed of the unmanned ship, and ensures that the unmanned ship can quickly find a correct and safe path in a complex environment.

Description

Stepping type unmanned ship path planning method adopting optimization
Technical Field
The invention is suitable for the technical field of unmanned boats, and particularly relates to a stepping type unmanned boat path planning method adopting optimization.
Background
The unmanned ship is a small offshore platform with environment sensing and autonomous navigation capabilities and capable of autonomously completing corresponding tasks. Based on the characteristics, the unmanned ship is required to have the capability of fast planning and comfortable navigation when the unmanned ship performs tasks in unknown sea areas, and the unmanned ship is helpful for completing corresponding work tasks.
The general global algorithm used in practical application of unmanned ships is an RRT algorithm, and the main function of the global algorithm is to plan a path as a reference path for local planning. For the conventional RRT algorithm, under the condition of quite complex environment, a great amount of time is required to search paths, and the planned paths have extreme conditions, such as V-shaped paths with extremely small included angles. The huge time overhead is detrimental to autonomous planning itself, which can result in the USV being stationary and the unmanned vessels (unmanned surface vessel, USV) drifting off the sea along the ocean current. The extreme path can cause extremely poor control effect of the USV, even the unmanned ship cannot be controlled, and autonomous navigation fails.
Therefore, a new step-by-step optimization unmanned ship path planning method is needed to solve the above technical problems.
Disclosure of Invention
The invention provides a stepping type unmanned ship path planning method, which aims to improve the path planning speed of an unmanned ship, improve algorithm efficiency and ensure that the unmanned ship can quickly find a correct and safe path in a complex environment.
The stepping optimization unmanned ship path planning method comprises the following steps:
s1, acquiring depth information and environment information through a sensor carried by an unmanned ship, and fusing the depth information and the environment information to obtain an environment map; wherein the environment map includes obstacle position information and passable area information;
s2, converting the environment map into a grid map, establishing an initial dual-depth Q network model, and training the grid map as input of the initial dual-depth Q network model to obtain a dual-depth Q network model;
s3, acquiring current position information of the unmanned ship;
s4, taking the current position information as input of the dual-depth Q network model to carry out path planning to obtain a planned path;
s5, planning a local path according to the planned path and the environmental information acquired by the unmanned ship in real time.
Preferably, step S2 comprises the sub-steps of:
s21, setting parameters of the initial dual depth Q network model;
s22, inputting the grid map into the initial dual depth Q network model for training, and obtaining the dual depth Q network model.
Preferably, the parameters of the initial dual depth Q network model include a learning rate, a discount factor, an activation function, and an action space.
Preferably, step S22 comprises the sub-steps of:
s221, setting a starting point and different target points in the grid map;
s222, inputting the grid map into the initial dual depth Q network model and training through an RMSProp algorithm.
Preferably, the step S4 includes the following substeps:
s41, converting the current position information into grid coordinate information, and inputting the grid coordinate information into the dual depth Q network model to obtain a sampling action;
s42, randomly sampling and taking points in the preset range of the adopted action to obtain sampling points;
s43, collision detection is carried out according to the sampling points through a preset calculation formula, if yes, the sampling points are used as new nodes, and father nodes of the new nodes are selected according to a preset method; if not, returning to the step S42;
s44, judging whether the new node is a target point, if so, storing the new node; if not, the new node is stored and returned to step S42.
Preferably, in step S43, the preset calculation formula is:
wherein x and y represent the x-axis and y-axis coordinates of the sampling point, respectively; xl and yl represent the minimum boundaries of the x-axis and y-axis, respectively; xu and yu represent the maximum boundaries of the x-axis and y-axis, respectively; isfree (x, y) indicates whether the sampling point belongs to a non-obstacle region, if so, the return value is 1, and if not, the return value is 0.
Preferably, in step S43, the preset method is as follows:
selecting a node as a first node in a preset range of the new node, and defining a father node of the first node as a second node;
the first node and the second node form a first vector, the first node and the new node form a second vector, whether an included angle formed between the first vector and the second vector is smaller than 90 degrees is judged, and if yes, the first node is used as a father node of the new node.
Preferably, whether the new node is a target point satisfies the following relation:
wherein N is new Representing the new node; isgol represents a function of determining whether the new node is a target point; x is x new ,y new Representing the x-axis and y-axis coordinates of the new node, respectively, and goalx, goaly representing the x-axis and y-axis coordinates of the target point, respectively.
Compared with the prior art, the method has the advantages that the depth information and the environment information are acquired through the sensor carried by the unmanned ship, and the depth information and the environment information are fused to obtain the environment map; the environment map comprises barrier position information and passable area information; converting the environment map into a grid map, establishing an initial dual-depth Q network model, and training the grid map as input of the initial dual-depth Q network model to obtain a dual-depth Q network model; acquiring current position information of an unmanned ship; taking the current position information as input of the dual-depth Q network model to carry out path planning to obtain a planned path; and carrying out local path planning according to the planned path and the environmental information acquired by the unmanned ship in real time. Therefore, the invention effectively improves the path planning speed of the unmanned ship, and ensures that the unmanned ship can quickly find a correct and safe path in a complex environment.
Drawings
The present invention will be described in detail with reference to the accompanying drawings. The foregoing and other aspects of the invention will become more apparent and more readily appreciated from the following detailed description taken in conjunction with the accompanying drawings. In the accompanying drawings:
FIG. 1 is a flow chart of a stepwise approach to optimized unmanned boat path planning provided by an embodiment of the present invention;
FIG. 2 is a schematic diagram of a grid map using an optimized unmanned boat path planning method in a step-by-step manner, provided by an embodiment of the present invention;
FIG. 3 is a representation of an approximate Q representation of a stepwise approach to optimized unmanned boat path planning provided by an embodiment of the present invention;
FIG. 4 is a schematic path diagram of a step-by-step approach to optimized unmanned boat path planning provided by an embodiment of the present invention;
FIG. 5 is a sector diagram of current position information for a stepwise approach to optimized unmanned boat path planning provided by an embodiment of the present invention;
FIG. 6 is a vector diagram of a stepwise approach to optimized unmanned aerial vehicle path planning provided by an embodiment of the present invention;
fig. 7 is a parent node update diagram of a stepwise approach to optimized unmanned boat path planning provided by an embodiment of the present invention.
Detailed Description
The present invention will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present invention more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.
Referring to fig. 1 to 7, the present invention provides a step-by-step optimization unmanned ship path planning method, which includes the following steps:
s1, acquiring depth information and environment information through a sensor carried by an unmanned ship, and fusing the depth information and the environment information to obtain an environment map; wherein the environment map includes obstacle position information and passable area information;
in the embodiment of the invention, the unmanned ship fuses the two parts of data by acquiring the depth information of the surrounding environment taken by the depth camera and the environment information scanned by the laser radar, so as to generate an environment map containing information such as obstacle positions, passable areas and the like used in training a network.
S2, converting the environment map into a grid map, establishing an initial dual-depth Q Network model, and training the grid map as input of the initial dual-depth Q Network model to obtain a dual-depth Q Network model (Double Deep Q-Network);
in an embodiment of the present invention, step S2 comprises the sub-steps of:
s21, setting parameters of the initial dual depth Q network model; parameters of the initial dual depth Q network model include a learning rate, a discount factor, an activation function, and an action space.
S22, inputting the grid map into the initial dual depth Q network model for training, and obtaining the dual depth Q network model.
Step S22 comprises the following sub-steps:
s221, setting a starting point and different target points in the grid map;
s222, inputting the grid map into the initial dual depth Q network model and training through an RMSProp algorithm.
Specifically, the dual depth Q network model includes a Q network and a q_target network, the network structure of the Q network and the q_target is a five-layer neural network including 3 hidden layers, the number of neurons of the hidden layers is 20, 40 and 20, the learning rate is 0.0001, the discount factor γ is 0.9, the activation function is shown in formula 1, the greedy strategy (epsilon-greedy) adopted in the training process is shown in formula 2, the action space includes 8 actions of up, down, left, right, up left, down left, up right, down right and the like, and the actions are respectively expressed as D1-D8 by symbols, as shown in formula 3, and the reward function R is shown in formula 4.
Equation 1:
equation 2:
equation 3:
equation 4:
on a grid map, n is selected (n is determined by the actual map), 4 different target points are selected, as shown in fig. 2, n is 18, gray points in the second row and the second column from the left are all starting points, and another gray point is the target point. The RMSProp algorithm is used to train the dual depth Q network model.
After the 4 training data are input into the dual-depth Q network model for training, the Q value calculation in the training process is shown in formula 5, and a specific Q value table needs to be calculated in actual training, for example, the approximate Q value table corresponding to the first graph in fig. 2 is shown in fig. 3, the larger the Q value is, the greater the selection possibility is indicated, and it can be seen from fig. 4 that the value corresponding to the Q table meets the requirement of actual planning.
Equation 5: q (s, a) =r+γ q_target (s ,argmax_a Q(s ,a;θ);θ )
Where Q (s, a) represents the Q value of the selected action a in state s, r is the prize obtained after execution of action a, s 'is the next state entered after execution of action a, γ is a discount factor for adjusting the importance of future prizes, q_target represents the output of the target Q network, argmax_a represents the action selected to have the largest Q value, θ is the parameter of the current policy network (Q network), and θ' is the parameter of the target network (q_target network).
The approximate Q table obtained by the steps can obtain the optimal path from the starting point to the end point, as shown in fig. 4, the well-trained network can obtain correct and effective actions according to the current position information of the unmanned ship, and meets the requirement of guiding planning and sampling.
S3, acquiring current position information of the unmanned ship;
s4, taking the current position information as input of the dual-depth Q network model to carry out path planning to obtain a planned path;
in the embodiment of the present invention, the step S4 includes the following substeps:
s41, converting the current position information into grid coordinate information, and inputting the grid coordinate information into the dual depth Q network model to obtain a sampling action;
s42, randomly sampling and taking points in the preset range of the adopted action to obtain sampling points;
s43, collision detection is carried out according to the sampling points through a preset calculation formula, if yes, the sampling points are used as new nodes, and father nodes of the new nodes are selected according to a preset method; if not, returning to the step S42;
in step S43, the preset calculation formula is:
wherein x and y represent the x-axis and y-axis coordinates of the sampling point, respectively; xl and yl represent the minimum boundaries of the x-axis and y-axis, respectively; xu and yu represent the maximum boundaries of the x-axis and y-axis, respectively; isfree (x, y) indicates whether the sampling point belongs to a non-obstacle region, if so, the return value is 1, and if not, the return value is 0.
Selecting a node as a first node in a preset range of the new node, and defining a father node of the first node as a second node;
the first node and the second node form a first vector, the first node and the new node form a second vector, whether an included angle formed between the first vector and the second vector is smaller than 90 degrees is judged, and if yes, the first node is used as a father node of the new node.
S44, judging whether the new node is a target point, if so, storing the new node; if not, the new node is stored and returned to step S42.
Whether the new node is a target point satisfies the following relation:
wherein N is new Representing the new node; isgol represents a function of determining whether the new node is a target point; x is x new ,y new Representing the x-axis and y-axis coordinates of the new node, respectively, and goalx, goaly representing the x-axis and y-axis coordinates of the target point, respectively.
Specifically, the defined and selected action is denoted as D, Q is an action evaluation value calculated by a network, the input value is in, the output value is out, epsilon in the formula 2 is 0.1, the current position information of the unmanned ship is N (x, y), and a new node (sampling point) N new (x new ,y new ) The sampling action of the DOUBLE DQN output is denoted as D and the sector radius is L (which can be designed according to actual needs).
Container k is the container of the storage path; container q is the container storing the midpoint; the container w is a container storing nodes with an angle larger than 90 degrees;
the target point (endpoint) coordinates are gol (golx, goly).
The minimum boundary xy coordinates of the map are (xl, yl), the maximum boundary xy coordinates are (xu, yu), and the starting point is first stored in the container k at the beginning of the planning.
The current position information N (x, y) is converted into grid coordinates N (x) according to formula 7 nn ,y nn ) Thereafter, N (x) nn ,y nn ) The method is input into a dual depth Q network model to obtain sampling actions D, taking N (x, y) as a center, taking the direction D as a center line, expanding left and right by 30 degrees to form a sector with a radius L (the size of L is determined according to actual needs), and forming an angle of 60 degrees, wherein the starting angle of the sector is θ1=θ0-30 degrees, the ending angle is θ2=θ0+30 degrees, wherein θ0 is an included angle between an NN-type vector and an x-axis, and recording that a point corresponding to the distance L from the direction D is N (x, y) is N (x 1, y 1), as shown in fig. 5. Randomly generating an angle θ between the start angle and the end angle according to equation 6 r Then randomly taking the points according to the formula 8 to obtain a sampling point N new (x new ,y new )。
Equation 6: θ r =θ1+random()*(θ2-θ1)
Equation 7: x is x nn =round((x*R x +y*R y )/X s )
y nn =round((x*R y +y*R z )/Y s )
Wherein R is x 、R y 、R z Is the rotation moment, X of radar scan data s 、Y s Is the resolution of the scan map, round (x) represents rounding.
Equation 8:
wherein u1 and u2 are different uniform random numbers between [0,1 ];
obtaining a sampling point N according to step S42 new For N new Performing collision detection according to formula 9, if not, returning to step S42, and updating N new Coordinate values of (2); if the collision detection condition is met, the sampling point is the new node N new Then further N new Selecting a father node, screening the father node according to the following comfort level conditions (namely a preset method), and selecting a father node N which accords with the selection newparent (xp,yp)。
Equation 9:
the comfort conditions are specifically as follows:
the main evaluation index of the comfort level in the invention is that the included angle between two vectors formed by every three points is not smaller than 90 degrees, and the method is mainly used for avoiding the condition that the path has extreme curvature, and meanwhile, the generated path can not meet the added dynamic constraint and model constraint.
With new node N new (x new ,y new ) In the center, a circle is drawn with a radius r, and all tree nodes in the circle are set F (node 1..node n) assuming that there are n nodes in the circle i .. node n).
Set node n in circle i (x i ,y i ) For point B (xb, yb), point B being the first node, parent node N of point B preparent (xp, yp) is point A (xa, ya), point A being the second node, new node N new (x new ,y new ) For point C, the BA vector and the BC vector (i.e., the first vector and the second vector) are formed, and the angle between the BA and the BC vector is recorded asAs shown in FIG. 6, all nodes of set F are traversed again, corresponding +.>Into the container w.
The BC wire in the rejecting container w collides with the obstacleA kind of electronic device
Traversing the container w to obtain the maximum angle in the containerThe corresponding coordinate is n i (x i ,y i )。
If it isIf the angle is larger than 90 degrees, selecting a node n i (x i ,y i ) For new node N new (x new ,y new ) Is the parent node N of (1) newparent (xp, yp), delete n i (x i ,y i ) Is used for guaranteeing the uniqueness of the child node of each node.
If it isIf the angle is smaller than 90 degrees, taking the node n i (x i ,y i ) The midpoint on the midpoint connection of the corresponding BA and BC vectors is taken as a new point B, the value of point B is updated to be the value of the midpoint, and the updated point B is stored in the container q until the angle is greater than 90 °, as shown in fig. 7. If a proper midpoint is successfully found, the midpoint is selected as the current new node N new (x new ,y new ) The parent node of the corresponding point A and the child node of the corresponding point B are deleted, and meanwhile, the container q is emptied; if the point meeting the condition cannot be found, taking the point N with the angle closest to 90 DEG in the container q mid As a new node N new (x new ,y new ) Is the parent node N of (1) newparent (xp, yp) while pointing the child node corresponding to point A to N mid ,N new Is directed to N by the parent node of (2) mid The uniqueness of the child node of each node is guaranteed.
The new node N obtained according to the above new Judging whether the node is a target node according to a formula 10, if so, storing the new node and the terminal point into the node in turnIn container k, the search is ended; if not, storing the new node into the container k, and then jumping back to the step S42 to continue execution until the new node is the target node, and finding out the path to finish searching.
Equation 10:
s5, planning a local path according to the planned path and the environmental information acquired by the unmanned ship in real time.
In the embodiment of the invention, according to the path planned in the step S4 and the environmental information transmitted by the sensor in real time, the information of the path planned in the step S4 and the environmental information are transmitted to a local planning part for further local planning, and then the generated control instruction is transmitted to a control system part for driving the unmanned ship to run according to the planned track.
Compared with the prior art, the method has the advantages that the depth information and the environment information are acquired through the sensor carried by the unmanned ship, and the depth information and the environment information are fused to obtain the environment map; the environment map comprises barrier position information and passable area information; converting the environment map into a grid map, establishing an initial dual-depth Q network model, and training the grid map as input of the initial dual-depth Q network model to obtain a dual-depth Q network model; acquiring current position information of an unmanned ship; taking the current position information as input of the dual-depth Q network model to carry out path planning to obtain a planned path; and carrying out local path planning according to the planned path and the environmental information acquired by the unmanned ship in real time. Therefore, the invention effectively improves the path planning speed of the unmanned ship, and ensures that the unmanned ship can quickly find a correct and safe path in a complex environment.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
While the embodiments of the present invention have been illustrated and described in connection with the drawings, what is presently considered to be the most practical and preferred embodiments of the invention, it is to be understood that the invention is not limited to the disclosed embodiments, but on the contrary, is intended to cover various equivalent modifications and equivalent arrangements included within the spirit and scope of the appended claims.

Claims (8)

1. The step-by-step optimization unmanned ship path planning method is characterized by comprising the following steps of:
s1, acquiring depth information and environment information through a sensor carried by an unmanned ship, and fusing the depth information and the environment information to obtain an environment map; wherein the environment map includes obstacle position information and passable area information;
s2, converting the environment map into a grid map, establishing an initial dual-depth Q network model, and training the grid map as input of the initial dual-depth Q network model to obtain a dual-depth Q network model;
s3, acquiring current position information of the unmanned ship;
s4, taking the current position information as input of the dual-depth Q network model to carry out path planning to obtain a planned path;
s5, planning a local path according to the planned path and the environmental information acquired by the unmanned ship in real time.
2. The stepwise approach to optimized unmanned boat path planning method according to claim 1, wherein step S2 comprises the sub-steps of:
s21, setting parameters of the initial dual depth Q network model;
s22, inputting the grid map into the initial dual depth Q network model for training, and obtaining the dual depth Q network model.
3. The stepwise approach to optimized unmanned boat path planning method of claim 2, wherein the parameters of the initial dual depth Q network model include learning rate, discount factor, activation function, and action space.
4. The stepwise approach to optimized unmanned boat path planning method according to claim 2, wherein step S22 comprises the sub-steps of:
s221, setting a starting point and different target points in the grid map;
s222, inputting the grid map into the initial dual depth Q network model and training through an RMSProp algorithm.
5. The stepwise approach to optimized unmanned boat path planning method according to claim 1, wherein step S4 comprises the sub-steps of:
s41, converting the current position information into grid coordinate information, and inputting the grid coordinate information into the dual depth Q network model to obtain a sampling action;
s42, randomly sampling and taking points in the preset range of the adopted action to obtain sampling points;
s43, collision detection is carried out according to the sampling points through a preset calculation formula, if yes, the sampling points are used as new nodes, and father nodes of the new nodes are selected according to a preset method; if not, returning to the step S42;
s44, judging whether the new node is a target point, if so, storing the new node; if not, the new node is stored and returned to step S42.
6. The stepping adoption optimization unmanned ship path planning method according to claim 5, wherein in step S43, the preset calculation formula is:
wherein x and y represent the x-axis and y-axis coordinates of the sampling point, respectively; xl and yl represent the minimum boundaries of the x-axis and y-axis, respectively; xu and yu represent the maximum boundaries of the x-axis and y-axis, respectively; isfree (x, y) indicates whether the sampling point belongs to a non-obstacle region, if so, the return value is 1, and if not, the return value is 0.
7. The stepping adoption optimization unmanned ship path planning method according to claim 5, wherein in step S43, the preset method is as follows:
selecting a node as a first node in a preset range of the new node, and defining a father node of the first node as a second node;
the first node and the second node form a first vector, the first node and the new node form a second vector, whether an included angle formed between the first vector and the second vector is smaller than 90 degrees is judged, and if yes, the first node is used as a father node of the new node.
8. The stepwise approach to optimized unmanned boat path planning of claim 5, wherein whether the new node is a target point satisfies the following relationship:
wherein N is new Representing the new node; isgol represents a function of determining whether the new node is a target point; x is x new ,y new Representing x-axis and y-axis coordinates, each representing the new node, each of the golx and goly representing the targetThe x-axis and y-axis coordinates of the point.
CN202410029928.6A 2024-01-08 2024-01-08 Stepping type unmanned ship path planning method adopting optimization Pending CN117848345A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202410029928.6A CN117848345A (en) 2024-01-08 2024-01-08 Stepping type unmanned ship path planning method adopting optimization

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202410029928.6A CN117848345A (en) 2024-01-08 2024-01-08 Stepping type unmanned ship path planning method adopting optimization

Publications (1)

Publication Number Publication Date
CN117848345A true CN117848345A (en) 2024-04-09

Family

ID=90545839

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202410029928.6A Pending CN117848345A (en) 2024-01-08 2024-01-08 Stepping type unmanned ship path planning method adopting optimization

Country Status (1)

Country Link
CN (1) CN117848345A (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109540136A (en) * 2018-10-25 2019-03-29 广东华中科技大学工业技术研究院 A kind of more unmanned boat collaboration paths planning methods
CN113657292A (en) * 2021-08-19 2021-11-16 东南大学 Vehicle automatic tracking driving method based on deep reinforcement learning
US20220214688A1 (en) * 2021-01-06 2022-07-07 Shanghai University Method and system for controlling multi-unmanned surface vessel collaborative search
CN114942643A (en) * 2022-06-17 2022-08-26 华中科技大学 Construction method and application of USV unmanned ship path planning model
CN115033000A (en) * 2022-07-06 2022-09-09 重庆大学 Dual-target path planning method based on deep reinforcement learning
CN116952239A (en) * 2023-07-06 2023-10-27 大连海事大学 Unmanned ship path planning method based on fusion of improved A and DWA
CN117193296A (en) * 2023-09-05 2023-12-08 广东工业大学 Improved A star unmanned ship path planning method based on high safety

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109540136A (en) * 2018-10-25 2019-03-29 广东华中科技大学工业技术研究院 A kind of more unmanned boat collaboration paths planning methods
US20220214688A1 (en) * 2021-01-06 2022-07-07 Shanghai University Method and system for controlling multi-unmanned surface vessel collaborative search
CN113657292A (en) * 2021-08-19 2021-11-16 东南大学 Vehicle automatic tracking driving method based on deep reinforcement learning
CN114942643A (en) * 2022-06-17 2022-08-26 华中科技大学 Construction method and application of USV unmanned ship path planning model
CN115033000A (en) * 2022-07-06 2022-09-09 重庆大学 Dual-target path planning method based on deep reinforcement learning
CN116952239A (en) * 2023-07-06 2023-10-27 大连海事大学 Unmanned ship path planning method based on fusion of improved A and DWA
CN117193296A (en) * 2023-09-05 2023-12-08 广东工业大学 Improved A star unmanned ship path planning method based on high safety

Similar Documents

Publication Publication Date Title
CN111780777B (en) Unmanned vehicle route planning method based on improved A-star algorithm and deep reinforcement learning
CN112241176B (en) Path planning and obstacle avoidance control method of underwater autonomous vehicle in large-scale continuous obstacle environment
Hu et al. A multiobjective optimization approach for COLREGs-compliant path planning of autonomous surface vehicles verified on networked bridge simulators
CN108803321B (en) Autonomous underwater vehicle track tracking control method based on deep reinforcement learning
CN108873687B (en) Intelligent underwater robot behavior system planning method based on deep Q learning
CN109784201B (en) AUV dynamic obstacle avoidance method based on four-dimensional risk assessment
CN109765890B (en) Multi-USV group collaborative collision avoidance planning method based on genetic algorithm
CN112650246B (en) Ship autonomous navigation method and device
CN114625151A (en) Underwater robot obstacle avoidance path planning method based on reinforcement learning
CN116225016A (en) Multi-agent path planning method based on distributed collaborative depth reinforcement learning model
Du et al. An optimized path planning method for coastal ships based on improved DDPG and DP
Yan et al. Reinforcement Learning‐Based Autonomous Navigation and Obstacle Avoidance for USVs under Partially Observable Conditions
Qu et al. Pursuit-evasion game strategy of USV based on deep reinforcement learning in complex multi-obstacle environment
Blekas et al. RL-based path planning for an over-actuated floating vehicle under disturbances
Yang et al. Autonomous UAV navigation in dynamic environments with double deep Q-networks
Sundarraj et al. Route Planning for an Autonomous Robotic Vehicle Employing a Weight-Controlled Particle Swarm-Optimized Dijkstra Algorithm
CN113064422B (en) Autonomous underwater vehicle path planning method based on double neural network reinforcement learning
Gao et al. An Optimized Path Planning Method for Container Ships in Bohai Bay Based on Improved Deep Q-Learning
Zhang et al. Intelligent vector field histogram based collision avoidance method for auv
Lin et al. Robust unmanned surface vehicle navigation with distributional reinforcement learning
CN117848345A (en) Stepping type unmanned ship path planning method adopting optimization
Palacios-Morocho et al. Multipath planning acceleration method with double deep R-learning based on a genetic algorithm
Zare et al. Continuous control with deep reinforcement learning for autonomous vessels
Kanakakis et al. Evolutionary path planning and navigation of autonomous underwater vehicles
Artusi et al. Path planning for a maritime suface ship based on Deep Reinforcement Learning and weather data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination