CN112650246B - Ship autonomous navigation method and device - Google Patents

Ship autonomous navigation method and device Download PDF

Info

Publication number
CN112650246B
CN112650246B CN202011535524.2A CN202011535524A CN112650246B CN 112650246 B CN112650246 B CN 112650246B CN 202011535524 A CN202011535524 A CN 202011535524A CN 112650246 B CN112650246 B CN 112650246B
Authority
CN
China
Prior art keywords
ship
action
value
navigation
track
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011535524.2A
Other languages
Chinese (zh)
Other versions
CN112650246A (en
Inventor
杨杰
刘今栋
尚午晟
梁奇
韦港文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan University of Technology WUT
Original Assignee
Wuhan University of Technology WUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University of Technology WUT filed Critical Wuhan University of Technology WUT
Priority to CN202011535524.2A priority Critical patent/CN112650246B/en
Publication of CN112650246A publication Critical patent/CN112650246A/en
Application granted granted Critical
Publication of CN112650246B publication Critical patent/CN112650246B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
    • G05D1/02Control of position or course in two dimensions
    • G05D1/0206Control of position or course in two dimensions specially adapted to water vehicles
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Aviation & Aerospace Engineering (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Traffic Control Systems (AREA)
  • Control Of Position, Course, Altitude, Or Attitude Of Moving Bodies (AREA)

Abstract

The invention relates to a ship autonomous navigation method, which comprises the following steps: constructing a ship motion model, and constructing a navigation water area model based on a navigation water area map; based on the navigation water area model, performing global flight path planning by adopting an ant colony optimization algorithm to generate a global reference flight path; based on the ship motion model, performing local risk collision avoidance planning by adopting a Q-learning algorithm to generate a real-time risk collision avoidance strategy; and realizing the autonomous navigation of the ship by combining the global reference track and the local risk collision avoidance strategy. The invention can realize real-time and accurate autonomous navigation when the ship navigates in a large water area range.

Description

Ship autonomous navigation method and device
Technical Field
The invention relates to the technical field of ship control and decision, in particular to a ship autonomous navigation method, a ship autonomous navigation device and a computer storage medium.
Background
With the vigorous development of high and new technologies such as artificial intelligence and the internet of things, the ship is highly valued by researchers as a carrier widely applied to cargo circulation, military cruising and public traffic. Because the motion of the ship is different from land and air vehicles, the ship is greatly influenced by the environment, and a great deal of uncertainty and risk exist in navigation. In order to realize the normal navigation of the ship, two key problems of global track planning and local risk collision avoidance of the ship need to be solved.
For the global track planning problem, various algorithms are currently applied to the research of autonomous navigation of ships, such as Dijkstra algorithm, a-star algorithm, artificial potential field method, particle swarm optimization algorithm, ant colony optimization algorithm, and the like. However, these classical offline methods need to acquire complete environment prior knowledge in advance, which is difficult to be implemented completely at present, and meanwhile, the difficulty of environment modeling is increased, so that the method is not suitable for real-time autonomous navigation of a ship when used alone.
For the problem of local risk collision avoidance, the mainstream method focuses on modeling the environment in real time. For example, complexity clustering is performed by analyzing obstacle information in a certain range of a ship body, dynamic obstacle avoidance is realized by combining an ant colony optimization algorithm, the method has excellent performance on obstacle avoidance, and when a starting point is far away from a target, redundant tracks are easily generated. Other researchers weaken the complexity of environment modeling by adopting a trial and error method, take the influence of sea stormy waves into consideration, and provide an autonomous navigation algorithm based on Q-learning, so that intelligent collision avoidance of the robot in an unknown environment is realized; an artificial potential field method is introduced to the Q-learning algorithm, so that the convergence speed of the Q-learning algorithm is further improved. However, the grid-based Q-learning algorithm is greatly affected by the size of the water area, and autonomous navigation in a large water area will increase track randomness, which is not favorable for track convergence.
Disclosure of Invention
In view of this, it is necessary to provide a ship autonomous navigation method and apparatus, so as to solve the problems that global track planning cannot be applied to real-time navigation, local risk avoidance is not applicable to large water areas, and track convergence is slow.
The invention provides an autonomous navigation method for a ship, which comprises the following steps:
constructing a ship motion model, and constructing a navigation water area model based on a navigation water area map;
based on the sailing water area model, carrying out global track planning by adopting an ant colony optimization algorithm to generate a global reference track;
based on the ship motion model, performing local risk collision avoidance planning by adopting a Q-learning algorithm to generate a real-time risk collision avoidance strategy;
and combining the global reference track and the local risk collision avoidance strategy to realize autonomous navigation of the ship.
Further, constructing a ship motion model, specifically:
establishing a ship model;
the motion of the ship is decomposed into translation along with a central point and rotation around the central point, and the motion of the ship is simplified into motion on a horizontal plane, so that the kinematic description of the ship is obtained;
setting a steering threshold value, and restraining the steering of the ship;
setting a safe area of the ship by taking the central point as a circle center and the safe distance as a radius;
and combining the ship model, the kinematics description, the steering threshold value and the safety region to obtain the ship motion model.
Further, constructing a navigation water area model based on the navigation water area map specifically comprises:
acquiring a navigation water area map, and carrying out edge extraction on obstacles in the navigation water area map;
extracting convex hulls of edges by using a Graham scanning method, setting a minimum side length threshold value and a minimum perimeter threshold value, and filtering the convex hulls of which the side length is smaller than the minimum side length threshold value or the perimeter is smaller than the minimum perimeter threshold value;
zooming the convex hulls equidistantly to enable the convex hulls to completely surround corresponding obstacles;
constructing an overall connected graph of the MAKLINK by the peak of the convex hull, and selecting the middle points, the starting points and the end points of all MAKINK lines in the overall connected graph of the MAKLINK as network nodes;
and connecting adjacent network nodes to obtain a undirected network diagram of the trajectory plan, and taking the undirected network diagram as the sailing water area model.
Further, based on the navigation water area model, performing global flight path planning by adopting an ant colony optimization algorithm to generate a global reference flight path, specifically:
the navigation water area model is a directionless network diagram, and the directionless network diagram comprises a plurality of connecting lines;
acquiring a starting point and an end point, and constructing an initial track from the starting point to the end point based on the undirected network graph;
establishing an objective function of node optimization:
Figure BDA0002852977950000031
wherein the content of the first and second substances,
Figure BDA0002852977950000032
is the length of the flight path,Length(V i (h i ),V i+1 (h i+1 ) Denotes the distance, V, between the ith node and the (i + 1) th node in the track i (h i ) Any point on the ith connecting line which represents the flight path passes through is marked as the ith node in the flight path, V i+1 (h i+1 ) Representing any point on the (i + 1) th connecting line passed by the flight path, and marking as the (i + 1) th node in the flight path, d is the number of the connecting lines passed by the flight path, h i Is a proportionality coefficient of h i ∈[0,1];
The proportionality coefficient being used to adjust V i (h i ) Specific position on the ith connecting line:
V i (h i )=V i (0) +h i (V i (1) -V i (0) );
wherein, V i (0) 、V i (1) Two end points of the ith connecting line through which the flight path passes are respectively arranged;
obtaining a plurality of paths by adjusting the proportional coefficients to form a solution set of feasible paths;
and optimizing the feasible path by adopting an ant colony optimization algorithm to obtain a shortest path as the global reference track.
Further, optimizing the feasible path by adopting an ant colony optimization algorithm to obtain a shortest path as the global reference track, specifically:
constructing a node set of a path according to the undirected network graph, and initializing parameters of an ant colony optimization algorithm;
selecting a next path node for each ant from the node set according to the pheromone;
performing single step updating on the corresponding path selected by the ants:
τ ij (t+1)=(1-ρ)τ ij (t)+ρτ 0
wherein, tau ij (t) denotes the pheromone between the ith node on the previous connection line to the jth node on the next connection line, τ ij (t + 1) is for τ ij (t) after the updateOf the pheromone, tau 0 Is initial pheromone, rho is the volatilization coefficient of the pheromone, and rho belongs to [0,1 ]];
After each ant completes the path selection of one round, the shortest path is selected for pheromone round updating, and the pheromone round updating formula is as follows:
τ ij (t+1)=(1-ρ)τ ij (t)+ρΔτ ij (t);
Δτ ij (t)=1/L *
wherein L is * For the length of the selected path, Δ τ ij (t) represents the sum of pheromones released by all ants between the two points, and t represents the turn;
and checking whether a termination condition is reached, if so, outputting the shortest path in the paths taken by the ants as the global reference track, and otherwise, performing next path optimization.
Further, based on the model of the sailing water area, an initial track is constructed, specifically:
in the constructed undirected network graph, the midpoint of a MAKLINK line is used as an intermediate node, and a Dijkstra algorithm is adopted to search the shortest path from a starting point to an end point to be used as the initial track.
Further, based on the ship motion model, performing local risk collision avoidance planning by using a Q-learning algorithm to generate a real-time risk collision avoidance strategy, specifically:
initializing a state set, an action set and a reward strategy of a ship; the state set comprises a plurality of states of the ship, wherein the states comprise relative position information between obstacles in a safe area of the ship and a current course angle of the ship; the motion set comprises a plurality of corresponding motions of the ship in each state, and the motions comprise translation information and rotation information; the reward strategy comprises a collision condition and a target arrival condition which are fed back by interaction with the environment in the ship navigation process;
calculating the Q value of each action in the action set, and using the Q value to represent the action value;
selecting the action with the highest action value from the action set by adopting an epsilon-greedy strategy;
updating the Q value of each action in the current state by adopting a Q-learning algorithm:
Q(s t ,a t )←Q(s t ,a t )+α[r t +γmaxQ(s t+1 ,a t )-Q(s t ,a t )];
s t ←s t+1
wherein s is t Is in the current state, a t For the selected action, s t+1 To perform action a t Next state of last, r t The reward is fed back under the current state, alpha is the learning rate, gamma is the discount factor, Q(s) t ,a t ) Is a state s t Lower motion a t Q value of (1), Q(s) t+1 ,a t ) Represents a state s t+1 Lower motion a t Max denotes taking the maximum value, and ← denotes updating;
and judging whether the difference between the Q values before and after updating is smaller than a set difference value, if so, outputting the action with the highest Q value as the real-time risk collision avoidance strategy, and otherwise, updating the Q value next time.
Further, the vessel autonomous navigation is realized by combining the global reference track and the local risk collision avoidance strategy, and the method specifically comprises the following steps:
acquiring a starting point, a target point and the corresponding global reference track;
according to the nodes and the node sequence contained in the global reference track, sequentially setting each node as a child target point and navigating to each child target point in sequence;
acquiring barrier information in a current navigation area in real time in the navigation process;
judging whether the ship reaches a current target point, if so, further judging whether the current target point is an end point, if so, finishing navigation, if not, updating the current target point, and if not, turning to the next step;
and judging whether the barrier information is in a safe area of the ship, if so, generating a corresponding real-time risk collision avoidance strategy based on the barrier information, and carrying out risk avoidance according to the real-time risk collision avoidance strategy, otherwise, adjusting the course of the ship to navigate to a current target point, and continuously monitoring whether the barrier information is in the safe area of the ship.
The invention also provides a ship autonomous navigation device, which comprises a processor and a memory, wherein the memory is stored with a computer program, and the computer program is executed by the processor to realize the ship autonomous navigation method.
The present invention also provides a computer storage medium having a computer program stored thereon, which, when executed by a processor, implements the ship autonomous navigation method.
Has the advantages that: the invention provides a ship autonomous navigation method based on an ant colony optimization algorithm and Q-learning, which comprises the steps of firstly constructing a ship motion model, and constructing a navigation water area model based on an original map; then designing a global flight path planning strategy based on an ant colony optimization algorithm, generating a global reference flight path, extracting flight path segmentation nodes, and reducing path randomness caused by Q-learning in subsequent local risk collision avoidance; then designing a local risk collision avoidance strategy based on Q-learning, and guiding real-time risk collision avoidance; finally, combining the global track planning and the local risk collision avoidance strategy to generate an autonomous navigation track based on the navigation water area; according to the difference of the actual model, the model parameters are properly adjusted according to conditions so as to be transplanted to the existing ship autonomous navigation control. The local risk collision avoidance strategy is combined on the basis of global track planning, so that the method can be used for real-time navigation, and meanwhile, the local risk collision avoidance strategy is implemented on the basis of the global track planning, so that the method can be suitable for large water areas.
Drawings
FIG. 1 is a flowchart of a first embodiment of a method for autonomous navigation of a ship according to the present invention;
FIG. 2 is a diagram of a ship motion model according to a first embodiment of the autonomous navigation method for a ship provided by the present invention;
FIG. 3 is a schematic view of a sailing water area of a first embodiment of the autonomous navigation method for a ship provided by the present invention;
FIG. 4 is a diagram of a model of a sailing water area of a first embodiment of the autonomous navigation method for a ship according to the present invention;
FIG. 5a is a length convergence curve in the global reference track optimizing process of the first embodiment of the autonomous navigation method for a ship according to the present invention;
FIG. 5b is a schematic view of a global reference track of the autonomous navigation method for a ship according to the first embodiment of the present invention;
fig. 6a is an accumulated reward convergence curve in the optimization process of the real-time risk collision avoidance strategy of the autonomous navigation method of a ship according to the first embodiment of the present invention;
fig. 6b is a schematic diagram of a real-time risk collision avoidance strategy according to the first embodiment of the autonomous navigation method for a ship provided by the present invention;
FIG. 7a is a diagram illustrating an autonomous navigation track obtained by a first embodiment of the method for autonomous navigation of a ship under a first navigation condition according to the present invention;
fig. 7b is an autonomous navigation track obtained by the first embodiment of the autonomous navigation method for a ship under the second navigation condition.
Detailed Description
The accompanying drawings, which are incorporated in and constitute a part of this application, illustrate preferred embodiments of the invention and together with the description, serve to explain the principles of the invention and not to limit the scope of the invention.
Example 1
As shown in fig. 1, embodiment 1 of the present invention provides a ship autonomous navigation method, including the steps of:
s1, constructing a ship motion model, and constructing a navigation water area model based on a navigation water area map;
s2, performing global flight path planning by adopting an ant colony optimization algorithm based on the navigation water area model to generate a global reference flight path;
s3, based on the ship motion model, performing local risk collision avoidance planning by adopting a Q-learning algorithm to generate a real-time risk collision avoidance strategy;
and S4, realizing autonomous navigation of the ship by combining the global reference track and the local risk collision avoidance strategy.
The embodiment provides a ship autonomous navigation method based on an ant colony optimization algorithm and Q-learning, and the method comprises the steps of firstly, constructing a ship motion model, and constructing a navigation water area model based on an original map; then designing a global flight path planning strategy based on an ant colony optimization algorithm, generating a global reference flight path, extracting flight path segmentation nodes, and reducing path randomness caused by Q-learning in subsequent local risk collision avoidance; then designing a local risk collision avoidance strategy based on Q-learning, and guiding real-time risk collision avoidance; finally, combining the global track planning and the local risk collision avoidance strategy to generate an autonomous navigation track based on the navigation water area; according to the difference of the actual model, the model parameters are properly adjusted according to conditions so as to be transplanted to the existing ship autonomous navigation control.
The ship autonomous navigation method based on the ant colony optimization algorithm and Q-learning can provide effective reference for ship autonomous navigation control.
Preferably, a ship motion model is constructed, specifically:
establishing a ship model;
the motion of the ship is decomposed into translation along with a central point and rotation around the central point, and the motion of the ship is simplified into motion on a horizontal plane, so that the kinematic description of the ship is obtained;
setting a steering threshold value, and restraining the steering of the ship;
setting a safe area of the ship by taking the central point as a circle center and the safe distance as a radius;
and combining the ship model, the kinematics description, the steering threshold value and the safety region to obtain the ship motion model.
When modeling a ship, firstly reducing the dimension of the ship motion, and decomposing the motion of the ship into two parts: the two parts of motions are superposed to form six-degree-of-freedom kinematic description of the ship along with the translation (including longitudinal movement, transverse movement and heaving) and the rotation (including rolling, longitudinal movement and yawing) around the central point. In the problem of navigation planning, the motion of a ship on a horizontal plane is described, so that the problem of autonomous navigation is simplified into a problem of two-dimensional path planning on the horizontal plane;
setting a steering threshold δ max Since the motion of the ship is not simple particle motion and a certain constraint condition is applied to the motion, the steering angle δ of the ship is made smaller than the threshold δ max
A safety zone of the ship is set, and a certain safety margin is reserved for the ship due to the safety of the running of the ship. Therefore, a circle with the center point as the circle center and the safe distance as the radius wraps the whole ship body, and when the minimum distance between the barrier and the center point is smaller than the safe distance, the ship body is considered to have collision danger.
Specifically, fig. 2 shows the ship motion model established by the present embodiment, in which: XYZO denotes a navigation water area coordinate system, XYZO denotes a ship coordinate system, a solid line denotes a ship model, a dotted line denotes a safety area, and l denotes a safety distance.
Preferably, the navigation water area model is constructed based on a navigation water area map, specifically:
acquiring a navigation water area map, and carrying out edge extraction on obstacles in the navigation water area map;
extracting a convex hull of the edge by using a Graham scanning method, setting a minimum side length threshold value and a minimum perimeter threshold value, and filtering the convex hull of which the side length is less than the minimum side length threshold value or the perimeter is less than the minimum perimeter threshold value;
zooming the convex hulls equidistantly to enable the convex hulls to completely surround corresponding obstacles;
constructing an overall connected graph of the MAKLINK by the peak of the convex hull, and selecting the middle points, the starting points and the end points of all MAKINK lines in the overall connected graph of the MAKLINK as network nodes;
and connecting adjacent network nodes to obtain a undirected network diagram of the trajectory plan, and taking the undirected network diagram as the sailing water area model.
Specifically, as shown in fig. 3, in the embodiment of the present invention, a map of a sailing water area is extracted from a height map as an original map, a navigable area is represented as gray, an obstacle area is represented as white, and then a small obstacle that cannot be directly obtained from the map by a ship is artificially added and marked as black.
The original map is preprocessed by adopting methods of edge extraction, convex hull extraction and convex polygon fitting, so that large obstacles are simplified, small obstacles are filtered, an MAKLINK global connectivity graph is constructed, and a water area environment model is simplified;
the embodiment uses Canny operator to carry out edge extraction on the map of the navigation water area. The edge extraction specifically comprises the following steps: smoothing the map of the navigation water area by using a Gaussian filter, calculating the gradient amplitude and the direction of the map of the navigation water area by using first-order partial derivative finite difference, performing non-maximum suppression on the gradient amplitude, and detecting and connecting edges by using a dual-threshold algorithm.
Extracting a convex hull by using a Graham scanning method, setting the minimum side length threshold of a convex polygon (namely the convex hull) as 1/32 of the side length of a map, setting the minimum perimeter threshold as 1/16 of the side length of the map, filtering out small islands and reefs through the minimum side length threshold and the minimum perimeter threshold, and abstracting a large island into the convex polygon;
carrying out equidistant zooming on the convex polygon to enable the convex polygon to completely surround the barrier;
constructing a MAKLINK global connected graph by convex polygon vertexes: selecting the middle points, the starting points and the end points of all the MAKINK lines, and connecting the adjacent points to form a trajectory planning undirected network diagram, as shown in FIG. 4, the black filling blocks in FIG. 4 are convex polygons, the dotted lines are MAKINK lines, all the dotted lines form a MAKINK global connected diagram, the solid lines are connecting lines, and all the solid lines form the undirected network diagram.
Preferably, based on the model of the sailing water area, an ant colony optimization algorithm is adopted to perform global track planning, and a global reference track is generated, specifically:
the navigation water area model is a directionless network diagram, and the directionless network diagram comprises a plurality of connecting lines;
acquiring a starting point and an end point, and constructing an initial track from the starting point to the end point based on the undirected network graph;
establishing an objective function of node optimization:
Figure BDA0002852977950000091
wherein, the first and the second end of the pipe are connected with each other,
Figure BDA0002852977950000092
length (V) as track Length i (h i ),V i+1 (h i+1 ) Denotes the distance between the ith node and the (i + 1) th node in the track, V) i (h i ) Any point on the ith connecting line which represents the flight path passes through is marked as the ith node in the flight path, V i+1 (h i+1 ) Representing any point on the (i + 1) th connecting line passed by the flight path, and marking as the (i + 1) th node in the flight path, d is the number of the connecting lines passed by the flight path, h i Is a proportionality coefficient of h i ∈[0,1];
The proportionality coefficient being used to adjust V i (h i ) Specific position on the ith connecting line:
V i (h i )=V i (0) +h i (V i (1) -V i (0) );
wherein, V i (0) 、V i (1) Two end points of the ith connecting line through which the flight path passes are respectively;
obtaining a plurality of paths by adjusting the proportional coefficients to form a solution set of feasible paths;
and optimizing the feasible path by adopting an ant colony optimization algorithm to obtain a shortest path as the global reference track.
And constructing an initial track in the constructed undirected network graph.
Then, establishing a node optimization function:
marking the connecting line of the central node where the initial track passes as L i Let the end point of the connecting line be V i (0) 、V i (1) Then, any point on the connecting line can be expressed as:
V i (h i )=V i (0) +h i (V i (1) -V i (0) )
wherein h is i Is a proportionality coefficient of h i ∈[0,1],i=1,2…, d, d is the number of nodes passed through;
the node optimization function is established as follows:
Figure BDA0002852977950000101
wherein, length (V) i (h i ),V i+1 (h i+1 ) Represents the distance between two nodes, when i =0, V i (h i ) Denotes the starting point, when i = d, V i (h i ) Indicating an endpoint;
optimizing a track node: by adjusting the proportionality coefficient h i Forming a solution set (h) of feasible paths 1 ,h 2 ,…,h d ) Searching for optimal parameter set by adopting ant colony optimization algorithm, and collecting through proportionality coefficient (h) 1 ,h 2 ,…,h d ) And obtaining the optimized node position, and connecting the starting point, the terminal point and each node to form the global track of the ship.
Preferably, the ant colony optimization algorithm is adopted to optimize the feasible path, and the shortest path is obtained as the global reference track, specifically:
constructing a node set of a path according to the undirected network graph, and initializing parameters of an ant colony optimization algorithm;
selecting a next path node for each ant from the node set according to the pheromone;
and updating the pheromone on the corresponding path selected by the ant in a single step:
τ ij (t+1)=(1-ρ)τ ij (t)+ρτ 0
wherein, tau ij (t) represents the previous connecting line L k To the last connection line L k+1 Pheromone, τ, between the j-th nodes ij (t + 1) is for τ ij (t) pheromone, τ, after update 0 Is initial pheromone, rho is the volatilization coefficient of the pheromone, and rho belongs to [0,1 ]];
After each ant completes the path selection of one round, the shortest path is selected for pheromone round updating, and the pheromone round updating formula is as follows:
τ ij (t+1)=(1-ρ)τ ij (t)+ρΔτ ij (t);
Δτ ij (t)=1/L *
wherein L is * For the length of the selected path, Δ τ ij (t) represents the sum of pheromones released by all ants between the two points, and t represents the turn;
and checking whether a termination condition is reached, if so, outputting the shortest path in the paths taken by the ants as the global reference track, and otherwise, performing next path optimization.
Preferably, based on the model of the sailing water area, an initial track is constructed, specifically:
in the constructed undirected network graph, the midpoint of a MAKLINK line is used as an intermediate node, and a Dijkstra algorithm is adopted to search the shortest path from a starting point to an end point to be used as the initial track.
And (3) taking the middle point of the MAKLINK line as an intermediate node, and searching the shortest path from the starting point to the end point by adopting a Dijkstra algorithm. The embodiment combines the traditional Dijkstra algorithm and the ant colony optimization algorithm on the global flight path planning to enable the global flight path to tend to be shortest, reduces the redundancy of paths, and is beneficial to the transportation efficiency and the economic benefit of ships.
Preferably, based on the ship motion model, a Q-learning algorithm is adopted to perform local risk collision avoidance planning, and a real-time risk collision avoidance strategy is generated, specifically:
initializing a state set, an action set and a reward strategy of the ship; the state set comprises a plurality of states of the ship, wherein the states comprise relative position information between obstacles in a safe area of the ship and a current course angle of the ship; the motion set comprises a plurality of corresponding motions of the ship in each state, and the motions comprise translation information and rotation information; the reward strategy comprises a collision condition and a target arrival condition which are fed back by interaction with the environment in the ship navigation process;
calculating the Q value of each action in the action set, and using the Q value to represent the action value;
selecting the action with the highest action value from the action set by adopting an epsilon-greedy strategy;
updating the Q value of each action in the current state by adopting a Q-learning algorithm:
Q(s t ,a t )←Q(s t ,a t )+α[r t +γmaxQ(s t+1 ,a t )-Q(s t ,a t )];
s t ←s t+1
wherein s is t Is in the current state, a t For the selected action, s t+1 To perform action a t Next state of last, r t The reward is fed back under the current state, alpha is the learning rate, gamma is the discount factor, Q(s) t ,a t ) Is a state s t Lower motion a t Q value of (1), Q(s) t+1 ,a t ) Represents a state s t+1 Lower motion a t Max represents taking the maximum value, and ← represents updating;
and judging whether the difference between the Q values before and after updating is smaller than a set difference value, if so, outputting the action with the highest Q value as the real-time risk collision avoidance strategy, and otherwise, updating the Q value next time.
Correspondingly, the local real-time risk avoidance needs to consider the filtered small-sized obstacles and the avoidance of unidentified obstacles in the navigation water area map, so that a Q-learning algorithm is introduced to the local risk avoidance, and an obstacle avoidance strategy is learned through interaction with the environment, so that the effect of further reducing the difficulty of environment modeling is achieved.
Specifically, a state set, an action set, and a reward policy are initialized. In the prior art, when a ship state set is established, the ship state set is generally established based on a geodetic coordinate system, so that a result obtained by training in a certain environment cannot be transplanted to another new environment; in addition, the state set based on the global environment will increase suddenly with the increase of the sea area, which is not favorable for storage and convergence. Unlike this state set definition method, the state set definition method proposed in this embodiment is as follows: for the construction of a ship state set, taking the position of a ship as the center of a circle, setting a threshold as an obstacle in the circle formed by the radius, and recording the obstacle and the relative position information of the ship as a state by combining the current course angle of the ship as s; the actions comprise advancing and steering, which is marked as a, the action in each state is initialized, the value of the action is represented by a Q value, and the larger the Q value of a certain action is, the higher the value of the action in the state is; in addition, a reward strategy is set to serve as value guidance according to collision conditions and target arrival conditions fed back by interaction with the environment in the ship navigation process.
And selecting the action. The action selection adopts an epsilon-greedy strategy, the action with the highest value is selected according to a certain probability epsilon, otherwise, random selection is carried out, and the mathematical expression is as follows:
Figure BDA0002852977950000131
wherein, a t For the selected action, Q(s) t A) is a state s t The Q value of the lower action a, argmax, represents the value of Q(s) t A) corresponding action at maximum, A represents a selectable action set, p is selection probability, epsilon is greedy degree, epsilon belongs to [0,1]。
And updating the action value. And updating the Q value of the action by adopting a Q-learning algorithm.
Through iteration, the Q value is converged, and the final strategy pi (a | s) = argmaxQ (s, a) is output.
Preferably, the vessel autonomous navigation is realized by combining the global reference track and the local risk collision avoidance strategy, and specifically, the method comprises the following steps:
acquiring a starting point, a target point and the corresponding global reference track;
according to the nodes and the node sequence contained in the global reference track, sequentially setting each node as a child target point and navigating to each child target point in sequence;
acquiring barrier information in a current navigation area in real time in the navigation process;
judging whether the ship reaches a current target point, if so, further judging whether the current target point is an end point, if so, finishing navigation, if not, updating the current target point, and if not, turning to the next step;
and judging whether the barrier information is in a safe area of the ship, if so, generating a corresponding real-time risk collision avoidance strategy based on the barrier information, and carrying out risk avoidance according to the real-time risk collision avoidance strategy, otherwise, adjusting the course of the ship to navigate to a current target point, and continuously monitoring whether the barrier information is in the safe area of the ship.
And after the global track planning and the local risk collision avoidance strategy planning are completed, the connection between the model parameters is properly adjusted according to the difference of the actual model by combining the global track planning and the local risk collision avoidance strategy. Based on a ship model, collision-free autonomous navigation from a starting point to a terminal point can be completed under the condition of acquiring priori knowledge of a navigation water area map, and the specific process comprises the following steps:
s41, inputting a starting point and a target point, and starting an algorithm;
s42, starting from the starting point position, obtaining an intermediate node passing from the starting point to the end point through a global track planning algorithm, setting the intermediate node as a child target point, and keeping the sequence of passing unchanged;
s43, detecting obstacle information in the navigation area through a ship sensing system, and if the current target point is judged to be reached, entering a step S46; if the judgment is dangerous, the step S44 is carried out; if the judgment result is safe, the step S45 is carried out;
s44, adopting a local risk collision avoidance strategy to avoid the current risk, and then switching to the step S43 to judge whether the risk is separated;
s45, adjusting the course to sail towards the current target point, and then turning to the step S43 to judge the dangerous situation;
s46, if the current target point is the end point, the step S47 is carried out, otherwise, the current target point is updated to be the next target point, and the step S43 is carried out;
and S47, finishing the algorithm and outputting the running track of the ship.
According to the ship autonomous navigation model, the original map is preprocessed by adopting edge extraction, convex hull extraction and convex polygon fitting methods, so that large obstacles are simplified, small obstacles are filtered, an MAKLINK global connection diagram is constructed, and a water area environment model is simplified; the traditional Dijkstra algorithm and the ant colony optimization algorithm are combined on the global flight path planning, so that the global flight path tends to be shortest, the redundancy of paths is reduced, and the transportation efficiency and the economic benefit of a ship are facilitated; correspondingly, the local real-time risk collision avoidance needs to consider the small-sized obstacles filtered out from the original map and the collision avoidance of the unidentified obstacles, so that a Q-learning algorithm is introduced to the local risk collision avoidance, a collision avoidance strategy is learned through interaction with the environment, and the effect of further reducing the difficulty of environment modeling is achieved. By combining global track planning and a local risk collision avoidance strategy, the model realizes collision-free autonomous navigation from a starting point to a terminal point of a ship, so that the model can be applied to digital construction of large-scale ports.
1. The ship autonomous navigation model based on the ant colony optimization algorithm and Q-learning makes up for the method in the background art, and only needs less environment priori knowledge to overcome the defect that real-time collision avoidance cannot be realized for the submerged reef which is not identified by the map in the water area, thereby reducing the modeling difficulty.
2. Compared with the prior art mentioned in the background art, the ship autonomous navigation model based on the ant colony optimization algorithm and the Q-learning has the advantages that complex modeling of obstacles by a ship is not needed in real time, the requirement on a ship sensing system is low, the sensing systems with different functional conditions can establish a state set suitable for the functional conditions of the ship and learn according to different types of acquired information, and certain universality is achieved.
3. The traditional Q-learning algorithm is applied to the problems of large state space, low convergence speed and large randomness of planned flight paths in a grid map, and the artificial potential field method mentioned in the background art as a reward strategy still cannot well solve the problems as value guidance. The ship autonomous navigation model based on the ant colony optimization algorithm and Q-learning provided by the invention not only reserves high degree of freedom of navigation caused by a grid method on local obstacle avoidance, but also ensures that the global navigation track always tends to be shortest, and is beneficial to improving the transportation efficiency and economic benefit of ships.
Based on the above reasons, the ship autonomous navigation model based on the ant colony optimization algorithm and Q-learning provided by the invention can provide effective reference for ship autonomous navigation control.
In order to check the effectiveness of the ship autonomous navigation model based on the ant colony optimization algorithm and Q-learning, verification experiments are respectively set in the embodiment to test the performance of the model.
Verification experiment:
and mapping the original map to a sailing water area, and setting the parameter values of the ship model, the global track planning strategy and the local risk collision avoidance strategy as shown in tables 1, 2 and 3 respectively by assuming that the distance between adjacent pixel points is the minimum unit and corresponds to 1km in the water area.
Tables 1,
Figure BDA0002852977950000151
TABLE 2,
Figure BDA0002852977950000152
Figure BDA0002852977950000161
In fig. 5a, it can be clearly seen that when the number of iterations reaches 120, the total length of the path has converged to 650km, and then the convergence speed gradually decreases, in fig. 5b, the Dijkstra algorithm generates an initial path, which is represented by a short dashed line, the path length is 671.36km, and after 500 iterations, the shortest path is 631.53km, which is represented by a long dashed line, i.e. the reference track.
TABLE 3,
Figure BDA0002852977950000162
After 100 learning cycles, the algorithm has substantially converged and has the ability to independently avoid obstacles in the local area shown in fig. 6b, as shown in fig. 6 a.
The ship autonomous navigation model based on the ant colony optimization algorithm and Q-learning provided by the invention is subjected to verification simulation experiments in two different original water domains. In fig. 7a, the track can be roughly divided into 6 sections, in the 5 th section of the track, after the ship detects the surrounding obstacles, the navigation track is adjusted to bypass the obstacles, after the danger is eliminated, the track is adjusted again to sail towards the current target point, and finally the collision-free navigation from the starting point to the target point is realized. In fig. 7b, similar effects are achieved by changing the positions of the start point and the target point.
Example 2
Embodiment 2 of the present invention provides a ship autonomous navigation apparatus, including a processor and a memory, where the memory stores a computer program, and when the computer program is executed by the processor, the ship autonomous navigation apparatus implements the ship autonomous navigation method provided in embodiment 1.
The ship autonomous navigation device provided by the embodiment of the invention is used for realizing the ship autonomous navigation method, so that the ship autonomous navigation device has the technical effects which are not described herein again.
Example 3
Embodiment 3 of the present invention provides a computer storage medium having stored thereon a computer program that, when executed by a processor, implements the vessel autonomous navigation method provided in embodiment 1.
The computer storage medium provided by the embodiment of the invention is used for realizing the ship autonomous navigation method, so that the technical effect of the ship autonomous navigation method is achieved, and the computer storage medium also has the technical effect, and is not repeated herein.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention.

Claims (9)

1. A ship autonomous navigation method is characterized by comprising the following steps:
constructing a ship motion model, and constructing a navigation water area model based on a navigation water area map;
based on the navigation water area model, performing global flight path planning by adopting an ant colony optimization algorithm to generate a global reference flight path;
based on the ship motion model, performing local risk collision avoidance planning by adopting a Q-learning algorithm to generate a real-time risk collision avoidance strategy;
the global reference track and the local risk collision avoidance strategy are combined to realize autonomous navigation of the ship;
based on the ship motion model, performing local risk collision avoidance planning by adopting a Q-learning algorithm to generate a real-time risk collision avoidance strategy, which specifically comprises the following steps:
initializing a state set, an action set and a reward strategy of the ship; the state set comprises a plurality of states of the ship, wherein the states comprise relative position information between obstacles in a safe area of the ship and a current course angle of the ship; the motion set comprises a plurality of corresponding motions of the ship in each state, and the motions comprise translation information and rotation information; the reward strategy comprises a collision condition and a target arrival condition which are fed back by interaction with the environment in the ship navigation process;
calculating the Q value of each action in the action set, and using the Q value to represent the action value;
by using
Figure 97587DEST_PATH_IMAGE001
The strategy selects the action with the highest action value from the action set, and specifically comprises the following steps: action selection adoption
Figure 528569DEST_PATH_IMAGE001
Strategy, with a certain probability
Figure 235625DEST_PATH_IMAGE003
And selecting the action with the highest value, otherwise, randomly selecting, wherein the mathematical expression is as follows:
Figure 334031DEST_PATH_IMAGE004
wherein the content of the first and second substances,
Figure 350397DEST_PATH_IMAGE005
in order for the action to be selected,
Figure 3095DEST_PATH_IMAGE006
is a state
Figure 779422DEST_PATH_IMAGE007
Lower motion
Figure 201176DEST_PATH_IMAGE008
The value of Q of (A) is,
Figure 263810DEST_PATH_IMAGE009
show to get
Figure 528438DEST_PATH_IMAGE010
The corresponding action at the maximum value is carried out,
Figure 233088DEST_PATH_IMAGE011
a set of selectable actions is represented that are,
Figure 650294DEST_PATH_IMAGE012
in order to select the probability of being,
Figure 149409DEST_PATH_IMAGE013
in order to be greedy,
Figure 166912DEST_PATH_IMAGE014
updating the Q value of each action in the current state by adopting a Q-learning algorithm:
Figure 285041DEST_PATH_IMAGE015
Figure 681387DEST_PATH_IMAGE016
wherein the content of the first and second substances,
Figure 741616DEST_PATH_IMAGE017
in the case of the current state of the mobile terminal,
Figure 856203DEST_PATH_IMAGE018
in order for the action to be selected,
Figure 778022DEST_PATH_IMAGE019
to perform an action
Figure 419088DEST_PATH_IMAGE020
In the latter next state, the state of the switch is,
Figure 525584DEST_PATH_IMAGE021
for the reward fed back in the current state,
Figure 2833DEST_PATH_IMAGE022
in order to obtain a learning rate,
Figure 977611DEST_PATH_IMAGE023
in order to be a factor of the discount,
Figure 348550DEST_PATH_IMAGE024
is in a state
Figure 501314DEST_PATH_IMAGE017
Lower motion
Figure 59334DEST_PATH_IMAGE018
The value of Q of (A) is,
Figure 572224DEST_PATH_IMAGE025
indicating a state
Figure 797669DEST_PATH_IMAGE019
Lower motion
Figure 855754DEST_PATH_IMAGE018
The value of Q of (A) is,
Figure 432229DEST_PATH_IMAGE026
it is indicated that the maximum value is taken,
Figure 93018DEST_PATH_IMAGE027
representing an update;
judging whether the difference between the Q values before and after updating is smaller than a set difference value, if so, outputting the action with the highest Q value as the real-time risk collision avoidance strategy, and otherwise, updating the Q value next time;
the state set definition method comprises the following steps: for the construction of a ship state set, the relative position information of an obstacle in a circle formed by taking the position of a ship as the center of the circle and setting a threshold as the radius and the ship is recorded as a state by combining the current course angle of the ship
Figure 297603DEST_PATH_IMAGE028
(ii) a The movements including advancing and turning, are described
Figure 916803DEST_PATH_IMAGE029
The value of an action is represented by a Q value by initializing the action in each state, and a larger Q value of an action indicates a higher value of the action in the state.
2. The vessel autonomous navigation method according to claim 1, characterized by constructing a vessel motion model, specifically:
establishing a ship model;
the motion of the ship is decomposed into translation along with a central point and rotation around the central point, and the motion of the ship is simplified into motion on a horizontal plane, so that the kinematic description of the ship is obtained;
setting a steering threshold value, and restraining the steering of the ship;
setting a safe area of the ship by taking the central point as a circle center and the safe distance as a radius;
and combining the ship model, the kinematics description, the steering threshold value and the safety region to obtain the ship motion model.
3. The vessel autonomous navigation method according to claim 1, wherein the navigation water area model is constructed based on a navigation water area map, specifically:
acquiring a navigation water area map, and carrying out edge extraction on obstacles in the navigation water area map;
extracting convex hulls of edges by using a Graham scanning method, setting a minimum side length threshold value and a minimum perimeter threshold value, and filtering the convex hulls of which the side length is smaller than the minimum side length threshold value or the perimeter is smaller than the minimum perimeter threshold value;
zooming the convex hulls equidistantly to enable the convex hulls to completely surround corresponding obstacles;
constructing an MAKLINK global connected graph by using the convex hull top point, and selecting the middle points, the starting points and the end points of all MAKLINK lines in the MAKLINK global connected graph as network nodes;
and connecting adjacent network nodes to obtain a undirected network diagram of the trajectory plan, and taking the undirected network diagram as the sailing water area model.
4. The ship autonomous navigation method according to claim 1, wherein based on the model of the sailing water area, an ant colony optimization algorithm is used to perform global track planning and generate a global reference track, specifically:
the navigation water area model is a directionless network diagram, and the directionless network diagram comprises a plurality of connecting lines;
acquiring a starting point and an end point, and constructing an initial track from the starting point to the end point based on the undirected network graph;
establishing an objective function of node optimization:
Figure 590361DEST_PATH_IMAGE030
wherein the content of the first and second substances,
Figure 54841DEST_PATH_IMAGE031
as to the length of the flight path,
Figure 113932DEST_PATH_IMAGE032
indicating the first in the track
Figure 638455DEST_PATH_IMAGE033
From node to node
Figure 923942DEST_PATH_IMAGE034
The distance between the individual nodes is such that,
Figure 67479DEST_PATH_IMAGE035
indicating the first track passed
Figure 856443DEST_PATH_IMAGE033
Any point on the connecting line is marked as the second point in the track
Figure 942080DEST_PATH_IMAGE033
The number of the nodes is equal to the number of the nodes,
Figure 449285DEST_PATH_IMAGE036
indicating the first track passed
Figure 396512DEST_PATH_IMAGE034
Any point on the strip connecting line is marked as the first point in the flight path
Figure 305562DEST_PATH_IMAGE034
The number of the nodes is one,
Figure 171887DEST_PATH_IMAGE037
the number of connecting lines through which the flight path passes,
Figure 556601DEST_PATH_IMAGE038
is a coefficient of proportionality that is,
Figure 900995DEST_PATH_IMAGE039
the proportionality coefficient being used for regulation
Figure 805497DEST_PATH_IMAGE040
In the first place
Figure 842723DEST_PATH_IMAGE041
Specific positions on the bar connection lines:
Figure 324520DEST_PATH_IMAGE042
wherein the content of the first and second substances,
Figure 862817DEST_PATH_IMAGE043
Figure 746460DEST_PATH_IMAGE044
respectively the first track passed
Figure 829953DEST_PATH_IMAGE046
Two end points of the bar connection line;
obtaining a plurality of paths by adjusting the proportional coefficients to form a solution set of feasible paths;
and optimizing the feasible path by adopting an ant colony optimization algorithm to obtain a shortest path as the global reference track.
5. The ship autonomous navigation method according to claim 4, wherein an ant colony optimization algorithm is used to optimize feasible paths, and a shortest path is obtained as the global reference track, specifically:
constructing a node set of a path according to the undirected network graph, and initializing parameters of an ant colony optimization algorithm;
selecting a next path node for each ant from the node set according to the pheromone;
and updating the pheromone on the corresponding path selected by the ant in a single step:
Figure 799046DEST_PATH_IMAGE047
wherein the content of the first and second substances,
Figure 875456DEST_PATH_IMAGE048
indicating the previous connection line
Figure 613604DEST_PATH_IMAGE049
From node to the next connection
Figure 992633DEST_PATH_IMAGE050
The pheromone between the individual nodes is,
Figure 324389DEST_PATH_IMAGE051
is a pair of
Figure 814276DEST_PATH_IMAGE048
The updated pheromone is carried out,
Figure 820582DEST_PATH_IMAGE052
is the initial pheromone and is a new pheromone,
Figure 636091DEST_PATH_IMAGE053
is the volatilization coefficient of the pheromone and is,
Figure 455142DEST_PATH_IMAGE054
after each ant completes the path selection of one round, the shortest path is selected for pheromone round updating, and the pheromone round updating formula is as follows:
Figure 748720DEST_PATH_IMAGE055
Figure 195882DEST_PATH_IMAGE056
wherein the content of the first and second substances,
Figure 306927DEST_PATH_IMAGE057
for the length of the selected path or paths,
Figure 737908DEST_PATH_IMAGE058
represents the sum of the pheromones released by all ants between the two points,
Figure 710543DEST_PATH_IMAGE059
representing a round;
and checking whether a termination condition is reached, if so, outputting the shortest path in the paths taken by the ants as the global reference track, and otherwise, performing next path optimization.
6. The vessel autonomous navigation method according to claim 4, wherein an initial track is constructed based on the model of the sailing waters, specifically:
in the constructed undirected network graph, the midpoint of a MAKLINK line is used as an intermediate node, and a Dijkstra algorithm is adopted to search the shortest path from a starting point to an end point to be used as the initial track.
7. The vessel autonomous navigation method according to claim 1, wherein vessel autonomous navigation is realized by combining the global reference track and the local risk collision avoidance strategy, and specifically, the method includes:
acquiring a starting point, a target point and the corresponding global reference track;
according to the nodes and the node sequence contained in the global reference track, sequentially setting each node as a child target point and navigating to each child target point in sequence;
acquiring barrier information in a current navigation area in real time in the navigation process;
judging whether the ship reaches a current target point, if so, further judging whether the current target point is an end point, if so, finishing navigation, if not, updating the current target point, and if not, turning to the next step;
and judging whether the barrier information is in a safe area of the ship, if so, generating a corresponding real-time risk collision avoidance strategy based on the barrier information, and carrying out risk avoidance according to the real-time risk collision avoidance strategy, otherwise, adjusting the course of the ship to navigate to a current target point, and continuously monitoring whether the barrier information is in the safe area of the ship.
8. A ship autonomous navigation apparatus comprising a processor and a memory, the memory having stored thereon a computer program which, when executed by the processor, implements the ship autonomous navigation method according to any one of claims 1 to 7.
9. A computer storage medium having a computer program stored thereon, wherein the computer program, when executed by a processor, implements the vessel autonomous navigation method according to any of claims 1-7.
CN202011535524.2A 2020-12-23 2020-12-23 Ship autonomous navigation method and device Active CN112650246B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011535524.2A CN112650246B (en) 2020-12-23 2020-12-23 Ship autonomous navigation method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011535524.2A CN112650246B (en) 2020-12-23 2020-12-23 Ship autonomous navigation method and device

Publications (2)

Publication Number Publication Date
CN112650246A CN112650246A (en) 2021-04-13
CN112650246B true CN112650246B (en) 2022-12-09

Family

ID=75359307

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011535524.2A Active CN112650246B (en) 2020-12-23 2020-12-23 Ship autonomous navigation method and device

Country Status (1)

Country Link
CN (1) CN112650246B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113110453B (en) * 2021-04-15 2022-06-21 哈尔滨工业大学 Artificial potential field obstacle avoidance method based on graph transformation
CN113190026B (en) * 2021-05-10 2022-06-28 河海大学 Energy-saving underwater path planning method based on re-excitation mechanism particle swarm algorithm
CN117452954B (en) * 2023-12-20 2024-04-02 北京海兰信数据科技股份有限公司 Navigation method of ship and terminal equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107037809A (en) * 2016-11-02 2017-08-11 哈尔滨工程大学 A kind of unmanned boat collision prevention method based on improvement ant group algorithm
CN109597417A (en) * 2019-01-14 2019-04-09 哈尔滨工程大学 A kind of more USV multi-agent synergy collision-avoidance planning methods based on collision prevention criterion
AU2020101965A4 (en) * 2020-08-25 2020-10-01 Wuhan University Of Technology A method of navigational early warming for ship encounter, and system thereof

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108820157B (en) * 2018-04-25 2020-03-10 武汉理工大学 Intelligent ship collision avoidance method based on reinforcement learning
CN109241552B (en) * 2018-07-12 2022-04-05 哈尔滨工程大学 Underwater robot motion planning method based on multiple constraint targets
CN109540136A (en) * 2018-10-25 2019-03-29 广东华中科技大学工业技术研究院 A kind of more unmanned boat collaboration paths planning methods
US11119250B2 (en) * 2019-01-15 2021-09-14 International Business Machines Corporation Dynamic adaption of vessel trajectory using machine learning models

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107037809A (en) * 2016-11-02 2017-08-11 哈尔滨工程大学 A kind of unmanned boat collision prevention method based on improvement ant group algorithm
CN109597417A (en) * 2019-01-14 2019-04-09 哈尔滨工程大学 A kind of more USV multi-agent synergy collision-avoidance planning methods based on collision prevention criterion
AU2020101965A4 (en) * 2020-08-25 2020-10-01 Wuhan University Of Technology A method of navigational early warming for ship encounter, and system thereof

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
基于三维卷积神经网络的点云图像船舶分类方法;任永梅 等;《激光与光电子学进展》;20200831;第57卷(第16期);161022-1-161022-9 *
规则约束下基于深度强化学习的船舶避碰方法;周双林 等;《中国航海》;20200930;第43卷(第3期);27-32+46 *

Also Published As

Publication number Publication date
CN112650246A (en) 2021-04-13

Similar Documents

Publication Publication Date Title
CN112650246B (en) Ship autonomous navigation method and device
Wang et al. Autonomous pilot of unmanned surface vehicles: Bridging path planning and tracking
Chen et al. Optimal time-consuming path planning for autonomous underwater vehicles based on a dynamic neural network model in ocean current environments
Cao et al. Target search control of AUV in underwater environment with deep reinforcement learning
CN111829527A (en) Unmanned ship path planning method based on deep reinforcement learning and considering marine environment elements
Wang et al. Cooperative collision avoidance for unmanned surface vehicles based on improved genetic algorithm
Wang et al. Local path optimization method for unmanned ship based on particle swarm acceleration calculation and dynamic optimal control
CN111665846B (en) Water surface unmanned ship path planning method based on rapid scanning method
Jung et al. An efficient underwater coverage method for multi-AUV with sea current disturbances
Zhou et al. Obstacle avoidance strategy for an autonomous surface vessel based on modified deep deterministic policy gradient
Deraj et al. Deep reinforcement learning based controller for ship navigation
CN110906935A (en) Unmanned ship path planning method
Ma et al. A new coverage path planning algorithm for unmanned surface mapping vehicle based on A-star based searching
Lan et al. Path planning for underwater gliders in time-varying ocean current using deep reinforcement learning
Zhang et al. Intelligent vector field histogram based collision avoidance method for auv
Xia et al. Research on collision avoidance algorithm of unmanned surface vehicle based on deep reinforcement learning
Su et al. A constrained locking sweeping method and velocity obstacle based path planning algorithm for unmanned surface vehicles in complex maritime traffic scenarios
Okereke et al. An Overview of Machine Learning Techniques in Local Path Planning for Autonomous Underwater Vehicles
Guo et al. Mission-driven path planning and design of submersible unmanned ship with multiple navigation states
Zhao et al. Global path planning of unmanned vehicle based on fusion of A∗ algorithm and Voronoi field
Gao et al. An Optimized Path Planning Method for Container Ships in Bohai Bay Based on Improved Deep Q-Learning
CN116952239A (en) Unmanned ship path planning method based on fusion of improved A and DWA
CN117369441A (en) Self-adaptive intelligent ship path planning method considering ship kinematics and COLLEGs
Ishii et al. A self-organizing map based navigation system for an underwater robot
CN116400679A (en) Ship formation algorithm combining rapid travel leveling method and virtual obstacle potential field method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant