CN115407768A - Underwater robot marine organism efficient fishing path planning method - Google Patents

Underwater robot marine organism efficient fishing path planning method Download PDF

Info

Publication number
CN115407768A
CN115407768A CN202210920410.2A CN202210920410A CN115407768A CN 115407768 A CN115407768 A CN 115407768A CN 202210920410 A CN202210920410 A CN 202210920410A CN 115407768 A CN115407768 A CN 115407768A
Authority
CN
China
Prior art keywords
cost
robot
path
fishing
underwater
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210920410.2A
Other languages
Chinese (zh)
Other versions
CN115407768B (en
Inventor
黄海
孙溢泽
张震坤
靳佰达
张云飞
蔡峰春
姜涛
韩鑫悦
王兆群
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harbin Engineering University
Original Assignee
Harbin Engineering University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harbin Engineering University filed Critical Harbin Engineering University
Priority to CN202210920410.2A priority Critical patent/CN115407768B/en
Publication of CN115407768A publication Critical patent/CN115407768A/en
Application granted granted Critical
Publication of CN115407768B publication Critical patent/CN115407768B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
    • G05D1/02Control of position or course in two dimensions
    • G05D1/0206Control of position or course in two dimensions specially adapted to water vehicles

Landscapes

  • Engineering & Computer Science (AREA)
  • Aviation & Aerospace Engineering (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Manipulator (AREA)

Abstract

The invention provides a path planning method for efficient marine organism fishing of an underwater robot, and belongs to the technical field of path planning. The invention provides an efficient fishing path planning strategy suitable for an underwater environment, which specifically comprises the following steps: firstly, defining fishing loss cost comprising path cost, turning cost and motion cost of a mechanical arm; then defining a collision risk cost consisting of the collision risk of the robot hull and the collision risk of the end effector, and related to the speed of the collision; and finally, establishing a value iterative network, inputting an underwater cost map obtained based on cost calculation, and calculating the average grabbing cost in a two-dimensional discrete environment to obtain underwater information which is beneficial to improving the grabbing task efficiency, so that the efficient fishing path planning of marine organisms of the underwater robot is realized.

Description

Underwater robot marine organism efficient fishing path planning method
Technical Field
The invention belongs to the technical field of path planning, and particularly relates to a path planning method for efficient marine organism fishing of an underwater robot.
Background
With the vigorous development of our country on ocean resources, the demand of people on marine products is increasing day by day, the tendency that an underwater fishing robot replaces a diver for fishing is more obvious, the threat of large underwater pressure and low temperature to the life health of the diver can be reduced, and the underwater grabbing efficiency is also improved. The fishing efficiency of the current underwater robot is far from the difference of human beings, wherein the path of underwater fishing operation is important for improving the fishing efficiency of the fishing robot. The underwater fishing environment faces complex marine organism distribution and obstacle conditions, the traditional underwater robot path planning method is difficult to apply, and efficient machine fishing can be realized only by analyzing underwater target distribution, the position relation of a robot and an obstacle and ocean current conditions.
The patent No. 201811521156.9, which discloses a method for parallel operation of an autonomous underwater robot and geological sampling equipment, provides a method for parallel operation of an autonomous underwater robot and geological sampling equipment. Mainly through the detection to the sample to the full coverage and the location parallel operation of sample scope are carried out, are difficult to combine the distribution and the position that marine organism grows to improve and catch efficiency. Based on the problems, a marine organism efficient catching path planning method for the underwater robot is provided to improve the efficiency of marine organism catching path planning of the underwater robot.
Disclosure of Invention
The invention aims to provide a method for planning a marine organism efficient fishing path of an underwater robot.
The purpose of the invention is realized by the following technical scheme:
a marine organism efficient fishing path planning method for an underwater robot specifically comprises the following steps:
the method comprises the following steps: calculating the fishing loss cost;
defining the fishing loss cost of the underwater robot, wherein the fishing loss cost of the underwater robot consists of the path movement cost of the robot moving to a destination, the bow turning cost in the movement process and the movement consumption cost of a mechanical arm for grabbing at the base bottom near a grabbing target;
step two: calculating collision risk cost;
defining collision risk cost of the underwater robot, wherein the collision risk cost consists of the collision risk of a robot boat body and the collision risk of an end effector, and the collision risk is related to the speed of collision; for a scene with multiple obstacles, calculating the collision risk of the robot on the hull and the end effector of each obstacle;
step three: adopting a value iterative network to realize path planning based on cost calculation;
according to the underwater environment, solving the collision risk cost, and assigning a value to the risk degree of each position of the value iteration network input map to obtain an underwater cost map; and inputting an underwater cost map in a value iterative network to obtain a planned path, calculating the motion consumption cost in a two-dimensional discrete environment, evaluating the grabbing difficulty degree of each task point, obtaining underwater information for improving the grabbing task efficiency, and obtaining the fishing path plan of marine organisms of the underwater robot.
The present invention may further comprise:
1. in the first step, path motion cost, bow turning cost and motion consumption cost of the mechanical arm during grabbing are defined as follows:
(1) Path motion cost f s The definition of (a) is expressed as follows:
Figure RE-GDA0003900350360000021
wherein the content of the first and second substances,
Figure RE-GDA0003900350360000022
which is indicative of the speed of the robot,
Figure RE-GDA0003900350360000023
representing the speed, t, of the ocean current s And t f Representing the start time and the end time of the movement;
(2) Cost f of turning bow t The definition of (a) is expressed as follows:
Figure RE-GDA0003900350360000024
wherein the heading angle is the angle before the change
Figure RE-GDA0003900350360000025
The angle of the change of the heading angle is stopped
Figure RE-GDA0003900350360000026
The bow-turning cost f in the motion process t Is the sum of changes of the heading angle;
(3) The motion consumption cost of the mechanical arm during grabbing is as follows:
Figure RE-GDA0003900350360000027
wherein, theta is Indicating the angle at which the i-th joint begins to change, θ if Angle, f, representing the i-th joint not changing after a time-off a Representing the motion cost of the rotation of each joint;
motion loss cost formula f pick Can be expressed as:
Figure RE-GDA0003900350360000028
wherein v is c Representing the magnitude of the flow rate, C α4 Represents the biological density coefficient, f s 、f t 、f a Respectively representing the path consumption cost, the turning consumption cost, the mechanical arm movement cost and C of the fishing robot in the moving process when the target is grabbed α1 、C α2 、C α3 Respectively representThe distance coefficient, the bow turning coefficient and the rotation coefficient of the mechanical arm movement of the fishing robot are obtained when the target is grabbed.
2. In the second step, the collision risk of the robot hull, the collision risk of the end effector, and the correlation between the collision risk and the collision speed are defined:
(1) Defining the collision risk cost of the robot hull at a task point, namely multiplying the speed by a risk function, and integrating in the grabbing time; the collision risk cost of the robot hull is:
Figure RE-GDA0003900350360000029
A. defining a safety distance epsilon v The expression of (c) is as follows:
Figure RE-GDA00039003503600000210
wherein l rob For the length of the fishing robot, k 1 To a safety factor, v c Is the flow rate of the operating area;
B. defining a collision risk function
Figure RE-GDA0003900350360000031
The following were used:
Figure RE-GDA0003900350360000032
wherein, the first and the second end of the pipe are connected with each other,
Figure RE-GDA0003900350360000033
is the robot-obstacle distance;
C. defining the speed v of the impact of the hull body As follows:
Figure RE-GDA0003900350360000034
wherein p is n Is the position of the robot, (v) x ,v y ,v z ) Is the speed of the collision;
(2) Defining the collision risk cost of the robot end effector at a task point, wherein the collision risk cost is obtained by multiplying the speed by a risk function and integrating in the grabbing time; the collision risk penalty of the robot end effector at the task point is:
Figure RE-GDA0003900350360000035
A. defining an end effector catch Risk function
Figure RE-GDA0003900350360000036
Comprises the following steps:
Figure RE-GDA0003900350360000037
wherein the content of the first and second substances,
Figure RE-GDA0003900350360000038
to capture the distance between the end-effector and the obstacle, d h Is the height of the obstacle, epsilon e For a safe distance between the fishing target and the obstacle, k 1 ,k 2 Is the undetermined coefficient;
B. velocity v of impact of end effector e The expression is as follows:
Figure RE-GDA0003900350360000039
wherein, the first and the second end of the pipe are connected with each other,
Figure RE-GDA00039003503600000310
speed of impact of end-effector, p n Is the position of the robot end effector;
(3) Defining a collision risk cost formula of the robot as follows:
Figure RE-GDA00039003503600000311
wherein the content of the first and second substances,
Figure RE-GDA00039003503600000312
as a function of the risk of the robot hull,
Figure RE-GDA00039003503600000313
is a risk function of the end effector, v body Projection of the velocity of the robot hull in the direction of impact with an obstacle, v e Is the projection of the velocity in the direction of the end effector impacting the obstacle.
3. In the third step, a path planning method based on cost calculation is provided to realize marine organism fishing path planning of the underwater robot, and the path planning process based on the underwater cost calculation is as follows:
(1) Generating a cost map;
the m × m continuous environment is divided into 1 × 1 grid environments with each grid, and each location (i, j) in the grid environment has only two states: the method comprises the steps that with obstacles and without obstacles, under the condition that the distance between a fishing robot and the obstacles is within a safe range, a safe distance is calculated according to the flow velocity, according to the states of eight grids around, collision risk cost basis of each grid environment is calculated through a collision risk cost function and the speed component of the robot in the direction where collision is likely to occur, and the collision risk cost basis is used as a cost value of an environment map under a two-dimensional discrete environment;
(2) Establishing a value iterative network to solve the problem of path planning;
establishing a value iteration network: embedding a value iteration module in a neural network, associating convolution operation and pooling operation of the neural network with a value iteration algorithm in reinforcement learning, endowing the neural network with a planning function, and describing the variable by using a state value function:
v(s)=E[R t+1v (S t+1 )|S t =s]
wherein γ is a discount coefficient;
the optimal strategy is found in the following way, so that the value of the total decision sequence is maximized:
Figure RE-GDA0003900350360000041
inputting an underwater cost map based on cost calculation in a value iterative network to obtain a planned path;
(3) Calculating average grabbing cost;
the average grabbing cost τ s is calculated as follows:
Figure RE-GDA0003900350360000042
wherein q is sum Representing the sum of states, f pick Represents the exercise expenditure cost, num i Indicating the target number of task points, mu 1 And mu 2 Respectively representing the value and the weight of the exercise consumption cost;
cost of exercise consumption f pick Solving through the difference between the actual planning path and the effective motion path; the path difference item is obtained by calculating the difference between the path length planned under the condition of no obstacle and the path length planned under the condition of an obstacle; calculating a difference value between a bow turning angle of a planned path without an obstacle and a bow turning angle under the condition that the obstacle exists to obtain a bow turning angle;
τ s and evaluating the path condition of the task points and the difficulty degree of grabbing, and selecting an optimal grabbing path.
The invention has the beneficial effects that:
aiming at the requirement of efficient path planning, the invention provides a path planning method based on cost calculation, which is characterized in that a collision risk analysis of underwater capture is carried out, a calculation formula of collision risk cost is designed from multiple angles by considering the distance between a robot and an obstacle, the state of the robot and the number of the obstacles, and the collision risk is evaluated. From the underwater motion efficiency, the underwater fishing consumption cost is calculated on the three aspects of the motion path cost, the bow turning cost and the mechanical arm motion cost by analyzing the underwater motion path of the robot, and the influence of ocean current on the motion speed and bow turning of the underwater robot is considered. And calculating the risk cost in the underwater environment by using the collision risk cost to obtain a cost map, and replacing the original map as input to provide more prior information. By designing a cost calculation-based path planning simulation experiment, path planning is carried out by comparing input different map use value iterative networks, cost-based planning is more prone to selecting actions with low risk cost in decision making, the cost of fishing consumption is reduced by keeping away from obstacles, the average grabbing cost is lower than that of planning based on an original value iterative network, and the efficient marine organism fishing path planning of the underwater robot is realized.
Drawings
FIG. 1 is a graph of path motion cost analysis according to the present invention;
FIG. 2 is a collision cost analysis diagram of the present invention;
FIG. 3 is a schematic view of the multi-obstacle collision risk of the present invention;
FIG. 4 is a value iteration network structure of the present invention;
FIG. 5 is a schematic diagram of calculating a collision risk cost value according to the present invention;
FIG. 6 is a flow chart of path planning in accordance with the present invention;
FIG. 7 is a value iteration path planning result of the present invention;
fig. 8 shows the result of path planning for multiple working points of the robot according to the present invention.
Detailed Description
The invention is further described below with reference to the accompanying drawings.
A method for planning a marine organism efficient fishing path of an underwater robot is characterized by being suitable for the method for planning the marine organism efficient fishing path of the underwater robot, and specifically comprising the following steps:
the fishing loss cost of the underwater robot is defined to be composed of the path cost of the robot moving to the destination, the turning cost in the moving process and the movement cost of the grabbing mechanical arm when the underwater robot reaches the base near the grabbing target.
The ocean current velocity can be represented to influence the robot speed through making a difference between the robot hull speed and the ocean current velocity and calculating the magnitude, and the path cost of the ocean current on the movement influence of the robot can be represented through integral in the whole movement process. Path motion cost f s The definition of (a) is expressed as follows:
Figure RE-GDA0003900350360000051
Figure RE-GDA0003900350360000052
which is indicative of the speed of the robot,
Figure RE-GDA0003900350360000053
representing the speed, t, of the ocean current s And t f Indicating the start and end moments of the movement.
As shown in fig. 1, if the robot reaches the point n along the red dotted line without an obstacle, the robot needs to rotate counterclockwise by a certain angle to reach the designated pose, and if the robot moves with an obstacle, the robot needs to turn forward clockwise by a certain angle and then continuously turn forward counterclockwise to reach the final pose. According to a yaw moment equation in a submarine six-degree-of-freedom equation, calculating the integral of yaw moment continuously changed in the motion process of the robot in the motion process to serve as the heading cost of the robot, wherein the heading cost f t The definition of (c) is expressed as follows:
Figure RE-GDA0003900350360000061
where ρ represents the density of the execution water area, L represents the length of the robot hull, N' r′ And N' r|r| And r is an angular velocity component around the z axis when the robot performs bow turning motion.
The robot approaches according to the target after reaching the position near the obstacle, so that the target is in the range capable of being grabbed by the robot, when the robot grabs, the mechanical arm has catching consumption cost, the targets at different positions can lead to different motion processes of the mechanical arm, and similarly, under the condition that the obstacle exists, the mechanical arm needs to bypass the obstacle to grab the target. The complexity of the grasping is described by the sum of the rotation angles of each joint of the mechanical arm. Taking a three-joint mechanical arm as an example, from a contracted state to a gripping state of the mechanical arm, theta is Indicates the angle at which the ith joint begins to change, θ if The angle of the ith joint which does not change after changing for a certain time is shown, and the movement cost of the rotation of each joint is f a Then, the motion consumption cost of the mechanical arm during grabbing can be obtained as follows:
Figure RE-GDA0003900350360000062
from the above analysis, the motion loss cost formula f pick Can be expressed as:
Figure RE-GDA0003900350360000063
the cost of consumption is generated in the motion process of the robot, wherein C α4 Representing the biological density coefficient, and is proportional to the number of target organisms in a certain direction in the region, when C α4 The cost of consumption for the entire path is reduced when larger. f. of s ,f t ,f a The path consumption cost, the turning consumption cost and the mechanical arm movement cost of the fishing robot in the moving process during target grabbing are represented respectively. C α1 , C α2 And C α3 Respectively representing the distance coefficient, the turning coefficient and the rotation coefficient of the mechanical arm movement of the fishing robot when the target is grabbed.
Defining the collision risk cost of an underwater robot consists of the collision risk of the robot hull and the collision risk of the end effector and is related to the speed of the collision. For a scene of a plurality of obstacles, calculating the collision risk of the robot on the hull and the end effector of each obstacle;
the required distance for different types of obstacles is the minimum distance between the robot and the obstacle, the situation of the figure 2 is taken as an example to show the state that the robot grabs near the obstacle, a green envelope ring represents a task point, the robot moves near the obstacle to realize grabbing at a plurality of task points, the obstacle and the robot boat body are represented by red minimum envelope circles, and the radii are R respectively rob And R obs The distance between the two circle centers is d 0 Then the distance between the robot and the obstacle is defined as d obs
d obs =d 0 -R obs -R rob
From the viewpoint of safety, it is considered that v Within is a risk, falling to 0 when out of range, and within a safe range the risk is greater when the distance is smaller. The safety distance is the flow velocity v from the working area c In relation to the above, when the flow velocity is increased, the safety distance is exponentially increased, and according to the actual underwater experiment of the fishing robot, when the flow velocity exceeds 0.6m/s, the robot cannot normally move, so that the robot is considered to move at v c The flow velocity is within 0-0.6m/s, the safe distance is increased linearly along with the increase of the flow velocity under the condition that the robot can move normally, and the length l of the fishing robot is different rob Can be based on the parameter k 1 Adjustment of safety distance, k 1 The safety distance is controlled for the safety factor. Safety distance epsilon v The expression of (a) is as follows:
Figure RE-GDA0003900350360000071
collision risk function
Figure RE-GDA0003900350360000072
When the distance between the robot and the obstacle is less than the safety distance, the safety distance is usedThe distance minus the square of the robot-obstacle distance difference represents the degree of risk and divided by the safe distance, which if larger, represents that the risk of collision is smaller.
Figure RE-GDA0003900350360000073
When the robot-obstacle distance d v obs When the distance is smaller, the difference value between the distance and the safety distance is larger, the collision risk is increased, and when the robot collides with an obstacle, d v obs Is 0, the collision risk function reaches a maximum of ε v When the distance d is v obs The greater the risk of collision, the minimum is 0. The collision risk is also inversely proportional to the safe distance, with the reward for collision risk decreasing when the safe distance is greater and increasing otherwise.
The collision risk is also related to the collision speed, when the speed of the speed in the direction of connecting the two enveloping circle centers is larger, the collision risk cost is larger, and the collision speed (v) is assumed to be that an obstacle collided with the robot is fixed x ,v y ,v z ) Can be measured and projected by a Doppler velocimeter equipped with the fishing robot, the speed of the robot is considered as an approaching speed due to estimation, and the speed is projected to the direction of the collision of the robot with the obstacle, and the expression v of the speed is body As follows:
Figure RE-GDA0003900350360000074
and finally, multiplying the speed by a risk function, and integrating in the capturing time to obtain the collision risk cost at the task point. The collision risk penalty of the robot hull is:
Figure RE-GDA0003900350360000081
in a similar way, when the paw executes the grabbing task, collision risk cost can be generated, and the catching risk function of the paw is as follows:
Figure RE-GDA0003900350360000082
to work out the cost function of the working space in combination with the situation of obstacles in the working environment, it is assumed that the obstacles in the fishing environment are all static, passing through the spatial distance
Figure RE-GDA0003900350360000083
Representing the distance between the end effector and the obstacle. d h Representing the height of the obstacle. Epsilon e The safe distance between the fishing target and the obstacle is represented, because the robot lands on the seabed during operation, the robot boat body is not influenced by the flow velocity of ocean currents, the end effector can be considered to be not influenced by the ocean currents and is a fixed value, the length of the mechanical arm can be generally considered, the risk of collision between the mechanical arm and the obstacle exists only within the safe distance, k 1 ,k 2 The undetermined coefficient can be set as the most suitable parameter according to specific situations.
The risk of collision is also related to the speed of collision, and when the speed of the speed in the direction of connecting the centers of the two enveloping circles is higher, the risk cost of collision is higher, and the speed of collision (v) e x ,v e y ,v e z ) Can be measured and projected by the catching robot, and the velocity of the end grab is considered as a constant value due to estimation, and the velocity is projected to the direction of the collision of the robot with the obstacle, and the expression v thereof e As follows:
Figure RE-GDA0003900350360000084
and finally, multiplying the speed by a risk function, and integrating in the capturing time to obtain the collision risk cost at the task point. The collision risk penalty of the robot hull is:
Figure RE-GDA0003900350360000085
in summary, the collision risk cost formula of the robot is as follows:
Figure RE-GDA0003900350360000086
Figure RE-GDA0003900350360000087
as a function of the risk of the robot hull,
Figure RE-GDA0003900350360000088
is a risk function of the end effector, v body Projection of the velocity of the robot hull in the direction of impact against the obstacle, v e Is a projection of the velocity in the direction of the end effector impacting the obstacle.
When the fishing robot grabs, a scene with a plurality of obstacles exists nearby, and the collision risk of the robot on the hull and the paw of each obstacle needs to be calculated respectively, which means that the collision risk increases with the number of obstacles. Assuming that num obstacles, i = {1,2.., num } exist around the time of fishing, the total collision risk penalty for the robot is:
f obs =∑f i obs
through the process analysis of grabbing the operation to underwater robot execution, catch consumption cost and collision risk cost to the robot and carry out the analysis, to sum up, the robot is grabbing total cost under water in the executive task in-process:
f=f pick +f obs
Figure RE-GDA0003900350360000091
f obs =f body +f effector
the structure of the value iterative network is shown in figure 4, when the value iterative network is used for solving the problem of path planning, the first step is to calculate a state action cost function q l+1 (s, a), namely performing convolution operation on the input image at a state s, wherein the number of channels is equal to the number of actions a in the action space to obtain a state action value function corresponding to the actions, and the second step of value iteration is to find out an optimal state value function v through a greedy strategy l+1 (s), which can be regarded as a one-step pooling operation of the obtained state cost function in the state s, and the two operations are expressed by the following formula:
Figure RE-GDA0003900350360000092
q l+1 (s, a) -State action value function representing the l +1 th round of State s in an iteration taking action a
Figure RE-GDA0003900350360000093
Representing the reward obtained by taking action a at state s
Gamma-represents the discount coefficient
Figure RE-GDA0003900350360000094
-representing state transition probabilities
v l+1 (s) -represents the 1 st +1 st round state s state value function in an iteration
After iteration of k rounds, a final state action value function can be obtained, an optimal strategy is obtained through selection, and the optimal strategy is expressed by the following formula:
Figure RE-GDA0003900350360000095
dividing the continuous m × m environment into 1 × 1 grid environments, considering that each position (i, j) in the grid environment is only oneTwo states: and respectively calculating collision risk cost of each grid environment with obstacles and without obstacles, wherein the risk cost value of each grid can be calculated according to the states of eight grids around the grid, and the risk cost value can be used as the cost value of the environment map in the two-dimensional discrete environment. Under the condition that the distance between the fishing robot and the obstacle is within a safe range, a safe distance is calculated according to the flow velocity, and collision risk cost is calculated through a collision risk cost function and the velocity component of the robot on the direction where collision is likely to occur, so that a cost map is generated. As shown in fig. 5, the green triangle is the current position, and the risk cost value corresponding to the two-dimensional discrete grid corresponding to the green triangle is the collision risk cost calculated for the case where two black obstacles are combined with the ocean current
Figure RE-GDA0003900350360000101
And (3) the sum:
f obs =∑f i obs
and finally, calculating the average grabbing cost. The average grabbing cost τ s is calculated as follows:
Figure RE-GDA0003900350360000102
wherein q is sum Representing the sum of states, f pick Represents the exercise expenditure cost, num i Indicating the target number of task points, mu 1 And mu 2 Representing the value and the weight of the cost of the exercise consumption, respectively.
Cost of exercise consumption f pick The solution is made by the difference between the actual planned path and the effective motion path. The path difference item is obtained by calculating the difference between the path length planned under the condition of no obstacle and the path length planned under the condition of an obstacle; the heading turning angle is obtained by calculating the difference between the heading turning angle of the planned path when no obstacle exists and the heading turning angle under the condition that the obstacle exists. By τ s And evaluating the path condition of the task points and the difficulty degree of grabbing, and selecting the optimal grabbing path to realize the high-efficiency marine life catching path planning of the underwater robot.
As shown in fig. 6, in the path planning based on cost calculation, firstly, an obtained underwater cost map is input in a network, secondly, reward images are designed according to different task requirements, a state-action value function of each point is calculated, a planned path based on a value iteration network is obtained, finally, the movement consumption cost under a two-dimensional discrete environment is calculated, the grabbing difficulty degree of each task point is evaluated, the optimal action is selected, a final path is output, and the efficient fishing path planning of marine organisms of the underwater robot is realized.
And simulating the path planning based on the underwater cost calculation, calculating the average grabbing cost according to the simulation result, and judging whether the path planned by the proposed algorithm can improve the safety of the path and reduce the average grabbing cost. In the simulation environment, on a 28m × 28m grid map, (i, j) represents the corresponding position of the grid, the starting point is at the upper left corner of the grid map, the end point is the task point for grabbing, and 30 grabbed objects exist. The planning method is to improve value iterative network, and the input network is an original map and a cost map subjected to cost calculation respectively.
TABLE 1 value index for different input categories
Figure RE-GDA0003900350360000103
As can be seen from the simulation, according to the planning result shown in fig. 5, the planned path (a) of the value iteration network based on the cost map as the input and the path (b) of the value iteration network based on the original input are more inclined to be far away from the obstacle and are shorter, so that the safety and the fishing efficiency of the robot in motion are improved.
For the path planning simulation of multiple operation points, the simulation environment is that the fishing robot catches 8 targets, the positions of the task points and the target number of the task points are known, and the positions of the fishing robot are known. The results obtained using the path planning method based on the value iterative network are shown in FIG. 8
In conclusion, the average grabbing cost of the path based on the cost calculation planning is less than that of the path based on the original value iterative network, because the cost-based planning is more prone to selecting actions with low risk cost during decision making, the fishing consumption cost is reduced due to the fact that the cost-based planning is far away from obstacles, and the path planning is more suitable for planning the path for executing the underwater grabbing task after the underwater collision risk and the risk of motion consumption are considered.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (4)

1. A marine organism efficient fishing path planning method for an underwater robot is characterized by comprising the following steps: the method specifically comprises the following steps:
the method comprises the following steps: calculating the fishing loss cost;
defining the fishing loss cost of the underwater robot, wherein the fishing loss cost of the underwater robot consists of the path movement cost of the robot moving to a destination, the turning cost in the movement process and the movement consumption cost of a mechanical arm for grabbing at the base near a grabbing target;
step two: calculating collision risk cost;
defining a collision risk cost of the underwater robot to be composed of a collision risk of the robot hull and a collision risk of the end effector, and the collision risks are related to a speed of the collision; for a scene with multiple obstacles, calculating the collision risk of the robot on the hull and the end effector of each obstacle;
step three: adopting a value iterative network to realize path planning based on cost calculation;
according to the underwater environment, solving the collision risk cost, and assigning a value to the risk degree of each position of the value iteration network input map to obtain an underwater cost map; and inputting an underwater cost map in a value iterative network to obtain a planned path, calculating the motion consumption cost in a two-dimensional discrete environment, evaluating the grabbing difficulty degree of each task point, obtaining underwater information for improving the grabbing task efficiency, and obtaining the fishing path plan of marine organisms of the underwater robot.
2. The underwater robot marine organism efficient fishing path planning method of claim 1, characterized in that: in the first step, path motion cost, bow turning cost and motion consumption cost of the mechanical arm during grabbing are defined as follows:
(1) Path motion cost f s The definition of (a) is expressed as follows:
Figure FDA0003777307230000011
wherein, the first and the second end of the pipe are connected with each other,
Figure FDA0003777307230000012
which is indicative of the speed of the robot,
Figure FDA0003777307230000013
representing the speed of the ocean current, t s And t f Representing the start time and the end time of the movement;
(2) Cost f of bow turning t The definition of (c) is expressed as follows:
Figure FDA0003777307230000014
wherein the heading angle is the angle before the change
Figure FDA0003777307230000015
The angle at which the heading angle stops changing is
Figure FDA0003777307230000016
The bow-turning cost f in the motion process t Is the sum of changes of the heading angle;
(3) The motion consumption cost of the mechanical arm during grabbing is as follows:
Figure FDA0003777307230000017
wherein, theta is Indicates the angle at which the ith joint begins to change, θ if Angle, f, representing the i-th joint not changing after a time of change a Representing the motion cost of the rotation of each joint;
motion loss cost formula f pick Can be expressed as:
Figure FDA0003777307230000018
wherein v is c Representing the magnitude of the flow rate, C α4 Represents the biological density coefficient, f s 、f t 、f a Respectively representing the path consumption cost, the turning consumption cost, the mechanical arm movement cost and C of the fishing robot in the moving process when the target is grabbed α1 、C α2 、C α3 Respectively representing the distance coefficient, the turning coefficient and the rotation coefficient of the mechanical arm movement of the fishing robot when the target is grabbed.
3. The underwater robot marine organism efficient fishing path planning method of claim 1, characterized in that: in the second step, the collision risk of the robot hull, the collision risk of the end effector, and the correlation between the collision risk and the collision speed are defined:
(1) Defining the collision risk cost of the robot hull at a task point, namely multiplying the speed by a risk function, and integrating in the grabbing time; the collision risk penalty of the robot hull is:
Figure FDA0003777307230000021
A. defining a safety distance epsilon v The expression of (a) is as follows:
Figure FDA0003777307230000022
wherein l rob For the length of the fishing robot, k 1 To a safety factor, v c Is the flow rate of the work area;
B. defining a collision risk function
Figure FDA0003777307230000023
The following were used:
Figure FDA0003777307230000024
wherein the content of the first and second substances,
Figure FDA0003777307230000025
is the robot-obstacle distance;
C. defining the speed v of the impact of the hull body As follows:
Figure FDA0003777307230000026
wherein p is n Is the position of the robot, (v) x ,v y ,v z ) Is the speed of the collision;
(2) Defining the collision risk cost of the robot end effector at a task point, wherein the collision risk cost is obtained by multiplying the speed by a risk function and integrating in the grabbing time; the collision risk penalty of the robot end effector at the task point is:
Figure FDA0003777307230000027
A. defining a catch risk function for an end effector
Figure FDA0003777307230000028
Comprises the following steps:
Figure FDA0003777307230000029
wherein the content of the first and second substances,
Figure FDA00037773072300000210
to catch the distance between the end-effector and the obstacle, d h Is the height of the obstacle, epsilon e For a safe distance between the fishing target and the obstacle, k 1 ,k 2 Is a undetermined coefficient;
B. velocity v of impact of end effector e The expression is as follows:
Figure FDA00037773072300000211
wherein the content of the first and second substances,
Figure FDA00037773072300000212
speed of impact of end-effector, p n Is the position of the robot end effector;
(3) Defining a collision risk cost formula of the robot as follows:
Figure FDA0003777307230000031
wherein the content of the first and second substances,
Figure FDA0003777307230000032
as a function of the risk of the robot hull,
Figure FDA0003777307230000033
is a risk function of the end effector, v body Projection of the velocity of the robot hull in the direction of impact with an obstacle, v e For the end to holdThe speed projection in the direction of the line impact on the obstacle.
4. The underwater robot marine organism efficient fishing path planning method of claim 1, characterized in that: in the third step, a path planning method based on cost calculation is provided to realize marine organism fishing path planning of the underwater robot, and the path planning process based on the underwater cost calculation is as follows:
(1) Generating a cost map;
the m × m continuous environment is divided into 1 × 1 grid environments with each grid, and each location (i, j) in the grid environment has only two states: the method comprises the steps that with obstacles and without obstacles, under the condition that the distance between a fishing robot and the obstacles is within a safe range, a safe distance is calculated according to the flow velocity, according to the states of eight grids around, collision risk cost basis of each grid environment is calculated through a collision risk cost function and the speed component of the robot in the direction where collision is likely to occur, and the collision risk cost basis is used as a cost value of an environment map under a two-dimensional discrete environment;
(2) Establishing a value iterative network to solve the problem of path planning;
establishing a value iteration network: embedding a value iteration module in a neural network, associating convolution operation and pooling operation of the neural network with a value iteration algorithm in reinforcement learning, endowing the neural network with a planning function, and describing the variable by using a state value function:
v(s)=E[R t+1 +γv(S t+1 )|S t =s]
wherein γ is the discount coefficient;
the optimal strategy is found in the following way, so that the value of the total decision sequence is maximum:
Figure FDA0003777307230000035
inputting an underwater cost map based on cost calculation in a value iterative network to obtain a planned path;
(3) Calculating average grabbing cost;
average grab cost τ s The calculation formula of (a) is as follows:
Figure FDA0003777307230000034
wherein q is sum Represents a state value sum, f pick Represents the exercise expenditure cost, num i Indicating the target number of task points, mu 1 And mu 2 Respectively representing the value and the weight of the exercise consumption cost;
cost of exercise consumption f pick Solving through the difference between the actual planning path and the effective motion path; the path difference item is obtained by calculating the difference between the path length planned under the condition of no obstacle and the path length planned under the condition of an obstacle; calculating a difference value between a bow-turning angle of a planned path without obstacles and a bow-turning angle under the condition of the existence of the obstacles to obtain a bow-turning angle;
τ s and evaluating the path condition of the task points and the difficulty degree of grabbing, and selecting the optimal grabbing path.
CN202210920410.2A 2022-08-02 2022-08-02 Underwater robot marine organism efficient catching path planning method Active CN115407768B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210920410.2A CN115407768B (en) 2022-08-02 2022-08-02 Underwater robot marine organism efficient catching path planning method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210920410.2A CN115407768B (en) 2022-08-02 2022-08-02 Underwater robot marine organism efficient catching path planning method

Publications (2)

Publication Number Publication Date
CN115407768A true CN115407768A (en) 2022-11-29
CN115407768B CN115407768B (en) 2023-12-12

Family

ID=84158737

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210920410.2A Active CN115407768B (en) 2022-08-02 2022-08-02 Underwater robot marine organism efficient catching path planning method

Country Status (1)

Country Link
CN (1) CN115407768B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08202445A (en) * 1995-01-20 1996-08-09 Mitsubishi Heavy Ind Ltd Controller of autonomous underwater robot
CN108281043A (en) * 2018-03-23 2018-07-13 上海海事大学 A kind of ship collision Warning System and method for early warning
CN109725650A (en) * 2019-03-08 2019-05-07 哈尔滨工程大学 A kind of AUV barrier-avoiding method under intensive obstacle environment
CN111951606A (en) * 2020-07-29 2020-11-17 武汉理工大学 Ship collision risk assessment and early warning method and system
CN112068576A (en) * 2020-07-21 2020-12-11 华中科技大学 Task-hierarchical timing optimization-based underwater unmanned ship-double mechanical arms cooperative control method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08202445A (en) * 1995-01-20 1996-08-09 Mitsubishi Heavy Ind Ltd Controller of autonomous underwater robot
CN108281043A (en) * 2018-03-23 2018-07-13 上海海事大学 A kind of ship collision Warning System and method for early warning
CN109725650A (en) * 2019-03-08 2019-05-07 哈尔滨工程大学 A kind of AUV barrier-avoiding method under intensive obstacle environment
CN112068576A (en) * 2020-07-21 2020-12-11 华中科技大学 Task-hierarchical timing optimization-based underwater unmanned ship-double mechanical arms cooperative control method
CN111951606A (en) * 2020-07-29 2020-11-17 武汉理工大学 Ship collision risk assessment and early warning method and system

Also Published As

Publication number Publication date
CN115407768B (en) 2023-12-12

Similar Documents

Publication Publication Date Title
JP6854549B2 (en) AUV action planning and motion control methods based on reinforcement learning
Ruan et al. Mobile robot navigation based on deep reinforcement learning
Zhu et al. Task assignment and path planning of a multi-AUV system based on a Glasius bio-inspired self-organising map algorithm
Wu et al. Reinforcement learning and particle swarm optimization supporting real-time rescue assignments for multiple autonomous underwater vehicles
CN109540151A (en) A kind of AUV three-dimensional path planning method based on intensified learning
CN113485371B (en) Underwater multi-AUV path planning method based on improved sparrow search algorithm
CN104794526A (en) Automatic ship anti-collision method optimized by wolf colony search algorithm
CN108196453A (en) A kind of manipulator motion planning Swarm Intelligent Computation method
CN113534668B (en) Maximum entropy based AUV (autonomous Underwater vehicle) motion planning method for actor-critic framework
Ji-Yong et al. Design and vision based autonomous capture of sea organism with absorptive type remotely operated vehicle
Sun et al. A novel fuzzy control algorithm for three-dimensional AUV path planning based on sonar model
Song et al. Method of emergency collision avoidance for unmanned surface vehicle (USV) based on motion ability database
Mina et al. Maneuvering ability-based weighted potential field framework for multi-USV navigation, guidance, and control
Pal et al. Mobile robot navigation using a neural net
CN117315025A (en) Mechanical arm 6D pose grabbing method based on neural network
Wang et al. Adaptive environment modeling based reinforcement learning for collision avoidance in complex scenes
Fu et al. Multi-behavior fusion based potential field method for path planning of unmanned surface vessel
Castaño et al. Trajectory tracking control of rowing pectoral fin-actuated robotic fish
CN115407768A (en) Underwater robot marine organism efficient fishing path planning method
CN116852347A (en) State estimation and decision control method for non-cooperative target autonomous grabbing
CN112124537B (en) Intelligent control method for underwater robot for autonomous absorption and fishing of benthos
Liu et al. UUV path planning method based on QPSO
CN115562313A (en) Autonomous underwater vehicle motion control method for pier flaw detection
Yang et al. Research on dynamic path planning of auv based on forward looking sonar and fuzzy control
Kim et al. Simulation and feasibility test of mini-rovs with auv for the manipulation purpose

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant