CN110430547A

CN110430547A - More AUV collaboration data collection algorithms in UASNs based on Q-learning

Info

Publication number: CN110430547A
Application number: CN201910670534.8A
Authority: CN
Inventors: 韩光洁; 宫爱妮; 王皓; 何宇
Original assignee: Changzhou Campus of Hohai University
Current assignee: Changzhou Campus of Hohai University
Priority date: 2019-07-24
Filing date: 2019-07-24
Publication date: 2019-11-08
Anticipated expiration: 2039-07-24
Also published as: CN110430547B

Abstract

The invention discloses more AUV collaboration data collection algorithms in a kind of UASNs based on Q-learning, include the following steps: to select cluster head according to certain condition, and cluster head is added in other node self-adaptings nearby, form node cluster；The distribution of AUV task is carried out based on contract net algorithm is improved；Path planning is carried out based on Q-learning algorithm, AUV completes data collection according to the path of planning.For the present invention by carrying out reasonable task distribution to multiple AUV, the improving AUV of the task completes efficiency, reduces data collection delay；The message level that data packet is considered in data collection carries out preferential collection to emergency data, realizes the fast and effective processing for emergency data；Path planning is carried out to AUV by using Q-learning, reduces the distance to go and energy consumption of AUV.

Description

More AUV collaboration data collection algorithms in UASNs based on Q-learning

Technical field

The invention belongs to water sound sensor network fields, and in particular to more AUV based on Q-learning in a kind of UASNs Collaboration data collection algorithm.

Background technique

Water sound sensor network is a kind of emerging and promising network technology, can be widely applied to answer under water With, such as underwater environment observation, coastline monitor and protect, take precautions against natural calamities, assisting navigation and mine detect.In recent years, underwater wireless Sensor network is widely applied the concern with many advantages by more and more oceanographers because of it, it can be helped Human perception and monitor wide unplumbed marine environment, the generation of monitoring and early warning Oceanic disasters, for exploration, using and Protection marine resources provide important information and support.

The a large amount of perception datas of transmission are responsible on basis one of of the data collection techniques as UASNs, but incident is high Energy consumption problem.Therefore, how underwater environment feature is combined, equilibrium data efficiency of transmission and node energy consumption have far-reaching significance. Larger due to underwater sensor network, the communication range of underwater node is limited and the spies such as network topology dynamic change Point, underwater data collection techniques carry out data collection usually using AUV.

The method for carrying out data collection using AUV at present, is received according to using the quantity of AUV to be broadly divided into single AUV Collection and multiple AUV carry out two class of data collection.

The first kind is to carry out data collection using single AUV, is collected under water by disposing underwater sensor node Relevant information, AUV navigate by water collector node data in a network, and are transmitted to the Sink of the water surface.

Ilyas et al. publishes an article " AEDG:AUV- on " Procedia Computer Science " in 2015 aided Efficient Data Gathering Routing Protocol for Underwater Wireless Sensor Networks " devises a kind of AUV auxiliary high efficient data capture Routing Protocol (AEDG).One AUV is along scheduled Elliptic orbit is mobile, broadcasts a HP to change information, and select gateway node according to RSSI value and the dump energy of node (GNS).Meanwhile member node (MNS) is associated with shortest path by gateway node, and is transmitted data packet by gateway node To AUV.The article " A Multi-hop Approach " that Khan.JU et al. is delivered on " Sensors " in 2016 proposes A kind of AUV cooperative data transmission gathering algorithm divided based on network.In the algorithm, according to voronoi, she generates a general to Sink Whole network plane is divided into multiple Thiessen polygons.Each polygon distributes an AUV, and plans road according to cluster situation Diameter, while AUV carries out cooperative transmission by the way that an agent node is arranged in each region.SEYED MOHAMMAD GHOREYSHI Et al. article " the Mobile Data Gathering With Hop-Constrained that is delivered on " IEEE ACCESS " Clustering in Underwater Sensor Networks " proposes a kind of Large-scale Mobile data based on cluster and adopts Collect (CMDG) scheme, acquires in data and weighed between delay and energy conservation.It is clustered for acoustic sensor and with shortest Stroke covers its cluster head problem and proposes two kinds of effective algorithms, to obtain approximate optimal solution within the shorter calculating time, discusses Influence of the chance variation to ECMDG performance is adopted in terms of energy-delay tradeoff when network working time and existing mobile data When the time of collection scheme compares long, CMDG can effectively keep shorter path length.

Second class is to carry out data collection using multiple AUV, completes relatively large field by the cooperation between multiple AUV The data collection of scape, the situation for being suitble to network range larger.

" the IEEE International Conference on Mechatronics of Yan, ZP et al. in 2017 And Automation (ICMA) " on publish an article " An Improved Multi-AW Patrol Path Planning Method " considers the unreliability of subsurface communication link, by increasing the number in path, so that path generates redundancy, guarantees number According to being able to carry out timely, accurate transmission.Particle swarm algorithm is constrained, two-staged prediction is introduced, realizes the update to particle Monitoring, avoids the generation of infeasible particle.Weidong Zhou et al. was in " Transactions of the in 2019 Institute of Measurement and Control " on article " the Route planning algorithm that delivers for autonomous underwater vehicles based on the hybrid of particle swarm Optimization algorithm and radial basis function " proposes a kind of based on particle group optimizing (PSO) the underwater robot paths planning method that algorithm is combined with radial basis function (RBF).In improved PSO algorithm, adopt It prevents improved PSO algorithm from falling into local optimum with big city's criterion, is carried out using path of the RBF to PSO algorithmic rule flat It is sliding, to avoid falling into local optimum.Bing Sun et al. was in " IEEE Transactions on Cognitive in 2019 And Developmental Systems " on publish an article " Complete Coverage Autonomous Underwater Vehicles Path Planning Based on Glasius Bio-Inspired Neural Network Algorithm For Discrete and Centralized Programming " proposes a kind of Ge Lasi based on discrete centralization programming Bionic neural network (GBNN) algorithm.Basic modeling is carried out based on more AUV all standing problems of grid chart and neural network, then Devise single AUV complete coverage algorithm based on GBNN algorithm, propose the underwater multi-robot all standing based on GBNN algorithm from Centralized planning method is dissipated, devises collisionless path for the underwater navigation of multiple underwater robots.

Data collection is carried out using single AUV to be suitable in the lesser situation of network size, when network size increases, is held It is also easy to produce biggish data collection delay.Therefore corresponding number is completed usually using multiple AUV cooperation when network size is larger According to the task of collection, but multiple AUV collect need reasonable task divide could completion task more efficiently, while each AUV Path planning be also problem research emphasis.

Summary of the invention

Present invention mainly solves the problem of have: the task of multiple AUV is reasonably distributed, guarantee task can be efficient The completion of rate；Efficient path planning is carried out to AUV, the delay and the energy consumption of AUV of data collection is reduced, extends the network longevity Life.

It realizes above-mentioned technical purpose, reaches above-mentioned technical effect, the invention is realized by the following technical scheme:

More AUV collaboration data collection algorithms based on Q-learning in a kind of UASNs, comprise the following steps:

Step 1: node cluster

Under water in wireless sensor network, node random placement picks out cluster from these nodes according to picking rule Head node is responsible for that the data of cluster interior nodes are collected and are integrated；After cluster head is selected, it can not received as the node of cluster head To the statement message from different cluster heads, the node for receiving message sends addition message to nearest cluster head and nearest cluster head is added Form different node clusters；

Step 2: AUV task distribution

After node clustering, regard each cluster as a collection task；In bidding phase, by cluster head as bid person Information on bidding is issued, the size information including cluster head position, data packet waits bidder to submit a tender；

Bidding period, AUV is as prospective tenderer；After AUV receives information on bidding, according to the content of information on bidding and certainly The location of body and energy state, the cost assessing the income of completion task and paying, decide whether to submit a tender, but each The bid quantity of AUV is limited；The successful AUV that submits a tender becomes the competition that bidder participates in task；

Acceptance of the bid stage, cluster head carry out analytical calculation to the information on bidding being collected into, and select satisfied bidder to complete to appoint Business, and sign contract, it is specified that bid person can only sign contract with a bidder, but bidder can be less than cluster Quantity half bid person signing；

Step 3: AUV path planning

After task is distributed, each AUV has been assigned to task, and AUV determines the access order of task cluster, uses Q- Learning algorithm make rational planning for AUV reach task cluster path.

The picking rule of leader cluster node in above-mentioned steps one are as follows: energy is greater than node primary powerTo prevent cluster head mistake It is early dead；The distance between two neighboring cluster head is greater than network deployment widthIt is relatively equal to guarantee that node is formed by cluster size Weighing apparatus；The node for meeting above-mentioned two condition has 40% probability to be picked as cluster head.

In above-mentioned steps two, when cluster head issues information on bidding as bid person, the initial range of transmission of bidding documents is less than node Communication radiusAnd provide bidding documents effective time be task distribute a time slot length, when beyond effective time not yet When bidder's competitive bidding, then former bidding documents failure, the range of transmission that bid person needs to expand bidding documents retransmit bidding documents.

In above-mentioned steps two, the bid quantity of bidder is limited to Pt, be according to itself shape after bidder receives bidding documents State and the cost for completing task carry out comprehensive assessment, decide whether the competitive bidding for participating in the task；It to be thrown after bidder's assessment When target quantity Mt < Pt, bidding documents information can be continued to, otherwise will not be received again.

In above-mentioned steps three, method that AUV determines the access order of task cluster are as follows:

(3-1) cluster head in t time slot broadcasts a beacon, includes cluster head number V in beacon_k, cluster head coordinate C_k(t), itself Maximum communication radius CR_k, the moving range MR of packet information rank DI, node k_kInformation, the message level of data packet is divided into A, b, c three classes, wherein a class is emergence message, needs quickly to handle；

(3-2) if not receiving the beacon from cluster head, the contact probability of AUV and cluster is 0, if AUV receives the letter of cluster head Mark, then the contact probability for calculating the cluster head of the beacon received is P_t(k, A_i)=Pen (d_t(V_k, A_i), CR_k, MR_k, DI)

Wherein d_t(V_k, A_i) indicate the distance between leader cluster node and AUV,

Wherein (A_x, A_y, A_z) indicate t moment AUV position coordinates, (V_x, V_y, V_z) indicate t moment leader cluster node position Coordinate；

After (3-3) AUV obtains the distance between cluster head of each task cluster, compare big with each cluster contact probability It is small, select the maximum cluster head of contact probability as the next target point of oneself；Pay the utmost attention to the packet information grade in beacon Not, the high data of preferential collection levels of information carry out data collection according to the size of contact probability if levels of information is consistent；

After (3-4) determines next target cluster of AUV through the above steps, the road of part is carried out using Q-learning Diameter planning, updates Q-table, selects the movement of Income Maximum；

(3-5) after AUV reaches target cluster, AUV sends acknowledgement frame, cluster, AUV where notifying other clusters AUV at present Position coordinates and it is expected that the residence time；Due to the influence of water flow, the position of node can change, after collecting to AUV, respectively The cluster head of a cluster retransmits beacon to AUV, after AUV receives beacon, repeats step (3-2), (3-3), (3-4).

In above-mentioned steps (3-4), in Q-table renewal process, the setting method of reward value be according to AUV previous state and Current state is set at a distance from target cluster, and it is as follows to update rule:

Wherein, d_t(V_k, A_j) indicate the distance between t moment AUV and target cluster head, d_t-1(V_k, A_j) indicate t-1 moment AUV The distance between target cluster head.

In above-mentioned steps (3-4), select the movement of Income Maximum as follows:

The movement of setting AUV is selected as centered on AUV, by the actual act spatial model A of AUV be defined as it is upper and lower, The discrete movement of left, right, front and rear 6 respectively represents floating, dive, left-hand rotation, right-hand rotation, advance, several fortune of retrogressing of AUV in water Dynamic state.

Compared with existing water sound sensor network method of data capture, good effect possessed by the present invention are as follows:

(1) cluster head selection is more reasonable, and the size of cluster is relatively uniform, and the quantity of cluster is suitable；

(2) traditional contract net algorithm is improved, the task distribution of AUV is more rationally efficient；

(3) levels of information for considering data packet when data collection is carried out, realizes the priority processing to emergency data；

(4) Q-learning algorithm is used when AUV path planning, selects to reward maximum path by Q-table, have Conducive to the cruise path for reducing AUV, the energy consumption of AUV is reduced.

Detailed description of the invention

Fig. 1 is the network model figure in the present invention Jing Guo sub-clustering；

Fig. 2 is the block diagram of AUV task distribution in the present invention；

Fig. 3 is the flow chart of AUV path planning；

Fig. 4 is AUV movement selection schematic diagram.

Specific embodiment

Below in conjunction with the accompanying drawings and embodiment the invention will be further described.

Step 1: node cluster

As shown in Figure 1, under water in wireless sensor network, node random placement, according to picking rule from these nodes In pick out leader cluster node, be responsible for the data of cluster interior nodes are collected and are integrated；After cluster head is selected, do not become cluster head Node will receive the statement message from different cluster heads, receive the node of message and sent to nearest cluster head and message is added is added Nearest cluster head forms different node clusters；

Step 2: AUV task distribution

As shown in Fig. 2, regarding each cluster as a collection task after node clustering；In bidding phase, by cluster head Information on bidding is issued as bid person, the size information including cluster head position, data packet waits bidder to submit a tender；

Step 3: AUV path planning

In above-mentioned steps three, as shown in figure 3, the method that AUV determines the access order of task cluster are as follows:

As shown in figure 4, the movement of setting AUV is selected as centered on AUV, the actual act spatial model A of AUV is defined For 6 discrete movements of up, down, left, right, before and after, respectively represent AUV floating in water, dive, left-hand rotation, right-hand rotation, advance, after Move back several motion states.

Claims

1. more AUV collaboration data collection algorithms in a kind of UASNs based on Q-learning, which is characterized in that include following step It is rapid:

Step 1: node cluster

Under water in wireless sensor network, node random placement picks out cluster head section according to picking rule from these nodes Point is responsible for that the data of cluster interior nodes are collected and are integrated；After cluster head is selected, it not will receive to come as the node of cluster head From the statement message of different cluster heads, the node for receiving message is added nearest cluster head to nearest cluster head transmission addition message and is formed Different node clusters；

Step 2: AUV task distribution

After node clustering, regard each cluster as a collection task；In bidding phase, issued by cluster head as bid person Information on bidding, the size information including cluster head position, data packet wait bidder to submit a tender；

Bidding period, AUV is as prospective tenderer；After AUV receives information on bidding, according to the content of information on bidding and itself institute The position at place and energy state, the cost assessing the income of completion task and paying, decide whether to submit a tender, but each AUV Bid quantity it is limited；The successful AUV that submits a tender becomes the competition that bidder participates in task；

Acceptance of the bid stage, cluster head carry out analytical calculation to the information on bidding being collected into, select satisfied bidder to complete task, and Contract is signed, it is specified that a bid person can only sign contract with a bidder, but a bidder can be with the number less than cluster The bid person of the half of amount contracts；

Step 3: AUV path planning

After task is distributed, each AUV has been assigned to task, and AUV determines the access order of task cluster, uses Q-learning Algorithm make rational planning for AUV reach task cluster path.

2. more AUV collaboration data collection algorithms in UASNs according to claim 1 based on Q-learning, feature It is, the picking rule of leader cluster node in the step 1 are as follows: energy is greater than node primary powerTo prevent cluster head too early It is dead；The distance between two neighboring cluster head is greater than network deployment widthIt is relatively equal to guarantee that node is formed by cluster size Weighing apparatus；The node for meeting above-mentioned two condition has 40% probability to be picked as cluster head.

3. more AUV collaboration data collection algorithms in UASNs according to claim 1 based on Q-learning, feature It is, in the step 2, when cluster head issues information on bidding as bid person, the initial range of transmission of bidding documents is communicated less than node RadiusAnd provide that the effective time of bidding documents is the length that task distributes a time slot, when beyond effective time bid not yet When person's competitive bidding, then former bidding documents failure, the range of transmission that bid person needs to expand bidding documents retransmit bidding documents.

4. more AUV collaboration data collection algorithms in UASNs according to claim 1 based on Q-learning, feature It is, in the step 2, the bid quantity of bidder is limited to Pt, be according to itself state after bidder receives bidding documents And the cost progress comprehensive assessment of task is completed, decide whether the competitive bidding for participating in the task；It to submit a tender after bidder's assessment Quantity Mt < Pt when, bidding documents information can be continued to, otherwise will not be received again.

5. more AUV collaboration data collection algorithms in UASNs according to claim 1 based on Q-learning, feature It is, in the step 3, method that AUV determines the access order of task cluster are as follows:

(3-1) cluster head in t time slot broadcasts a beacon, includes cluster head number V in beacon_k, cluster head coordinate C_k(t), itself is maximum Communication radius CR_k, the moving range MR of packet information rank DI, node k_kInformation, the message level of data packet is divided into a, b, C three classes, wherein a class is emergence message, needs quickly to handle；

(3-2) if not receiving the beacon from cluster head, the contact probability of AUV and cluster is 0, if AUV receives the beacon of cluster head, The contact probability for calculating the cluster head of beacon received is

P_t(k, A_i)=pen (d_t(V_k,A_i),CR_k,MR_k, DI)

Wherein d_t(V_k,A_i) indicate the distance between leader cluster node and AUV,

Wherein (A_x, A_y, A_z) indicate t moment AUV position coordinates, (V_x, V_y, V_z) indicate t moment leader cluster node position coordinates；

After (3-3) AUV obtains the distance between cluster head of each task cluster, compare the size with each cluster contact probability, selects The maximum cluster head of contact probability is selected as the next target point of oneself；The packet information rank in beacon is paid the utmost attention to, it is excellent The high data of levels of information are first collected, if levels of information is consistent, carry out data collection according to the size of contact probability；

After (3-4) determines next target cluster of AUV through the above steps, advised using the path that Q-learning carries out part It draws, updates Q-table, select the movement of Income Maximum；

(3-5) after AUV reaches target cluster, AUV sends acknowledgement frame, the position of cluster, AUV where notifying other clusters AUV at present Coordinate and it is expected that the residence time；Due to the influence of water flow, the position of node can change, after being collected to AUV, each cluster Cluster head retransmit beacon to AUV, after AUV receives beacon, repeat step (3-2), (3-3), (3-4).

6. more AUV collaboration data collection algorithms in UASNs according to claim 5 based on Q-learning, feature It is, in the step (3-4), in Q-table renewal process, the setting method of reward value is according to AUV previous state and to work as Preceding state is set at a distance from target cluster, and it is as follows to update rule:

Wherein, d_t(V_k,A_j) indicate the distance between t moment AUV and target cluster head, d_t-1(V_k,A_j) indicate t-1 moment AUV and mesh Mark the distance between cluster head.

7. more AUV collaboration data collection algorithms in UASNs according to claim 5 based on Q-learning, feature It is, in the step (3-4), selects the movement of Income Maximum as follows:

Setting AUV movement be selected as centered on AUV, by the actual act spatial model A of AUV be defined as upper and lower, left and right, Forward and backward 6 discrete movements respectively represent AUV floating in water, dive, left-hand rotation, right-hand rotation, advance, retreat several movement shapes State.