CN101835239A - Multi-path delay sensing optimal route selecting method for cognitive network - Google Patents

Multi-path delay sensing optimal route selecting method for cognitive network Download PDF

Info

Publication number
CN101835239A
CN101835239A CN201010120758A CN201010120758A CN101835239A CN 101835239 A CN101835239 A CN 101835239A CN 201010120758 A CN201010120758 A CN 201010120758A CN 201010120758 A CN201010120758 A CN 201010120758A CN 101835239 A CN101835239 A CN 101835239A
Authority
CN
China
Prior art keywords
value
node
bag
path
cognitive
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201010120758A
Other languages
Chinese (zh)
Other versions
CN101835239B (en
Inventor
盛敏
乐天助
史琰
李建东
李红艳
龙春燕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xidian University
Original Assignee
Xidian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xidian University filed Critical Xidian University
Priority to CN2010101207580A priority Critical patent/CN101835239B/en
Publication of CN101835239A publication Critical patent/CN101835239A/en
Application granted granted Critical
Publication of CN101835239B publication Critical patent/CN101835239B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

The invention discloses a multi-path delay sensing optimal route selecting method for a cognitive network, which comprises the following steps of: dividing service into different classes; establishing a plurality of paths through route discovery; adopting an end-to-end delay recorded in the route discovery process as an initial value of a Q value; updating the Q value of the path by utilizing a Q learning algorithm, and introducing the estimation of node queue delay and the estimation of channel contending delay during updating; selecting an activating path according to the Q value to send a data package; reducing route control packet overhead by utilizing the Q learning algorithm; and when the plurality of paths cannot meet the requirement of QoS of the service, beginning the process again. The method has the advantages of quick transmission of advanced service, short path delay, high routing efficiency and high network load bearing capacity, and can be used for a cognitive wireless network.

Description

Multi-path delay sensing optimal route selecting method for cognitive network
Technical field
The invention belongs to wireless communication technology field, relate to cognitive multipath optimal route selecting method, be used for cognition wireless network.
Background technology
Cognition network is by adopting suitable study mechanism, as the intensified learning algorithm, can be under the situation that can't obtain complete environmental information, the current state of sensing network, and each parameter of network is reshuffled according to the state that perceives, and then adapt to the network environment that constantly changes, improve the performance of network.The Q learning algorithm can utilize the environment award to seek and carry out optimum behavior under the environmental model condition of unknown as a kind of intensified learning algorithm.Document " Cognitive Network Management with Reinforcement Learning for WirelessMesh Networks " has proposed the Q learning algorithm is applied to reduce the wireless network method for routing that route is controlled expense.This method can autonomous learning and is predicted the state of network, and according to suitable control expense number under the network state quality configuration corresponding states of institute's perception, thereby reach the purpose that reduces network overhead.But employed path is still the fixed route of traditional routing algorithm in this method, can't be adaptive carries out optimal path selection according to states such as current end-to-end time delay of network and loads.
In the document " Packet Routing In Dynamically Changing Networks:A ReinforcementLearning Approach " a kind of method for routing that the Q learning algorithm is applied to Path selection has been proposed.This method has self-learning capability, can the different offered load of perception, and the path of learning time delay minimum under this load.This method can obtain relative less time delay under the situation of high capacity.But this method still is a more rough study to time delay, and the current state of learning network causes the accuracy of sensing results low more accurately.
Summary of the invention
The objective of the invention is to overcome the shortcoming of above-mentioned prior art, the advantages of above-mentioned two kinds of methods is got up, a kind of multi-path delay sensing optimal route selecting method for cognitive network has been proposed, to set up mulitpath, and utilize path delay of time of Q learning algorithm real-time perception different business grade, select best route to send packet according to sensing results is dynamic in multipath; And in perception path delay of time, introduce the estimation of channel competition time delay and different business grade bag queuing delay, improve the accuracy of sensing results.
The object of the present invention is achieved like this:
One. the term explanation
Cognitive Hello bag: be meant the bag that is used for to neighbor node inquiry Q value.
Cognitive Echo bag: be meant the bag that is used for replying the Q value to neighbor node.
RREQ bag: be meant the routing request packet that source node sends.
RREP bag: be meant the routing reply bag that destination node sends.
Hello bag: be meant the bag that is used to notify the existence of this node of neighbor node.
Metric: be meant route cost.
Activated path: be meant the path that is actually used in transmits data packets in the multipath.
Two. step of the present invention
Realize the object of the invention technical scheme, comprise the steps:
(1) according to type of service the business in the network is divided into different priorities, wherein video traffic is a limit priority, and speech business is secondary priority, and data service is a lowest priority;
(2) in routing table, increase the Q codomain of representing video, the estimation in the path delay of time of 3 service with different priority levels of voice-and-data, and they are initialized as " zero ";
(3) in routing table, increase the activation marker territory of corresponding video, 3 different business priority of voice-and-data, and it is initialized as " vacation ";
(4) in route replies RREP bag, increase the path number territory of delegated path sequence number, and it is initialized as " zero ";
(5) between node, increase the cognition inquiry Cognitive Hello bag that transmits the Q value, and Cognitive Echo bag is replied in cognition;
(6) employing of the source node in the network is broadcasted the mode that floods and is sent the RREQ bag, and writes down the time that this sends the RREQ bag;
Whether, according to this node be destination node determine send mode, if this node is a destination node, then reply the RREP bag if (7) receiving the via node that RREQ wraps, otherwise via node will be transmitted the RREQ bag, and note the time that this transmits this RREQ bag;
(8) destination node is when receiving the RREQ bag, read the source node in the bag, and determine whether to reply the RREP bag, if number of times is more than or equal to 3 according to the number of times that in this route finding process, receives from the RREQ of this source node bag, then abandon this RREQ bag, otherwise reply the RREP bag;
(9) receive the via node that RREP wraps, read the value in the path number territory in the RREP bag, estimate to arrive the time delay value of destination node, and it is recorded in the Q codomain of routing table corresponding to this path number, as the initial value of different priorities Q value;
(10) receive the source node that RREP wraps, estimate to arrive the time delay value of destination node, read the value in the path number territory in the RREP bag, then the time delay estimated value is recorded in the routing table in the Q codomain corresponding to this path number, as the initial value of different priorities Q value, the route of finishing between source node and the destination node is set up;
(11) each node of having set up on the path is replied Cognitive Echo bag transmission Q value by mutual cognitive inquiry Cognitive Hello bag and cognition, and upgrades the Q value of the different priorities in the routing table by following formula:
Q t x ( d , y ) i = ( 1 - α ) Q t - 1 x ( d , y ) i + α ( W i + T contention + min z Q t - 1 y ( d , z ) i )
In the formula, α is the study factor, and scope is 0<α<1;
The subscript i of Q value is the service priority of packet, i=1,2,3;
D represents destination node, and x is a present node, and y represents the neighbors of x, and z represents the neighbors of y;
T represents current time, and t-1 represents one constantly;
Q T-1 y(d, z) iArrive the Q value of destination node for a last moment neighbors that from Cognitive Echo bag, reads;
Q T-1 x(d, y) iBe the Q value that goes up the destination node of carving copy node arrival for the moment that from routing table, reads;
Q t x(d, y) iArrive the Q value of destination node for this node after upgrading;
W iFor system of non-intrusion priority queue medium priority is that the queuing time that wraps in the formation of i is estimated, W i = Σ k = 1 3 λ k X k 2 ‾ 2 ( 1 - ρ 1 - · · · - ρ i - 1 ) ( 1 - ρ 1 - · · · - ρ i ) , In the formula, i is a priority of data packets, i=1,2,3; K is an integer, k=1,2,3; λ kFor priority is the data packet arrival rate of k; X k 2It for priority the second order distance of average service time of the packet of k; ρ iUtilance for the bag of priority i;
T ContentionFor the time based on the channel competition mean consumption of 802.11 standard channels is estimated T Contention=(1-P Tr) σ+P TrP sT s+ P Tr(1-P s) T c, in the formula, σ is the unit slot length, T sFor channel is used for successfully transmitting the time of being experienced, T cBe the time that channel bumps and experienced, P TrFor having at least a node to send the probability of bag in n neighbors in any time slot, n is the neighbors number of this node, P Tr=1-(1-τ) n, wherein, τ is that arbitrary node is sending the probability that wraps, P in the time slot arbitrarily sFor bag in any time slot sends probability of successful, P s = nτ ( 1 - τ ) n - 1 1 - ( 1 - τ ) n ;
(12) source node as activated path, is used to send packet according to the paths in the Q value selection multipath of real-time update;
(13) source node adopts the Q learning algorithm to reset the route lifetime of hello cycle and this route;
(14), then forward step (6) to and initiate new route finding process if the Q value in the routing table of source node all can't satisfy the QoS time delay demand of the packet priority that will send.
The present invention compared with prior art has following advantage:
The present invention since each source node destination node between set up and safeguard mulitpath simultaneously, and adopt the Q learning algorithm time delay in these paths is learnt and to be predicted, from multipath, select best path dynamically for use according to study and prediction result, make method for routing to adjust employed path, guaranteed the low time delay of network according to the dynamic change of network environment;
2. the present invention is owing to respectively the time delay of different priorities data being learnt and being predicted, more rational is the suitable path of data selection of different priorities, make the network can be according to the rational Resources allocation of the data of different priorities, balance network load improves network performance;
3. the present invention has added the prediction to node queue's length because when Q value is upgraded, and learns and has predicted the influence of node queue's length to time delay, makes node avoid the bigger node of queue length when the selection path, has reduced the path delay of time;
4. the present invention is because when upgrading the Q value, added prediction, made node when Path selection, avoid the zone of some channel competition fiercenesses, reduced the path delay of time the channel competition time delay, alleviate congestion to a certain extent simultaneously, balance the load of node in the network;
5. the present invention is owing to adopt the state of Q learning algorithm prediction network, and the route lifetime and the hello cycle of reasonably configuration oneself, can effectively reduce network overhead.
Description of drawings
Fig. 1 is a cognitive route theory diagram of the present invention;
Fig. 2 is a routing table entry schematic diagram of the present invention;
Fig. 3 is a route requests RREQ bag schematic diagram of the present invention;
Fig. 4 is a routing reply RREP bag schematic diagram of the present invention;
Fig. 5 is the cognitive inquiry of a present invention Cognitive Hello bag schematic diagram;
Fig. 6 is that Cognitive Echo bag schematic diagram is replied in cognition of the present invention;
Fig. 7 is an example schematic of the present invention.
Embodiment
With reference to Fig. 1, specific implementation of the present invention comprises the steps:
Step 1 is divided into different brackets with the business in the network.
According to type of service the business in the network is divided into different priorities, wherein video traffic is divided into limit priority, and speech business is divided into secondary priority, and data service is divided into lowest priority, and these different business have different service quality QoS requirements for network.
Step 2 increases Q codomain and activation marker territory in routing table, increase the path number territory in the RREP bag, introduces Cognitive Hello bag and Cognitive Echo bag and is used for transmitting between the node Q value.
Described Q codomain is divided into three subdomains, and the Q value of corresponding video, 3 different business priority of voice-and-data is initialized as " zero " respectively;
Described activation marker territory also is divided into 3 subdomains, and the video of respectively corresponding this path correspondence, the activation marker of 3 different business priority of voice-and-data are initialized as " vacation ".
Step 3, the mode that the source node in the network adopts broadcasting to flood send the RREQ bag.
Source node is noted the time that this sends RREQ when sending the RREQ bag, and the channel that adopts 802.11 standards to provide when broadcasting carries out.
Step 4, via node are transmitted the RREQ bag.
Whether via node is that destination node is determined send mode according to this node, if this node is a destination node, then replys the RREP bag, otherwise via node will be transmitted the RREQ bag, and notes the time of transmitting RREQ.
Step 5, destination node are replied the RREP bag.
(5a) destination node is when receiving the RREQ bag, read the source node in the bag, and determine whether to reply the RREP bag, if number of times is more than or equal to 3 according to the number of times that in this route finding process, receives from the RREQ of this source node bag, then abandon this RREQ bag, otherwise reply the RREP bag;
When (5b) replying RREP, destination node is inserted the number of times of answer in the path number territory of RREP, and this number of times is this and replys the path number of setting up;
(5c) behind the answer RREP, reply number of times and add 1.
Step 6, via node are transmitted the RREP bag, and upgrade the routing table entry that arrives destination node.
(6a) via node reads the value in the path number territory in the RREP bag;
(6b) node time that will receive RREP deduct before time of the corresponding RREQ of transmission of record, and the gained result obtained the time delay estimated value of via node to destination node divided by 2;
(6c) the time delay estimated value is recorded in the Q codomain of routing table corresponding to this path number, as the initial value of the Q value subdomain of different priorities.
Step 7, source node are handled the RREP bag, upgrade the routing table entry that arrives destination node, finish the initialization procedure of path foundation and Q value.
(7a) read the value in the path number territory in the RREP bag;
(7b) receive RREP bag at every turn after, on number of path, add 1;
(7c) node time that will receive RREP deduct before record the corresponding RREQ of transmission time and divided by 2, obtain the time delay estimated value of via node to destination node;
(7d) the time delay estimated value is recorded in the Q codomain of routing table corresponding to this path number, as the initial value of the Q value subdomain of different priorities.
Step 8 is utilized the Q learning algorithm to carry out path Q value and is upgraded.
(8a) on the multipath of having set up, each node constructed a Cognitive Hello bag every one second, and insert this node address in the source node territory of Cognitive Hello bag as source node address, the destination node address in path is inserted in the destination node territory of Cognitive Hello bag, next-hop node to the path sends Cognitive Hello bag, and inquiry arrives the Q value of destination node; Each node is when receiving Cognitive Hello bag, read the source node of Cognitive Hello bag and the destination node in path, structure Cognitive Echo bag, the Q value that is recorded in the different priorities of respective path destination node in the local routing table is inserted in the bag, and Cognitive Echo bag is replied to the source node of Cognitive Hello bag; At last, each node reads the Q value from bag when receiving Cognitive Echo bag;
(8b) each node upgrades the Q value as follows after getting access to the Q value of neighbors:
At first, estimate that system of non-intrusion priority queue medium priority is the queuing time in the formation of wrapping in of i:
W i = Σ k = 1 3 λ k X k 2 ‾ 2 ( 1 - ρ 1 - · · · - ρ i - 1 ) ( 1 - ρ 1 - · · · - ρ i )
Wherein, i is a priority of data packets, i=1,2,3; K is an integer, k=1,2,3; λ kFor priority is the data packet arrival rate of k, X k 2For priority is the packet average service time second order distance of k; ρ iUtilance for the bag of priority i;
Secondly, compete the time of mean consumption based on 802.11 standard channel predicted channel:
T contention=(1-P tr)σ+P trP sT s+P tr(1-P s)T c
In the formula, σ is the unit slot length, T sFor channel is used for successfully transmitting the time of being experienced, T cBe the time that channel bumps and experienced, P TrFor having at least a node to send the probability of bag in n neighbors in any time slot, n is the neighbors number of this node, P Tr=1-(1-τ) n, wherein, τ is that arbitrary node is sending the probability that wraps, P in the time slot arbitrarily sFor bag in any time slot sends probability of successful, P s = nτ ( 1 - τ ) n - 1 1 - ( 1 - τ ) n ;
At last, with W iAnd T ContentionTwo value substitution Q values more in the new formula, carry out upgrading according to following more new formula to the Q value:
Q t x ( d , y ) i = ( 1 - α ) Q t - 1 x ( d , y ) i + α ( W i + T contention + min z Q t - 1 y ( d , z ) i )
Wherein, α is the study factor, and scope is 0<α<1; The subscript i of Q value is the priority of bag, i=1,2,3; D represents destination node; X is a present node; Y represents the neighbors of x; Z represents the neighbors of y; T represents current time; T-1 represents one constantly; Q T-1 y(d, z) iArrive the Q value of destination node for a last moment neighbors that from Cognitive Echo bag, reads; Q T-1 x(d, y) iBe the Q value that goes up the destination node of carving copy node arrival for the moment that from routing table, reads; Q t x(d, y) iArrive the Q value of destination node for this node after upgrading;
(8c) the Q value after will upgrading is inserted in the Q codomain of routing table corresponding to this path.
Step 9, source node selects a paths as activated path according to the Q value of real-time update, is used to send packet.
(9a) utilize that paths of setting up at first as the activated path transmits data packets when initial between source node and the destination node, and 3 activation markers under this path number in the routing table all are set to " very ", the activation marker under other path number all is set to " vacation ";
(9b) in the multipath of having set up, each paths is all being safeguarded video, the Q value of 3 service with different priority levels of voice-and-data, source node compares respectively by service priority the Q value in different paths, if after double renewal, certain paths all keeps minimum corresponding to the Q value of a certain priority service, then select this path as activated path, be used to send the packet of this priority, in the source node routing table activation marker under should the described priority in path is set to " very ", in the routing table other path under described priority activation marker be set to " vacation ";
(9c) when source node has packet to send, then select activation marker in the routing table to be used to send this packet for the path of " very " according to priority of data packets and destination node.
Step 10, source node adopts the Q learning algorithm to reset the route lifetime of hello cycle and this route according to the situation of change of Q value in the routing table.
(10a) source node is averaged the video of every paths, the Q value of 3 different priorities of voice-and-data respectively after receiving that Cognitive Echo when bag upgrade the Q value of routing table, obtains the time delay estimated value T in this path Est
(10b) according to above-mentioned time delay estimated value T Est, calculate normalization estimated value in path delay of time γ by following formula:
γ=T est/ete max
In the formula, ete MaxThe maximum of the end-to-end time delay that the expression network allows;
(10c) according to above-mentioned normalization estimated value in path delay of time γ, to describing the Q of network stabilization sValue and instable Q UnsValue is upgraded respectively, the network stabilization Q after upgrade s[t] value and instable Q Uns[t] value:
Q s [ t ] = ∂ * Q s [ t - 1 ] + ( 1 - ∂ ) γ
Q uns [ t ] = ∂ * Q uns [ t - 1 ] + 1 - ∂ γ
In the formula, Q s[t-1] expression node is at the Q of t-1 moment network stabilization sValue; Q s[t] expression node is at the Q of t moment network stabilization sValue; Q Uns[t-1] expression node is at the Q of t-1 moment network instability UnsValue; Q Uns[t] expression node is at the Q of t moment network instability UnsValue;
Figure GSA00000050719300084
Be meant the study factor, span is 0 &le; &PartialD; < 1 ;
(10d) source node is carried out different operating according to upgrading the result, works as Q s[t]>Q UnsWhen [t], perceive the network state instability, reduce the route lifetime and the hello cycle of this route; Work as Q s[t]<Q UnsWhen [t], perceive network state and stablize the route lifetime and the hello cycle that increase this route.
Step 11, node if the Q value in the routing table of source node all can't satisfy the QoS time delay demand of this packet priority, are then initiated new route finding process when sending packet.
According to the performing step of the invention described above, provide following example:
With reference to Fig. 7, small arrow in the example of the present invention on the path represents between each node that mutual cognitive inquiry CognitiveHello bag and cognition reply Cognitive Echo bag and transmit Q value, and CH that marks on the small arrow and CE represent that respectively Cognitive Hello wraps and Cognitive Echo wraps.The selected path that provides and the situation of change of route lifetime have been marked as time passes on the time shaft among Fig. 7.Concrete process prescription is as follows:
Suppose certain constantly source node S priority is arranged is that 2 speech business will send to destination node D.The source node S table of query and routing, discovery does not arrive the route entry of destination node D, then initiates route finding process.Successively set up three paths through the route finding process source node and arrived destination node D, be respectively path S-A-D, path S-B-C-D and S-E-F-D.On above-mentioned three paths, each node wrapped with the mutual Cognitive Hello of its neighbors bag and Cognitive Echo every one second and transmits the Q value, then according to the new video more respectively of new formula more, the Q value of voice-and-data business, the video traffic Q1 value after obtaining upgrading, speech business Q2 value and data service Q3 value.And in 0 to 1 second time, select the S-A-D path as initial value.If at 0 to 2 second therebetween, the Q2 value of path S-B-C-D remains minimum, and then node S will select path S-B-C-D to come the transmitting audio data bag as activated path.In the section, the average (Q1+Q2+Q3)/3 of the different business priority Q value of path S-A-D increases, and then thinks the network environment variation to reduce route lifetime and Hello cycle, as the T of route lifetime from Fig. 7 between supposing at this moment (0,2)Be reduced to T (2,4)Pass in time, route lifetime and activated path that the present invention adopts all can dynamically be adjusted with the variation of network environment.

Claims (4)

1. a multi-path delay sensing optimal route selecting method for cognitive network comprises the steps:
(1) according to type of service the business in the network is divided into different priorities, wherein video traffic is a limit priority, and speech business is secondary priority, and data service is a lowest priority;
(2) in routing table, increase the Q codomain of representing video, the estimation in the path delay of time of 3 service with different priority levels of voice-and-data, and they are initialized as " zero ";
(3) in routing table, increase the activation marker territory of corresponding video, 3 different business priority of voice-and-data, and it is initialized as " vacation ";
(4) in route replies RREP bag, increase the path number territory of delegated path sequence number, and it is initialized as " zero ";
(5) between node, increase the cognition inquiry Cognitive Hello bag that transmits the Q value, and Cognitive Echo bag is replied in cognition;
(6) employing of the source node in the network is broadcasted the mode that floods and is sent the RREQ bag, and writes down the time that this sends the RREQ bag;
Whether, according to this node be destination node determine send mode, if this node is a destination node, then reply the RREP bag if (7) receiving the via node that RREQ wraps, otherwise via node will be transmitted the RREQ bag, and note the time that this transmits this RREQ bag;
(8) destination node is when receiving the RREQ bag, read the source node in the bag, and determine whether to reply the RREP bag, if number of times is more than or equal to 3 according to the number of times that in this route finding process, receives from the RREQ of this source node bag, then abandon this RREQ bag, otherwise reply the RREP bag;
(9) receive the via node that RREP wraps, read the value in the path number territory in the RREP bag, estimate to arrive the time delay value of destination node, and it is recorded in the Q codomain of routing table corresponding to this path number, as the initial value of different priorities Q value;
(10) receive the source node that RREP wraps, estimate to arrive the time delay value of destination node, read the value in the path number territory in the RREP bag, then the time delay estimated value is recorded in the routing table in the Q codomain corresponding to this path number, as the initial value of different priorities Q value, the route of finishing between source node and the destination node is set up;
(11) each node of having set up on the path is replied Cognitive Echo bag transmission Q value by mutual cognitive inquiry Cognitive Hello bag and cognition, and upgrades the Q value of the different priorities in the routing table by following formula:
Q t x ( d , y ) i = ( 1 - &alpha; ) Q t - 1 x ( d , y ) i + &alpha; ( W i + T contention + min z Q t - 1 y ( d , z ) i )
In the formula, α is the study factor, and scope is 0<α<1;
The subscript i of Q value is the service priority of packet, i=1,2,3;
D represents destination node, and x is a present node, and y represents the neighbors of x, and z represents the neighbors of y;
T represents current time, and t-1 represents one constantly;
Q T-1 y(d, z) iArrive the Q value of destination node for a last moment neighbors that from Cognitive Echo bag, reads;
Q T-1 x(d, y) iBe the Q value that goes up the destination node of carving copy node arrival for the moment that from routing table, reads;
Q t x(d, y) iArrive the Q value of destination node for this node after upgrading;
W iFor system of non-intrusion priority queue medium priority is that the queuing time that wraps in the formation of i is estimated, W i = &Sigma; k = 1 3 &lambda; k X k 2 &OverBar; 2 ( 1 - &rho; 1 - &CenterDot; &CenterDot; &CenterDot; - &rho; i - 1 ) ( 1 - &rho; 1 - &CenterDot; &CenterDot; &CenterDot; - &rho; i ) , In the formula, i is a priority of data packets, i=1,2,3; K is an integer, k=1,2,3; λ kFor priority is the data packet arrival rate of k; X k 2It for priority the second order distance of average service time of the packet of k; ρ iUtilance for the bag of priority i;
T ContentionFor the time based on the channel competition mean consumption of 802.11 standard channels is estimated T Contention=(1-P Tr) σ+P TrP sT s+ P Tr(1-P s) T c, in the formula, σ is the unit slot length, T sFor channel is used for successfully transmitting the time of being experienced, T cBe the time that channel bumps and experienced, P TrFor having at least a node to send the probability of bag in n neighbors in any time slot, n is the neighbors number of this node, P Tr=1-(1-τ) n, wherein, τ is that arbitrary node is sending the probability that wraps, P in the time slot arbitrarily sFor bag in any time slot sends probability of successful, P s = n&tau; ( 1 - &tau; ) n - 1 1 - ( 1 - &tau; ) n ;
(12) source node as activated path, is used to send packet according to the paths in the Q value selection multipath of real-time update;
(13) source node adopts the Q learning algorithm to reset the route lifetime of hello cycle and this route;
(14), then forward step (6) to and initiate new route finding process if the Q value in the routing table of source node all can't satisfy the QoS time delay demand of the packet priority that will send.
2. cognitive network route method according to claim 1, wherein step (11) described on the multipath of having set up node reply Cognitive Echo bag transmission Q value by mutual cognitive inquiry Cognitive Hello bag and cognition, carry out according to following steps:
(2a) node of each on the path was every one second structure Cognitive Hello bag, just this node address is inserted in the source node territory of Cognitive Hello bag as source node address, the destination node address in path is inserted in the destination node territory of Cognitive Hello bag, the next-hop node to the path sends the Q value that the inquiry of Cognitive Hello bag arrives destination node;
(2b) node is when receiving Cognitive Hello bag, read the source node of bag and the destination node in path, structure Cognitive Echo bag, the Q value that is recorded in the different priorities of respective path destination node in the local routing table is inserted in the bag, and Cognitive Echo bag is replied to the source node of Cognitive Hello bag;
(2c) each node reads the Q value from bag when receiving Cognitive Echo bag.
3. cognitive network route method according to claim 1, wherein the described source node of step (12) selects a paths as activated path according to the Q value of real-time update, is used to send packet, carries out according to following steps:
(3a) utilize that paths of setting up at first as the activated path transmits data packets when initial between source node and the destination node, and 3 activation markers under this path number in the routing table all are set to " very ", the activation marker under other path number all is set to " vacation ";
(3b) in the multipath of having set up, each paths is all being safeguarded video, the Q value of 3 service with different priority levels of voice-and-data, source node compares respectively by service priority the Q value in different paths, if after double renewal, certain paths all keeps minimum corresponding to the Q value of a certain priority service, then select this path as activated path, be used to send the packet of this priority, in the source node routing table activation marker under should the described priority in path is set to " very ", in the routing table other path under described priority activation marker be set to " vacation ";
(3c) when source node has packet to send, then select activation marker in the routing table to be used to send this packet for the path of " very " according to priority of data packets and destination node.
4. cognitive network route method according to claim 1, wherein the described source node of step (10) adopts the Q learning algorithm to reset the route lifetime of hello cycle and this route, carries out according to following steps:
(4a) source node is averaged the video of every paths, the Q value of 3 different priorities of voice-and-data respectively after receiving that Cognitive Echo when bag upgrade the Q value of routing table, obtains the time delay estimated value T in this path Est
(4b) according to above-mentioned time delay estimated value T Est, calculate normalization estimated value in path delay of time γ by following formula:
γ=T est/ete max
In the formula, ete MaxThe maximum of the end-to-end time delay that the expression network allows;
(4c) according to above-mentioned normalization estimated value in path delay of time γ, to describing the Q of network stabilization sValue and instable Q UnsValue is upgraded respectively, the network stabilization Q after upgrade s[t] value and instable Q Uns[t] value:
Q s [ t ] = &PartialD; * Q s [ t - 1 ] + ( 1 - &PartialD; ) &gamma;
Q uns [ t ] = &PartialD; * Q uns [ t - 1 ] + 1 - &PartialD; &gamma;
In the formula, Q s[t-1] expression node is at the Q of t-1 moment network stabilization sValue; Q s[t] expression node is at the Q of t moment network stabilization sValue; Q Uns[t-1] expression node is at the Q of t-1 moment network instability UnsValue; Q Uns[t] expression node is at the Q of t moment network instability UnsValue;
Figure FSA00000050719200044
Be meant the study factor, span is 0 &le; &PartialD; < 1 ;
(4d) source node is carried out different operating according to upgrading the result, works as Q s[t]>Q UnsWhen [t], perceive the network state instability, reduce the route lifetime and the hello cycle of this route; Work as Q s[t]<Q UnsWhen [t], perceive network state and stablize the route lifetime and the hello cycle that increase this route.
CN2010101207580A 2010-03-09 2010-03-09 Multi-path delay sensing optimal route selecting method for cognitive network Expired - Fee Related CN101835239B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010101207580A CN101835239B (en) 2010-03-09 2010-03-09 Multi-path delay sensing optimal route selecting method for cognitive network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010101207580A CN101835239B (en) 2010-03-09 2010-03-09 Multi-path delay sensing optimal route selecting method for cognitive network

Publications (2)

Publication Number Publication Date
CN101835239A true CN101835239A (en) 2010-09-15
CN101835239B CN101835239B (en) 2012-08-15

Family

ID=42719120

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010101207580A Expired - Fee Related CN101835239B (en) 2010-03-09 2010-03-09 Multi-path delay sensing optimal route selecting method for cognitive network

Country Status (1)

Country Link
CN (1) CN101835239B (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101951346A (en) * 2010-10-27 2011-01-19 迈普通信技术股份有限公司 SIP (Session Initiation Protocol) relay routing method and system thereof
CN102170660A (en) * 2011-05-11 2011-08-31 北京交通大学 Multi-path route constructing method and system based on APDV (Ad Hoc On-demand Distance Vector)
CN102547461A (en) * 2011-12-27 2012-07-04 圆刚科技股份有限公司 System for receiving audio and video and external website data, wireless data distributor and method
CN102739546A (en) * 2012-06-29 2012-10-17 浙江大学 Agricultural internet of things routing method based on resource competition mode
CN103491509A (en) * 2013-09-29 2014-01-01 北京天公瑞丰科技有限公司 Micro-power wireless communication node and micro-power wireless communication network
CN104780566A (en) * 2014-01-14 2015-07-15 中兴通讯股份有限公司 Method and device for realizing flow balance of wireless router terminal
CN105897585A (en) * 2016-04-11 2016-08-24 电子科技大学 Q learning packet transmission method based on delay constraints for ad hoc network
CN106376088A (en) * 2016-11-29 2017-02-01 重庆邮电大学 Channel competition based resource-pool resource distribution method in vehicle-vehicle communication
CN106656807A (en) * 2015-11-02 2017-05-10 华为技术有限公司 Message forwarding method and SDN switch
CN107548120A (en) * 2017-08-11 2018-01-05 北京航空航天大学 Face the method for routing and device of sky communication network
CN108449151A (en) * 2018-05-02 2018-08-24 河海大学常州校区 Frequency spectrum access method in a kind of cognitive radio networks based on machine learning
CN108924051A (en) * 2018-07-04 2018-11-30 河南科技大学 Information centre's network interest package transmitting method based on enhancing study
CN108966309A (en) * 2018-09-20 2018-12-07 电子科技大学 A kind of method for intelligently routing based on intensified learning
CN111356197A (en) * 2019-12-11 2020-06-30 航天行云科技有限公司 Path switching method of multi-node routing and related equipment
CN111988225A (en) * 2020-08-19 2020-11-24 西安电子科技大学 Multi-path routing method based on reinforcement learning and transfer learning

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101057511A (en) * 2004-11-05 2007-10-17 网状网络公司 System and method for providing a congestion-aware routing metric for selecting a route between nodes in a multihopping communication network
CN101350774A (en) * 2008-07-28 2009-01-21 北京邮电大学 Route selection method for intelligent self-perception optical network base on network status
CN101588621A (en) * 2009-06-30 2009-11-25 北京科技大学 Random routing method based on risk assessment in multi-hop cognition wireless network

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101057511A (en) * 2004-11-05 2007-10-17 网状网络公司 System and method for providing a congestion-aware routing metric for selecting a route between nodes in a multihopping communication network
CN101350774A (en) * 2008-07-28 2009-01-21 北京邮电大学 Route selection method for intelligent self-perception optical network base on network status
CN101588621A (en) * 2009-06-30 2009-11-25 北京科技大学 Random routing method based on risk assessment in multi-hop cognition wireless network

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101951346A (en) * 2010-10-27 2011-01-19 迈普通信技术股份有限公司 SIP (Session Initiation Protocol) relay routing method and system thereof
CN102170660A (en) * 2011-05-11 2011-08-31 北京交通大学 Multi-path route constructing method and system based on APDV (Ad Hoc On-demand Distance Vector)
CN102170660B (en) * 2011-05-11 2013-10-30 北京交通大学 Multi-path route constructing method and system based on APDV (Ad Hoc On-demand Distance Vector)
CN102547461A (en) * 2011-12-27 2012-07-04 圆刚科技股份有限公司 System for receiving audio and video and external website data, wireless data distributor and method
CN102547461B (en) * 2011-12-27 2013-08-14 圆刚科技股份有限公司 System for receiving audio and video and external website data, wireless data distributor and method
CN102739546A (en) * 2012-06-29 2012-10-17 浙江大学 Agricultural internet of things routing method based on resource competition mode
CN103491509A (en) * 2013-09-29 2014-01-01 北京天公瑞丰科技有限公司 Micro-power wireless communication node and micro-power wireless communication network
CN103491509B (en) * 2013-09-29 2016-03-30 北京天公瑞丰科技有限公司 A kind of micro power radio communication node and a kind of micro power radio communication network
CN104780566A (en) * 2014-01-14 2015-07-15 中兴通讯股份有限公司 Method and device for realizing flow balance of wireless router terminal
WO2015106526A1 (en) * 2014-01-14 2015-07-23 中兴通讯股份有限公司 Method and apparatus for wireless routing terminal achieving flow equalizing
CN104780566B (en) * 2014-01-14 2019-09-24 南京中兴新软件有限责任公司 A kind of wireless router terminal realizes the method and device of flow equalization
CN106656807A (en) * 2015-11-02 2017-05-10 华为技术有限公司 Message forwarding method and SDN switch
CN105897585A (en) * 2016-04-11 2016-08-24 电子科技大学 Q learning packet transmission method based on delay constraints for ad hoc network
CN105897585B (en) * 2016-04-11 2019-07-23 电子科技大学 A kind of Q study block transmission method of the self-organizing network based on delay constraint
CN106376088B (en) * 2016-11-29 2019-05-31 重庆邮电大学 Resource pool resource allocation methods in truck traffic based on channel competition
CN106376088A (en) * 2016-11-29 2017-02-01 重庆邮电大学 Channel competition based resource-pool resource distribution method in vehicle-vehicle communication
CN107548120B (en) * 2017-08-11 2020-07-03 北京航空航天大学 Routing method and device of air-to-air communication network
CN107548120A (en) * 2017-08-11 2018-01-05 北京航空航天大学 Face the method for routing and device of sky communication network
CN108449151A (en) * 2018-05-02 2018-08-24 河海大学常州校区 Frequency spectrum access method in a kind of cognitive radio networks based on machine learning
CN108449151B (en) * 2018-05-02 2020-11-17 河海大学常州校区 Spectrum access method in cognitive radio network based on machine learning
CN108924051A (en) * 2018-07-04 2018-11-30 河南科技大学 Information centre's network interest package transmitting method based on enhancing study
CN108924051B (en) * 2018-07-04 2021-03-16 河南科技大学 Information center network interest packet forwarding method based on reinforcement learning
CN108966309A (en) * 2018-09-20 2018-12-07 电子科技大学 A kind of method for intelligently routing based on intensified learning
CN108966309B (en) * 2018-09-20 2021-01-12 电子科技大学 Intelligent routing method based on reinforcement learning
CN111356197A (en) * 2019-12-11 2020-06-30 航天行云科技有限公司 Path switching method of multi-node routing and related equipment
CN111988225A (en) * 2020-08-19 2020-11-24 西安电子科技大学 Multi-path routing method based on reinforcement learning and transfer learning
CN111988225B (en) * 2020-08-19 2022-03-04 西安电子科技大学 Multi-path routing method based on reinforcement learning and transfer learning

Also Published As

Publication number Publication date
CN101835239B (en) 2012-08-15

Similar Documents

Publication Publication Date Title
CN101835239B (en) Multi-path delay sensing optimal route selecting method for cognitive network
CN109547351B (en) Routing method based on Q learning and trust model in Ad Hoc network
US20140016645A1 (en) Communication method and communication apparatus
CN101827415B (en) Channel allocating method and system in wireless mesh network
US8559293B2 (en) Wireless communication apparatus, wireless communication network and method of controlling routing selection
Kasana et al. Fuzzy-based channel selection for location oriented services in multichannel VCPS environments
CN103118413B (en) A kind of IPv6 industrial wireless sensor network method for routing based on deterministic schedule
CN102223671A (en) Method and communication equipment for data transmission in wireless multihop network
CN103826283A (en) Routing method and device for nodes in wireless ad hoc network
CN102572994B (en) Wireless self-organizing network fusing routing generation method based on node characteristics
KR20110050653A (en) A method for discovering high throughput routes in wireless mesh networks
CN101711048A (en) Method for choosing gateway in integrating process of mobile self-organization network and Internet
KR20140027161A (en) Routing method and apparatus for setting optimum multi-hop hybrid v-mimo transmission path for wireless ad hoc network
CN101459949B (en) Route establishing method
CN105577547A (en) Multi-Qos based route selection method in mobile self-organizing network
CN105580437A (en) Discovering stable routes in wireless networks
US20100142510A1 (en) Communication apparatus and method for enhancing reliability in low power time division access method of sensor network
CN102143549B (en) Cognitive routing protocol for heterogeneous wireless return network
Jaiswal et al. An optimal QoS-aware multipath routing protocol for IoT based wireless sensor networks
Ahmad et al. Efficient AODV routing based on traffic load and mobility of node in MANET
CN101895952B (en) Multi-route establishment method and parallel data transmission method of wireless sensor network
CN102781060B (en) A kind of method, forward node and wireless network realizing route in the wireless network
CN101754321B (en) Cognitive routing method based on MIMO link
CN107484207A (en) Combine topology control and channel distribution load-balancing method in wireless sensor network
CN104053208A (en) Route method and device based on channel allocation in wireless ad hoc network

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20120815

Termination date: 20180309

CF01 Termination of patent right due to non-payment of annual fee