CN112714064A

CN112714064A - Power line communication network topology control method, device, equipment and medium

Info

Publication number: CN112714064A
Application number: CN202011603518.6A
Authority: CN
Inventors: 施展
Original assignee: Electric Power Dispatch Control Center of Guangdong Power Grid Co Ltd
Current assignee: Electric Power Dispatch Control Center of Guangdong Power Grid Co Ltd
Priority date: 2020-12-29
Filing date: 2020-12-29
Publication date: 2021-04-27

Abstract

The application discloses a power line communication network topology control method, a device, equipment and a medium, wherein the method comprises the following steps: calculating the signal-to-noise ratio of the current link; if the signal-to-noise ratio belongs to the range of a preset signal-to-noise ratio window, calculating the Q value of each node in the current link; acquiring the state and the action of the node at the current moment, and observing the state of the next moment corresponding to the action at the current moment, wherein the action comprises sending information to a neighbor node; selecting the action with the highest immediate report and the state of the next moment corresponding to the action, and updating the Q value; if the updated Q value is not converged, updating the state and the corresponding Q value at the next moment again; and selecting the most reliable node for communication topology selection until the Q values of all the nodes are converged. The method and the device selectively find out the reliable node to carry out communication topology selection.

Description

Power line communication network topology control method, device, equipment and medium

Technical Field

The present application relates to the field of power line communication technologies, and in particular, to a method, an apparatus, a device, and a medium for controlling topology of a power line communication network.

Background

Because users can change the power supply range and the power supply object frequently, the physical topology of the low-voltage distribution network has certain time-varying property. Meanwhile, the load in the low-voltage distribution network is changed frequently, and the change can cause the change of a communication channel and even cause the interruption of a communication link, thereby causing the change of the logic topology of the low-voltage distribution network. This reduces the reliability of the power line communication.

In recent years, researchers tend to use a flattened architecture in the aspect of topology control, and the proposed method has no learning capability, so that the topology cannot be dynamically adjusted according to the change of a scene to a certain extent, and an effective control mechanism is also lacking in the aspect of network self-healing.

In a traditional clustering algorithm, a network is provided with certain associated node sets to form clusters, and each cluster comprises 1 cluster head node and a plurality of member nodes in the cluster. The cluster head nodes are selected through a certain algorithm or rule, information in the cluster is collected, data fusion processing is carried out, and therefore the nodes which coordinate and manage the work of member nodes in the cluster are completed, and the function of forwarding among clusters needs to be achieved.

The non-overlapping clustering algorithm is used for researching and discussing the aspects of selection, establishment, automatic routing and the like of a narrow-band power line communication data logical link of a low-voltage distribution network, and provides a dynamic routing algorithm and a network reconstruction algorithm based on non-overlapping clustering. The algorithm can dynamically establish, maintain and optimize the power line communication network route according to the change of the channel quality, and ensure the effectiveness of the communication network.

The artificial spider web topology control algorithm provides a communication model based on artificial spider webs, a logic topology network formed by the artificial spider webs is used for replacing a part of layer networks which are mainly responsible for communication protocols in a traditional low-voltage power distribution network, a new automatic routing protocol is formulated, and an artificial spider web simulation model is established.

The genetic algorithm is improved, the thought of a graph traversal algorithm and the advantages of the Dijkstra algorithm are fused, the ecological niche technology optimal retention principle is combined, and the ecological niche dynamic environment adaptation method can be well achieved.

Disclosure of Invention

The application provides a topology control method, a device, equipment and a medium of a power line communication network, which can selectively find out reliable nodes to select communication topology.

In view of the above, a first aspect of the present application provides a power line communication network topology control method, including:

s01: calculating the signal-to-noise ratio of the current link;

s02: if the signal-to-noise ratio belongs to the range of a preset signal-to-noise ratio window, calculating the Q value of each node in the current link;

s03: acquiring the state and action of a node at the current moment, and observing the state of the next moment corresponding to the action of the current moment, wherein the action comprises sending information to a neighbor node;

s04: selecting the action with the highest immediate return and the state of the next moment corresponding to the action, and updating the Q value;

s05: if the updated Q value does not converge, the process returns to step S03;

s06: and if the Q values of all the nodes are converged, selecting the most reliable node for communication topology selection.

Optionally, the method further includes:

if the signal-to-noise ratio does not belong to the preset signal-to-noise ratio window range, calculating the communication success rate of the current link;

and judging the reliability of the link according to the communication success rate.

Optionally, the calculating the Q value of each node in the current link includes:

in the formula: s_tIs the state of the node at time t, S_t∈S^*，S^*Representing a set of states, i.e. the current state S_tAll possible states of (a); a is_tFor the node's action at time t, a_tE.g. A, A represents action set; r is instant report at time t; gamma is a discount factor used for determining the relative proportion of delayed return and immediate return, and gamma is more than or equal to 0 and less than or equal to 1; alpha is a learning factor, 0<α≤1；Q_t+1And Q_tQ values of the nodes at t +1 and t are respectively;

representing the maximum Q value at which the node takes action a at t + 1.

Optionally, the selecting the action with the highest immediate report and the state of the next moment corresponding to the action, and updating the Q value includes:

select action a with the highest immediate return_tAnd the state S of the next time corresponding to the action_t+1Updating the Q value:

optionally, if the Q values of all the nodes converge, selecting the most reliable node for communication topology selection includes:

acquiring a first action with optimal instant return under the condition that the Q value of a node is converged, and acquiring an optimal neighbor node corresponding to the first action;

and selecting communication topology according to the optimal node corresponding to each node.

A second aspect of the present application provides a power line communication network topology control apparatus, the apparatus including:

the first calculating unit is used for calculating the signal-to-noise ratio of the current link;

the second calculation unit is used for calculating the Q value of each node in the current link when the signal-to-noise ratio belongs to the range of a preset signal-to-noise ratio window;

the first acquisition unit is used for acquiring the current time state and action of the node and observing the state of the next time corresponding to the execution of the current time action, wherein the action comprises the step of sending information to a neighbor node;

the updating unit is used for selecting the action with the highest instant report and the state of the next moment corresponding to the action, and updating the Q value;

the iteration unit is used for returning to the first acquisition unit when the updated Q value is not converged;

and the topology selection unit is used for selecting the most reliable node for communication topology selection when the Q values of all the nodes are converged.

Optionally, the method further includes:

the third calculating unit is used for calculating the communication success rate of the current link when the signal-to-noise ratio does not belong to the range of the preset signal-to-noise ratio window;

and the judging unit is used for judging the reliability of the link according to the communication success rate.

Optionally, the topology selecting unit includes:

the second acquisition unit is used for acquiring a first action with optimal instant return under the condition that the Q value of the node is converged and acquiring an optimal neighbor node corresponding to the first action;

and the second topology selection unit is used for carrying out communication topology selection according to the optimal node corresponding to each node.

A third aspect of the present application provides a power line communication network topology control device, the device comprising a processor and a memory:

the memory is used for storing program codes and transmitting the program codes to the processor;

the processor is configured to execute the steps of the power line communication network topology control method according to the first aspect, according to instructions in the program code.

A fourth aspect of the present application provides a computer-readable storage medium for storing program code for performing the method of the first aspect.

According to the technical scheme, the method has the following advantages:

the application provides a topology control method of a power line communication network, which calculates the signal-to-noise ratio of a current link; if the signal-to-noise ratio belongs to the range of a preset signal-to-noise ratio window, calculating the Q value of each node in the current link; acquiring the state and the action of the node at the current moment, and observing the state of the next moment corresponding to the action at the current moment, wherein the action comprises sending information to a neighbor node; selecting the action with the highest immediate report and the state of the next moment corresponding to the action, and updating the Q value; if the updated Q value is not converged, updating the state and the corresponding Q value at the next moment again; and selecting the most reliable node for communication topology selection until the Q values of all the nodes are converged.

According to the method and the device, the reliability of the link can be judged by utilizing the Q learning algorithm model under the condition that the power line communication network is not stable, and the reliable node can be selectively found out so as to select the communication topology mode. Q learning is carried out only in the range of a preset signal-to-noise ratio window by setting the signal-to-noise ratio window to judge the reliability of the link; and the values lower than the window interval and higher than the window interval directly determine the reliability of the link according to the threshold value, thereby reducing the Q learning calculation amount.

Drawings

Fig. 1 is a flowchart of a method of an embodiment of a topology control method for a power line communication network according to the present application;

fig. 2 is a flowchart of a method of another embodiment of a topology control method of a power line communication network according to the present application;

fig. 3 is a device structure diagram of an embodiment of a topology control device of a power line communication network according to the present application.

Detailed Description

In order to make the technical solutions of the present application better understood, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

Fig. 1 is a flowchart of a method of an embodiment of a topology control method for a power line communication network according to the present application, as shown in fig. 1, where fig. 1 includes:

s01: calculating the signal-to-noise ratio of the current link;

it should be noted that, the signal-to-noise ratio of the current link is calculated, and the present application may determine whether the current link is a reliable link through the signal-to-noise ratio.

it should be noted that, if the calculated signal-to-noise ratio falls within the preset signal-to-noise ratio window range, the Q value of each node in the link is calculated, and the calculation method of the Q value of the node is as follows:

in the formula: s_tIs the state of the node at time t, S_te.S (S represents a state set); a is_tFor the node's action at time t, a_te.A (A represents the action set); r is instant report at time t; γ ∈ [ s, a) is a discount factor for determining the relative proportion of delayed return to immediate return, γ ≦ 1 being 0 ≦ γ, if γ is 1, then being as important for future rewards (delayed returns) and current rewards (immediate returns); if gamma is equal to 0, the method does not belong to reinforcement learning, but is a prediction problem, namely, a binary prediction problem which corresponds to any reward when doing what action in what state; alpha is a learning factor, 0<α≤1；Q_t+1And Q_tQ values of the nodes at t +1 and t are respectively; can pass through S_tAnd a_tThe system state at the next moment is obtained,

representing the maximum Q value at which the node takes action a at t + 1.

In the formula, the report function r (i, j) represents an instant report obtained after a node directly/indirectly registers with a gateway, and specifically includes:

in the formula w_iThe link weight of the node i receiving the learning message is obtained; w is a_jThe link weight of the node j which sends the learning message; chi shape_ijIs the hop count between the node j sending the learning message and the node i receiving the learning message. The formula comprehensively considers factors such as link quality, distance and the like and reflects the return acquired by the node when the network changes.

s04: selecting the action with the highest immediate report and the state of the next moment corresponding to the action, and updating the Q value;

it should be noted that, because different application scenarios may have different resources, adjusting α may update the Q value, and the method converges. When α is 1, which is a learning rule in deterministic return, the update formula of Q value is:

the Q learning algorithm recursively learns the best Q value with the winning state and reward and selecting an action at each time t, a representing the learning rate a ∈ (0,1), Q(s) if each state pair is accessed an indefinite number of times and the learning rate a decreases to zero_t,a_t) Convergence to the optimum merit function Q(s) with probability 1_t,a_t) (ii) a The Q learning algorithm is characterized in that historical experiences are accumulated through continuous perception of the environment, and a learning subject can autonomously select an optimal action target through continuous trial and error and continuous reinforcement. The process is as follows: establishing a Q value table, and initializing the Q value Q(s) corresponding to each state-action pair_t,a_t) 0; and judging whether the Q value is converged. If the convergence is reached, the algorithm is completed, and the program is exited; if not, the method comprises the following steps: 1) observing the state s at the current moment_t(ii) a 2) Selecting action a based on current state and policy_t(ii) a 3) Performing action a_tObtaining an immediate report r(s)_t,a_t) (ii) a 4) Observing a new state s_t+1And updating the Q value table until the Q value converges.

S05: if the updated Q value does not converge, the process returns to step S03;

it should be noted that, when the updated Q value is not converged, the new state is taken as the new current time state, an action is selected again according to the current state and the policy, the action is executed to obtain an immediate response, the action with the highest immediate response rate and the state at the next time corresponding to the action are continuously selected, and the Q value is updated until the Q value is converged.

It should be noted that when the Q values of all nodes are converged, the optimal action of each node can be obtained, and since the optimal action refers to an action that the node sends information to a neighboring node and obtains the highest immediate return, the optimal neighboring node transmitted by the node in the next step can be obtained according to the optimal action, that is, the communication topology selection can be performed according to the optimal neighboring node transmitted by the node in the next step.

The present application further provides another embodiment of a topology control method for a power line communication network, as shown in fig. 2, where fig. 2 includes:

s21: calculating the signal-to-noise ratio of the current link;

s22: if the signal-to-noise ratio does not belong to the range of the preset signal-to-noise ratio window, calculating the communication success rate of the current link;

s23: judging the reliability of the link according to the communication success rate;

it should be noted that, if the signal-to-noise ratio of the current link does not belong to the range of the preset signal-to-noise ratio window, the communication success rate of the current link is calculated, and the reliability of the link is determined according to the communication success rate.

Specifically, the channel state may be defined as 3 states, and when the state is 1, the communication success rate CSR is large, the channel condition is good, and the group frame can be correctly received; when the state is 2, the CSR is small, the channel condition is poor, the networking frame can be received, but the analysis error code is large, and the data error exists. When the state is 3, it indicates that the CSR is small, the channel condition is bad, and the networking frame cannot be transmitted.

The channel state is switched between 1, 2 and 3 under the influence of various factors. According to the equivalent state, the group frame is not retransmitted only when the channel state is in the equivalent state 1. When the equivalent state is 2 or 3, the networking frame is considered to be failed to be sent, and retransmission is needed.

In a power line communication network channel environment, setting the communication success rate CSR thresholds of an uplink/downlink as lambda 1 and lambda 2 respectively, and only when the conditions of the CSR thresholds of the uplink/downlink are met, taking corresponding networking action can determine the reliability of a link.

Setting a window from 0 to the maximum signal-to-noise ratio, and only performing Q learning in a window interval to judge the reliability of the link; and if the values are lower than the window interval and higher than the window interval, the reliability of the link is directly judged according to the threshold value, and the calculated amount is reduced.

S24: if the signal-to-noise ratio belongs to the range of a preset signal-to-noise ratio window, calculating the Q value of each node in the current link;

s25: acquiring the state and the action of the node at the current moment, and observing the state of the next moment corresponding to the action at the current moment, wherein the action comprises sending information to a neighbor node;

s26: selecting the action with the highest immediate report and the state of the next moment corresponding to the action, and updating the Q value;

s27: if the updated Q value does not converge, the process returns to step S03;

s28: and acquiring a first action with optimal instant return under the condition that the Q value of the node is converged, and acquiring an optimal neighbor node corresponding to the first action.

S29: and selecting communication topology according to the optimal node corresponding to each node.

It should be noted that when the Q values of all nodes are converged, a first action that each node has the optimal immediate return can be obtained, and since the first action refers to that when the node sends information to a neighboring node, an action with the highest immediate return is obtained, an optimal neighboring node transmitted by the node in the next step can be obtained according to the optimal action, that is, communication topology selection can be performed according to the optimal neighboring node transmitted by each node in the next step.

The above is an embodiment of the method of the present application, and the present application further provides an embodiment of a topology control device for a power line communication network, as shown in fig. 3, where fig. 3 includes:

a first calculating unit 301, configured to calculate a signal-to-noise ratio of a current link;

a second calculating unit 302, configured to calculate a Q value of each node in the current link when the signal-to-noise ratio belongs to a preset signal-to-noise ratio window range;

a first obtaining unit 303, configured to obtain a current state and an action of a node, and observe a next state corresponding to the execution of the current action, where the action includes sending information to a neighboring node;

an updating unit 304, configured to select the action with the highest immediate report and a state of a next moment corresponding to the action, and update the Q value;

an iteration unit 305, configured to return to the first obtaining unit when the updated Q value is not converged;

and a topology selecting unit 306, configured to select the most reliable node for communication topology selection when the Q values of all the nodes converge.

In a specific embodiment, the method further comprises the following steps:

The topology selection unit includes:

The application also provides a topology control device of a power line communication network, which comprises a processor and a memory:

the memory is used for storing the program codes and transmitting the program codes to the processor; the processor is used for executing a power line communication network topology control method in the embodiment of the application according to instructions in the program code.

The present application also provides a computer-readable storage medium for storing program codes for executing a power line communication network topology control method in the embodiments of the present application.

It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.

The terms "first," "second," "third," "fourth," and the like in the description of the present application and in the above-described drawings are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the application described herein are, for example, capable of operation in sequences other than those illustrated or otherwise described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.

It should be understood that in the present application, "at least one" means one or more, "a plurality" means two or more. "and/or" for describing an association relationship of associated objects, indicating that there may be three relationships, e.g., "a and/or B" may indicate: only A, only B and both A and B are present, wherein A and B may be singular or plural. The character "/" generally indicates that the former and latter associated objects are in an "or" relationship. "at least one of the following" or similar expressions refer to any combination of these items, including any combination of single item(s) or plural items. For example, at least one (one) of a, b, or c, may represent: a, b, c, "a and b", "a and c", "b and c", or "a and b and c", wherein a, b, c may be single or plural.

In the several embodiments provided in the present application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.

The above embodiments are only used for illustrating the technical solutions of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions in the embodiments of the present application.

Claims

1. A topology control method for a power line communication network, comprising:

s01: calculating the signal-to-noise ratio of the current link;

s05: if the updated Q value does not converge, the process returns to step S03;

2. The power line communication network topology control method according to claim 1, further comprising:

3. The topology control method for power line communication network according to claim 1, wherein said calculating the Q value of each node in the current link comprises:

representing the maximum Q value at which the node takes action a at t + 1.

4. The method according to claim 1, wherein the selecting the action with the highest immediate report and the state at the next time corresponding to the action to update the Q value comprises:

5. the method for controlling topology of power line communication network according to claim 1, wherein said selecting the most reliable node for communication topology selection if Q values of all nodes converge comprises:

6. A power line communication network topology control apparatus, comprising:

7. The power line communication network topology control device according to claim 6, further comprising:

8. The topology control device of power line communication network according to claim 6, wherein the topology selection unit comprises:

9. A power line communication network topology control device, the device comprising a processor and a memory:

the processor is configured to execute the power line communication network topology control method according to any one of claims 1 to 5 according to instructions in the program code.

10. A computer-readable storage medium characterized in that the computer-readable storage medium is configured to store a program code for executing the power line communication network topology control method of any one of claims 1 to 5.