CN112383846A

CN112383846A - Cloud-fog elastic optical network-oriented spectrum resource allocation method for advance reservation request

Info

Publication number: CN112383846A
Application number: CN202011271498.7A
Authority: CN
Inventors: 吴利杰; 朱睿杰; 杨燚; 刘岩; 安致嫄; 舒新建; 赵凌霄
Original assignee: Zhengzhou University; State Grid Corp of China SGCC; Information and Telecommunication Branch of State Grid Henan Electric Power Co Ltd
Current assignee: Zhengzhou University; State Grid Corp of China SGCC; Information and Telecommunication Branch of State Grid Henan Electric Power Co Ltd
Priority date: 2020-11-13
Filing date: 2020-11-13
Publication date: 2021-02-19
Anticipated expiration: 2040-11-13
Also published as: CN112383846B

Abstract

The invention discloses a spectrum resource allocation method for cloud-fog elastic optical network advance reservation request, comprising the steps of: calculating k shortest candidate paths of service requests by using a shortest path algorithm; dividing time slices based on link-based spectrum resources and spectrum slots, obtain the path resource matrix according to the state of each time-spectrum unit, and obtain the time slice and the number of spectrum slots required to process the service request according to the path resource matrix; use the reinforcement learning algorithm to confirm the action of the service request being allocated, and obtain according to the action Reward; confirm whether the allocation plan is valid according to the reward, and if valid, record the allocation plan; the allocation plan includes the start time of the service request scheduling, the shortest candidate path, the time slice required to process the service request and the number of spectrum slots; traverse k in turn The shortest candidate paths are selected, and the allocation scheme that yields the greatest reward is selected. The present invention has good robustness and can maximize the utilization rate of spectrum resources.

Description

Cloud-fog elastic optical network-oriented spectrum resource allocation method for advance reservation request

Technical Field

The invention relates to the technical field of elastic optical networks and cloud-fog communication, in particular to a spectrum resource allocation method for a cloud-fog elastic optical network advance reservation request.

Background

With the rapid development of 5G communication, internet of things (IoT) and virtual reality technologies, traditional cloud computing cannot meet its needs with high latency and huge energy consumption. Edge computing is a good complement to cloud computing, being closer to the device and with lower latency, and the cooperation of cloud computing and edge computing can fuse their advantages and provide higher quality of service. Meanwhile, as the bandwidth requirements of service requests are more and more diversified, new requirements are provided for the network to have the capability of flexibly providing frequency spectrums.

Elastic Optical Networks (EONs) are the underlying networks that are expected to carry flexible requests between cloud computing and edge computing. Based on the OFDM technology, the substrate spectrum resources are cut into independent spectrum time slots, each spectrum time slot usually occupies 6.25GHz or 12.5GHz, and a plurality of spectrum time slots can be efficiently and flexibly provided for arriving requests. Therefore, the application of Elastic Optical Networks (EONs) allows cloud-edge computing and 5G technologies to better improve quality of life.

There are often many service requests for mass data migration or mass data backup between cloud-edge data centers, and these mass data migration or backup service requests do not need to be responded to immediately, and they always have a certain deadline. These service requests are completed before the expiration date, e.g., 8 am the next day. Therefore, these requests are also referred to as Advance Reservation (AR) requests. Due to the introduction of the time domain, these requests can be delayed appropriately to relieve network resource pressure and avoid network congestion. For allocating an advance reservation request, not only the spectrum domain resources but also the time domain should be considered. The request may be successfully allocated if both time resources and spectrum resources meet the requirements.

Routing and Spectrum Allocation (RSA) issues have been a hot issue in EON. Although many researches have researched the problem of large-capacity data transmission in some aspects and most of the researches propose the traditional heuristic RSA algorithm, in static RSA and dynamic RSA, the traditional heuristic RSA algorithm cannot be continuously optimized and is limited by scalability, and the technical problems that service requests cannot be reasonably distributed and processed and the blocking rate is high exist.

Disclosure of Invention

The invention provides a spectrum resource allocation method facing a cloud-fog elastic optical network advance reservation request, which solves the problem of spectrum resource allocation of cross-data center transmission services such as data backup, application data synchronization and virtual machine migration in the existing Internet of things.

S1, for a service request

K shortest candidate paths of the service request r are calculated by using a shortest path algorithm, wherein,

representing the number of services carried by the service request r, s representing the source node, d representing the destinationNode of, t_aAnd t_dRespectively representing the arrival time and the deadline of the service request r;

s2, dividing time slices and frequency spectrum slots based on frequency spectrum resources of each link, obtaining a path resource matrix corresponding to the shortest candidate path in the step S1 according to the state of each time frequency spectrum unit, and obtaining the number n of the time slices needed for processing the service request r according to the path resource matrix_tAnd the number n of spectral slots_f；

S3, the number n of time slices obtained from the step S2_tAnd the number n of spectral slots_fConfirming the action A allocated to the service request R in the path resource matrix obtained in the step S2 by using a reinforcement learning algorithm, and acquiring a corresponding reward R according to the action A;

s4, according to the reward R obtained in the step S3, whether the distribution scheme under the shortest candidate route is effective is confirmed, if yes, the distribution scheme under the shortest candidate route and the corresponding reward R are recorded, and then the step S5 is executed, and if not, the step S5 is directly executed;

the allocation scheme includes a start time t of scheduling of a service request r_sThe shortest candidate path, the number of time slices n required for processing the service request r_tAnd the number n of spectral slots_f；

S5, according to the method of steps S2-S4, traversing k shortest candidate paths in turn, and selecting the distribution scheme generating the maximum reward R as the distribution scheme of the service request R.

The invention has the beneficial effects that: for an incoming advance reservation request, the invention firstly finds k shortest candidate paths by using a shortest path method, traverses each candidate path and calculates available spectrum resources corresponding to each candidate path; different service time and the number of frequency spectrum slots can be allocated to each service request, then the optimal allocation scheme is selected by utilizing the deep neural network, meanwhile, a reward is obtained for each allocation scheme, and the optimal allocation scheme is decided according to the reward; the method has good robustness, can select a proper routing path for all the services of the advance reservation requests and allocate the optimal service time and spectrum resources for each advance reservation request, thereby maximizing the utilization rate of the spectrum resources and reducing the blocking rate and the initial time delay of the service requests.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

FIG. 1 is a schematic flow chart of the present invention.

Fig. 2 is a schematic diagram of the synthesis of the environmental state S.

Fig. 3 is a flow chart of the DQN algorithm.

Fig. 4 is a schematic diagram of a cluster.

FIG. 5 shows the time-frequency spectrum continuity TF_cA statistical representation of the parameters in (1).

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to fig. 1 to 5 in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments. All other embodiments, which can be obtained by a person skilled in the art without inventive effort based on the embodiments of the present invention, are within the scope of the present invention.

A spectrum resource allocation method for a pre-reservation request in a cloud-cloud elastic optical network, as shown in fig. 1, includes the following steps:

s1, for a service request

table s representing the number of services carried by a service request rIndicating a source node, d indicating a destination node, t_aAnd t_dRespectively representing the arrival time and the deadline of the service request r;

the service request

For reserving the request service in advance, each shortest candidate path is composed of one or more links.

S2, dividing time slices and frequency spectrum slots based on frequency spectrum resources of each link, respectively obtaining a path resource matrix corresponding to the shortest candidate path in the step S1 according to the state of each time frequency spectrum unit, and obtaining the number n of the time slices needed for processing the service request r according to the path resource matrix_tAnd the number n of spectral slots_fThe method comprises the following steps:

s21, dividing the link on the shortest candidate path into time slices and frequency spectrum slots, establishing time frequency spectrum units based on the time slices and the frequency spectrum slots, and respectively confirming the state of each time frequency spectrum unit on the link;

the expression of the state of the time spectrum unit is as follows:

wherein S is_(t,f)Representing a time-spectral unit u_(t,f)State of (1), time spectrum unit u_(t,f)Is composed of the t-th time slice and the f-th frequency spectrum slot.

S22, confirming the link resource matrix of the link according to the state of each time spectrum unit on the link obtained in the step S21;

the expression of the link resource matrix is:

in the formula of U_lA link resource matrix representing the link/,

representing a time-spectrum unit u on a link l_(T,F)T represents the number of time slices on link l; f denotes the number of spectrum slots on link i.

S23, confirming the link resource matrix of each link on the shortest candidate path according to the methods of S21 and S22, and confirming the path resource matrix of the shortest candidate path according to the link resource matrix;

the expression of the path resource matrix is as follows:

in the formula of U_PA path resource matrix representing the shortest candidate path P, L representing all link sets comprised by the shortest candidate path P,

representing the time-spectrum unit u on all links in the shortest candidate path P_(T,F)The state of (1).

The path resource matrix represents the state of each time spectrum unit in the shortest candidate path, and the available spectrum resources in the shortest candidate path can be quickly identified according to the path resource matrix.

S24, calculating the service duration time Deltat required by the service request r according to the path resource matrix, and calculating the number n of the time slices required by the service request r according to the service duration time Deltat_tAnd the number n of spectral slots_f；

The number of time slices n_tThe calculation formula of (2) is as follows:

wherein τ represents the size of a time slice, and Δ t represents the service duration of the service request r;

the service duration Δ t is obtained by processing the service request r by respectively trying different start times and using available spectrum resources in the path resource matrix according to the following constraint conditions:

max△t＝t_d-t_a；

t_a≤t_s≤t_d；

τ≤△t≤t_d-t_s；

in the formula, t_sRepresents the starting time of the scheduling of the service request r;

the calculation formula of the service duration time Δ t is as follows:

△t＝t_e-t_s；

in the formula, t_eRepresenting the end time of the service request r scheduling;

the number n of spectrum slots_fThe calculation formula of (2) is as follows:

in the formula, F_slotRepresenting the capacity of a spectrum bin, GB representing the guard bandwidth [. ]]Indicating that the whole is taken.

In this embodiment, the capacity F of one spectrum slot_slotAt 12.5GHZ, the size of a time slice τ is one hour.

in this embodiment, the reinforcement learning algorithm is a DQN algorithm, and step S4 includes the following steps:

s31, as shown in FIG. 2, establishing a resource environment according to the path resource matrix established in step S2, and the number n of time slices required by the service request r_tAnd the number n of spectral slots_fAnd establishing a request environment corresponding to the resource environment, and synthesizing the resource environment and the request environment to obtain an environment state S.

And S32, inputting the environment state S obtained in the step S31 into the evaluate network of the DQN algorithm to obtain an action A, wherein the action A represents the position of the service request r to be distributed in the path resource matrix.

And S33, judging and calculating the reward R corresponding to the position according to the reward mechanism.

The reward mechanism of the reward R is as follows:

in the formula, SRU represents a spectrum resource utilization value, and TSAE represents a time spectrum allocation efficiency; the smaller the spectrum resource utilization value SRU, the better, indicating that more resources may be left for subsequent requests, and therefore, the smaller the SRU,

the larger, i.e. the more awards R; the larger the time-spectrum allocation efficiency TSAE, the better, indicating less spectrum fragmentation in the path resource matrix, i.e., more available resources.

The calculation formula of the frequency spectrum resource utilization value SRU is as follows:

SRU＝(t_e-t_s)×n_t×h(r)；

where h (r) represents the number of route hops from source node s to destination node d;

the calculation formula of the time spectrum allocation efficiency TSAE is as follows:

TSAE＝C_s×R_i×TF_c；

in the formula, C_sDenotes the size of the cluster, R_iIndicating resource idleness, TF_cRepresents temporal spectral continuity;

the calculation of the time spectrum allocation efficiency TSAE comprehensively considers two factors of a cluster and a resource idleness degree on the basis of the time spectrum continuity, so that the spectrum fragmentation can be reduced, and the spectrum resources are utilized to the maximum extent.

The cluster is divided into a position assigned by the service request r and a surrounding areaThe time and frequency spectrum units are connected to form a cluster with the size C_sI.e. the number of time-spectrum units in the cluster; resource idleness degree R_iRepresenting the fraction of time spectrum units in the path resource matrix that are free. As shown in fig. 4, if the allocated location of the service request is available block 1, cluster 1 is formed, and the size C of cluster 1_s64; if the service request is allocated the available block 2, cluster 2 is formed, and the size C of cluster 2_s17; since the number of time spectrum units in available block 1 and available block 2 is the same, the resource idleness R in both cases_iSame as R_i＝0.32。

The time-frequency spectrum continuity TF_cThe calculation formula of (2) is as follows:

in the formula (I), the compound is shown in the specification,

and

representing the number of available spectral blocks, num, along the time axis and the spectral axis, respectively_2uIndicating the number of two consecutive spectral units (along the time axis and the spectral axis, respectively).

Time-frequency spectrum continuity TF_cRepresents the situation of spectral fragmentation in the path resource matrix, as shown in fig. 5, the corresponding TF in fig. 5_c＝1.08。

S4, according to the reward R obtained in the step S3, whether the distribution scheme under the shortest candidate route is effective is confirmed, if yes, the distribution scheme under the shortest candidate route and the corresponding reward R are recorded, and then the step S5 is executed, and if not, the step S5 is directly executed; the allocation scheme includes a start time t of scheduling of a service request r_sThe shortest candidate path, the number of time slices n required for processing the service request r_tAnd the number n of spectral slots_f。

Whether the position allocated in step S3 is occupied can be determined according to the sign of the reward R, and if the reward R is positive, the allocation scheme is valid, and if the reward R is negative, the allocation scheme is invalid.

Preferably, after recording the distribution scheme under the shortest candidate path, the environment state S is synchronously updated according to the action a to obtain a new environment state S_{_}And the experience (S, A, R, S)_{_}) And storing the updated network parameter into an experience pool of the evaluate network, judging whether the set time for updating the network parameter is reached, if so, updating the network parameter, and if not, directly executing the step S5.

As shown in fig. 3, the DQN algorithm includes two networks, namely an evaluate network and a target network, respectively, and the evaluate network is used to calculate an estimated Q value, denoted as Q_evaluate(ii) a the target network is used for calculating an actual Q value, which is marked as Q_target. As shown in fig. 3, according to the set time for updating the network parameters, the evaluate network and the target network extract part of experience (S, a, R, S) from the experience pool at intervals_{_}) The evaluate network obtains Q according to the environment state S_evaluate(S, A), the target network according to the new environment state S_{_}To obtain Q_target(S_{_},A_{_}) Then calculating a loss function from the two Q values, wherein A_{_}Indicating a new environmental state according to S_-The estimated new action.

The loss function is Q_evaluate(S, A) and Q_target(S_{_},A_{_}) The specific formula of the mean square error L is as follows:

L＝E((Q_target(S_{_},A_{_})-Q_evaluate(S,A))²)；

the evaluate network updates the network parameters by adopting a gradient descent method, and the target network copies the updated parameters of the evaluate network, which is the prior art and is not described in detail in this embodiment.

And S5, traversing the k shortest candidate paths in sequence according to the method of the steps S2-S4, and then using the allocation scheme of the maximum reward R generated by the elastic optical network as the spectrum resource allocation scheme of the service request R.

The invention firstly establishes a two-dimensional resource model of frequency domain and time domain facing the service of the advance reservation request, carries out interaction with the environment through reinforcement learning, scores the frequency spectrum resource allocation scheme to optimize the allocation of frequency spectrum resources, and then updates the state of the corresponding time frequency spectrum unit according to the determined allocation scheme to prepare for the arrival of the next service request.

Since Deep Reinforcement Learning (DRL) shows the potential for successful Learning strategies for combinatorial and distributed problems, the present invention relies on obtaining feedback and rewards from the environment, and the DQN algorithm can learn the optimization strategy step by step, and is therefore well suited for decision-making problems. In the research of the static Spectrum Allocation strategy of the advance reservation request service, the optimal solution of the computing Resource of the Integer Linear Programming (ILP), the DRDA method and three traditional heuristic algorithms are compared, the performance of the DRDA in the static RSA problem is tested, and the simulation result shows that the performance of the DRDA method is very close to the optimal solution of the Resource computed by the ILP. In the research of dynamic Spectrum Allocation strategy, the invention provides a Time Spectrum Allocation Efficiency (TSAE) measurement standard for measuring the available resource state in an elastic optical network, a DQN algorithm Allocation scheme is adopted for scoring, simulation test and large-scale network experiment are adopted for comparing DRDA with three traditional heuristic algorithms from three aspects of average TSAE, request blocking rate and average initial delay, and the result shows that the DRDA method has good robustness, and compared with other three heuristic algorithms, the DRDA method keeps lower initial delay while obtaining the lowest blocking rate.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims

1. A spectrum resource allocation method for cloud-fog elastic optical network reservation request in advance, is characterized in that, comprises the steps:

S1, for a service request

Use the shortest path algorithm to calculate the k shortest candidate paths of the service request r, where,

Represents the number of services carried by the service request r, s represents the source node, d represents the destination node, t _a and t _d represent the arrival time and deadline of the service request r, respectively;

S2: Divide time slices and spectrum slots based on the spectrum resources of each link, obtain the path resource matrix corresponding to the shortest candidate path in step S1 according to the state of each time spectrum unit, and obtain the processing service request r according to the path resource matrix the required number of time slices n _t and the number of spectral bins n _f ;

S3, according to the number of time slices n _t and the number of spectrum slots n _f obtained in step S2, use the reinforcement learning algorithm to confirm the action A that the service request r is allocated in the path resource matrix obtained in step S2, and obtain the corresponding reward according to the action A R;

S4, confirm whether the allocation scheme under the shortest candidate path is valid according to the reward R obtained in step S3, if valid, record the allocation scheme under the shortest candidate path and the corresponding reward R and then execute step S5, if invalid, execute directly Step S5;

The allocation scheme includes the scheduled start time _ts of the service request r, the shortest candidate path, the number of time slices _nt required to process the service request r, and the number of frequency spectrum slots _nf ;

S5, according to the method of steps S2-S4, traverse the k shortest candidate paths in sequence, and select the allocation scheme that generates the maximum reward R as the allocation scheme of the service request r.

2. The spectrum resource allocation method for cloud-fog elastic optical network advance reservation request according to claim 1, wherein the step S2 comprises the following steps:

S21, the link on the shortest candidate path is divided into time slices and spectrum slots, and time-spectrum units are established based on the time slices and spectrum slots, and the status of each time-spectrum unit on the link is confirmed respectively;

S22, confirming the link resource matrix of the link according to the state of each time spectrum unit on the link obtained in step S21;

S23, confirm the link resource matrix of each link on the shortest candidate path according to the method of step S21 and step S22 respectively, confirm the path resource matrix of this shortest candidate path according to the link resource matrix;

S24: Calculate the service duration Δt required by the service request r according to the path resource matrix, and calculate the number of time slices _nt and the number of spectrum slots nf required by the service request _r according to the service duration Δt.

3. The spectrum resource allocation method for cloud-fog elastic optical network advance reservation request according to claim 2, is characterized in that, in step S24, the calculation formula of described time slice quantity n _t is:

In the formula, τ represents the size of a time slice, and Δt represents the service duration of the service request r;

The calculation formula of the number of spectrum slots n _f is:

In the formula, F _slot represents the capacity of a spectrum slot, GB represents the guard bandwidth, and [*] represents the rounding up.

4. The spectrum resource allocation method for cloud-fog elastic optical network advance reservation request according to claim 2 or 3, characterized in that, the service duration Δt is based on the following constraints, and by trying different The method for processing the service request r at the start time is obtained, and the constraint conditions are:

maxΔt=t _d −t _a ;

t _a ≤t _s ≤t _d ;

τ≤Δt≤t _d -t _s ;

In the formula, _ts represents the start time of the service request r scheduling, and τ represents the size of a time slice;

The calculation formula of the service duration Δt is:

Δt=t _e −t _s ;

In the formula, t _e represents the end time of the service request r scheduling.

5. The spectrum resource allocation method for cloud-fog elastic optical network advance reservation request according to claim 1, wherein the step S3 comprises the following steps:

S31, establish a resource environment according to the path resource matrix established in step S2, establish a request environment corresponding to the resource environment according to the number of time slices n _t and the number of frequency spectrum slots n _f required by the service request r, and perform the resource environment and the request environment. Synthesized to obtain the environmental state S;

S32, the environmental state S obtained in step S31 is input into the evaluate network of the reinforcement learning algorithm to obtain action A, and the action A represents the position where the service request r will be allocated in the path resource matrix;

S33, judge and calculate the reward R corresponding to the position according to the reward mechanism.

6. The spectrum resource allocation method for cloud-fog elastic optical network advance reservation request according to claim 5, characterized in that, in step S33, the reward mechanism is:

In the formula, SRU represents the spectrum resource utilization value, and TSAE represents the time spectrum allocation efficiency.

7. The spectrum resource allocation method for cloud-fog elastic optical network advance reservation request according to claim 6, wherein the calculation formula of the spectrum resource utilization value SRU is:

SRU=(t _e -t _s )×n _t ×h(r);

In the formula, h(r) represents the number of routing hops from the source node s to the destination node d, and _ts and _te represent the start time and end time of the service request r scheduling, respectively.

8. The spectrum resource allocation method for cloud-fog elastic optical network advance reservation request according to claim 6, wherein the calculation formula of the time spectrum allocation efficiency TSAE is:

TSAE=C _s ×R _i ×TF _c ;

In the formula, C _s represents the size of the cluster, R _i represents the resource idleness, and TF _c represents the time-spectrum continuity.

9. The spectrum resource allocation method for cloud-fog elastic optical network advance reservation request according to claim 8, wherein the calculation formula of the time spectrum continuity TF _c is:

In the formula,

and

represents the number of available spectrum blocks along the time axis and spectrum axis, respectively, and num _2u represents the number of two consecutive spectrum units.

10. The spectrum resource allocation method for cloud-fog elastic optical network advance reservation request according to claim 1, wherein in step S4, the allocation scheme and corresponding reward under the shortest candidate path are recorded After R, update the environment state S synchronously according to the action A, obtain the new environment state S _{_} , store the experience (S, A, R, S _{_} ) in the experience pool, and then judge whether the set time for updating network parameters is reached , if yes, use the gradient descent method to update the network parameters, otherwise, go to step S5 directly.