CN114629547B - High-throughput beam hopping scheduling method for differentiated services - Google Patents

High-throughput beam hopping scheduling method for differentiated services Download PDF

Info

Publication number
CN114629547B
CN114629547B CN202210273871.5A CN202210273871A CN114629547B CN 114629547 B CN114629547 B CN 114629547B CN 202210273871 A CN202210273871 A CN 202210273871A CN 114629547 B CN114629547 B CN 114629547B
Authority
CN
China
Prior art keywords
cluster
satellite
clusters
wave
cell
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210273871.5A
Other languages
Chinese (zh)
Other versions
CN114629547A (en
Inventor
白卫岗
刘聪俐
李建东
史琰
周笛
李浩然
朱彦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xidian University
Original Assignee
Xidian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xidian University filed Critical Xidian University
Priority to CN202210273871.5A priority Critical patent/CN114629547B/en
Publication of CN114629547A publication Critical patent/CN114629547A/en
Application granted granted Critical
Publication of CN114629547B publication Critical patent/CN114629547B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B7/00Radio transmission systems, i.e. using radiation field
    • H04B7/14Relay systems
    • H04B7/15Active relay systems
    • H04B7/185Space-based or airborne stations; Stations for satellite systems
    • H04B7/1853Satellite systems for providing telephony service to a mobile station, i.e. mobile satellite service
    • H04B7/18539Arrangements for managing radio, resources, i.e. for establishing or releasing a connection
    • H04B7/18543Arrangements for managing radio, resources, i.e. for establishing or releasing a connection for adaptation of transmission parameters, e.g. power control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B7/00Radio transmission systems, i.e. using radiation field
    • H04B7/14Relay systems
    • H04B7/15Active relay systems
    • H04B7/185Space-based or airborne stations; Stations for satellite systems
    • H04B7/1853Satellite systems for providing telephony service to a mobile station, i.e. mobile satellite service
    • H04B7/18545Arrangements for managing station mobility, i.e. for station registration or localisation
    • H04B7/18547Arrangements for managing station mobility, i.e. for station registration or localisation for geolocalisation of a station
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B7/00Radio transmission systems, i.e. using radiation field
    • H04B7/14Relay systems
    • H04B7/15Active relay systems
    • H04B7/185Space-based or airborne stations; Stations for satellite systems
    • H04B7/1853Satellite systems for providing telephony service to a mobile station, i.e. mobile satellite service
    • H04B7/18569Arrangements for system physical machines management, i.e. for construction operations control, administration, maintenance
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements
    • H04W24/02Arrangements for optimising operational condition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements
    • H04W24/06Testing, supervising or monitoring using simulated traffic
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W28/00Network traffic management; Network resource management
    • H04W28/02Traffic management, e.g. flow control or congestion control
    • H04W28/08Load balancing or load distribution
    • H04W28/09Management thereof
    • H04W28/0958Management thereof based on metrics or performance parameters
    • H04W28/0967Quality of Service [QoS] parameters
    • H04W28/0975Quality of Service [QoS] parameters for reducing delays

Abstract

The invention provides a high-throughput beam hopping scheduling method for differentiated services, which mainly solves the problems of low utilization rate of on-satellite resources and high computational complexity in the prior art. The implementation scheme is as follows: dividing wave position cells in a satellite coverage range into different clusters; establishing a clustering model for load balancing among clusters according to the load balancing principle among clusters and the geographical position near principle of the wave position cell in the cluster: solving the clustering model with balanced load among clusters by using an immune algorithm to obtain a clustering result; determining the delay tolerance taking the hopping time slot as a unit according to the requirements of different service types on the delay; establishing a hopping wave beam scheduling model based on the delay tolerance: and setting a beam dynamic scheduling model in each divided cluster, regarding the beam dynamic scheduling model as a Markov decision process, and solving according to a deep reinforcement learning algorithm to obtain a beam scheduling result. The invention can improve the resource utilization rate and reduce the calculation complexity while ensuring the throughput, and can be used for satellite resource allocation.

Description

High-throughput beam hopping scheduling method for differentiated services
Technical Field
The invention relates to the field of satellite communication, in particular to a beam hopping satellite beam clustering scheduling method which can be used for reasonably allocating satellite resources to cells with different service requirements in a scene with fast service requirement dynamic change in a satellite coverage area.
Background
The early single beam is mostly a global beam or an area beam, the beam width is wide, and the antenna gain is small. In order to cope with the rapid increase in traffic demand, multi-beam technology based on spot beams is employed. The wave beam is narrow, so the gain is high, and the frequency reuse technology is added, thereby the system capacity of the satellite is obviously improved, and more services are served. However, with the rapid development of communication and internet of things, terrestrial services exhibit the characteristic of uneven space-time distribution, and particularly, the high dynamics of low-orbit satellites face more uneven services. In order to solve the problem, improve the resource utilization rate, avoid the condition of 'uneven strain' as much as possible, and provide a beam hopping technology, thereby further improving the system capacity. However, in the aspect of resource allocation, the existing beam hopping technology is usually designed based on the heterogeneity of the ground service requirements, and focuses on finding the optimal throughput to meet the capacity requirements of different areas, but does not consider the factors such as the service type and the time delay. In actual process, the service delay requirements of different users are different. In order to guarantee user experience, the requirements of throughput and service delay need to be considered comprehensively. In terms of clustering, in order to simplify the system and improve the resource utilization rate, a uniform clustering, uniform power allocation, and full frequency multiplexing manner are usually adopted, that is, the power allocated to each cluster is uniform and non-adjustable. Under the condition that inter-cluster resources are the same, non-uniformity of inter-cluster services is not considered, uniform clustering is carried out, and overload or underload phenomena can occur.
The patent document "hopping pattern optimization method and device based on time slot allocation algorithm, and storage medium" (patent application No. 201910675600.0, application publication No. CN 110518956A) applied by the university of civil liberation army engineering of china discloses an improved hopping beam time slot allocation method under a clustering scene. The method allocates the time slot number for each cell in a pre-allocation mode, and then reallocates by using the same frequency interference distance threshold, thereby effectively eliminating the influence of interference on the signal quality while improving the system capacity. However, the method does not consider the service delay performance, so that the real-time service may fail due to timeout waiting.
A method for scheduling resources based on beam hopping is disclosed in a patent document applied by Shanghai Yuanxin satellite science and technology Co., ltd (patent application number 201811070246.0, application publication number CN 109121147A). The method is characterized in that a satellite coverage area is represented by two three-dimensional matrixes. The first matrix is a user actual demand matrix, and a three-dimensional matrix is formed by adding a time dimension; the second matrix is a beam hopping service matrix, and a time dimension is also added. And obtaining a target matrix to be optimized by multiplying the two matrixes, and then solving. The method solves the problem that the satellite capacity is matched with the ground requirement in the fast moving scene of the low-orbit satellite. However, with the increase of the number of satellite beams and wave position cells, the search space of the optimization algorithm facing global scheduling is increased sharply, and the complexity of the algorithm is improved.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provides a high-throughput beam hopping scheduling method facing differentiated services, so that the throughput is guaranteed, the resource utilization rate is improved, and the calculation complexity is reduced.
The specific idea of the invention for realizing the purpose is as follows: by establishing a load balancing clustering model based on an immune algorithm, wave bit cells in a satellite coverage range are divided into different clusters, so that the calculation complexity is reduced; and (3) completing beam resource scheduling by constructing a beam resource scheduling model facing to delay tolerance constraint and adopting a deep reinforcement learning method based on priority playback.
According to the above thought, the technical scheme of the invention comprises the following steps:
(1) And generating clustering results in the satellite coverage range:
1a) Dividing wave bit cells in a satellite coverage range into different clusters;
1b) Establishing a clustering model for load balancing among clusters according to the load balancing principle among clusters and the geographical position near principle of the wave position cell in the cluster:
Figure BDA0003555023570000021
s.t.C 1 :d mn ≤s,m∈M i and n∈M i
Figure BDA0003555023570000022
wherein P is an objective function for minimizing mean square error of load among clusters, K is the number of clusters, N is the number of wave position cells, R is j Traffic demand, X, for cell j ij Representing whether the wave position j belongs to a cluster i, R is the load mean of K clusters, C 1 As a distance constraint, C 2 To ensure the constraint condition that a wave position cell only belongs to a cluster, m and n represent two wave position cells in the same cluster, d mn Representing the wave position center distance of M cells and n cells, s is the upper limit of the distance between two wave position centers in the cluster, M i Represents a set of wave position cells belonging to cluster i;
1c) Solving the clustering model with balanced load among clusters by using an immune algorithm to obtain a clustering result;
(2) Establishing a cluster beam dynamic scheduling model:
2a) Determining the delay tolerance taking a hopping time slot as a unit for different service types according to the requirements of the service types on the delay;
2b) Executing the process that different types of data packets wait to be issued in an on-satellite cache queue, wherein the corresponding delay tolerance is reduced by one every time a jump time slot passes, and the data packets are discarded when the on-satellite waiting delay exceeds the tolerance of the delay tolerance and are regarded as overtime failure;
2c) According to the principle of maximizing service guarantee rate and minimizing overtime failure rate of the service, establishing a hopping wave beam scheduling model based on delay tolerance:
Figure BDA0003555023570000031
Figure BDA0003555023570000032
Figure BDA0003555023570000033
Figure BDA0003555023570000034
wherein, P 1 In order to maximize the service guarantee rate of the intra-cluster service, T is the set of all decision moments of the beam hopping satellite in the coverage time range of the same area, N is the total wave position cell number in the cluster,
Figure BDA0003555023570000035
is shown at t j The amount of packets sent to cell n after the end of the slot,
Figure BDA0003555023570000036
indicates a cutoff to t j Time slot, the total amount of packets, P, received by the satellite for cell n 2 To minimize intra-cluster traffic timeout failure rates,
Figure BDA0003555023570000037
amount of data packets indicating that the data packet whose destination address is the wave bit cell n has failed due to timeout waiting, C 1 The number of cells for obtaining beam scheduling in each hopping time slot in the cluster is 1,C 2 To ensure that the data packets on the satellite at the current time do not exceed the maximum limit,
Figure BDA0003555023570000038
is shown at t j Whether slot-wave-bit cell n is illuminated by the operating beam,
Figure BDA0003555023570000039
indicating that the light is illuminated, whereas, not illuminated,
Figure BDA00035550235700000310
is shown at t j After the time slot is finished, the number of data packets stored in a satellite memory corresponding to the wave position cell n is equal to L, and L is the maximum capacity of a cache queue of each wave position cell on the satellite;
(3) And (3) establishing the beam dynamic scheduling model in the step (2) in each cluster divided in the step (1), regarding the scheduling model problem as a Markov decision process, and solving according to deep reinforcement learning to obtain a beam scheduling result.
Compared with the prior art, the invention has the following advantages:
first, the computational complexity is reduced: aiming at a satellite system with uniform power distribution and full frequency multiplexing, along with the increase of the number of satellite beams and wave position cells, the search space of an optimization algorithm facing global scheduling is increased sharply, and the algorithm complexity is improved; the load balancing clustering model established by the invention divides a complex task into a plurality of subtasks, so that the search space is reduced, and the computational complexity is reduced.
Secondly, the system service guarantee rate is improved: compared with the existing beam hopping scheduling algorithm, the invention considers the difference of services in the scheduling process, provides a beam scheduling model based on the time delay tolerance, and during specific operation, establishes an optimization problem by taking the maximum service guarantee rate and the minimum service overtime failure rate as objective functions, so that the system throughput is increased, the service guarantee rate is improved, and the service overtime failure rate is reduced.
Description of the drawings:
FIG. 1 is a general flow chart of an implementation of the present invention;
FIG. 2 is a sub-flowchart for solving a load balancing clustering model using an immune algorithm in accordance with the present invention;
FIG. 3 is a wave level cell layout of the present invention;
FIG. 4 is a graph of the clustering results in the present invention;
FIG. 5 is a state reconstruction diagram in the present invention;
FIG. 6 is a graph comparing the convergence rate of deep reinforcement learning of the present invention with the prior global hopping algorithm;
FIG. 7 is a graph comparing the normalized throughput of the present invention with different prior art hopping algorithms;
fig. 8 is a graph comparing service coverage of the present invention with that of the existing different hopping algorithms.
Detailed Description
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with examples are described in further detail below.
The present example includes two major parts: the first part is to use an immune algorithm to generate a clustering result of load balance, and the second part is to use a deep reinforcement learning algorithm to obtain a beam dynamic scheduling result.
Referring to fig. 1, the implementation steps of this example are as follows:
a first part: an immune algorithm is used to produce load-balanced clustering results.
Step 1, dividing wave position cells in a satellite coverage range into different clusters.
Initializing all cell service request volumes and geographic positions in the coverage area of the beam hopping satellite, wherein the cell service request volumes and the geographic positions comprise the relative geographic positions of 19 wave position cells in the coverage area, the service request volume of each cell and wave position cell numbers as shown in figure 3;
and determining the number of the divided clusters according to the number K of the working beams owned by the single satellite, and enabling the working beams to correspond to the clusters one by one, wherein each working beam is responsible for carrying out dynamic beam scheduling in one cluster.
In the embodiment, the satellite is provided with 3 working beams, 3 clusters are divided according to the one-to-one correspondence relationship between the single satellite working beams and the clusters, and each working beam is responsible for carrying out dynamic beam scheduling in one cluster.
And 2, establishing a cluster model for load balancing among clusters according to the load balancing principle among clusters and the principle that the geographical position of the wave position cell in the cluster is close.
2.1 Determining the service request quantity of each wave position cell, and calculating the sum S of the service request quantities of all cells in the coverage area of the satellite;
2.2 Calculate the load mean of K clusters
Figure BDA0003555023570000051
Establishing a load balancing optimization target:
Figure BDA0003555023570000052
wherein P is an objective function for minimizing mean square error of load among clusters, the number of K clusters, N is the number of wave position cells, R j Traffic demand, X, for cell j ij Representing whether the wave position j belongs to the cluster i or not, wherein R is the load mean value of K clusters;
2.3 Determining the distance upper limit s of the centers of two wave positions in the cluster, and establishing a distance constraint condition:
d mn ≤s,m∈M i and n∈M i
where m, n represent two wave site cells in the same cluster, d mn Representing the wave position center distance, M, of M and n cells i Representing the set of wave-bit cells belonging to cluster i.
2.4 Combining the load balancing optimization target in 2.2) with the distance constraint condition in 2.3) to obtain a clustering model of load balancing among clusters:
Figure BDA0003555023570000053
wherein P is an objective function for minimizing mean square error of load among clusters, K is the number of clusters, N is the number of wave position cells, R is j Traffic demand, X, for cell j ij Indicating whether the wave position j belongs to a cluster i, R is the load mean value of K clusters, C 1 As a distance constraint, C 2 To ensure the constraint condition that a wave position cell only belongs to a cluster, m and n represent two wave position cells in the same cluster, d mn Representing the wave position center distance of M cells and n cells, s is the upper limit of the distance between two wave position centers in the cluster, M i Representing the set of wave-bit cells belonging to cluster i.
And 3, solving the clustering model with balanced load among clusters by using an immune algorithm to obtain a clustering result.
Referring to fig. 2, the specific implementation of this step is as follows:
3.1 Initializing the central wave position population of each cluster and a memory base, namely setting the iteration number N according to the number of wave position cells and the number of divided clusters e Population size S, memory pool capacity O, crossover probability P c Probability of mutation P m Diversity evaluation parameter P s And setting the current iteration number n =0; randomly generating M initial clustering center antibody populations, wherein M is 35 in the embodiment;
3.2 For clustering models, the affinity function is designed:
Figure BDA0003555023570000061
wherein A is v For the affinity function, P is an objective function P that minimizes the mean square error of the load between clusters, C is a penalty constant for solutions that do not satisfy the distance requirement, and Y denotes that the distance constraint C is not satisfied 1 The number of wave bits;
3.3 Calculating fitness function values of all individuals according to the fitness function expression in 3.2);
3.4 Calculate the reproduction rate and antibody concentration for all individuals:
3.4.1 Calculate the affinity between the two antibodies:
Figure BDA0003555023570000062
wherein S is v,s For affinity between antibodies, k v,s Indicates the same number of bits of the antibody s and the antibody v, and L indicates the length of the antibody;
3.4.2 Antibody concentration was calculated from the results of 3.4.1):
Figure BDA0003555023570000063
Figure BDA0003555023570000064
wherein, C v The concentration of the antibody, N is the total number of the antibody, and T is a preset threshold value;
3.4.3 From the fitness function and antibody concentration, the rate of reproduction was calculated:
Figure BDA0003555023570000065
wherein P is the reproduction rate, alpha is a constant, A v As a fitness function value, C v Is the antibody concentration;
3.5 According to the reproduction rate, adding individuals with high fitness and low antibody concentration into a memory bank, and taking the first S individuals to form a parent group;
3.6 Adopting a selection mechanism of roulette to select individuals for crossing and variation to obtain a new group, and then taking out part of individuals from a memory bank to form the new group together with the new group;
3.7 ) whether the maximum number of iterations N has been reached e
If the iteration number N is reached e And then outputting the optimal clustering result A,
otherwise, repeat 3.3) -3.6) until the number of iterations N is reached e And outputting an optimal clustering result A.
The clustering result obtained in this embodiment is shown in fig. 4, where the wave site cells of the same color are a cluster.
A second part: and obtaining a beam dynamic scheduling result by using a deep reinforcement learning algorithm.
And step 4, determining the delay tolerance taking the hopping time slot as a unit according to the requirements of different service types on the delay.
4.1 Calculate the time delay that the data packet has passed during its transmission before reaching the destination satellite:
T delay_1 =T prop +T trans
wherein, T delay_1 Total time delay experienced before reaching destination satellite,T prop For propagation delay, T trans Is the transmission delay;
4.2 Estimate the transmission delay T of the data packet from the destination satellite to the user terminal delay_2
4.3 Time delay limit T in QoS guarantee for determining the service type of the data packet limit Determining the length BH of a hopping time slot of a hopping beam slot And calculating the residual delay tolerance of the data packet:
Figure BDA0003555023570000071
wherein D is tole For residual delay tolerance, T delay_1 The total delay experienced in the transmission before reaching the destination satellite. In this embodiment, the service type in 3 is set, and the remaining delay tolerance is 2, 4, and 20, respectively.
And 5, continuously reducing the residual delay tolerance along with time.
And executing the process that different types of data packets wait to be issued in the on-satellite cache queue, wherein the corresponding delay tolerance is reduced by one every time a jump time slot passes, and when the on-satellite waiting delay of the data packets exceeds the tolerance, the data packets are discarded and regarded as overtime failure.
And 6, establishing a hopping wave beam scheduling model based on the delay tolerance according to the principles of maximizing the service guarantee rate and minimizing the overtime failure rate of the service.
6.1 Characterize intra-cluster beam hopping system scenarios:
setting N in a working beam service cluster as wave position cells, expressing the service request quantity of each cell in a data packet mode and obeying the arrival rate of lambda i I =1,2,. Cndot, poisson distribution of N;
representing the amount of data stored in the satellite memory corresponding to each wave bit cell as
Figure BDA0003555023570000081
Wherein
Figure BDA0003555023570000082
Is shown at t j After the time slot is finished, the number of data packets stored in a satellite memory corresponding to the nth wave position cell;
6.2 According to the data packet arrival condition of each cell, establishing an optimization objective function for maximizing the service guarantee rate of the service and minimizing the overtime failure rate of the service:
Figure BDA0003555023570000083
Figure BDA0003555023570000084
wherein, P 1 To maximize the rate of service guarantees, P, for intra-cluster services 2 In order to minimize the business overtime failure rate in the cluster, T is the set of all decision moments of the beam hopping satellite in the same area coverage time range, N is the total wave position cell number in the cluster,
Figure BDA0003555023570000085
is shown at t j The amount of packets sent to cell n after the end of the slot,
Figure BDA0003555023570000086
denotes a cutoff to t j The slotted satellite receives the total amount of packets destined for cell n,
Figure BDA0003555023570000087
the data packet quantity which indicates that the data packet with the destination address of the wave bit cell n is invalid due to overtime waiting is represented;
6.3 According to the one-to-one correspondence relationship between the working beam and the cluster, a beam constraint condition is established:
Figure BDA0003555023570000088
wherein, the first and the second end of the pipe are connected with each other,
Figure BDA0003555023570000089
is shown at t j Whether the slot wave position cell n is illuminated by the operating beam,
Figure BDA00035550235700000810
indicating illuminated, whereas it is not illuminated;
6.4 Determining the maximum capacity L of each wave bit cell buffer queue on the satellite, and establishing a buffer constraint condition:
Figure BDA00035550235700000811
wherein the content of the first and second substances,
Figure BDA00035550235700000812
is shown at t j After the time slot is finished, the number of data packets stored in a satellite memory corresponding to the wave position cell n is equal to L, and L is the maximum capacity of a cache queue of each wave position cell on the satellite;
6.5 The optimization objective function of maximizing the service guarantee rate and minimizing the service overtime failure rate in 6.2) is combined with the beam constraint condition in 6.3) and the cache constraint condition in 6.4) to obtain a beam hopping scheduling model based on the delay tolerance:
Figure BDA0003555023570000091
Figure BDA0003555023570000092
Figure BDA0003555023570000093
Figure BDA0003555023570000094
wherein, C 1 As a beam constraintEnsuring that the number of cells obtaining beam scheduling in each hopping time slot in the cluster is 1,C 2 And the data packet on the satellite at the current moment is not beyond the maximum limit for the buffer constraint condition.
Step 7, in each cluster of the first part, establishing the beam scheduling model in 6.5), and regarding each scheduling model problem as a Markov decision process.
The Markov decision process comprises the design of state, action and reward, the optimization objective function of maximizing service guarantee rate and minimizing service overtime failure rate in the beam scheduling model is converted into reward, the beam constraint condition in the beam scheduling model is converted into action, and the arrival condition of a data packet with delay tolerance in each wave position cell is converted into state, and the method is specifically realized as follows:
7.1 The number matrix of the data packets to be transmitted with different residual delay tolerances in each wave bit queue of the current time slot is designed as follows:
Figure BDA0003555023570000095
wherein, the first and the second end of the pipe are connected with each other,
Figure BDA0003555023570000096
is t j The state at the time of the time slot,
Figure BDA0003555023570000097
is a two-dimensional state matrix;
the matrix
Figure BDA0003555023570000098
The reconstruction is obtained by reconstructing the arrival condition of the data packet in each wave bit buffer queue in the current time slot cluster, and the specific state reconstruction process is shown in fig. 5, wherein a wave bit n represents a cell with the number of n in the cluster, and T is th For the maximum delay tolerance, t, of all traffic types j For the time slots that the data packets go through during the waiting process, the "x", "o" and "Δ" represent the arrival of three different types of service data packets, respectively, and the left side of fig. 5 showsDividing the number of data packets in each wave bit queue in the current time slot cluster according to types to obtain a right graph matrix
Figure BDA0003555023570000099
The row number represents the residual delay tolerance, the column number represents the wave bit cell number, and the value in the row b column of the matrix a represents the number of data packets with the residual delay tolerance a in the wave bit cell b;
7.2 Design action selects the illuminated wave position cell for the current time slot:
Figure BDA0003555023570000101
wherein the content of the first and second substances,
Figure BDA0003555023570000102
is t j Action of selection in time slot, x n Indicating whether the wave position is irradiated by the working beam in the time slot or not, wherein N is the total number of wave position cells in the cluster;
7.3 Design reward is the difference between the number of processed packets and the number of failed packets for the current timeslot:
Figure BDA0003555023570000103
wherein the content of the first and second substances,
Figure BDA0003555023570000104
is t j The reward obtained after the time slot has performed the action,
Figure BDA0003555023570000105
indicating the total number of packets processed by the system after the current time slot selection action,
Figure BDA0003555023570000106
the number of the packets which are failed by the total timeout of the current time slot.
And 8, solving by utilizing deep reinforcement learning according to the Markov decision process design in the step 7 to obtain a beam scheduling result.
8.1 Initialization parameters):
8.1.1 Initializing scene parameters within a cluster of beam-hopping satellites:
confirming wave position cell parameters and wave beam parameters in each cluster according to the clustering result obtained in the step 3; the cluster wave position cell parameters comprise the serial number of the cluster cell, the service request quantity of the cluster cell, the service type and the data packet size; the beam parameters comprise single beam working bandwidth and single beam power;
8.1.2 Initializing deep reinforcement learning parameters:
setting a training period M, the time slot number T of each period, a learning rate alpha and an experience pool capacity N by taking a working beam as an agent and the arrival condition of a data packet in a cluster as an environment ep Bulk data size N b Discount factor gamma, network update frequency C, current network Q, target network
Figure BDA0003555023570000107
A greedy factor ε;
8.2 Initialize the state of the current environment to s) t Updating the greedy factor epsilon;
8.3 S) is to t As the input of Q network, obtaining Q value output corresponding to the used actions of Q network, and selecting action a by epsilon-greedy method t
8.4 In state s) t Lower execution action a t To obtain a new state s t+1 And a prize r t
8.5 S) mixing(s) in 8.4) t ,a t ,r t ,s t+1 ) Storing the current environment into an experience pool, and updating the current environment to be s t+1
8.6 Sample N from the experience pool b Training the Q network by using a sample, and updating the Q network by using a gradient descent method;
8.7 ) whether the current time slot t reaches the network updating frequency is judged:
if t modC =1, i.e. the remainder of t integer division C is 1, then update
Figure BDA0003555023570000111
Network, otherwise, not updating
Figure BDA0003555023570000112
A network;
8.8 ) whether the current time slot T reaches the time slot number T of each period is judged:
if T = T, judging whether the current iteration round number M reaches the training period M:
if M = M, the iteration is terminated, and a scheduling model with the training finished is output;
otherwise, repeating 8.2) -8.7), and continuing training;
if T ≠ T, repeat 8.3) -8.7), continue training.
The effect of the present invention can be further illustrated by the following simulation results:
1. simulation conditions are as follows:
simulation parameters: setting 3 satellite working beams, 19 satellite coverage wave digits, 100MHz single beam working bandwidth, 70W single beam power, 20ms beam hopping time slots, 20kbits of data packet size, 2, 4 and 20 delay tolerance of three services, 20 population scale, 100 iteration times of an immune algorithm, 0.95 diversity evaluation parameter, 0.5 cross probability, 0.4 variation probability, 15 memory bank capacity, 600 cycles of deep reinforcement learning training cycle, 1000 time slots per cycle, 0.00001 learning rate, 100000 empirical pool capacity, 32 batch data size, 0.9 discount factor, 20 updating step length, 1 initial exploration rate and 0.01 final exploration rate.
The simulation environment is as follows: MATLABR2018b, python3.6.
2. Simulation content and results
Simulation 1: the delay tolerance-based beam hopping clustering scheduling method and the deep reinforcement learning convergence rate of the existing global hopping algorithm are simulated respectively, the result is shown in fig. 6, as can be seen from fig. 6, the convergence is started in iteration 400, but the convergence is started only in 1200 according to the global scheduling method, and the scheduling method of the invention improves twice the convergence rate compared with the global scheduling and reduces the computational complexity.
Simulation 2: the delay tolerance-based beam hopping clustering scheduling method is simulated with the normalized throughput of the existing longest queue priority, polling and random scheduling hopping algorithm, the result is shown in fig. 7, and as can be seen from fig. 7, when the supply-demand ratio is 110%, the throughput of the method is respectively improved by 6%, 10% and 15% compared with the longest queue priority, polling and random scheduling algorithm.
Simulation 3: the delay tolerance-based hopping beam clustering scheduling method is simulated with the service guarantee rates of the existing longest queue priority, polling and random allocation hopping algorithms, the result is shown in fig. 8, and as can be seen from fig. 8, when the supply-demand ratio is 110%, the service demand guarantee rates of the delay tolerance-based hopping beam clustering scheduling method are respectively improved by 7%, 11% and 15% compared with the service demand guarantee rates of the longest queue priority, polling and random allocation algorithms.
The foregoing description is only an example of the present invention and is not intended to limit the invention, so that it will be apparent to those skilled in the art that various changes and modifications in form and detail may be made therein without departing from the spirit and scope of the invention.

Claims (7)

1. A high throughput beam hopping scheduling method for differentiated services is characterized by comprising the following steps:
(1) And generating clustering results in the satellite coverage range:
1a) Dividing wave position cells in a satellite coverage range into different clusters;
1b) Establishing a clustering model for load balancing among clusters according to the load balancing principle among clusters and the geographical position near principle of the wave position cell in the cluster:
Figure FDA0003959106520000011
s.t.C 1 :d mn ≤s,m∈M i and n∈M i
Figure FDA0003959106520000012
wherein P is an objective function for minimizing mean square error of load among clusters, K is the number of clusters, N is the number of wave position cells, R j Traffic demand, X, for cell j ij Representing whether the wave position j belongs to a cluster i, R is the load mean of K clusters, C 1 As a distance constraint, C 2 To ensure the constraint condition that a wave position cell only belongs to a cluster, m and n represent two wave position cells in the same cluster, d mn Representing the wave position center distance of M cells and n cells, s is the upper limit of the distance between two wave position centers in the cluster, M i Represents a set of wave position cells belonging to cluster i;
1c) Solving the clustering model with balanced load among clusters by using an immune algorithm to obtain a clustering result;
(2) Establishing a cluster beam dynamic scheduling model:
2a) Determining the delay tolerance taking a hopping time slot as a unit for different service types according to the requirements of the service types on the delay;
2b) Executing the process that different types of data packets wait to be issued in an on-satellite cache queue, wherein the corresponding delay tolerance is reduced by one every time a jump time slot passes, and the data packets are discarded when the on-satellite waiting delay exceeds the tolerance of the delay tolerance and are regarded as overtime failure;
2c) According to the principle of maximizing service guarantee rate and minimizing overtime failure rate of the service, establishing a hopping wave beam scheduling model based on delay tolerance:
Figure FDA0003959106520000021
Figure FDA0003959106520000022
s.t.C 1 :
Figure FDA0003959106520000023
C 2 :
Figure FDA0003959106520000024
wherein, P 1 In order to maximize the service guarantee rate of the intra-cluster service, T is the set of all decision moments of the beam hopping satellite in the coverage time range of the same area, N is the total wave position cell number in the cluster,
Figure FDA0003959106520000025
is shown at t j The amount of packets sent to cell n after the end of the slot,
Figure FDA0003959106520000026
indicates a cutoff to t j Time slot, the total amount of packets, P, received by the satellite for cell n 2 To minimize intra-cluster traffic timeout failure rates,
Figure FDA0003959106520000027
amount of data packets indicating that the data packet whose destination address is the wave bit cell n has failed due to timeout waiting, C 1 The number of cells for obtaining beam scheduling in each hopping time slot in the cluster is 1 2 To ensure that the data packets on the satellite at the current time do not exceed the maximum limit,
Figure FDA0003959106520000028
is shown at t j Whether the slot wave position cell n is illuminated by the operating beam,
Figure FDA0003959106520000029
indicating that the light is illuminated, whereas, not illuminated,
Figure FDA00039591065200000210
is shown at t j Satellite storage corresponding to wave position cell n after time slot is finishedThe number of data packets stored in the device, L is the maximum capacity of each wave bit cell buffer queue on the satellite;
(3) And (3) establishing the beam dynamic scheduling model in the step (2) in each cluster divided in the step (1), taking the scheduling model problem as a Markov decision process, and solving according to deep reinforcement learning to obtain a beam scheduling result.
2. The method of claim 1, wherein 1 a) the wave position cells within the coverage area of the satellite are divided into different clusters, the number of the divided clusters is determined according to the number of the operating beams K owned by a single satellite, the operating beams are in one-to-one correspondence with the clusters, and each operating beam is responsible for performing dynamic beam scheduling in one cluster.
3. The method according to claim 1, wherein in 1 b), a clustering model for load balancing among clusters is established according to the principle of load balancing among clusters and the principle of geographical location proximity of a wave site cell in a cluster, and the following is achieved:
1b1) Determining the service request quantity of each wave position cell, and calculating the sum S of the service request quantities of all cells in the coverage area of the satellite;
1b2) Calculating the load mean of K clusters
Figure FDA00039591065200000211
Establishing a load balancing optimization target:
Figure FDA0003959106520000031
wherein P is an objective function for minimizing mean square error of load among clusters, K is the number of clusters, N is the number of wave position cells, R j Traffic demand, X, for cell j ij Representing whether the wave position j belongs to the cluster i or not, wherein R is the load mean value of K clusters;
1b3) Determining the distance upper limit s of the centers of two wave positions in the cluster, and establishing a distance constraint condition:
d mn ≤s,m∈M i and n∈M i
where m, n represent two wave site cells in the same cluster, d mn Representing the wave position center distance, M, of M and n cells i Represents a set of wave position cells belonging to cluster i;
1b4) Combining the load balancing optimization target in 1b 2) with the distance constraint condition in 1b 3) to obtain a clustering model for load balancing among clusters:
Figure FDA0003959106520000032
s.t.C 1 :d mn ≤s,m∈M i and n∈M i
C 2 :
Figure FDA0003959106520000033
4. the method according to claim 1, wherein 1 c) the clustering model for load balancing among clusters is solved by using an immune algorithm, and the following is realized:
1c1) Initializing a central wave position population and a memory library of each cluster;
1c2) For the clustering model, an affinity function is designed:
Figure FDA0003959106520000034
wherein A is v For the affinity function, P is an objective function P that minimizes the mean square error of the load between clusters, C is a penalty constant for solutions that do not satisfy the distance requirement, and Y denotes that the distance constraint C is not satisfied 1 The number of wave bits;
1c3) Sorting the population in a descending order according to the affinity function in 1c 2), selecting the first H individuals to form a parent population, sequentially performing selection, crossing and mutation operations to obtain a new population, taking out partial individuals from a memory bank to jointly form a new generation population,
1c4) Repeating 1c 3) for a plurality of iterations until a maximum number of iterations N is reached e And obtaining an optimal solution.
5. The method of claim 1, wherein in 2 a), the delay tolerance in units of hopping slots is determined according to the requirements of different service types on delay, and is implemented as follows:
2a1) Calculating the time delay of the data packet in the transmission process before reaching the target satellite:
T delay_1 =T prop +T trans
wherein, T delay_1 Total time delay, T, experienced before reaching the destination satellite prop For propagation delay, T trans Is the transmission delay;
2a2) Estimating the transmission delay T of data packet from destination satellite to user terminal delay_2
2a3) Determining time delay limit T in QoS guarantee of service type of data packet limit Determining the length BH of a beam hopping slot slot And calculating the residual delay tolerance of the data packet:
Figure FDA0003959106520000041
wherein D is tole For residual delay tolerance, T delay_1 The total delay experienced during transmission before reaching the destination satellite.
6. The method of claim 1, wherein 2 c) according to the principles of maximizing service guarantee rate and minimizing service timeout failure rate, a time delay tolerance-based beam hopping scheduling model is established as follows:
2c1) Characterizing a cluster internal hopping beam system scenario:
setting N in a working beam service cluster as wave position cells, expressing the service request quantity of each cell in a data packet mode and obeying the arrival rate of lambda i I =1,2,.., poise of NLoose distribution;
representing the amount of data stored in the satellite memory corresponding to each wave bit cell as
Figure FDA0003959106520000042
Wherein
Figure FDA0003959106520000043
Is shown at t j After the time slot is finished, the number of data packets stored in a satellite memory corresponding to the nth wave position cell;
2c2) Establishing an optimization objective function for maximizing service guarantee rate and minimizing service overtime failure rate according to the data packet arrival condition of each cell:
Figure FDA0003959106520000051
Figure FDA0003959106520000052
wherein, P 1 To maximize the rate of service guarantees, P, for intra-cluster services 2 In order to minimize the cluster service timeout failure rate, T is the set of all decision moments of the beam hopping satellite in the same area coverage time range, N is the total wave position cell number in the cluster,
Figure FDA0003959106520000053
is shown at t j The amount of packets sent to cell n after the end of the slot,
Figure FDA0003959106520000054
denotes a cutoff to t j The slotted satellite receives the total amount of packets destined for cell n,
Figure FDA0003959106520000055
data indicating failure of data packet with destination address being wave bit cell n due to overtime waitingThe amount of the package;
2c3) According to the one-to-one correspondence relationship between the working beams and the clusters, beam constraint conditions are established:
Figure FDA0003959106520000056
wherein the content of the first and second substances,
Figure FDA0003959106520000057
is shown at t j Whether slot-wave-bit cell n is illuminated by the operating beam,
Figure FDA0003959106520000058
indicating illuminated, whereas it is not illuminated;
2c4) Determining the maximum capacity L of each wave bit cell buffer queue on the satellite, and establishing a buffer constraint condition:
Figure FDA0003959106520000059
wherein the content of the first and second substances,
Figure FDA00039591065200000510
is shown at t j After the time slot is finished, the number of data packets stored in a satellite memory corresponding to the wave position cell n is equal to L, and L is the maximum capacity of a cache queue of each wave position cell on the satellite;
2c5) Combining the optimized objective function of maximizing the service guarantee rate and minimizing the service overtime failure rate in 2c 2), the beam constraint condition in 2c 3) and the cache constraint condition in 2c 4) to obtain a time delay tolerance-based beam hopping scheduling model:
Figure FDA0003959106520000061
Figure FDA0003959106520000062
s.t.C 1 :
Figure FDA0003959106520000063
C 2 :
Figure FDA0003959106520000064
7. the method according to claim 1, wherein (3) regarding the beam dynamic scheduling model established in each cluster as a markov decision process, and performing solution according to deep reinforcement learning to obtain a beam scheduling result, which is implemented as follows:
3a) The state in the deep reinforcement learning algorithm is designed as the current time slot, and the number matrix of the data packets to be transmitted with different residual delay tolerances in each wave position queue is as follows:
Figure FDA0003959106520000065
wherein the content of the first and second substances,
Figure FDA0003959106520000066
is t j The state at the time of the time slot,
Figure FDA0003959106520000067
is a two-dimensional state matrix;
3b) The actions in the design deep reinforcement learning algorithm are to select the irradiated wave position cell for the current time slot:
Figure FDA0003959106520000068
wherein, the first and the second end of the pipe are connected with each other,
Figure FDA0003959106520000069
is t j Action of selection in time slot, x n Indicating whether the wave position is irradiated by the working beam in the time slot or not, wherein N is the total number of wave position cells in the cluster;
3c) The reward in the deep reinforcement learning algorithm is designed as the difference between the processing packet number and the failure packet number of the current time slot:
Figure FDA00039591065200000610
wherein the content of the first and second substances,
Figure FDA00039591065200000611
is t j The reward obtained after the time slot has performed the action,
Figure FDA00039591065200000612
indicating the total number of packets processed by the system after the current time slot selection action,
Figure FDA00039591065200000613
the number of the packets which are always overtime and invalid in the current time slot;
3d) And taking the working beam as an agent and the arrival condition of the data packet in the cluster as an environment, and executing a deep reinforcement learning algorithm to obtain an optimization result.
CN202210273871.5A 2022-03-19 2022-03-19 High-throughput beam hopping scheduling method for differentiated services Active CN114629547B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210273871.5A CN114629547B (en) 2022-03-19 2022-03-19 High-throughput beam hopping scheduling method for differentiated services

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210273871.5A CN114629547B (en) 2022-03-19 2022-03-19 High-throughput beam hopping scheduling method for differentiated services

Publications (2)

Publication Number Publication Date
CN114629547A CN114629547A (en) 2022-06-14
CN114629547B true CN114629547B (en) 2023-01-20

Family

ID=81903000

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210273871.5A Active CN114629547B (en) 2022-03-19 2022-03-19 High-throughput beam hopping scheduling method for differentiated services

Country Status (1)

Country Link
CN (1) CN114629547B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115483960B (en) * 2022-08-23 2023-08-29 爱浦路网络技术(南京)有限公司 Wave beam jumping scheduling method, system and device for low orbit satellite and storage medium
CN116346202B (en) * 2023-03-15 2024-02-09 南京融星智联信息技术有限公司 Wave beam hopping scheduling method based on maximum weighting group

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109450498A (en) * 2018-11-05 2019-03-08 南京邮电大学 A kind of high throughput satellite communication system beam-hopping method
CN110049514A (en) * 2019-03-29 2019-07-23 中国科学院计算技术研究所 A kind of control method for equalizing load suitable for multi-beam satellite network
CN110518956A (en) * 2019-07-25 2019-11-29 中国人民解放军陆军工程大学 Jump wave pattern optimization method and device based on Slot Allocation Algorithm, storage medium
CN113572517A (en) * 2021-07-30 2021-10-29 哈尔滨工业大学 Beam hopping resource allocation method, system, storage medium and equipment based on deep reinforcement learning
CN113709880A (en) * 2021-08-23 2021-11-26 中国人民解放军陆军工程大学 Service self-adaptive satellite beam hopping system resource allocation method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109450498A (en) * 2018-11-05 2019-03-08 南京邮电大学 A kind of high throughput satellite communication system beam-hopping method
CN110049514A (en) * 2019-03-29 2019-07-23 中国科学院计算技术研究所 A kind of control method for equalizing load suitable for multi-beam satellite network
CN110518956A (en) * 2019-07-25 2019-11-29 中国人民解放军陆军工程大学 Jump wave pattern optimization method and device based on Slot Allocation Algorithm, storage medium
CN113572517A (en) * 2021-07-30 2021-10-29 哈尔滨工业大学 Beam hopping resource allocation method, system, storage medium and equipment based on deep reinforcement learning
CN113709880A (en) * 2021-08-23 2021-11-26 中国人民解放军陆军工程大学 Service self-adaptive satellite beam hopping system resource allocation method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"Joint precoding schemes for flexible resource allocation in high throughput satellite systems based on beam hopping";Chen Zhang,Xudong Zhao,Gengxin Zhang;《China Communications》;20210930;全文 *
"Resource Allocation for Beam-hopping User Downlinks in Multi-beam Satellite System";Lin Wang等;《2019 15th International Wireless Communications & Mobile Computing Conference (IWCMC)》;20190628;全文 *

Also Published As

Publication number Publication date
CN114629547A (en) 2022-06-14

Similar Documents

Publication Publication Date Title
CN114629547B (en) High-throughput beam hopping scheduling method for differentiated services
CN110518956B (en) Time slot allocation algorithm-based hopping pattern optimization method and device, and storage medium
CN110489229B (en) Multi-target task scheduling method and system
CN108874525B (en) Service request distribution method facing edge computing environment
CN108901075B (en) GS algorithm-based resource allocation method
CN110134493B (en) Service function chain deployment algorithm based on resource fragment avoidance
CN109561148A (en) Distributed task dispatching method in edge calculations network based on directed acyclic graph
CN108092706B (en) Mapping method
CN114585006B (en) Edge computing task unloading and resource allocation method based on deep learning
CN113407251A (en) Method and device for unloading mobile edge network service
CN113822456A (en) Service combination optimization deployment method based on deep reinforcement learning in cloud and mist mixed environment
CN111328146B (en) Service scheduling method for optimizing transmission rate weight based on genetic algorithm
CN114625506A (en) Edge cloud collaborative task unloading method based on adaptive covariance matrix evolution strategy
CN109995496B (en) Pilot frequency distribution method of large-scale antenna system
CN113139639A (en) MOMBI-based smart city application-oriented multi-target calculation migration method and device
CN115866626B (en) Service deployment method based on NSGA-II and simulated annealing in edge environment
CN114980216B (en) Dependency task unloading system and method based on mobile edge calculation
WO2017148246A1 (en) Data configuration method and device
CN109195222B (en) Power distribution method based on statistical characteristic reference
CN111082971A (en) Shared resource allocation method for cloud load test
CN111047040A (en) Web service combination method based on IFPA algorithm
CN113285832B (en) NSGA-II-based power multi-mode network resource optimization allocation method
CN115276754A (en) Grid time delay prediction-based satellite transmission optimization method
Gao Computing resource allocation strategy based on mobile edge computing in internet of vehicles environment
Chen et al. Data-driven task offloading method for resource-constrained terminals via unified resource model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant