CN117614507A - Self-adaptive flow unloading method of high-dynamic topology heaven-earth integrated network - Google Patents

Self-adaptive flow unloading method of high-dynamic topology heaven-earth integrated network Download PDF

Info

Publication number
CN117614507A
CN117614507A CN202311484841.XA CN202311484841A CN117614507A CN 117614507 A CN117614507 A CN 117614507A CN 202311484841 A CN202311484841 A CN 202311484841A CN 117614507 A CN117614507 A CN 117614507A
Authority
CN
China
Prior art keywords
satellite
leo
leo satellite
representing
traffic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311484841.XA
Other languages
Chinese (zh)
Inventor
石金明
辛宁
陈特
李殷乔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Academy of Space Technology CAST
Original Assignee
China Academy of Space Technology CAST
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Academy of Space Technology CAST filed Critical China Academy of Space Technology CAST
Priority to CN202311484841.XA priority Critical patent/CN117614507A/en
Publication of CN117614507A publication Critical patent/CN117614507A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B7/00Radio transmission systems, i.e. using radiation field
    • H04B7/14Relay systems
    • H04B7/15Active relay systems
    • H04B7/185Space-based or airborne stations; Stations for satellite systems
    • H04B7/1851Systems using a satellite or space-based relay
    • H04B7/18513Transmission in a satellite or space-based system
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B7/00Radio transmission systems, i.e. using radiation field
    • H04B7/14Relay systems
    • H04B7/15Active relay systems
    • H04B7/185Space-based or airborne stations; Stations for satellite systems
    • H04B7/1851Systems using a satellite or space-based relay
    • H04B7/18519Operations control, administration or maintenance
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W28/00Network traffic management; Network resource management
    • H04W28/02Traffic management, e.g. flow control or congestion control
    • H04W28/08Load balancing or load distribution
    • H04W28/09Management thereof
    • H04W28/0917Management thereof based on the energy state of entities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W28/00Network traffic management; Network resource management
    • H04W28/02Traffic management, e.g. flow control or congestion control
    • H04W28/08Load balancing or load distribution
    • H04W28/09Management thereof
    • H04W28/0925Management thereof using policies
    • H04W28/095Management thereof using policies based on usage history, e.g. usage history of devices
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W28/00Network traffic management; Network resource management
    • H04W28/02Traffic management, e.g. flow control or congestion control
    • H04W28/08Load balancing or load distribution
    • H04W28/09Management thereof
    • H04W28/0958Management thereof based on metrics or performance parameters
    • H04W28/0967Quality of Service [QoS] parameters
    • H04W28/0975Quality of Service [QoS] parameters for reducing delays
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W84/00Network topologies
    • H04W84/02Hierarchically pre-organised networks, e.g. paging networks, cellular networks, WLAN [Wireless Local Area Network] or WLL [Wireless Local Loop]
    • H04W84/04Large scale networks; Deep hierarchical networks
    • H04W84/06Airborne or Satellite Networks
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Abstract

The invention relates to a self-adaptive flow unloading method of a high-dynamic topology heaven-earth integrated network, which comprises the steps of collecting information such as heaven-earth integrated network topology, link state, satellite flow load and the like, self-adaptive flow unloading flow of LEO satellites, reinforcement learning model aggregation of GEO satellites and the like. The method is based on a federal reinforcement learning architecture and an algorithm of a GEO satellite and LEO satellite mixed constellation, and intelligent decision is made on a flow unloading request from a ground user terminal, so that the average flow unloading delay of the ground user terminal is optimized, and the problems of unbalanced LEO satellite flow load and overlarge unloading delay caused by isomerism and high dynamic property in an heaven-earth integrated network are solved.

Description

Self-adaptive flow unloading method of high-dynamic topology heaven-earth integrated network
Technical Field
The invention relates to a low-orbit constellation network flow unloading method applied to the field of space information, and belongs to the technical field of flow distribution in a wireless communication network.
Background
In a conventional satellite communication system, GEO satellites are used for a static topology network, and uninterrupted transmission is performed between the satellites and a ground station, so that wide coverage can be achieved even if only one satellite is deployed. GEO satellites, however, are more suitable for providing delay insensitive communication services because of their longer earth transmission delay. While large LEO constellations can provide high throughput, high bandwidth rates, and low service delays for global coverage, the high speed motion of LEO satellites relative to the ground results in shorter durations of their links with the ground and faster changes in the satellite-to-ground network topology. In addition, it is difficult to satisfy various business demands by only GEO satellites or LEO satellites. For GEO satellites, even though they have a large amount of communication and buffering resources, the long propagation delay of the satellite to the earth cannot meet certain delay sensitive class services. For LEO satellites, relatively little on-board resources are available, which may increase the offloading delay and packet loss rate in traffic offloading. Therefore, an heaven-earth integrated network based on a mixed constellation of GEO satellites and LEO satellites plays an important role in meeting the diversified demands of ground users.
In a typical world-wide integrated network, GEO satellites are generally part of the world-wide backbone network, and LEO satellites are part of the world-wide access network, responsible for setting up inter-satellite links, data exchanges, and relays between the world-wide and ground backbones. In addition, the ground users are unevenly distributed over the coverage of the satellites, which results in the problem that some satellites covering the hot spot area are not available with sufficient resources, while others are too much. Therefore, research on inter-satellite cooperative transmission has important significance for improving the utilization rate of the space-earth integrated network resources.
Because the long-distance transmission among different network segments in the heaven-earth integrated network can cause unstable communication links, most of the existing flow unloading methods face to the ground static topology network, the existing flow unloading methods cannot be directly applied to the heaven-earth integrated network with high dynamic topology, and because of the isomerism and the high dynamic property of the heaven-earth integrated network, flow transmission strategies are designed in a targeted manner so as to adapt to the unbalance of resources and the change of environment. Therefore, there is a need to design an adaptive traffic offload method for a heaven-earth integrated network that is oriented to a high dynamic topology. Reinforcement learning is a learning strategy method that can dynamically give near optimal traffic offload decisions by collecting network states and based on the network states. In addition, in the space-earth integrated network, because the nodes are highly dynamic, centralized control of global information is difficult to realize, so the centralized reinforcement learning method is not suitable for the space-earth integrated network with high dynamic.
Federal learning, as an enhanced decentralization approach, can be deployed on multiple LEO satellites and learn and make decisions in a distributed fashion. The federal learning can effectively utilize local data and satellite-borne computing resources of a plurality of LEO satellites, and save signaling overhead caused by frequent transmission of original training data. Therefore, the self-adaptive flow unloading method for the high-dynamic heaven-earth integrated network can be designed based on federal reinforcement learning.
Disclosure of Invention
The technical solution of the invention is as follows: the self-adaptive flow unloading method of the high-dynamic topology heaven-earth integrated network is provided, and the problems of unbalanced LEO satellite flow load and overlarge unloading delay caused by isomerism and high dynamic property in the heaven-earth integrated network are solved.
The technical scheme of the invention is as follows:
an adaptive traffic offload method for a high dynamic topology heaven and earth integrated network, comprising:
(1) The ground user terminal generates a traffic unloading task and sends a traffic unloading request to LEO satellites in a coverage area where the traffic unloading request is located;
(2) After receiving a flow unloading request of a ground terminal, the LEO satellite receives the unloading flow of the ground terminal;
(3) The LEO satellite makes traffic offload decisions for forwarding offload traffic:
(4) If the unloaded flow is forwarded to the adjacent LEO satellite, the adjacent LEO satellite continuously executes the flow unloading decision process of the step (3);
(5) If the unloading flow is forwarded to the GEO satellite, judging whether a local buffer queue is full, if so, directly discarding the data packet, otherwise, adding the data packet into the buffer queue, then forwarding the data packet to an earth station near the target terminal, and then forwarding the data packet to the target terminal by the earth station.
Further, the step (3) performs a traffic offload decision, specifically:
a) If the target terminal is located in the coverage area of the LEO satellite, forwarding the data to the target terminal through the LEO satellite in a one-hop mode;
b) If the target terminal is not in the LEO satellite coverage area and other satellites are required to forward the traffic, constructing an observation state vector s according to the collected network state information i Will s i Inputting a locally trained reinforcement learning model to obtain a flow unloading decision a i And forwards traffic to neighboring LEO satellites or GEO satellites based on the decision information.
Further, an observation state vector s is constructed i The method specifically comprises the following steps:
s i ={l 0 ,l 1 ,…,l k ,…,l K ,l G ,d 0 ,d 1 ,…,d k ,…,d K ,d G ,
q 0 ,q 1 ,…,q k ,…,q K ,q G ,B 0 ,B 1 ,…,B k ,…,B K ,B G }
wherein l 0 Indicating the link duration of the LEO satellite with a terrestrial user terminal, l k Indicating the link duration of the LEO satellite with the neighboring LEO satellite k, l G Representing a link duration of the LEO satellite with the GEO satellite; d, d 0 Represents the distance of the LEO satellite from the ground user terminal, d k Represents the distance, d, of the LEO satellite from the adjacent LEO satellite k G Representing the distance of the LEO satellite from the GEO satellite; q 0 Represents the buffer queue length, q, of the LEO satellite k Buffer queue length, q, representing adjacent LEO satellite k of the LEO satellite G Representing the buffer queue length of the GEO satellite; b (B) 0 Indicating the available communication bandwidth of the LEO satellite, B k Representing the available communication bandwidth of a neighboring LEO satellite k of the LEO satellite, B G Representing the communication bandwidth available to GEO satellites.
Further, during each cycle, the LEO satellite broadcasts Hello packets to its neighboring LEO satellites, including the buffer queue length, location parameters, motion parameters, and available communication bandwidth local to the LEO satellite.
Further, training the reinforcement learning model is achieved specifically by the following modes:
(b1) At the beginning of each cycle, the LEO satellite calculates a reward R based on the actual discharge delay of the traffic i The network state s in the last period is to be calculated i Traffic offload decision a i And a traffic offload bonus R i Storing in an experience data set as history data;
(b2) Within each period, LEO satellites are randomly sampled in a local empirical data setTraining a reinforcement learning model and updating model parameters by a plurality of samples;
(b3) Periodically polymerizing the reinforcement learning model by the GEO satellite;
(b4) The GEO satellite broadcasts the aggregated reinforcement learning network model parameters to LEO constellations;
(b5) And after the LEO satellite receives the aggregation model parameters of the GEO satellite, updating the local reinforcement learning model parameters.
Further, the LEO satellite calculates a reward R according to the actual unloading delay of the flow i The method specifically comprises the following steps:
the reward function obtained by LEO satellite performing traffic offload decision is defined as
Wherein, kappa i Indicating traffic type, κ S Representing delay-sensitive traffic, κ N Indicating that the traffic is not delay sensitive,is an indication function;
for delay-sensitive traffic, the reward function for offloading traffic by LEO satellites is defined as
Wherein t is i Indicating delay in flow unloading, t max,i Representing a maximum tolerable delay for flow offloading, - Γ being a constant representing a loss due to flow offloading delay exceeding the maximum tolerable delay;
for non-delay sensitive traffic, the reward function for offloading traffic by LEO satellites is defined as
Further, traffic offload decision a i The method comprises the following steps:
a i ={x 0 ,x 1 ,…,x k ,…,x K ,b 0 ,b 1 ,…,b k ,…,b K }
wherein x is 0 Representing data transmission to GEO satellite, x k Representing offloading data to neighboring LEO satellites k, b of the LEO satellite 0 Representing the communication bandwidth allocated for the link between the LEO satellite and the GEO satellite, b k Representing the communication bandwidth allocated for the link between the LEO satellite and the neighboring LEO satellite.
Further, the transmission delay of one hop in the heaven-earth integrated network is calculated by the following formula:
wherein d is ij Represents the distance between node i and node j, c represents the speed of light, l i Indicating the size of the data packet, R jj Representing the transmission rate of the link between node i and node j, q i Representing the cache queue length of node i.
Further, if a data packet from a ground user terminal can be forwarded to a target node by one hop through the LEO satellite, traffic offload is delayed as
t i =t ul +t lu
Wherein t is ul Representing one-hop propagation delay from a user terminal to the LEO satellite, t lu Representing a one-hop transmission delay from the LEO satellite to a target user terminal;
if the data packet cannot be forwarded to the target node by one hop through the LEO satellite, but can be forwarded to the target node by two hops through the LEO satellite adjacent to the data packet, the traffic unloading delay is
t i =t ul +t ln +t nu
Wherein t is ln Representing one-hop propagation delay from the LEO satellite to its neighboring LEO satellites, t nu Representing a one-hop transmission delay from the LEO satellite adjacent satellite to the target user terminal;
if the data packet cannot be forwarded to the target node through the LEO satellite and LEO satellites adjacent to the LEO satellite, the LEO satellite forwards the data packet to the GEO satellite, and the traffic offload delay of the process is that
Wherein t is lg Representing one-hop propagation delay from the LEO satellite to the GEO satellite, t gs Representing one-hop propagation delay from GEO satellite to earth station, t su Representing the transmission delay from the earth station to the user terminal,represents the maximum buffer queue length, q, of a GEO satellite G Representing the buffer queue length of the GEO satellite.
Further, the reinforcement learning model is based on a Q-learning model, and the updating method of model parameters is as follows:
wherein w represents a parameter of the Q-network, eta represents a learning rate, L Q (w) is a loss function calculated as follows:
where λ is the discount factor and Q (s, a; w) is the Q network.
Further, the GEO satellite in step (b 3) periodically aggregates the reinforcement learning model, specifically:
at intervals, the GEO satellite randomly selects N from the LEO constellation C Each LEO satellite sends a model transmission request to the selected LEO satellite;
after the selected LEO satellite receives the request from the GEO satellite, the local reinforcement learning model parameters w are obtained c And number of samples n in training sample set c Uploading to a GEO satellite;
and the GEO satellites perform model aggregation according to the received LEO satellite reinforcement learning model parameters and the number of samples in the training sample set.
Further, model polymerization was performed in the following manner:
wherein w is agg Is the parameters of the reinforced learning network model after aggregation.
Compared with the prior art, the invention has the beneficial effects that:
(1) The invention adopts a self-adaptive flow unloading method based on federal reinforcement learning, and the method utilizes reinforcement learning to carry out real-time intelligent decision on a ground flow unloading request by collecting information such as topology information of an heaven-earth integrated network, states of inter-satellite and satellite-earth links, flow loads of adjacent satellites, available communication bandwidths and the like, so that the method is applicable to the heaven-earth integrated network with high dynamic topology;
(2) In the invention, the LEO satellite only carries out aggregation in the GEO satellite on the initial part of the local training model parameters, so as to update the global reinforcement learning model. The GEO satellite then sends the updated model back to the LEO satellite. Thus, the LEO satellite can quickly make traffic offloading decisions according to the updated model. The process can greatly save redundant original data exchange consumption, and the global model realizes global convergence of integrating all LEO satellite flow unloading characteristics;
(3) According to the invention, the federal reinforcement learning algorithm is deployed on the LEO satellites, so that the cooperative intelligent traffic unloading decision of a plurality of LEO satellites can be realized, the average traffic unloading delay of the user terminal in the heaven-earth integrated network is optimized, and the congestion problem caused by unbalanced load in the network is effectively avoided.
Drawings
FIG. 1 is a schematic diagram of an adaptive traffic offload architecture for an heaven-earth integrated network in accordance with the present invention;
fig. 2 is a schematic diagram of an heaven-earth integrated network traffic offloading flow in an embodiment of the present invention.
Detailed Description
The following describes in further detail the embodiments of the present invention with reference to the accompanying drawings.
The invention provides a self-adaptive flow unloading method of a high-dynamic topology heaven-earth integrated network, in the method, the LEO satellite has the main functions of receiving a flow unloading request from a ground user terminal and deciding whether to forward the flow to an adjacent LEO satellite or a GEO satellite, wherein the flow forwarding decision can be self-adaptively adjusted according to the network state through a reinforcement learning model local to the LEO satellite. The main function of GEO satellites is to forward traffic from LEO satellites to earth stations near the target user terminal while being responsible for collecting reinforcement learning model parameters from LEO satellites and performing periodic model aggregation. The schematic diagram of the adaptive traffic offload architecture of the heaven-earth integrated network in the invention is shown in fig. 1.
In the high dynamic topology heaven-earth integrated network traffic offloading method, the observed state space of the LEO satellite is defined as follows:
s i ={l 0 ,l 1 ,…,l k ,…,l K ,l G ,d 0 ,d 1 ,…,d k ,…,d K ,d G
q 0 ,q 1 ,…,q k ,…,q K ,q G ,B 0 ,B 1 ,…,B k ,…,B K ,B G }
wherein l 0 Indicating the link duration of the LEO satellite with a terrestrial user terminal, l k Indicating the link duration of the LEO satellite with the neighboring LEO satellite k, l G Representing a link duration of the LEO satellite with the GEO satellite; d, d 0 Represents the distance of the LEO satellite from the ground user terminal, d k Represents the distance, d, of the LEO satellite from the adjacent LEO satellite k G Representing the distance of the LEO satellite from the GEO satellite; q 0 Represents the buffer queue length, q, of the LEO satellite k Buffer queue length, q, representing adjacent LEO satellite k of the LEO satellite G Representing the buffer queue length of the GEO satellite; b (B) 0 Indicating the available communication bandwidth of the LEO satellite, B k Representing the available communication bandwidth of a neighboring LEO satellite k of the LEO satellite, B G Representing the communication bandwidth available to GEO satellites.
In addition, the distance between the LEO satellite and the ground user terminal and the distance between the LEO satellite and the adjacent satellite can be calculated by the following formula:
wherein x is i ,y i ,z i The position coordinates of the satellite or ground terminal are represented by the following calculation method:
wherein θ is k Andlatitude and longitude information representing a satellite or ground terminal, respectively. In particular, when h=0, x i ,y i ,z i Indicating the location of the ground terminal.
The traffic offload decision space for LEO satellites is defined as follows:
a i ={x 0 ,x 1 ,…,x k ,…,x K ,b 0 ,b 1 ,…,b k ,…,b K }
wherein x is 0 Representing data transmission to GEO satellite, x k Representing offloading data to neighboring LEO satellites k, b of the LEO satellite 0 Representing the communication bandwidth allocated for the link between the LEO satellite and the GEO satellite, b k Representing the communication bandwidth allocated for the link between the LEO satellite and the neighboring LEO satellite.
In a high dynamic topology heaven-earth integrated network, the main optimization objective of the traffic offload algorithm is to minimize the average traffic offload delay of the user terminals in the heaven-earth integrated network. Thus, traffic offload decision assessment for LEO satellites needs to be based on traffic offload delays. The flow unloading delay calculation method of the single user terminal in the heaven-earth integrated network is as follows:
1. for delay-sensitive traffic, the reward function for offloading traffic by LEO satellites is defined as
Wherein t is i Indicating delay in flow unloading, t max,i Representing the maximum tolerable delay for traffic offloading, - Γ being a constant representing the loss caused by the traffic offloading delay exceeding the maximum tolerable delay.
2. For non-delay sensitive traffic, the bonus function for offloading traffic by LEO satellites is defined as
In summary, the reward function obtained by LEO satellites performing traffic offload decisions is defined as
Wherein kappa is i Indicating traffic type, κ S Representing delay-sensitive traffic, κ N Indicating that the traffic is not delay sensitive,to indicate a function.
The transmission delay of one hop in the heaven-earth integrated network is calculated by the following formula:
wherein d is ij Represents the distance between node i and node j, c represents the speed of light, l i Indicating the size of the data packet, R ij Representing the transmission rate of the link between node i and node j, q i Representing the cache queue length of node i.
In particular, for a certain LEO satellite, if a packet from a terrestrial user terminal can be forwarded to the target node by one hop through the LEO satellite, the traffic offload delay is
t i =t ul +t lu
Wherein t is ul Representing one-hop propagation delay from a user terminal to the LEO satellite, t lu Representing a one-hop transmission delay from the LEO satellite to the target user terminal.
If the data packet cannot be forwarded to the target node by one hop through the LEO satellite, but can be forwarded to the target node by two hops through the LEO satellite adjacent to the LEO satellite, the traffic unloading delay is t i =t ul +t ln +t nu
Wherein t is ln Representing one-hop propagation delay from the LEO satellite to its neighboring LEO satellites, t nu Representing one-hop transmission delay from the LEO satellite's neighboring satellites to the target user terminal.
If the data packet can not be forwarded to the target node through the LEO satellite and LEO satellites adjacent to the LEO satellite, the LEO satellite forwards the data packet to the GEO satellite, and if the buffer queue of the GEO satellite is full, the data packet is directly discarded, otherwise, the data packet is forwarded to an earth station associated with the target node, and then the earth station sends the data packet to the target user terminal, and the traffic unloading delay in the process is that
Wherein t is lg Representing one-hop propagation delay from the LEO satellite to the GEO satellite, t gs Representing one-hop propagation delay from GEO satellite to earth station, t su Representing the transmission delay from the earth station to the user terminal,representing the maximum buffer queue length for GEO satellites.
The transmission rate of the link between the nodes needs to be calculated when the traffic offload delay is calculated, including the transmission rate of the satellite-to-ground link and the transmission rate of the inter-satellite link. The achievable transmission rate of an inter-satellite link can be calculated by the following formula:
wherein P is ts Representing the transmitted power of LEO satellites, G ts Indicating the gain of LEO satellite transmitting antenna, G rs Representing the gain of the LEO satellite receiving antenna, k representing the Boltzmann constant, N s Representing noise temperature of the whole communication system E b /N 0 Representing the ratio of the received energy per bit to the noise spectral density, L f The free space attenuation representing the signal power in the inter-satellite link is calculated as follows:
where c represents the speed of light, d represents the distance between two LEO satellites, and f is the center frequency of the transmitted signal.
The calculation formula of the satellite-ground link reachable transmission rate is as follows:
R SG =Blog 2 (1+SNR SG )
wherein SNR is SG The signal-to-noise ratio, representing the satellite-to-ground link, can be calculated by the following formula:
wherein P is t Representing the transmitting power of the transmitting end, G t Indicating the antenna gain of the transmitting end, G r Indicating the antenna gain of the receiving end, L r Represents the atmospheric attenuation of the satellite-to-ground link, Σp t′ G t′ G r′ L f′ L r′ Indicating interference generated by other satellite-to-ground link transmissions on the satellite-to-ground link, N indicating noise power.
As shown in fig. 2, the flow of the LEO satellite flow unloading method based on federal reinforcement learning of the invention is as follows:
1. the ground user terminal sends a flow unloading request to LEO satellites in the coverage area where the ground user terminal is located;
2. after receiving a flow unloading request of a ground user terminal, the LEO satellite receives unloading flow from the ground user terminal;
3. the LEO satellite makes decisions on forwarding of the offload traffic:
a) If the target node for unloading the data is in the coverage area of the LEO satellite, directly forwarding the data to the target node;
b) If the target node for unloading data is not in the coverage of the LEO satellite, constructing an observation state vector s according to the collected network state information i Will s i Inputting a locally trained reinforcement learning model to obtain a flow unloading decision a i Forwarding the traffic to an adjacent LEO satellite or GEO satellite according to the decision information;
4. if the unloading flow is forwarded to the adjacent LEO satellite, the adjacent LEO satellite continues to execute the flow unloading decision process of the step 3;
5. if the unloading flow is forwarded to the GEO satellite, the data packet is directly discarded if the GEO satellite buffer queue is full, otherwise, the data is forwarded to an earth station near the target terminal, and then the earth station forwards the data to the target terminal.
Training the reinforcement learning model is realized in the following manner:
(1) LEO satellite calculates a reward R based on actual offloading delays of traffic i And will<s i ,a i ,R i ,s i+1 >Storing the experience data set;
(2) Within each period, LEO satellites are randomly sampled in a local empirical data setTraining a reinforcement learning model based on a Q-learning model and updating model parameters by the following method:
wherein w represents a parameter of the Q-network, eta represents a learning rate, L Q (w) is lossThe function is calculated as follows:
(3) The GEO satellites periodically aggregate the reinforcement learning model:
a) GEO random selection N C A plurality of LEO satellites;
b) The selected LEO satellite will locally train model parameters w c And number of samples n in training sample set c Uploading to a GEO satellite;
c) The GEO satellite carries out model aggregation according to the received LEO satellite training model and the sample number, and the aggregation mode is as follows:
(4) The GEO satellite broadcasts the aggregated reinforcement learning network model parameters to LEO constellations;
(5) And after the LEO satellite receives the aggregation model parameters of the GEO satellite, updating the local reinforcement learning model parameters.
The invention is not described in detail in the field of technical personnel common knowledge.

Claims (12)

1. The self-adaptive traffic unloading method of the high-dynamic topology heaven-earth integrated network is characterized by comprising the following steps of:
(1) The ground user terminal generates a traffic unloading task and sends a traffic unloading request to LEO satellites in a coverage area where the traffic unloading request is located;
(2) After receiving a flow unloading request of a ground terminal, the LEO satellite receives the unloading flow of the ground terminal;
(3) The LEO satellite makes traffic offload decisions for forwarding offload traffic:
(4) If the unloaded flow is forwarded to the adjacent LEO satellite, the adjacent LEO satellite continuously executes the flow unloading decision process of the step (3);
(5) If the unloading flow is forwarded to the GEO satellite, judging whether a local buffer queue is full, if so, directly discarding the data packet, otherwise, adding the data packet into the buffer queue, then forwarding the data packet to an earth station near the target terminal, and then forwarding the data packet to the target terminal by the earth station.
2. The adaptive traffic offload method for a high dynamic topology heaven and earth integration network of claim 1, wherein: and (3) carrying out flow unloading decision, specifically:
a) If the target terminal is located in the coverage area of the LEO satellite, forwarding the data to the target terminal through the LEO satellite in a one-hop mode;
b) If the target terminal is not in the LEO satellite coverage area and other satellites are required to forward the traffic, constructing an observation state vector s according to the collected network state information i Will s i Inputting a locally trained reinforcement learning model to obtain a flow unloading decision a i And forwards traffic to neighboring LEO satellites or GEO satellites based on the decision information.
3. The adaptive traffic offload method for a high dynamic topology heaven and earth integration network of claim 2, wherein: construction of an observed State vector s i The method specifically comprises the following steps:
s i ={l 0 ,l 1 ,…,l k ,…,l K ,l G ,d 0 ,d 1 ,…,d k ,…,d K ,d G ,
q 0 ,q 1 ,…,q k ,…,q K ,q G ,B 0 ,B 1 ,…,B k ,…,B K ,B G }
wherein l 0 Indicating the link duration of the LEO satellite with a terrestrial user terminal, l k Indicating the link duration of the LEO satellite with the neighboring LEO satellite k, l G Representing a link duration of the LEO satellite with the GEO satellite; d, d 0 Represents the distance of the LEO satellite from the ground user terminal, d k Representing the LEO satelliteDistance, d, adjacent LEO satellite k G Representing the distance of the LEO satellite from the GEO satellite; q 0 Represents the buffer queue length, q, of the LEO satellite k Buffer queue length, q, representing adjacent LEO satellite k of the LEO satellite G Representing the buffer queue length of the GEO satellite; b (B) 0 Indicating the available communication bandwidth of the LEO satellite, B k Representing the available communication bandwidth of a neighboring LEO satellite k of the LEO satellite, B G Representing the communication bandwidth available to GEO satellites.
4. The adaptive traffic offload method for a high dynamic topology heaven-earth integration network of claim 3, wherein: during each period, the LEO satellite broadcasts Hello packets to its neighboring LEO satellites, including the buffer queue length, location parameters, motion parameters, and available communication bandwidth local to the LEO satellite.
5. The adaptive traffic offload method for a high dynamic topology heaven-earth integration network of claim 3, wherein: training the reinforcement learning model is realized in the following manner:
(b1) At the beginning of each cycle, the LEO satellite calculates a reward R based on the actual discharge delay of the traffic i The network state s in the last period is to be calculated i Traffic offload decision a i And a traffic offload bonus R i Storing in an experience data set as history data;
(b2) Within each period, LEO satellites are randomly sampled in a local empirical data setTraining a reinforcement learning model and updating model parameters by a plurality of samples;
(b3) Periodically polymerizing the reinforcement learning model by the GEO satellite;
(b4) The GEO satellite broadcasts the aggregated reinforcement learning network model parameters to LEO constellations;
(b5) And after the LEO satellite receives the aggregation model parameters of the GEO satellite, updating the local reinforcement learning model parameters.
6. The adaptive traffic offload method for a high dynamic topology heaven and earth integration network of claim 5, wherein: the LEO satellite calculates a reward R according to the actual discharge delay of the flow i The method specifically comprises the following steps:
the reward function obtained by LEO satellite performing traffic offload decision is defined as
Wherein, kappa i Indicating traffic type, κ S Representing delay-sensitive traffic, κ N Indicating that the traffic is not delay sensitive,is an indication function;
for delay-sensitive traffic, the reward function for offloading traffic by LEO satellites is defined as
Wherein t is i Indicating delay in flow unloading, t max,i Representing a maximum tolerable delay for flow offloading, - Γ being a constant representing a loss due to flow offloading delay exceeding the maximum tolerable delay;
for non-delay sensitive traffic, the reward function for offloading traffic by LEO satellites is defined as
7. The adaptive traffic offload method for a high dynamic topology heaven and earth integration network of claim 6, wherein: traffic offload decision a i The method comprises the following steps:
a i ={x 0 ,x 1 ,…,x k ,…,x K ,b 0 ,b 1 ,…,b k ,…,b k }
wherein x is 0 Representing data transmission to GEO satellite, x k Representing offloading data to neighboring LEO satellites k, b of the LEO satellite 0 Representing the communication bandwidth allocated for the link between the LEO satellite and the GEO satellite, b k Representing the communication bandwidth allocated for the link between the LEO satellite and the neighboring LEO satellite.
8. The adaptive traffic offload method for a high dynamic topology heaven and earth integration network of claim 6, wherein: the transmission delay of one hop in the heaven-earth integrated network is calculated by the following formula:
wherein d is ij Represents the distance between node i and node j, c represents the speed of light, l i Indicating the size of the data packet, R ij Representing the transmission rate of the link between node i and node j, q i Representing the cache queue length of node i.
9. The adaptive traffic offload method for a high dynamic topology heaven and earth integration network of claim 8, wherein: if a packet from a ground user terminal can be forwarded to a target node by one hop through the LEO satellite, traffic offload is delayed as
t i =t ul +t lu
Wherein t is ul Representing one-hop propagation delay from a user terminal to the LEO satellite, t lu Representing a one-hop transmission delay from the LEO satellite to a target user terminal;
if the data packet cannot be forwarded to the target node by one hop through the LEO satellite, but can be forwarded to the target node by two hops through the LEO satellite adjacent to the data packet, the traffic unloading delay is
t i =t ul +t ln +t nu
Wherein t is ln Representing one-hop propagation delay from the LEO satellite to its neighboring LEO satellites, t nu Representing a one-hop transmission delay from the LEO satellite adjacent satellite to the target user terminal;
if the data packet cannot be forwarded to the target node through the LEO satellite and LEO satellites adjacent to the LEO satellite, the LEO satellite forwards the data packet to the GEO satellite, and the traffic offload delay of the process is that
Wherein t is lg Representing one-hop propagation delay from the LEO satellite to the GEO satellite, t gs Representing one-hop propagation delay from GEO satellite to earth station, t su Representing the transmission delay from the earth station to the user terminal,represents the maximum buffer queue length, q, of a GEO satellite G Representing the buffer queue length of the GEO satellite.
10. The adaptive traffic offload method for a high dynamic topology heaven and earth integration network of claim 9, wherein: the reinforcement learning model is based on a Q-learning model, and the updating method of model parameters is as follows:
wherein w represents a parameter of the Q-network, eta represents a learning rate, L Q (w) is a loss function calculated as follows:
where λ is the discount factor and Q (s, a; w) is the Q network.
11. The adaptive traffic offload method for a high dynamic topology heaven and earth integration network of claim 5, wherein: the step (b 3) of periodically polymerizing the reinforcement learning model by the GEO satellite comprises the following specific steps:
at intervals, the GEO satellite randomly selects N from the LEO constellation C Each LEO satellite sends a model transmission request to the selected LEO satellite;
after the selected LEO satellite receives the request from the GEO satellite, the local reinforcement learning model parameters w are obtained c And number of samples n in training sample set c Uploading to a GEO satellite;
and the GEO satellites perform model aggregation according to the received LEO satellite reinforcement learning model parameters and the number of samples in the training sample set.
12. The adaptive traffic offload method for a high dynamic topology heaven and earth integration network of claim 11, wherein: model polymerization is carried out in the following manner:
wherein w is agg Is the parameters of the reinforced learning network model after aggregation.
CN202311484841.XA 2023-11-08 2023-11-08 Self-adaptive flow unloading method of high-dynamic topology heaven-earth integrated network Pending CN117614507A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311484841.XA CN117614507A (en) 2023-11-08 2023-11-08 Self-adaptive flow unloading method of high-dynamic topology heaven-earth integrated network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311484841.XA CN117614507A (en) 2023-11-08 2023-11-08 Self-adaptive flow unloading method of high-dynamic topology heaven-earth integrated network

Publications (1)

Publication Number Publication Date
CN117614507A true CN117614507A (en) 2024-02-27

Family

ID=89955299

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311484841.XA Pending CN117614507A (en) 2023-11-08 2023-11-08 Self-adaptive flow unloading method of high-dynamic topology heaven-earth integrated network

Country Status (1)

Country Link
CN (1) CN117614507A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117856877A (en) * 2024-03-06 2024-04-09 广东世炬网络科技有限公司 Data transfer method, device, equipment and storage medium in NTN (network time network)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117856877A (en) * 2024-03-06 2024-04-09 广东世炬网络科技有限公司 Data transfer method, device, equipment and storage medium in NTN (network time network)

Similar Documents

Publication Publication Date Title
CN113038387B (en) Switching judgment method based on Q learning in low orbit satellite network
CN111479306B (en) Q-learning-based flight ad hoc network QoS routing method
US6748233B1 (en) System and method for energy-efficient transmission power control, routing and transmission scheduling in wireless communication networks
US8509098B2 (en) Method and apparatus for identifying network connectivity changes in dynamic networks
CN117614507A (en) Self-adaptive flow unloading method of high-dynamic topology heaven-earth integrated network
CN112994776B (en) Gateway station rain attenuation resisting switching method and device suitable for high-throughput satellite communication
CN107135469B (en) A kind of distributed user cut-in method
CN114900225B (en) Civil aviation Internet service management and access resource allocation method based on low-orbit giant star base
CN105072676B (en) Aeronautical Ad hoc networks Poewr control method based on TDMA agreement
CN114650567A (en) Unmanned aerial vehicle-assisted V2I network task unloading method
Anh et al. A deep reinforcement learning approach for backscatter-assisted relay communications
Xu et al. Spatial location aided fully-distributed dynamic routing for large-scale LEO satellite networks
Han et al. Dynamic routing for software-defined LEO satellite networks based on ISL attributes
Rabjerg et al. Exploiting topology awareness for routing in LEO satellite constellations
Zuo et al. Deep reinforcement learning based load balancing routing for LEO satellite network
Hsu et al. A deep reinforcement learning based routing scheme for LEO satellite networks in 6G
CN107484207A (en) Combine topology control and channel distribution load-balancing method in wireless sensor network
GB2504813A (en) Distributed network scheduling of bandwidth resources using network situation awareness information
WO2001031945A1 (en) System and method for energy-efficient transmission power control, routing and transmission scheduling in wireless communication networks
Li et al. WSN10-5: adaptive multi-copy routing for intermittently connected mobile ad hoc networks
Liu et al. Energy-saving predictive video streaming with deep reinforcement learning
He et al. Load-balanced collaborative offloading for LEO satellite networks
US20230247484A1 (en) Adaptive load balancing in a satellite network
CN114390476A (en) Aviation ad hoc network routing method based on end-to-end quality parameters
Zhou et al. A congestion-aware handover scheme for LEO satellite networks

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination