CN115001611A - Resource allocation method of hopping beam satellite spectrum sharing system based on reinforcement learning - Google Patents

Resource allocation method of hopping beam satellite spectrum sharing system based on reinforcement learning Download PDF

Info

Publication number
CN115001611A
CN115001611A CN202210542515.9A CN202210542515A CN115001611A CN 115001611 A CN115001611 A CN 115001611A CN 202210542515 A CN202210542515 A CN 202210542515A CN 115001611 A CN115001611 A CN 115001611A
Authority
CN
China
Prior art keywords
satellite
leo
resource allocation
reinforcement learning
hopping
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210542515.9A
Other languages
Chinese (zh)
Other versions
CN115001611B (en
Inventor
任品毅
吴镇国
徐东阳
鲁磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xian Jiaotong University
Original Assignee
Xian Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xian Jiaotong University filed Critical Xian Jiaotong University
Priority to CN202210542515.9A priority Critical patent/CN115001611B/en
Publication of CN115001611A publication Critical patent/CN115001611A/en
Application granted granted Critical
Publication of CN115001611B publication Critical patent/CN115001611B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B17/00Monitoring; Testing
    • H04B17/30Monitoring; Testing of propagation channels
    • H04B17/382Monitoring; Testing of propagation channels for resource allocation, admission control or handover
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B1/00Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
    • H04B1/69Spread spectrum techniques
    • H04B1/713Spread spectrum techniques using frequency hopping
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Electromagnetism (AREA)
  • Radio Relay Systems (AREA)

Abstract

The invention discloses a resource allocation method of a hopping beam satellite spectrum sharing system based on reinforcement learning, which comprises the following steps: solving an optimization problem based on reinforcement learning to obtain a Q value table, wherein the optimization problem is constructed by taking the throughput of the maximized LEO satellite system as a target; and performing resource allocation on each LEO user according to the Q value table, and completing the resource allocation of the beam hopping satellite spectrum sharing system based on reinforcement learning.

Description

Resource allocation method of hopping beam satellite spectrum sharing system based on reinforcement learning
Technical Field
The invention belongs to the field of a hopping beam satellite system, and relates to a resource allocation method of a hopping beam satellite spectrum sharing system based on reinforcement learning.
Background
With the development of satellite communication, competition among various countries for spectrum resources becomes a necessary trend. In order to meet the requirement of satellite communication on nonrenewable frequency spectrum resources, frequency spectrum sharing becomes a key field of academic research, satellites exchange ephemeris information to negotiate with each other, and a proper frequency spectrum sharing method is adopted to improve the frequency spectrum utilization rate. Most of the existing spectrum sharing methods set angles by using spatial isolation, reduce co-channel interference by switching beams to realize spectrum sharing, or limit the transmitting power of satellite beams by adaptive power control to further reduce co-channel interference to realize spectrum sharing, but the former needs to realize angle adjustment by optimizing a phased array antenna, has high cost and cannot be applied to a satellite system in a large scale, and the latter limits the transmitting power of satellite beams to have certain influence on the throughput of satellites.
In order to effectively utilize spectrum resources and realize spectrum sharing, the beam hopping technology becomes an applicable means. The beam hopping technology utilizes a small number of beams to perform time-sharing coverage, and based on the time slicing technology, only part of spot beams on a satellite are in a working state at a certain specific moment, so that the same frequency interference can be reduced from the angle of time isolation, and the resource can be fully utilized by the part of working spot beams.
In a hopping beam spectrum sharing system, the capacity of a satellite beam channel and the service condition of a user need to be considered comprehensively during resource allocation, so that the problem of resource allocation of the hopping beam spectrum sharing system is a complex and variable problem. However, the current resource allocation algorithm cannot adapt to a complex environment, has insufficient flexibility and has a certain influence on the throughput of the system.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provides a resource allocation method of a beam hopping satellite spectrum sharing system based on reinforcement learning, which can improve the throughput of an LEO satellite system.
In order to achieve the above object, the resource allocation method of the hopping beam satellite spectrum sharing system based on reinforcement learning according to the present invention comprises the following steps:
solving an optimization problem based on reinforcement learning to obtain a Q value table, wherein the optimization problem is constructed by taking the throughput of the maximized LEO satellite system as a target;
and performing resource allocation on each LEO user according to the Q value table to complete resource allocation of the hopping beam satellite spectrum sharing system based on reinforcement learning.
The method specifically comprises the following steps:
establishing an optimization problem with the aim of maximizing the throughput of the LEO satellite system;
calculating coordinates of the GEO satellite, the LEO satellite and the user in the geocentric geostationary coordinate system;
calculating an off-axis angle, antenna gain, channel gain and interference gain;
setting the signal-to-interference-and-noise ratios of LEO users and GEO users in different time slots when the LEO satellite works in a wave beam;
introducing reinforcement learning to solve the optimization problem, wherein a Q value, a return function and state information are initialized, an LEO user for resource allocation is selected according to a greedy strategy, the return function is calculated, the signal-to-interference-and-noise ratio is counted, the next state is observed, the Q value is updated according to a Q value updating formula, and repeated operation is continuously carried out on the next time slot until a Q value table tends to be converged;
and performing resource allocation on each LEO user according to the obtained Q value table to complete resource allocation of the hopping beam satellite spectrum sharing system based on reinforcement learning.
With the goal of maximizing throughput of the LEO satellite system, the established optimization problem is as follows:
maxR LEO
Figure BDA0003650808690000031
wherein ,γth For GEO user signal quality threshold, K is the number of clusters, P tot Is the satellite total power.
The antenna gain is:
Figure BDA0003650808690000032
wherein ,Gmax Maximum gain of the antenna, J 1 (. and J) 3 (. to) a first order Bessel function and a third order Bessel function, respectivelyThe root function, sin θ 3dB Is the 3dB angle of the antenna, and theta is the transmitting angle between the satellite transmitting antenna and the interfered user.
The channel gain is:
Figure BDA0003650808690000033
where L is the path loss.
The interference gain is:
Figure BDA0003650808690000034
wherein, L is path loss, θ is the transmitting angle between the satellite transmitting antenna and the interfered user, and γ is the receiving angle of the user in the direction of the interfering satellite.
The signal-to-interference-and-noise ratios of the LEO users and the GEO users are respectively as follows:
Figure BDA0003650808690000041
Figure BDA0003650808690000042
wherein ,
Figure BDA0003650808690000043
and
Figure BDA0003650808690000044
respectively the transmitting power of the LEO satellite and the transmitting power of the GEO satellite,
Figure BDA0003650808690000045
for the bandwidth overlap of LEO users and GEO users,
Figure BDA0003650808690000046
for the bandwidth overlap ratio of GEO users and LEO users,
Figure BDA0003650808690000047
an element of the working matrix for the LEO hop beam, indicating whether or not the LEO beam v is activated in time slot j,
Figure BDA0003650808690000048
and
Figure BDA0003650808690000049
is the power of additive white gaussian noise,
Figure BDA00036508086900000410
and
Figure BDA00036508086900000411
in order to obtain the gain of the channel,
Figure BDA00036508086900000412
and
Figure BDA00036508086900000413
is the interference gain.
The Q value updating formula is as follows:
Figure BDA00036508086900000414
wherein ,
Figure BDA00036508086900000415
as environmental status information, a j Cell selected for beam hopping service of LEO satellite in current time slot, R j+1 Is a function of the reward.
And acquiring a resource allocation decision scheme according to the Q value table, and then performing resource allocation on each LEO user according to the resource allocation decision scheme.
The invention has the following beneficial effects:
the resource allocation method of the hopping beam satellite spectrum sharing system based on reinforcement learning aims at maximizing the throughput of an LEO satellite system during specific operation, establishes an optimization problem, uses reinforcement learning, comprehensively considers the current service flow distribution condition and the satellite resource condition, and considers the internal connection of resource allocation, so that the throughput of the LEO satellite system is increased under the condition of ensuring the communication service quality of GEO users, the resources are fully utilized, and the resource utilization rate is improved.
Drawings
FIG. 1 is a schematic diagram of the system of the present invention;
FIG. 2 is a GEO user carrier-to-interference-plus-noise ratio diagram of the present invention;
FIG. 3 is a GEO user carrier-to-interference-plus-noise ratio diagram of a conventional resource allocation scheme;
FIG. 4 is a graph comparing throughput of the present invention with conventional resource allocation;
FIG. 5 is a graph illustrating the impact of reinforcement learning training times on throughput variability according to the present invention.
Detailed Description
In order to make the technical solutions of the present invention better understood, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, not all of the embodiments, and are not intended to limit the scope of the present disclosure. Moreover, in the following description, descriptions of well-known structures and techniques are omitted so as to not unnecessarily obscure the concepts of the present disclosure. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
There is shown in the drawings a schematic block diagram of a disclosed embodiment in accordance with the invention. The figures are not drawn to scale, wherein certain details are exaggerated and possibly omitted for clarity of presentation. The shapes of various regions, layers and their relative sizes and positional relationships shown in the drawings are merely exemplary, and deviations may occur in practice due to manufacturing tolerances or technical limitations, and a person skilled in the art may additionally design regions/layers having different shapes, sizes, relative positions, according to actual needs.
Referring to fig. 1, there are three main devices in the system, namely, a GEO satellite, a LEO satellite and a cell, wherein the GEO satellite continuously performs data transmission on cells 1 to 7, and the LEO satellite serves 49 cells in a mobile process in a beam staring manner, wherein 7 cells and the GEO satellite repeatedly cover, and the LEO satellite exchanges ephemeris information with the GEO satellite to obtain a beam hopping resource allocation algorithm under the condition of acquiring spectrum arrangement of a beam of the GEO satellite, so as to reasonably design a beam hopping plan table; the terminals in the cell receive the requested service data according to the beam hopping schedule received by the satellite.
The resource allocation method of the hopping beam spectrum sharing system based on reinforcement learning comprises the following steps:
1) with the goal of maximizing LEO satellite system throughput at maximum, an optimization problem is established, i.e.
maxR LEO
Figure BDA0003650808690000061
wherein ,
Figure BDA0003650808690000062
for GEO user n at time slot j signal quality, γ th As GEO user signal quality threshold, T ij Allocating an indicator factor, T, to a time slot ij I.e. the resource allocation is obtained for user i in slot j, K is the number of clusters, P i j For the power of beam i in time slot j, P tot Is the total satellite power.
2) Let the coordinate of the GEO satellite in the Earth-centered-Earth-fixed coordinate system under the time slot j be c _ GEO j The coordinate of the LEO satellite in the geocentric geostationary coordinate system under the time slot j is c _ LEO j The coordinates of the GEO user u in the earth center-earth fixed coordinate system under the time slot j are
Figure BDA0003650808690000063
The coordinates of the LEO user v in the geocentric geostationary coordinate system under the time slot j are
Figure BDA0003650808690000064
3) Let the off-axis angle of the GEO satellite in the LEO user direction at time slot j be
Figure BDA0003650808690000065
The off-axis angle of the LEO user in the GEO satellite direction is
Figure BDA0003650808690000071
The off-axis angle of the LEO satellite in the GEO user direction at time slot j is
Figure BDA0003650808690000072
The off-axis angle of the GEO user in the direction of the LEO satellite is
Figure BDA0003650808690000073
wherein :
Figure BDA0003650808690000074
Figure BDA0003650808690000075
Figure BDA0003650808690000076
Figure BDA0003650808690000077
expressed by antenna gain
Figure BDA0003650808690000078
It is known that the off-axis angle affects the antenna gain, where G max Maximum gain of the antenna, J 1 (. and J) 3 (. cndot.) is a first order Bessel function and a third order Bessel function, sin θ 3dB At the 3dB angle of the antenna.
Let the interference gain from GEO satellite to LEO user v be
Figure BDA0003650808690000079
Interference gain from LEO satellite to GEO user u is
Figure BDA00036508086900000710
The channel gain from the GEO satellite to the GEO user u is
Figure BDA00036508086900000711
The channel gain from LEO satellite to LEO user v is
Figure BDA00036508086900000712
wherein :
Figure BDA00036508086900000713
Figure BDA00036508086900000714
Figure BDA00036508086900000715
Figure BDA00036508086900000716
wherein ,Lsg→el 、L sl→eg 、L sg→eg 、L sl→el Respectively the path loss from the GEO satellite to the LEO user, the path loss from the LEO satellite to the GEO user, the path loss from the GEO satellite to the GEO user, and the path loss from the LEO satellite to the LEO user,
Figure BDA0003650808690000081
is the angle between the pointing direction of the GEO transmitting antenna and the connecting line between the transmitting end and the corresponding receiving end of the GEO user, beta j For directing and transmitting LEO transmitting antennasAnd the connection included angle between the transmitting end and the receiving end corresponding to the LEO user.
4) Is provided with
Figure BDA0003650808690000082
And
Figure BDA0003650808690000083
LEO users and GEO users have signal-to-interference-and-noise ratios in different time slots when the LEO satellite works in all wave beams, wherein
Figure BDA0003650808690000084
Figure BDA0003650808690000085
wherein ,
Figure BDA0003650808690000086
and
Figure BDA0003650808690000087
are LEO satellite transmission power and GEO satellite transmission power,
Figure BDA0003650808690000088
for the bandwidth overlap ratio of the LEO users and GEO users,
Figure BDA0003650808690000089
for the bandwidth overlap ratio of GEO users and LEO users,
Figure BDA00036508086900000810
an element of the operation matrix for the LEO hop beam, indicating whether or not the LEO beam v is activated in time slot j,
Figure BDA00036508086900000811
and
Figure BDA00036508086900000812
is additive white gaussian noise power.
5) Is provided with
Figure BDA00036508086900000813
Setting an action a for the environmental status information j Cell selected for beam hopping service of LEO satellite in current time slot and method thereof
Figure BDA00036508086900000814
Correlation, let Q be Q(s) j ,a j ) Is shown in state s j Lower selection action a j The update rule of Q is as follows:
Figure BDA00036508086900000815
wherein ,Rj Is a return function;
Figure BDA00036508086900000816
wherein ,ra And r b Positive and negative numbers with large absolute value, gamma 0 And the value is the GEO user signal-to-interference-and-noise ratio threshold value, and the Q value, the return function value and the state information are 0 during initialization.
6) Selecting action a according to a greedy policy j The greedy policy is:
Figure BDA0003650808690000091
wherein epsilon is the probability of randomly selecting an action, and the LEO satellite serves the user to obtain a corresponding return value R j And updating the status information s j+1 And obtaining a Q value according to a Q value updating formula, and continuously repeating the operation of the next time slot until a complete Q value table is obtained by system convergence, thereby obtaining a resource allocation decision scheme under all time slots.
In summary, the specific process of the present invention is as follows:
1) establishing an optimization problem by taking the maximum throughput of the maximized LEO satellite system as a target;
2) calculating coordinates of the GEO satellite, the LEO satellite and the user in a geocentric geostationary coordinate system, and determining an off-axis angle, antenna gain, channel gain and interference gain of the satellite and the user;
3) setting the signal-to-interference-and-noise ratios of LEO users and GEO users in different time slots when the LEO satellite works in a wave beam;
4) initializing a Q value, a return function and state information, selecting an LEO user for resource allocation according to a greedy strategy, calculating the return function, counting a signal-to-interference-and-noise ratio, observing the next state, and updating according to a Q value updating formula;
5) continuing to repeat the operation of the next time slot until the Q value table tends to converge;
6) and acquiring a resource allocation decision scheme under each time slot according to the Q value table, and performing resource allocation on each LEO user to complete resource allocation of the hopping beam spectrum sharing system based on reinforcement learning.
Fig. 2 is a carrier-to-interference-and-noise ratio diagram of the GEO user in the present invention, and it can be seen that the present invention ensures the information transmission quality of the GEO user in the frequency sharing process, and the information transmission of the GEO user is not affected.
Fig. 3 is a chart of carrier-to-noise ratio of GEO users in the multi-beam satellite fixed allocation method, and it can be seen that, in the spectrum sharing process, the invention does not consider ensuring the information transmission quality of GEO users, so that the co-frequency interference of the LEO satellite affects the GEO users.
Fig. 4 is a comparison graph of throughput of LEO satellites according to the multi-beam satellite fixed allocation method of the present invention, and it can be seen that the overall throughput performance of the satellite system is better in the present invention, and the resource utilization rate is improved.
Fig. 5 is a diagram illustrating the influence of the training times on the fluctuation of throughput in the present invention, and it can be seen that as the training times increase, the variance of the fluctuation of throughput also tends to be stable, so that an appropriate training time can be selected to reduce the influence caused by randomness.

Claims (9)

1. A resource allocation method of a hopping beam satellite spectrum sharing system based on reinforcement learning is characterized by comprising the following steps:
solving an optimization problem based on reinforcement learning to obtain a Q value table, wherein the optimization problem is constructed by taking the throughput of the maximized LEO satellite system as a target;
and performing resource allocation on each LEO user according to the Q value table, and completing the resource allocation of the hopping beam satellite spectrum sharing system based on reinforcement learning.
2. The resource allocation algorithm of the reinforcement learning-based beam-hopping satellite spectrum sharing system according to claim 1, specifically comprising the following steps:
establishing an optimization problem with the aim of maximizing the throughput of the LEO satellite system;
calculating coordinates of the GEO satellite, the LEO satellite and the user in the geocentric geostationary coordinate system;
calculating an off-axis angle, antenna gain, channel gain and interference gain;
setting the signal-to-interference-and-noise ratios of LEO users and GEO users in different time slots when the LEO satellite works in a wave beam;
introducing reinforcement learning to solve the optimization problem, wherein a Q value, a return function and state information are initialized, an LEO user for resource allocation is selected according to a greedy strategy, the return function is calculated, the signal-to-interference-and-noise ratio is counted, the next state is observed, the Q value is updated according to a Q value updating formula, and repeated operation is continuously carried out on the next time slot until a Q value table tends to be converged;
and performing resource allocation on each LEO user according to the obtained Q value table to complete resource allocation of the hopping beam satellite spectrum sharing system based on reinforcement learning.
3. The resource allocation algorithm of the reinforcement learning-based beam-hopping satellite spectrum sharing system according to claim 1, wherein the optimization problem is established with the goal of maximizing the throughput of the LEO satellite system as follows:
max R LEO
Figure FDA0003650808680000021
wherein ,γth Is GEO user signal quality threshold, K is cluster number, P tot Is the total satellite power.
4. The reinforcement learning-based resource allocation algorithm for the frequency spectrum sharing system of the beam hopping satellite according to claim 1, wherein the antenna gain is:
Figure FDA0003650808680000022
wherein ,Gmax Maximum gain of the antenna, J 1 (. o) and J 3 (. cndot.) is a first order Bessel function and a third order Bessel function, sin θ 3dB Is the 3dB angle of the antenna, and theta is the transmission angle between the satellite transmitting antenna and the interfered user.
5. The reinforcement learning-based resource allocation algorithm for the frequency spectrum sharing system of the beam hopping satellite, according to claim 4, wherein the channel gain is:
Figure FDA0003650808680000023
where L is the path loss.
6. The resource allocation algorithm of the reinforcement learning-based beam-hopping satellite spectrum sharing system according to claim 4, wherein the interference gain is:
Figure FDA0003650808680000024
wherein, L is path loss, θ is the transmitting angle between the satellite transmitting antenna and the interfered user, and γ is the receiving angle of the user in the direction of the interfering satellite.
7. The resource allocation algorithm of the reinforcement learning-based beam-hopping satellite spectrum sharing system according to claim 1, wherein the signal-to-interference-and-noise ratios of the LEO users and the GEO users are respectively:
Figure FDA0003650808680000031
Figure FDA0003650808680000032
wherein ,
Figure FDA0003650808680000033
and
Figure FDA0003650808680000034
respectively the transmitting power of the LEO satellite and the transmitting power of the GEO satellite,
Figure FDA0003650808680000035
for the bandwidth overlap ratio of the LEO users and GEO users,
Figure FDA0003650808680000036
for the bandwidth overlap ratio of GEO users and LEO users,
Figure FDA0003650808680000037
an element of the operation matrix for the LEO hop beam, indicating whether or not the LEO beam v is activated in time slot j,
Figure FDA0003650808680000038
and
Figure FDA0003650808680000039
in order to be an additive white gaussian noise power,
Figure FDA00036508086800000310
and
Figure FDA00036508086800000311
in order to obtain the gain of the channel,
Figure FDA00036508086800000312
and
Figure FDA00036508086800000313
is the interference gain.
8. The resource allocation algorithm of the reinforcement learning-based beam-hopping satellite spectrum sharing system according to claim 1, wherein the Q value is updated according to the formula:
Figure FDA00036508086800000314
wherein ,
Figure FDA00036508086800000315
as environmental status information, a j Cell selected for beam hopping service of LEO satellite in current time slot, R j+1 Is a function of the reward.
9. The reinforcement learning-based resource allocation algorithm for the frequency spectrum sharing system of the beam hopping satellite, according to claim 1, wherein a resource allocation decision scheme is obtained according to the Q-value table, and then resource allocation is performed for each LEO user according to the resource allocation decision scheme.
CN202210542515.9A 2022-05-18 2022-05-18 Resource allocation method of beam hopping satellite spectrum sharing system based on reinforcement learning Active CN115001611B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210542515.9A CN115001611B (en) 2022-05-18 2022-05-18 Resource allocation method of beam hopping satellite spectrum sharing system based on reinforcement learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210542515.9A CN115001611B (en) 2022-05-18 2022-05-18 Resource allocation method of beam hopping satellite spectrum sharing system based on reinforcement learning

Publications (2)

Publication Number Publication Date
CN115001611A true CN115001611A (en) 2022-09-02
CN115001611B CN115001611B (en) 2023-09-26

Family

ID=83026716

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210542515.9A Active CN115001611B (en) 2022-05-18 2022-05-18 Resource allocation method of beam hopping satellite spectrum sharing system based on reinforcement learning

Country Status (1)

Country Link
CN (1) CN115001611B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111970047A (en) * 2020-08-25 2020-11-20 桂林电子科技大学 LEO satellite channel allocation method based on reinforcement learning
US20220067063A1 (en) * 2020-08-27 2022-03-03 Industry-Academic Cooperation Foundation, Yonsei University Apparatus and method for adaptively managing sharded blockchain network based on deep q network
CN114189939A (en) * 2021-12-14 2022-03-15 清华大学 Dynamic beam-hopping and wave-band bandwidth allocation method based on multi-agent reinforcement learning
CN114389678A (en) * 2022-01-12 2022-04-22 北京邮电大学 Multi-beam satellite resource allocation method based on decision performance evaluation
CN114499629A (en) * 2021-12-24 2022-05-13 南京邮电大学 Dynamic resource allocation method for beam-hopping satellite system based on deep reinforcement learning

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111970047A (en) * 2020-08-25 2020-11-20 桂林电子科技大学 LEO satellite channel allocation method based on reinforcement learning
US20220067063A1 (en) * 2020-08-27 2022-03-03 Industry-Academic Cooperation Foundation, Yonsei University Apparatus and method for adaptively managing sharded blockchain network based on deep q network
CN114189939A (en) * 2021-12-14 2022-03-15 清华大学 Dynamic beam-hopping and wave-band bandwidth allocation method based on multi-agent reinforcement learning
CN114499629A (en) * 2021-12-24 2022-05-13 南京邮电大学 Dynamic resource allocation method for beam-hopping satellite system based on deep reinforcement learning
CN114389678A (en) * 2022-01-12 2022-04-22 北京邮电大学 Multi-beam satellite resource allocation method based on decision performance evaluation

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
张沛等: "基于深度增强学习和多目标优化改进的卫星资源分配算法", 《通信学报》, no. 06 *

Also Published As

Publication number Publication date
CN115001611B (en) 2023-09-26

Similar Documents

Publication Publication Date Title
CN108882245B (en) GEO and LEO cognitive satellite network and dynamic frequency allocation method thereof
CN112134614B (en) Downlink carrier resource allocation method and system for multi-beam communication satellite
Patil et al. Hybrid compression and message-sharing strategy for the downlink cloud radio-access network
CN111417126B (en) On-demand coverage method and device for space-sky-ground cooperative marine communication network
CN111416648A (en) Multi-beam adaptive management method and device for low-earth-orbit satellite system
CN107835528A (en) The resource allocation methods avoided in the ground integrated network of star based on interference
CN108449150A (en) Star based on radio map information ground disturbance coordination method, device and equipment
CN110048755A (en) A kind of millimetre-wave attenuator method and communication system
CN108391316B (en) Wave beam distribution method
Wang et al. Spectrum optimization for cognitive satellite communications with cournot game model
Takahashi et al. Adaptive multi-beam arrangement for improving throughput in an HTS communication system
Wang et al. Dynamic beam hopping of multi-beam satellite based on genetic algorithm
CN111417125B (en) Method and device for optimizing capacity of space-air-ground cooperative marine communication network
CN115474281A (en) Resource allocation method in satellite-ground converged network
CN115441939A (en) Multi-beam satellite communication system resource allocation method based on MADDPG algorithm
CN113873658B (en) Method for allocating beam hopping resources by taking user service weight gain as objective function
Sun et al. Edge intelligence assisted resource management for satellite communication
CN115001611A (en) Resource allocation method of hopping beam satellite spectrum sharing system based on reinforcement learning
Leng et al. User-level scheduling and resource allocation for multi-beam satellite systems with full frequency reuse
Yuan et al. Joint Beam Direction Control and Radio Resource Allocation in Dynamic Multi-beam LEO Satellite Networks
CN113541768B (en) NOMA-based LEO satellite communication system frequency point distribution method
Lyu et al. Beam position and beam hopping design for LEO satellite communications
Khan et al. Reconfigurable Intelligent Surfaces for 6G Non-Terrestrial Networks: Assisting Connectivity from the Sky
WO2019061294A1 (en) Access point device and communication method
Hu et al. Efficient seamless coverage of high throughput satellites with irregular coverage shapes

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant