WO2023143200A1 - Beam switching method and apparatus, and processor readable storage medium - Google Patents

Beam switching method and apparatus, and processor readable storage medium Download PDF

Info

Publication number
WO2023143200A1
WO2023143200A1 PCT/CN2023/072416 CN2023072416W WO2023143200A1 WO 2023143200 A1 WO2023143200 A1 WO 2023143200A1 CN 2023072416 W CN2023072416 W CN 2023072416W WO 2023143200 A1 WO2023143200 A1 WO 2023143200A1
Authority
WO
WIPO (PCT)
Prior art keywords
state vector
beam switching
beams
loss function
quality information
Prior art date
Application number
PCT/CN2023/072416
Other languages
French (fr)
Chinese (zh)
Inventor
索士强
秦海超
黄秋萍
苏昕
Original Assignee
大唐移动通信设备有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 大唐移动通信设备有限公司 filed Critical 大唐移动通信设备有限公司
Publication of WO2023143200A1 publication Critical patent/WO2023143200A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W36/00Hand-off or reselection arrangements
    • H04W36/24Reselection being triggered by specific parameters
    • H04W36/30Reselection being triggered by specific parameters by measured or perceived connection quality data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B17/00Monitoring; Testing
    • H04B17/30Monitoring; Testing of propagation channels
    • H04B17/309Measuring or estimating channel quality parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B17/00Monitoring; Testing
    • H04B17/30Monitoring; Testing of propagation channels
    • H04B17/309Measuring or estimating channel quality parameters
    • H04B17/318Received signal strength
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B17/00Monitoring; Testing
    • H04B17/30Monitoring; Testing of propagation channels
    • H04B17/309Measuring or estimating channel quality parameters
    • H04B17/318Received signal strength
    • H04B17/327Received signal code power [RSCP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B17/00Monitoring; Testing
    • H04B17/30Monitoring; Testing of propagation channels
    • H04B17/309Measuring or estimating channel quality parameters
    • H04B17/336Signal-to-interference ratio [SIR] or carrier-to-interference ratio [CIR]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B17/00Monitoring; Testing
    • H04B17/30Monitoring; Testing of propagation channels
    • H04B17/391Modelling the propagation channel
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W36/00Hand-off or reselection arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W36/00Hand-off or reselection arrangements
    • H04W36/0005Control or signalling for completing the hand-off
    • H04W36/0083Determination of parameters used for hand-off, e.g. generation or modification of neighbour cell lists
    • H04W36/0085Hand-off measurements
    • H04W36/0094Definition of hand-off measurement parameters
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Definitions

  • the present disclosure relates to the field of wireless communication technologies, and in particular, the present disclosure relates to a beam switching method, device, and processor-readable storage medium.
  • the channel between the network node and UE may change due to obstacles or rotational movement of the UE, resulting in reduced beam quality between the network node and UE or quality of other beams than this Better beam quality; in this case, beam switching is required between network nodes and UEs for better system performance.
  • the fixed handover threshold is small, although better system performance can be maintained, it will lead to frequent handover, even unnecessary handover, resulting in increased load overhead; if the fixed handover threshold is large , although the frequency of handover can be reduced, some handovers cannot be triggered in time, resulting in the failure to guarantee the performance of the system.
  • the present disclosure proposes a beam switching method, device, and processor-readable storage medium, so as to solve at least one aspect of the above-mentioned technical defects to a certain extent.
  • a beam switching method performed by a first network node, including:
  • the first quality information respectively corresponding to the beams includes the first quality information sent by at least one user equipment UE to the first network node;
  • the first average channel quality corresponding to a plurality of first beams, and the second average channel quality corresponding to a plurality of second beams in at least one second time period determine the first state vector; at least one second time period in the first before the time period;
  • obtaining first quality information respectively corresponding to a plurality of first beams within a first time period includes:
  • the first quality information includes at least one item of signal to interference and noise ratio SINR, received power RSRP, and received quality RSRQ.
  • the first state vector is determined based on the first average channel quality corresponding to the multiple first beams and the second average channel quality corresponding to the multiple second beams in at least one second time period, including:
  • a first state vector is obtained.
  • determining a beam switching threshold corresponding to at least one UE includes:
  • the first relationship model is used to characterize the relationship among the first state vector, reward value and beam switching threshold.
  • the first relationship model uses the first index change of at least one UE to The corresponding first parameter and the second parameter corresponding to the beam switching cost are determined.
  • the first relational model is trained by:
  • the relational model is trained, at least including:
  • the model parameters of the relational model are updated to obtain the updated relational model.
  • the relational model is trained, further comprising:
  • the model parameters of the updated relational model are updated to obtain the updated relational model.
  • the loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model it further includes:
  • the termination condition is one of the following:
  • the loss function value is less than or equal to the loss function value threshold; or,
  • the loss function value is greater than or equal to the loss function value threshold.
  • the first quality information sent by at least one UE to the first network node is sent to the second network node.
  • a beam switching method is provided, which is performed by a UE, including:
  • the corresponding first quality information, and the average channel quality corresponding to multiple beams in at least one second time period determine the beam switching threshold corresponding to the UE;
  • a beam switching device which is applied to a first network node, and includes a memory, a transceiver, and a processor:
  • transceiver configured to send and receive data under the control of the processor
  • a processor that reads a computer program in memory and does the following:
  • first quality information respectively corresponding to multiple first beams within a first time period Acquiring first quality information respectively corresponding to multiple first beams within a first time period, where the first quality information respectively corresponding to multiple first beams includes first quality information sent to the first network node by at least one user equipment UE;
  • the first average channel quality corresponding to a plurality of first beams, and the second average channel quality corresponding to a plurality of second beams in at least one second time period determine the first state vector; at least one second time period in the first before the time period;
  • a beam switching device which is applied to a UE, including a memory, a transceiver, and a processor:
  • transceiver configured to send and receive data under the control of the processor
  • a processor that reads a computer program in memory and does the following:
  • the corresponding first quality information, and the average channel quality corresponding to multiple beams in at least one second time period determine the beam switching threshold corresponding to the UE;
  • the present disclosure provides a beam switching device applied to a first network node, including:
  • the first processing unit is configured to acquire first quality information respectively corresponding to multiple first beams within a first time period, where the first quality information respectively corresponding to multiple first beams includes at least one user equipment UE sent to the first network node the first quality information of
  • the second processing unit is configured to determine the first average channel quality corresponding to the multiple first beams based on the first quality information respectively corresponding to the multiple first beams;
  • a third processing unit configured to determine a first state vector based on a first average channel quality corresponding to a plurality of first beams, and a second average channel quality corresponding to a plurality of second beams in at least one second time period; at least one the second time period is before the first time period;
  • the fourth processing unit is configured to determine a beam switching threshold corresponding to at least one UE based on the first state vector.
  • the present disclosure provides a beam switching device applied to a UE, including:
  • the fifth processing unit is configured to send the first quality information corresponding to the first beam within the first time period to the first network node; so that the first network node obtains the first quality information respectively corresponding to the multiple beams within the first time period , and based on the first quality information respectively corresponding to the multiple beams, and the average channel quality corresponding to the multiple beams in at least one second time period, determine the beam switching threshold corresponding to the UE;
  • the sixth processing unit is configured to receive the beam switching threshold corresponding to the UE sent by the first network node, and perform beam switching based on the beam switching threshold.
  • a processor-readable storage medium stores a computer program, and when the computer program is executed by a processor, the methods described in the first aspect and the second aspect are implemented.
  • a dynamic adjustment of the beam switching threshold corresponding to at least one UE is realized, so that an appropriate beam switching frequency can be guaranteed while system performance can be maintained.
  • FIG. 1 is a schematic diagram of a system architecture provided by an embodiment of the present disclosure
  • FIG. 2 is a schematic flowchart of a beam switching method provided by an embodiment of the present disclosure
  • FIG. 3 is a schematic diagram of beam switching provided by an embodiment of the present disclosure.
  • FIG. 4 is a schematic flowchart of another beam switching method provided by an embodiment of the present disclosure.
  • FIG. 5 is a schematic flow diagram of reinforcement learning provided by an embodiment of the present disclosure.
  • FIG. 6 is a schematic flowchart of another beam switching method provided by an embodiment of the present disclosure.
  • FIG. 7 is a schematic flowchart of another beam switching method provided by an embodiment of the present disclosure.
  • FIG. 8 is a schematic diagram of SINR statistics provided by an embodiment of the present disclosure.
  • FIG. 9 is a schematic diagram of SINR statistics provided by an embodiment of the present disclosure.
  • FIG. 10 is a schematic diagram of counting the number of handovers provided by an embodiment of the present disclosure.
  • FIG. 11 is a schematic structural diagram of a beam switching device provided by an embodiment of the present disclosure.
  • FIG. 12 is a schematic structural diagram of a beam switching device provided by an embodiment of the present disclosure.
  • FIG. 13 is a schematic structural diagram of a beam switching device provided by an embodiment of the present disclosure.
  • Fig. 14 is a schematic structural diagram of a beam switching device provided by an embodiment of the present disclosure.
  • Determine B based on A in this application means that the factor A should be considered when determining B. It is not limited to “B can be determined only based on A”, but also includes: “Determine B based on A and C", “Determine B based on A, C and E", "determine C based on A, further determine B based on C” wait. In addition, it may also include A as the condition for determining B, for example, "when A meets the first condition, use the first method to determine B"; another example, "when A meets the second condition, determine B”; another example , "when A satisfies the third condition, determine B based on the first parameter” and so on. Of course, it may also be a condition that takes A as a factor for determining B, for example, "when A satisfies the first condition, use the first method to determine C, and further determine B based on C" and so on.
  • Determine B based on A in this application means that the factor A should be considered when determining B. It is not limited to “B can be determined based on A only”, but also includes: “B is determined based on A and C", “B is determined based on A, C and E", "C is determined based on A, and B is further determined based on C" wait. In addition, it may also include A as the condition for determining B, for example, "when A meets the first condition, use the first method to determine B"; another example, "when A meets the second condition, determine B”; another example , "when A satisfies the third condition, determine B based on the first parameter” and so on. Of course, it may also be a condition that takes A as a factor for determining B, for example, "when A satisfies the first condition, use the first method to determine C, and further determine B based on C" and so on.
  • millimeter wave technology has become a key technology in 5G (5th Generation Mobile Communication Technology, fifth generation mobile communication technology). Due to the high-frequency characteristics of millimeter waves, their links may suffer severe path loss.
  • millimeter wave communication systems need to use beamforming-based Massive MIMO (Multiple-input Multiple-output, large-scale multiple-input More technology) technology.
  • Massive MIMO Multiple-input Multiple-output, large-scale multiple-input More technology
  • Beamforming technology can effectively improve the spectral efficiency of mobile communication systems.
  • the coverage of each beam is limited, and beam switching is required to maintain good system performance.
  • the serving beam will no longer be suitable for the UE, and beam switching is required, or there is an obstacle between the network node and the UE that causes the quality of the current serving beam to drop suddenly, and beam switching is also required.
  • network nodes such as millimeter-wave base stations have a smaller coverage area, so the deployment density of millimeter-wave base stations is higher than that of LTE (Long Term Evolution, long-term evolution) base stations.
  • LTE Long Term Evolution, long-term evolution
  • beam switching occurs not only between mmWave base stations, but also between beams within the same mmWave base station.
  • the dense deployment of mmWave base stations and the use of beams will further increase the frequency of beam switching, which may limit the performance of practical systems, so the beam switching process needs to be optimized.
  • beam scanning and reporting are performed periodically. Based on the results of beam reporting, when the quality of the current serving beam fluctuates, beam switching can be performed to obtain better communication performance.
  • switching parameters can be optimized to improve system performance and reduce the probability of beam failure, while reducing the number of beam switching times to reduce system load overhead.
  • the beam switching based on the measurement report includes two parts of switching: one part is beam switching across base stations, and the other part is beam switching inside the base station. According to actual simulation statistics, most beam switching occurs between different base stations.
  • the UE accesses a network node, such as an AP (Access Point, access point), the beam scanning is performed periodically, and after the UE performs beam measurement on all beams, it saves the best N beam sequence numbers and its quality, and obtain candidate beams, the UE feeds back relevant beam information (such as SINR (Signal to Interference plus Noise Ratio, signal to interference and noise ratio), etc.) to the AP, and the AP determines whether the beam should be switched according to the fixed threshold determination method judge.
  • a network node such as an AP (Access Point, access point)
  • SINR Signal to Interference plus Noise Ratio, signal to interference and noise ratio
  • HM Hysteresis Margin, hysteresis threshold
  • the handover judgment process includes: the AP judges whether the candidate beam enters the beam handover judgment process based on the reported beam quality. If the condition of SINR serve + ⁇ SINR candidate is met, the handover judgment is entered.
  • the judgment process needs to last for TTT time; After the switching judgment process, if the condition of SINR serve + ⁇ SINR candidate is satisfied within the TTT time, beam switching will occur after the judgment ends, and the serving beam will be switched to the candidate beam; if the above conditions are not satisfied at a certain moment, the judgment will be interrupted process, this judgment ends, and beam switching does not occur.
  • the handover decision process select the beam with the largest SINR in the measurement results of each UE in the beam report as the candidate beam and add it to the candidate beam set; The larger beam is the candidate beam, and it will be pushed forward accordingly. If the candidate beam satisfies the requirement of the threshold, the handover decision process is triggered. During the determination process of beam switching, the candidate beam will not be selected by other users, and at the same time, the user will not select other beams for switching determination until a beam switching determination ends.
  • FIG. 1 A schematic diagram of a system architecture provided by an embodiment of the present disclosure is shown in FIG. 1 .
  • the system architecture includes: UEs and network nodes, wherein UEs are, for example, UE110, UE111, UE112, and UE113 in FIG. 1 , and network nodes are, for example, UEs in FIG. Network node 120 and network node 121.
  • the network nodes are deployed in the access network, for example, the network nodes are deployed in the access network NG-RAN (New Generation-Radio Access Network, New Generation Radio Access Network) in the 5G system.
  • the UE and the network node communicate with each other through a certain air interface technology, for example, they may communicate with each other through a cellular technology.
  • the UE involved in the embodiments of the present disclosure may be a device that provides voice and/or data connectivity to a user, a handheld device with a wireless connection function, or other processing devices connected to a wireless modem.
  • Types of UEs include mobile phones, vehicle user terminals, tablets, laptops, personal digital assistants, mobile Internet devices, wearable devices, and the like.
  • the network node involved in this embodiment of the present disclosure may be a base station, and the base station may include multiple cells that provide services for a terminal.
  • the base station can also be called an access point AP, or it can be a device in the access network that communicates with wireless terminal devices through one or more sectors on the air interface, or other names.
  • the network node is operable to interchange received over-the-air frames with Internet Protocol (IP) packets, acting as a router between the wireless terminal device and the rest of the access network, which may include the Internet Protocol (IP) communication network.
  • IP Internet Protocol
  • the network nodes may also coordinate attribute management for the air interface.
  • the network node involved in the embodiments of the present disclosure may be a network device (Base Transceiver Station, BTS) in Global System for Mobile communications (GSM) or Code Division Multiple Access (Code Division Multiple Access, CDMA) ), it can also be a network device (NodeB) in Wide-band Code Division Multiple Access (WCDMA), or it can be an evolved network device in a long term evolution (long term evolution, LTE) system (evolutional Node B, eNB or e-NodeB), 5G base station (gNB) in the 5G network architecture (next generation system), B5G base station in the super 5th generation mobile communication system (B5G), 6G (6th Generation Mobile Communication Technology, The 6G base station in the network architecture of the sixth generation mobile communication technology can also be a home The home evolved base station (Home evolved Node B, HeNB), relay node (relay node), home base station (femto), pico base station (pico), millimeter wave base station, etc.
  • BTS Base Transceiver
  • network nodes may include centralized unit (centralized unit, CU) nodes and distributed unit (distributed unit, DU) nodes, and the centralized unit and distributed unit may also be arranged geographically separately.
  • centralized unit centralized unit, CU
  • distributed unit distributed unit
  • An embodiment of the present disclosure provides a beam switching method, which is executed by the first network node.
  • the flowchart of the method is shown in FIG. 2 , and the method includes:
  • the first network node acquires first quality information respectively corresponding to multiple beams of at least one second network node within the first time period; and receives the first quality information sent by at least one UE.
  • the first quality information may include at least one of SINR, RSRP (Reference Signal Receiving Power, received power), and RSRQ (Reference Signal Receiving Quality, received quality).
  • the first network node obtains first quality information corresponding to 20 first beams in the first time period, that is, in the first time period, the first network node obtains 20 first quality information, and the 20 first beams
  • the piece of quality information includes 10 pieces of first quality information sent by one or more second network nodes to the first network node, and 10 pieces of first quality information sent by the 10 UEs to the first network node respectively.
  • the first network node obtains 20 pieces of first quality information, the first quality information is SINR, and the 20 pieces of first quality information are 20 SINRs, then the average value of these 20 SINRs is calculated, and the The average value of 20 SINRs is used as the first average channel quality.
  • S203 Determine a first state vector based on the first average channel quality corresponding to multiple first beams and the second average channel quality corresponding to multiple second beams in at least one second time period; at least one second time period is in Before the first time period.
  • the sliding window includes N time windows, and these N time windows are respectively the first time window, the second time window, the third time window, ... the Nth time window, the Nth time window
  • the time lengths of one time window, the second time window, the third time window, ... the Nth time window can be preset, and N is a positive integer.
  • the first time period is the first time window, and the first time window is the current window; multiple second time periods can be respectively the second time window, the third time window, ... the Nth time window, wherein, The second time window, the third time window, ...
  • the Nth time window are all historical windows; the average channel quality of the first time window can be the first average channel quality, and the average channel quality of the second time window can be the first The average channel quality of the second time window, the average channel quality of the third time window can be the second average channel quality, ...
  • the average channel quality of the Nth time window can be the second average channel quality; the average channel quality of the second time window, the third The average channel quality of the time window, ... the average channel quality of the Nth time window can be different; the average channel quality of the first time window can be expressed by SINR 1 , the average channel quality of the second time window, and the average channel quality of the third time window
  • the quality, ... the average channel quality of the Nth time window can be represented by SINR 2 , SINR 3 , ... SINR N respectively.
  • the beam switching threshold may include TTT, HM, and the like.
  • the beam switching threshold corresponding to at least one UE is dynamically adjusted, so that while maintaining system performance, an appropriate beam switching frequency can also be guaranteed.
  • obtaining first quality information respectively corresponding to a plurality of first beams within a first time period includes:
  • the first quality information includes at least one item of signal to interference and noise ratio SINR, received power RSRP, and received quality RSRQ.
  • the first network node can obtain at least one second The multiple beams of the network node respectively correspond to the first quality information; and receive the first quality information sent by at least one UE.
  • the first state vector is determined based on the first average channel quality corresponding to the multiple first beams and the second average channel quality corresponding to the multiple second beams in at least one second time period, including:
  • a first state vector is obtained.
  • the value of the first information vector and the value of the second information vector are both discrete values
  • the value of the first information vector and the value of the second information vector can be used to represent the signal strength level
  • the signal strength level can be 1 or 2 , 3, ... m, etc., wherein, m is a positive integer.
  • the first information vector is b 1
  • multiple second information vectors can be b 2 , b 3 , ... b n respectively
  • the first state vector is (b 1 , b 2 , b 3 , ... b n ) .
  • determining a beam switching threshold corresponding to at least one UE includes:
  • the first relationship model is used to characterize the relationship among the first state vector, reward value and beam switching threshold.
  • the first relationship model may be a table, and the table may be used to characterize the relationship among the first state vector, the reward value, and the beam switching threshold.
  • the reward value may be used to represent the effect of the UE performing beam switching based on the UE's corresponding beam switching threshold.
  • the maximum reward value can be used to represent the best effect of the UE performing beam switching based on the UE's corresponding beam switching threshold.
  • the first relationship model is determined by a first parameter corresponding to a first index change of at least one UE, and a second parameter corresponding to a beam switching cost.
  • the first index may include at least one of channel capacity, signal-to-noise ratio, and block error rate BLER.
  • the first relational model may be a table, and the table may be determined by using a first parameter corresponding to a channel capacity change of at least one UE and a second parameter corresponding to a beam switching cost.
  • the first parameter corresponding to the channel capacity change of at least one UE may be
  • the second parameter corresponding to the beam switching cost may be ⁇ c *H n , where, Indicates that the beam switching threshold corresponding to the state vector results in change in capacity, represents the average value of channel capacity C of at least one UE, ⁇ c is a preset penalty factor, and H n is an average number of handovers.
  • the first relational model can be trained by reinforcement learning.
  • the first relational model is trained by:
  • the relational model is trained, at least including:
  • the model parameters of the relational model are updated to obtain the updated relational model.
  • the loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model it further includes:
  • the termination condition is one of the following:
  • the loss function value is less than or equal to the loss function value threshold; or,
  • the loss function value is greater than or equal to the loss function value threshold.
  • the second state vector in the training sample set is input to the relational model, and the beam switching threshold corresponding to the second state vector is determined; based on the beam switching threshold corresponding to the second state vector, Through the reward function, the reward value corresponding to the second state vector is obtained; the second state vector and the reward value corresponding to the second state vector are substituted into the loss function corresponding to the relational model to obtain the loss function value, and based on the loss function value, the relational model Model parameters are updated. Until the termination condition is reached, the relational model when the termination condition is reached is taken as the first relational model.
  • the relational model is trained, further comprising:
  • the model parameters of the updated relational model are updated to obtain the updated relational model.
  • the second state vector is used to represent each training Average channel quality across multiple beams over time period.
  • the reward value corresponding to the second state vector is obtained through the reward function; wherein, the reward function is shown in formula (1):
  • the beam switching threshold corresponding to the second state vector results in change in capacity represents the average value of channel capacity C of at least one UE, ⁇ c is a preset penalty factor, and H n is an average number of handovers.
  • the first relational model can be deployed in a network node.
  • the first quality information sent by at least one UE to the first network node is sent to the second network node.
  • the first network node may share the first quality information as side information with other network nodes (for example, one or more second network nodes); wherein, the side information may be Information shared between network nodes.
  • An embodiment of the present disclosure provides another beam switching method, which is performed by the UE.
  • the flowchart of the method is shown in FIG. 4 , and the method includes:
  • S402. Receive a beam switching threshold corresponding to the UE sent by the first network node, and perform beam switching based on the beam switching threshold.
  • beam switching may be triggered based on the beam switching threshold, so that the UE switches from the serving beam to other beams. beam.
  • the beam switching threshold corresponding to the UE is dynamically adjusted, so that while maintaining system performance, an appropriate beam switching frequency can also be ensured.
  • reinforcement learning in the beam switching scenario is provided.
  • the flow diagram of the reinforcement learning is shown in FIG. 5 , including:
  • the beam switching threshold may include TTT, HM, and the like.
  • each state vector in the training sample set can be used to represent the average channel quality of multiple beams within a training period.
  • a reward function as shown in formula (1) is set.
  • the trained relational model is the first relational model.
  • the relational model is trained.
  • the relational model can be a table, and the table can be a Q-table (Q table) in the Q-learning algorithm, and the table can be used to represent the state vector, award The relationship between the excitation value and the beam switching threshold; input the state vector in the training sample set to the table, and determine the beam switching threshold corresponding to the state vector; based on the beam switching threshold corresponding to the state vector, through the reward function, get the corresponding Reward value; Substitute the state vector and the reward value corresponding to the state vector into the loss function corresponding to the table to obtain the loss function value, and update the model parameters of the table based on the loss function value.
  • Q table Q-table
  • the termination condition input the state vector in the training sample set to the table, and determine the beam switching threshold corresponding to the state vector; based on the beam switching threshold corresponding to the state vector, obtain the reward value corresponding to the state vector through the reward function; The reward value corresponding to the state vector is substituted into the loss function corresponding to the table to obtain the loss function value, and based on the loss function value, the model parameters of the table are updated.
  • the table when the termination condition is reached is used as the first table (relational model after training); wherein, the termination condition can be that the loss function value is less than or equal to the loss function value threshold.
  • the trained relationship model may be the first table, and the first table is deployed in the network nodes.
  • FIG. 6 Another beam switching method is provided in the embodiment of the present disclosure.
  • the schematic flowchart of the method is shown in FIG. 6 , and the method includes:
  • the network node sends a reference signal for beam scanning to the UE.
  • the reference signal may be an SSB (Synchronization Signal Block, synchronization signal block).
  • SSB Synchronization Signal Block, synchronization signal block
  • the UE performs beam measurement to obtain quality information corresponding to the serving beam.
  • the UE feeds back the quality information corresponding to the serving beam and the quality information corresponding to the candidate beam to the network node.
  • the network node uses the quality information corresponding to the serving beam as side information, and shares the side information with other network nodes.
  • step S604 is not required; the quality information corresponding to the serving beam may be SINR, RSRP or RSRQ.
  • the network node determines a state vector based on all side information, and searches for a beam switching threshold corresponding to the UE based on a relationship model obtained through reinforcement learning training.
  • the network node stores all edge information, and all edge information is the edge corresponding to the serving beam information, and side information sent by other network nodes to network nodes.
  • the network node makes a beam switching decision based on the beam switching threshold; if the network node determines that the UE switches from the serving beam to other beams, the network node instructs the UE to switch from the serving beam to other beams, and proceeds to step S607 for execution.
  • other beams may be candidate beams.
  • the UE performs beam switching.
  • the embodiment of the present disclosure provides yet another beam switching method, which is applied in a multi-AP millimeter wave network scenario, and the topology of the scenario is: in a scenario with a radius of 30m, 3 APs and 7 users are evenly deployed; Among them, 3 of the 7 users move along a certain path at a speed of 5 km/h (kilometer/hour), and each of the 3 APs has 32 beams, using hybrid beamforming technology, The cycle of beam scanning is set to 10 ms, and the UE performs beam switching when it meets the conditions for beam switching.
  • a schematic flow chart of the method is shown in Figure 7, and the method includes:
  • the network node determines a beam switching threshold through a greedy strategy.
  • the UE performs beam switching based on the beam switching threshold.
  • the network node determines the second state vector through the sliding window.
  • the sliding window includes 5 time windows, that is, the length of the sliding window is 5, and these 5 time windows are respectively the first time window, the second time window, the third time window, the fourth time window and
  • the time length of the first time window is 10ms
  • the time lengths of the second time window, the third time window, the fourth time window and the fifth time window are all 40ms.
  • the network node updates the Q table through the reward function.
  • the reinforcement learning training is performed according to the reward function shown in formula (1), that is, the training of the relationship model is performed, and the relationship model is a Q table.
  • the penalty factor ⁇ c in the reward function can be set to 0.75.
  • the network node deploys the updated Q table to the network node; respectively go to step S701 and S706 are executed.
  • the network node determines the first state vector, and obtains the beam switching threshold by looking up the Q table.
  • the first state vector may be (b 1 , b 2 , b 3 , . . . b n ).
  • the values of b 1 , b 2 , b 3 ,...b n can be used to characterize the signal strength level, and the signal strength level can be 1, 2, 3,...m, etc., for example, the signal strength level can be 1, 2, 3 , 4 and 5, that is, there are 5 levels of signal strength.
  • the network node performs a beam switching decision.
  • the network node performs periodic beam scanning.
  • the UE performs beam measurement.
  • the UE feeds back the quality information corresponding to the beam; go to step S706 for execution.
  • the quality information may be SINR, RSRP or RSRQ.
  • steps S701-S705 are steps in reinforcement learning training, and the update interval T of each reinforcement learning training may be 10 ms (milliseconds); steps S706-S710 are steps in online execution.
  • Figure 9 is an enlarged view of Figure 8
  • the CDF Cumulative Distribution Function, Cumulative Distribution Function
  • this disclosure is based on the beam switching method of reinforcement learning (such as Q-learning), and the CDF statistical results of the SINR corresponding to the serving beam within 20s; the performance of the beam switching based on reinforcement learning in this disclosure is better.
  • the present disclosure is based on reinforcement learning (such as Q-learning) beam switching, and the number of beam switching occurrences is the least.
  • the embodiment of the present disclosure also provides a beam switching device, which is applied to the first network node.
  • the structural diagram of the device is shown in FIG. Receive and send data.
  • the bus architecture may include any number of interconnected buses and bridges, specifically one or more processors represented by the processor 1310 and various circuits of the memory represented by the memory 1320 are linked together.
  • the bus architecture can also link together various other circuits such as peripherals, voltage regulators, and power management circuits, all of which are well known in the art, therefore, it is not described further herein.
  • the bus interface provides the interface.
  • the transceiver 1300 may be a plurality of elements, including a transmitter and a receiver, providing a unit for communicating with various other devices over transmission media, including wireless channels, wired channels, optical cables, and other transmission media.
  • the processor 1310 is responsible for managing the bus architecture and general processing, and the memory 1320 is used to store computer programs, such as data used by the processor 1310 when performing operations.
  • the processor 1310 may be a central processing device (CPU), an application specific integrated circuit (Application Specific Integrated Circuit, ASIC), a field programmable gate array (Field-Programmable Gate Array, FPGA) or a complex programmable logic device (Complex Programmable Logic Device , CPLD), the processor can also adopt a multi-core architecture.
  • CPU central processing device
  • ASIC Application Specific Integrated Circuit
  • FPGA field programmable gate array
  • CPLD Complex Programmable Logic Device
  • Processor 1310 configured to read the computer program in the memory and perform the following operations:
  • first quality information respectively corresponding to multiple first beams within a first time period Acquiring first quality information respectively corresponding to multiple first beams within a first time period, where the first quality information respectively corresponding to multiple first beams includes first quality information sent to the first network node by at least one user equipment UE;
  • the first average channel quality corresponding to a plurality of first beams, and the second average channel quality corresponding to a plurality of second beams in at least one second time period determine the first state vector; at least one second time period in the first before the time period;
  • obtaining first quality information respectively corresponding to a plurality of first beams within a first time period includes:
  • the first quality information includes at least one item of signal to interference and noise ratio SINR, received power RSRP, and received quality RSRQ.
  • the first state vector is determined based on the first average channel quality corresponding to the multiple first beams and the second average channel quality corresponding to the multiple second beams in at least one second time period, including:
  • the first average channel quality is quantized and mapped to obtain the first average channel quality corresponding to The value of the first information vector; and the second average channel quality is quantized and mapped to obtain the value of the second information vector corresponding to the second average channel quality;
  • a first state vector is obtained.
  • determining a beam switching threshold corresponding to at least one UE includes:
  • the first relationship model is used to characterize the relationship among the first state vector, reward value and beam switching threshold.
  • the first relationship model is determined by a first parameter corresponding to a first index change of at least one UE, and a second parameter corresponding to a beam switching cost.
  • the first relational model is trained by:
  • the relational model is trained, at least including:
  • the model parameters of the relational model are updated to obtain the updated relational model.
  • the relational model is trained, further comprising:
  • the model parameters of the updated relational model are updated to obtain the updated relational model.
  • the loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model it further includes:
  • the termination condition is one of the following:
  • the loss function value is less than or equal to the loss function value threshold; or,
  • the loss function value is greater than or equal to the loss function value threshold.
  • the first quality information sent by at least one UE to the first network node is sent to the second network node.
  • an embodiment of the present disclosure also provides a beam switching device, which is applied to a UE.
  • the structural diagram of the device is shown in FIG. data.
  • the bus architecture may include any number of interconnected buses and bridges, specifically one or more processors represented by the processor 1410 and various circuits of the memory represented by the memory 1420 are linked together.
  • the bus architecture can also link together various other circuits such as peripherals, voltage regulators, and power management circuits, etc., which are well known in the art and therefore will not be further described herein.
  • the bus interface provides the interface.
  • Transceiver 1400 may be a plurality of elements, including transmitters and receivers, providing means for communicating with various other devices over transmission media, including wireless channels, wired channels, fiber optic cables, etc. Transmission medium.
  • the user interface 1430 can also be an interface capable of connecting externally and internally to required devices, and the connected devices include but are not limited to keypads, display screens, etc. monitors, speakers, microphones, joysticks, etc.
  • the processor 1410 is responsible for managing the bus architecture and general processing, and the memory 1420 can store data used by the processor 1410 when performing operations.
  • the processor 1410 can be a CPU (central device), ASIC (Application Specific Integrated Circuit, application specific integrated circuit), FPGA (Field-Programmable Gate Array, field programmable gate array) or CPLD (Complex Programmable Logic Device , complex programmable logic device), the processor can also adopt a multi-core architecture.
  • CPU central device
  • ASIC Application Specific Integrated Circuit
  • FPGA Field-Programmable Gate Array, field programmable gate array
  • CPLD Complex Programmable Logic Device , complex programmable logic device
  • the processor is used to execute the method of the second aspect provided by the embodiments of the present disclosure according to the obtained executable instruction by calling the computer program stored in the memory.
  • the processor and memory may also be physically separated.
  • Processor 1410 configured to read the computer program in memory 1420 and perform the following operations:
  • the corresponding first quality information, and the average channel quality corresponding to multiple beams in at least one second time period determine the beam switching threshold corresponding to the UE;
  • the embodiments of the present disclosure also provide a beam switching device, which is applied to the first network node.
  • the structural diagram of the device is shown in FIG. 13 .
  • the beam switching device 80 includes a first processing unit 801.
  • the first processing unit 801 is configured to acquire first quality information corresponding to multiple first beams respectively within a first time period, where the first quality information corresponding to multiple first beams respectively includes at least one user equipment UE sent to the first network The first quality information of the node;
  • the second processing unit 802 is configured to determine the first average channel quality corresponding to the multiple first beams based on the first quality information respectively corresponding to the multiple first beams;
  • the third processing unit 803 is configured to determine the first state vector based on the first average channel quality corresponding to the multiple first beams and the second average channel quality corresponding to the multiple second beams in at least one second time period; at least a second time period preceding the first time period;
  • the fourth processing unit 804 is configured to determine a beam switching threshold corresponding to at least one UE based on the first state vector.
  • the first processing unit 801 is specifically configured to:
  • the first quality information includes at least one item of signal to interference and noise ratio SINR, received power RSRP, and received quality RSRQ.
  • the third processing unit 803 is specifically configured to:
  • a first state vector is obtained.
  • the fourth processing unit 804 is specifically configured to:
  • the first relationship model is used to characterize the relationship among the first state vector, reward value and beam switching threshold.
  • the first relationship model is determined by a first parameter corresponding to a first index change of at least one UE, and a second parameter corresponding to a beam switching cost.
  • the first relational model is trained by:
  • the relational model is trained, at least including:
  • the model parameters of the relational model are updated to obtain the updated relational model.
  • the relational model is trained, further comprising:
  • the model parameters of the updated relational model are updated to obtain the updated relational model.
  • the loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model it further includes:
  • the termination condition is one of the following:
  • the loss function value is less than or equal to the loss function value threshold; or,
  • the loss function value is greater than or equal to the loss function value threshold.
  • the first processing unit 801 is further configured to:
  • the first quality information sent by at least one UE to the first network node is sent to the second network node.
  • the embodiments of the present disclosure also provide a beam switching device, which is applied to a UE.
  • the structural diagram of the device is shown in FIG. Six processing units 902 .
  • the fifth processing unit 901 is configured to send the first quality information corresponding to the first beam within the first time period to the first network node; so that the first network node obtains the first quality information corresponding to the multiple beams within the first time period information, and based on the first quality information respectively corresponding to the multiple beams, and the average channel quality corresponding to the multiple beams in at least one second time period, determine the beam switching threshold corresponding to the UE;
  • the sixth processing unit 902 is configured to receive the beam switching threshold corresponding to the UE sent by the first network node, and perform beam switching based on the beam switching threshold.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
  • the integrated unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a processor-readable storage medium.
  • the technical solution of the present application is essentially or part of the contribution to the prior art or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , including several instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor (processor) execute all or part of the steps of the methods described in the various embodiments of the present application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disc and other media that can store program codes. .
  • the embodiment of the present disclosure also provides a processor-readable storage medium
  • the essence is to store a computer program, and the computer program is used to implement the steps of any method for determining the time of cell handover provided by any one of the embodiments of the present disclosure or any one of the optional implementations when executed by a processor.
  • the processor-readable storage medium can be any available medium or data storage device that can be accessed by the processor, including but not limited to magnetic storage (e.g., floppy disk, hard disk, tape, magneto-optical disk (MO), etc.), optical storage (e.g., CD, DVD, BD, HVD, etc.), and semiconductor memory (such as ROM, EPROM, EEPROM, non-volatile memory (NAND FLASH), solid-state drive (SSD)), etc.
  • magnetic storage e.g., floppy disk, hard disk, tape, magneto-optical disk (MO), etc.
  • optical storage e.g., CD, DVD, BD, HVD, etc.
  • semiconductor memory such as ROM, EPROM, EEPROM, non-volatile memory (NAND FLASH), solid-state drive (SSD)
  • the embodiments of the present application may be provided as methods, systems, or computer program products. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, optical storage, etc.) having computer-usable program code embodied therein.
  • processor-executable instructions may also be stored in a processor-readable memory capable of directing a computer or other programmable data processing device to operate in a specific manner, such that the instructions stored in the processor-readable memory produce a manufacturing product, the instruction device realizes the functions specified in one or more procedures of the flow chart and/or one or more blocks of the block diagram.
  • processor-executable instructions can also be loaded onto a computer or other programmable data processing device, causing a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented Execution instructions are provided for implementing A step of a function specified in a flow chart process or processes and/or a block diagram block or blocks.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Electromagnetism (AREA)
  • Quality & Reliability (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

Provided in the present disclosure is a beam switching method, comprising: acquiring first quality information respectively corresponding to a plurality of first beams within a first time period, the first quality information respectively corresponding to the plurality of first beams comprising first quality information sent by at least one user equipment (UE) to a first network node; on the basis of the first quality information respectively corresponding to the plurality of first beams, determining a first average channel quality corresponding to the plurality of first beams; determining a first state vector on the basis of the first average channel quality corresponding to the plurality of first beams and a second average channel quality corresponding to a plurality of second beams within at least one second time period, the at least one second time period being earlier than the first time period; and on the basis of the first state vector, determining a beam switching threshold corresponding to the at least one UE. In the method, dynamic adjustment on a beam switching threshold corresponding to at least one UE is achieved, thereby ensuring a proper beam switching frequency.

Description

波束切换方法、装置及处理器可读存储介质Beam switching method, device and processor-readable storage medium
相关申请的交叉引用Cross References to Related Applications
本公开要求于2022年01月26日在中国国家知识产权局提交的题为“波束切换方法、装置及处理器可读存储介质”的中国专利中请No.202210095304.5的优先权,该申请的全部内容通过引用并入于此。This disclosure claims the priority of Chinese Patent Application No. 202210095304.5 entitled "Beam Switching Method, Device, and Processor-Readable Storage Medium" filed at the State Intellectual Property Office of China on January 26, 2022, the entirety of which The contents are hereby incorporated by reference.
技术领域technical field
本公开涉及无线通信技术领域,具体而言,本公开涉及波束切换方法、装置及处理器可读存储介质。The present disclosure relates to the field of wireless communication technologies, and in particular, the present disclosure relates to a beam switching method, device, and processor-readable storage medium.
背景技术Background technique
现有技术中,网络节点和UE(User Equipment,用户设备)之间的信道可能由于障碍物或者UE旋转移动发生改变,从而导致网络节点和UE之间的波束质量降低或者其他波束的质量比该波束质量更好;在这种情况下,为了获得更好的系统性能,网络节点和UE之间需要进行波束切换。在基于固定切换阈值的切换过程中,若固定切换阈值较小,虽然能够维持较好的系统性能,但会导致频繁切换,甚至导致不必要的切换,造成负载开销增加;若固定切换阈值较大,虽然能减少切换的频率,但有的切换不能及时触发,从而导致系统的性能得不到保证。In the prior art, the channel between the network node and UE (User Equipment, user equipment) may change due to obstacles or rotational movement of the UE, resulting in reduced beam quality between the network node and UE or quality of other beams than this Better beam quality; in this case, beam switching is required between network nodes and UEs for better system performance. In the handover process based on the fixed handover threshold, if the fixed handover threshold is small, although better system performance can be maintained, it will lead to frequent handover, even unnecessary handover, resulting in increased load overhead; if the fixed handover threshold is large , although the frequency of handover can be reduced, some handovers cannot be triggered in time, resulting in the failure to guarantee the performance of the system.
发明内容Contents of the invention
本公开针对现有的方式的缺点,提出一种波束切换方法、装置及处理器可读存储介质,以在一定程度上解决上述的技术缺陷中的至少一个方面。Aiming at the shortcomings of the existing methods, the present disclosure proposes a beam switching method, device, and processor-readable storage medium, so as to solve at least one aspect of the above-mentioned technical defects to a certain extent.
第一方面,提供了一种波束切换方法,由第一网络节点执行,包括:In a first aspect, a beam switching method is provided, performed by a first network node, including:
获取第一时间段内多个第一波束分别对应的第一质量信息,多个第一 波束分别对应的第一质量信息包括至少一个用户设备UE发送给第一网络节点的第一质量信息;Acquiring the first quality information respectively corresponding to the multiple first beams in the first time period, the multiple first beams The first quality information respectively corresponding to the beams includes the first quality information sent by at least one user equipment UE to the first network node;
基于多个第一波束分别对应的第一质量信息,确定多个第一波束对应的第一平均信道质量;Based on the first quality information respectively corresponding to the multiple first beams, determine the first average channel quality corresponding to the multiple first beams;
基于多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量;至少一个第二时间段在第一时间段之前;Based on the first average channel quality corresponding to a plurality of first beams, and the second average channel quality corresponding to a plurality of second beams in at least one second time period, determine the first state vector; at least one second time period in the first before the time period;
基于第一状态向量,确定至少一个UE对应的波束切换阈值。Based on the first state vector, determine a beam switching threshold corresponding to at least one UE.
在一个实施例中,获取第一时间段内多个第一波束分别对应的第一质量信息,包括:In an embodiment, obtaining first quality information respectively corresponding to a plurality of first beams within a first time period includes:
获取第一时间段内第二网络节点发送的第一质量信息;并接收至少一个UE发送的第一质量信息;Acquire first quality information sent by the second network node within the first time period; and receive first quality information sent by at least one UE;
其中,第一质量信息包括信号与干扰噪声比SINR、接收功率RSRP、接收质量RSRQ中的至少一项。Wherein, the first quality information includes at least one item of signal to interference and noise ratio SINR, received power RSRP, and received quality RSRQ.
在一个实施例中,基于多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量,包括:In an embodiment, the first state vector is determined based on the first average channel quality corresponding to the multiple first beams and the second average channel quality corresponding to the multiple second beams in at least one second time period, including:
将第一平均信道质量进行量化映射处理,得到第一平均信道质量对应的第一信息向量的值;并将第二平均信道质量进行量化映射处理,得到第二平均信道质量对应的第二信息向量的值;Performing quantization mapping processing on the first average channel quality to obtain the value of the first information vector corresponding to the first average channel quality; performing quantization mapping processing on the second average channel quality to obtain a second information vector corresponding to the second average channel quality value;
基于第一信息向量和至少一个第二信息向量,得到第一状态向量。Based on the first information vector and at least one second information vector, a first state vector is obtained.
在一个实施例中,基于第一状态向量,确定至少一个UE对应的波束切换阈值,包括:In an embodiment, based on the first state vector, determining a beam switching threshold corresponding to at least one UE includes:
将第一状态向量输入至第一关系模型,进行匹配处理,得到与第一状态向量相匹配的一个或多个奖励值,并将一个或多个奖励值中的最大奖励值对应的波束切换阈值,确定为UE对应的波束切换阈值;Input the first state vector into the first relationship model, perform matching processing, obtain one or more reward values matching the first state vector, and set the beam switching threshold corresponding to the maximum reward value among the one or more reward values , determined as the beam switching threshold corresponding to the UE;
其中,第一关系模型用于表征第一状态向量、奖励值和波束切换阈值之间的关系。Wherein, the first relationship model is used to characterize the relationship among the first state vector, reward value and beam switching threshold.
在一个实施例中,第一关系模型通过至少一个UE的第一指标变化对 应的第一参数,以及波束切换代价对应的第二参数确定。In one embodiment, the first relationship model uses the first index change of at least one UE to The corresponding first parameter and the second parameter corresponding to the beam switching cost are determined.
在一个实施例中,第一关系模型是通过以下方式训练得到的:In one embodiment, the first relational model is trained by:
构建训练样本集合;基于训练样本集合,对关系模型进行训练,得到第一关系模型;Constructing a training sample set; based on the training sample set, training the relational model to obtain a first relational model;
基于训练样本集合,对关系模型进行训练,至少包括:Based on the training sample set, the relational model is trained, at least including:
将训练样本集合中的第二状态向量输入至关系模型,确定第二状态向量对应的波束切换阈值;Inputting the second state vector in the training sample set to the relational model, and determining the beam switching threshold corresponding to the second state vector;
基于第二状态向量对应的波束切换阈值和奖励函数,确定第二状态向量对应的奖励值;Determining a reward value corresponding to the second state vector based on the beam switching threshold and the reward function corresponding to the second state vector;
基于第二状态向量、第二状态向量对应的奖励值和关系模型对应的损失函数,确定损失函数值;Determining a loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model;
基于损失函数值,对关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the relational model are updated to obtain the updated relational model.
在一个实施例中,基于训练样本集合,对关系模型进行训练,还包括:In one embodiment, based on the training sample set, the relational model is trained, further comprising:
在未达到终止条件时,重复执行以下步骤:When the termination condition is not met, the following steps are repeated:
将训练样本集合中的第二状态向量输入至更新后的关系模型,确定第二状态向量对应的波束切换阈值;Inputting the second state vector in the training sample set to the updated relationship model, and determining the beam switching threshold corresponding to the second state vector;
基于第二状态向量对应的波束切换阈值和奖励函数,确定第二状态向量对应的奖励值;Determining a reward value corresponding to the second state vector based on the beam switching threshold and the reward function corresponding to the second state vector;
基于第二状态向量、第二状态向量对应的奖励值和更新后的关系模型对应的损失函数,确定损失函数值;Determining a loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the updated relationship model;
基于损失函数值,对更新后的关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the updated relational model are updated to obtain the updated relational model.
在一个实施例中,在基于第二状态向量、第二状态向量对应的奖励值和关系模型对应的损失函数,确定损失函数值之后,还包括:In one embodiment, after determining the loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model, it further includes:
判断是否达到终止条件;Determine whether the termination condition is met;
若确定达到终止条件,则得到第一关系模型;If it is determined that the termination condition is met, the first relational model is obtained;
终止条件为以下中的一项:The termination condition is one of the following:
损失函数值小于或等于损失函数值阈值;或者, The loss function value is less than or equal to the loss function value threshold; or,
损失函数值大于或等于损失函数值阈值。The loss function value is greater than or equal to the loss function value threshold.
在一个实施例中,将至少一个UE发送给第一网络节点的第一质量信息,发送给第二网络节点。In an embodiment, the first quality information sent by at least one UE to the first network node is sent to the second network node.
第二方面,提供了一种波束切换方法,由UE执行,包括:In a second aspect, a beam switching method is provided, which is performed by a UE, including:
向第一网络节点发送第一时间段内第一波束对应的第一质量信息;以使第一网络节点获取第一时间段内多个波束分别对应的第一质量信息,并基于多个波束分别对应的第一质量信息,以及至少一个第二时间段内多个波束对应的平均信道质量,确定UE对应的波束切换阈值;Sending the first quality information corresponding to the first beam in the first time period to the first network node; so that the first network node obtains the first quality information respectively corresponding to the multiple beams in the first time period, and based on the multiple beams respectively The corresponding first quality information, and the average channel quality corresponding to multiple beams in at least one second time period, determine the beam switching threshold corresponding to the UE;
接收第一网络节点发送的UE对应的波束切换阈值,并基于波束切换阈值进行波束切换。Receive the beam switching threshold corresponding to the UE sent by the first network node, and perform beam switching based on the beam switching threshold.
第三方面,提供了一种波束切换装置,应用于第一网络节点,包括存储器,收发机,处理器:In a third aspect, a beam switching device is provided, which is applied to a first network node, and includes a memory, a transceiver, and a processor:
存储器,用于存储计算机程序;memory for storing computer programs;
收发机,用于在处理器的控制下收发数据;a transceiver, configured to send and receive data under the control of the processor;
处理器,用于读取存储器中的计算机程序并执行以下操作:A processor that reads a computer program in memory and does the following:
获取第一时间段内多个第一波束分别对应的第一质量信息,多个第一波束分别对应的第一质量信息包括至少一个用户设备UE发送给第一网络节点的第一质量信息;Acquiring first quality information respectively corresponding to multiple first beams within a first time period, where the first quality information respectively corresponding to multiple first beams includes first quality information sent to the first network node by at least one user equipment UE;
基于多个第一波束分别对应的第一质量信息,确定多个第一波束对应的第一平均信道质量;Based on the first quality information respectively corresponding to the multiple first beams, determine the first average channel quality corresponding to the multiple first beams;
基于多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量;至少一个第二时间段在第一时间段之前;Based on the first average channel quality corresponding to a plurality of first beams, and the second average channel quality corresponding to a plurality of second beams in at least one second time period, determine the first state vector; at least one second time period in the first before the time period;
基于第一状态向量,确定至少一个UE对应的波束切换阈值。Based on the first state vector, determine a beam switching threshold corresponding to at least one UE.
第四方面,提供了一种波束切换装置,应用于UE,包括存储器,收发机,处理器:In a fourth aspect, a beam switching device is provided, which is applied to a UE, including a memory, a transceiver, and a processor:
存储器,用于存储计算机程序;memory for storing computer programs;
收发机,用于在处理器的控制下收发数据;a transceiver, configured to send and receive data under the control of the processor;
处理器,用于读取存储器中的计算机程序并执行以下操作: A processor that reads a computer program in memory and does the following:
向第一网络节点发送第一时间段内第一波束对应的第一质量信息;以使第一网络节点获取第一时间段内多个波束分别对应的第一质量信息,并基于多个波束分别对应的第一质量信息,以及至少一个第二时间段内多个波束对应的平均信道质量,确定UE对应的波束切换阈值;Sending the first quality information corresponding to the first beam in the first time period to the first network node; so that the first network node obtains the first quality information respectively corresponding to the multiple beams in the first time period, and based on the multiple beams respectively The corresponding first quality information, and the average channel quality corresponding to multiple beams in at least one second time period, determine the beam switching threshold corresponding to the UE;
接收第一网络节点发送的UE对应的波束切换阈值,并基于波束切换阈值进行波束切换。Receive the beam switching threshold corresponding to the UE sent by the first network node, and perform beam switching based on the beam switching threshold.
第五方面,本公开提供了一种波束切换装置,应用于第一网络节点,包括:In a fifth aspect, the present disclosure provides a beam switching device applied to a first network node, including:
第一处理单元,用于获取第一时间段内多个第一波束分别对应的第一质量信息,多个第一波束分别对应的第一质量信息包括至少一个用户设备UE发送给第一网络节点的第一质量信息;The first processing unit is configured to acquire first quality information respectively corresponding to multiple first beams within a first time period, where the first quality information respectively corresponding to multiple first beams includes at least one user equipment UE sent to the first network node the first quality information of
第二处理单元,用于基于多个第一波束分别对应的第一质量信息,确定多个第一波束对应的第一平均信道质量;The second processing unit is configured to determine the first average channel quality corresponding to the multiple first beams based on the first quality information respectively corresponding to the multiple first beams;
第三处理单元,用于基于多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量;至少一个第二时间段在第一时间段之前;A third processing unit, configured to determine a first state vector based on a first average channel quality corresponding to a plurality of first beams, and a second average channel quality corresponding to a plurality of second beams in at least one second time period; at least one the second time period is before the first time period;
第四处理单元,用于基于第一状态向量,确定至少一个UE对应的波束切换阈值。The fourth processing unit is configured to determine a beam switching threshold corresponding to at least one UE based on the first state vector.
第六方面,本公开提供了一种波束切换装置,应用于UE,包括:In a sixth aspect, the present disclosure provides a beam switching device applied to a UE, including:
第五处理单元,用于向第一网络节点发送第一时间段内第一波束对应的第一质量信息;以使第一网络节点获取第一时间段内多个波束分别对应的第一质量信息,并基于多个波束分别对应的第一质量信息,以及至少一个第二时间段内多个波束对应的平均信道质量,确定UE对应的波束切换阈值;The fifth processing unit is configured to send the first quality information corresponding to the first beam within the first time period to the first network node; so that the first network node obtains the first quality information respectively corresponding to the multiple beams within the first time period , and based on the first quality information respectively corresponding to the multiple beams, and the average channel quality corresponding to the multiple beams in at least one second time period, determine the beam switching threshold corresponding to the UE;
第六处理单元,用于接收第一网络节点发送的UE对应的波束切换阈值,并基于波束切换阈值进行波束切换。The sixth processing unit is configured to receive the beam switching threshold corresponding to the UE sent by the first network node, and perform beam switching based on the beam switching threshold.
第七方面,提供了一种处理器可读存储介质,其中,处理器可读存储介质存储有计算机程序,计算机程序被处理器执行时,实现第一方面和第二方面所述的方法。 In a seventh aspect, a processor-readable storage medium is provided, wherein the processor-readable storage medium stores a computer program, and when the computer program is executed by a processor, the methods described in the first aspect and the second aspect are implemented.
本公开实施例提供的技术方案,至少具有如下有益效果:The technical solutions provided by the embodiments of the present disclosure have at least the following beneficial effects:
实现了对至少一个UE对应的波束切换阈值进行动态地调整,从而在能够保持系统性能的同时,也可以保证适当的波束切换频率。A dynamic adjustment of the beam switching threshold corresponding to at least one UE is realized, so that an appropriate beam switching frequency can be guaranteed while system performance can be maintained.
本公开附加的方面和优点将在下面的描述中部分给出,这些将从下面的描述中变得明显,或通过本公开的实践了解到。Additional aspects and advantages of the disclosure will be set forth in part in the description which follows, and will become apparent from the description, or may be learned by practice of the disclosure.
附图说明Description of drawings
为了更清楚地说明本公开实施例中的技术方案,下面将对本公开实施例描述中所需要使用的附图作简单地介绍。In order to illustrate the technical solutions in the embodiments of the present disclosure more clearly, the following briefly introduces the drawings that are required for the description of the embodiments of the present disclosure.
图1为本公开实施例提供的系统架构的示意图;FIG. 1 is a schematic diagram of a system architecture provided by an embodiment of the present disclosure;
图2为本公开实施例提供的一种波束切换方法的流程示意图;FIG. 2 is a schematic flowchart of a beam switching method provided by an embodiment of the present disclosure;
图3为本公开实施例提供的波束切换的示意图;FIG. 3 is a schematic diagram of beam switching provided by an embodiment of the present disclosure;
图4为本公开实施例提供的另一种波束切换方法的流程示意图;FIG. 4 is a schematic flowchart of another beam switching method provided by an embodiment of the present disclosure;
图5为本公开实施例提供的强化学习的流程示意图;FIG. 5 is a schematic flow diagram of reinforcement learning provided by an embodiment of the present disclosure;
图6为本公开实施例提供的又一种波束切换方法的流程示意图;FIG. 6 is a schematic flowchart of another beam switching method provided by an embodiment of the present disclosure;
图7为本公开实施例提供的又一种波束切换方法的流程示意图;FIG. 7 is a schematic flowchart of another beam switching method provided by an embodiment of the present disclosure;
图8为本公开实施例提供的SINR统计的示意图;FIG. 8 is a schematic diagram of SINR statistics provided by an embodiment of the present disclosure;
图9为本公开实施例提供的SINR统计的示意图;FIG. 9 is a schematic diagram of SINR statistics provided by an embodiment of the present disclosure;
图10为本公开实施例提供的切换次数统计的示意图;FIG. 10 is a schematic diagram of counting the number of handovers provided by an embodiment of the present disclosure;
图11为本公开实施例提供的一种波束切换装置的结构示意图;FIG. 11 is a schematic structural diagram of a beam switching device provided by an embodiment of the present disclosure;
图12为本公开实施例提供的一种波束切换装置的结构示意图;FIG. 12 is a schematic structural diagram of a beam switching device provided by an embodiment of the present disclosure;
图13为本公开实施例提供的一种波束切换装置的结构示意图;FIG. 13 is a schematic structural diagram of a beam switching device provided by an embodiment of the present disclosure;
图14为本公开实施例提供的一种波束切换装置的结构示意图。Fig. 14 is a schematic structural diagram of a beam switching device provided by an embodiment of the present disclosure.
具体实施方式Detailed ways
下面详细描述本申请的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,仅用于解释本申请,而不能解释为对本申请的限制。 Embodiments of the present application are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present application, and are not construed as limiting the present application.
本技术领域技术人员可以理解,除非特意声明,这里使用的单数形式“一”、“一个”、“所述”和“该”也可包括复数形式。应该进一步理解的是,本申请的说明书中使用的措辞“包括”是指存在所述特征、整数、步骤、操作、元件和/或组件,但是并不排除存在或添加一个或多个其他特征、整数、步骤、操作、元件、组件和/或它们的组。应该理解,当我们称元件被“连接”或“耦接”到另一元件时,它可以直接连接或耦接到其他元件,或者也可以存在中间元件。此外,这里使用的“连接”或“耦接”可以包括无线连接或无线耦接。这里使用的措辞“和/或”包括一个或更多个相关联的列出项的全部或任一单元和全部组合。Those skilled in the art will understand that unless otherwise stated, the singular forms "a", "an", "said" and "the" used herein may also include plural forms. It should be further understood that the word "comprising" used in the specification of the present application refers to the presence of said features, integers, steps, operations, elements and/or components, but does not exclude the presence or addition of one or more other features, Integers, steps, operations, elements, components, and/or groups thereof. It will be understood that when an element is referred to as being "connected" or "coupled" to another element, it can be directly connected or coupled to the other element or intervening elements may also be present. Additionally, "connected" or "coupled" as used herein may include wireless connection or wireless coupling. The expression "and/or" used herein includes all or any elements and all combinations of one or more associated listed items.
本申请实施例中术语“和/或”,描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。字符“/”一般表示前后关联对象是一种“或”的关系。本申请实施例中术语“多个”是指两个或两个以上,其它量词与之类似。The term "and/or" in the embodiments of this application describes the association relationship of associated objects, indicating that there may be three relationships, for example, A and/or B, which may mean: A exists alone, A and B exist simultaneously, and B exists alone These three situations. The character "/" generally indicates that the contextual objects are an "or" relationship. The term "plurality" in the embodiments of the present application refers to two or more, and other quantifiers are similar.
本申请中的“基于A确定B”表示确定B时要考虑A这个因素。并不限于“只基于A就可以确定出B”,还应包括:“基于A和C确定B”、“基于A、C和E确定B”、基于“A确定C,基于C进一步确定B”等。另外还可以包括将A作为确定B的条件,例如,“当A满足第一条件时,使用第一方法确定B”;再例如,“当A满足第二条件时,确定B”等;再例如,“当A满足第三条件时,基于第一参数确定B”等。当然也可以是将A作为确定B的因素的条件,例如,“当A满足第一条件时,使用第一方法确定C,并进一步基于C确定B”等。"Determine B based on A" in this application means that the factor A should be considered when determining B. It is not limited to "B can be determined only based on A", but also includes: "Determine B based on A and C", "Determine B based on A, C and E", "determine C based on A, further determine B based on C" wait. In addition, it may also include A as the condition for determining B, for example, "when A meets the first condition, use the first method to determine B"; another example, "when A meets the second condition, determine B"; another example , "when A satisfies the third condition, determine B based on the first parameter" and so on. Of course, it may also be a condition that takes A as a factor for determining B, for example, "when A satisfies the first condition, use the first method to determine C, and further determine B based on C" and so on.
本申请中的“根据A确定B”表示确定B时要考虑A这个因素。并不限于“只根据A就可以确定出B”,还应包括:“根据A和C确定B”、“根据A、C和E确定B”、根据“A确定C,根据C进一步确定B”等。另外还可以包括将A作为确定B的条件,例如,“当A满足第一条件时,使用第一方法确定B”;再例如,“当A满足第二条件时,确定B”等;再例如,“当A满足第三条件时,基于第一参数确定B”等。当然也可以是将A作为确定B的因素的条件,例如,“当A满足第一条件时,使用第一方法确定C,并进一步基于C确定B”等。 "Determine B based on A" in this application means that the factor A should be considered when determining B. It is not limited to "B can be determined based on A only", but also includes: "B is determined based on A and C", "B is determined based on A, C and E", "C is determined based on A, and B is further determined based on C" wait. In addition, it may also include A as the condition for determining B, for example, "when A meets the first condition, use the first method to determine B"; another example, "when A meets the second condition, determine B"; another example , "when A satisfies the third condition, determine B based on the first parameter" and so on. Of course, it may also be a condition that takes A as a factor for determining B, for example, "when A satisfies the first condition, use the first method to determine C, and further determine B based on C" and so on.
为了更好的理解及说明本公开实施例的方案,下面对本公开实施例中所涉及到的一些技术用语进行简单说明。In order to better understand and illustrate the solutions of the embodiments of the present disclosure, some technical terms involved in the embodiments of the present disclosure will be briefly described below.
(1)波束切换技术(1) Beam switching technology
随着通信系统对数据流量需求的不断增加,毫米波技术成为5G(5th Generation Mobile Communication Technology,第五代移动通信技术)中的关键技术。而由于毫米波的高频率特性,其链路可能会遭受严重的路径损失,为了克服这一缺点,毫米波通信系统需要使用基于波束形成的Massive MIMO(Multiple-input Multiple-output,大规模多入多出技术)技术。波束形成技术能有效地提高移动通信系统的频谱效率,然而每个波束的覆盖范围有限,需要波束切换来维持良好的系统性能。例如当UE离开服务波束的覆盖范围时,该服务波束将不再适合该UE,需要进行波束切换,或网络节点和UE之间存在障碍物使得当前服务波束质量骤降,也需要进行波束切换。With the increasing demand for data traffic in communication systems, millimeter wave technology has become a key technology in 5G (5th Generation Mobile Communication Technology, fifth generation mobile communication technology). Due to the high-frequency characteristics of millimeter waves, their links may suffer severe path loss. In order to overcome this shortcoming, millimeter wave communication systems need to use beamforming-based Massive MIMO (Multiple-input Multiple-output, large-scale multiple-input More technology) technology. Beamforming technology can effectively improve the spectral efficiency of mobile communication systems. However, the coverage of each beam is limited, and beam switching is required to maintain good system performance. For example, when the UE leaves the coverage of the serving beam, the serving beam will no longer be suitable for the UE, and beam switching is required, or there is an obstacle between the network node and the UE that causes the quality of the current serving beam to drop suddenly, and beam switching is also required.
在实际应用场景中,网络节点例如毫米波基站,由于覆盖范围较小,所以毫米波基站部署密度要比LTE(Long Term Evolution,长期演进)基站高。在使用波束的毫米波网络中,波束切换不仅发生在毫米波基站之间,而且还发生在同一毫米波基站内部的波束之间。毫米波基站的密集部署和波束的使用将进一步增加波束切换的频率,这可能限制实际系统的性能,所以需要对波束切换过程进行优化。In practical application scenarios, network nodes such as millimeter-wave base stations have a smaller coverage area, so the deployment density of millimeter-wave base stations is higher than that of LTE (Long Term Evolution, long-term evolution) base stations. In mmWave networks using beams, beam switching occurs not only between mmWave base stations, but also between beams within the same mmWave base station. The dense deployment of mmWave base stations and the use of beams will further increase the frequency of beam switching, which may limit the performance of practical systems, so the beam switching process needs to be optimized.
在波束管理过程中,波束扫描和报告周期地执行,基于波束报告的结果,当前服务波束的质量发生波动时,可以进行波束切换以获得更好的通信性能。针对该类型的波束切换,可以对切换的参数进行优化以提升系统性能并减少波束故障的概率,同时减少波束切换次数以减少系统的负载开销。In the process of beam management, beam scanning and reporting are performed periodically. Based on the results of beam reporting, when the quality of the current serving beam fluctuates, beam switching can be performed to obtain better communication performance. For this type of beam switching, switching parameters can be optimized to improve system performance and reduce the probability of beam failure, while reducing the number of beam switching times to reduce system load overhead.
(2)基于固定阈值(固定切换阈值)的波束切换(2) Beam switching based on fixed threshold (fixed switching threshold)
网络节点例如基站,基于测量报告的波束切换包括两部分的切换:一部分是跨基站的波束切换,另一部分是基站内部的波束切换。经过实际的仿真统计,大部分的波束切换是发生在不同基站之间。因此参考小区切换 的模式中的A3事件,UE在接入网络节点,例如AP(Access Point,接入点)之后,波束扫描周期性地进行,UE对所有波束进行波束测量后,保存最佳的N个波束序号及其质量,并得到候选波束,UE反馈相关的波束信息(例如SINR(Signal to Interference plus Noise Ratio,信号与干扰噪声比)等)至AP,AP依据固定阈值的判定方法进行波束是否进行切换的判断。For a network node such as a base station, the beam switching based on the measurement report includes two parts of switching: one part is beam switching across base stations, and the other part is beam switching inside the base station. According to actual simulation statistics, most beam switching occurs between different base stations. Therefore refer to cell handover In the A3 event in the mode, after the UE accesses a network node, such as an AP (Access Point, access point), the beam scanning is performed periodically, and after the UE performs beam measurement on all beams, it saves the best N beam sequence numbers and its quality, and obtain candidate beams, the UE feeds back relevant beam information (such as SINR (Signal to Interference plus Noise Ratio, signal to interference and noise ratio), etc.) to the AP, and the AP determines whether the beam should be switched according to the fixed threshold determination method judge.
以SINR作为波束质量的参考指标;每次上报所有波束质量进行切换的触发判断。固定阈值的判定方法相关的参数有:TTT(Time-to-trigger,触发时间)、HM(Hysteresis Margin,迟滞阈值)等。HM表示当前服务波束的SINR与目标波束的SINR的最小差值,该阈值记作Δ,TTT表示目标波束满足触发差值的条件持续一定时间。在基于固定阈值的切换设置中,如果测量到其他的波束比当服务波束的质量更好,且超过一定阈值Δ,并且该条件保持了一段时间(TTT),那么就会触发波束切换。切换判定流程包括:AP基于上报的波束质量,判断候选波束是否进入波束切换的判定过程,若满足SINRserve+Δ<SINRcandidate的条件,则进入切换判定,该判定过程需要持续TTT时间;在进入切换判定过程后,若在TTT时间内均满足SINRserve+Δ<SINRcandidate的条件,则在判定结束后发生波束切换,服务波束切换为该候选波束;若某时刻不满足上述条件,则中断判定过程,此次判定结束,不发生波束切换。Use SINR as the reference index of beam quality; report the trigger judgment of handover for all beam qualities each time. The parameters related to the determination method of the fixed threshold include: TTT (Time-to-trigger, trigger time), HM (Hysteresis Margin, hysteresis threshold), etc. HM indicates the minimum difference between the SINR of the current serving beam and the SINR of the target beam, and the threshold is denoted as Δ, and TTT indicates that the target beam satisfies the condition of triggering the difference for a certain period of time. In the fixed-threshold-based handover setting, beam handoff is triggered if the quality of other beams measured to be better than the serving beam exceeds a certain threshold Δ, and this condition persists for a period of time (TTT). The handover judgment process includes: the AP judges whether the candidate beam enters the beam handover judgment process based on the reported beam quality. If the condition of SINR serve +Δ<SINR candidate is met, the handover judgment is entered. The judgment process needs to last for TTT time; After the switching judgment process, if the condition of SINR serve +Δ<SINR candidate is satisfied within the TTT time, beam switching will occur after the judgment ends, and the serving beam will be switched to the candidate beam; if the above conditions are not satisfied at a certain moment, the judgment will be interrupted process, this judgment ends, and beam switching does not occur.
在切换判定流程中,选取波束报告中每个UE测量结果中SINR最大的波束为候选波束,加入候选波束集;若该候选波束为其他用户的服务波束,为避免波束冲突,则选择SINR第二大的波束为候选波束,并以此后推。若候选波束满足阈值的要求,则触发切换判定过程。在波束切换的判定过程中,候选波束不会再被其他用户选择,同时该用户也不会再选择其他波束进行切换判定,直到一次波束切换判定结束。In the handover decision process, select the beam with the largest SINR in the measurement results of each UE in the beam report as the candidate beam and add it to the candidate beam set; The larger beam is the candidate beam, and it will be pushed forward accordingly. If the candidate beam satisfies the requirement of the threshold, the handover decision process is triggered. During the determination process of beam switching, the candidate beam will not be selected by other users, and at the same time, the user will not select other beams for switching determination until a beam switching determination ends.
下面将结合本公开实施例中的附图,对本公开实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本公开一部分实施例,并不是全部的实施例。基于本公开中的实施例,本领域普通技术人员在没 有做出创造性劳动前提下所获得的所有其他实施例,都属于本公开保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present disclosure with reference to the accompanying drawings in the embodiments of the present disclosure. Obviously, the described embodiments are only some of the embodiments of the present disclosure, not all of them. Based on the embodiments in this disclosure, those of ordinary skill in the art All other embodiments obtained under the premise of making creative work belong to the protection scope of the present disclosure.
本公开实施例提供的一种系统架构的示意图如图1所示,该系统架构包括:UE和网络节点,其中,UE例如图1中的UE110、UE111、UE112和UE113,网络节点例如图1中网络节点120和网络节点121。网络节点部署在接入网中,例如,网络节点部署在5G系统中的接入网NG-RAN(New Generation-Radio Access Network,新一代无线接入网)。UE和网络节点之间通过某种空口技术互相通信,例如可以通过蜂窝技术相互通信。A schematic diagram of a system architecture provided by an embodiment of the present disclosure is shown in FIG. 1 . The system architecture includes: UEs and network nodes, wherein UEs are, for example, UE110, UE111, UE112, and UE113 in FIG. 1 , and network nodes are, for example, UEs in FIG. Network node 120 and network node 121. The network nodes are deployed in the access network, for example, the network nodes are deployed in the access network NG-RAN (New Generation-Radio Access Network, New Generation Radio Access Network) in the 5G system. The UE and the network node communicate with each other through a certain air interface technology, for example, they may communicate with each other through a cellular technology.
本公开实施例涉及的UE,可以是指向用户提供语音和/或数据连通性的设备,具有无线连接功能的手持式设备、或连接到无线调制解调器的其他处理设备等。UE的类型包括手机、车辆用户终端、平板电脑、膝上型电脑、个人数字助理、移动上网装置、可穿戴式设备等。The UE involved in the embodiments of the present disclosure may be a device that provides voice and/or data connectivity to a user, a handheld device with a wireless connection function, or other processing devices connected to a wireless modem. Types of UEs include mobile phones, vehicle user terminals, tablets, laptops, personal digital assistants, mobile Internet devices, wearable devices, and the like.
本公开实施例涉及的网络节点可以是基站,该基站可以包括多个为终端提供服务的小区。根据具体应用场合不同,基站又可以称为接入点AP,或者可以是接入网中在空中接口上通过一个或多个扇区与无线终端设备通信的设备,或者其它名称。网络节点可用于将收到的空中帧与网际协议(Internet Protocol,IP)分组进行相互更换,作为无线终端设备与接入网的其余部分之间的路由器,其中接入网的其余部分可包括网际协议(IP)通信网络。网络节点还可协调对空中接口的属性管理。例如,本公开实施例涉及的网络节点可以是全球移动通信系统(Global System for Mobile communications,GSM)或码分多址接入(Code Division Multiple Access,CDMA)中的网络设备(Base Transceiver Station,BTS),也可以是带宽码分多址接入(Wide-band Code Division Multiple Access,WCDMA)中的网络设备(NodeB),还可以是长期演进(long term evolution,LTE)系统中的演进型网络设备(evolutional Node B,eNB或e-NodeB)、5G网络架构(next generation system)中的5G基站(gNB)、超5代移动通信系统(B5G)中的B5G基站、6G(6th Generation Mobile Communication Technology,第六代移动通信技术)网络架构中的6G基站,也可以是家 庭演进基站(Home evolved Node B,HeNB)、中继节点(relay node)、家庭基站(femto)、微微基站(pico)、毫米波基站等,本公开实施例中并不限定。在一些网络结构中,网络节点可以包括集中单元(centralized unit,CU)节点和分布单元(distributed unit,DU)节点,集中单元和分布单元也可以地理上分开布置。The network node involved in this embodiment of the present disclosure may be a base station, and the base station may include multiple cells that provide services for a terminal. Depending on the specific application, the base station can also be called an access point AP, or it can be a device in the access network that communicates with wireless terminal devices through one or more sectors on the air interface, or other names. The network node is operable to interchange received over-the-air frames with Internet Protocol (IP) packets, acting as a router between the wireless terminal device and the rest of the access network, which may include the Internet Protocol (IP) communication network. The network nodes may also coordinate attribute management for the air interface. For example, the network node involved in the embodiments of the present disclosure may be a network device (Base Transceiver Station, BTS) in Global System for Mobile communications (GSM) or Code Division Multiple Access (Code Division Multiple Access, CDMA) ), it can also be a network device (NodeB) in Wide-band Code Division Multiple Access (WCDMA), or it can be an evolved network device in a long term evolution (long term evolution, LTE) system (evolutional Node B, eNB or e-NodeB), 5G base station (gNB) in the 5G network architecture (next generation system), B5G base station in the super 5th generation mobile communication system (B5G), 6G (6th Generation Mobile Communication Technology, The 6G base station in the network architecture of the sixth generation mobile communication technology can also be a home The home evolved base station (Home evolved Node B, HeNB), relay node (relay node), home base station (femto), pico base station (pico), millimeter wave base station, etc. are not limited in the embodiments of the present disclosure. In some network structures, network nodes may include centralized unit (centralized unit, CU) nodes and distributed unit (distributed unit, DU) nodes, and the centralized unit and distributed unit may also be arranged geographically separately.
为使本公开的目的、技术方案和优点更加清楚,下面将结合附图对本公开实施方式作进一步地详细描述。In order to make the purpose, technical solution and advantages of the present disclosure clearer, the implementation manners of the present disclosure will be further described in detail below in conjunction with the accompanying drawings.
本公开实施例中提供了一种波束切换方法,由第一网络节点执行,该方法的流程示意图如图2所示,该方法包括:An embodiment of the present disclosure provides a beam switching method, which is executed by the first network node. The flowchart of the method is shown in FIG. 2 , and the method includes:
S201,获取第一时间段内多个第一波束分别对应的第一质量信息,多个第一波束分别对应的第一质量信息包括至少一个用户设备UE发送给第一网络节点的第一质量信息。S201. Acquire first quality information respectively corresponding to a plurality of first beams within a first time period, where the first quality information respectively corresponding to a plurality of first beams includes first quality information sent by at least one user equipment UE to a first network node .
具体地,第一网络节点获取第一时间段内至少一个第二网络节点的多个波束分别对应的第一质量信息;并接收至少一个UE发送的第一质量信息。其中,第一质量信息可以包括SINR、RSRP(Reference Signal Receiving Power,接收功率)、RSRQ(Reference Signal Receiving Quality,接收质量)中的至少一项。例如,第一网络节点获得第一时间段内20个第一波束分别对应的第一质量信息,即在第一时间段内,第一网络节点获得了20个第一质量信息,这20个第一质量信息包括一个或多个第二网络节点发送给第一网络节点的10个第一质量信息,以及10个UE分别发送给第一网络节点的第一质量信息。Specifically, the first network node acquires first quality information respectively corresponding to multiple beams of at least one second network node within the first time period; and receives the first quality information sent by at least one UE. Wherein, the first quality information may include at least one of SINR, RSRP (Reference Signal Receiving Power, received power), and RSRQ (Reference Signal Receiving Quality, received quality). For example, the first network node obtains first quality information corresponding to 20 first beams in the first time period, that is, in the first time period, the first network node obtains 20 first quality information, and the 20 first beams The piece of quality information includes 10 pieces of first quality information sent by one or more second network nodes to the first network node, and 10 pieces of first quality information sent by the 10 UEs to the first network node respectively.
S202,基于多个第一波束分别对应的第一质量信息,确定多个第一波束对应的第一平均信道质量。S202. Based on the first quality information respectively corresponding to the multiple first beams, determine first average channel quality corresponding to the multiple first beams.
具体地,例如,第一网络节点获得了20个第一质量信息,第一质量信息为SINR,这20个第一质量信息为20个SINR,则计算这20个SINR的平均值,并将这20个SINR的平均值作为第一平均信道质量。Specifically, for example, the first network node obtains 20 pieces of first quality information, the first quality information is SINR, and the 20 pieces of first quality information are 20 SINRs, then the average value of these 20 SINRs is calculated, and the The average value of 20 SINRs is used as the first average channel quality.
S203,基于多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量;至少一个第二时间段在第一时间段之前。 S203. Determine a first state vector based on the first average channel quality corresponding to multiple first beams and the second average channel quality corresponding to multiple second beams in at least one second time period; at least one second time period is in Before the first time period.
具体地,如图3所示,滑动窗口包括N个时间窗口,这N个时间窗口分别是第一个时间窗口、第二个时间窗口、第三个时间窗口、…第N个时间窗口,第一个时间窗口、第二个时间窗口、第三个时间窗口、…第N个时间窗口的时间长度可以预先设定,N为正整数。第一时间段为第一个时间窗口,第一个时间窗口为当前窗口;多个第二时间段可以分别为第二个时间窗口、第三个时间窗口、…第N个时间窗口,其中,第二个时间窗口、第三个时间窗口、…第N个时间窗口都为历史窗口;第一个时间窗口平均信道质量可以为第一平均信道质量,第二个时间窗口平均信道质量可以为第二平均信道质量、第三个时间窗口平均信道质量可以为第二平均信道质量、…第N个时间窗口平均信道质量可以为第二平均信道质量;第二个时间窗口平均信道质量、第三个时间窗口平均信道质量、…第N个时间窗口平均信道质量之间可以不同;第一个时间窗口平均信道质量可以用SINR1表示,第二个时间窗口平均信道质量、第三个时间窗口平均信道质量、…第N个时间窗口平均信道质量可以分别用SINR2、SINR3、…SINRN表示。Specifically, as shown in Figure 3, the sliding window includes N time windows, and these N time windows are respectively the first time window, the second time window, the third time window, ... the Nth time window, the Nth time window The time lengths of one time window, the second time window, the third time window, ... the Nth time window can be preset, and N is a positive integer. The first time period is the first time window, and the first time window is the current window; multiple second time periods can be respectively the second time window, the third time window, ... the Nth time window, wherein, The second time window, the third time window, ... the Nth time window are all historical windows; the average channel quality of the first time window can be the first average channel quality, and the average channel quality of the second time window can be the first The average channel quality of the second time window, the average channel quality of the third time window can be the second average channel quality, ... The average channel quality of the Nth time window can be the second average channel quality; the average channel quality of the second time window, the third The average channel quality of the time window, ... the average channel quality of the Nth time window can be different; the average channel quality of the first time window can be expressed by SINR 1 , the average channel quality of the second time window, and the average channel quality of the third time window The quality, ... the average channel quality of the Nth time window can be represented by SINR 2 , SINR 3 , ... SINR N respectively.
需要说明的是,考虑到系统性能随时间的变化趋势,以及避免未来某些不必要的波束切换发生,采用滑动窗口可以考虑到历史经验的影响。It should be noted that, taking into account the changing trend of system performance over time and avoiding unnecessary beam switching in the future, the use of sliding windows can take into account the influence of historical experience.
S204,基于第一状态向量,确定至少一个UE对应的波束切换阈值。S204. Based on the first state vector, determine a beam switching threshold corresponding to at least one UE.
具体地,波束切换阈值可以包括TTT、HM等。Specifically, the beam switching threshold may include TTT, HM, and the like.
本公开实施例中,实现了对至少一个UE对应的波束切换阈值进行动态地调整,从而在能够保持系统性能的同时,也可以保证适当的波束切换频率。In the embodiments of the present disclosure, it is realized that the beam switching threshold corresponding to at least one UE is dynamically adjusted, so that while maintaining system performance, an appropriate beam switching frequency can also be guaranteed.
在一个实施例中,获取第一时间段内多个第一波束分别对应的第一质量信息,包括:In an embodiment, obtaining first quality information respectively corresponding to a plurality of first beams within a first time period includes:
获取第一时间段内第二网络节点发送的第一质量信息;并接收至少一个UE发送的第一质量信息;Acquire first quality information sent by the second network node within the first time period; and receive first quality information sent by at least one UE;
其中,第一质量信息包括信号与干扰噪声比SINR、接收功率RSRP、接收质量RSRQ中的至少一项。Wherein, the first quality information includes at least one item of signal to interference and noise ratio SINR, received power RSRP, and received quality RSRQ.
在一个实施例中,第一网络节点可以获取第一时间段内至少一个第二 网络节点的多个波束分别对应的第一质量信息;并接收至少一个UE发送的第一质量信息。In one embodiment, the first network node can obtain at least one second The multiple beams of the network node respectively correspond to the first quality information; and receive the first quality information sent by at least one UE.
在一个实施例中,基于多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量,包括:In an embodiment, the first state vector is determined based on the first average channel quality corresponding to the multiple first beams and the second average channel quality corresponding to the multiple second beams in at least one second time period, including:
将第一平均信道质量进行量化映射处理,得到第一平均信道质量对应的第一信息向量的值;并将第二平均信道质量进行量化映射处理,得到第二平均信道质量对应的第二信息向量的值;Performing quantization mapping processing on the first average channel quality to obtain the value of the first information vector corresponding to the first average channel quality; performing quantization mapping processing on the second average channel quality to obtain a second information vector corresponding to the second average channel quality value;
基于第一信息向量和至少一个第二信息向量,得到第一状态向量。Based on the first information vector and at least one second information vector, a first state vector is obtained.
具体地,第一信息向量的值和第二信息向量的值都为离散值,第一信息向量的值和第二信息向量的值可以用于表征信号强度等级,信号强度等级可以为1、2、3、…m等,其中,m为正整数。又例如,第一信息向量为b1,多个第二信息向量可以分别为b2、b3、…bn,则第一状态向量为(b1,b2,b3,…bn)。Specifically, the value of the first information vector and the value of the second information vector are both discrete values, the value of the first information vector and the value of the second information vector can be used to represent the signal strength level, and the signal strength level can be 1 or 2 , 3, ... m, etc., wherein, m is a positive integer. For another example, the first information vector is b 1 , multiple second information vectors can be b 2 , b 3 , ... b n respectively, then the first state vector is (b 1 , b 2 , b 3 , ... b n ) .
在一个实施例中,基于第一状态向量,确定至少一个UE对应的波束切换阈值,包括:In an embodiment, based on the first state vector, determining a beam switching threshold corresponding to at least one UE includes:
将第一状态向量输入至第一关系模型,进行匹配处理,得到与第一状态向量相匹配的一个或多个奖励值,并将一个或多个奖励值中的最大奖励值对应的波束切换阈值,确定为UE对应的波束切换阈值;Input the first state vector into the first relationship model, perform matching processing, obtain one or more reward values matching the first state vector, and set the beam switching threshold corresponding to the maximum reward value among the one or more reward values , determined as the beam switching threshold corresponding to the UE;
其中,第一关系模型用于表征第一状态向量、奖励值和波束切换阈值之间的关系。Wherein, the first relationship model is used to characterize the relationship among the first state vector, reward value and beam switching threshold.
具体地,第一关系模型可以为表格,该表格可以用于表征第一状态向量、奖励值和波束切换阈值之间的关系。奖励值可以用于表征UE基于UE对应的波束切换阈值,进行波束切换的效果。最大奖励值可以用于表征UE基于UE对应的波束切换阈值,进行波束切换的最好效果。Specifically, the first relationship model may be a table, and the table may be used to characterize the relationship among the first state vector, the reward value, and the beam switching threshold. The reward value may be used to represent the effect of the UE performing beam switching based on the UE's corresponding beam switching threshold. The maximum reward value can be used to represent the best effect of the UE performing beam switching based on the UE's corresponding beam switching threshold.
在一个实施例中,第一关系模型通过至少一个UE的第一指标变化对应的第一参数,以及波束切换代价对应的第二参数确定。In an embodiment, the first relationship model is determined by a first parameter corresponding to a first index change of at least one UE, and a second parameter corresponding to a beam switching cost.
具体地,第一指标可以包括信道容量、信噪比、误块率BLER中的至少一项。 Specifically, the first index may include at least one of channel capacity, signal-to-noise ratio, and block error rate BLER.
在一个实施例中,第一关系模型可以为表格,该表格可以通过至少一个UE的信道容量变化对应的第一参数,以及波束切换代价对应的第二参数确定。In an embodiment, the first relational model may be a table, and the table may be determined by using a first parameter corresponding to a channel capacity change of at least one UE and a second parameter corresponding to a beam switching cost.
具体地,至少一个UE的信道容量变化对应的第一参数可以是波束切换代价对应的第二参数可以是βc*Hn,其中,表示状态向量对应的波束切换阈值导致的容量变化,表示至少一个UE的信道容量C的平均值,βc为预设的惩罚因子,Hn为平均切换次数。Specifically, the first parameter corresponding to the channel capacity change of at least one UE may be The second parameter corresponding to the beam switching cost may be β c *H n , where, Indicates that the beam switching threshold corresponding to the state vector results in change in capacity, represents the average value of channel capacity C of at least one UE, β c is a preset penalty factor, and H n is an average number of handovers.
在一个实施例中,第一关系模型可以通过强化学习进行训练。例如,第一关系模型是通过以下方式训练得到的:In one embodiment, the first relational model can be trained by reinforcement learning. For example, the first relational model is trained by:
构建训练样本集合;基于训练样本集合,对关系模型进行训练,得到第一关系模型;Constructing a training sample set; based on the training sample set, training the relational model to obtain a first relational model;
基于训练样本集合,对关系模型进行训练,至少包括:Based on the training sample set, the relational model is trained, at least including:
将训练样本集合中的第二状态向量输入至关系模型,确定第二状态向量对应的波束切换阈值;Inputting the second state vector in the training sample set to the relational model, and determining the beam switching threshold corresponding to the second state vector;
基于第二状态向量对应的波束切换阈值和奖励函数,确定第二状态向量对应的奖励值;Determining a reward value corresponding to the second state vector based on the beam switching threshold and the reward function corresponding to the second state vector;
基于第二状态向量、第二状态向量对应的奖励值和关系模型对应的损失函数,确定损失函数值;Determining a loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model;
基于损失函数值,对关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the relational model are updated to obtain the updated relational model.
在一个实施例中,在基于第二状态向量、第二状态向量对应的奖励值和关系模型对应的损失函数,确定损失函数值之后,还包括:In one embodiment, after determining the loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model, it further includes:
判断是否达到终止条件;Determine whether the termination condition is met;
若确定达到终止条件,则得到第一关系模型;If it is determined that the termination condition is met, the first relational model is obtained;
终止条件为以下中的一项:The termination condition is one of the following:
损失函数值小于或等于损失函数值阈值;或者,The loss function value is less than or equal to the loss function value threshold; or,
损失函数值大于或等于损失函数值阈值。The loss function value is greater than or equal to the loss function value threshold.
具体的,将训练样本集合中的第二状态向量输入至关系模型,确定第二状态向量对应的波束切换阈值;基于第二状态向量对应的波束切换阈值, 通过奖励函数,得到第二状态向量对应的奖励值;将第二状态向量和第二状态向量对应的奖励值代入关系模型对应的损失函数得到损失函数值,并基于损失函数值,对关系模型的模型参数进行更新处理。直至达到终止条件,将达到终止条件时的关系模型作为第一关系模型。Specifically, the second state vector in the training sample set is input to the relational model, and the beam switching threshold corresponding to the second state vector is determined; based on the beam switching threshold corresponding to the second state vector, Through the reward function, the reward value corresponding to the second state vector is obtained; the second state vector and the reward value corresponding to the second state vector are substituted into the loss function corresponding to the relational model to obtain the loss function value, and based on the loss function value, the relational model Model parameters are updated. Until the termination condition is reached, the relational model when the termination condition is reached is taken as the first relational model.
在一个实施例中,基于训练样本集合,对关系模型进行训练,还包括:In one embodiment, based on the training sample set, the relational model is trained, further comprising:
在未达到终止条件时,重复执行以下步骤:When the termination condition is not met, the following steps are repeated:
将训练样本集合中的第二状态向量输入至更新后的关系模型,确定第二状态向量对应的波束切换阈值;Inputting the second state vector in the training sample set to the updated relationship model, and determining the beam switching threshold corresponding to the second state vector;
基于第二状态向量对应的波束切换阈值和奖励函数,确定第二状态向量对应的奖励值;Determining a reward value corresponding to the second state vector based on the beam switching threshold and the reward function corresponding to the second state vector;
基于第二状态向量、第二状态向量对应的奖励值和更新后的关系模型对应的损失函数,确定损失函数值;Determining a loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the updated relationship model;
基于损失函数值,对更新后的关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the updated relational model are updated to obtain the updated relational model.
具体地,若第一指标为信道容量,将训练样本集合中的第二状态向量输入至关系模型,基于贪婪策略,确定第二状态向量对应的波束切换阈值;第二状态向量用于表征各训练时间段内的多个波束的平均信道质量。Specifically, if the first index is channel capacity, input the second state vector in the training sample set to the relational model, and determine the beam switching threshold corresponding to the second state vector based on the greedy strategy; the second state vector is used to represent each training Average channel quality across multiple beams over time period.
基于第二状态向量对应的波束切换阈值,通过奖励函数,得到第二状态向量对应的奖励值;其中,奖励函数如公式(1)所示:
Based on the beam switching threshold corresponding to the second state vector, the reward value corresponding to the second state vector is obtained through the reward function; wherein, the reward function is shown in formula (1):
其中,表示第二状态向量对应的波束切换阈值导致的容量变化,表示至少一个UE的信道容量C的平均值,βc为预设的惩罚因子,Hn为平均切换次数。in, Indicates that the beam switching threshold corresponding to the second state vector results in change in capacity, represents the average value of channel capacity C of at least one UE, β c is a preset penalty factor, and H n is an average number of handovers.
第一关系模型可以部署在网络节点中。The first relational model can be deployed in a network node.
在一个实施例中,将至少一个UE发送给第一网络节点的第一质量信息,发送给第二网络节点。In an embodiment, the first quality information sent by at least one UE to the first network node is sent to the second network node.
具体地,第一网络节点可以将第一质量信息作为边信息,与其他网络节点(例如,一个或多个第二网络节点)进行共享;其中,边信息是可以 在各网络节点之间进行共享的信息。Specifically, the first network node may share the first quality information as side information with other network nodes (for example, one or more second network nodes); wherein, the side information may be Information shared between network nodes.
本公开实施例中提供了另一种波束切换方法,由UE执行,该方法的流程示意图如图4所示,该方法包括:An embodiment of the present disclosure provides another beam switching method, which is performed by the UE. The flowchart of the method is shown in FIG. 4 , and the method includes:
S401,向第一网络节点发送第一时间段内第一波束对应的第一质量信息;以使第一网络节点获取第一时间段内多个波束分别对应的第一质量信息,并基于多个波束分别对应的第一质量信息,以及至少一个第二时间段内多个波束对应的平均信道质量,确定UE对应的波束切换阈值。S401. Send the first quality information corresponding to the first beam in the first time period to the first network node; so that the first network node obtains the first quality information corresponding to the multiple beams in the first time period, and based on multiple The first quality information corresponding to the beams and the average channel quality corresponding to the multiple beams in at least one second time period determine the beam switching threshold corresponding to the UE.
S402,接收第一网络节点发送的UE对应的波束切换阈值,并基于波束切换阈值进行波束切换。S402. Receive a beam switching threshold corresponding to the UE sent by the first network node, and perform beam switching based on the beam switching threshold.
需要说明的是,若UE测量到除服务波束之外的其他波束比服务波束(例如第一波束)的质量更好,则可以基于波束切换阈值,触发波束切换,使UE从服务波束切换到其他波束。It should be noted that if the UE measures that the quality of other beams other than the serving beam is better than that of the serving beam (such as the first beam), beam switching may be triggered based on the beam switching threshold, so that the UE switches from the serving beam to other beams. beam.
本公开实施例中,实现了对UE对应的波束切换阈值进行动态地调整,从而在能够保持系统性能的同时,也可以保证适当的波束切换频率。In the embodiments of the present disclosure, it is realized that the beam switching threshold corresponding to the UE is dynamically adjusted, so that while maintaining system performance, an appropriate beam switching frequency can also be ensured.
通过如下实施例来对本公开上述实施例的波束切换方法进行全面详尽的介绍:The beam switching method in the above-mentioned embodiments of the present disclosure is introduced comprehensively and in detail through the following embodiments:
本公开实施例中提供了波束切换场景下的强化学习,该强化学习的流程示意图如图5所示,包括:In the embodiments of the present disclosure, reinforcement learning in the beam switching scenario is provided. The flow diagram of the reinforcement learning is shown in FIG. 5 , including:
S501,预先设置波束切换阈值。S501. Preset a beam switching threshold.
具体地,波束切换阈值可以包括TTT、HM等。Specifically, the beam switching threshold may include TTT, HM, and the like.
S502,构建训练样本集合。S502. Construct a training sample set.
具体地,训练样本集合中的每个状态向量可以用于表征一个训练时间段内的多个波束的平均信道质量。Specifically, each state vector in the training sample set can be used to represent the average channel quality of multiple beams within a training period.
S503,预先设置奖励函数。S503, preset a reward function.
具体地,设置如公式(1)所示的奖励函数。Specifically, a reward function as shown in formula (1) is set.
S504,基于训练样本集合,训练关系模型,得到训练后的关系模型。S504. Based on the training sample set, train the relationship model to obtain the trained relationship model.
具体地,训练后的关系模型为第一关系模型。根据Q-learning算法,基于训练样本集合,训练关系模型。关系模型可以为表格,该表格可以是Q-learning算法中的Q-table(Q表),该表格可以用于表征状态向量、奖 励值和波束切换阈值之间的关系;将训练样本集合中的状态向量输入至表格,确定状态向量对应的波束切换阈值;基于状态向量对应的波束切换阈值,通过奖励函数,得到状态向量对应的奖励值;将状态向量和状态向量对应的奖励值代入表格对应的损失函数得到损失函数值,并基于损失函数值,对表格的模型参数进行更新处理。重复执行以下步骤:将训练样本集合中的状态向量输入至表格,确定状态向量对应的波束切换阈值;基于状态向量对应的波束切换阈值,通过奖励函数,得到状态向量对应的奖励值;将状态向量和状态向量对应的奖励值代入表格对应的损失函数得到损失函数值,并基于损失函数值,对表格的模型参数进行更新处理。直至达到终止条件,将达到终止条件时的表格作为第一表格(训练后的关系模型);其中,终止条件可以为损失函数值小于或等于损失函数值阈值。Specifically, the trained relational model is the first relational model. According to the Q-learning algorithm, based on the training sample set, the relational model is trained. The relational model can be a table, and the table can be a Q-table (Q table) in the Q-learning algorithm, and the table can be used to represent the state vector, award The relationship between the excitation value and the beam switching threshold; input the state vector in the training sample set to the table, and determine the beam switching threshold corresponding to the state vector; based on the beam switching threshold corresponding to the state vector, through the reward function, get the corresponding Reward value; Substitute the state vector and the reward value corresponding to the state vector into the loss function corresponding to the table to obtain the loss function value, and update the model parameters of the table based on the loss function value. Repeat the following steps: input the state vector in the training sample set to the table, and determine the beam switching threshold corresponding to the state vector; based on the beam switching threshold corresponding to the state vector, obtain the reward value corresponding to the state vector through the reward function; The reward value corresponding to the state vector is substituted into the loss function corresponding to the table to obtain the loss function value, and based on the loss function value, the model parameters of the table are updated. Until the termination condition is reached, the table when the termination condition is reached is used as the first table (relational model after training); wherein, the termination condition can be that the loss function value is less than or equal to the loss function value threshold.
S505,将训练后的关系模型部署在网络节点中。S505. Deploy the trained relationship model on the network nodes.
具体地,训练后的关系模型可以为第一表格,将该第一表格部署在网络节点中。Specifically, the trained relationship model may be the first table, and the first table is deployed in the network nodes.
本公开实施例中提供了又一种波束切换方法,该方法的流程示意图如图6所示,该方法包括:Another beam switching method is provided in the embodiment of the present disclosure. The schematic flowchart of the method is shown in FIG. 6 , and the method includes:
S601,网络节点发送用于波束扫描的参考信号给UE。S601, the network node sends a reference signal for beam scanning to the UE.
具体地,参考信号可以是SSB(Synchronization Signal Block,同步信号块)。Specifically, the reference signal may be an SSB (Synchronization Signal Block, synchronization signal block).
S602,UE进行波束测量,得到服务波束对应的质量信息。S602. The UE performs beam measurement to obtain quality information corresponding to the serving beam.
S603,UE将服务波束对应的质量信息和候选波束对应的质量信息反馈给网络节点。S603, the UE feeds back the quality information corresponding to the serving beam and the quality information corresponding to the candidate beam to the network node.
S604,网络节点将服务波束对应的质量信息作为边信息,并将该边信息与其他网络节点进行共享。S604. The network node uses the quality information corresponding to the serving beam as side information, and shares the side information with other network nodes.
需要说明的是,若没有其他网络节点,则不需要步骤S604;服务波束对应的质量信息可以是SINR、RSRP或RSRQ。It should be noted that, if there are no other network nodes, step S604 is not required; the quality information corresponding to the serving beam may be SINR, RSRP or RSRQ.
S605,网络节点基于全部边信息,确定状态向量,并基于强化学习训练得到的关系模型,查找UE对应的波束切换阈值。S605. The network node determines a state vector based on all side information, and searches for a beam switching threshold corresponding to the UE based on a relationship model obtained through reinforcement learning training.
具体地,网络节点存储全部边信息,全部边信息为服务波束对应的边 信息,以及其他网络节点向网络节点发送的边信息。Specifically, the network node stores all edge information, and all edge information is the edge corresponding to the serving beam information, and side information sent by other network nodes to network nodes.
S606,网络节点基于波束切换阈值,进行波束切换决策;若网络节点确定UE从服务波束切换到其他波束,则网络节点指示UE从服务波束切换到其他波束,并转到步骤S607执行。S606, the network node makes a beam switching decision based on the beam switching threshold; if the network node determines that the UE switches from the serving beam to other beams, the network node instructs the UE to switch from the serving beam to other beams, and proceeds to step S607 for execution.
具体地,其他波束可以是候选波束。Specifically, other beams may be candidate beams.
S607,UE进行波束切换。S607, the UE performs beam switching.
本公开实施例中提供了又一种波束切换方法,该方法应用在多AP毫米波网络场景,该场景的拓扑结构为:在半径为30m的场景下,均匀部署3个AP和7个用户;其中,这7个用户中的3个用户以5km/h(公里/时)的运动速度沿某路径进行直线运动,这3个AP中的每个AP有32个波束,采用混合波束成形技术,波束扫描的周期设定为10ms,当UE满足波束切换的条件时进行切换。该方法的流程示意图如图7所示,该方法包括:The embodiment of the present disclosure provides yet another beam switching method, which is applied in a multi-AP millimeter wave network scenario, and the topology of the scenario is: in a scenario with a radius of 30m, 3 APs and 7 users are evenly deployed; Among them, 3 of the 7 users move along a certain path at a speed of 5 km/h (kilometer/hour), and each of the 3 APs has 32 beams, using hybrid beamforming technology, The cycle of beam scanning is set to 10 ms, and the UE performs beam switching when it meets the conditions for beam switching. A schematic flow chart of the method is shown in Figure 7, and the method includes:
S701,网络节点通过贪婪策略,确定波束切换阈值。S701. The network node determines a beam switching threshold through a greedy strategy.
具体地,波束切换阈值的可选项:(HM=1dB和TTT=100ms)、(HM=1dB和TTT=100ms)、(HM=3dB和TTT=100ms)、(HM=5dB和TTT=100ms)、(HM=7dB和TTT=100ms)等。Specifically, the options for the beam switching threshold: (HM=1dB and TTT=100ms), (HM=1dB and TTT=100ms), (HM=3dB and TTT=100ms), (HM=5dB and TTT=100ms), (HM=7dB and TTT=100ms) and so on.
S702,UE基于波束切换阈值,执行波束切换。S702. The UE performs beam switching based on the beam switching threshold.
S703,网络节点通过滑动窗口,确定第二状态向量。S703. The network node determines the second state vector through the sliding window.
例如,滑动窗口包括5个时间窗口,即滑动窗口的长度为5个,这5个时间窗口分别是第一个时间窗口、第二个时间窗口、第三个时间窗口、第四个时间窗口和第五个时间窗口,第一个时间窗口的时间长度为10ms,第二个时间窗口、第三个时间窗口、第四个时间窗口和第五个时间窗口的时间长度都为40ms。For example, the sliding window includes 5 time windows, that is, the length of the sliding window is 5, and these 5 time windows are respectively the first time window, the second time window, the third time window, the fourth time window and For the fifth time window, the time length of the first time window is 10ms, and the time lengths of the second time window, the third time window, the fourth time window and the fifth time window are all 40ms.
S704,网络节点通过奖励函数,更新Q表。S704, the network node updates the Q table through the reward function.
具体地,根据公式(1)所示的奖励函数进行强化学习训练,即进行关系模型的训练,该关系模型为Q表,例如,其中,奖励函数中惩罚因子βc可以设定为0.75。Specifically, the reinforcement learning training is performed according to the reward function shown in formula (1), that is, the training of the relationship model is performed, and the relationship model is a Q table. For example, the penalty factor β c in the reward function can be set to 0.75.
S705,网络节点将更新后的Q表部署到网络节点中;分别转到步骤 S701和S706执行。S705, the network node deploys the updated Q table to the network node; respectively go to step S701 and S706 are executed.
S706,网络节点确定第一状态向量,通过查Q表,得到波束切换阈值。S706. The network node determines the first state vector, and obtains the beam switching threshold by looking up the Q table.
具体地,第一状态向量可以为(b1,b2,b3,…bn)。b1,b2,b3,…bn的值可以用于表征信号强度等级,信号强度等级可以为1、2、3、…m等,例如,信号强度等级取值为1、2、3、4和5,即信号强度等级共有5个等级。Specifically, the first state vector may be (b 1 , b 2 , b 3 , . . . b n ). The values of b 1 , b 2 , b 3 ,...b n can be used to characterize the signal strength level, and the signal strength level can be 1, 2, 3,...m, etc., for example, the signal strength level can be 1, 2, 3 , 4 and 5, that is, there are 5 levels of signal strength.
S707,网络节点进行波束切换判定。S707, the network node performs a beam switching decision.
S708,网络节点进行周期性波束扫描。S708. The network node performs periodic beam scanning.
S709,UE进行波束测量。S709, the UE performs beam measurement.
S710,UE反馈波束对应的质量信息;转到步骤S706执行。S710, the UE feeds back the quality information corresponding to the beam; go to step S706 for execution.
具体地,质量信息可以为SINR、RSRP或RSRQ。Specifically, the quality information may be SINR, RSRP or RSRQ.
需要说明的是,步骤S701-S705为强化学习训练中的步骤,每次强化学习训练的更新间隔T可以为10ms(毫秒);步骤S706-S710为线上执行中的步骤。It should be noted that steps S701-S705 are steps in reinforcement learning training, and the update interval T of each reinforcement learning training may be 10 ms (milliseconds); steps S706-S710 are steps in online execution.
在一个实施例中,如图8和图9所示(图9为图8的放大图),基于固定阈值,服务波束对应的SINR在20s(秒)时间内的CDF(Cumulative Distribution Function,累积分布函数)统计结果;本公开基于强化学习(例如Q-learning)的波束切换方法,服务波束对应的SINR在20s时间内的CDF统计结果;本公开基于强化学习的波束切换在性能方面较优。In one embodiment, as shown in Figure 8 and Figure 9 (Figure 9 is an enlarged view of Figure 8), based on a fixed threshold, the CDF (Cumulative Distribution Function, Cumulative Distribution Function) of the SINR corresponding to the serving beam within 20s (seconds) function) statistical results; this disclosure is based on the beam switching method of reinforcement learning (such as Q-learning), and the CDF statistical results of the SINR corresponding to the serving beam within 20s; the performance of the beam switching based on reinforcement learning in this disclosure is better.
在一个实施例中,如图10所示,本公开基于强化学习(例如Q-learning)的波束切换,发生波束切换的次数是最少的。In one embodiment, as shown in FIG. 10 , the present disclosure is based on reinforcement learning (such as Q-learning) beam switching, and the number of beam switching occurrences is the least.
基于相同的发明构思,本公开实施例还提供了一种波束切换装置,应用于第一网络节点,该装置的结构示意图如图11所示,收发机1300,用于在处理器1310的控制下接收和发送数据。Based on the same inventive concept, the embodiment of the present disclosure also provides a beam switching device, which is applied to the first network node. The structural diagram of the device is shown in FIG. Receive and send data.
其中,在图11中,总线架构可以包括任意数量的互联的总线和桥,具体由处理器1310代表的一个或多个处理器和存储器1320代表的存储器的各种电路链接在一起。总线架构还可以将诸如外围设备、稳压器和功率管理电路等之类的各种其他电路链接在一起,这些都是本领域所公知的, 因此,本文不再对其进行进一步描述。总线接口提供接口。收发机1300可以是多个元件,即包括发送机和接收机,提供用于在传输介质上与各种其他装置通信的单元,这些传输介质包括无线信道、有线信道、光缆等传输介质。处理器1310负责管理总线架构和通常的处理,存储器1320用于存储计算机程序,例如可以存储处理器1310在执行操作时所使用的数据。Wherein, in FIG. 11 , the bus architecture may include any number of interconnected buses and bridges, specifically one or more processors represented by the processor 1310 and various circuits of the memory represented by the memory 1320 are linked together. The bus architecture can also link together various other circuits such as peripherals, voltage regulators, and power management circuits, all of which are well known in the art, Therefore, it is not described further herein. The bus interface provides the interface. The transceiver 1300 may be a plurality of elements, including a transmitter and a receiver, providing a unit for communicating with various other devices over transmission media, including wireless channels, wired channels, optical cables, and other transmission media. The processor 1310 is responsible for managing the bus architecture and general processing, and the memory 1320 is used to store computer programs, such as data used by the processor 1310 when performing operations.
处理器1310可以是中央处埋器(CPU)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或复杂可编程逻辑器件(Complex Programmable Logic Device,CPLD),处理器也可以采用多核架构。The processor 1310 may be a central processing device (CPU), an application specific integrated circuit (Application Specific Integrated Circuit, ASIC), a field programmable gate array (Field-Programmable Gate Array, FPGA) or a complex programmable logic device (Complex Programmable Logic Device , CPLD), the processor can also adopt a multi-core architecture.
处理器1310,用于读取存储器中的计算机程序并执行以下操作:Processor 1310, configured to read the computer program in the memory and perform the following operations:
获取第一时间段内多个第一波束分别对应的第一质量信息,多个第一波束分别对应的第一质量信息包括至少一个用户设备UE发送给第一网络节点的第一质量信息;Acquiring first quality information respectively corresponding to multiple first beams within a first time period, where the first quality information respectively corresponding to multiple first beams includes first quality information sent to the first network node by at least one user equipment UE;
基于多个第一波束分别对应的第一质量信息,确定多个第一波束对应的第一平均信道质量;Based on the first quality information respectively corresponding to the multiple first beams, determine the first average channel quality corresponding to the multiple first beams;
基于多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量;至少一个第二时间段在第一时间段之前;Based on the first average channel quality corresponding to a plurality of first beams, and the second average channel quality corresponding to a plurality of second beams in at least one second time period, determine the first state vector; at least one second time period in the first before the time period;
基于第一状态向量,确定至少一个UE对应的波束切换阈值。Based on the first state vector, determine a beam switching threshold corresponding to at least one UE.
在一个实施例中,获取第一时间段内多个第一波束分别对应的第一质量信息,包括:In an embodiment, obtaining first quality information respectively corresponding to a plurality of first beams within a first time period includes:
获取第一时间段内第二网络节点发送的第一质量信息;并接收至少一个UE发送的第一质量信息;Acquire first quality information sent by the second network node within the first time period; and receive first quality information sent by at least one UE;
其中,第一质量信息包括信号与干扰噪声比SINR、接收功率RSRP、接收质量RSRQ中的至少一项。Wherein, the first quality information includes at least one item of signal to interference and noise ratio SINR, received power RSRP, and received quality RSRQ.
在一个实施例中,基于多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量,包括:In an embodiment, the first state vector is determined based on the first average channel quality corresponding to the multiple first beams and the second average channel quality corresponding to the multiple second beams in at least one second time period, including:
将第一平均信道质量进行量化映射处理,得到第一平均信道质量对应 的第一信息向量的值;并将第二平均信道质量进行量化映射处理,得到第二平均信道质量对应的第二信息向量的值;The first average channel quality is quantized and mapped to obtain the first average channel quality corresponding to The value of the first information vector; and the second average channel quality is quantized and mapped to obtain the value of the second information vector corresponding to the second average channel quality;
基于第一信息向量和至少一个第二信息向量,得到第一状态向量。Based on the first information vector and at least one second information vector, a first state vector is obtained.
在一个实施例中,基于第一状态向量,确定至少一个UE对应的波束切换阈值,包括:In an embodiment, based on the first state vector, determining a beam switching threshold corresponding to at least one UE includes:
将第一状态向量输入至第一关系模型,进行匹配处理,得到与第一状态向量相匹配的一个或多个奖励值,并将一个或多个奖励值中的最大奖励值对应的波束切换阈值,确定为UE对应的波束切换阈值;Input the first state vector into the first relationship model, perform matching processing, obtain one or more reward values matching the first state vector, and set the beam switching threshold corresponding to the maximum reward value among the one or more reward values , determined as the beam switching threshold corresponding to the UE;
其中,第一关系模型用于表征第一状态向量、奖励值和波束切换阈值之间的关系。Wherein, the first relationship model is used to characterize the relationship among the first state vector, reward value and beam switching threshold.
在一个实施例中,第一关系模型通过至少一个UE的第一指标变化对应的第一参数,以及波束切换代价对应的第二参数确定。In an embodiment, the first relationship model is determined by a first parameter corresponding to a first index change of at least one UE, and a second parameter corresponding to a beam switching cost.
在一个实施例中,第一关系模型是通过以下方式训练得到的:In one embodiment, the first relational model is trained by:
构建训练样本集合;基于训练样本集合,对关系模型进行训练,得到第一关系模型;Constructing a training sample set; based on the training sample set, training the relational model to obtain a first relational model;
基于训练样本集合,对关系模型进行训练,至少包括:Based on the training sample set, the relational model is trained, at least including:
将训练样本集合中的第二状态向量输入至关系模型,确定第二状态向量对应的波束切换阈值;Inputting the second state vector in the training sample set to the relational model, and determining the beam switching threshold corresponding to the second state vector;
基于第二状态向量对应的波束切换阈值和奖励函数,确定第二状态向量对应的奖励值;Determining a reward value corresponding to the second state vector based on the beam switching threshold and the reward function corresponding to the second state vector;
基于第二状态向量、第二状态向量对应的奖励值和关系模型对应的损失函数,确定损失函数值;Determining a loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model;
基于损失函数值,对关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the relational model are updated to obtain the updated relational model.
在一个实施例中,基于训练样本集合,对关系模型进行训练,还包括:In one embodiment, based on the training sample set, the relational model is trained, further comprising:
在未达到终止条件时,重复执行以下步骤:When the termination condition is not met, the following steps are repeated:
将训练样本集合中的第二状态向量输入至更新后的关系模型,确定第二状态向量对应的波束切换阈值;Inputting the second state vector in the training sample set to the updated relationship model, and determining the beam switching threshold corresponding to the second state vector;
基于第二状态向量对应的波束切换阈值和奖励函数,确定第二状态向 量对应的奖励值;Based on the beam switching threshold and reward function corresponding to the second state vector, determine the direction of the second state to The reward value corresponding to the amount;
基于第二状态向量、第二状态向量对应的奖励值和更新后的关系模型对应的损失函数,确定损失函数值;Determining a loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the updated relationship model;
基于损失函数值,对更新后的关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the updated relational model are updated to obtain the updated relational model.
在一个实施例中,在基于第二状态向量、第二状态向量对应的奖励值和关系模型对应的损失函数,确定损失函数值之后,还包括:In one embodiment, after determining the loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model, it further includes:
判断是否达到终止条件;Determine whether the termination condition is met;
若确定达到终止条件,则得到第一关系模型;If it is determined that the termination condition is met, the first relational model is obtained;
终止条件为以下中的一项:The termination condition is one of the following:
损失函数值小于或等于损失函数值阈值;或者,The loss function value is less than or equal to the loss function value threshold; or,
损失函数值大于或等于损失函数值阈值。The loss function value is greater than or equal to the loss function value threshold.
在一个实施例中,将至少一个UE发送给第一网络节点的第一质量信息,发送给第二网络节点。In an embodiment, the first quality information sent by at least one UE to the first network node is sent to the second network node.
在此需要说明的是,本发明实施例提供的上述装置,能够实现上述方法实施例所实现的所有方法步骤,且能够达到相同的技术效果,在此不再对本实施例中与方法实施例相同的部分及有益效果进行具体赘述。It should be noted here that the above-mentioned device provided by the embodiment of the present invention can realize all the method steps realized by the above-mentioned method embodiment, and can achieve the same technical effect. The part and the beneficial effect are described in detail.
基于相同的发明构思,本公开实施例还提供了一种波束切换装置,应用于UE,该装置的结构示意图如图12所示,收发机1400,用于在处理器1410的控制下接收和发送数据。Based on the same inventive concept, an embodiment of the present disclosure also provides a beam switching device, which is applied to a UE. The structural diagram of the device is shown in FIG. data.
其中,在图12中,总线架构可以包括任意数量的互联的总线和桥,具体由处理器1410代表的一个或多个处理器和存储器1420代表的存储器的各种电路链接在一起。总线架构还可以将诸如外围设备、稳压器和功率管理电路等之类的各种其他电路链接在一起,这些都是本领域所公知的,因此,本文不再对其进行进一步描述。总线接口提供接口。收发机1400可以是多个元件,即包括发送机和接收机,提供用于在传输介质上与各种其他装置通信的单元,这些传输介质包括,这些传输介质包括无线信道、有线信道、光缆等传输介质。针对不同的用户设备,用户接口1430还可以是能够外接内接需要设备的接口,连接的设备包括但不限于小键盘、显 示器、扬声器、麦克风、操纵杆等。Wherein, in FIG. 12 , the bus architecture may include any number of interconnected buses and bridges, specifically one or more processors represented by the processor 1410 and various circuits of the memory represented by the memory 1420 are linked together. The bus architecture can also link together various other circuits such as peripherals, voltage regulators, and power management circuits, etc., which are well known in the art and therefore will not be further described herein. The bus interface provides the interface. Transceiver 1400 may be a plurality of elements, including transmitters and receivers, providing means for communicating with various other devices over transmission media, including wireless channels, wired channels, fiber optic cables, etc. Transmission medium. For different user equipments, the user interface 1430 can also be an interface capable of connecting externally and internally to required devices, and the connected devices include but are not limited to keypads, display screens, etc. monitors, speakers, microphones, joysticks, etc.
处理器1410负责管理总线架构和通常的处理,存储器1420可以存储处理器1410在执行操作时所使用的数据。The processor 1410 is responsible for managing the bus architecture and general processing, and the memory 1420 can store data used by the processor 1410 when performing operations.
可选的,处理器1410可以是CPU(中央处埋器)、ASIC(Application Specific Integrated Circuit,专用集成电路)、FPGA(Field-Programmable Gate Array,现场可编程门阵列)或CPLD(Complex Programmable Logic Device,复杂可编程逻辑器件),处理器也可以采用多核架构。Optionally, the processor 1410 can be a CPU (central device), ASIC (Application Specific Integrated Circuit, application specific integrated circuit), FPGA (Field-Programmable Gate Array, field programmable gate array) or CPLD (Complex Programmable Logic Device , complex programmable logic device), the processor can also adopt a multi-core architecture.
处理器通过调用存储器存储的计算机程序,用于按照获得的可执行指令执行本公开实施例提供的第二方面的方法。处理器与存储器也可以物理上分开布置。The processor is used to execute the method of the second aspect provided by the embodiments of the present disclosure according to the obtained executable instruction by calling the computer program stored in the memory. The processor and memory may also be physically separated.
处理器1410,用于读取存储器1420中的计算机程序并执行以下操作:Processor 1410, configured to read the computer program in memory 1420 and perform the following operations:
向第一网络节点发送第一时间段内第一波束对应的第一质量信息;以使第一网络节点获取第一时间段内多个波束分别对应的第一质量信息,并基于多个波束分别对应的第一质量信息,以及至少一个第二时间段内多个波束对应的平均信道质量,确定UE对应的波束切换阈值;Sending the first quality information corresponding to the first beam in the first time period to the first network node; so that the first network node obtains the first quality information respectively corresponding to the multiple beams in the first time period, and based on the multiple beams respectively The corresponding first quality information, and the average channel quality corresponding to multiple beams in at least one second time period, determine the beam switching threshold corresponding to the UE;
接收第一网络节点发送的UE对应的波束切换阈值,并基于波束切换阈值进行波束切换。Receive the beam switching threshold corresponding to the UE sent by the first network node, and perform beam switching based on the beam switching threshold.
在此需要说明的是,本发明实施例提供的上述装置,能够实现上述方法实施例所实现的所有方法步骤,且能够达到相同的技术效果,在此不再对本实施例中与方法实施例相同的部分及有益效果进行具体赘述。It should be noted here that the above-mentioned device provided by the embodiment of the present invention can realize all the method steps realized by the above-mentioned method embodiment, and can achieve the same technical effect. The part and the beneficial effect are described in detail.
基于前述实施例相同的发明构思,本公开实施例还提供了一种波束切换装置,应用于第一网络节点,该装置的结构示意图如图13所示,波束切换装置80,包括第一处理单元801、第二处理单元802、第三处理单元803和第四处理单元804。Based on the same inventive concept as the foregoing embodiments, the embodiments of the present disclosure also provide a beam switching device, which is applied to the first network node. The structural diagram of the device is shown in FIG. 13 . The beam switching device 80 includes a first processing unit 801. A second processing unit 802, a third processing unit 803, and a fourth processing unit 804.
第一处理单元801,用于获取第一时间段内多个第一波束分别对应的第一质量信息,多个第一波束分别对应的第一质量信息包括至少一个用户设备UE发送给第一网络节点的第一质量信息;The first processing unit 801 is configured to acquire first quality information corresponding to multiple first beams respectively within a first time period, where the first quality information corresponding to multiple first beams respectively includes at least one user equipment UE sent to the first network The first quality information of the node;
第二处理单元802,用于基于多个第一波束分别对应的第一质量信息,确定多个第一波束对应的第一平均信道质量; The second processing unit 802 is configured to determine the first average channel quality corresponding to the multiple first beams based on the first quality information respectively corresponding to the multiple first beams;
第三处理单元803,用于基于多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量;至少一个第二时间段在第一时间段之前;The third processing unit 803 is configured to determine the first state vector based on the first average channel quality corresponding to the multiple first beams and the second average channel quality corresponding to the multiple second beams in at least one second time period; at least a second time period preceding the first time period;
第四处理单元804,用于基于第一状态向量,确定至少一个UE对应的波束切换阈值。The fourth processing unit 804 is configured to determine a beam switching threshold corresponding to at least one UE based on the first state vector.
在一个实施例中,第一处理单元801,具体用于:In one embodiment, the first processing unit 801 is specifically configured to:
获取第一时间段内第二网络节点发送的第一质量信息;并接收至少一个UE发送的第一质量信息;Acquire first quality information sent by the second network node within the first time period; and receive first quality information sent by at least one UE;
其中,第一质量信息包括信号与干扰噪声比SINR、接收功率RSRP、接收质量RSRQ中的至少一项。Wherein, the first quality information includes at least one item of signal to interference and noise ratio SINR, received power RSRP, and received quality RSRQ.
在一个实施例中,第三处理单元803,具体用于:In one embodiment, the third processing unit 803 is specifically configured to:
将第一平均信道质量进行量化映射处理,得到第一平均信道质量对应的第一信息向量的值;并将第二平均信道质量进行量化映射处理,得到第二平均信道质量对应的第二信息向量的值;Performing quantization mapping processing on the first average channel quality to obtain the value of the first information vector corresponding to the first average channel quality; performing quantization mapping processing on the second average channel quality to obtain a second information vector corresponding to the second average channel quality value;
基于第一信息向量和至少一个第二信息向量,得到第一状态向量。Based on the first information vector and at least one second information vector, a first state vector is obtained.
在一个实施例中,第四处理单元804,具体用于:In one embodiment, the fourth processing unit 804 is specifically configured to:
将第一状态向量输入至第一关系模型,进行匹配处理,得到与第一状态向量相匹配的一个或多个奖励值,并将一个或多个奖励值中的最大奖励值对应的波束切换阈值,确定为UE对应的波束切换阈值;Input the first state vector into the first relationship model, perform matching processing, obtain one or more reward values matching the first state vector, and set the beam switching threshold corresponding to the maximum reward value among the one or more reward values , determined as the beam switching threshold corresponding to the UE;
其中,第一关系模型用于表征第一状态向量、奖励值和波束切换阈值之间的关系。Wherein, the first relationship model is used to characterize the relationship among the first state vector, reward value and beam switching threshold.
在一个实施例中,第一关系模型通过至少一个UE的第一指标变化对应的第一参数,以及波束切换代价对应的第二参数确定。In an embodiment, the first relationship model is determined by a first parameter corresponding to a first index change of at least one UE, and a second parameter corresponding to a beam switching cost.
在一个实施例中,第一关系模型是通过以下方式训练得到的:In one embodiment, the first relational model is trained by:
构建训练样本集合;基于训练样本集合,对关系模型进行训练,得到第一关系模型;Constructing a training sample set; based on the training sample set, training the relational model to obtain a first relational model;
基于训练样本集合,对关系模型进行训练,至少包括:Based on the training sample set, the relational model is trained, at least including:
将训练样本集合中的第二状态向量输入至关系模型,确定第二状态向量对应的波束切换阈值; Inputting the second state vector in the training sample set to the relational model, and determining the beam switching threshold corresponding to the second state vector;
基于第二状态向量对应的波束切换阈值和奖励函数,确定第二状态向量对应的奖励值;Determining a reward value corresponding to the second state vector based on the beam switching threshold and the reward function corresponding to the second state vector;
基于第二状态向量、第二状态向量对应的奖励值和关系模型对应的损失函数,确定损失函数值;Determining a loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model;
基于损失函数值,对关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the relational model are updated to obtain the updated relational model.
在一个实施例中,基于训练样本集合,对关系模型进行训练,还包括:In one embodiment, based on the training sample set, the relational model is trained, further comprising:
在未达到终止条件时,重复执行以下步骤:When the termination condition is not met, the following steps are repeated:
将训练样本集合中的第二状态向量输入至更新后的关系模型,确定第二状态向量对应的波束切换阈值;Inputting the second state vector in the training sample set to the updated relationship model, and determining the beam switching threshold corresponding to the second state vector;
基于第二状态向量对应的波束切换阈值和奖励函数,确定第二状态向量对应的奖励值;Determining a reward value corresponding to the second state vector based on the beam switching threshold and the reward function corresponding to the second state vector;
基于第二状态向量、第二状态向量对应的奖励值和更新后的关系模型对应的损失函数,确定损失函数值;Determining a loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the updated relationship model;
基于损失函数值,对更新后的关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the updated relational model are updated to obtain the updated relational model.
在一个实施例中,在基于第二状态向量、第二状态向量对应的奖励值和关系模型对应的损失函数,确定损失函数值之后,还包括:In one embodiment, after determining the loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model, it further includes:
判断是否达到终止条件;Determine whether the termination condition is met;
若确定达到终止条件,则得到第一关系模型;If it is determined that the termination condition is met, the first relational model is obtained;
终止条件为以下中的一项:The termination condition is one of the following:
损失函数值小于或等于损失函数值阈值;或者,The loss function value is less than or equal to the loss function value threshold; or,
损失函数值大于或等于损失函数值阈值。The loss function value is greater than or equal to the loss function value threshold.
在一个实施例中,第一处理单元801,还用于:In one embodiment, the first processing unit 801 is further configured to:
将至少一个UE发送给第一网络节点的第一质量信息,发送给第二网络节点。The first quality information sent by at least one UE to the first network node is sent to the second network node.
在此需要说明的是,本发明实施例提供的上述装置,能够实现上述方法实施例所实现的所有方法步骤,且能够达到相同的技术效果,在此不再对本实施例中与方法实施例相同的部分及有益效果进行具体赘述。 It should be noted here that the above-mentioned device provided by the embodiment of the present invention can realize all the method steps realized by the above-mentioned method embodiment, and can achieve the same technical effect. The part and the beneficial effect are described in detail.
基于前述实施例相同的发明构思,本公开实施例还提供了一种波束切换装置,应用于UE,该装置的结构示意图如图14所示,波束切换装置90,包括第五处理单元901和第六处理单元902。Based on the same inventive concept as the foregoing embodiments, the embodiments of the present disclosure also provide a beam switching device, which is applied to a UE. The structural diagram of the device is shown in FIG. Six processing units 902 .
第五处理单元901,用于向第一网络节点发送第一时间段内第一波束对应的第一质量信息;以使第一网络节点获取第一时间段内多个波束分别对应的第一质量信息,并基于多个波束分别对应的第一质量信息,以及至少一个第二时间段内多个波束对应的平均信道质量,确定UE对应的波束切换阈值;The fifth processing unit 901 is configured to send the first quality information corresponding to the first beam within the first time period to the first network node; so that the first network node obtains the first quality information corresponding to the multiple beams within the first time period information, and based on the first quality information respectively corresponding to the multiple beams, and the average channel quality corresponding to the multiple beams in at least one second time period, determine the beam switching threshold corresponding to the UE;
第六处理单元902,用于接收第一网络节点发送的UE对应的波束切换阈值,并基于波束切换阈值进行波束切换。The sixth processing unit 902 is configured to receive the beam switching threshold corresponding to the UE sent by the first network node, and perform beam switching based on the beam switching threshold.
在此需要说明的是,本发明实施例提供的上述装置,能够实现上述方法实施例所实现的所有方法步骤,且能够达到相同的技术效果,在此不再对本实施例中与方法实施例相同的部分及有益效果进行具体赘述。It should be noted here that the above-mentioned device provided by the embodiment of the present invention can realize all the method steps realized by the above-mentioned method embodiment, and can achieve the same technical effect. The part and the beneficial effect are described in detail.
需要说明的是,本申请实施例中对单元的划分是示意性的,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。It should be noted that the division of units in the embodiment of the present application is schematic, and is only a logical function division, and there may be another division manner in actual implementation. In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个处理器可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器(processor)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。If the integrated unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a processor-readable storage medium. Based on this understanding, the technical solution of the present application is essentially or part of the contribution to the prior art or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , including several instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor (processor) execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disc and other media that can store program codes. .
基于相同的发明构思,本公开实施例还提供了一种处理器可读存储介 质,存储有计算机程序,该计算机程序用于被处理器执行时实现本公开实施例中任意一个实施例或任意一种可选实施方式提供的任意一种小区切换的时间确定方法的步骤。Based on the same inventive concept, the embodiment of the present disclosure also provides a processor-readable storage medium The essence is to store a computer program, and the computer program is used to implement the steps of any method for determining the time of cell handover provided by any one of the embodiments of the present disclosure or any one of the optional implementations when executed by a processor.
处理器可读存储介质可以是处理器能够存取的任何可用介质或数据存储设备,包括但不限于磁性存储器(例如软盘、硬盘、磁带、磁光盘(MO)等)、光学存储器(例如CD、DVD、BD、HVD等)、以及半导体存储器(例如ROM、EPROM、EEPROM、非易失性存储器(NAND FLASH)、固态硬盘(SSD))等。The processor-readable storage medium can be any available medium or data storage device that can be accessed by the processor, including but not limited to magnetic storage (e.g., floppy disk, hard disk, tape, magneto-optical disk (MO), etc.), optical storage (e.g., CD, DVD, BD, HVD, etc.), and semiconductor memory (such as ROM, EPROM, EEPROM, non-volatile memory (NAND FLASH), solid-state drive (SSD)), etc.
本领域内的技术人员应明白,本申请的实施例可提供为方法、系统、或计算机程序产品。因此,本申请可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器和光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art should understand that the embodiments of the present application may be provided as methods, systems, or computer program products. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, optical storage, etc.) having computer-usable program code embodied therein.
本申请是参照根据本申请实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机可执行指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机可执行指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present application is described with reference to flowcharts and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the present application. It should be understood that each procedure and/or block in the flowchart and/or block diagrams, and combinations of procedures and/or blocks in the flowchart and/or block diagrams can be implemented by computer-executable instructions. These computer-executable instructions can be provided to a general purpose computer, special purpose computer, embedded processor, or processor of other programmable data processing equipment to produce a machine, such that instructions executed by the processor of the computer or other programmable data processing equipment produce Means for realizing the functions specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.
这些处理器可执行指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的处理器可读存储器中,使得存储在该处理器可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These processor-executable instructions may also be stored in a processor-readable memory capable of directing a computer or other programmable data processing device to operate in a specific manner, such that the instructions stored in the processor-readable memory produce a manufacturing product, the instruction device realizes the functions specified in one or more procedures of the flow chart and/or one or more blocks of the block diagram.
这些处理器可执行指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现 在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These processor-executable instructions can also be loaded onto a computer or other programmable data processing device, causing a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented Execution instructions are provided for implementing A step of a function specified in a flow chart process or processes and/or a block diagram block or blocks.
显然,本领域的技术人员可以对本申请进行各种改动和变型而不脱离本申请的精神和范围。这样,倘若本申请的这些修改和变型属于本申请权利要求及其等同技术的范围之内,则本申请也意图包含这些改动和变型在内。 Obviously, those skilled in the art can make various changes and modifications to the application without departing from the spirit and scope of the application. In this way, if these modifications and variations of the present application fall within the scope of the claims of the present application and their equivalent technologies, the present application is also intended to include these modifications and variations.

Claims (31)

  1. 一种波束切换方法,由第一网络节点执行,其中,包括:A beam switching method, performed by a first network node, including:
    获取第一时间段内多个第一波束分别对应的第一质量信息,所述多个第一波束分别对应的第一质量信息包括至少一个用户设备UE发送给所述第一网络节点的第一质量信息;Acquire first quality information respectively corresponding to multiple first beams within a first time period, where the first quality information respectively corresponding to multiple first beams includes the first quality information sent by at least one user equipment UE to the first network node. quality information;
    基于所述多个第一波束分别对应的第一质量信息,确定所述多个第一波束对应的第一平均信道质量;Based on the first quality information respectively corresponding to the multiple first beams, determine a first average channel quality corresponding to the multiple first beams;
    基于所述多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量;所述至少一个第二时间段在所述第一时间段之前;Based on the first average channel quality corresponding to the plurality of first beams and the second average channel quality corresponding to the plurality of second beams in at least one second time period, determine the first state vector; the at least one second time period period is before said first time period;
    基于所述第一状态向量,确定所述至少一个UE对应的波束切换阈值。Based on the first state vector, determine a beam switching threshold corresponding to the at least one UE.
  2. 根据权利要求1所述的方法,其中,所述获取第一时间段内多个第一波束分别对应的第一质量信息,包括:The method according to claim 1, wherein said obtaining first quality information respectively corresponding to a plurality of first beams within a first time period comprises:
    获取第一时间段内第二网络节点发送的第一质量信息;并接收所述至少一个UE发送的第一质量信息;Acquiring the first quality information sent by the second network node within the first time period; and receiving the first quality information sent by the at least one UE;
    其中,所述第一质量信息包括信号与干扰噪声比SINR、接收功率RSRP、接收质量RSRQ中的至少一项。Wherein, the first quality information includes at least one of signal to interference and noise ratio SINR, received power RSRP, and received quality RSRQ.
  3. 根据权利要求1所述的方法,其中,所述基于所述多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量,包括:The method according to claim 1, wherein, based on the first average channel quality corresponding to the plurality of first beams and the second average channel quality corresponding to a plurality of second beams in at least one second time period, Determine the first state vector, including:
    将所述第一平均信道质量进行量化映射处理,得到所述第一平均信道质量对应的第一信息向量的值;并将所述第二平均信道质量进行量化映射处理,得到所述第二平均信道质量对应的第二信息向量的值;performing quantization and mapping processing on the first average channel quality to obtain the value of the first information vector corresponding to the first average channel quality; and performing quantization mapping processing on the second average channel quality to obtain the second average The value of the second information vector corresponding to the channel quality;
    基于所述第一信息向量和至少一个所述第二信息向量,得到第一状态向量。A first state vector is obtained based on the first information vector and at least one of the second information vectors.
  4. 根据权利要求1所述的方法,其中,所述基于所述第一状态向量,确定所述至少一个UE对应的波束切换阈值,包括:The method according to claim 1, wherein the determining the beam switching threshold corresponding to the at least one UE based on the first state vector comprises:
    将所述第一状态向量输入至第一关系模型,进行匹配处理,得到与所 述第一状态向量相匹配的一个或多个奖励值,并将所述一个或多个奖励值中的最大奖励值对应的波束切换阈值,确定为所述UE对应的波束切换阈值;Input the first state vector into the first relational model, perform matching processing, and obtain the one or more reward values that match the first state vector, and determine a beam switching threshold corresponding to a maximum reward value among the one or more reward values as the beam switching threshold corresponding to the UE;
    其中,所述第一关系模型用于表征所述第一状态向量、奖励值和波束切换阈值之间的关系。Wherein, the first relationship model is used to characterize the relationship among the first state vector, reward value and beam switching threshold.
  5. 根据权利要求4所述的方法,其中,所述第一关系模型通过所述至少一个UE的第一指标变化对应的第一参数,以及波束切换代价对应的第二参数确定。The method according to claim 4, wherein the first relationship model is determined by a first parameter corresponding to a first index change of the at least one UE and a second parameter corresponding to a beam switching cost.
  6. 根据权利要求4所述的方法,其中,所述第一关系模型是通过以下方式训练得到的:The method according to claim 4, wherein the first relational model is obtained through training in the following manner:
    构建训练样本集合;基于所述训练样本集合,对关系模型进行训练,得到所述第一关系模型;Constructing a training sample set; based on the training sample set, training the relational model to obtain the first relational model;
    所述基于所述训练样本集合,对关系模型进行训练,至少包括:The training of the relational model based on the training sample set includes at least:
    将所述训练样本集合中的第二状态向量输入至所述关系模型,确定所述第二状态向量对应的波束切换阈值;inputting a second state vector in the training sample set to the relationship model, and determining a beam switching threshold corresponding to the second state vector;
    基于所述第二状态向量对应的波束切换阈值和奖励函数,确定所述第二状态向量对应的奖励值;determining a reward value corresponding to the second state vector based on the beam switching threshold and the reward function corresponding to the second state vector;
    基于所述第二状态向量、所述第二状态向量对应的奖励值和所述关系模型对应的损失函数,确定损失函数值;determining a loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model;
    基于所述损失函数值,对所述关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the relationship model are updated to obtain an updated relationship model.
  7. 根据权利要求6所述的方法,其中,所述基于所述训练样本集合,对关系模型进行训练,还包括:The method according to claim 6, wherein said training a relational model based on said training sample set further comprises:
    在未达到终止条件时,重复执行以下步骤:When the termination condition is not met, the following steps are repeated:
    将所述训练样本集合中的第二状态向量输入至更新后的关系模型,确定所述第二状态向量对应的波束切换阈值;Inputting the second state vector in the training sample set to the updated relationship model, and determining the beam switching threshold corresponding to the second state vector;
    基于所述第二状态向量对应的波束切换阈值和奖励函数,确定所述第二状态向量对应的奖励值;determining a reward value corresponding to the second state vector based on the beam switching threshold and the reward function corresponding to the second state vector;
    基于第二状态向量、所述第二状态向量对应的奖励值和所述更新后的 关系模型对应的损失函数,确定损失函数值;Based on the second state vector, the reward value corresponding to the second state vector and the updated The loss function corresponding to the relational model, and determine the value of the loss function;
    基于所述损失函数值,对所述更新后的关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the updated relationship model are updated to obtain the updated relationship model.
  8. 根据权利要求6所述的方法,其中,在所述基于第二状态向量、所述第二状态向量对应的奖励值和所述关系模型对应的损失函数,确定损失函数值之后,还包括:The method according to claim 6, wherein, after determining the loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model, further comprising:
    判断是否达到终止条件;Determine whether the termination condition is met;
    若确定达到所述终止条件,则得到所述第一关系模型;If it is determined that the termination condition is met, then obtain the first relational model;
    所述终止条件为以下中的一项:The termination condition is one of the following:
    所述损失函数值小于或等于损失函数值阈值;或者,The loss function value is less than or equal to a loss function value threshold; or,
    所述损失函数值大于或等于损失函数值阈值。The loss function value is greater than or equal to a loss function value threshold.
  9. 根据权利要求1所述的方法,其中,还包括:The method according to claim 1, further comprising:
    将所述至少一个UE发送给所述第一网络节点的第一质量信息,发送给第二网络节点。Sending the first quality information sent by the at least one UE to the first network node to a second network node.
  10. 一种波束切换方法,由UE执行,其中,包括:A beam switching method, performed by a UE, including:
    向第一网络节点发送第一时间段内第一波束对应的第一质量信息;以使所述第一网络节点获取所述第一时间段内多个波束分别对应的第一质量信息,并基于所述多个波束分别对应的第一质量信息,以及至少一个第二时间段内多个波束对应的平均信道质量,确定所述UE对应的波束切换阈值;Sending the first quality information corresponding to the first beam within the first time period to the first network node; so that the first network node obtains the first quality information respectively corresponding to the multiple beams within the first time period, and based on The first quality information corresponding to the plurality of beams, and the average channel quality corresponding to the plurality of beams in at least one second time period, determine the beam switching threshold corresponding to the UE;
    接收所述第一网络节点发送的所述UE对应的波束切换阈值,并基于所述波束切换阈值进行波束切换。receiving the beam switching threshold corresponding to the UE sent by the first network node, and performing beam switching based on the beam switching threshold.
  11. 一种波束切换装置,应用于第一网络节点,包括存储器,收发机,处理器,其中:A beam switching device, applied to a first network node, includes a memory, a transceiver, and a processor, wherein:
    存储器,用于存储计算机程序;memory for storing computer programs;
    收发机,用于在所述处理器的控制下收发数据;a transceiver, configured to send and receive data under the control of the processor;
    处理器,用于读取所述存储器中的计算机程序并执行以下操作:a processor for reading the computer program in said memory and performing the following operations:
    获取第一时间段内多个第一波束分别对应的第一质量信息,所述多个第一波束分别对应的第一质量信息包括至少一个用户设备UE发送给所述 第一网络节点的第一质量信息;Acquire first quality information respectively corresponding to multiple first beams within a first time period, where the first quality information respectively corresponding to multiple first beams includes at least one user equipment UE sent to the first quality information of the first network node;
    基于所述多个第一波束分别对应的第一质量信息,确定所述多个第一波束对应的第一平均信道质量;Based on the first quality information respectively corresponding to the multiple first beams, determine a first average channel quality corresponding to the multiple first beams;
    基于所述多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量;所述至少一个第二时间段在所述第一时间段之前;Based on the first average channel quality corresponding to the plurality of first beams and the second average channel quality corresponding to the plurality of second beams in at least one second time period, determine the first state vector; the at least one second time period period is before said first time period;
    基于所述第一状态向量,确定所述至少一个UE对应的波束切换阈值。Based on the first state vector, determine a beam switching threshold corresponding to the at least one UE.
  12. 根据权利要求11所述的装置,其中,所述获取第一时间段内多个第一波束分别对应的第一质量信息,包括:The device according to claim 11, wherein said acquiring first quality information respectively corresponding to a plurality of first beams within a first time period comprises:
    获取第一时间段内第二网络节点发送的第一质量信息;并接收所述至少一个UE发送的第一质量信息;Acquiring the first quality information sent by the second network node within the first time period; and receiving the first quality information sent by the at least one UE;
    其中,所述第一质量信息包括信号与干扰噪声比SINR、接收功率RSRP、接收质量RSRQ中的至少一项。Wherein, the first quality information includes at least one of signal to interference and noise ratio SINR, received power RSRP, and received quality RSRQ.
  13. 根据权利要求11所述的装置,其中,所述基于所述多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量,包括:The apparatus according to claim 11, wherein, based on the first average channel quality corresponding to the multiple first beams and the second average channel quality corresponding to multiple second beams in at least one second time period, Determine the first state vector, including:
    将所述第一平均信道质量进行量化映射处理,得到所述第一平均信道质量对应的第一信息向量的值;并将所述第二平均信道质量进行量化映射处理,得到所述第二平均信道质量对应的第二信息向量的值;performing quantization and mapping processing on the first average channel quality to obtain the value of the first information vector corresponding to the first average channel quality; and performing quantization mapping processing on the second average channel quality to obtain the second average The value of the second information vector corresponding to the channel quality;
    基于所述第一信息向量和至少一个所述第二信息向量,得到第一状态向量。A first state vector is obtained based on the first information vector and at least one of the second information vectors.
  14. 根据权利要求11所述的装置,其中,所述基于所述第一状态向量,确定所述至少一个UE对应的波束切换阈值,包括:The apparatus according to claim 11, wherein the determining the beam switching threshold corresponding to the at least one UE based on the first state vector comprises:
    将所述第一状态向量输入至第一关系模型,进行匹配处理,得到与所述第一状态向量相匹配的一个或多个奖励值,并将所述一个或多个奖励值中的最大奖励值对应的波束切换阈值,确定为所述UE对应的波束切换阈值;Inputting the first state vector into the first relational model, performing matching processing, obtaining one or more reward values matching the first state vector, and rewarding the largest reward among the one or more reward values The beam switching threshold corresponding to the value is determined as the beam switching threshold corresponding to the UE;
    其中,所述第一关系模型用于表征所述第一状态向量、奖励值和波束切换阈值之间的关系。 Wherein, the first relationship model is used to characterize the relationship among the first state vector, reward value and beam switching threshold.
  15. 根据权利要求14所述的装置,其中,所述第一关系模型通过所述至少一个UE的第一指标变化对应的第一参数,以及波束切换代价对应的第二参数确定。The apparatus according to claim 14, wherein the first relationship model is determined by a first parameter corresponding to a first index change of the at least one UE and a second parameter corresponding to a beam switching cost.
  16. 根据权利要求14所述的装置,其中,所述第一关系模型是通过以下方式训练得到的:The device according to claim 14, wherein the first relationship model is obtained by training in the following manner:
    构建训练样本集合;基于所述训练样本集合,对关系模型进行训练,得到所述第一关系模型;Constructing a training sample set; based on the training sample set, training the relational model to obtain the first relational model;
    所述基于所述训练样本集合,对关系模型进行训练,至少包括:The training of the relational model based on the training sample set includes at least:
    将所述训练样本集合中的第二状态向量输入至所述关系模型,确定所述第二状态向量对应的波束切换阈值;inputting a second state vector in the training sample set to the relationship model, and determining a beam switching threshold corresponding to the second state vector;
    基于所述第二状态向量对应的波束切换阈值和奖励函数,确定所述第二状态向量对应的奖励值;determining a reward value corresponding to the second state vector based on the beam switching threshold and the reward function corresponding to the second state vector;
    基于所述第二状态向量、所述第二状态向量对应的奖励值和所述关系模型对应的损失函数,确定损失函数值;determining a loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model;
    基于所述损失函数值,对所述关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the relationship model are updated to obtain an updated relationship model.
  17. 根据权利要求16所述的装置,其中,所述基于所述训练样本集合,对关系模型进行训练,还包括:The device according to claim 16, wherein said training a relational model based on said training sample set further comprises:
    在未达到终止条件时,重复执行以下步骤:When the termination condition is not met, the following steps are repeated:
    将所述训练样本集合中的第二状态向量输入至更新后的关系模型,确定所述第二状态向量对应的波束切换阈值;Inputting the second state vector in the training sample set to the updated relationship model, and determining the beam switching threshold corresponding to the second state vector;
    基于所述第二状态向量对应的波束切换阈值和奖励函数,确定所述第二状态向量对应的奖励值;determining a reward value corresponding to the second state vector based on the beam switching threshold and the reward function corresponding to the second state vector;
    基于第二状态向量、所述第二状态向量对应的奖励值和所述更新后的关系模型对应的损失函数,确定损失函数值;determining a loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the updated relationship model;
    基于所述损失函数值,对所述更新后的关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the updated relationship model are updated to obtain the updated relationship model.
  18. 根据权利要求16所述的装置,其中,在所述基于第二状态向量、 所述第二状态向量对应的奖励值和所述关系模型对应的损失函数,确定损失函数值之后,还包括:The apparatus according to claim 16, wherein, in said second state vector based, The reward value corresponding to the second state vector and the loss function corresponding to the relationship model, after determining the loss function value, further include:
    判断是否达到终止条件;Determine whether the termination condition is met;
    若确定达到所述终止条件,则得到所述第一关系模型;If it is determined that the termination condition is met, then obtain the first relational model;
    所述终止条件为以下中的一项:The termination condition is one of the following:
    所述损失函数值小于或等于损失函数值阈值;或者,The loss function value is less than or equal to a loss function value threshold; or,
    所述损失函数值大于或等于损失函数值阈值。The loss function value is greater than or equal to a loss function value threshold.
  19. 根据权利要求11所述的装置,其中,所述处理器读取所述存储器中的计算机程序并执行的操作还包括:The apparatus according to claim 11, wherein the operations performed by the processor to read the computer program in the memory further include:
    将所述至少一个UE发送给所述第一网络节点的第一质量信息,发送给第二网络节点。Sending the first quality information sent by the at least one UE to the first network node to the second network node.
  20. 一种波束切换装置,应用于UE,包括存储器,收发机,处理器,其中:A beam switching device, applied to a UE, including a memory, a transceiver, and a processor, wherein:
    存储器,用于存储计算机程序;memory for storing computer programs;
    收发机,用于在所述处理器的控制下收发数据;a transceiver, configured to send and receive data under the control of the processor;
    处理器,用于读取所述存储器中的计算机程序并执行以下操作:a processor for reading the computer program in said memory and performing the following operations:
    向第一网络节点发送第一时间段内第一波束对应的第一质量信息;以使所述第一网络节点获取所述第一时间段内多个波束分别对应的第一质量信息,并基于所述多个波束分别对应的第一质量信息,以及至少一个第二时间段内多个波束对应的平均信道质量,确定所述UE对应的波束切换阈值;Sending the first quality information corresponding to the first beam within the first time period to the first network node; so that the first network node obtains the first quality information respectively corresponding to the multiple beams within the first time period, and based on The first quality information corresponding to the plurality of beams, and the average channel quality corresponding to the plurality of beams in at least one second time period, determine the beam switching threshold corresponding to the UE;
    接收所述第一网络节点发送的所述UE对应的波束切换阈值,并基于所述波束切换阈值进行波束切换。receiving the beam switching threshold corresponding to the UE sent by the first network node, and performing beam switching based on the beam switching threshold.
  21. 一种波束切换装置,应用于第一网络节点,包括:A beam switching device, applied to a first network node, comprising:
    第一处理单元,用于获取第一时间段内多个第一波束分别对应的第一质量信息,所述多个第一波束分别对应的第一质量信息包括至少一个用户设备UE发送给所述第一网络节点的第一质量信息;The first processing unit is configured to obtain first quality information respectively corresponding to multiple first beams within a first time period, where the first quality information respectively corresponding to multiple first beams includes at least one user equipment UE sent to the first quality information of the first network node;
    第二处理单元,用于基于所述多个第一波束分别对应的第一质量信息,确定所述多个第一波束对应的第一平均信道质量; A second processing unit, configured to determine a first average channel quality corresponding to the plurality of first beams based on the first quality information respectively corresponding to the plurality of first beams;
    第三处理单元,用于基于所述多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量;所述至少一个第二时间段在所述第一时间段之前;A third processing unit, configured to determine a first state vector based on a first average channel quality corresponding to the plurality of first beams and a second average channel quality corresponding to a plurality of second beams in at least one second time period; said at least one second time period precedes said first time period;
    第四处理单元,用于基于所述第一状态向量,确定所述至少一个UE对应的波束切换阈值。A fourth processing unit, configured to determine a beam switching threshold corresponding to the at least one UE based on the first state vector.
  22. 根据权利要求21所述的装置,其中,所述第一处理单元获取第一时间段内多个第一波束分别对应的第一质量信息,包括:The device according to claim 21, wherein the first processing unit acquires first quality information respectively corresponding to a plurality of first beams within a first time period, comprising:
    获取第一时间段内第二网络节点发送的第一质量信息;并接收所述至少一个UE发送的第一质量信息;Acquiring the first quality information sent by the second network node within the first time period; and receiving the first quality information sent by the at least one UE;
    其中,所述第一质量信息包括信号与干扰噪声比SINR、接收功率RSRP、接收质量RSRQ中的至少一项。Wherein, the first quality information includes at least one of signal to interference and noise ratio SINR, received power RSRP, and received quality RSRQ.
  23. 根据权利要求21所述的装置,其中,所述第三处理单元基于所述多个第一波束对应的第一平均信道质量,以及至少一个第二时间段内多个第二波束对应的第二平均信道质量,确定第一状态向量,包括:The apparatus according to claim 21, wherein the third processing unit is based on the first average channel quality corresponding to the plurality of first beams and the second channel quality corresponding to the plurality of second beams in at least one second time period. Average channel quality, determine the first state vector, comprising:
    将所述第一平均信道质量进行量化映射处理,得到所述第一平均信道质量对应的第一信息向量的值;并将所述第二平均信道质量进行量化映射处理,得到所述第二平均信道质量对应的第二信息向量的值;performing quantization and mapping processing on the first average channel quality to obtain the value of the first information vector corresponding to the first average channel quality; and performing quantization mapping processing on the second average channel quality to obtain the second average The value of the second information vector corresponding to the channel quality;
    基于所述第一信息向量和至少一个所述第二信息向量,得到第一状态向量。A first state vector is obtained based on the first information vector and at least one of the second information vectors.
  24. 根据权利要求21所述的装置,其中,所述第四处理单元基于所述第一状态向量,确定所述至少一个UE对应的波束切换阈值,包括:The apparatus according to claim 21, wherein the fourth processing unit determines the beam switching threshold corresponding to the at least one UE based on the first state vector, comprising:
    将所述第一状态向量输入至第一关系模型,进行匹配处理,得到与所述第一状态向量相匹配的一个或多个奖励值,并将所述一个或多个奖励值中的最大奖励值对应的波束切换阈值,确定为所述UE对应的波束切换阈值;Inputting the first state vector into the first relational model, performing matching processing, obtaining one or more reward values matching the first state vector, and rewarding the largest reward among the one or more reward values The beam switching threshold corresponding to the value is determined as the beam switching threshold corresponding to the UE;
    其中,所述第一关系模型用于表征所述第一状态向量、奖励值和波束切换阈值之间的关系。Wherein, the first relationship model is used to characterize the relationship among the first state vector, reward value and beam switching threshold.
  25. 根据权利要求24所述的装置,其中,所述第一关系模型通过所述至少一个UE的第一指标变化对应的第一参数,以及波束切换代价对应 的第二参数确定。The apparatus according to claim 24, wherein the first relationship model changes the first parameter corresponding to the first indicator of the at least one UE, and the beam switching cost corresponds to The second parameter of is determined.
  26. 根据权利要求24所述的装置,其中,所述第一关系模型是通过以下方式训练得到的:The device according to claim 24, wherein the first relational model is trained by:
    构建训练样本集合;基于所述训练样本集合,对关系模型进行训练,得到所述第一关系模型;Constructing a training sample set; based on the training sample set, training the relational model to obtain the first relational model;
    所述基于所述训练样本集合,对关系模型进行训练,至少包括:The training of the relational model based on the training sample set includes at least:
    将所述训练样本集合中的第二状态向量输入至所述关系模型,确定所述第二状态向量对应的波束切换阈值;inputting a second state vector in the training sample set to the relationship model, and determining a beam switching threshold corresponding to the second state vector;
    基于所述第二状态向量对应的波束切换阈值和奖励函数,确定所述第二状态向量对应的奖励值;determining a reward value corresponding to the second state vector based on the beam switching threshold and the reward function corresponding to the second state vector;
    基于所述第二状态向量、所述第二状态向量对应的奖励值和所述关系模型对应的损失函数,确定损失函数值;determining a loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model;
    基于所述损失函数值,对所述关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the relationship model are updated to obtain an updated relationship model.
  27. 根据权利要求26所述的装置,其中,所述基于所述训练样本集合,对关系模型进行训练,还包括:The device according to claim 26, wherein said training a relational model based on said training sample set further comprises:
    在未达到终止条件时,重复执行以下步骤:When the termination condition is not met, the following steps are repeated:
    将所述训练样本集合中的第二状态向量输入至更新后的关系模型,确定所述第二状态向量对应的波束切换阈值;Inputting the second state vector in the training sample set to the updated relationship model, and determining the beam switching threshold corresponding to the second state vector;
    基于所述第二状态向量对应的波束切换阈值和奖励函数,确定所述第二状态向量对应的奖励值;determining a reward value corresponding to the second state vector based on the beam switching threshold and the reward function corresponding to the second state vector;
    基于第二状态向量、所述第二状态向量对应的奖励值和所述更新后的关系模型对应的损失函数,确定损失函数值;determining a loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the updated relationship model;
    基于所述损失函数值,对所述更新后的关系模型的模型参数进行更新,得到更新后的关系模型。Based on the loss function value, the model parameters of the updated relationship model are updated to obtain the updated relationship model.
  28. 根据权利要求26所述的装置,其中,在所述基于第二状态向量、所述第二状态向量对应的奖励值和所述关系模型对应的损失函数,确定损失函数值之后,还包括: The device according to claim 26, wherein, after determining the loss function value based on the second state vector, the reward value corresponding to the second state vector, and the loss function corresponding to the relationship model, further comprising:
    判断是否达到终止条件;Determine whether the termination condition is met;
    若确定达到所述终止条件,则得到所述第一关系模型;If it is determined that the termination condition is met, then obtain the first relational model;
    所述终止条件为以下中的一项:The termination condition is one of the following:
    所述损失函数值小于或等于损失函数值阈值;或者,The loss function value is less than or equal to a loss function value threshold; or,
    所述损失函数值大于或等于损失函数值阈值。The loss function value is greater than or equal to a loss function value threshold.
  29. 根据权利要求21所述的装置,其中,所述第一处理单元还用于:The device according to claim 21, wherein the first processing unit is further configured to:
    将所述至少一个UE发送给所述第一网络节点的第一质量信息,发送给第二网络节点。Sending the first quality information sent by the at least one UE to the first network node to the second network node.
  30. 一种波束切换装置,应用于UE,包括:A beam switching device, applied to a UE, comprising:
    第五处理单元,用于向第一网络节点发送第一时间段内第一波束对应的第一质量信息;以使所述第一网络节点获取所述第一时间段内多个波束分别对应的第一质量信息,并基于所述多个波束分别对应的第一质量信息,以及至少一个第二时间段内多个波束对应的平均信道质量,确定所述UE对应的波束切换阈值;The fifth processing unit is configured to send the first quality information corresponding to the first beam within the first time period to the first network node; so that the first network node obtains the quality information respectively corresponding to the multiple beams within the first time period First quality information, and based on the first quality information respectively corresponding to the plurality of beams, and the average channel quality corresponding to the plurality of beams in at least one second time period, determine a beam switching threshold corresponding to the UE;
    第六处理单元,用于接收所述第一网络节点发送的所述UE对应的波束切换阈值,并基于所述波束切换阈值进行波束切换。The sixth processing unit is configured to receive the beam switching threshold corresponding to the UE sent by the first network node, and perform beam switching based on the beam switching threshold.
  31. 一种处理器可读存储介质,所述处理器可读存储介质存储有计算机程序,所述计算机程序被处理器执行时,实现权利要求1至9中任一项所述的方法或权利要求10所述的方法。 A processor-readable storage medium, the processor-readable storage medium stores a computer program, and when the computer program is executed by a processor, the method according to any one of claims 1 to 9 or claim 10 is implemented the method described.
PCT/CN2023/072416 2022-01-26 2023-01-16 Beam switching method and apparatus, and processor readable storage medium WO2023143200A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210095304.5A CN116546579A (en) 2022-01-26 2022-01-26 Beam switching method, device and processor readable storage medium
CN202210095304.5 2022-01-26

Publications (1)

Publication Number Publication Date
WO2023143200A1 true WO2023143200A1 (en) 2023-08-03

Family

ID=87454714

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/072416 WO2023143200A1 (en) 2022-01-26 2023-01-16 Beam switching method and apparatus, and processor readable storage medium

Country Status (2)

Country Link
CN (1) CN116546579A (en)
WO (1) WO2023143200A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017196243A1 (en) * 2016-05-13 2017-11-16 Telefonaktiebolaget Lm Ericsson (Publ) Low-power channel-state-information reporting mode
CN107547115A (en) * 2016-06-29 2018-01-05 中兴通讯股份有限公司 The method and device that a kind of narrow beam takes over seamlessly
CN112311446A (en) * 2020-10-20 2021-02-02 陕西航天技术应用研究院有限公司 Satellite beam switching method and system based on multiple dimensions
CN112929893A (en) * 2019-12-06 2021-06-08 大唐移动通信设备有限公司 Method and device for switching receiving wave beams

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017196243A1 (en) * 2016-05-13 2017-11-16 Telefonaktiebolaget Lm Ericsson (Publ) Low-power channel-state-information reporting mode
CN107547115A (en) * 2016-06-29 2018-01-05 中兴通讯股份有限公司 The method and device that a kind of narrow beam takes over seamlessly
CN112929893A (en) * 2019-12-06 2021-06-08 大唐移动通信设备有限公司 Method and device for switching receiving wave beams
CN112311446A (en) * 2020-10-20 2021-02-02 陕西航天技术应用研究院有限公司 Satellite beam switching method and system based on multiple dimensions

Also Published As

Publication number Publication date
CN116546579A (en) 2023-08-04

Similar Documents

Publication Publication Date Title
US11483720B2 (en) Communications device and method
US9408121B2 (en) Method and apparatus for enhancing measurement in wireless communication system
US10701621B2 (en) Small cell discovery in a communication network
US9426675B2 (en) System and method for adaptation in a wireless communications system
EP4287574A1 (en) Model update method and apparatus in communication system, and storage medium
Su et al. A self-optimizing mobility management scheme based on cell ID information in high velocity environment
Huang et al. Self-adapting handover parameters optimization for SDN-enabled UDN
CN114071612B (en) Method, device and storage medium for updating primary cell of secondary cell group
WO2011042414A1 (en) Femtocell base station
Gadam et al. Review of adaptive cell selection techniques in LTE-advanced heterogeneous networks
WO2021012166A1 (en) Wireless communication method and device
KR20170003970A (en) Apparatus and method for controlling soft switch proportion
Tashan et al. Voronoi-Based Handover Self-Optimization Technique For Handover Ping-Pong Reduction in 5G Networks
WO2023143200A1 (en) Beam switching method and apparatus, and processor readable storage medium
EP3162113B1 (en) Methods, nodes and system for enabling redistribution of cell load
Venkatesh et al. Optimizing handover in LTE using SON system by handling mobility robustness
Sun et al. Data driven smart load balancing in wireless networks
Venkatesan et al. Interference Mitigation Approach using Massive MIMO towards 5G networks
CN115250529A (en) Data transmission method, base station and storage medium
Wang et al. Investigating the Impact of Variables on Handover Performance in 5G Ultra-Dense Networks
Zhang et al. Multi-Connectivity Mobility Management in Downlink FD-RAN: A Learning Based Approach
WO2024032328A1 (en) Configuration information processing method and apparatus
US11785525B2 (en) Path selection for integrated access and backhaul
WO2024022055A1 (en) Candidate cell determination method and apparatus
WO2024027414A1 (en) Random access resource determining method, terminal, apparatus, and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23746082

Country of ref document: EP

Kind code of ref document: A1