WO2024055739A1 - Method for determining uplink channel, and terminal and network device - Google Patents

Method for determining uplink channel, and terminal and network device Download PDF

Info

Publication number
WO2024055739A1
WO2024055739A1 PCT/CN2023/107293 CN2023107293W WO2024055739A1 WO 2024055739 A1 WO2024055739 A1 WO 2024055739A1 CN 2023107293 W CN2023107293 W CN 2023107293W WO 2024055739 A1 WO2024055739 A1 WO 2024055739A1
Authority
WO
WIPO (PCT)
Prior art keywords
channel
uplink channel
sta
reinforcement learning
idle
Prior art date
Application number
PCT/CN2023/107293
Other languages
French (fr)
Chinese (zh)
Inventor
王和俊
王滨后
徐芳
孙可欣
谢刚
Original Assignee
青岛海尔智能技术研发有限公司
海尔智家股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 青岛海尔智能技术研发有限公司, 海尔智家股份有限公司 filed Critical 青岛海尔智能技术研发有限公司
Publication of WO2024055739A1 publication Critical patent/WO2024055739A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B17/00Monitoring; Testing
    • H04B17/30Monitoring; Testing of propagation channels
    • H04B17/309Measuring or estimating channel quality parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B17/00Monitoring; Testing
    • H04B17/30Monitoring; Testing of propagation channels
    • H04B17/382Monitoring; Testing of propagation channels for resource allocation, admission control or handover
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B17/00Monitoring; Testing
    • H04B17/30Monitoring; Testing of propagation channels
    • H04B17/391Modelling the propagation channel
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L5/00Arrangements affording multiple use of the transmission path
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Definitions

  • the present application relates to the field of wireless communication technology, for example, to a method for determining an uplink channel, a terminal, and network equipment.
  • the cooperative scheduling methods between multiple APs mainly include C-OFDMA (Coordinated Orthogonal Frequency-Division Multiple Access, Coordinated Orthogonal Frequency Division Multiple Access) and CBF (Coordinated Beamforming, Coordinated Beamforming).
  • C-OFDMA Coordinatd Orthogonal Frequency-Division Multiple Access
  • CBF Coordinated Beamforming
  • the AP Wired Access Point, wireless access point
  • OFDMA Orthogonal Frequency-Division Multiple Access, Orthogonal Frequency Division Multiple Access
  • RU Resource Unit, spectrum resource block
  • allocating appropriate RUs to STA requires sufficient CSI (Channel State Information) or sufficient channel estimation;
  • C-OFDMA uses orthogonal channels to suppress interference and transmit correctly, but in In the case of loss of coordination, the AP cannot guarantee that the channels are orthogonal, so it cannot suppress interference through orthogonal channels.
  • the utilization rate of the channel will also be reduced.
  • the channel information fed back by the STA to the AP under CBF is incomplete, making it difficult for the AP to effectively estimate the channel.
  • Embodiments of the present disclosure provide a method, terminals, and network equipment for determining uplink channels, thereby completing channel allocation when the CSI is unknown or the CSI information is incomplete, thereby improving system throughput and improving channel efficiency. Utilization.
  • the method for determining the uplink channel, applied to terminal equipment STA includes:
  • the trigger instruction is used to trigger the STA to sense the channel status
  • the channel status will be sensed again after the set backoff time.
  • accessing the uplink channel for data transmission includes:
  • the performance information of the corresponding channel is updated according to the transmission result, and the new data packet arrival indication is received;
  • the performance information of the corresponding channel is updated according to the transmission result, and the data transmission instruction is re-executed.
  • determining the uplink channel based on the performance information of the idle channel includes:
  • the idle channel with the best ability to successfully transmit data packets is determined as the uplink channel.
  • the determination of the uplink channel includes:
  • the uplink channel is determined according to the output of the reinforcement learning-based uplink channel selection model.
  • the training of the uplink channel selection model based on reinforcement learning includes:
  • the uplink channel selection model based on reinforcement learning is trained according to the reward parameter R t to obtain the channel corresponding to the system action that maximizes the reward parameter R t as the uplink channel.
  • the establishment of the uplink channel selection model based on reinforcement learning includes:
  • C t represents the average network throughput at time t
  • N represents the total number of STAs
  • training the reinforcement learning-based uplink channel selection model according to the reward parameter Rt includes:
  • Q t represents the Q value of the current state
  • Q t+1 represents the Q value of the next state moment
  • represents the learning rate of reinforcement learning, with a value of (0,1)
  • represents the emphasis on historical rewards.
  • the value is (0,1)
  • maxQ t (S′,A′) represents the maximum Q value of all possible action strategies at the next moment.
  • the method for determining an uplink channel, applied to an access point AP includes:
  • the trigger instruction is used to trigger the STA to sense the channel status
  • a terminal device including a processor and a memory
  • the memory is used to store a computer program
  • the processor is used to call and run the program stored in the memory, and perform the above-mentioned determination.
  • Upstream channel method including a processor and a memory, the memory is used to store a computer program, the processor is used to call and run the program stored in the memory, and perform the above-mentioned determination. Upstream channel method.
  • a network device including a processor and a communication interface.
  • the communication interface is used to communicate with other network devices; the processor is used to run a set of programs, so that the network device implements the above-mentioned functions. Method to determine the upstream channel.
  • the method, terminal and network device for determining the uplink channel provided by the embodiments of the present disclosure can achieve the following technical effects:
  • the terminal STA senses the channel status and determines the uplink channel based on the channel performance in the idle channel for data transmission. In this way, in the case of unknown CSI information or incomplete CSI information such as communication loss, the uplink channel selection can be completed through the STA, which can achieve a greater throughput improvement, maximize channel utilization, and reduce the probability of collision between channels. , improve the spectral efficiency of the system.
  • Figure 1 is a schematic diagram of an environmental system according to an embodiment of the present disclosure
  • Figure 2 is a schematic flowchart of a method for determining an uplink channel provided by an embodiment of the present disclosure
  • Figure 3 is a schematic flowchart of another method for determining an uplink channel provided by an embodiment of the present disclosure
  • Figure 4 is a schematic diagram of the training process of the uplink channel selection model based on reinforcement learning in an embodiment of the present disclosure
  • Figure 5 is a schematic flowchart of another method for determining an uplink channel provided by an embodiment of the present disclosure
  • Figure 6 is a schematic flowchart of another method for determining an uplink channel provided by an embodiment of the present disclosure
  • Figure 7 is an application schematic diagram of an embodiment of the present disclosure.
  • Figure 8 is a schematic diagram of a terminal device provided by an embodiment of the present disclosure.
  • Figure 9 is a schematic diagram of a network device provided by an embodiment of the present disclosure.
  • the character "/" indicates that the preceding and following objects are in an "or" relationship.
  • A/B indicates: A or B.
  • a and/or B means: A or B, or A and B.
  • correspondence can refer to an association relationship or a binding relationship.
  • correspondence between A and B refers to an association relationship or a binding relationship between A and B.
  • AP represents a wireless access point, which may be a router, a gateway or a combined router-gateway.
  • STA represents a user terminal, which may be a mobile terminal or station connected to the AP via a communication connection function to obtain access to AP system resources (eg, network). It can be a cellular phone, a cordless phone, a Session Initiation Protocol (SIP) phone, a Wireless Local Loop (WLL) station, a Personal Digital Assistant (Personal Digital Assistant, PDA) device, a device with wireless communication capabilities Handheld devices, computing devices or other processing devices connected to wireless modems, vehicle-mounted devices, wearable devices, next-generation communication systems such as terminal devices in NR networks, or future evolved Public Land Mobile Networks (PLMN) Terminal equipment in the network, etc.
  • AP system resources eg, network
  • AP system resources eg, network
  • It can be a cellular phone, a cordless phone, a Session Initiation Protocol (SIP) phone, a Wireless Local Loop (WLL) station, a Personal Digital Assistant (Personal Digital Assistant, PDA) device, a
  • virtual reality Virtual Reality, VR
  • AR Augmented Reality
  • industrial control industrial control
  • Wireless terminal equipment in self-driving self driving
  • wireless terminal equipment in remote medical remote medical
  • wireless terminal equipment in smart grid smart grid
  • transportation safety transportation safety
  • Figure 1 shows a schematic diagram of an environmental system provided by an embodiment of the present disclosure.
  • the environmental system includes multiple APs and multiple STAs.
  • each AP can access one or more STAs; each STA can also access one or more APs.
  • an AP can connect to a STA and establish at least two channels. As shown in Figure 1, STA1 and AP1; one AP can also be connected to two STAs and establish at least two channels. As shown in Figure 1, STA2, AP2, and AP3.
  • Each STA can obtain channel sensing information and data transmission information of one or more channels between it and the accessed AP. Each STA can also sense and obtain the interference power of other APs in the group to the STA.
  • each channel needs to be responsible for carrying uplink and downlink data.
  • the STA and AP will perform carrier sense multiple access (CSMA)/enhanced distributed channel access. (enhanced distributed channel access, EDCA) backoff, an air interface collision may occur after data is sent. If an air interface collision occurs, the data transmission fails and needs to be resent.
  • CSMA carrier sense multiple access
  • EDCA enhanced distributed channel access
  • the downlink needs to wait for the uplink transmission to complete before data can be sent.
  • the uplink needs to wait for the downlink sending to complete before data can be sent. Therefore, uplink and downlink data may collide, and the waiting time for data transmission may be extended, affecting the channel utilization and data throughput of the system.
  • the uplink channel in this embodiment is responsible for transmitting information data from the STA to the AP.
  • An idle channel refers to an unoccupied channel among the multiple channels accessed.
  • the idle channel can be determined among the accessed channels by searching for a channel that emits an idle signal, or by searching for a carrier-free channel.
  • the above-mentioned environment system may also include other network entities such as network controllers and mobility management entities, which are not limited in the embodiments of the present application.
  • embodiments of the present disclosure provide a method for determining an uplink channel, so that the STA can determine the uplink channel in the allocated resources and access the upload data.
  • this method is applied to terminal equipment STA, including:
  • Step S201 The STA receives a trigger instruction; the trigger instruction is used to trigger the STA to sense the channel state.
  • the trigger command is used to inform the STA that the data packet has arrived, and it can sense the channel status to transmit the data packet.
  • Step S202 If there is an idle channel, the STA determines the uplink channel based on the performance information of the idle channel, and accesses the uplink channel for data transmission.
  • Step S203 If there is no idle channel, the STA senses the channel status again after the set backoff time.
  • the STA begins to sense the channel status. If no idle channel is sensed, the data packet backs off and continues sensing; if the idle state is sensed, the uplink channel is determined in the idle state based on the performance information of the idle state. , for data transmission.
  • the STA can complete the uplink channel selection, which can achieve greater throughput improvement, maximize channel utilization, reduce the probability of collision between channels, and improve Spectral efficiency of the system.
  • determine the uplink channel based on the performance information of the idle channel including:
  • the idle channel with the best ability to successfully transmit data packets is determined as the uplink channel.
  • the ability to successfully transmit a data packet may be determined by the historical transmission success rate of the data packet and/or the channel sensing weight.
  • the channel with the highest historical data packet transmission success rate is determined as the uplink channel.
  • the historical transmission success rate can be determined.
  • the channel with the highest channel sensing weight is determined as the uplink channel.
  • the channel sensing weight can generally be calculated and obtained by the STA using the spectrum sensing algorithm on the corresponding channel, and is used to represent the channel quality.
  • Reinforcement learning is an online learning algorithm.
  • the agent interacts with the external environment through a reward mechanism and adjusts its behavior according to the reward value obtained in the environment, allowing the agent to learn. And adapt to the external environment, prompting the agent to choose the behavior that can obtain the maximum reward for itself in the environment.
  • the characteristics of reinforcement learning and adapting to the external environment can be applied to the channel selection between the STA and the AP, so that the STA can learn the changing channel status as an agent, and finally select the one that successfully transmits the data packet in the idle channel.
  • the idle channel with the best capability is used as the uplink channel to reduce channel status scanning overhead and improve channel detection probability. This achieves the purpose of achieving greater throughput improvement, maximizing channel utilization, reducing the probability of collisions between channels, and improving the spectrum efficiency of the system.
  • an embodiment of the present disclosure provides a method for determining an uplink channel, which is applied to the STA in Figure 1 to determine the uplink channel between the STA and the AP through the data processing method of reinforcement learning.
  • the method includes:
  • Step S301 The STA receives a trigger instruction; the trigger instruction is used to trigger the STA to sense the channel state.
  • Step S302 When there is an idle channel, construct an uplink channel selection model based on reinforcement learning based on the network average throughput optimization problem.
  • the STA senses the channel status again after the set backoff time.
  • the uplink channel selection model based on reinforcement learning is constructed, including a state set, an action set and a reward function.
  • the establishment of the uplink channel selection model based on reinforcement learning includes:
  • the signal-to-interference-to-noise ratio refers to the ratio of the signal to the sum of interference and noise in the system.
  • the channel information between the STA and the AP is used to establish the uplink channel selection model, so that when selecting the uplink channel, the status of each channel can be combined to ensure that the system throughput meets the requirements.
  • Step S303 With the goal of maximizing the average network throughput, the channel state information and performance information are input into the uplink channel selection model based on reinforcement learning for training, and the average network throughput is obtained.
  • the training of the reinforcement learning-based uplink channel selection model includes:
  • the uplink channel selection model based on reinforcement learning is trained according to the reward parameter R t to obtain the channel corresponding to the system action that maximizes the reward parameter R t as the uplink channel.
  • the reward parameter R t is used to represent the average value of the perception weight and channel transmission weight of the selected uplink channel at time t.
  • training a resource allocation decision-making model based on reinforcement learning based on the reward parameter R t includes: using the following method as the update rule for reinforcement learning:
  • Q t represents the Q value of the current state
  • Q t+1 represents the Q value of the next state moment
  • represents the learning rate of reinforcement learning, with a value of (0,1)
  • represents the emphasis on historical rewards.
  • the value is (0,1)
  • maxQ t (S′,A′) represents the maximum Q value of all possible action strategies at the next moment.
  • Figure 4 shows a schematic diagram of reinforcement learning training in an embodiment of the present disclosure to illustrate the above steps.
  • the reinforcement learning in this embodiment uses the Q-Learning algorithm.
  • the agent performs actions in the environment to obtain certain rewards to perceive the environment, thereby learning a mapping strategy from state to action to maximize the reward value.
  • STA is used as an agent for reinforcement learning and performs data processing as an intelligent agent. Based on the mutual interference information between APs and channel idle conditions received by the STA, the reinforcement learning algorithm is used to achieve reasonable and effective uplink channel selection. Through the process of continuous interaction between the agent STA and the environment, feedback is obtained from the environment, and then the action of the agent STA is changed to realize the adjustment of the uplink channel selection action.
  • STA first obtains mutual interference information between APs and channel idle conditions as channel performance information and State information S 0 , the agent STA takes action A 0 in the S 0 environment as a channel selection decision, and feeds it back to the AP in the environment.
  • the actions taken by STA can be selected according to the greedy strategy.
  • the agent STA After the agent STA makes a channel selection decision, it performs access and data transmission according to the selected uplink channel. Determine the reward parameter R 1 based on the system throughput and provide feedback to the STA; and send the next state S 1 including mutual interference information between APs and channel idle conditions to the STA.
  • the STA After receiving the reward parameter R 1 and the environment status S 1 , the STA updates the Q value table according to the update rules of reinforcement learning, and takes action A 1 to the environment as an uplink channel selection decision. After receiving action A 1 , the environment state changes from state S 1 to S 2 , and the reward parameter R 2 is fed back.
  • STA gets the reward parameter R 2 and state S 2 , updates the Q value table, and takes action A 2 ; gets the reward parameter R 3 and state S 3 , updates the Q value table, and takes action A 3 .
  • This cycle is continued until the system throughput reaches the maximum, that is, the reward parameter Rt reaches the maximum.
  • the purpose of reducing interference and improving throughput is achieved.
  • a Q value is used for each channel in the table to represent the level of channel transmission quality; when a data packet arrives, the STA begins to sense the idle channel. If no idle channel is sensed, the data packet backs off. , continue sensing; if an idle channel is sensed, use the Q-Learning mechanism to learn the uplink channel selection strategy.
  • the Q-learning learning process includes: determining the action A t+1 at this moment based on the previous state S t , then updating the state S t+1 , and feeding back a reward R t . Through learning, the STA will select a channel with the best transmission quality among the idle channels for transmission. The transmission quality here is measured by the success rate of historical transmission data packets.
  • the STA can take action according to the greedy decision-making strategy through Q-Learning, that is, select among the idle channels with probability ⁇ , and finally determine the uplink channel.
  • Step S304 When the average network throughput reaches the maximum value, determine the uplink channel according to the output of the uplink channel selection model based on reinforcement learning.
  • the system's corresponding action A t is used as the optimal strategy to determine the corresponding uplink channel selection action.
  • Step S305 accessing an uplink channel for data transmission.
  • the terminal STA makes decisions by sensing the channel status and the number of idle channels, selects the channel with the highest channel quality for data transmission, and feeds the reward back to the environment while updating the next state.
  • the uplink channel is determined based on channel performance in the idle channel for data transmission. In this way, in the case of unknown CSI information or incomplete CSI information such as communication loss, the uplink channel selection can be completed through the STA, which can achieve a greater throughput improvement, maximize channel utilization, and reduce the probability of collision between channels. , improve the spectral efficiency of the system.
  • Figure 5 shows a method for determining the uplink channel to illustrate the perceived channel conditions when a data packet arrives, And use reinforcement learning to select the channel to be accessed to complete the uplink transmission to the AP.
  • an embodiment of the present disclosure provides a method for determining an uplink channel, which is applied to the STA in Figure 1 to determine the uplink channel between the STA and the AP through the data processing method of reinforcement learning.
  • the method includes:
  • Step S501 The STA receives a trigger instruction; the trigger instruction includes a data packet arrival indication.
  • Step S502 The STA senses whether there is an idle channel.
  • Step S503 If there is no idle channel, data packet backoff is performed, and the channel status is sensed again after the set backoff time.
  • the set backoff duration is determined by a random distribution with mean ⁇ .
  • Step S504 If there is an idle channel, use the Q-learning algorithm to output the channel selection decision as the uplink channel through the uplink channel selection model based on reinforcement learning.
  • Step S505 Access the uplink channel for data transmission. And the action set and reward parameters in step S504 are updated according to the selected channel action and the system throughput change after the selection.
  • Step S506 After the data transmission is successful, the information required by the Q-learning algorithm in step S504 is updated according to the transmission result. Update the status set in step S504 according to the transmission result, and return to step S501 to receive a new data packet arrival indication.
  • Step S507 After the data transmission fails, update the information required by the Q-learning algorithm in step S504 according to the transmission result, and return to step S502 to re-execute the data transmission instruction.
  • the status set in step S504 is updated according to the transmission result.
  • the terminal STA makes decisions by sensing the channel status and the number of idle channels, selects the channel with the highest channel quality for data transmission, and feeds the reward back to the environment while updating the next state. And continue to update the environment status after making a decision. Update the two situations that exist after data packet transmission into the Q-learning learning process. After the transmission is successful, after updating the environment status, this data transmission ends, waiting for the arrival of new data packets, and entering the next round of data transmission; after the transmission fails, after updating the environment status, it is necessary to enter the retransmission mechanism and re-sense. channel for data packet transmission.
  • the uplink channel selection can be completed through the STA, which can achieve a greater throughput improvement, maximize channel utilization, and reduce the probability of collision between channels. , improve the spectral efficiency of the system.
  • Figure 6 shows a method for determining the uplink channel, applied to the AP in the environment system shown in Figure 1, including:
  • Step S601 the AP sends a trigger instruction; the trigger instruction is used to trigger the STA to sense the channel status.
  • the AP sends a trigger command to the STA to obtain the data cache information fed back by the STA, and triggers the STA to sense the channel status for data transmission.
  • the AP can send a BSRP buffer status report poll frame (Buffer Status Report Poll, BSRP) to cause the STA to send a buffer status report frame (Buffer Status Report, BSR).
  • Buffer Status Report Poll Buffer Status Report Poll
  • BSR Buffer Status Report
  • Step S602 The AP receives the data transmitted by the STA through the uplink channel; the uplink channel is determined by the STA based on the performance information of the idle channel.
  • the AP After receiving the data transmitted by the STA through the uplink channel, the AP also sends an acknowledgment character (ACK) to the STA to indicate receipt of the uploaded data.
  • ACK acknowledgment character
  • the uplink channel selection can be completed through the STA, which can achieve a greater throughput improvement, maximize channel utilization, and reduce the probability of collision between channels. , improve the spectral efficiency of the system.
  • Figure 7 shows an application diagram of a method for determining an uplink channel.
  • the method for determining the uplink channel includes the following steps:
  • Step S701 the AP sends a BRSP to the STA, requesting to obtain the STA's data cache information;
  • Step S702 the STA sends a BSR to the AP to feed back the data cache information
  • Step S703 The STA senses the current status of all channels. If it senses that there are multiple idle channels, it enters the Q-learning learning process environment and selects the idle channel with the best ability to successfully transmit data packets as the uplink channel. If there is no channel idle, the data packet will back off for a period of time before being transmitted. The back off time is subject to a random distribution with mean ⁇ .
  • Step S704 The STA accesses the uplink channel and transmits data.
  • Step S705 The AP receives the data transmitted by the STA and sends an ACK to the STA to indicate receipt.
  • the terminal STA senses the channel status, determines the uplink channel based on the channel performance in the idle channel, and performs data transmission.
  • the uplink channel selection can be completed through the STA, which can achieve a greater throughput improvement, maximize channel utilization, and reduce the probability of collision between channels. , reduce the impact of interference between multiple APs on data transmission, and improve the spectrum efficiency of the system.
  • an embodiment of the present disclosure provides a terminal device, including a processor 800 and a memory 801 .
  • the memory 801 is used to store computer programs, and the processor 800 is used to call and run the programs stored in the memory, and perform the above-mentioned method for determining the uplink channel.
  • the device also includes a communication interface 802 and a bus 803.
  • the communication interface 802 is used to communicate with other network devices; the processor 800, the communication interface 802, and the memory 801 can communicate with each other through the bus 803.
  • an embodiment of the present disclosure provides a network device, including a processor 900 and a memory 901 .
  • the memory 901 is used to store computer programs, and the processor 900 is used to call and run the programs stored in the memory, and perform the above-mentioned method for determining the uplink channel.
  • the device also includes a communication interface 902 and a bus 903.
  • the communication interface 902 is used to communicate with other network devices; the processor 900, the communication interface 902, and the memory 901 can communicate with each other through the bus 903.
  • the above-mentioned logical instructions in the memory 901 can be implemented in the form of software functional units and can be stored in a computer-readable storage medium when sold or used as an independent product.
  • the memory 901 can be used to store software programs, computer-executable programs, such as program instructions/modules corresponding to the methods in the embodiments of the present disclosure.
  • the processor 900 executes program instructions/modules stored in the memory 901 to execute functional applications and data processing, that is, to implement the method for determining the uplink channel in the above embodiment.
  • the memory 901 may include a stored program area and a stored data area, where the stored program area may store an operating system and at least one application program required for a function; the stored data area may store data created according to the use of the terminal device, etc.
  • the memory 901 may include high-speed random access memory and may also include non-volatile memory.
  • Embodiments of the present disclosure provide a computer-readable storage medium that stores computer-executable instructions, and the computer-executable instructions are configured to execute the above method for determining an uplink channel.
  • Embodiments of the present disclosure provide a computer program product.
  • the computer program product includes a computer program stored on a computer-readable storage medium.
  • the computer program includes program instructions. When the program instructions are executed by a computer, the The computer executes the above method for determining the uplink channel.
  • An embodiment of the present disclosure provides a computer program that, when executed by a computer, causes the computer to implement the above method for determining an uplink channel.
  • the above-mentioned computer-readable storage medium may be a transient computer-readable storage medium or a non-transitory computer-readable storage medium.
  • the technical solution of the embodiments of the present disclosure may be embodied in the form of a software product.
  • the computer software product is stored in a storage medium and includes one or more instructions to enable a computer device (which may be a personal computer, a server, or a network equipment, etc.) to perform all or part of the steps of the method described in the embodiments of the present disclosure.
  • the aforementioned storage media can be non-transitory storage media, including: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disk, etc.
  • the term “and/or” as used in this application refers to an or any and all possible combinations of one or more of the associated listed.
  • the term “comprise” and its variations “comprises” and/or “comprising” etc. refer to stated features, integers, steps, operations, elements, and/or The presence of a component does not exclude the presence or addition of one or more other features, integers, steps, operations, elements, components and/or groupings of these.
  • an element defined by the statement “comprises a" does not exclude the presence of additional identical elements in a process, method or apparatus including the stated element.
  • each embodiment may focus on its differences from other embodiments, and the same and similar parts among various embodiments may be referred to each other.
  • the relevant parts can be referred to the description of the method part.
  • the disclosed methods and products can be implemented in other ways.
  • the device embodiments described above are only illustrative.
  • the division of the units may only be a logical function division. In actual implementation, there may be other division methods.
  • multiple units or components may be combined. Either it can be integrated into another system, or some features can be ignored, or not implemented.
  • the coupling or direct coupling or communication connection between each other shown or discussed may be through some interfaces, and the indirect coupling or communication connection of the devices or units may be in electrical, mechanical or other forms.
  • each functional unit in the embodiment of the present disclosure may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.
  • each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more components for implementing the specified logical function(s).
  • Executable instructions may occur out of the order noted in the figures. For example, two consecutive blocks can actually execute essentially in parallel, it They can sometimes be executed in reverse order, depending on the functionality involved.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Physics & Mathematics (AREA)
  • Electromagnetism (AREA)
  • Quality & Reliability (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

The present application relates to the technical field of wireless communications. Disclosed is a method for determining an uplink channel. The method is applied to a station (STA) device, and comprises: receiving a trigger instruction, wherein the trigger instruction is used for triggering an STA to sense a channel state; if there is an idle channel, determining an uplink channel according to performance information of the idle channel, and accessing the uplink channel for data transmission; and if there is no idle channel, sensing the channel state again after a set backoff duration. In this way, in the case of communication disorder, etc. in which CSI is unknown or CSI is incomplete, selection of an uplink channel can be completed by means of an STA, such that a relatively large increase in the throughput can be achieved, and the channel utilization rate is maximized, thereby reducing the probability of inter-channel collision, and improving the spectrum efficiency of a system. Further disclosed in the present application are a terminal and a network device.

Description

用于确定上行信道的方法及终端、网络设备Method and terminal and network equipment for determining uplink channel
本申请基于申请号为202211128492.3、申请日为2022年9月16日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。This application is filed based on a Chinese patent application with application number 202211128492.3 and a filing date of September 16, 2022, and claims the priority of the Chinese patent application. The entire content of the Chinese patent application is hereby incorporated into this application as a reference.
技术领域Technical field
本申请涉及无线通信技术领域,例如涉及一种用于确定上行信道的方法及终端、网络设备。The present application relates to the field of wireless communication technology, for example, to a method for determining an uplink channel, a terminal, and network equipment.
背景技术Background technique
目前,在多AP间的协同调度的方式主要包括C-OFDMA(Coordinated Orthogonal Frequency-Division Multiple Access,协同正交频分多址)和CBF(Coordinated Beamforming,协同波束赋形)。Currently, the cooperative scheduling methods between multiple APs mainly include C-OFDMA (Coordinated Orthogonal Frequency-Division Multiple Access, Coordinated Orthogonal Frequency Division Multiple Access) and CBF (Coordinated Beamforming, Coordinated Beamforming).
其中,在C-OFDMA中,AP(Wireless Access Point,无线访问接入点)协调为所有STA(Station,终端)共享OFDMA(Orthogonal Frequency-Division Multiple Access,正交频分多址))资源,并使不同STA采用正交的时间和频率,以避免RU(Resource Unit,频谱资源块)冲突。一方面,将适当的RU分配给STA需要充足的CSI(Channel State Information,信道状态信息)或者充分的信道估计;另一方面,C-OFDMA是通过正交信道来抑制干扰并正确传输,但是在失协的情况下,AP无法保证信道是正交的,因此也就无法通过正交信道抑制干扰。另外,在保证信道正交时,也会降低信道的利用率。而CBF下STA反馈给AP的信道信息是不完整的,从而AP很难对信道进行有效估计。Among them, in C-OFDMA, the AP (Wireless Access Point, wireless access point) coordinates to share OFDMA (Orthogonal Frequency-Division Multiple Access, Orthogonal Frequency Division Multiple Access) resources for all STAs (Station, terminal), and Let different STAs use orthogonal times and frequencies to avoid RU (Resource Unit, spectrum resource block) conflicts. On the one hand, allocating appropriate RUs to STA requires sufficient CSI (Channel State Information) or sufficient channel estimation; on the other hand, C-OFDMA uses orthogonal channels to suppress interference and transmit correctly, but in In the case of loss of coordination, the AP cannot guarantee that the channels are orthogonal, so it cannot suppress interference through orthogonal channels. In addition, when ensuring channel orthogonality, the utilization rate of the channel will also be reduced. However, the channel information fed back by the STA to the AP under CBF is incomplete, making it difficult for the AP to effectively estimate the channel.
此外,在WiFi6中信道估计是在HE-SIG-B字段完成的,这个字段长度是有限的,所以无法实现完整的信道估计。想要获得完整的CSI就要进行多次的传输,这样又会大大降低信道的效率。In addition, in WiFi6, channel estimation is completed in the HE-SIG-B field. The length of this field is limited, so complete channel estimation cannot be achieved. To obtain complete CSI, multiple transmissions are required, which will greatly reduce the efficiency of the channel.
因此,现有的两种方案CBF和C-OFDMA都是需要完整的CSI才可以实现信道的共享,然而在通信失协的情况下,AP很难准确知道其他AP所使用信道的情况,所以很难给出信道估计的情况,也就是说在通信失协的情况下,这两种方案可行性不强。Therefore, the two existing solutions, CBF and C-OFDMA, require complete CSI to achieve channel sharing. However, in the case of communication loss, it is difficult for APs to accurately know the channels used by other APs, so it is very difficult to It is difficult to give channel estimation, that is to say, when communication is out of coordination, these two solutions are not feasible.
发明内容Contents of the invention
为了对披露的实施例的一些方面有基本的理解,下面给出了简单的概括。所述概括不 是泛泛评述,也不是要确定关键/重要组成元素或描绘这些实施例的保护范围,而是作为后面的详细说明的序言。In order to provide a basic understanding of some aspects of the disclosed embodiments, a simplified summary is provided below. The generalization is not This is a general review, and it is not intended to identify key/important constituent elements or delineate the protection scope of these embodiments, but is intended to serve as a prelude to the detailed description that follows.
本公开实施例提供了一种用于确定上行信道的方法及终端、网络设备,从而在未知CSI或CSI信息不完整的情况下,完成信道的分配,以实现系统吞吐量的提升,提高信道的利用率。Embodiments of the present disclosure provide a method, terminals, and network equipment for determining uplink channels, thereby completing channel allocation when the CSI is unknown or the CSI information is incomplete, thereby improving system throughput and improving channel efficiency. Utilization.
在一些实施例中,所述用于确定上行信道的方法,应用于终端设备STA,包括:In some embodiments, the method for determining the uplink channel, applied to terminal equipment STA, includes:
接收触发指令;所述触发指令用于触发STA感知信道状态;Receive a trigger instruction; the trigger instruction is used to trigger the STA to sense the channel status;
若存在空闲信道,则根据空闲信道的性能信息确定上行信道,接入所述上行信道进行数据传输;If an idle channel exists, determine the uplink channel based on the performance information of the idle channel, and access the uplink channel for data transmission;
若不存在空闲信道,则在设定的退避时长后再次感知信道状态。If there is no idle channel, the channel status will be sensed again after the set backoff time.
可选地,所述接入所述上行信道进行数据传输,包括:Optionally, accessing the uplink channel for data transmission includes:
在数据传输成功后,根据传输结果更新对应信道的性能信息,并接收新的数据包到达指示;After the data transmission is successful, the performance information of the corresponding channel is updated according to the transmission result, and the new data packet arrival indication is received;
在数据传输失败后,根据传输结果更新对应信道的性能信息,并重新执行数据传输指令。After the data transmission fails, the performance information of the corresponding channel is updated according to the transmission result, and the data transmission instruction is re-executed.
可选地,所述根据空闲信道的性能信息确定上行信道,包括:Optionally, determining the uplink channel based on the performance information of the idle channel includes:
获取空闲信道的性能信息;Obtain performance information of idle channels;
将成功传输数据包的能力最优的空闲信道,确定为上行信道。The idle channel with the best ability to successfully transmit data packets is determined as the uplink channel.
可选地,所述上行信道的确定,包括:Optionally, the determination of the uplink channel includes:
构建基于强化学习的上行信道选择模型;Construct an uplink channel selection model based on reinforcement learning;
将信道状态信息、性能信息输入到基于强化学习的上行信道选择模型中训练,并获得网络平均吞吐量;Input channel status information and performance information into the uplink channel selection model based on reinforcement learning for training, and obtain the average network throughput;
在网络平均吞吐量达到最大值的情况下,根据所述基于强化学习的上行信道选择模型的输出确定上行信道。When the average network throughput reaches the maximum value, the uplink channel is determined according to the output of the reinforcement learning-based uplink channel selection model.
可选地,基于强化学习的上行信道选择模型的训练,包括:Optionally, the training of the uplink channel selection model based on reinforcement learning includes:
将信道状态信息、性能信息作为强化学习的状态集 Use channel state information and performance information as state sets for reinforcement learning
其中,表示STA在信道上的信道感知权重的集合;表示t时刻第k个STA在第m个信道上的感知权重;in, Represents the set of channel sensing weights of STA on the channel; Represents the sensing weight of the k-th STA on the m-th channel at time t;
表示STA在信道上的数据包传输权重的集合;表示t时刻第k个STA在第m个信道上的数据包传输权重感知权重; Represents the set of data packet transmission weights of STA on the channel; Indicates the perceived weight of the data packet transmission weight of the k-th STA on the m-th channel at time t;
将所述状态集S输入基于强化学习的上行信道选择模型中训练,得到动作集A={f1,f2,...,fM},表示STA采取对应于在M个空闲信道中选择上行信道的动作的集合;The state set S is input into the uplink channel selection model based on reinforcement learning for training, and the action set A={f 1 , f 2 ,..., f M } is obtained, indicating that the STA takes actions corresponding to selecting among M idle channels. A collection of actions on the uplink channel;
根据所述状态集确定奖励参数表示第k个STA在第m个信道传输的即时奖励;Determine reward parameters based on the state set Represents the immediate reward of the k-th STA transmitting on the m-th channel;
根据所述奖励参数Rt训练所述基于强化学习的上行信道选择模型,以得到使奖励参数Rt最大的系统动作对应的信道,作为所述上行信道。The uplink channel selection model based on reinforcement learning is trained according to the reward parameter R t to obtain the channel corresponding to the system action that maximizes the reward parameter R t as the uplink channel.
可选地,所述基于强化学习的上行信道选择模型的建立,包括:
Optionally, the establishment of the uplink channel selection model based on reinforcement learning includes:
其中,Ct表示t时刻的网络平均吞吐量;N表示STA的总数;表示t时刻第k个STA的信干噪比。Among them, C t represents the average network throughput at time t; N represents the total number of STAs; Indicates the signal-to-interference-noise ratio of the kth STA at time t.
可选地,所述根据所述奖励参数Rt训练所述基于强化学习的上行信道选择模型,包括:Optionally, training the reinforcement learning-based uplink channel selection model according to the reward parameter Rt includes:
将如下方式,作为强化学习的更新规则:
Use the following method as the update rule for reinforcement learning:
其中,Qt表示当前状态的Q值,Qt+1表示下一状态时刻的Q值;α表示的强化学习的学习率,取值为(0,1);β表示对历史奖励的重视程度,取值为(0,1);表示即时奖励;maxQt(S′,A′)表示下一时刻所有可能动作策略的最大Q值。Among them, Q t represents the Q value of the current state, Q t+1 represents the Q value of the next state moment; α represents the learning rate of reinforcement learning, with a value of (0,1); β represents the emphasis on historical rewards. , the value is (0,1); represents the immediate reward; maxQ t (S′,A′) represents the maximum Q value of all possible action strategies at the next moment.
在一些实施例中,该用于确定上行信道的方法,应用于接入点AP,包括:In some embodiments, the method for determining an uplink channel, applied to an access point AP, includes:
发送触发指令;所述触发指令用于触发STA感知信道状态;Send a trigger instruction; the trigger instruction is used to trigger the STA to sense the channel status;
接收STA通过上行信道传输的数据;所述上行信道是所述STA根据空闲信道的性能信息确定的。Receive data transmitted by the STA through the uplink channel; the uplink channel is determined by the STA based on the performance information of the idle channel.
在一些实施例中,提供一种终端设备,包括处理器和存储器,所述存储器用于存储计算机程序,所述处理器用于调用并运行所述存储器中存储的程序,执行如上述的用于确定上行信道的方法。In some embodiments, a terminal device is provided, including a processor and a memory, the memory is used to store a computer program, the processor is used to call and run the program stored in the memory, and perform the above-mentioned determination. Upstream channel method.
在一些实施例中,提供一种网络设备,包括处理器和通信接口,通信接口用于与其它网络设备进行通信;处理器用于运行一组程序,以使得所述网络设备实现如上述的用于确定上行信道的方法。In some embodiments, a network device is provided, including a processor and a communication interface. The communication interface is used to communicate with other network devices; the processor is used to run a set of programs, so that the network device implements the above-mentioned functions. Method to determine the upstream channel.
本公开实施例提供的用于确定上行信道的方法、终端及网络设备,可以实现以下技术效果: The method, terminal and network device for determining the uplink channel provided by the embodiments of the present disclosure can achieve the following technical effects:
通过终端STA感知信道状态,并在空闲信道中根据信道性能确定上行信道,进行数据传输。这样,能够在通信失协等未知CSI信息或CSI信息不完整的情况下,通过STA完成上行信道的选择,可以实现较大的吞吐量提升,并最大化信道利用率,降低信道间碰撞的概率,提高系统的频谱效率。The terminal STA senses the channel status and determines the uplink channel based on the channel performance in the idle channel for data transmission. In this way, in the case of unknown CSI information or incomplete CSI information such as communication loss, the uplink channel selection can be completed through the STA, which can achieve a greater throughput improvement, maximize channel utilization, and reduce the probability of collision between channels. , improve the spectral efficiency of the system.
以上的总体描述和下文中的描述仅是示例性和解释性的,不用于限制本申请。The above general description and the following description are exemplary and explanatory only and are not intended to limit the application.
附图说明Description of drawings
一个或多个实施例通过与之对应的附图进行示例性说明,这些示例性说明和附图并不构成对实施例的限定,附图中具有相同参考数字标号的元件示为类似的元件,附图不构成比例限制,并且其中:One or more embodiments are exemplified by corresponding drawings. These exemplary descriptions and drawings do not constitute limitations to the embodiments. Elements with the same reference numerals in the drawings are shown as similar elements. The drawings are not limited to scale and in which:
图1是本公开实施例的环境系统示意图;Figure 1 is a schematic diagram of an environmental system according to an embodiment of the present disclosure;
图2是本公开实施例提供的一个用于确定上行信道的方法的流程示意图;Figure 2 is a schematic flowchart of a method for determining an uplink channel provided by an embodiment of the present disclosure;
图3是本公开实施例提供的另一个用于确定上行信道的方法的流程示意图;Figure 3 is a schematic flowchart of another method for determining an uplink channel provided by an embodiment of the present disclosure;
图4是本公开实施例中基于强化学习的上行信道选择模型的训练过程示意图;Figure 4 is a schematic diagram of the training process of the uplink channel selection model based on reinforcement learning in an embodiment of the present disclosure;
图5是本公开实施例提供的另一个用于确定上行信道的方法的流程示意图;Figure 5 is a schematic flowchart of another method for determining an uplink channel provided by an embodiment of the present disclosure;
图6是本公开实施例提供的另一个用于确定上行信道的方法的流程示意图;Figure 6 is a schematic flowchart of another method for determining an uplink channel provided by an embodiment of the present disclosure;
图7是本公开实施例的一个应用示意图;Figure 7 is an application schematic diagram of an embodiment of the present disclosure;
图8是本公开实施例提供的一个终端设备的示意图;Figure 8 is a schematic diagram of a terminal device provided by an embodiment of the present disclosure;
图9是本公开实施例提供的一个网络设备的示意图。Figure 9 is a schematic diagram of a network device provided by an embodiment of the present disclosure.
具体实施方式Detailed ways
为了能够更加详尽地了解本公开实施例的特点与技术内容,下面结合附图对本公开实施例的实现进行详细阐述,所附附图仅供参考说明之用,并非用来限定本公开实施例。在以下的技术描述中,为方便解释起见,通过多个细节以提供对所披露实施例的充分理解。然而,在没有这些细节的情况下,一个或多个实施例仍然可以实施。在其它情况下,为简化附图,熟知的结构和装置可以简化展示。In order to be able to understand the features and technical contents of the embodiments of the present disclosure in more detail, the implementation of the embodiments of the present disclosure is described in detail below in conjunction with the accompanying drawings. The attached drawings are for reference only and are not used to limit the embodiments of the present disclosure. In the following technical description, for the convenience of explanation, a full understanding of the disclosed embodiments is provided through multiple details. However, one or more embodiments can still be implemented without these details. In other cases, to simplify the drawings, well-known structures and devices can be simplified for display.
本公开实施例的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本公开实施例的实施例。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含。The terms "first", "second", etc. in the description and claims of the embodiments of the present disclosure and the above-mentioned drawings are used to distinguish similar objects and are not necessarily used to describe a specific order or sequence. It should be understood that data so used are interchangeable under appropriate circumstances for the purposes of the embodiments of the disclosure described herein. Furthermore, the terms "including" and "having" and any variations thereof are intended to cover non-exclusive inclusion.
除非另有说明,术语“多个”表示两个或两个以上。 Unless otherwise stated, the term "plurality" means two or more.
本公开实施例中,字符“/”表示前后对象是一种“或”的关系。例如,A/B表示:A或B。In the embodiment of the present disclosure, the character "/" indicates that the preceding and following objects are in an "or" relationship. For example, A/B indicates: A or B.
术语“和/或”是一种描述对象的关联关系,表示可以存在三种关系。例如,A和/或B,表示:A或B,或,A和B这三种关系。The term "and/or" is an association relationship describing objects, indicating that three relationships can exist. For example, A and/or B means: A or B, or A and B.
术语“对应”可以指的是一种关联关系或绑定关系,A与B相对应指的是A与B之间是一种关联关系或绑定关系。The term "correspondence" can refer to an association relationship or a binding relationship. The correspondence between A and B refers to an association relationship or a binding relationship between A and B.
本公开实施例中,AP表示无线接入点,可以是路由器、网关或组合路由器网关。In the embodiment of the present disclosure, AP represents a wireless access point, which may be a router, a gateway or a combined router-gateway.
STA表示用户终端,可以是经由通信连接功能连接到AP以获得对AP系统资源(如网络)的接入的移动终端或站。可以是蜂窝电话、无绳电话、会话启动协议(Session Initiation Protocol,SIP)电话、无线本地环路(Wireless Local Loop,WLL)站、个人数字助理(Personal Digital Assistant,PDA)设备、具有无线通信功能的手持设备、计算设备或连接到无线调制解调器的其它处理设备、车载设备、可穿戴设备、下一代通信系统例如NR网络中的终端设备,或者未来演进的公共陆地移动网络(Public Land Mobile Network,PLMN)网络中的终端设备等。也可以是手机(Mobile Phone)、平板电脑(Pad)、带无线收发功能的电脑、虚拟现实(Virtual Reality,VR)终端设备、增强现实(Augmented Reality,AR)终端设备、工业控制(industrial control)中的无线终端设备、无人驾驶(self driving)中的无线终端设备、远程医疗(remote medical)中的无线终端设备、智能电网(smart grid)中的无线终端设备、运输安全(transportation safety)中的无线终端设备、智慧城市(smart city)中的无线终端设备或智慧家庭(smart home)中的无线终端设备等。STA represents a user terminal, which may be a mobile terminal or station connected to the AP via a communication connection function to obtain access to AP system resources (eg, network). It can be a cellular phone, a cordless phone, a Session Initiation Protocol (SIP) phone, a Wireless Local Loop (WLL) station, a Personal Digital Assistant (Personal Digital Assistant, PDA) device, a device with wireless communication capabilities Handheld devices, computing devices or other processing devices connected to wireless modems, vehicle-mounted devices, wearable devices, next-generation communication systems such as terminal devices in NR networks, or future evolved Public Land Mobile Networks (PLMN) Terminal equipment in the network, etc. It can also be a mobile phone (Mobile Phone), tablet computer (Pad), computer with wireless transceiver function, virtual reality (Virtual Reality, VR) terminal equipment, augmented reality (Augmented Reality, AR) terminal equipment, industrial control (industrial control) Wireless terminal equipment in self-driving (self driving), wireless terminal equipment in remote medical (remote medical), wireless terminal equipment in smart grid (smart grid), transportation safety (transportation safety) Wireless terminal equipment, wireless terminal equipment in smart city (smart city) or wireless terminal equipment in smart home (smart home), etc.
图1表示本公开实施例提供的环境系统示意图。Figure 1 shows a schematic diagram of an environmental system provided by an embodiment of the present disclosure.
结合图1所示,该环境系统包括多个AP和多个STA。As shown in Figure 1, the environmental system includes multiple APs and multiple STAs.
在上述多个系统中,每个AP可以接入一个或多个STA;每个STA也可以接入一个或多个AP。In the multiple systems mentioned above, each AP can access one or more STAs; each STA can also access one or more APs.
示例地,一个AP可以与一个STA相连接,并建立至少两条信道。如图1中的STA1与AP1;一个AP也可以与两个STA相连接,并建立至少两条信道。如图1中的STA2与AP2、AP3。For example, an AP can connect to a STA and establish at least two channels. As shown in Figure 1, STA1 and AP1; one AP can also be connected to two STAs and establish at least two channels. As shown in Figure 1, STA2, AP2, and AP3.
每个STA可以获取其与所接入的AP之间的一个或多个信道的信道感知信息及数据传输信息。各STA之间还可感测获取组内其他AP对本STA的干扰功率。Each STA can obtain channel sensing information and data transmission information of one or more channels between it and the accessed AP. Each STA can also sense and obtain the interference power of other APs in the group to the STA.
在现有技术中,虽然一个STA可以连接多个AP,该STA可以和多个AP之间建立多条双工信道,但是每条信道都需要负责上行和下行的数据承载。信道发送数据之前,STA和AP会进行载波侦听多址接入(carrier sense multiple access,CSMA)/增强分布式信道接入 (enhanced distributed channel access,EDCA)退避,数据发送后可能会发生空口碰撞,如果发生空口碰撞,则数据发送失败,需要重新发送。当信道上下行均有数据发送时,如果AP或STA正在发送上行数据,则下行需要等待上行发送完毕,才能进行数据发送。如果正在发送下行数据,则上行需要等待下行发送完毕,才能进行数据发送。因此,可能会导致上下行数据发生碰撞,且数据发送的等待时延长,影响系统的信道利用率和数据吞吐量。In the existing technology, although one STA can connect to multiple APs and establish multiple duplex channels with multiple APs, each channel needs to be responsible for carrying uplink and downlink data. Before sending data on the channel, the STA and AP will perform carrier sense multiple access (CSMA)/enhanced distributed channel access. (enhanced distributed channel access, EDCA) backoff, an air interface collision may occur after data is sent. If an air interface collision occurs, the data transmission fails and needs to be resent. When data is sent both uplink and downlink on the channel, if the AP or STA is sending uplink data, the downlink needs to wait for the uplink transmission to complete before data can be sent. If downlink data is being sent, the uplink needs to wait for the downlink sending to complete before data can be sent. Therefore, uplink and downlink data may collide, and the waiting time for data transmission may be extended, affecting the channel utilization and data throughput of the system.
在本实施例中的上行信道负责从STA向AP传输信息数据。The uplink channel in this embodiment is responsible for transmitting information data from the STA to the AP.
空闲信道是指接入的多信道中未被占用的信道。一般地,可以通过搜索发出空闲信号的信道,或搜索无载波信道在接入的信道中确定空闲信道。An idle channel refers to an unoccupied channel among the multiple channels accessed. Generally, the idle channel can be determined among the accessed channels by searching for a channel that emits an idle signal, or by searching for a carrier-free channel.
通过搭建上述的网络架构,以及AP与STA间的信息交互、数据处理,获取在信道状态难以估计的情况下,基于该用于确定上行信道的方法,完成信道的分配。以实现系统吞吐量的提升,提高信道的利用率。By building the above network architecture, as well as information interaction and data processing between AP and STA, when the channel status is difficult to estimate, channel allocation is completed based on this method for determining the uplink channel. In order to improve the system throughput and improve channel utilization.
在一些情况下,上述的环境系统还可以包括网络控制器、移动管理实体等其他网络实体,本申请实施例对此不作限定。In some cases, the above-mentioned environment system may also include other network entities such as network controllers and mobility management entities, which are not limited in the embodiments of the present application.
基于上述的环境系统,本公开实施例提供一种用于确定上行信道的方法,使得STA能够在所分配的资源中确定上行信道,并接入上传数据。Based on the above environmental system, embodiments of the present disclosure provide a method for determining an uplink channel, so that the STA can determine the uplink channel in the allocated resources and access the upload data.
如图2所示,该方法应用于终端设备STA,包括:As shown in Figure 2, this method is applied to terminal equipment STA, including:
步骤S201,STA接收触发指令;触发指令用于触发STA感知信道状态。Step S201: The STA receives a trigger instruction; the trigger instruction is used to trigger the STA to sense the channel state.
这里,触发指令用于告知STA数据包到达,可以进行感知信道状态,以传输数据包。Here, the trigger command is used to inform the STA that the data packet has arrived, and it can sense the channel status to transmit the data packet.
步骤S202,若存在空闲信道,则STA根据空闲信道的性能信息确定上行信道,接入上行信道进行数据传输。Step S202: If there is an idle channel, the STA determines the uplink channel based on the performance information of the idle channel, and accesses the uplink channel for data transmission.
步骤S203,若不存在空闲信道,则STA在设定的退避时长后再次感知信道状态。Step S203: If there is no idle channel, the STA senses the channel status again after the set backoff time.
这样,当有数据包到达时,STA开始感知信道状态,如果没有感知到空闲信道,则数据包退避,继续感知;如果感知到空闲状态,则根据空闲状态的性能信息在空闲状态中确定上行信道,以进行数据传输。能够在通信失协等未知CSI信息或CSI信息不完整的情况下,通过STA完成上行信道的选择,可以实现较大的吞吐量提升,并最大化信道利用率,降低信道间碰撞的概率,提高系统的频谱效率。In this way, when a data packet arrives, the STA begins to sense the channel status. If no idle channel is sensed, the data packet backs off and continues sensing; if the idle state is sensed, the uplink channel is determined in the idle state based on the performance information of the idle state. , for data transmission. When the CSI information is unknown or incomplete, such as communication loss, the STA can complete the uplink channel selection, which can achieve greater throughput improvement, maximize channel utilization, reduce the probability of collision between channels, and improve Spectral efficiency of the system.
可选地,根据空闲信道的性能信息确定上行信道,包括:Optionally, determine the uplink channel based on the performance information of the idle channel, including:
获取空闲信道的性能信息;Obtain performance information of idle channels;
将成功传输数据包的能力最优的空闲信道,确定为上行信道。The idle channel with the best ability to successfully transmit data packets is determined as the uplink channel.
这里,成功传输数据包的能力可以通过数据包历史传输成功率和/或信道感知权重确定。 Here, the ability to successfully transmit a data packet may be determined by the historical transmission success rate of the data packet and/or the channel sensing weight.
例如,将数据包历史传输成功率最高的信道,确定为上行信道。For example, the channel with the highest historical data packet transmission success rate is determined as the uplink channel.
这样,通过获取一个或多个空闲信道性能信息中的历史传输数据,即可确定历史传输成功率。历史传输成功率越高,发生信道碰撞等影响传输质量的可能越低。In this way, by obtaining historical transmission data in one or more idle channel performance information, the historical transmission success rate can be determined. The higher the historical transmission success rate, the lower the possibility of channel collisions affecting transmission quality.
又例如,将信道感知权重最高的信道,确定为上行信道。For another example, the channel with the highest channel sensing weight is determined as the uplink channel.
信道感知权重,一般可以通过STA在对应信道上利用频谱感知算法计算获取,用于表示信道质量。The channel sensing weight can generally be calculated and obtained by the STA using the spectrum sensing algorithm on the corresponding channel, and is used to represent the channel quality.
近年来,基于强化学习的研究越来越广泛,强化学习是一种在线学习算法,智能体和外部环境通过奖励机制进行交互,根据在环境中得到的奖励值来调整自身行为,使得智能体学习并适应外部环境,促使智能体选择能够在环境中获得使自身最大奖励的行为。可将强化学习的这种学习并适应于外部环境的特性应用于STA与AP间的信道选择中,使STA作为智能体对变化的信道状态进行学习,最终在空闲信道中选择成功传输数据包的能力最优的空闲信道作为上行信道,以减少信道状态扫描开销并提高信道检测概率。达到实现较大的吞吐量提升,并最大化信道利用率,降低信道间碰撞的概率,提高系统的频谱效率的目的。In recent years, research based on reinforcement learning has become more and more extensive. Reinforcement learning is an online learning algorithm. The agent interacts with the external environment through a reward mechanism and adjusts its behavior according to the reward value obtained in the environment, allowing the agent to learn. And adapt to the external environment, prompting the agent to choose the behavior that can obtain the maximum reward for itself in the environment. The characteristics of reinforcement learning and adapting to the external environment can be applied to the channel selection between the STA and the AP, so that the STA can learn the changing channel status as an agent, and finally select the one that successfully transmits the data packet in the idle channel. The idle channel with the best capability is used as the uplink channel to reduce channel status scanning overhead and improve channel detection probability. This achieves the purpose of achieving greater throughput improvement, maximizing channel utilization, reducing the probability of collisions between channels, and improving the spectrum efficiency of the system.
下面,结合具体方案对上述方案进行说明。Below, the above solutions will be described with reference to specific solutions.
如图3所示,本公开实施例提供一种用于确定上行信道的方法,应用于图1中的STA,以通过强化学习的数据处理方式,确定STA与AP间的上行信道。该方法包括:As shown in Figure 3, an embodiment of the present disclosure provides a method for determining an uplink channel, which is applied to the STA in Figure 1 to determine the uplink channel between the STA and the AP through the data processing method of reinforcement learning. The method includes:
步骤S301,STA接收触发指令;触发指令用于触发STA感知信道状态。Step S301: The STA receives a trigger instruction; the trigger instruction is used to trigger the STA to sense the channel state.
步骤S302,在存在空闲信道的情况下,根据网络平均吞吐量优化问题,构建基于强化学习的上行信道选择模型。Step S302: When there is an idle channel, construct an uplink channel selection model based on reinforcement learning based on the network average throughput optimization problem.
若不存在空闲信道,则STA在设定的退避时长后再次感知信道状态。If there is no idle channel, the STA senses the channel status again after the set backoff time.
这里,所构建的基于强化学习的上行信道选择模型,包括状态集、动作集和奖励函数。Here, the uplink channel selection model based on reinforcement learning is constructed, including a state set, an action set and a reward function.
可选地,该基于强化学习的上行信道选择模型的建立,包括:
Optionally, the establishment of the uplink channel selection model based on reinforcement learning includes:
其中,Ct表示t时刻的网络平均吞吐量;N表示STA的总数;表示t时刻第k个STA的信干噪比。其中,信干噪比是指系统中信号与干扰和噪声之和的比。Among them, C t represents the average network throughput at time t; N represents the total number of STAs; Indicates the signal-to-interference-noise ratio of the kth STA at time t. Among them, the signal-to-interference-to-noise ratio refers to the ratio of the signal to the sum of interference and noise in the system.
如此,将STA与AP间的信道信息用于上行信道选择模型的建立,使得选择上行信道时,能够结合各信道的状态,以使得系统吞吐量达到要求。In this way, the channel information between the STA and the AP is used to establish the uplink channel selection model, so that when selecting the uplink channel, the status of each channel can be combined to ensure that the system throughput meets the requirements.
步骤S303,以网络平均吞吐量最大为目标,将信道状态信息、性能信息输入到基于强化学习的上行信道选择模型中训练,并获得网络平均吞吐量。 Step S303: With the goal of maximizing the average network throughput, the channel state information and performance information are input into the uplink channel selection model based on reinforcement learning for training, and the average network throughput is obtained.
可选地,该基于强化学习的上行信道选择模型的训练,包括:Optionally, the training of the reinforcement learning-based uplink channel selection model includes:
将信道状态信息、性能信息作为强化学习的状态集 Use channel state information and performance information as state sets for reinforcement learning
其中,表示STA在信道上的信道感知权重的集合;表示t时刻第k个STA在第m个信道上的感知权重;in, Represents the set of channel sensing weights of STA on the channel; Represents the sensing weight of the k-th STA on the m-th channel at time t;
表示STA在信道上的数据包传输权重的集合;表示t时刻第k个STA在第m个信道上的数据包传输权重感知权重; Represents the set of data packet transmission weights of STA on the channel; Indicates the perceived weight of the data packet transmission weight of the k-th STA on the m-th channel at time t;
将状态集S输入基于强化学习的上行信道选择模型中训练,得到动作集A={f1,f2,...,fM},表示STA采取对应于在M个空闲信道中选择上行信道的动作的集合;The state set S is input into the uplink channel selection model based on reinforcement learning for training, and the action set A={f 1 , f 2 ,..., f M } is obtained, indicating that the STA takes steps corresponding to selecting the uplink channel among M idle channels. a collection of actions;
根据状态集确定奖励参数表示t时刻第k个STA在第m个信道传输的即时奖励;Determine reward parameters based on state set Represents the immediate reward of the k-th STA transmitting on the m-th channel at time t;
根据奖励参数Rt训练基于强化学习的上行信道选择模型,以得到使奖励参数Rt最大的系统动作对应的信道,作为上行信道。The uplink channel selection model based on reinforcement learning is trained according to the reward parameter R t to obtain the channel corresponding to the system action that maximizes the reward parameter R t as the uplink channel.
这里,奖励参数Rt用于表示t时刻所选择的上行信道的感知权重和信道传输权重的平均值。Here, the reward parameter R t is used to represent the average value of the perception weight and channel transmission weight of the selected uplink channel at time t.
进一步地,根据奖励参数Rt训练基于强化学习的资源分配决策模型,包括:将如下方式,作为强化学习的更新规则:
Further, training a resource allocation decision-making model based on reinforcement learning based on the reward parameter R t includes: using the following method as the update rule for reinforcement learning:
其中,Qt表示当前状态的Q值,Qt+1表示下一状态时刻的Q值;α表示的强化学习的学习率,取值为(0,1);β表示对历史奖励的重视程度,取值为(0,1);表示即时奖励;maxQt(S′,A′)表示下一时刻所有可能动作策略的最大Q值。Among them, Q t represents the Q value of the current state, Q t+1 represents the Q value of the next state moment; α represents the learning rate of reinforcement learning, with a value of (0,1); β represents the emphasis on historical rewards. , the value is (0,1); represents the immediate reward; maxQ t (S′,A′) represents the maximum Q value of all possible action strategies at the next moment.
图4中示出了本公开实施例中强化学习训练的示意图,以对上述步骤进行说明。Figure 4 shows a schematic diagram of reinforcement learning training in an embodiment of the present disclosure to illustrate the above steps.
本实施例中的强化学习,采用Q-Learning算法。通过智能体在环境中执行动作获得一定的奖励来感知环境,从而学习到由状态到动作的映射策略来最大化奖励值。The reinforcement learning in this embodiment uses the Q-Learning algorithm. The agent performs actions in the environment to obtain certain rewards to perceive the environment, thereby learning a mapping strategy from state to action to maximize the reward value.
在图4中,将STA作为强化学习的代理,作为智能体进行数据处理。根据STA接收的AP之间的相互干扰信息及信道空闲情况,利用强化学习算法实现合理有效的上行信道选择。通过智能体STA与环境不断交互的过程,从环境中得到反馈,进而改变智能体STA的动作,实现上行信道选择动作的调整。In Figure 4, STA is used as an agent for reinforcement learning and performs data processing as an intelligent agent. Based on the mutual interference information between APs and channel idle conditions received by the STA, the reinforcement learning algorithm is used to achieve reasonable and effective uplink channel selection. Through the process of continuous interaction between the agent STA and the environment, feedback is obtained from the environment, and then the action of the agent STA is changed to realize the adjustment of the uplink channel selection action.
具体地,STA首先获取AP之间相互干扰信息及信道空闲情况,作为信道性能信息和 状态信息S0,智能体STA在S0环境下采取行动A0作为信道选择决策,并反馈给环境中的AP。这里,STA所采取的动作可以根据贪婪策略进行选择。Specifically, STA first obtains mutual interference information between APs and channel idle conditions as channel performance information and State information S 0 , the agent STA takes action A 0 in the S 0 environment as a channel selection decision, and feeds it back to the AP in the environment. Here, the actions taken by STA can be selected according to the greedy strategy.
智能体STA进行信道选择决策后,根据所选择的上行信道进行接入及数据传输。根据系统吞吐量确定奖励参数R1,向STA进行反馈;并向STA发送包含AP之间的相互干扰信息及信道空闲情况的下一状态S1。STA收到奖励参数R1和环境状态的S1后,根据强化学习的更新规则,更新Q值表,并对环境采取行动A1作为上行信道选择决策。环境状态接收到动作A1后由状态S1变化到S2,并且反馈奖励参数R2。即STA得到奖励参数R2、状态S2,更新Q值表,采取A2行动;得到奖励参数R3、状态S3,更新Q值表,采取A3行动。以此进行循环,直到系统吞吐量最大,即奖励参数Rt达到最大。最终达到减少干扰,提高吞吐量的目的。After the agent STA makes a channel selection decision, it performs access and data transmission according to the selected uplink channel. Determine the reward parameter R 1 based on the system throughput and provide feedback to the STA; and send the next state S 1 including mutual interference information between APs and channel idle conditions to the STA. After receiving the reward parameter R 1 and the environment status S 1 , the STA updates the Q value table according to the update rules of reinforcement learning, and takes action A 1 to the environment as an uplink channel selection decision. After receiving action A 1 , the environment state changes from state S 1 to S 2 , and the reward parameter R 2 is fed back. That is, STA gets the reward parameter R 2 and state S 2 , updates the Q value table, and takes action A 2 ; gets the reward parameter R 3 and state S 3 , updates the Q value table, and takes action A 3 . This cycle is continued until the system throughput reaches the maximum, that is, the reward parameter Rt reaches the maximum. Ultimately, the purpose of reducing interference and improving throughput is achieved.
通过Q值表的更新,表中对每条信道都用一个Q值代表其信道传输质量的高低;在有数据包到达时,STA开始感知空闲信道,如果没有感知到空闲信道,则数据包退避,继续感知;如果感知到空闲信道,利用Q-Learning机制对上行信道选择策略进行学习。Q-learning学习过程包括:根据上一状态St决定此时刻的动作At+1,然后更新状态St+1,并且会反馈一个奖励Rt。STA通过学习会在空闲信道中选择一个传输质量最好的信道进行传输,这里的传输质量的衡量是历史传输数据包成功率的高低。并根据奖励参数更新Q值表中的Q值。这样,根据Q值对信道进行排序,可得到信道传输质量排序列表。当接收数据包达到信息后,STA可通过Q-Learning按照贪婪决策策略来采取行动,即以ε的概率在空闲信道中进行选择,最终确定上行信道。Through the update of the Q value table, a Q value is used for each channel in the table to represent the level of channel transmission quality; when a data packet arrives, the STA begins to sense the idle channel. If no idle channel is sensed, the data packet backs off. , continue sensing; if an idle channel is sensed, use the Q-Learning mechanism to learn the uplink channel selection strategy. The Q-learning learning process includes: determining the action A t+1 at this moment based on the previous state S t , then updating the state S t+1 , and feeding back a reward R t . Through learning, the STA will select a channel with the best transmission quality among the idle channels for transmission. The transmission quality here is measured by the success rate of historical transmission data packets. And update the Q value in the Q value table according to the reward parameters. In this way, by sorting the channels according to the Q value, a sorted list of channel transmission quality can be obtained. When the received data packet reaches the information, the STA can take action according to the greedy decision-making strategy through Q-Learning, that is, select among the idle channels with probability ε, and finally determine the uplink channel.
步骤S304,在网络平均吞吐量达到最大值的情况下,根据基于强化学习的上行信道选择模型的输出确定上行信道。Step S304: When the average network throughput reaches the maximum value, determine the uplink channel according to the output of the uplink channel selection model based on reinforcement learning.
将奖励参数Rt达到最大值时,系统对应的动作At作为最优策略,以确定相应的上行信道选择动作。When the reward parameter R t reaches the maximum value, the system's corresponding action A t is used as the optimal strategy to determine the corresponding uplink channel selection action.
步骤S305,接入上行信道进行数据传输。Step S305: accessing an uplink channel for data transmission.
如此,通过基于强化学习的上行信道选择模型,通过终端STA感知信道状态以及空闲信道的数量来做出决策,选择信道质量最高的信道进行数据传输,并将奖励反馈给环境同时更新下一状态。以在空闲信道中根据信道性能确定上行信道,进行数据传输。这样,能够在通信失协等未知CSI信息或CSI信息不完整的情况下,通过STA完成上行信道的选择,可以实现较大的吞吐量提升,并最大化信道利用率,降低信道间碰撞的概率,提高系统的频谱效率。In this way, through the uplink channel selection model based on reinforcement learning, the terminal STA makes decisions by sensing the channel status and the number of idle channels, selects the channel with the highest channel quality for data transmission, and feeds the reward back to the environment while updating the next state. The uplink channel is determined based on channel performance in the idle channel for data transmission. In this way, in the case of unknown CSI information or incomplete CSI information such as communication loss, the uplink channel selection can be completed through the STA, which can achieve a greater throughput improvement, maximize channel utilization, and reduce the probability of collision between channels. , improve the spectral efficiency of the system.
图5示出了一种用于确定上行信道的方法,以说明在有数据包到达时感知信道情况, 并利用强化学习选择所要接入的信道以完成向AP的上行传输的步骤。Figure 5 shows a method for determining the uplink channel to illustrate the perceived channel conditions when a data packet arrives, And use reinforcement learning to select the channel to be accessed to complete the uplink transmission to the AP.
如图5所示,本公开实施例提供一种用于确定上行信道的方法,应用于图1中的STA,以通过强化学习的数据处理方式,确定STA与AP间的上行信道。该方法包括:As shown in Figure 5, an embodiment of the present disclosure provides a method for determining an uplink channel, which is applied to the STA in Figure 1 to determine the uplink channel between the STA and the AP through the data processing method of reinforcement learning. The method includes:
步骤S501,STA接收触发指令;触发指令包括数据包到达指示。Step S501: The STA receives a trigger instruction; the trigger instruction includes a data packet arrival indication.
步骤S502,STA感知是否存在空闲信道。Step S502: The STA senses whether there is an idle channel.
步骤S503,若不存在空闲信道,则进行数据包退避,并在设定的退避时长后再次感知信道状态。设定的退避时长服从均值为λ的随机分布确定。Step S503: If there is no idle channel, data packet backoff is performed, and the channel status is sensed again after the set backoff time. The set backoff duration is determined by a random distribution with mean λ.
步骤S504,若存在空闲信道,利用Q-learning算法通过基于强化学习的上行信道选择模型输出信道选择决策,作为上行信道。Step S504: If there is an idle channel, use the Q-learning algorithm to output the channel selection decision as the uplink channel through the uplink channel selection model based on reinforcement learning.
步骤S505,接入上行信道进行数据传输。并根据所选择的信道动作、以及选择后系统吞吐量变化更新步骤S504中的动作集和奖励参数。Step S505: Access the uplink channel for data transmission. And the action set and reward parameters in step S504 are updated according to the selected channel action and the system throughput change after the selection.
步骤S506,在数据传输成功后,根据传输结果更新步骤S504中Q-learning算法所需的信息。根据传输结果更新步骤S504中的状态集,并返回步骤S501接收新的数据包到达指示。Step S506: After the data transmission is successful, the information required by the Q-learning algorithm in step S504 is updated according to the transmission result. Update the status set in step S504 according to the transmission result, and return to step S501 to receive a new data packet arrival indication.
步骤S507,在数据传输失败后,根据传输结果更新步骤S504中Q-learning算法所需的信息,并返回步骤S502重新执行数据传输指令。根据传输结果更新步骤S504中的状态集。Step S507: After the data transmission fails, update the information required by the Q-learning algorithm in step S504 according to the transmission result, and return to step S502 to re-execute the data transmission instruction. The status set in step S504 is updated according to the transmission result.
如此,通过基于强化学习的上行信道选择模型,通过终端STA感知信道状态以及空闲信道的数量来做出决策,选择信道质量最高的信道进行数据传输,并将奖励反馈给环境同时更新下一状态。并在做出决策以后继续对环境状态进行更新。将数据包传输后存在的两种情况更新到Q-learning学习过程中。在传输成功后,在更新环境状态之后,本次数据传输结束,等待新的数据包到达,进入下一轮数据传输;在传输失败后,在更新环境状态之后,要进入重传机制,重新感知信道,以进行数据包的传输。这样,能够在通信失协等未知CSI信息或CSI信息不完整的情况下,通过STA完成上行信道的选择,可以实现较大的吞吐量提升,并最大化信道利用率,降低信道间碰撞的概率,提高系统的频谱效率。In this way, through the uplink channel selection model based on reinforcement learning, the terminal STA makes decisions by sensing the channel status and the number of idle channels, selects the channel with the highest channel quality for data transmission, and feeds the reward back to the environment while updating the next state. And continue to update the environment status after making a decision. Update the two situations that exist after data packet transmission into the Q-learning learning process. After the transmission is successful, after updating the environment status, this data transmission ends, waiting for the arrival of new data packets, and entering the next round of data transmission; after the transmission fails, after updating the environment status, it is necessary to enter the retransmission mechanism and re-sense. channel for data packet transmission. In this way, in the case of unknown CSI information or incomplete CSI information such as communication loss, the uplink channel selection can be completed through the STA, which can achieve a greater throughput improvement, maximize channel utilization, and reduce the probability of collision between channels. , improve the spectral efficiency of the system.
图6示出了一种用于确定上行信道的方法,应用于图1所示环境系统中的AP,包括:Figure 6 shows a method for determining the uplink channel, applied to the AP in the environment system shown in Figure 1, including:
步骤S601,AP发送触发指令;触发指令用于触发STA感知信道状态。Step S601, the AP sends a trigger instruction; the trigger instruction is used to trigger the STA to sense the channel status.
这里,AP通过向STA发送触发指令,以获取STA反馈的数据缓存信息,并触发STA进行信道状态感知,从而进行数据传输。AP可以发送BSRP缓存状态报告轮询帧(Buffer Status Report Poll,BSRP)来使STA发送缓存状态报告帧(Buffer Status Report,BSR)。 Here, the AP sends a trigger command to the STA to obtain the data cache information fed back by the STA, and triggers the STA to sense the channel status for data transmission. The AP can send a BSRP buffer status report poll frame (Buffer Status Report Poll, BSRP) to cause the STA to send a buffer status report frame (Buffer Status Report, BSR).
步骤S602,AP接收STA通过上行信道传输的数据;上行信道是STA根据空闲信道的性能信息确定的。Step S602: The AP receives the data transmitted by the STA through the uplink channel; the uplink channel is determined by the STA based on the performance information of the idle channel.
在接收到STA通过上行信道传输的数据后,AP还向STA发送确认字符(Acknowledge character,ACK)以表示接收到上传数据。After receiving the data transmitted by the STA through the uplink channel, the AP also sends an acknowledgment character (ACK) to the STA to indicate receipt of the uploaded data.
这样,能够在通信失协等未知CSI信息或CSI信息不完整的情况下,通过STA完成上行信道的选择,可以实现较大的吞吐量提升,并最大化信道利用率,降低信道间碰撞的概率,提高系统的频谱效率。In this way, in the case of unknown CSI information or incomplete CSI information such as communication loss, the uplink channel selection can be completed through the STA, which can achieve a greater throughput improvement, maximize channel utilization, and reduce the probability of collision between channels. , improve the spectral efficiency of the system.
图7示出了一种用于确定上行信道的方法的应用示意图。Figure 7 shows an application diagram of a method for determining an uplink channel.
在该实际应用中,该用于确定上行信道的方法包括以下步骤:In this practical application, the method for determining the uplink channel includes the following steps:
步骤S701,AP向STA发送BRSP,请求获取STA的数据缓存信息;Step S701, the AP sends a BRSP to the STA, requesting to obtain the STA's data cache information;
步骤S702,STA向AP发送BSR,反馈数据缓存信息;Step S702, the STA sends a BSR to the AP to feed back the data cache information;
步骤S703,STA感知当前所有信道状态,如果感知到有多个空闲信道,则进入Q-learning学习过程环境,选择成功传输数据包的能力最优的空闲信道作为上行信道。如果没有信道空闲,数据包将退避一段时间再进行传输,退避时间是服从均值为λ的随机分布。Step S703: The STA senses the current status of all channels. If it senses that there are multiple idle channels, it enters the Q-learning learning process environment and selects the idle channel with the best ability to successfully transmit data packets as the uplink channel. If there is no channel idle, the data packet will back off for a period of time before being transmitted. The back off time is subject to a random distribution with mean λ.
步骤S704,STA接入上行信道并传输数据。Step S704: The STA accesses the uplink channel and transmits data.
步骤S705,AP接收到STA传输的数据,向STA发送ACK以表示接收。Step S705: The AP receives the data transmitted by the STA and sends an ACK to the STA to indicate receipt.
如此,通过终端STA感知信道状态,并在空闲信道中根据信道性能确定上行信道,进行数据传输。这样,能够在通信失协等未知CSI信息或CSI信息不完整的情况下,通过STA完成上行信道的选择,可以实现较大的吞吐量提升,并最大化信道利用率,降低信道间碰撞的概率,降低多AP间干扰对数据传输的影响,提高系统的频谱效率。In this way, the terminal STA senses the channel status, determines the uplink channel based on the channel performance in the idle channel, and performs data transmission. In this way, in the case of unknown CSI information or incomplete CSI information such as communication loss, the uplink channel selection can be completed through the STA, which can achieve a greater throughput improvement, maximize channel utilization, and reduce the probability of collision between channels. , reduce the impact of interference between multiple APs on data transmission, and improve the spectrum efficiency of the system.
结合图8所示,本公开实施例提供一种终端设备,包括处理器800和存储器801。存储器801用于存储计算机程序,处理器800用于调用并运行存储器中存储的程序,执行如上述的用于确定上行信道的方法。As shown in FIG. 8 , an embodiment of the present disclosure provides a terminal device, including a processor 800 and a memory 801 . The memory 801 is used to store computer programs, and the processor 800 is used to call and run the programs stored in the memory, and perform the above-mentioned method for determining the uplink channel.
可选地,该设备还包括通信接口802和总线803。通信接口802用于与其它网络设备进行通信;处理器800、通信接口802、存储器801可以通过总线803完成相互间的通信。Optionally, the device also includes a communication interface 802 and a bus 803. The communication interface 802 is used to communicate with other network devices; the processor 800, the communication interface 802, and the memory 801 can communicate with each other through the bus 803.
结合图9所示,本公开实施例提供一种网络设备,包括处理器900和存储器901。存储器901用于存储计算机程序,处理器900用于调用并运行存储器中存储的程序,执行如上述的用于确定上行信道的方法。As shown in FIG. 9 , an embodiment of the present disclosure provides a network device, including a processor 900 and a memory 901 . The memory 901 is used to store computer programs, and the processor 900 is used to call and run the programs stored in the memory, and perform the above-mentioned method for determining the uplink channel.
可选地,该设备还包括通信接口902和总线903。通信接口902用于与其它网络设备进行通信;处理器900、通信接口902、存储器901可以通过总线903完成相互间的通信。 Optionally, the device also includes a communication interface 902 and a bus 903. The communication interface 902 is used to communicate with other network devices; the processor 900, the communication interface 902, and the memory 901 can communicate with each other through the bus 903.
此外,上述的存储器901中的逻辑指令可以通过软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。In addition, the above-mentioned logical instructions in the memory 901 can be implemented in the form of software functional units and can be stored in a computer-readable storage medium when sold or used as an independent product.
存储器901作为一种计算机可读存储介质,可用于存储软件程序、计算机可执行程序,如本公开实施例中的方法对应的程序指令/模块。处理器900通过运行存储在存储器901中的程序指令/模块,从而执行功能应用以及数据处理,即实现上述实施例中用于确定上行信道的方法。As a computer-readable storage medium, the memory 901 can be used to store software programs, computer-executable programs, such as program instructions/modules corresponding to the methods in the embodiments of the present disclosure. The processor 900 executes program instructions/modules stored in the memory 901 to execute functional applications and data processing, that is, to implement the method for determining the uplink channel in the above embodiment.
存储器901可包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序;存储数据区可存储根据终端设备的使用所创建的数据等。此外,存储器901可以包括高速随机存取存储器,还可以包括非易失性存储器。The memory 901 may include a stored program area and a stored data area, where the stored program area may store an operating system and at least one application program required for a function; the stored data area may store data created according to the use of the terminal device, etc. In addition, the memory 901 may include high-speed random access memory and may also include non-volatile memory.
本公开实施例提供了一种计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令设置为执行上述用于确定上行信道的方法。Embodiments of the present disclosure provide a computer-readable storage medium that stores computer-executable instructions, and the computer-executable instructions are configured to execute the above method for determining an uplink channel.
本公开实施例提供了一种计算机程序产品,所述计算机程序产品包括存储在计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,使所述计算机执行上述用于确定上行信道的方法。Embodiments of the present disclosure provide a computer program product. The computer program product includes a computer program stored on a computer-readable storage medium. The computer program includes program instructions. When the program instructions are executed by a computer, the The computer executes the above method for determining the uplink channel.
本公开实施例提供了一种计算机程序,当所述计算机程序被计算机执行时,使所述计算机实现上述用于确定上行信道的方法。An embodiment of the present disclosure provides a computer program that, when executed by a computer, causes the computer to implement the above method for determining an uplink channel.
上述的计算机可读存储介质可以是暂态计算机可读存储介质,也可以是非暂态计算机可读存储介质。The above-mentioned computer-readable storage medium may be a transient computer-readable storage medium or a non-transitory computer-readable storage medium.
本公开实施例的技术方案可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括一个或多个指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本公开实施例所述方法的全部或部分步骤。而前述的存储介质可以是非暂态存储介质,包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等多种可以存储程序代码的介质,也可以是暂态存储介质。The technical solution of the embodiments of the present disclosure may be embodied in the form of a software product. The computer software product is stored in a storage medium and includes one or more instructions to enable a computer device (which may be a personal computer, a server, or a network equipment, etc.) to perform all or part of the steps of the method described in the embodiments of the present disclosure. The aforementioned storage media can be non-transitory storage media, including: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disk, etc. A medium that can store program code or a temporary storage medium.
以上描述和附图充分地示出了本公开的实施例,以使本领域的技术人员能够实践它们。其他实施例可以包括结构的、逻辑的、电气的、过程的以及其他的改变。实施例仅代表可能的变化。除非明确要求,否则单独的部件和功能是可选的,并且操作的顺序可以变化。一些实施例的部分和特征可以被包括在或替换其他实施例的部分和特征。而且,本申请中使用的用词仅用于描述实施例并且不用于限制权利要求。如在实施例以及权利要求的描述中使用的,除非上下文清楚地表明,否则单数形式的“一个”(a)、“一个”(an)和“所述”(the)旨在同样包括复数形式。类似地,如在本申请中所使用的术语“和/或”是指包含一个 或一个以上相关联的列出的任何以及所有可能的组合。另外,当用于本申请中时,术语“包括”(comprise)及其变型“包括”(comprises)和/或包括(comprising)等指陈述的特征、整体、步骤、操作、元素,和/或组件的存在,但不排除一个或一个以上其它特征、整体、步骤、操作、元素、组件和/或这些的分组的存在或添加。在没有更多限制的情况下,由语句“包括一个…”限定的要素,并不排除在包括所述要素的过程、方法或者设备中还存在另外的相同要素。本文中,每个实施例重点说明的可以是与其他实施例的不同之处,各个实施例之间相同相似部分可以互相参见。对于实施例公开的方法、产品等而言,如果其与实施例公开的方法部分相对应,那么相关之处可以参见方法部分的描述。The foregoing description and drawings illustrate embodiments of the disclosure sufficiently to enable those skilled in the art to practice them. Other embodiments may incorporate structural, logical, electrical, process, and other changes. The examples represent only possible variations. Unless expressly required, individual components and features are optional and the order of operations may vary. Portions and features of some embodiments may be included in or substituted for those of other embodiments. Furthermore, the words used in this application are used only to describe the embodiments and not to limit the claims. As used in the description of the embodiments and the claims, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context clearly dictates otherwise. . Similarly, the term "and/or" as used in this application refers to an or any and all possible combinations of one or more of the associated listed. In addition, when used in this application, the term "comprise" and its variations "comprises" and/or "comprising" etc. refer to stated features, integers, steps, operations, elements, and/or The presence of a component does not exclude the presence or addition of one or more other features, integers, steps, operations, elements, components and/or groupings of these. Without further limitation, an element defined by the statement "comprises a..." does not exclude the presence of additional identical elements in a process, method or apparatus including the stated element. In this article, each embodiment may focus on its differences from other embodiments, and the same and similar parts among various embodiments may be referred to each other. For the methods, products, etc. disclosed in the embodiments, if they correspond to the method part disclosed in the embodiment, then the relevant parts can be referred to the description of the method part.
本领域技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,可以取决于技术方案的特定应用和设计约束条件。所述技术人员可以对每个特定的应用来使用不同方法以实现所描述的功能,但是这种实现不应认为超出本公开实施例的范围。所述技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art will appreciate that the units and algorithm steps of each example described in conjunction with the embodiments disclosed herein can be implemented with electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software may depend on the specific application and design constraints of the technical solution. The skilled person may use different methods to implement the described functionality for each specific application, but such implementations should not be considered to be beyond the scope of the disclosed embodiments. The skilled person can clearly understand that for the convenience and simplicity of description, the specific working processes of the systems, devices and units described above can be referred to the corresponding processes in the foregoing method embodiments, and will not be described again here.
本文所披露的实施例中,所揭露的方法、产品(包括但不限于装置、设备等),可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,可以仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另外,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例。另外,在本公开实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。In the embodiments disclosed herein, the disclosed methods and products (including but not limited to devices, equipment, etc.) can be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units may only be a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined. Either it can be integrated into another system, or some features can be ignored, or not implemented. In addition, the coupling or direct coupling or communication connection between each other shown or discussed may be through some interfaces, and the indirect coupling or communication connection of the devices or units may be in electrical, mechanical or other forms. The units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place, or they may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to implement this embodiment. In addition, each functional unit in the embodiment of the present disclosure may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.
附图中的流程图和框图显示了根据本公开实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段或代码的一部分,所述模块、程序段或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个连续的方框实际上可以基本并行地执行,它 们有时也可以按相反的顺序执行,这可以依所涉及的功能而定。在附图中的流程图和框图所对应的描述中,不同的方框所对应的操作或步骤也可以以不同于描述中所披露的顺序发生,有时不同的操作或步骤之间不存在特定的顺序。例如,两个连续的操作或步骤实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这可以依所涉及的功能而定。框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或动作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。 The flowcharts and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more components for implementing the specified logical function(s). Executable instructions. In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two consecutive blocks can actually execute essentially in parallel, it They can sometimes be executed in reverse order, depending on the functionality involved. In the descriptions corresponding to the flowcharts and block diagrams in the accompanying drawings, operations or steps corresponding to different blocks may also occur in a sequence different from that disclosed in the description, and sometimes there is no specific distinction between different operations or steps. order. For example, two consecutive operations or steps may actually be performed substantially in parallel, or they may sometimes be performed in reverse order, depending on the functionality involved. Each block in the block diagram and/or flowchart illustration, and combinations of blocks in the block diagram and/or flowchart illustration, may be implemented by special purpose hardware-based systems that perform the specified functions or actions, or may be implemented using special purpose hardware implemented in combination with computer instructions.

Claims (12)

  1. 一种用于确定上行信道的方法,应用于终端设备STA,其特征在于,所述方法包括:A method for determining an uplink channel, applied to terminal equipment STA, characterized in that the method includes:
    接收触发指令;所述触发指令用于触发STA感知信道状态;Receive a trigger instruction; the trigger instruction is used to trigger the STA to sense the channel status;
    若存在空闲信道,则根据空闲信道的性能信息确定上行信道,接入所述上行信道进行数据传输;If an idle channel exists, determine the uplink channel based on the performance information of the idle channel, and access the uplink channel for data transmission;
    若不存在空闲信道,则在设定的退避时长后再次感知信道状态。If there is no idle channel, the channel status will be sensed again after the set backoff time.
  2. 根据权利要求1所述的方法,其特征在于,所述接入所述上行信道进行数据传输,包括:The method according to claim 1, characterized in that said accessing the uplink channel for data transmission includes:
    在数据传输成功后,根据传输结果更新对应信道的性能信息,并接收新的数据包到达指示;After the data transmission is successful, the performance information of the corresponding channel is updated according to the transmission result, and the new data packet arrival indication is received;
    在数据传输失败后,根据传输结果更新对应信道的性能信息,并重新执行数据传输指令。After the data transmission fails, the performance information of the corresponding channel is updated according to the transmission result, and the data transmission instruction is re-executed.
  3. 根据权利要求1或2所述的方法,其特征在于,所述根据空闲信道的性能信息确定上行信道,包括:The method according to claim 1 or 2, characterized in that determining the uplink channel according to the performance information of the idle channel includes:
    获取空闲信道的性能信息;Obtain performance information of idle channels;
    将成功传输数据包的能力最优的空闲信道,确定为上行信道。The idle channel with the best ability to successfully transmit data packets is determined as the uplink channel.
  4. 根据权利要求3所述的方法,其特征在于,所述上行信道的确定,包括:The method according to claim 3, characterized in that the determination of the uplink channel includes:
    构建基于强化学习的上行信道选择模型;Construct an uplink channel selection model based on reinforcement learning;
    将信道状态信息、性能信息输入到基于强化学习的上行信道选择模型中训练,并获得网络平均吞吐量;Input channel status information and performance information into the uplink channel selection model based on reinforcement learning for training, and obtain the average network throughput;
    在网络平均吞吐量达到最大值的情况下,根据所述基于强化学习的上行信道选择模型的输出确定上行信道。When the average network throughput reaches the maximum value, the uplink channel is determined according to the output of the reinforcement learning-based uplink channel selection model.
  5. 根据权利要求4所述的方法,其特征在于,基于强化学习的上行信道选择模型的训练,包括:The method according to claim 4, characterized in that the training of the uplink channel selection model based on reinforcement learning includes:
    将信道状态信息、性能信息作为强化学习的状态集 Use channel state information and performance information as state sets for reinforcement learning
    其中,表示STA在信道上的信道感知权重的集合;表示t时刻第k个STA在第m个信道上的感知权重;in, Represents the set of channel sensing weights of STA on the channel; Represents the sensing weight of the k-th STA on the m-th channel at time t;
    表示STA在信道上的数据包传输权重的集合;表示t时刻第k个STA在第m个信道上的数据包传输权重感知权重; Represents the set of data packet transmission weights of STA on the channel; Indicates the perceived weight of the data packet transmission weight of the k-th STA on the m-th channel at time t;
    将所述状态集S输入基于强化学习的上行信道选择模型中训练,得到动作集A={f1, f2,…, fM},表示STA采取对应于在M个空闲信道中选择上行信道的动作的集合;The state set S is input into the uplink channel selection model based on reinforcement learning for training, and the action set A={f 1 , f 2 ,..., f M } is obtained, indicating that the STA takes steps corresponding to selecting the uplink channel among M idle channels. a collection of actions;
    根据所述状态集确定奖励参数表示第k个STA在第m个信道传输的即时奖励;Determine reward parameters based on the state set Represents the immediate reward of the k-th STA transmitting on the m-th channel;
    根据所述奖励参数Rt训练所述基于强化学习的上行信道选择模型,以得到使奖励参数Rt最大的系统动作对应的信道,作为所述上行信道。The uplink channel selection model based on reinforcement learning is trained according to the reward parameter R t to obtain the channel corresponding to the system action that maximizes the reward parameter R t as the uplink channel.
  6. 根据权利要求5所述的方法,其特征在于,所述基于强化学习的上行信道选择模型的建立,包括:
    The method according to claim 5, characterized in that the establishment of the uplink channel selection model based on reinforcement learning includes:
    其中,Ct表示t时刻的网络平均吞吐量;N表示STA的总数;表示t时刻第k个STA的信干噪比。Among them, C t represents the average network throughput at time t; N represents the total number of STAs; Indicates the signal-to-interference-noise ratio of the kth STA at time t.
  7. 根据权利要求5所述的方法,其特征在于,所述根据所述奖励参数Rt训练所述基于强化学习的上行信道选择模型,包括:The method of claim 5, wherein training the reinforcement learning-based uplink channel selection model according to the reward parameter R t includes:
    将如下方式,作为强化学习的更新规则:
    Use the following method as the update rule for reinforcement learning:
    其中,Qt表示当前状态的Q值,Qt+1表示下一状态时刻的Q值;α表示的强化学习的学习率,取值为(0,1);β表示对历史奖励的重视程度,取值为(0,1);表示即时奖励;maxQt(S′,A′)表示下一时刻所有可能动作策略的最大Q值。Among them, Q t represents the Q value of the current state, Q t+1 represents the Q value of the next state moment; α represents the learning rate of reinforcement learning, with a value of (0,1); β represents the emphasis on historical rewards. , the value is (0,1); represents the immediate reward; maxQ t (S′,A′) represents the maximum Q value of all possible action strategies at the next moment.
  8. 一种用于确定上行信道的方法,应用于接入点AP,其特征在于,包括:A method for determining an uplink channel, applied to an access point AP, is characterized by including:
    发送触发指令;所述触发指令用于触发STA感知信道状态;Send a trigger instruction; the trigger instruction is used to trigger the STA to sense the channel status;
    接收STA通过上行信道传输的数据;所述上行信道是所述STA根据空闲信道的性能信息确定的。Receive data transmitted by the STA through the uplink channel; the uplink channel is determined by the STA based on the performance information of the idle channel.
  9. 一种终端设备,其特征在于,包括处理器和存储器,所述存储器用于存储计算机程序,所述处理器用于调用并运行所述存储器中存储的程序,执行如权利要求1至7所述的用于确定上行信道的方法。A terminal device, characterized in that it includes a processor and a memory, the memory is used to store a computer program, the processor is used to call and run the program stored in the memory, and execute the steps as described in claims 1 to 7 Method used to determine the upstream channel.
  10. 一种网络设备,其特征在于,包括处理器和通信接口,通信接口用于与其它网络设备进行通信;处理器用于运行一组程序,以使得所述网络设备实现如权利要求8所述的用于确定上行信道的方法。 A network device, characterized in that it includes a processor and a communication interface, the communication interface is used to communicate with other network devices; the processor is used to run a set of programs, so that the network device implements the use as claimed in claim 8 method for determining the uplink channel.
  11. 一种计算机程序,当所述计算机程序被计算机执行时,使所述计算机实现如权利要求1至7所述的用于确定上行信道的方法。A computer program, when the computer program is executed by a computer, causes the computer to implement the method for determining an uplink channel as claimed in claims 1 to 7.
  12. 一种计算机程序产品,所述计算机程序产品包括存储在计算机可读存储介质上的计算机指令,当所述程序指令被计算机执行时,使所述计算机实现如权利要求1至7所述的用于确定上行信道的方法。 A computer program product. The computer program product includes computer instructions stored on a computer-readable storage medium. When the program instructions are executed by a computer, the computer implements the methods described in claims 1 to 7. Method to determine the upstream channel.
PCT/CN2023/107293 2022-09-16 2023-07-13 Method for determining uplink channel, and terminal and network device WO2024055739A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211128492.3A CN117768075A (en) 2022-09-16 2022-09-16 Method, terminal and network equipment for determining uplink channel
CN202211128492.3 2022-09-16

Publications (1)

Publication Number Publication Date
WO2024055739A1 true WO2024055739A1 (en) 2024-03-21

Family

ID=90274172

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/107293 WO2024055739A1 (en) 2022-09-16 2023-07-13 Method for determining uplink channel, and terminal and network device

Country Status (2)

Country Link
CN (1) CN117768075A (en)
WO (1) WO2024055739A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101257714A (en) * 2008-04-08 2008-09-03 浙江大学 Across layer self-adapting paralleling channel allocating method of cognized radio system
WO2017036258A1 (en) * 2015-09-02 2017-03-09 华为技术有限公司 Contention access method, contention access device, base station and contention access system
CN111342920A (en) * 2020-01-10 2020-06-26 重庆邮电大学 Channel selection method based on Q learning
WO2020172022A1 (en) * 2019-02-21 2020-08-27 Google Llc User-equipment-coordination set for a wireless network using an unlicensed frequency band
WO2021155763A1 (en) * 2020-02-05 2021-08-12 维沃移动通信有限公司 Data transmission method and apparatus for unlicensed band, and communication device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101257714A (en) * 2008-04-08 2008-09-03 浙江大学 Across layer self-adapting paralleling channel allocating method of cognized radio system
WO2017036258A1 (en) * 2015-09-02 2017-03-09 华为技术有限公司 Contention access method, contention access device, base station and contention access system
WO2020172022A1 (en) * 2019-02-21 2020-08-27 Google Llc User-equipment-coordination set for a wireless network using an unlicensed frequency band
CN111342920A (en) * 2020-01-10 2020-06-26 重庆邮电大学 Channel selection method based on Q learning
WO2021155763A1 (en) * 2020-02-05 2021-08-12 维沃移动通信有限公司 Data transmission method and apparatus for unlicensed band, and communication device

Also Published As

Publication number Publication date
CN117768075A (en) 2024-03-26

Similar Documents

Publication Publication Date Title
JP7499413B2 (en) Resource selection method, device and terminal device
CN107113782B (en) System and method for avoiding interference in digital communications
EP3817492B1 (en) Apparatus and methods for tb ppdu alignment for multi-link triggered uplink access in a wireless network
KR102604211B1 (en) Wireless LAN base station device, and wireless LAN terminal device
EP3662710B1 (en) Multi-tid a-mpdu transmission
CN110139353A (en) A kind of method and relevant apparatus of multi-access point AP coordinating transmissions
CN106471750B (en) Full duplex wireless communication on devices with limited echo cancellation capability
CN110113808A (en) System and method for power control
CN107211323A (en) System and method for transmitting data in WLAN multi-user transmission chance
CN107736049A (en) The method and apparatus of transmitting uplink data
CN103052077B (en) A kind of WLAN (wireless local area network) OBSS website sky point interference avoidance method
WO2007061014A1 (en) Wireless communication method and wireless communication apparatus
CN109218361A (en) The method and base station that a kind of pair of internet-of-things terminal is scheduled
CN109314965A (en) Method and apparatus for MU resource request
JP7176050B2 (en) Wireless communication method using TXOP and wireless communication terminal using the same
CN108476476A (en) Power indicates system and method
CN105515734B (en) Method and apparatus for the mixed automatic repeat request for promoting uplink to transmit
CN107409431A (en) A kind of data transfer control method and access point, website
KR20230144638A (en) Sharing RTA traffic and EDCA TXOP
CN105144781A (en) System and method for speed frame exchange
WO2024055739A1 (en) Method for determining uplink channel, and terminal and network device
WO2022121726A1 (en) Resource allocation method and apparatus
KR20220104644A (en) Method and apparatus for low latency communicaiton in communication system supporting multi-link
US20170280424A1 (en) Resource allocation signaling in uplink frames in wireless networks
CN107041006A (en) The method and apparatus for transmitting data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23864475

Country of ref document: EP

Kind code of ref document: A1