WO2023231165A1 - 一种基于Stackelberg博弈的多频段群智频谱感知方法 - Google Patents

一种基于Stackelberg博弈的多频段群智频谱感知方法 Download PDF

Info

Publication number
WO2023231165A1
WO2023231165A1 PCT/CN2022/107291 CN2022107291W WO2023231165A1 WO 2023231165 A1 WO2023231165 A1 WO 2023231165A1 CN 2022107291 W CN2022107291 W CN 2022107291W WO 2023231165 A1 WO2023231165 A1 WO 2023231165A1
Authority
WO
WIPO (PCT)
Prior art keywords
sensing
users
user
demand
reward
Prior art date
Application number
PCT/CN2022/107291
Other languages
English (en)
French (fr)
Inventor
朱琦
郭晓敏
Original Assignee
南京邮电大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 南京邮电大学 filed Critical 南京邮电大学
Publication of WO2023231165A1 publication Critical patent/WO2023231165A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B17/00Monitoring; Testing
    • H04B17/30Monitoring; Testing of propagation channels
    • H04B17/382Monitoring; Testing of propagation channels for resource allocation, admission control or handover
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Definitions

  • the invention belongs to the field of communication technology, and specifically relates to a multi-frequency group intelligence spectrum sensing method based on Stackelberg game.
  • Cognitive Radio technology can greatly improve spectrum utilization through spectrum sharing.
  • Spectrum sensing is an important part of cognitive radio technology.
  • Spectrum holes refer to idle frequency bands that are legally used but not occupied by authorized users (AU).
  • Cognitive radio can allow secondary users (SU) to access the spectrum holes.
  • AU authorized users
  • SU secondary users
  • the process of detecting and confirming the existence of authorized users through spectrum sensing technology is particularly important.
  • Secondary user spectrum sensing usually uses the energy detection method, which does not require prior knowledge of the authorized user. It determines whether the authorized user exists by calculating whether the accumulated energy in the frequency band exceeds the threshold value.
  • the perception results of a single secondary user on the licensed frequency band are often unreliable, and the collaborative spectrum sensing performance of multiple secondary users is better than that of a single secondary user. User perception performance, therefore multiple secondary users are usually used for collaborative spectrum sensing.
  • Literature [10] proposes a cooperative spectrum sensing algorithm based on SU classification, which introduces an incentive mechanism to encourage more SUs to actively participate in detection.
  • the algorithm divides secondary users into ordinary secondary users (OSU) and relay secondary users according to channel conditions.
  • User (RSU) first each SU decides whether to participate by calculating the utility function, then the OSU sends the detected data to the nearby RSU, and then the RSU sends the received data together with the local detection data to the fusion center.
  • the above-mentioned research only focuses on spectrum sensing in a single frequency band. In actual systems, multiple frequency bands often need to be occupied, and this research does not consider the optimization of secondary user perception costs, resulting in high sensing costs.
  • the present invention introduces crowd intelligence sensing technology into spectrum sensing, considers multi-band scenarios, and proposes a multi-band group intelligence spectrum sensing method based on Stackelberg game to reduce the sensing cost of cooperative sensing secondary users by optimizing sensing time.
  • the purpose of this invention is to overcome the shortcomings of sensing a single frequency band in the existing technology and provide a multi-band crowd intelligence spectrum sensing method based on Stackelberg game.
  • the sensing demand is The secondary user recruits appropriate collaborative sensing secondary users to complete the sensing task, improves the enthusiasm for collaborative sensing, optimizes the sensing time of the cooperative sensing secondary user during the game process, saves sensing costs, and improves sensing performance.
  • the present invention adopts the following technical solutions.
  • Secondary users the set of secondary users with perceived needs is expressed as The set of collaboration-aware secondary users is expressed as
  • the method includes the following steps:
  • Step 1 Construct a subordinate layer optimization problem and deduce that there is a Nash equilibrium solution for the cooperative sensing secondary user game: comprehensively consider the detection probability and sensing time to define the utility function of the cooperative sensing secondary user to maximize the utility of the cooperative sensing secondary user;
  • Step 2 Construct a leadership optimization problem: comprehensively considering the detection probability and task reward after voting and fusion, the utility function of the perceived demand sub-user is defined to maximize the utility of the perceived demand sub-user;
  • Step 3 Construct the problem of payment of rewards from the sensing demand secondary users to the collaborative sensing secondary users into a multi-band crowd intelligence spectrum sensing system model based on Stackelberg game.
  • the sensing demand secondary users are the leadership layer
  • the collaborative sensing secondary users are the subordinate layers.
  • each cooperative sensing secondary user can sense all frequency bands, but can only sense one frequency band at the same time;
  • Step 4 The sensing demand sub-user releases tasks and initial rewards to the collaborative sensing sub-user for the frequency band to be sensed, and initializes the maximum utility of the sensing demand sub-user. is 0;
  • Step 5 All collaborative sensing secondary users calculate their utility under the current reward based on the reward and sensing time, and select the sensing time corresponding to the maximum utility by optimizing the sensing time.
  • the collaborative sensing secondary user calculates the corresponding detection probability and cost based on the sensing time. And transmit the data pairs (sensing time, detection probability, cost-based quotation) to the sensing demand secondary users;
  • Step 6 The demand-sensing secondary user sends the recruitment intention and the price of payment to the collaboration-sensing secondary user with high detection probability according to its remuneration;
  • Step 7 If the collaborative sensing sub-user is recruited by multiple sensing demand sub-users at the same time, by comparing the price options provided by multiple sensing demand sub-users, the sensing task with higher reward can be added;
  • Step 8 Perceived demand sub-users calculate utility under current remuneration. If the utility value is higher than Then record the reward, increase the reward value with a step size ⁇ under the limit of the maximum reward B max , release a new reward and repeat Steps 5-Step 8 until the error of the utility value of the user in two consecutive times of perceived demand is less than ⁇ ;
  • Step 9 Use the reward corresponding to the optimal utility of the sensing demand sub-user obtained in Step 8 as the final reward.
  • the collaborative sensing secondary user determines the final sensing time based on the reward and uploads the sensing data to the sensing demand secondary user to obtain the final judgment. result.
  • T j represents the set of collaborative secondary users participating in frequency band sensing task j.
  • the cost c ij of collaborative sensing secondary users is:
  • ⁇ and ⁇ represent weighting coefficients
  • t ij represents the sensing time of cooperative sensing secondary user i in sensing band task j
  • d ij represents the distance between cooperative sensing secondary user i and sensing demand secondary user j.
  • p ij represents the reward obtained by the collaborative sensing secondary user i
  • c ij represents the cost consumed by the collaborative sensing secondary user i to complete the frequency band sensing task j
  • B j represents the reward released by sensing demand secondary user j
  • ⁇ and ⁇ represent weighting coefficients
  • t ij represents the sensing time of cooperative sensing secondary user i sensing band task j
  • d ij represents the distance between the collaboration sensing sub-user i and the sensing demand sub-user j.
  • the detection probability formula of cooperative sensing secondary user i sensing frequency band j is:
  • p f represents the false alarm probability of secondary user i
  • ⁇ ij represents the signal-to-noise ratio of secondary user i sensing frequency band j
  • t ij represents the sensing time of secondary user i sensing frequency band j
  • f s represents the sampling frequency, usually a fixed value
  • the Q function is a complementary cumulative distribution function
  • represents the weighting coefficient
  • B j represents the reward released by sensing demand sub-user j.
  • the detection probability of sensing task j after the sensing demand secondary users are fused through voting is expressed as:
  • the sampling frequency of the cooperative sensing secondary user is 10kHz
  • the false alarm probability is 0.1
  • Wireless signal transmission considers large-scale fading, and its fading coefficient is 4,
  • the decision threshold value of the voting fusion criterion is N/2.
  • the present invention has the following advantages and beneficial effects:
  • the method of the present invention models the demand-sensing sub-users and the collaboration-sensing sub-users as the leadership and subordinate layers of the Stackelberg game respectively.
  • the optimal strategies of the demand-sensing sub-users and the collaboration-sensing sub-users are obtained, and the leadership game is played
  • Optimizing the reward in the medium optimizes the utility of the sensing demand sub-users
  • optimizing the sensing time in the subordinate layer game optimizes the utility of the collaborative sensing sub-users.
  • the present invention combines spectrum sensing with crowd intelligence sensing, considers multiple sensing needs of secondary users working in different frequency bands, and recruits cooperative sensing secondary users to complete tasks to obtain the usage of different frequency bands.
  • one cooperative sensing secondary user Only one frequency band can be sensed at the same time.
  • the cooperative sensing secondary user sends the sensing result to the sensing demand secondary user.
  • the sensing demand secondary user integrates the results of multiple cooperative sensing sub-users to obtain more accurate sensing results.
  • the present invention considers that multiple secondary users with sensing needs working in different frequency bands need to sense different frequency bands. Secondary users with sensing needs issue frequency band sensing tasks and recruit cooperative sensing secondary users to obtain the usage of the frequency band.
  • the cooperative sensing sub-users recruited by each sensing demand sub-user are not determined in advance, but change with the game process according to the detection probability, sensing time and quotation of the cooperative sensing sub-users.
  • the utility of the sensing demand secondary user in the present invention is defined as comprehensive consideration of detection probability and reward.
  • the utility of the collaborative sensing secondary user is defined as reward minus cost.
  • the reward is related to the detection probability, the cost is related to the sensing time, and the cooperative sensing secondary user and sensing demand. related to the distance between secondary users.
  • the present invention considers reverse selection when selecting a collaborative sensing secondary user.
  • a collaborative sensing secondary user is only sent a recruitment intention by one sensing needs secondary user, the collaborative sensing secondary user will complete the sensing task.
  • a collaboration sensing sub-user is sent recruitment intentions by multiple sensing demand sub-users at the same time, the sub-user can join the task with the highest reward by comparing the remuneration price options given by multiple sensing demand sub-users.
  • Figure 1 is a method flow chart of an embodiment of the present invention.
  • Figure 2 is a schematic diagram of a Stackelberg game system model according to an embodiment of the present invention.
  • the present invention is a multi-band crowd intelligence spectrum sensing method based on Stackelberg game.
  • This method models the problem of sensing demand secondary users paying rewards to collaborative sensing secondary users as a Stackelberg game system model, where the sensing demand secondary users are in the game model.
  • the leadership layer, collaborative sensing sub-user is the subordinate layer in the game model.
  • Sensing demand sub-users publish frequency band sensing tasks and initial rewards.
  • Each collaborative sensing sub-user optimizes its own utility by optimizing sensing time and sends sensing data to sensing demand sub-users. Sensing demand sub-users continuously update their rewards to make them more efficient. The utility is optimized and the final judgment result is obtained.
  • this method comprehensively considers the detection probability and reward to define the utility of the sensing demand secondary user, and optimizes the reward through the game to obtain the best utility.
  • this method comprehensively considers the detection probability and sensing time to define The utility of the secondary users is collaboratively sensed, and the sensing time is optimized according to the rewards released by the sensing needs to obtain the best utility. The derivation proves that there is a Nash equilibrium in the optimization of the sensing time.
  • FIG. 2 is a schematic diagram of a Stackelberg game system model according to an embodiment of the present invention.
  • the system scene is a circular area with N collaborative sensing sub-users and M sensing demand sub-users randomly distributed.
  • the sampling frequency of the cooperative sensing secondary user is 10kHz
  • the false alarm probability is 0.1
  • wireless signal transmission considers large-scale fading
  • the fading coefficient is 4,
  • the decision threshold value of the voting fusion criterion is N/2.
  • the sensing demand sub-users will pay rewards to the sub-users who provide sensing results.
  • the secondary users existing in the system model of the present invention are divided into two parts.
  • the first part of the secondary users work on different frequency bands respectively. If they want to use the authorized frequency band without affecting the authorized users, they first need to issue sensing tasks, and then recruit other idle secondary users for collaborative spectrum sensing to obtain the usage of the frequency band.
  • the set of these secondary users with needs is called the sensing demand secondary user set, expressed as The other part is the idle secondary users.
  • After receiving the tasks issued by the sensing demand secondary users they sense and upload the sensing results through the smart devices they carry.
  • the set of these idle secondary users is called the collaborative sensing secondary user set.
  • a multi-band crowd intelligence spectrum sensing method based on Stackelberg game of the present invention includes the following steps:
  • Step1 Construct a subordinate layer optimization problem, and deduce that there is a Nash equilibrium solution for the collaborative sensing secondary user game: comprehensively considering the detection probability and sensing time, the utility function of the cooperative sensing secondary user is defined, that is, the optimization problem of the subordinate layer is to make the cooperative sensing secondary user Maximize utility.
  • T j represents the set of collaborative secondary users participating in frequency band sensing task j.
  • Collaboration-aware secondary users Completing the sensing task requires a cost, including the cost of sensing frequency band consumption and the cost of uploading sensing data.
  • the cost of sensing frequency band consumption is related to the sensing time t ij .
  • the cost of uploading sensing data is related to the cost of collaborative sensing secondary users. and perceived needs sub-users It is related to the distance between them, so the cost c ij of collaborative sensing secondary users is defined as follows:
  • ⁇ and ⁇ represent weighting coefficients
  • t ij represents the sensing time of cooperative sensing secondary user i in sensing band task j
  • d ij represents the distance between cooperative sensing secondary user i and sensing demand secondary user j.
  • p ij represents the collaboration-aware secondary user
  • the reward obtained, c ij represents the collaboration-aware secondary user
  • the cost of completing the frequency band sensing task Represents the detection probability of cooperative sensing secondary user i sensing band task j
  • B j represents the reward released by sensing demand secondary user j
  • ⁇ and ⁇ represent weighting coefficients
  • t ij represents the sensing time of cooperative sensing secondary user i sensing band task j
  • d ij represents the distance between the collaboration sensing sub-user i and the sensing demand sub-user j.
  • the secondary user senses whether the spectrum of the authorized user is in use through the energy detection method.
  • the detection probability formula of cooperative sensing secondary user i sensing frequency band j is expressed as:
  • p f represents the false alarm probability of secondary user i
  • ⁇ ij represents the signal-to-noise ratio of secondary user i sensing frequency band j
  • t ij represents the sensing time of secondary user i sensing frequency band j
  • f s represents the sampling frequency, usually a fixed value
  • the Q function is a complementary cumulative distribution function
  • the signal-to-noise ratio ⁇ ij are all positive values, so the second part of K' Less than 0, third part is less than 0, and because K ⁇ 0, the first part of K' is less than 0, so K' is less than 0, and because U ij is about the first part of the second-order partial derivative of t ij is greater than 0, so it can be seen that the second-order partial derivative of U ij with respect to t ij Right now
  • the utility function U ij is a strictly convex function about t ij , and there is a unique optimal solution.
  • the first-order partial derivative of U ij with respect to t ij has a positive value.
  • Step 2 Construct the leadership optimization problem: comprehensively considering the detection probability and task reward after voting and fusion, the utility function of the perceived demand sub-user is defined, that is, the leadership optimization problem is to maximize the utility of the perceived demand sub-user.
  • represents the weighting coefficient
  • B j represents the reward released by sensing demand sub-user j.
  • Each sensing demand secondary user uses the voting fusion criterion to process the sensing results submitted by multiple collaborative sensing secondary users. After voting fusion, the detection probability of sensing task j is expressed as:
  • Step3 Construct the problem of payment of rewards from the sensing demand secondary users to the collaborative sensing secondary users into a multi-band crowd intelligence spectrum sensing system model based on Stackelberg game.
  • the sensing demand secondary users are the leadership layer and the collaborative sensing secondary users are the subordinate layers.
  • each cooperative sensing secondary user can sense all frequency bands, but can only sense one frequency band at the same time;
  • Step 4 The sensing demand secondary user issues tasks and initial rewards to the collaborative sensing secondary user for the frequency band to be sensed, and initializes the maximum utility of the sensing demand secondary user. is 0;
  • Step5 All cooperative sensing secondary users calculate their utility under the current reward based on the reward and sensing time, and select the sensing time corresponding to the maximum utility by optimizing the sensing time.
  • the cooperative sensing secondary user calculates the corresponding detection probability and cost based on the sensing time. And transmit the data (sensing time, detection probability, cost-based quotation) to the sensing demand secondary users;
  • Step6 The demand-sensing secondary user sends the recruitment intention and the price of payment to the collaboration-sensing secondary user with high detection probability according to its remuneration;
  • Step7 If the collaborative sensing sub-user is recruited by multiple sensing demand sub-users at the same time, by comparing the price options provided by multiple sensing demand sub-users, the sensing task with the highest reward can be added;
  • Step8 The perceived demand sub-user calculates the utility under the current remuneration. If the utility value is higher than Then record the reward, increase the reward value with a step size ⁇ under the limit of the maximum reward B max , release a new reward and repeat Steps 5-Step 8 until the error of the utility value of the user in two consecutive times of perceived demand is less than ⁇ ;
  • Step9 Use the reward corresponding to the optimal utility of the sensing demand secondary user obtained in Step 8 as the final reward.
  • the collaborative sensing secondary user determines the final sensing time based on the reward and uploads the sensing data to the sensing demand secondary user to obtain the final judgment. result.
  • the present invention proposes a multi-band crowd intelligence spectrum sensing method based on Stackelberg game for spectrum sensing scenarios, combined with crowd intelligence sensing technology.
  • This method models the problem of payment of rewards from sensing demand sub-users to collaboration sensing sub-users as a Stackelberg game model, where sensing demand sub-users are the leadership in the game model and collaboration sensing sub-users are the subordinate layers in the game model.
  • the utility of sensing demand sub-users is defined by comprehensive consideration of detection probability and reward, and the reward is optimized through the game to obtain the best utility
  • the utility of collaborative sensing secondary users is defined by comprehensive consideration of detection probability and sensing time.
  • Utility according to the perceived demand, the rewards released by users are optimized to obtain the best utility by optimizing the sensing time, and the derivation proves that there is a Nash equilibrium in the optimization of sensing time.

Landscapes

  • Physics & Mathematics (AREA)
  • Electromagnetism (AREA)
  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本发明公开了一种基于Stackelberg博弈的多频段群智频谱感知方法,将感知需求次用户向协作感知次用户支付报酬问题建模为博弈模型,其中前者是领导层,后者是从属层。感知需求次用户发布频段感知任务与初始报酬,各协作感知次用户通过优化感知时间使自身效用最优并将感知数据发送给感知需求次用户,感知需求次用户不断更新报酬使效用最优并得到最终判决结果。本发明在领导层博弈中综合考虑检测概率和报酬定义了感知需求次用户效用,通过博弈优化报酬获得最佳效用,从属层博弈中综合考虑检测概率和感知时间定义了协作感知次用户效用,根据感知需求次用户发布报酬优化感知时间以获得最佳效用,推导证明了感知时间的优化存在纳什均衡。

Description

一种基于Stackelberg博弈的多频段群智频谱感知方法 技术领域
本发明属于通信技术领域,具体涉及一种基于Stackelberg博弈的多频段群智频谱感知方法。
背景技术
随着智能终端设备的急剧增加,频谱资源日益紧缺,认知无线电技术(Cognitive Radio,CR)可以通过频谱共享大大提高频谱利用率,频谱感知是认知无线电技术的重要环节。频谱空洞是指授权用户(Authorized User,AU)合法使用但未被占用的空闲频段,认知无线电可以将次用户(Second User,SU)机会接入到频谱空洞,但要想实现空闲频谱资源的接入,通过频谱感知技术检测确认授权用户是否存在这一过程尤为重要。
次用户频谱感知通常采用能量检测法,不需要知道授权用户的先验知识,通过计算频段积累的能量是否超过门限值来判断授权用户是否存在。然而面对无线环境中阴影效应、多径衰落、噪声不确定等不利因素的影响,单个次用户对授权频段的感知结果往往是不可靠的,而多个次用户协作频谱感知性能优于单个次用户的感知性能,因此通常采用多个次用户协作频谱感知。关于多用户协作频谱感知的许多研究都是默认次用户无偿感知授权用户是否存在,这在生活中是不现实的,因为次用户在感知过程中会消耗时间、能量、内存等计算资源,所以多用户感知结果虽然更准确,但次用户可能不愿意无偿参与频谱感知过程。因此将激励机制引入频谱感知可以有效解决这一问题,激励机制通过支付报酬的方式补偿次用户的成本,鼓励他们积极参与协作频谱感知。
文献[10]提出了一种基于SU分类的协作频谱感知算法,引入激励机制来鼓励更多的SU积极参与检测,该算法根据信道条件将次用户分为普通次用户(OSU)和中继次用户(RSU),首先每个SU通过计算效用函数决定是否参与,然后OSU将检测到的数据发送给附近的RSU,接着RSU将收到的数据与本地检测数据一同传送给融合中心。
上述研究只针对单个频段进行频谱感知,在实际系统中,往往需要占用多个频段,并且该研究没有考虑次用户感知成本优化问题,造成感知代价较大。本发明将群智感知技术引入到频谱感知中,考虑多频段的场景,提出了一种基于Stackelberg博弈的多频段群智频谱感知方法,通过优化感知时间减少协作感知次用户的感知成本。
[10]LI Peijun,HAN Bo,LI Heng,et al.The research of spectrum sensing based on SU classification in cognitive LTE-A network[C]//2019 IEEE 3rd Information Technology,Networking,Electronic and Automation Control Conference,Chengdu,China.IEEE,2019:1917-1921.
发明内容
本发明的目的在于克服现有技术感知单个频段的缺陷,提供一种基于Stackelberg博弈的多频段群智频谱感知方法,通过领导层感知需求次用户和从属层协作感知次用户各自博弈,为感知需求次用户招募到合适的协作感知次用户完成感知任务,提高协作感知积极性,在博弈过程中优化协作感知次用户感知时间节约感知成本,提高感知性能。
为解决上述技术问题,本发明采用以下技术方案。
一种基于Stackelberg博弈的多频段群智频谱感知方法,将感知需求次用户向协作感知次用户支付报酬的问题建模为Stackelberg博弈系统模型,其中感知需求次用户是博弈模型中的领导层,协作感知次用户是博弈模型中的从属层;所述系统的场景是一个圆形区域,随机分布有N个协作感知次用户和M个感知需求次用户,取M=2,即存在两个感知需求次用户;所述的感知需求次用户的集合,表示为
Figure PCTCN2022107291-appb-000001
所述的协作感知次用户的集合表示为
Figure PCTCN2022107291-appb-000002
所述方法包括以下步骤:
Step1、构建从属层优化问题,并推导协作感知次用户博弈存在纳什均衡解:综合考虑检测概率、感知时间定义协作感知次用户的效用函数,使协作感知次用户的效用最大化;
Step2、构建领导层优化问题:综合考虑经表决融合后的检测概率和任务报酬定义了感知需求次用户的效用函数,使感知需求次用户的效用最大化;
Step3、将感知需求次用户向协作感知次用户支付报酬的问题构建成基于Stackelberg博弈的多频段群智频谱感知系统模型,在博弈模型中感知需求次用户是领导层,协作感知次用户是从属层,每个协作感知次用户可以感知所有频段,但同时只能感知一个频段;
Step4、感知需求次用户对于要感知的频段向协作感知次用户发布任务及初始报酬,初始化感知需求次用户的最大效用
Figure PCTCN2022107291-appb-000003
为0;
Step5、所有协作感知次用户根据报酬及感知时间计算其在当前报酬下的效用,通过优化感知时间选择效用最大时对应的感知时间,协作感知次用户根据该感知时间计算对应的检测概率与成本,并将数据对(感知时间,检测概率,基于成本产生的报价)传送给感知需求次用户;
Step6、感知需求次用户根据其报酬向检测概率高的协作感知次用户发送招募意愿及支付报酬的价格;
Step7、若协作感知次用户被多个感知需求次用户同时招募时,通过比较多个感知需求次用户提供的价格选择能够获得报酬多的感知任务加入;
Step8、感知需求次用户计算在当前报酬下的效用,如果该效用值高于
Figure PCTCN2022107291-appb-000004
则将该报酬记录下来,在最大报酬B max的限制下以步长μ增加报酬值,发布新的报酬并重复Step5-Step8,直到相邻两次感知需求次用户的效用值误差小于δ;
Step9、将Step8中得到的感知需求次用户效用最优时对应的报酬作为最终的报酬,协作感知次用户根据该报酬确定最终的感知时间并将感知数据上传给感知需求次用户,得到最终的判决结果。
具体的,协作感知次用户
Figure PCTCN2022107291-appb-000005
的报酬p ij为:
Figure PCTCN2022107291-appb-000006
其中
Figure PCTCN2022107291-appb-000007
表示协作感知次用户i感知频段任务j的检测概率,B j表示感知需求次用户j发布的报酬,T j表示参与频段感知任务j的协作次用户集合。
具体的,协作感知次用户的成本c ij为:
c ij=β×t ij+γ×d ij                               (2)
其中β和γ表示加权系数,t ij表示协作感知次用户i感知频段任务j的感知时间,d ij表示协作感知次用户i和感知需求次用户j之间的距离。
具体的,协作感知次用户
Figure PCTCN2022107291-appb-000008
的效用为:
Figure PCTCN2022107291-appb-000009
其中p ij表示协作感知次用户i获得的报酬,c ij表示协作感知次用户i完成频段感知任务j消耗的成本,
Figure PCTCN2022107291-appb-000010
表示协作感知次用户i感知频段任务j的检测概率,B j表示感知需求次用户j发布的报酬,β和γ表示加权系数,t ij表示协作感知次用户i感知频段任务j的感知时间,d ij表示协作感知次用户i和感知需求次用户j之间的距离。
具体的,协作感知次用户i感知频段j的检测概率公式为:
Figure PCTCN2022107291-appb-000011
其中,p f表示次用户i的虚警概率,γ ij表示次用户i感知频段j的信噪比,t ij表示次用户i感知频段j的感知时间,f s表示采样频率,通常是一个定值,Q函数是一个互补的累积分布函数,表达式为:
Figure PCTCN2022107291-appb-000012
具体的,感知需求次用户
Figure PCTCN2022107291-appb-000013
通过招募协作感知次用户
Figure PCTCN2022107291-appb-000014
完成相应频段感知任务得到的效用表示为:
Figure PCTCN2022107291-appb-000015
其中α表示加权系数,
Figure PCTCN2022107291-appb-000016
表示感知需求次用户经过表决融合之后得到频段j的检测概率,
Figure PCTCN2022107291-appb-000017
表示协作感知次用户i感知频段任务j的检测概率,B j表示感知需求次用户j发布的报酬。
具体的,所述感知需求次用户经过表决融合后感知任务j的检测概率表示为:
Figure PCTCN2022107291-appb-000018
其中
Figure PCTCN2022107291-appb-000019
表示次用户i感知频段j的虚警概率,
Figure PCTCN2022107291-appb-000020
表示次用户i感知频段j的检测概率。
优选的,所述的协作感知次用户的采样频率取10kHz,虚警概率取0.1,加权系数α=8,β=1,γ=0.3,无线信号传输考虑大尺度衰落,其衰落系数取4,表决融合准则的判决门限值取N/2。
与现有技术相比,本发明具有以下优点和有益效果:
1、本发明方法将感知需求次用户与协作感知次用户分别建模为Stackelberg博弈的领导层和从属层,通过博弈得到感知需求次用户和协作感知次用户各自的最优策略,在领导层博弈中优化报酬使感知需求次用户效用最优,在从属层博弈中优化感知时间使协作感知次用户效用最优。
2、本发明将频谱感知与群智感知结合,考虑工作在不同频段的多个感知需求次用户招募协作感知次用户完成任务来获得不同的频段的使用情况,在该场景中一个协作感知次用户同时只能感知一个频段,协作感知次用户将感知结果发送给感知需求次用户,感知需求次用户融合多个协作感知次用户的结果,得到更准确的感知结果。
3、本发明考虑工作在不同频段的多个感知需求次用户需要对不同的频段进行感知,有感知需求次用户发布频段感知任务,分别招募协作感知次用户获得频段的使用情况。每个感知 需求次用户招募到的协作感知次用户不是事先确定好的,而是根据协作感知次用户的检测概率、感知时间以及报价,随着博弈的过程而变化。
4、本发明感知需求次用户的效用定义为综合考虑检测概率以及报酬,协作感知次用户的效用定义为报酬减去成本,报酬与检测概率有关,成本与感知时间以及协作感知次用户与感知需求次用户之间的距离有关。
5、本发明在协作感知次用户选择时考虑反向选择,当一个协作感知次用户仅被一个感知需求次用户发送招募意愿时,该协作感知次用户就完成该感知任务。当一个协作感知次用户被多个感知需求次用户同时发送招募意愿时,该次用户通过比较多个感知需求次用户给出的报酬价格选择可以使自己获得报酬最多的任务加入。
附图说明
图1为本发明的一个实施例的方法流程图。
图2为本发明的一个实施例的Stackelberg博弈系统模型示意图。
具体实施方式
本发明的一种基于Stackelberg博弈的多频段群智频谱感知方法,该方法将感知需求次用户向协作感知次用户支付报酬的问题建模为Stackelberg博弈系统模型,其中感知需求次用户是博弈模型中的领导层,协作感知次用户是博弈模型中的从属层。感知需求次用户发布频段感知任务与初始报酬,每个协作感知次用户通过优化感知时间使得自身的效用最优并将感知数据发送给感知需求次用户,感知需求次用户通过不断的更新报酬使其效用达到最优,并得到最终的判决结果。在领导层博弈中,该方法综合考虑检测概率和报酬定义了感知需求次用户的效用,通过博弈优化报酬以获得最佳效用,在从属层博弈中,该方法综合考虑检测概率和感知时间定义了协作感知次用户的效用,根据感知需求次用户发布的报酬优化感知时间以获得最佳效用,并且推导证明了感知时间的优化存在纳什均衡。
下面结合附图和实施例对本发明做进一步详细说明。
图2为本发明一个实施例的Stackelberg博弈系统模型示意图。如图2所示,系统的场景是一个圆形区域,随机分布着N个协作感知次用户和M个感知需求次用户,本发明取M=2,即存在两个感知需求次用户。在本实施例中,协作感知次用户的采样频率取10kHz,虚警概率取0.1,加权系数α=8,β=1,γ=0.3,无线信号传输考虑大尺度衰落,并且衰落系数取4,表决融合准则的判决门限值取N/2。为了激励协作感知次用户完成感知任务,感知需求次用户会向提供感知结果的次用户支付报酬。
本发明在系统模型中存在的次用户分为两部分,第一部分次用户分别工作在不同的频段上,想要在不影响授权用户情况下使用授权频段的次用户,首先需要发布感知任务,然后招 募其他空闲次用户进行协作频谱感知获得频段的使用情况,这些有需求的次用户组成的集合称为感知需求次用户集合,表示为
Figure PCTCN2022107291-appb-000021
另外一部分是空闲次用户,在接收到感知需求次用户发布的任务以后,他们通过自身携带的智能设备进行感知并上传感知结果,这些空闲次用户组成的集合称为协作感知次用户集合
Figure PCTCN2022107291-appb-000022
如图1所示,本发明的一种基于Stackelberg博弈的多频段群智频谱感知方法,包括以下步骤:
Step1:构建从属层优化问题,并推导协作感知次用户博弈存在纳什均衡解:综合考虑检测概率、感知时间定义了协作感知次用户的效用函数,即从属层的优化问题就是使协作感知次用户的效用最大化。
协作感知次用户
Figure PCTCN2022107291-appb-000023
完成频段感知任务可以从感知需求次用户
Figure PCTCN2022107291-appb-000024
那里得到报酬,并且得到的报酬与其自身的检测概率有关,所以协作感知次用户
Figure PCTCN2022107291-appb-000025
的报酬p ij定义为:
Figure PCTCN2022107291-appb-000026
其中
Figure PCTCN2022107291-appb-000027
表示协作感知次用户i感知频段任务j的检测概率,B j表示感知需求次用户j发布的报酬,T j表示参与频段感知任务j的协作次用户集合。
协作感知次用户
Figure PCTCN2022107291-appb-000028
完成感知任务需要消耗成本,包括感知频段消耗的成本和上传感知数据消耗的成本,感知频段消耗的成本与感知时间t ij有关,上传感知数据消耗的成本与协作感知次用户
Figure PCTCN2022107291-appb-000029
和感知需求次用户
Figure PCTCN2022107291-appb-000030
之间距离有关,因此协作感知次用户的成本c ij的定义如下:
c ij=β×t ij+γ×d ij                              (2)
其中β和γ表示加权系数,t ij表示协作感知次用户i感知频段任务j的感知时间,d ij表示协作感知次用户i与感知需求次用户j之间的距离。
所以协作感知次用户
Figure PCTCN2022107291-appb-000031
的效用定义为:
Figure PCTCN2022107291-appb-000032
其中p ij表示协作感知次用户
Figure PCTCN2022107291-appb-000033
获得的报酬,c ij表示协作感知次用户
Figure PCTCN2022107291-appb-000034
完成频段感知任务消耗的成本,
Figure PCTCN2022107291-appb-000035
表示协作感知次用户i感知频段任务j的检测概率,B j表示感知需求次用户j发布的报酬,β和γ表示加权系数,t ij表示协作感知次用户i感知频段任务j的感知时间,d ij表示协作感知次用户i与感知需求次用户j之间的距离。
对于协作感知次用户
Figure PCTCN2022107291-appb-000036
而言,为了获得更多的报酬,需要向感知需求次用户
Figure PCTCN2022107291-appb-000037
提交最佳的检测概率,假设检测概率中只有感知时间是可以由协作感知次用户
Figure PCTCN2022107291-appb-000038
自身决定的,为了使得协作感知次用户
Figure PCTCN2022107291-appb-000039
效用最优,协作感知次用户
Figure PCTCN2022107291-appb-000040
通过博弈可以确定自己最优的感知时间,从而获得最优的检测概率,因此,从属层协作感知次用户
Figure PCTCN2022107291-appb-000041
的优化问题表示为:
Figure PCTCN2022107291-appb-000042
在认知无线电频谱感知中,次用户通过能量检测法来感知授权用户的频谱是否在使用,协作感知次用户i感知频段j的检测概率公式表示为:
Figure PCTCN2022107291-appb-000043
其中,p f表示次用户i的虚警概率,γ ij表示次用户i感知频段j的信噪比,t ij表示次用户i感知频段j的感知时间,f s表示采样频率,通常是一个定值,Q函数是一个互补的累积分布函数,表达式为:
Figure PCTCN2022107291-appb-000044
为了使次用户的检测概率具有参考意义,要求
Figure PCTCN2022107291-appb-000045
Figure PCTCN2022107291-appb-000046
Figure PCTCN2022107291-appb-000047
Figure PCTCN2022107291-appb-000048
关于t ij的一阶偏导数表示为:
Figure PCTCN2022107291-appb-000049
进而,U ij关于t ij的一阶偏导数表示为:
Figure PCTCN2022107291-appb-000050
进而,U ij关于t ij的二阶偏导数表示为:
Figure PCTCN2022107291-appb-000051
其中,
Figure PCTCN2022107291-appb-000052
因为任务预算B j、感知时间t ij、采样频率f s、检测概率
Figure PCTCN2022107291-appb-000053
信噪比γ ij都是正值,所以K'中第二部分
Figure PCTCN2022107291-appb-000054
小于0,第三部分
Figure PCTCN2022107291-appb-000055
小于0,又因为K<0,所以K'中第一部分
Figure PCTCN2022107291-appb-000056
小于0,所以K'小于0,又因为U ij关于t ij二阶偏导的前一部分
Figure PCTCN2022107291-appb-000057
大于0,从而可知U ij关于t ij的二阶偏导数
Figure PCTCN2022107291-appb-000058
Figure PCTCN2022107291-appb-000059
的效用函数U ij是关于t ij的严格凸函数,存在唯一的最优解。
由于U ij关于t ij的二阶偏导数恒为负值,意味着U ij关于t ij的一阶偏导数单调递减,又因为K<0,即
Figure PCTCN2022107291-appb-000060
所以有
Figure PCTCN2022107291-appb-000061
假设当K=0时,可得
Figure PCTCN2022107291-appb-000062
从而有
Figure PCTCN2022107291-appb-000063
即U ij关于t ij的一阶偏导数存在正值。
假设当K→-∞时,可得t ij→∞,从而有
Figure PCTCN2022107291-appb-000064
由于β>0,所以当t ij→∞时,
Figure PCTCN2022107291-appb-000065
即U ij关于t ij的一阶偏导数存在负值。
因此若
Figure PCTCN2022107291-appb-000066
的最大值大于0,则最优的感知时间
Figure PCTCN2022107291-appb-000067
可以通过下列方程组得到:
Figure PCTCN2022107291-appb-000068
Figure PCTCN2022107291-appb-000069
的最大值小于0,则
Figure PCTCN2022107291-appb-000070
效用最大时对应的感知时间为
Figure PCTCN2022107291-appb-000071
因此,
Figure PCTCN2022107291-appb-000072
的感知时间博弈存在唯一纳什均衡解,即
Figure PCTCN2022107291-appb-000073
检测概率博弈存在唯一纳什均衡解。
Step2:构建领导层优化问题:综合考虑经表决融合后的检测概率和任务报酬定义了感知需求次用户的效用函数,即领导层的优化问题就是使感知需求次用户的效用最大化。
考虑
Figure PCTCN2022107291-appb-000074
的效用与发布的报酬以及
Figure PCTCN2022107291-appb-000075
感知相应频段的检测概率有关,通过向
Figure PCTCN2022107291-appb-000076
发放报酬可以激励更多的协作感知次用户参与感知。感知需求次用户
Figure PCTCN2022107291-appb-000077
通过招募
Figure PCTCN2022107291-appb-000078
完成相应频段感知任务得到的效用定义为:
Figure PCTCN2022107291-appb-000079
其中α表示加权系数,
Figure PCTCN2022107291-appb-000080
表示感知需求次用户经过表决融合之后得到频段j的检测概率,
Figure PCTCN2022107291-appb-000081
表示协作感知次用户i感知频段任务j的检测概率,B j表示感知需求次用户j发布的报酬。每个感知需求次用户采用表决融合准则对多个协作感知次用户提交的感知结果进行处理,经过表决融合后感知任务j的检测概率表示为:
Figure PCTCN2022107291-appb-000082
其中
Figure PCTCN2022107291-appb-000083
表示次用户i感知频段j的虚警概率,
Figure PCTCN2022107291-appb-000084
表示次用户i感知频段j的检测概率。因此,领导层感知需求次用户
Figure PCTCN2022107291-appb-000085
的优化问题表示为:
Figure PCTCN2022107291-appb-000086
假设每个感知需求次用户支付给协作感知次用户的总报酬不超过B max,那么在0<B j≤B max的范围内一定存在一个最优的报酬
Figure PCTCN2022107291-appb-000087
使得感知需求次用户的效用函数值最大。
Step3:将感知需求次用户向协作感知次用户支付报酬的问题构建成基于Stackelberg博弈的多频段群智频谱感知系统模型,在博弈模型中感知需求次用户是领导层,协作感知次用户是从属层,每个协作感知次用户可以感知所有频段,但同时只能感知一个频段;
Step4:感知需求次用户对于要感知的频段向协作感知次用户发布任务及初始报酬,初始化感知需求次用户的最大效用
Figure PCTCN2022107291-appb-000088
为0;
Step5:所有协作感知次用户根据报酬及感知时间计算其在当前报酬下的效用,通过优化感知时间选择效用最大时对应的感知时间,协作感知次用户根据该感知时间计算对应的检测概率与成本,并将数据对感知时间,检测概率,基于成本产生的报价)传送给感知需求次用户;
Step6:感知需求次用户根据其报酬向检测概率高的协作感知次用户发送招募意愿及支付报酬的价格;
Step7:若协作感知次用户被多个感知需求次用户同时招募时,通过比较多个感知需求次用户提供的价格选择能够获得报酬多的感知任务加入;
Step8:感知需求次用户计算在当前报酬下的效用,如果该效用值高于
Figure PCTCN2022107291-appb-000089
则将该报酬记录下来,在最大报酬B max的限制下以步长μ增加报酬值,发布新的报酬并重复Step5-Step8,直到相邻两次感知需求次用户的效用值误差小于δ;
Step9:将Step8中得到的感知需求次用户效用最优时对应的报酬作为最终的报酬,协作感知次用户根据该报酬确定最终的感知时间并将感知数据上传给感知需求次用户,得到最终的 判决结果。
综上所述,本发明针对频谱感知场景,结合群智感知技术,提出了一种基于Stackelberg博弈的多频段群智频谱感知方法。该方法将感知需求次用户向协作感知次用户支付报酬的问题建模为Stackelberg博弈模型,其中感知需求次用户是博弈模型中的领导层,协作感知次用户是博弈模型中的从属层。在领导层博弈中,综合考虑检测概率和报酬定义了感知需求次用户的效用,通过博弈优化报酬以获得最佳效用;在从属层博弈中,综合考虑检测概率和感知时间定义了协作感知次用户的效用,根据感知需求次用户发布的报酬通过优化感知时间以获得最佳效用,并且推导证明了感知时间的优化存在纳什均衡。

Claims (8)

  1. 一种基于Stackelberg博弈的多频段群智频谱感知方法,其特征在于,将感知需求次用户向协作感知次用户支付报酬的问题建模为Stackelberg博弈系统模型,其中感知需求次用户是博弈模型中的领导层,协作感知次用户是博弈模型中的从属层;所述系统的场景是一个圆形区域,随机分布有N个协作感知次用户和M个感知需求次用户,取M=2,即存在两个感知需求次用户;所述的感知需求次用户的集合,表示为
    Figure PCTCN2022107291-appb-100001
    所述的协作感知次用户的集合表示为
    Figure PCTCN2022107291-appb-100002
    所述方法包括以下步骤:
    Step1、构建从属层优化问题,并推导协作感知次用户博弈存在纳什均衡解:综合考虑检测概率、感知时间定义协作感知次用户的效用函数,使协作感知次用户的效用最大化;
    Step2、构建领导层优化问题:综合考虑经表决融合后的检测概率和任务报酬定义了感知需求次用户的效用函数,使感知需求次用户的效用最大化;
    Step3、将感知需求次用户向协作感知次用户支付报酬的问题构建成基于Stackelberg博弈的多频段群智频谱感知系统模型,在博弈模型中感知需求次用户是领导层,协作感知次用户是从属层,每个协作感知次用户可以感知所有频段,但同时只能感知一个频段;
    Step4、感知需求次用户对于要感知的频段向协作感知次用户发布任务及初始报酬,初始化感知需求次用户的最大效用
    Figure PCTCN2022107291-appb-100003
    为0;
    Step5、所有协作感知次用户根据报酬及感知时间计算其在当前报酬下的效用,通过优化感知时间选择效用最大时对应的感知时间,协作感知次用户根据该感知时间计算对应的检测概率与成本,并将数据对(感知时间,检测概率,基于成本产生的报价)传送给感知需求次用户;
    Step6、感知需求次用户根据其报酬向检测概率高的协作感知次用户发送招募意愿及支付报酬的价格;
    Step7、若协作感知次用户被多个感知需求次用户同时招募时,通过比较多个感知需求次用户提供的价格选择能够获得报酬多的感知任务加入;
    Step8、感知需求次用户计算在当前报酬下的效用,如果该效用值高于
    Figure PCTCN2022107291-appb-100004
    则将该报酬记录下来,在最大报酬B max的限制下以步长μ增加报酬值,发布新的报酬并重复Step5-Step8,直到相邻两次感知需求次用户的效用值误差小于δ;
    Step9、将Step8中得到的感知需求次用户效用最优时对应的报酬作为最终的报酬,协作感知次用户根据该报酬确定最终的感知时间并将感知数据上传给感知需求次用户,得到最终的判决结果。
  2. 根据权利要求1所述的一种基于Stackelberg博弈的多频段群智频谱感知方法,其特征在于,协作感知次用户
    Figure PCTCN2022107291-appb-100005
    的报酬p ij为:
    Figure PCTCN2022107291-appb-100006
    其中
    Figure PCTCN2022107291-appb-100007
    表示协作感知次用户i感知频段任务j的检测概率,B j表示感知需求次用户j发布的报酬,T j表示参与频段感知任务j的协作次用户集合。
  3. 根据权利要求1所述的一种基于Stackelberg博弈的多频段群智频谱感知方法,其特征在于,协作感知次用户的成本c ij为:
    c ij=β×t ij+γ×d ij  (2)
    其中β和γ表示加权系数,t ij表示协作感知次用户i感知频段任务j的感知时间,d ij表示协作感知次用户i和感知需求次用户j之间的距离。
  4. 根据权利要求1所述的一种基于Stackelberg博弈的多频段群智频谱感知方法,其特征在于,协作感知次用户
    Figure PCTCN2022107291-appb-100008
    的效用为:
    Figure PCTCN2022107291-appb-100009
    其中p ij表示协作感知次用户i获得的报酬,c ij表示协作感知次用户i完成频段感知任务j消耗的成本,
    Figure PCTCN2022107291-appb-100010
    表示协作感知次用户i感知频段任务j的检测概率,B j表示感知需求次用户j发布的报酬,β和γ表示加权系数,t ij表示协作感知次用户i感知频段任务j的感知时间,d ij表示协作感知次用户i和感知需求次用户j之间的距离。
  5. 根据权利要求1所述的一种基于Stackelberg博弈的多频段群智频谱感知方法,其特征在于,协作感知次用户i感知频段j的检测概率公式为:
    Figure PCTCN2022107291-appb-100011
    其中,p f表示次用户i的虚警概率,γ ij表示次用户i感知频段j的信噪比,t ij表示次用户i感知频段j的感知时间,f s表示采样频率,通常是一个定值,Q函数是一个互补的累积分布函数, 表达式为:
    Figure PCTCN2022107291-appb-100012
  6. 根据权利要求1所述的一种基于Stackelberg博弈的多频段群智频谱感知方法,其特征在于,感知需求次用户
    Figure PCTCN2022107291-appb-100013
    通过招募协作感知次用户
    Figure PCTCN2022107291-appb-100014
    完成相应频段感知任务得到的效用表示为:
    Figure PCTCN2022107291-appb-100015
    其中α表示加权系数,
    Figure PCTCN2022107291-appb-100016
    表示感知需求次用户经过表决融合之后得到频段j的检测概率,
    Figure PCTCN2022107291-appb-100017
    表示协作感知次用户i感知频段任务j的检测概率,B j表示感知需求次用户j发布的报酬。
  7. 根据权利要求1所述的一种基于Stackelberg博弈的多频段群智频谱感知方法,其特征在于,所述感知需求次用户经过表决融合后感知任务j的检测概率表示为:
    Figure PCTCN2022107291-appb-100018
    其中
    Figure PCTCN2022107291-appb-100019
    表示次用户i感知频段j的虚警概率,
    Figure PCTCN2022107291-appb-100020
    表示次用户i感知频段j的检测概率。
  8. 根据权利要求1至7任一项所述的一种基于Stackelberg博弈的多频段群智频谱感知方法,其特征在于,所述的协作感知次用户的采样频率取10kHz,虚警概率取0.1,加权系数α=8,β=1,γ=0.3,无线信号传输考虑大尺度衰落,其衰落系数取4,表决融合准则的判决门限值取N/2。
PCT/CN2022/107291 2022-05-30 2022-07-22 一种基于Stackelberg博弈的多频段群智频谱感知方法 WO2023231165A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210598013.8A CN115102648B (zh) 2022-05-30 2022-05-30 一种基于Stackelberg博弈的多频段群智频谱感知方法
CN202210598013.8 2022-05-30

Publications (1)

Publication Number Publication Date
WO2023231165A1 true WO2023231165A1 (zh) 2023-12-07

Family

ID=83288924

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/107291 WO2023231165A1 (zh) 2022-05-30 2022-07-22 一种基于Stackelberg博弈的多频段群智频谱感知方法

Country Status (2)

Country Link
CN (1) CN115102648B (zh)
WO (1) WO2023231165A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117118547A (zh) * 2022-10-24 2023-11-24 湖南科技学院 认知无线电网络中优化能效与感知性能的动态分组算法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080167064A1 (en) * 2006-12-11 2008-07-10 Yeheskel Bar-Ness Method and system for decentralized power control of a multi-antenna access point using game theory
CN105528248A (zh) * 2015-12-04 2016-04-27 北京邮电大学 多任务合作应用下的群智感知激励机制
CN109492887A (zh) * 2018-10-25 2019-03-19 浙江工商大学 基于势博弈理论的移动群智感知激励方法
CN110149161A (zh) * 2019-05-10 2019-08-20 南京邮电大学 一种基于Stackelberg博弈的多任务协作频谱感知方法
CN110798273A (zh) * 2019-10-21 2020-02-14 南京邮电大学 一种基于次用户效用最优的协作频谱感知方法

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8805428B2 (en) * 2010-10-27 2014-08-12 Dynamic Invention Llc Cooperative spectrum sensing in cognitive radio networks
CN104780007A (zh) * 2015-04-03 2015-07-15 南京邮电大学 基于联盟博弈的多用户协作频谱感知方法
CN112333715A (zh) * 2015-09-15 2021-02-05 索尼公司 电子设备及频谱管理方法、基站侧的电子设备及方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080167064A1 (en) * 2006-12-11 2008-07-10 Yeheskel Bar-Ness Method and system for decentralized power control of a multi-antenna access point using game theory
CN105528248A (zh) * 2015-12-04 2016-04-27 北京邮电大学 多任务合作应用下的群智感知激励机制
CN109492887A (zh) * 2018-10-25 2019-03-19 浙江工商大学 基于势博弈理论的移动群智感知激励方法
CN110149161A (zh) * 2019-05-10 2019-08-20 南京邮电大学 一种基于Stackelberg博弈的多任务协作频谱感知方法
CN110798273A (zh) * 2019-10-21 2020-02-14 南京邮电大学 一种基于次用户效用最优的协作频谱感知方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
BAHARLOUEI AKRAM, JABBARI BIJAN: "A stackelberg game spectrum sensing scheme in cooperative cognitive radio networks", 2012 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), IEEE, 1 April 2012 (2012-04-01) - 4 April 2012 (2012-04-04), pages 2215 - 2219, XP093115540, ISBN: 978-1-4673-0435-1, DOI: 10.1109/WCNC.2012.6214160 *

Also Published As

Publication number Publication date
CN115102648B (zh) 2023-07-25
CN115102648A (zh) 2022-09-23

Similar Documents

Publication Publication Date Title
CN104579523B (zh) 认知无线网络频谱感知和接入决策联合的优化方法
Elhabyan et al. A Pareto optimization-based approach to clustering and routing in Wireless Sensor Networks
Huang et al. Efficient workload allocation and user-centric utility maximization for task scheduling in collaborative vehicular edge computing
Long et al. Non-cooperative power control for wireless ad hoc networks with repeated games
Ren et al. Pricing and distributed power control in wireless relay networks
Lu et al. Optimization of task offloading strategy for mobile edge computing based on multi-agent deep reinforcement learning
WO2023231165A1 (zh) 一种基于Stackelberg博弈的多频段群智频谱感知方法
CN110798273B (zh) 一种基于次用户效用最优的协作频谱感知方法
Wang et al. Game-theoretic algorithm designs and analysis for interactions among contributors in mobile crowdsourcing with word of mouth
Cao et al. Cognitive radio networks with heterogeneous users: How to procure and price the spectrum?
Wu et al. Optimal energy-efficient sensing in cooperative cognitive radio networks
Shu et al. A game theoretic approach for energy‐efficient communications in multi‐hop cognitive radio networks
CN110149161B (zh) 一种基于Stackelberg博弈的多任务协作频谱感知方法
Fantacci et al. A d2d-aided federated learning scheme with incentive mechanism in 6G networks
CN112261704B (zh) 一种基于可充电无线传感器网络的重复博弈路由方法
Pilloni et al. IoT_ProSe: Exploiting 3GPP services for task allocation in the Internet of Things
Haldorai et al. Energy Efficient Network Selection for Cognitive Spectrum Handovers
Jiang et al. Age of information-based computation offloading and transmission scheduling in mobile edge computing-enabled iot networks
Khalifa et al. Enhanced cooperative behavior and fair spectrum allocation for intelligent IoT devices in cognitive radio networks
Su et al. Dynamic resource allocation in queue-constrained and delay-sensitive vehicular networks
CN108631895B (zh) 一种基于贝叶斯博弈的频谱感知次用户激励方法
Lai et al. Optimal wireless information and energy transmissions for UAV-enabled cognitive communication systems
Zhong et al. Rate-aware fuzzy clustering and stable sensor association for load balancing in WSNs
CN106998571B (zh) Ad Hoc对等网络无中心分布式快速共识方法
Guo et al. Joint opportunistic power and rate allocation for wireless ad hoc networks: An adaptive particle swarm optimization approach

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22944478

Country of ref document: EP

Kind code of ref document: A1