CN111182571B

CN111182571B - Long-term joint optimization method for base station activation control and beam forming

Info

Publication number: CN111182571B
Application number: CN202010031744.5A
Authority: CN
Inventors: 马孟园; 林静然; 杨健; 利强; 邵怀宗
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2020-01-13
Filing date: 2020-01-13
Publication date: 2021-02-02
Anticipated expiration: 2040-01-13
Also published as: CN111182571A

Abstract

The invention discloses a long-term joint optimization method for base station activation control and beamforming, which comprises the following steps: initializing a central processing unit, a network control unit and an auxiliary processing unit in the MISO network, wherein the network control unit collects and processes information such as channels in the network, service states of base stations and the like, the central processing unit performs corresponding analysis according to received current environment information, the network control unit opens or closes a designated base station, then a forming beam is calculated and configured by combining channel information, meanwhile, a power consumption indicating quantity is fed back, the auxiliary processing unit receives and stores historical data of system operation, the data stored by the auxiliary processing unit is utilized to assist the central processing unit to update the weight of the neural network, and the auxiliary system operates normally. The invention provides a self-adaptive online solution, which reduces network power and solves the problems of low environmental adaptability, complex operation, poor practicability, high operation and maintenance cost and large capital investment in the prior art.

Description

Long-term joint optimization method for base station activation control and beam forming

Technical Field

The invention belongs to the technical field of wireless communication, and particularly relates to a long-term joint optimization method for base station activation control and beam forming.

Background

In the context of heterogeneous networks, many different types of base stations are often deployed within a cell, and maintaining the operation of these base stations consumes a significant amount of energy. Base station activation control reduces network energy consumption by shutting down operating base stations that are serving redundancies within a cell, a technique that has received significant attention in recent decades. Base station active control and beamforming typically work together to force the network to serve as many users as possible within an acceptable range of QoS. Since these problems are NP-hard (NP-hard refers to the absence of an algorithm with polynomial time to solve the problem) mixed integer programming, researchers have often adopted convex interval relaxation or applied sparsity constraints to achieve acceptable complexity processing.

Although the existing schemes have differences in formula derivation and algorithm design, the past research is to activate and deactivate the base station based on the instant Channel State Information (CSI), so the prior art has the following problems:

(1) the operation state of the base station can be changed at any time, so that the management of the network tends to be complex in actual operation, and the practicability of the activation control technology is low;

(2) the burden overhead of the system due to management is heavy, the operation and maintenance cost of the system is high, and the capital investment is large.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides a long-term joint optimization method of base station activation control and beam forming, which is simple and convenient to operate, reduces operation and maintenance cost and saves capital investment, and the base station activation control in a period of time is considered to control the operation state switching frequency of a base station so as to maintain the flexibility of base station activation and simultaneously keep the stability of the operation state of the base station to a certain degree, thereby improving the practicability and solving the problems of low environmental adaptability, complex operation, low practicability, high operation and maintenance cost and large capital investment in the prior art.

In order to achieve the purpose of the invention, the invention adopts the technical scheme that:

a long-term joint optimization method for base station activation control and beam forming comprises the following steps:

s1, initializing parameters and configurations of a central processing unit, a network control unit and an auxiliary processing unit in the MISO network;

s2, collecting and processing the service state information of the channel and the base station in the network by using the network control unit, and outputting the processed information to the central processing unit;

s3, analyzing by using the central processing unit according to the received current environment information, and returning the base station activation and closing scheme to the network control unit;

s4, the network control unit is used for turning on or off the appointed base station according to the received scheme, then the shaped beam is calculated and configured by combining the channel information, and the power consumption indication quantity in the network is fed back;

s5, utilizing the auxiliary processing unit to receive and save the history data of the system operation, including environment information, base station configuration scheme and power consumption indication amount;

s6, assisting the central processing unit to update the weight of the neural network by using the data stored by the auxiliary processing unit, and assisting the normal operation of the system;

and S7, judging whether the network service is finished, if not, returning to the step S2, otherwise, finishing the process.

Further, in step S1, initializing parameters and configurations of the central processing unit, the network control unit and the auxiliary processing unit in the MISO network includes:

setting the number K of base stations, the number M of antennas at a transmitting end, the number I of downlink users and the number T of division time slots in a network;

setting the maximum operating power P of the base station_maxNoise power σ²Lowest signal-to-noise ratio gamma of user link and control quantity lambda for maintaining operation overhead of base station₁Control quantity lambda for switching working state of base station₂；

Setting the neural network structure of the central processing unit, initializing the network weight theta and controlling parameters alpha and X_fInitializing an auxiliary processing unit memory upper limit D and an auxiliary control parameter X_b。

Further, in step S2, the network control unit is used to collect and process the current channel in the network and the service status information of the base station at the previous time, and output the processed information to the central processing unit, which is represented as:

s_t＝φ(H_t,α_t-1)

wherein s is_tThe current environment information after being processed; h_t,α_t-1Respectively serving current channel information and last time base station service state information in the networkPhi (-) is a mapping function within the network control unit.

Further, in step S4, the network control unit is utilized to turn on or off the designated base station according to the received scheme, and then combine the channel information H_tCalculating and configuring shaped beam W_tWhile feeding back the power consumption indicator r in the network_tThe method comprises the following steps:

s4-1, setting auxiliary variables and intermediate parameters;

s4-2, initializing iteration control parameters;

s4-3, updating the current objective function value, wherein the updating formula is as follows:

wherein the content of the first and second substances,

in order to update the value of the objective function,

the value is the next generation objective function value, and n is the iteration number indicating quantity;

s4-4, calculating the next generation beam forming matrix according to the auxiliary variable and the intermediate parameter, and updating the auxiliary variable and the intermediate parameter;

s4-5, calculating a next generation objective function value, updating an inner layer iteration parameter, judging whether the next generation objective function value, the current objective function value and an inner layer iteration number indicating quantity meet requirements, if so, outputting a beam forming matrix and entering the step S4-6, otherwise, entering the step S4-3;

s4-6, according to the calculated beam forming matrix W_tLast time base station service state indication quantity alpha_t-1And current base station service state indication quantity alpha_tAn indication of the power consumption r in the output network_t。

Further, in step S4-4, the method calculates a next generation beamforming matrix according to the auxiliary variable and the intermediate parameter, and updates the auxiliary variable and the intermediate parameter, including the following sub-steps:

s4-4-1, calculating the next generation beam forming matrix of the inner layer according to the current auxiliary variable and the current intermediate parameter, wherein the calculation formula is as follows:

wherein, W_kA beamforming matrix for a base station K to users in the network;

s4-4-2, updating an auxiliary variable F, wherein the updating formula is as follows:

definition of

Wherein f isⁱIs the i-th row of the F,

is fⁱThe (i) th element of (a),

is fⁱRemoving

Vector of remaining elements, yⁱIs the i-th row of the Y,

is yⁱThe (i) th element of (a),

is yⁱRemoving

Vectors of the remaining elements, beta being a parameter satisfying a constraint condition [ ·]⁺Taking a non-negative value, wherein 1 is a column vector with all elements being 1;

s4-4-3, updating the auxiliary variable U, wherein the updating formula is as follows:

wherein, F_uAnd phi_uAre respectively Fⁿ⁺¹And phiⁿRemoving the left sub-matrix of the last column;

s4-4-4, updating the intermediate parameters { Ψ, Φ }, wherein the updating formula is as follows:

further, in step S4-5, the formula for determining whether the next-generation objective function value, the current objective function value, and the indication quantity of the number of iterations in the inner layer meet the requirement is:

wherein the content of the first and second substances,

in order to update the value of the objective function,

and the value is an objective function of the next generation, N is an iteration number indicating quantity, N is the maximum iteration number, and err is the difference value of the objective function before and after iteration.

Further, in the step S4-6, the beamforming matrix W is calculated according to the calculated beamforming matrix W_tLast time base station service state indication quantity alpha_t-1And current base station service state indication quantity alpha_tAn indication of the power consumption r in the output network_tThe calculation formula of (2) is as follows:

Δ＝max(0,γ-γ_mm)

P_all(t)＝P_t(t)+λ₁P_m(t)+λ₂P_s(t)

wherein, C₁As a feedback quantity amplitude control factor, C₂Penalty factor, P, for violating user link quality of service requirements_t(t),P_m(t),P_s(t) Total Transmission Power, maintenance base station operating Power and base station State switching Power, λ, of base stations in the slotted t network, respectively₁,λ₂Respectively maintaining the control quantity of the base station operation overhead and the control quantity of the base station working state switching, wherein gamma is a preset lower limit of user service quality, and gamma is_mmAnd taking the upper limit value of the user service quality in the network when the decision a is taken for the time slot t.

Further, in step S5, the receiving and storing the history data of the system operation by using the auxiliary processing unit specifically includes:

the environmental information, the base station configuration scheme, the power consumption indication amount and the environmental information at the next time are recorded in the memory of the auxiliary processing unit as a tuple each time.

Further, in step S6, the data stored in the auxiliary processing unit is used to assist the central processing unit in updating the neural network weight, so as to assist the normal operation of the system, which specifically includes the following sub-steps:

s6-1, uniformly sampling X from the memory by using the auxiliary processing unit_bThe experience bar group (s, a, r, s') is output to the central processing unit;

s6-2, preparing a label, which is expressed as:

s6-4, every interval X of central processing unit_fStep updating step, target neural network B^-(s|θ^-) The weight is updated to be the weight theta of the neural network B (s | theta) to be estimated^-←θ。

The invention has the following beneficial effects:

(1) the invention has obvious advantages in the aspects of base station management control and communication link switching in the network, ensures the QoS condition, simultaneously reduces the system overhead and the extra resource cost, reduces the operation and maintenance cost of the system, and saves the capital investment;

(2) the invention divides a period into a plurality of time slots, models the channel into a block fading channel, and approximates the channel in the time slot to a flat fading channel, thereby greatly simplifying the processing of the subsequent problems, further reducing the operation complexity and improving the practicability.

Drawings

Fig. 1 is a flow chart of a long-term joint optimization method of base station activation control and beamforming according to the present invention;

FIG. 2 is a schematic of the overall design of the present invention;

fig. 3 is a graph comparing the results of power consumption simulation for the present invention and the conventional scheme.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

As shown in fig. 1 and fig. 2, an embodiment of the present invention provides a long-term joint optimization method for base station active control and beamforming, including the following steps S1 to S7:

in this embodiment, initializing parameters and configurations of a Central Processing Unit (CPU), a Network Control Unit (NCU), and an Auxiliary Processing Unit (APU) in an MISO (multiple-input-single-output) network according to the present invention includes:

in this embodiment, the present invention utilizes the network control unit to collect and process the current channel and the last-time base station service status information in the network, and outputs the processed information to the central processing unit, which is represented as:

s_t＝φ(H_t,α_t-1)

wherein s is_tThe current environment information after being processed; h_t,α_t-1Respectively, current channel information in the network and the last time base station service state information, phi (-) is a mapping function in the network control unit.

in this embodiment, the base station activation and deactivation scheme referred to in the present invention is the base station status indication vector α_tBase station status indication vector α_tIs 0 or 1, indicating that the corresponding base station is activated or deactivated.

in the embodiment, the invention utilizes the network control unit to turn on or off the appointed base station according to the received scheme, and then combines the channel information H_tCalculating and configuring shaped beam W_tWhile feeding back the power consumption indicator r in the network_tThe method further comprises the following steps:

s4-1, setting auxiliary variables and intermediate parameters;

s4-2, initializing iteration control parameters;

wherein the content of the first and second substances,

in order to update the value of the objective function,

s4-4, calculating the next generation beamforming matrix according to the auxiliary variable and the intermediate parameter, and updating the auxiliary variable and the intermediate parameter, including the following steps:

wherein, W_kBeamforming matrices, C, for base stations K to users in the network^M×IThe complex matrix of M × I is represented by the formula

Wherein, U_kAnd Ψ_kIs an intermediate parameter, δ is a parameter satisfying a constraint condition, and ρ is an iterative convergence rate control quantity.

If it is

δ is 0;

otherwise, search delta by dichotomy>0 satisfies

definition of

Wherein f isⁱIs the i-th row of the F,

is fⁱThe (i) th element of (a),

is fⁱRemoving

Vector of remaining elements, yⁱIs the i-th row of the Y,

is yⁱThe (i) th element of (a),

is yⁱRemoving

s4-5, calculating the next generation objective function value, updating the inner layer iteration parameter, and judging whether the next generation objective function value, the current objective function value and the inner layer iteration frequency indication quantity meet the requirements, wherein the judgment formula is as follows:

wherein the content of the first and second substances,

in order to update the value of the objective function,

If yes, outputting a beamforming matrix and entering the step S4-6, otherwise, entering the step S4-3;

s4-6, according to the calculated beam forming matrix W_tLast moment base station clothesTraffic status indicator quantity alpha_t-1And current base station service state indication quantity alpha_tAn indication of the power consumption r in the output network_tThe calculation formula is as follows:

Δ＝max(0,γ-γ_mm)

P_all(t)＝P_t(t)+λ₁P_m(t)+λ₂P_s(t)

wherein, C₁As a feedback quantity amplitude control factor, C₂Penalty factor, P, for violating user link quality of service requirements_t(t),P_m(t),P_s(t) Total Transmission Power, maintenance base station operating Power and base station State switching Power, λ, of base stations in the slotted t network, respectively₁,λ₂Respectively maintaining the control quantity of the base station operation overhead and the control quantity of the base station working state switching, wherein gamma is a preset lower limit of user service quality, and gamma is_mmThe upper limit of the user service quality in the network when the decision a is taken for the time slot t can be calculated by the dichotomy and is expressed as

P_m(t)＝p_m||a_t||₁

P_s(t)＝p_s||a_t-a_t-1||₁

Wherein p is_mAnd p_sRespectively, the power consumed for maintaining the operation of the base station and the power consumed for switching the service state of the base station are set to be a proper normal number.

in this embodiment, the present invention provides the environment information s each time_tBase station configuration scheme a_t-1Power, powerConsumption indicating quantity r_tAnd the next time environmental information s_t+1Recorded as a tuple in the auxiliary processing unit memory.

in this embodiment, the data stored in the auxiliary processing unit is used to assist the central processing unit to update the neural network weight, and the normal operation of the auxiliary system is assisted, which specifically includes the following steps:

s6-1, uniformly sampling X from the memory by using the auxiliary processing unit_bA set of experience bars (s, a, r, s '), where s, a, r, s' respectively denote s for any t_t、a_t-1、r_t、s_t+1And output to the central processing unit;

s6-2, preparing a label, which is expressed as:

wherein z represents a label and a' is represented at s_t+1A temporal base station configuration scheme;

Wherein v ═ B (s | θ) is the network to be estimated; v. of^-＝B^-(s|θ^-) Is a target network; v. of_aB (s, a | θ) represents a value v corresponding to the selected base station configuration scheme a_a(ii) a Where theta and theta^-Representing a neural network weight parameter;

is shown in (2)^KValue vectors corresponding to the base station configuration schemes; theta and target theta^-Representing the network weight parameter.

In the embodiment of the invention, a cell with 5 4-antenna base stations BS and 5 single-antenna users is considered, K is 5, I is 5, the users are randomly distributed in a hexagonal cell with the radius of 1km, and the base station is positioned in the center of the cell. Suppose a transmission process is divided into 10 transmission time slots, and T is 10; in the specific implementation process, a standard complex Gaussian is used to generate a channel matrix H, the variance of the channel matrix H follows a probability distribution model of a cell and a user, and lambda is set₂50, user signal-to-noise ratio minimum threshold gamma 1, and transmitting power p_s1, variance of noise σ²Base station operation maintenance power p 1_mBase station service state switching power p 1_s1, feedback quantity amplitude control factor C₁＝10³Penalty factor C for violating user link QoS requirement₂＝10⁶The CPU control parameter α is 0.3, X_f5, the APU memory upper limit D is 10, and the auxiliary control parameter X_b＝10；

Calculating the beam forming matrix W of each base station in each time slot_kThe iteration number indicating amount n is 0, the iteration convergence rate control amount ρ is 10, and the minimum difference err between the target values before and after the iteration is 10^-2The maximum number of iterations N is 100. The simulation performance of the invention and the traditional scheme is changed when the control quantity lambda is changed₁The comparison is carried out under the value, and the simulation results of the two algorithms are shown in figure 3.

The long-term combined optimization method for base station activation control and beam forming, which is simple and convenient to operate, reduces operation and maintenance cost and saves capital investment, improves the practicability and solves the problems of low environmental adaptability, complex operation, low practicability, high operation and maintenance cost and large capital investment in the prior art.

It will be appreciated by those of ordinary skill in the art that the embodiments described herein are intended to assist the reader in understanding the principles of the invention and are to be construed as being without limitation to such specifically recited embodiments and examples. Those skilled in the art can make various other specific changes and combinations based on the teachings of the present invention without departing from the spirit of the invention, and these changes and combinations are within the scope of the invention.

Claims

1. A long-term joint optimization method for base station activation control and beamforming is characterized by comprising the following steps:

s1, initializing parameters and configurations of the hub processing unit, the network control unit and the auxiliary processing unit in the MISO network, including:

Setting the neural network structure of the central processing unit, initializing the network weight theta and controlling parameters alpha and X_fInitializing an auxiliary processing unit memory upper limit D and an auxiliary control parameter X_b；

S2, using the network control unit to collect and process the channel in the network and the service state information of the base station, outputting the processed information to the central processing unit, and showing as:

s_t＝φ(H_t,α_t-1)

wherein s is_tThe current environment information after being processed; h_t,α_t-1Respectively the current channel information and the last time base station service state information in the network, phi (-) is a mapping function in the network control unit;

s3, analyzing by using the central processing unit according to the received current environment information, and returning the base station activation and closing scheme to the network control unit; wherein the base station activation and deactivation scheme is a base station state indication vector alpha_tBase station status indication vector α_tThe element of (1) is 0 or 1, which indicates that the corresponding base station is activated or closed;

s4, the network control unit is used to turn on or off the appointed base station according to the received scheme, then the shaped beam is calculated and configured by combining the channel information, and the power consumption indication quantity in the network is fed back, the method comprises the following steps:

s4-1, setting auxiliary variables and intermediate parameters;

s4-2, initializing iteration control parameters;

wherein the content of the first and second substances,

in order to update the value of the objective function,

wherein, W_kA beamforming matrix for a base station K to users in the network;

definition of

Wherein f isⁱI row of F, F_i ⁱIs fⁱThe (i) th element of (a),

is fⁱRemoving f_i ⁱVector of remaining elements, yⁱIs the i-th row of the Y,

is yⁱThe (i) th element of (a),

is yⁱRemoving

s4-6, according to the calculated beam forming matrix W_tLast time base station service state indication quantity alpha_t-1And current base station service state indication quantity alpha_tAn indication of the power consumption r in the output network_t；

S5, receiving and storing historical data of system operation, including environment information, base station configuration scheme, and power consumption indication amount, by using the auxiliary processing unit, specifically:

recording the environment information, the base station configuration scheme, the power consumption indication quantity and the environment information at the next moment as a tuple in the memory of the auxiliary processing unit each time;

s6, assisting the central processing unit to update the weight of the neural network by using the data stored by the auxiliary processing unit, and assisting the normal operation of the system, specifically comprising the following steps:

s6-1, uniformly sampling X from the memory by using the auxiliary processing unit_bThe experience bar group (s, a, r, s') is output to the central processing unit; wherein s, a, r, s' each represent s of any t_t、a_t-1、r_t、s_t+1；

S6-2, preparing a label, which is expressed as:

s6-4, pivot processingUnit per interval X_fStep updating step, target neural network B^-(s|θ^-) The weight is updated to be the weight theta of the neural network B (s | theta) to be estimated^-←θ；

2. The long-term joint optimization method for base station activation control and beamforming as claimed in claim 1, wherein in step S4-5, the formula for determining whether the requirement is met according to the next-generation objective function value, the current objective function value and the indication quantity of the number of inner-layer iterations is:

wherein the content of the first and second substances,

in order to update the value of the objective function,

3. The long-term joint optimization method for base station active control and beamforming as claimed in claim 1, wherein in step S4-6, based on the calculated beamforming matrix W_tLast time base station service state indication quantity alpha_t-1And current base station service state indication quantity alpha_tAn indication of the power consumption r in the output network_tThe calculation formula of (2) is as follows:

Δ＝max(0,γ-γ_mm)

P_all(t)＝P_t(t)+λ₁P_m(t)+λ₂P_s(t)