CN113242066B

CN113242066B - Multi-cell large-scale MIMO communication intelligent power distribution method

Info

Publication number: CN113242066B
Application number: CN202110505185.1A
Authority: CN
Inventors: 黄永明; 吴珩; 张征明; 张铖; 葛瑶
Original assignee: Southeast University
Current assignee: Southeast University
Priority date: 2021-05-10
Filing date: 2021-05-10
Publication date: 2022-07-26
Anticipated expiration: 2041-05-10
Also published as: CN113242066A

Abstract

The invention discloses a multi-cell large-scale MIMO communication intelligent power distribution method, which comprises the steps of firstly generating a data set according to a traditional model-based optimal power distribution method, taking position information and channel state information of all users in a network as input characteristics, and taking the result of optimal power distribution as an output label, thereby generating a sample. The number of users contained in each sample of the data set is random. And further providing a seq2seq method based on a long-short term memory network, wherein a data set is used for training to obtain a mapping relation between the position information of the user and a power distribution strategy. The invention utilizes the communication network environment characteristics and the channel information to mine the relationship among the user characteristics, the power distribution and the system performance, designs the data-driven power distribution method, breaks through the limitation that the traditional method based on the neural network can only process the scene of fixed user number, and can flexibly adapt to the real-time change condition of the user number and the position only by training a single network.

Description

Multi-cell large-scale MIMO communication intelligent power distribution method

Technical Field

The invention relates to a multi-cell large-scale MIMO communication intelligent power distribution method, and belongs to the technical field of wireless communication.

Background

To meet the ever-increasing demands for communication equipment and data traffic, massive MIMO technology has been proposed and developed in the last few years. A base station in a large-scale MIMO network is provided with hundreds of antennas to obtain spatial multiplexing gain, and tens of users can be served on the same time-frequency resource, so that the frequency spectrum efficiency is effectively improved, and the base station becomes one of key technologies of 5G communication. The pilot pollution and the interference between users are usually the limitations of the MIMO system, and the interference is usually reduced and the spectrum efficiency is improved by reasonably allocating resources. And because a large-scale MIMO system is configured with a large-scale antenna array, a network can obtain larger spatial transmission gain, and a channel depends on large-scale fading, so that the problem of resource allocation is easier to solve.

Power allocation is a key technology for increasing the capacity of a communication system, and has been widely studied in academia. However, the traditional model-based method has the problem of high solving complexity. In view of the rapid development of data science and artificial intelligence, a communication system power control technology based on a neural network is developed. The method has the advantages that the full-connection neural network is used for fitting the traditional power distribution strategies such as the maximum minimum rate, the maximum rate, the rate and the like in the large-scale MIMO system by using the deep learning technology, a reasonable power distribution scheme can be provided under the condition that the position information of a user is known, and the performance of the method approaches to the traditional algorithm, and meanwhile, the complexity and the processing time of the optimization process are greatly reduced; there are also scholars who train a deep neural network to perform centralized power distribution in a de-cellular massive MIMO scenario based on dynamic cooperative clustering, or train a deep neural network through each access point, performing distributed power distribution using only locally available information as input. There have also been some studies on the decision to learn power allocation using the advantages of reinforcement learning. The methods fully utilize the capabilities of a neural network learning complex mode and approximation function mapping, and can greatly reduce the complexity and processing time of on-line execution while approximating the performance to the traditional algorithm. The limitation is that these methods are only for a fixed number of massive MIMO networks, i.e. both the number of cells (or access point number) and the number of users are fixed.

However, in practical communication scenarios, the number of users served at different times per cell is constantly changing, which is not in accordance with the assumptions made in previous studies. Although the deep neural network has flexibility and strong function fitting capability, it can only be applied to the problem that the input and target can be reasonably coded by vectors of fixed dimensions, which brings huge limitation to solving the wireless communication problem by using the neural network. At present, the relevant research is still insufficient, and a deep neural network based on residual error intensive connection of a convolutional neural network is designed by scholars for solving the problem and is used for processing the number of dynamically changed active users, so that the mapping of the large-scale fading coefficient and the power distribution strategy of the users is realized, the method limits the maximum number of users which can be served by each cell, and the large-scale fading coefficient of the number of inactive users is set to be 0. Thus, setting the number of the maximum service users in the fixed network to 0 inevitably causes a certain loss to the performance. From another point of view, the invention is inspired by successfully applying seq2seq neural network to process the problem of translation of variable-length text in the field of text processing, converts the problem of power distribution of the number of dynamic users into the problem of mapping from sequences of indefinite length to sequences, and learns the mapping relation by using the network of the seq2seq structure.

Disclosure of Invention

The technical problem is as follows: the invention aims to solve the defects that the existing power distribution method based on a model is high in solving complexity and only suitable for a fixed communication network scale, and provides a power distribution method based on a seq2seq neural network aiming at a scene of dynamic change of the number of multi-cell large-scale MIMO users. The method fully utilizes the relation between the user information and the power distribution mined by the neural network, can reduce the solving complexity while ensuring that the performance approaches the traditional method, and more importantly, can flexibly adapt to the scene of the change of the number and the position of the users in the communication network at any moment.

The technical scheme is as follows: in order to achieve the purpose, the invention adopts the technical scheme that: a multi-cell large-scale MIMO communication intelligent power distribution method. The technical scheme comprises the following steps:

the method comprises the following steps: and establishing a multi-cell large-scale MIMO system model with dynamically changed user number, and designing an objective function.

Step two: randomly setting the number and the position of users in each cell, solving an optimal power distribution strategy by using a max product SINR method, collecting user position information, channel state information and a corresponding optimal power distribution result, carrying out appropriate data preprocessing, taking the user position information and the channel state information in a network as input characteristics, taking the power distribution result as an output label, and collecting for multiple times to form a data set.

Step three: and constructing a seq2seq network model for processing variable-length sequences of input and output.

Step four: and (4) training the data set generated in the step two by using the network built in the step three.

Step five: and inputting a sequence generated by the user position and the channel state information in the current network into the trained seq2seq network model to obtain an optimal power distribution sequence in the current scene, and performing inverse normalization processing to obtain an optimal power distribution result.

In the first step, a multi-cell large-scale MIMO system model with dynamically changed user number is established. Suppose a massive MIMO communication network has L cells, each of which has a base station equipped with M antennas. Assuming that the number of users in each cell changes dynamically, the number of users served by the ith cell is k _i The number of users served by each cell in the network can be expressed as K ═ K _i I 1, a, L, with a _i ＝{1,2,...,k _i Represents the user number set in the ith cell, and the size of the set is k _i Determining that N is k ₁ +k ₂ +...+k _L Is the total number of all users in the current system.

Channel usage between user n and base station j in cell l

To indicate that the base station of cell l transmits downlink signals

Wherein

Is a signal sent to user n in cell l, using a precoding vector

To control the spatial directionality of the transmission, the precoding vector satisfies | | w _l,n || ² ＝1，ρ _l,n Indicating the transmission power allocated by the base station to user n in cell i. Further, the downlink spectrum efficiency of the user n in the cell l of the massive MIMO system is:

wherein

Representing the ratio, σ, of data for downlink transmission in each coherent block ² Representing the noise power, the superscript dl for SINR represents the downlink, E { · } represents the expectation, | · | represents the absolute value operation,

denotes w _l,n The conjugate transpose of (c).

For convenience of representation, the above spectral efficiency is re-expressed as:

wherein

a _l,n And b _i,k,l,n Mean channel gain and mean interference gain are indicated separately.

The objective function considered in step one is:

wherein

Indicating the total transmission power that can be provided by a base station.

And step two, generating a data set. In each data acquisition process, the number k of users in each cell is randomly set in a network area _l And location x of each user _l,n Then calculating large scale fading coefficient

And channel correlation matrix

Obtaining estimated channel vector by using channel estimation method based on minimum mean square error algorithm

Computing a precoding vector w _l,n . Averaging the estimated channels by Monte Carlo to obtain { a _l,n And { b }and _i,k,l,n }. Power allocation strategy for maximizing objective function by traditional geometric programming solution

Definition S ═ (S) ₁ ,s ₂ ,...,s _N ) Is an ordered sequence of numbers in which s _i The vector (G, D, a, I) is a one-dimensional vector for representing information about the user I in the system, and G ═ G ₁ ,g ₂ ,...,g _L ) Denotes the channel gain between the user and L base stations, D ═ D ₁ ,d ₂ ) Representing usersPosition information, where a denotes a signal amount transmitted by a user, and I denotes an amount of (I) transmitted by a user ₁ ,i ₂ ,...,i _N-1 ) The amount of interference to that user for other users in the system. The information of the N users together form an ordered sequence S. Taking S as an input feature, ρ ^* As an output tag. One sample is formed for one data acquisition, resulting in a data set with a large number of samples. To better fit the network, a logarithmic transformation and normalization process is used on the samples.

The size of I is fixed to the maximum number of users which can be served by the network, and the I of each sample is supplemented to the fixed length by adopting a processing mode of filling 0 at the tail end.

And in the third step, a network model is built, and the built seq2seq model is divided into two parts, namely an encoder part and a decoder part.

In order to avoid the limitation that the traditional Recurrent Neural Network (RNN) can have long-term dependence, the encoder part uses a long-term memory network (LSTM) to fully mine the correlation of users at different positions in the communication network. The input dimension of the encoder is set to the dimension of the information of the single user described in step two, i.e. s _i Of (c) is measured. The encoder reads the sequences in sequence in the order of the sequence S, thereby converting the sequences of indefinite length into intermediate vectors of fixed dimensions. The processing of the encoder can be expressed as:

wherein x is _t Is an input variable at time t, W is a weight matrix of the neural network unit, b represents bias, σ (-) and g (-) represent sigmoid function and tanh function, respectively, f _t 、i _t 、C _t 、

And o _t All are intermediate variables in the process, h _t For the output variable of LSTM, i.e. the hidden state obtained after the input information passes through the encoder of the seq2seq network, assuming that the input of the sequence is completed at time t equal to N, then h _N All input information is processed and converted into a context vector with fixed length.

The decoder part is another LSTM network and needs to ensure that its hidden layer dimension is the same as the dimension of the context vector. And then connecting a full connection layer, converting the output of each step into a value, and performing inverse normalization processing on the value to obtain the power value which is calculated by the seq2seq model and is distributed to the current user.

Before training the established neural network, firstly, initializing parameters of the model, and setting the batch size n _batch Training iteration number n _epoch . Each time n is input _batch The characteristic part S of each sample is processed by an encoder to obtain an intermediate variable h _N Inputting the input data into a decoder end, setting a guiding strength factor, selecting the output of the previous step as the current input variable with a certain probability in each step, or else, using the corresponding label value as the input variable, and then obtaining the output vector of the model

Calculating a loss function

And updating model parameters by adopting an adaptive moment estimation optimization method. And freezing the network parameters after the training is finished.

The algorithm application process in the fifth step is specifically as follows: and preprocessing the user position and the channel state information in the current network to generate a sequence with a fixed format, inputting the sequence into a trained neural network model to obtain an optimal power distribution sequence in the current scene, and performing inverse normalization processing on the optimal power distribution sequence to obtain an optimal power distribution result.

Has the beneficial effects that: compared with the prior art, the technical scheme of the invention has the following beneficial technical effects:

the invention provides a multi-cell large-scale MIMO communication intelligent power distribution method, which considers the situation that the number and the position of users change in real time in an actual scene. The long-time and short-time memory network is utilized to mine the correlation of users at different positions, and the mapping relation between the user position information and the channel state information and the power distribution is fitted, so that the performance can approach the traditional algorithm, and the method can adapt to the scene that the number and the position of the users are constantly changed in real time.

Drawings

Fig. 1 is a flowchart of an intelligent power allocation method for multi-cell large-scale MIMO communication according to the present invention.

Fig. 2 is a structural diagram of a seq2seq neural network constructed in the embodiment of the present invention.

Fig. 3 and 4 are graphs comparing the performance of an intelligent power allocation method for multi-cell massive MIMO communication according to an embodiment of the present invention with that of a conventional method.

Detailed Description

The present invention will now be described in further detail with reference to the drawings and detailed description of exemplary embodiments of the invention, it being understood that the embodiments described are merely illustrative of some, and not restrictive, of the invention. All other embodiments, which can be obtained by a person skilled in the art without making any creative effort based on the embodiments in the present invention, belong to the protection scope of the present invention.

The following is an implementation manner of the intelligent power allocation method suitable for multi-cell large-scale MIMO communication of the present invention:

Step two: the number and the position of users in each cell are randomly set, an optimal power distribution strategy is solved by using a traditional optimization method, user position information, channel state information and a corresponding optimal power distribution result are collected, appropriate data preprocessing is carried out, the user position information and the channel state information in a network are used as input characteristics, the power distribution result is used as an output label, and a data set is formed by multiple times of collection.

In the first step, a multi-cell large-scale MIMO system model with dynamically changed user number is established. The embodiment of the invention assumes that a large-scale MIMO communication network is provided with 4 cells, the coverage area of each cell is 0.5km multiplied by 0.5km, and the center of each cell is provided with a base station which is provided with 100 antennas. Assuming that the number of users in each cell is not fixed, the number of users served by the ith cell is k _i The number of users served in each cell in the network is represented as K ═ K _i 1.. times.l., using a, i ═ 1 _i ＝{1,2,...,k _i Denotes the set of user numbers in the ith cell, the size of the set is denoted by k _i Determining that N is k ₁ +k ₂ +...+k _L Is the total number of all users in the current system. In the embodiment of the invention, the user set K is set to be 0,5]Random permutation of integers of intervals. Each user is located more than 35m from the base station. Assuming a communication bandwidth of 10MHz, the total receiver noise power σ ² Set to-94 dBm. The pilot multiplexing factor is 1. The uplink transmit power ρ of each user equipment is 20 dBm. The number of monte carlo simulations was set to 100.

Channel usage between user n and base station j in cell l

It is shown that in the present embodiment, it is assumed that it obeys the rayleigh fading distribution:

wherein the content of the first and second substances,

is a space correlation matrix obtained by the base station side, and the normalized trace of the matrix

The large-scale fading coefficient simulating geometric path loss and shadow fading represents the average channel gain between one antenna of the base station j and the user n of the cell l, and is modeled as:

where γ -148dB represents the median of the channel gain at a reference distance of 1km between the user and the base station, α -3.76 is the path loss coefficient,

is the distance between the user and the base station on a two-dimensional plane.

The channel estimation adopts a standard minimum mean square error channel estimation method, and the estimation of the channel between the base station j and the user n in the cell l can be:

wherein, the first and the second end of the pipe are connected with each other,

is a noise, and is a sound wave,

set P _n ＝{l′∈{1,...,L}:n≤k _l′ Denotes the index set of all cells containing the user number n, the error of channel estimation

Independently of

Let the base station of cell l transmit downlink signals

Wherein

Is a signal sent to user n in cell l, using a precoding vector

To control the spatial directionality of the transmission, the precoding vector satisfies | | w _l,n || ² ＝1，ρ _l,n Indicating the transmission power. The following precoding scheme is used in the embodiments of the present invention:

wherein the content of the first and second substances,

then, the achievable downlink spectrum efficiency of the user n in the cell l of the massive MIMO system is:

wherein

Representing the ratio of data for downlink transmission in each coherent block.

The spectral efficiency is re-expressed as:

wherein

a _l,n And b _i,k,l,n Respectively, mean channel gain and mean interference gain.

The objective function of the first step is designed as follows:

and step two, generating a data set. When generating a sample, the number k of users in each cell needs to be randomly set in the network area _l And location of individual users

Then calculating large scale fading coefficient

And channel correlation matrix

Obtaining an estimated channel vector using a minimum mean square error based channel estimation method

According to the aboveCalculating a precoding vector w according to the precoding scheme _l,n . Averaging the estimated channels by Monte Carlo to obtain { a _l,n And { b }and _i,k,l,n }. Power allocation strategy for maximizing objective function by traditional geometric programming solution

This method of solving the optimal solution of power allocation by geometric programming can be called max product SINR method.

Definition S ═ (S) ₁ ,s ₂ ,...,s _N ) Is an ordered sequence of numbers in which s _i The vector (G, D, a, I) is a one-dimensional vector for representing information about the user I in the system, and G ═ G ₁ ,g ₂ ,g ₃ ,g ₄ ) Denotes the channel gain between the user and L base stations, D ═ D ₁ ,d ₂ ) Indicating the location information of the user, a ═ a indicating the amount of signal transmitted by the user, and I ═ I indicating the amount of signal transmitted by the user ₁ ,i ₂ ,...,i _N-1 ) The amount of interference to that user for other users in the system. The information of the N users together form an ordered sequence S. It should be noted that the size of each sample is different since the total number of users at a time varies. But each s in the input features due to network limitations _i Must be fixed, s _i The size of G, D, A is fixed, only the size of I varies with N, so the size of I is fixed as the maximum number of users that can be served by the network, in this embodiment, since the user set K is set to [0,5 ]]The integers of the interval are randomly arranged, so that the maximum number of users in the network is 14, and the I of each sample is supplemented to 14 values by adopting a processing mode of filling 0 at the tail. Then s _i Is fixed at 20.

Taking S as an input feature, ρ ^* As an output tag. A random scatter of the user at a time generates one sample, ultimately generating a data set with a large number of samples. To better fit the network, a logarithmic conversion and normalization process is used on the samples. 200000 samples were generated.

And (3) building a network model in the third step, wherein the built seq2seq model is divided into two parts, namely an encoder part and a decoder part.

In order to avoid the limitation of long-term dependence of the traditional Recurrent Neural Network (RNN), the encoder part uses a long-term memory network (LSTM) network to fully mine the relevance of users at different positions in the communication network. The input dimension of the encoder is set to the dimension of the information of the single user described in step two, i.e. s _i Dimension of (c), 20. The number of neurons in each hidden layer is 128. The encoder reads the sequence sequentially in the order of the sequence S, thereby converting the indefinite length sequence into a 128-dimensional hidden state. h is _N Is an intermediate variable with fixed length which is converted by processing all input information.

The decoder part is another LSTM network and needs to guarantee that its hidden layer dimension and the dimension of the intermediate variables are the same. And then connecting a full connection layer, converting the output of each step into a value, and performing inverse normalization processing on the value to obtain the power value which is calculated by the seq2seq model and is distributed to the current user.

The model training process in the step four specifically comprises the following steps: firstly, initializing the parameters of the model, and setting the batch size n _batch 8, training iteration number n _epoch 40. Each time n is input _batch The characteristic S of each sample is processed by an encoder to obtain an intermediate variable h _N Inputting the input data into a decoder end, setting a guiding strength factor, selecting the output of the previous step as the current input variable with a certain probability in each step, or else, using the corresponding label value as the input variable, and then obtaining the output vector of the model

Calculating a loss function

The algorithm application process in the step five specifically comprises the following steps: and preprocessing the user position and the channel state information in the current network to generate a sequence with a fixed format, inputting the sequence into a trained neural network model to obtain an optimal power distribution sequence in the current scene, and performing inverse normalization processing on the optimal power distribution sequence to obtain an optimal power distribution result.

The invention evaluates the multi-cell large-scale MIMO communication intelligent power distribution method provided by the invention through simulation experiments. After a seq2seq network model is trained and frozen, power allocation is performed by respectively using an average allocation method, a max product SINR allocation method and the intelligent allocation method based on the seq2seq network, and fig. 3 is a cumulative probability distribution (CDF) curve of user average Spectral Efficiency (SE) obtained by using the three methods. It can be found that the average allocation method obviously has the worst performance, and the trained seq2seq power allocation model is very close to the max product SINR power allocation method.

The samples in the test set contain a variety of user numbers. Fig. 4 compares CDF curves of the system and the rate obtained by performing power allocation respectively by the method of the present patent and the max product SINR method when there are 10, 11, 12, 13, and 14 users in the system respectively. The power distribution model based on the seq2seq network can be well adapted to the condition of the change of the number of users, the calculation complexity is greatly reduced, and meanwhile, the system and the speed performance can be ensured to approach the traditional method under the scene of various numbers of users.

Those skilled in the art can adaptively change the modules in the embodiment and set them in an optimization method or apparatus different from the embodiment. Specifically, a plurality of modules in the embodiment may be combined into one module, or one module may be divided into a plurality of sub-modules, which is applied to a method or an apparatus of the same technical idea as the embodiment.

Claims

1. A multi-cell large-scale MIMO communication intelligent power distribution method is characterized by comprising the following steps:

the method comprises the following steps: establishing a multi-cell large-scale MIMO system model with dynamically changed user number, and designing a target function;

step two: randomly setting the number and the position of users in each cell, solving an optimal power distribution strategy by using a max product SINR method, collecting user position information, channel state information and corresponding optimal power distribution results, carrying out data preprocessing, taking the user position information and the channel state information in a network as input characteristics, taking the power distribution results as output labels, and collecting for multiple times to form a data set;

step three: building a seq2seq network model for processing variable-length sequences of input and output, wherein the seq2seq represents a sequence-to-sequence;

step four: training the data set generated in the step two by using the seq2seq network model built in the step three;

step five: inputting a sequence generated by a user position and channel state information in a current network into a trained seq2seq network model to obtain an optimal power distribution sequence in a current scene, and performing inverse normalization processing to obtain an optimal power distribution result;

the concrete process of constructing the seq2seq network model in the third step is as follows:

the constructed seq2seq model is divided into two parts: an encoder section and a decoder section;

the encoder part uses a long-term memory network to mine the correlation of users at different positions in the communication network; the encoder sequentially reads the sequences according to the sequence order, thereby converting the sequences with indefinite length into intermediate vectors with fixed dimensionality; the processing of the encoder is represented as:

wherein x _t Is an input variable at time t, W is a weight matrix of the neural network, b represents bias, σ (-) and g (-) represent sigmoid function and tanh function, respectively, f _t 、i _t 、C _t 、

And o _t All are intermediate variables in the process, h _t Assuming that the input of the sequence is completed at time t N, h is the hidden state obtained by the LSTM output variable at time t, i.e. the input information after passing through the encoder of the seq2seq network _N All input information is processed and converted into a context vector with fixed length; subscript f corresponds to forget gate, W _f Weight matrix representing forgetting gate, b _f Representing a forgetting gate bias parameter, i representing an input gate, c representing a state unit, and o representing an output gate; LSTM represents a long-and-short term memory network;

the decoder part is another LSTM network whose hidden layer dimension is the same as that of the context vector; the LSTM network is connected with a full connection layer, the output of each step is converted into a value, and the value is subjected to reverse normalization processing to obtain a power value which is calculated by a seq2seq model and is allocated to a current user;

the seq2seq network model training process in the fourth step specifically comprises the following steps:

firstly, initializing parameters of a seq2seq network model, and setting a batch size n _batch And training iteration number n _epoch (ii) a Each time n is input _batch The characteristic S of each sample is processed by an encoder to obtain an intermediate variable h _N Inputting the input data into a decoder end, setting a guiding strength factor, selecting the output of the previous step as the current input variable with a certain probability in each step, or else, using the corresponding label value as the input variable, and then obtaining the output vector of the model

Calculating a loss function

ρ ^* Updating model parameters by adopting an adaptive moment estimation optimization method for a power distribution strategy for maximizing the objective function; and freezing the network parameters after the training is finished.

2. The intelligent power allocation method for multi-cell massive MIMO communication according to claim 1, wherein the specific step of establishing the multi-cell massive MIMO system model with dynamically changing user number in step one comprises:

the large-scale MIMO communication network is assumed to have L cells, each cell is provided with a base station and M antennas; assuming that the number of users in each cell changes dynamically, the number of users served by the ith cell is k _i The number of users served in each cell in the network is represented as K ═ K _i I 1, a, L, with a _i ＝{1,2,...,k _i Denotes the set of user numbers in the ith cell, the size of the set is denoted by k _i Determining N ═ k ₁ +k ₂ +...+k _L The total number of all users in the current system;

channel usage between user n and base station j in cell l

To indicate that the base station of cell l transmits downlink signals

Wherein

Is a signal sent to user n in cell l, using a precoding vector

To control the spatial directivity of the transmission, the precoding vector is satisfied

ρ _l,n Represents the transmission power allocated by the base station to user n in cell l; the downlink spectrum efficiency of the user n in the cell l is as follows:

wherein

Representing the ratio, σ, of data used for downlink transmission in each coherent block ² Representing the noise power, the superscript dl of the SINR represents the downlink, E {. cndot. } represents the expectation, l | - | represents the absolute value operation,

denotes w _l,n The ranges of L and j are [1, L ]]，

Is a complex set of dimensions M, p _i,k Represents the transmission power allocated by the base station to user k in cell i,

representing the channel between user n and base station i in cell i.

3. The intelligent power allocation method for multi-cell massive MIMO communication according to claim 2, wherein for convenience of representation, the spectrum efficiency can be expressed as:

wherein

4. The intelligent power allocation method for multi-cell massive MIMO communication according to claim 2, wherein the objective function of step one is designed to:

wherein

Indicating the total transmission power that can be provided by a base station.

5. The intelligent power allocation method for multi-cell massive MIMO communication according to claim 1, wherein the specific steps of the second step comprise:

in each data acquisition process, the number k of users in each cell is randomly set in a network area _l And location x of each user _l,n Then calculating large scale fading coefficient

And channel correlation matrix

Computing a precoding vector w _l,n (ii) a Averaging the estimated channels by Monte Carlo to obtain { a } _l,n And { b }and _i,k,l,n }; power allocation strategy for maximizing objective function by traditional geometric programming solution

Definition S ═ (S) ₁ ,s ₂ ,...,s _N ) Is an ordered sequence of numbers in which s _i The vector (G, D, a, I) is a one-dimensional vector for representing information about the user I in the system, and G ═ G ₁ ,g ₂ ,...,g _L ) Denotes the channel gain between the user and L base stations, D ═ D ₁ ,d ₂ ) Indicates the location information of the user, a ═ a) indicates the amount of signals transmitted by the user, and I ═ I indicates the amount of signals transmitted by the user ₁ ,i ₂ ,...,i _N-1 ) The interference amount of other users in the system to the user; the information of N users jointly form an ordered sequence S; taking S as an input feature, ρ ^* As an output tag; forming a sample by primary data acquisition, and carrying out logarithmic transformation and standardization treatment on the sample to finally generate a data set;

subscript l denotes the l cell, subscript n denotes the n user, superscript j denotes the j base station, a _l,n And b _i,k,l,n Respectively representing the average channel gain and the average interference gain, s _i Wherein i has a range of [1, N ]]。

6. The intelligent power allocation method for multi-cell massive MIMO communication as claimed in claim 5, wherein the size of I is fixed to the maximum number of users that the network can serve, and the I of each sample is supplemented to a fixed length by a processing method of filling 0 at the end.