CN113055111A

CN113055111A - Channel modeling method and system based on Bayesian optimization

Info

Publication number: CN113055111A
Application number: CN202110240013.6A
Authority: CN
Inventors: 柴利; 乐程放; 唐慧; 杨君
Original assignee: Wuhan University of Science and Engineering WUSE
Current assignee: Wuhan University of Science and Engineering WUSE
Priority date: 2021-03-04
Filing date: 2021-03-04
Publication date: 2021-06-29
Anticipated expiration: 2041-03-04
Also published as: CN113055111B

Abstract

The invention discloses a channel modeling method and a system based on Bayesian optimization, belonging to the technical field of wireless communication, wherein the modeling method uses a neural network of a back propagation algorithm as a channel modeling model, but is different from other artificial experiments for searching the hyperparameters of the model.

Description

Channel modeling method and system based on Bayesian optimization

Technical Field

The invention belongs to the technical field of wireless communication, and particularly relates to a channel modeling method and system based on Bayesian optimization.

Background

Along with the popularization and the use of wireless communication and wired networks, terminals like mobile phones, automobile-mounted systems, smart homes and the like are deeply fused with the internet, the number of the terminals accessing the internet and the wireless data transmission flow are greatly increased, the 4 th generation mobile communication technology (4G) which is being used in a large scale cannot meet the rapid development of mobile communication, internet of vehicles and internet of things, and the 5 th generation mobile communication technology (5G) provides a solution for people. At present, the frequency band used by 5G commercial in China is the middle frequency band below 6GHz, but available and developed frequency band resources below 6GHz are almost used, and frequency spectrum resources below 6GHz cannot meet the requirements of modern data transmission, namely low time delay, high capacity and large connection. As such, the industry is beginning to research the work of higher frequency bands, wherein millimeter wave wireless communication is one of the core technologies to solve the problem, and is the trend of future 5G development.

The main scenes used by the 5G technology are very complex and mainly classified into four categories: continuous wide area coverage, high capacity of hot spots, low power consumption, large connection, low delay and high reliability, the former two use scenes are the main scenes of the current large-scale commercial 4 th generation mobile communication technology (4G), the latter two scenes are the application scenes newly developed by the 5 th generation mobile communication technology (5G), and the characteristic that the 4G technology cannot well support the internet data transmission is solved. Millimeter waves play an important role in these four main scenarios, and the performance thereof in information transmission is affected by the characteristics of the wireless channel. The 5G channel characteristics mainly have some characteristics such as time delay, path loss, arrival angle, etc., and the study of these channel characteristics in a 5G scenario is a necessary step in the present study of millimeter wave wireless communication. In 5G wireless communication technology, channel characteristics may exhibit different characteristics in a usage scenario, and in order to improve system performance in the 5G usage scenario, research on new characteristics of a channel is necessary. Millimeter wave wireless communication still has the characteristics of wavelength length, is favorable to the miniaturization of system transmitting terminal radio frequency hardware, and the wave beam of millimeter wave is more narrow simultaneously, and the directive property is stronger, and anti-interference ability is stronger than 4G technique in the information transmission process, effectively reduces wireless communication system's energy consumption.

In millimeter wave wireless communication, methods for channel simulation and modeling can be mainly classified into three categories: the first category is deterministic modeling methods. The method is characterized in that modeling is carried out under a specific propagation environment, and the method comprises two methods of ray tracing and impulse response; the second category is statistical modeling methods. The method is based on the statistical characteristics of environmental parameters for modeling, and comprises a geometric model based on scatterer distribution, a parameterized model and a modeling method based on spatial correlation. The third type is semi-deterministic modeling, which adopts the characteristics of the first two channel models and makes up the limitations of statistical models and the shortcomings of deterministic model complexity. At present, the propagation characteristic parameters of the wireless channel are learned by utilizing the powerful capability of machine learning, the corresponding relation between the channel parameters and the physical communication scene characteristics is established, the advantages of the traditional deterministic channel model and the random statistical channel model are considered, the complexity is low, the system can better conform to the actual environment, and most wireless channel models can be accurately calculated. The main modeling method mainly comprises a random geometric modeling method and a correlation matrix method.

Researchers have applied many machine learning algorithms to the field of wireless communications. Reference documents: LV L.A novel wireless channel model with multiplex feed-forward network [ C]The wireless communication network is modeled according to a feedforward neural network, a communication channel model based on a multilayer FNN is established, an improved BP algorithm and a modeling method of a simulation result are provided by utilizing the excellent learning characteristic of the neural network, existing problems and solving methods are analyzed, and meanwhile, the simulation result shows that the FNN model can well track the time-varying characteristic of a non-stationary wireless channel. Reference documents: BERHAD N J, IBNKAHLA M,

F.Statistical analysis of a two-layer backpropagation algorithm used for modeling nonlinear memoryless channels:The single neuron case[J].IEEE transactions on signal processing,1997,45(3) 747-756. Bershad N J et al trained the network on the BP neural network using zero mean Gaussian data and studied the transient and convergence properties of the network. At the same time, the authors have studied the influence of neural network structure, weights, initial conditions and learning speed step size on the network mean square error. For example, researchers have used radial basis function neural networks (RBF-NN) and Relevance Vector Machines (RVM) to predict path loss and estimate angle of arrival for wireless transmissions, respectively. The following neural network approach is also included: generating a channel impulse response by constructing a cluster kernel by using an Artificial Neural Network (ANN); directly extracting channel impulse response from the measurement data by using a LASSO algorithm; extracting channel characteristics by using a Convolutional Neural Network (CNN) to identify different wireless channels; 11 independent CNN networks were used to learn and predict 11 channel statistical characteristic parameters, respectively. The 11 independent CNN networks share the same set of input features, and the output labels are respectively 11 channel parameters (such as arrival angle mean, delay mean, arrival angle spread, delay spread, and the like), so that the same set of input features are used for learning and predicting a plurality of channel parameters at the same time, and the corresponding relation between the plurality of channel parameters and a communication scene is established.

Statistical modeling relies primarily on measurements of the channel, and derives various important statistical properties of the channel from a large number of actual measurements. Due to the large growth of mobile services and mobile users, microcellular and picocellular systems find application in a wide variety of propagation environments. In these propagation environments, the size of the cell radius is reduced by a lot, so that the statistical similarity between the propagation environments disappears, and the statistical model cannot be applied in this case.

The deterministic modeling needs detailed information of a propagation environment, the accuracy of the information affects the accuracy of channel modeling, and a ray tracing technology and a time domain finite difference method which are mainstream methods for deterministic modeling have respective defects, for example, in the ray tracing technology, rays which cannot reach a receiving point are abandoned at the beginning due to the technical characteristics of a mirror image method, so that the method is very difficult to generate scatterers in a complex environment, can be only applied in a simple geometric environment, and has a narrow application range; the ray transmitting method needs a receiving ball to receive rays, the radius of the receiving ball needs to be accurately set, and the calculated result is influenced when the radius is too large or too small. Although the time-domain finite difference method has high accuracy, the method needs detailed details of a propagation environment, has a large unknown quantity, is a big problem on how to solve, and has a complex solving algorithm.

Semi-deterministic modeling is a modeling method between statistical modeling and deterministic modeling, and at present, channel modeling by using a machine learning algorithm is a trend of semi-deterministic modeling. Neural networks that employ back-propagation algorithms have found many applications in channel modeling as an important algorithm for machine learning. However, in the channel modeling of the neural network, there are many hyper-parameters, such as learning rate, activation function, the number of neurons in a certain layer, batch size, etc., and these hyper-parameters need to artificially set the value of each hyper-parameter. Although the hyper-parameters set by the engineers with great experience can lead the model to obtain better results, the method for setting the hyper-parameters often cannot obtain satisfactory results in the use process of the ordinary people and has no theoretical explanation. Meanwhile, the hyper-parameters set by engineers with abundant experience are only limited to a certain specific scene, and if the set hyper-parameters are separated from the scene, the set hyper-parameters can not be used any more, so that the method has the advantage of strong adaptability, and the hyper-parameters which accord with the scene can only be set again according to the characteristics of the scene.

Disclosure of Invention

Aiming at the defects or improvement requirements in the prior art, the invention provides a channel modeling method and system based on Bayesian optimization, aiming at establishing a channel model based on a convolutional neural network, learning and predicting 6 channel parameters by using 6 input features, establishing a corresponding relation between the channel parameters and a communication scene, searching for optimal hyper-parameters through a Bayesian optimization algorithm, and finally obtaining an optimal model corresponding to the optimal hyper-parameters.

To achieve the above object, according to an aspect of the present invention, there is provided a channel modeling method based on bayesian optimization, including:

building a simulation environment in whichArranging a plurality of receiving antennas and a plurality of transmitting antennas, forming a group of receiving and transmitting antennas by any receiving antenna and any transmitting antenna, obtaining three-dimensional coordinates of the receiving antennas and the transmitting antennas in each group of receiving and transmitting antennas as input parameters, and simulating each group of receiving and transmitting antennas to obtain channel parameters corresponding to each group of receiving and transmitting antennas, wherein the input parameters form a characteristic data set X_dataA measurement data set Y consisting of channel parameters_data；

Feature data set X_dataRandom division into training sets X_trainCross validation set X_validationAnd test set X_testMeasuring data set Y_dataRandom division into training sets Y_trainCross validation set Y_validationAnd test set Y_test；

Constructing a convolutional neural network model consisting of an input layer, a convolutional layer, a full-link layer and an output layer, determining a cost function and an evaluation index of the convolutional neural network model, randomly setting an initial hyper-parameter value of the convolutional neural network model, and setting a hyper-parameter search space, an iteration number N and a proxy model of a Bayesian optimization algorithm as a Gaussian process;

feature data set X of training set_trainAnd metrology data set Y_trainInputting the data into a convolutional neural network model, training the convolutional neural network model by using a back propagation algorithm, and simultaneously, carrying out cross validation on a feature data set X of a set_validationAnd metrology data set Y_validationInputting the data into a convolutional neural network model, and calculating an evaluation index decision coefficient of a cross validation set and a corresponding hyper-parameter by using a Bayesian optimization algorithm;

performing circulation according to the iteration times N, determining the hyper-parameter of the next circulation according to an EI acquisition function after each circulation until the evaluation index decision coefficient and the corresponding hyper-parameter of the N-circulation cross validation sets are obtained, and selecting the corresponding hyper-parameter combination as the optimal hyper-parameter combination when the evaluation index decision coefficient of the cross validation set is minimum;

feature data set X of test set_testAnd metrology data set Y_testCorresponding to the optimum hyper-parametric combination obtained after input into the loopAnd in the optimal convolutional neural network model, calculating to obtain an evaluation index decision coefficient of the test set, and verifying the effect of the optimal convolutional neural network model.

In some alternative embodiments, the composition is prepared by

Determining a cost function J of a convolutional neural network model_mseFrom

Determining an evaluation index decision coefficient R of a convolutional neural network model²Wherein m represents the number of data groups, y_iThe measured actual value of the ith group of data is shown,

indicating the measured predicted value of the i-th group of data,

the mean value of the measured actual values is shown.

In some alternative embodiments, the composition is prepared by

Calculating evaluation index decision coefficient R of cross validation set² _val。

In some alternative embodiments, the composition is prepared by

By adjusting the hyper-parameters

So that is at

Finding optimal hyper-parameters in

Wherein,

the optimal hyper-parameter is represented by,

representing a hyper-parametric search space,

a set of hyper-parameters is represented,

representing the optimized objective function.

In some alternative embodiments, the composition is prepared by

Determining a hyper-parameter for a next cycle, wherein,

is the acquisition function, v^*Is that

The current value of the optimum function is,

is a function of the cumulative density of a standard normal distribution,

and

mean and standard deviation, respectively, D_1:tIs an observed data set.

According to another aspect of the present invention, there is provided a channel modeling system based on bayesian optimization, comprising:

the simulation module is used for building a simulation environment, a plurality of receiving antennas and a plurality of transmitting antennas are arranged in the simulation environment, any receiving antenna and any transmitting antenna form a group of receiving and transmitting antennas, and the receiving and transmitting antennas in each group are obtainedThree-dimensional coordinates of the receiving antenna and the transmitting antenna are used as input parameters, and simulation is carried out on each group of receiving and transmitting antennas to obtain channel parameters corresponding to each group of receiving and transmitting antennas, wherein the input parameters form a characteristic data set X_dataA measurement data set Y consisting of channel parameters_data；

A data set dividing module for dividing the characteristic data set X_dataRandom division into training sets X_trainCross validation set X_validationAnd test set X_testMeasuring data set Y_dataRandom division into training sets Y_trainCross validation set Y_validationAnd test set Y_test；

The initialization module is used for constructing a convolutional neural network model composed of an input layer, a convolutional layer, a full-link layer and an output layer, determining a cost function and an evaluation index of the convolutional neural network model, randomly setting an initial hyper-parameter value of the convolutional neural network model, and setting a hyper-parameter search space, iteration times N and a proxy model of a Bayesian optimization algorithm as a Gaussian process;

a training module for converting the feature data set X of the training set_trainAnd metrology data set Y_trainInputting the data into a convolutional neural network model, training the convolutional neural network model by using a back propagation algorithm, and simultaneously, carrying out cross validation on a feature data set X of a set_validationAnd metrology data set Y_validationInputting the data into a convolutional neural network model, and calculating an evaluation index decision coefficient of a cross validation set and a corresponding hyper-parameter by using a Bayesian optimization algorithm; performing circulation according to the iteration times N, determining the hyper-parameter of the next circulation according to an EI acquisition function after each circulation until the evaluation index decision coefficient and the corresponding hyper-parameter of the N-circulation cross validation sets are obtained, and selecting the corresponding hyper-parameter combination as the optimal hyper-parameter combination when the evaluation index decision coefficient of the cross validation set is minimum;

a verification module for testing the feature data set X of the test set_testAnd metrology data set Y_testInputting the data into an optimal convolutional neural network model corresponding to the optimal hyper-parameter combination obtained after circulation, and calculating to obtain an evaluation index decision system of the test setAnd verifying the effect of the optimal convolutional neural network model.

In some alternative embodiments, the composition is prepared by

Determining a cost function J of a convolutional neural network model_mseFrom

indicating the measured predicted value of the i-th group of data,

the mean value of the measured actual values is shown.

In some alternative embodiments, the composition is prepared by

In some alternative embodiments, the composition is prepared by

By adjusting the hyper-parameters

So that is at

Finding optimal hyper-parameters in

Wherein,

the optimal hyper-parameter is represented by,

representing a hyper-parametric search space,

a set of hyper-parameters is represented,

representing the optimized objective function.

In some alternative embodiments, the composition is prepared by

Determining a hyper-parameter for a next cycle, wherein,

is the acquisition function, v^*Is that

The current value of the optimum function is,

is a function of the cumulative density of a standard normal distribution,

and

mean and standard deviation, respectively, D_1:tIs an observed data set.

According to another aspect of the invention, a computer-readable storage medium is provided, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method of any of the above.

In general, compared with the prior art, the above technical solution contemplated by the present invention can achieve the following beneficial effects:

1. the channel modeling method builds a required communication scene based on electromagnetic simulation software Wireless Insite. The Wireless suite (WI) is electromagnetic simulation software based on Ray Tracing, and can simulate electromagnetic propagation and Wireless communication in various complex environments such as cities, indoor areas and suburban areas to directly obtain channel characteristic parameters. Therefore, the channel modeling method does not depend on expensive channel measurement, and various important statistical characteristics of the channel do not need to be obtained from a large number of actual measurements. The statistical model can be applied to other propagation environments mainly by means of statistical similarity among the propagation environments, channel parameters of different propagation environments can be obtained by the modeling method without the statistical similarity among the propagation environments, and the channel modeling method is used in different propagation environments only by building corresponding communication scenes in the simulation module, so that the method has the advantages of universality and expandability.

2. Because the Wireless instruments can simulate electromagnetic propagation and Wireless communication in various environments such as cities, indoor areas and suburban areas, the modeling method can be applied to simple propagation environments and complex propagation environments, is not limited by propagation environment conditions, only needs to build a communication scene and set corresponding communication conditions, and software automatically calculates channel parameters according to a ray tracing method, so that the calculation complexity is low, and the calculation accuracy is high.

3. The modeling method uses a neural network of a back propagation algorithm as a model for channel modeling, but is different from other artificial experiments for searching the hyperparameters of the model, the Bayesian optimization algorithm is adopted for automatically searching the optimal hyperparameters, the defects that the time for searching the hyperparameters through the artificial experiments is long and the precision is low are overcome, meanwhile, the hyperparameters searched through the artificial experiments can only be suitable for a specific propagation environment, the hyperparameters are required to be searched through experiments again when the optimal hyperparameters are changed to another propagation environment, and the modeling method has the advantage of strong adaptability, can be applied to any propagation environment, and has the advantage of strong adaptability.

Drawings

Fig. 1 is a schematic flow chart of a channel modeling method based on bayesian optimization according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. In addition, the technical features involved in the embodiments of the present invention described below may be combined with each other as long as they do not conflict with each other.

Fig. 1 is a schematic flowchart of a channel modeling method based on bayesian optimization according to an embodiment of the present invention, where the method shown in fig. 1 includes the following steps:

s1: building a simulation environment, arranging a plurality of receiving antennas and a plurality of transmitting antennas in the simulation environment, forming a group of receiving and transmitting antennas by any receiving antenna and any transmitting antenna, obtaining three-dimensional coordinates of the receiving antennas and the transmitting antennas in each group of receiving and transmitting antennas as input parameters, and simulating each group of receiving and transmitting antennas to obtain channel parameters corresponding to each group of receiving and transmitting antennas;

in the embodiment of the invention, Wireless InSite software can be used for building a simulation environment, and the frequency can be 100GHz for mainly simulating an outdoor scene according to the requirements of the invention. Through setting the Wireless inite software, for example, setting 300 transmit-receive antennas, 6 input parameters in total of three-dimensional coordinates of the transmit-receive antennas can be obtained, and 6 channel parameters can be output through simulation: the average horizontal arrival angle, the average vertical arrival angle, the average horizontal departure angle, the average vertical departure angle, the path loss, and the received power, and finally 90000 sets of data can be obtained, and the size of the data set is 90000 × 12.

S2: composing feature data sets X from input parameters_dataA measurement data set Y consisting of channel parameters_dataThe feature data set X_dataRandom division into training sets X_trainCross validation set X_validationAnd test set X_testMeasuring data set Y_dataRandom division into training sets Y_trainCross validation set Y_validationAnd test set Y_test；

In the embodiment of the invention, 90000 groups of data containing 6 input parameters of three-dimensional coordinates of the transmitting and receiving antenna are called a characteristic data set X_dataThe 90000 sets of data comprising 6 channel parameters are referred to as a measurement data set Y_dataRandomly dividing 90000 groups of data into a training set, a cross validation set and a test set, wherein the percentage of the training set, the cross validation set and the test set in 90000 groups of data can be 60%, 20% and 20%, and X is_train、X_validationAnd X_testSequentially representing the feature data set in the training set, the feature data set in the cross validation set, and the feature data set in the test set, Y_train、Y_validationAnd Y_testThe measurement data sets in the training set, the cross validation set and the test set are sequentially represented.

S3: constructing a convolutional neural network model consisting of an input layer, a convolutional layer, a full-link layer and an output layer, determining a cost function and an evaluation index of the convolutional neural network model, randomly setting an initial hyper-parameter value of the convolutional neural network model, and setting a hyper-parameter search space, an iteration number N and a proxy model of a Bayesian optimization algorithm as a Gaussian process;

in the embodiment of the invention, the network structure of the convolutional neural network model is composed of an input layer, a convolutional layer, a fully-connected layer and an output layer, wherein the convolutional layer can have 3 layers, and the fully-connected layer can have 3 layers, so that the basic structure of the convolutional neural network model is determined, and the convolutional neural network model contains 5 hyper-parameters: there are learning rate, type of activation function, batch size, number of convolutional layer convolutional kernels per layer, and number of fully-connected layer neurons per layer.

In the embodiment of the invention, the cost function J of the convolutional neural network model_mse：

In formula (1): m represents the number of data groups;

y_ithe measured actual value of the ith group of data is represented;

the measured predicted value of the ith data is shown.

Evaluation index determination coefficient R²：

In formula (2): m represents the number of data groups;

y_ithe measured actual value of the ith group of data is represented;

the measured predicted value of the ith data is shown;

the mean value of the measured actual values is shown.

In the embodiment of the invention, the hyperparametric search space of the Bayesian optimization algorithm is set

The iteration number N and the proxy model are Gaussian processes, and in a Bayesian optimization algorithm, the hyper-parameters are used

So that is at

Finding optimal hyper-parameters in

Namely:

in formula (3):

representing an optimal hyper-parameter;

representing a hyper-parametric search space;

representing a set of hyper-parameters;

representing the optimized objective function.

S4: feature data set X of training set_trainAnd metrology data set Y_trainInputting into a convolutional neural network, and obtaining a feature data set X_trainAs input to the convolutional neural network, and a data set Y is measured_trainAs the output of the network, training a convolutional neural network by using a back propagation algorithm; feature data set X of cross validation set_validationAs input to the network, a metrology data set Y_validationThe output of the network is input into a convolutional neural network, and an evaluation index determination coefficient R of a cross validation set is calculated² _valNamely:

in formula (4): m represents the number of data sets, y_iThe measured actual value of the ith group of data is shown,

indicating the measured predicted value of the i-th group of data,

the mean value of the measured actual values is shown.

S5: manually randomly setting a set of hyper-parameters

Constructing an initial convolution network structure, circulating for N times according to the iteration times N, repeating the step S4 in each circulation, and determining the hyper-parameter of the next circulation according to the EI acquisition function, namely:

in formula (5):

is the acquisition function, v^*Is that

The current value of the optimum function is,

is a function of the cumulative density of a standard normal distribution,

and

mean and standard deviation, respectively, D_1:tIs the observed data set and t represents time.

And obtaining evaluation index decision coefficients and corresponding hyper-parameters of the N-time circulation cross validation sets, and selecting a hyper-parameter combination with the minimum evaluation index decision coefficient of the cross validation sets as an optimal hyper-parameter combination.

S6: feature data set X of test set_testAnd metrology data set Y_testInput into the optimized hyper-parameter obtained after the loop according to step S5

In the corresponding optimal model, calculating to obtain the evaluation index decision coefficient of the test setR² _valAnd verifying the effect of the optimal model.

The present invention is further illustrated by the following detailed description, without limiting the scope of the invention.

(1) Firstly, Wireless InSite software is used for building a simulation environment, the requirements of the invention are met, an outdoor scene is mainly simulated, and the frequency is 100 GHz. Through setting the Wireless InSite software, 300 transceiving antennas are set, 6 input parameters of the three-dimensional coordinates of the transceiving antennas can be obtained, and 6 channel parameters can be output through simulation: the average horizontal arrival angle, the average vertical arrival angle, the average horizontal departure angle, the average vertical departure angle, the path loss, and the received power, and finally 90000 groups of data (see table 1 for details) can be obtained, and the size of the data set is 90000 × 12.

190000 groups of data in Table

(2) The 90000 groups of data containing the three-dimensional coordinates of the transmitting and receiving antennas are referred to as a feature data set X_dataAs shown in Table 2, 90000 sets of data containing 6 channel parameters are referred to as a measurement data set Y_dataAs shown in Table 3, 90000 groups of data are randomly divided into a training set, a cross validation set and a test set, wherein the training set, the cross validation set and the test set account for 60%, 20% and 20% of the 90000 groups of data in sequence, and X is_train、X_validationAnd X_testSequentially representing the feature data set in the training set, the feature data set in the cross validation set, and the feature data set in the test set, Y_train、Y_validationAnd Y_testThe measurement data sets in the training set, the cross validation set and the test set are sequentially represented.

Table 2 characteristic data set X_data

TABLE 3 metrology data set Y_data

X in Table 4_trainX in Table 5_validationAnd X in Table 6_testSequentially represent the feature data sets in the training set, the cross-validation set, and the test set, e.g., Y in Table 7_trainY in Table 8_validationAnd Y in Table 9_testThe measurement data sets in the training set, the cross validation set and the test set are sequentially represented.

TABLE 4 feature data set X in training set_train

Table 5 feature data set X in cross-validation set_validation

TABLE 6 feature data set X in test set_test

TABLE 7 measurement data set Y in training set_train

TABLE 8Metrology data set Y in cross-validation set_validation

TABLE 9 measurement data set Y in test set_test

(3) The network structure of the convolutional neural network model is composed of an input layer, a convolutional layer, a fully-connected layer and an output layer, wherein the convolutional layer has 3 layers, the fully-connected layer has 3 layers, so that the basic structure of the convolutional neural network is determined, and the convolutional neural network model contains 5 hyper-parameters: there are learning rate, type of activation function, batch size, number of convolutional layer convolutional kernels per layer, and number of fully-connected layer neurons per layer.

(4) Determining cost function and evaluation index of model, and cost function J of convolutional neural network_mse：

Wherein m represents the number of data sets, y_iThe measured actual value of the ith group of data is shown,

the measured predicted value of the ith data is shown.

Evaluation index determination coefficient R²：

indicating the measured predicted value of the i-th group of data,

the mean value of the measured actual values is shown.

(5) Hyperparametric search space for setting Bayesian optimization algorithm

As shown in Table 10, the number of iterations 50 and the proxy model are Gaussian processes, and are optimized by the hyperparameters in the Bayesian optimization algorithm

So that is at

Finding optimal hyper-parameters in

Namely:

wherein,

the optimal hyper-parameter is represented by,

representing a hyper-parametric search space,

a set of hyper-parameters is represented,

representing the optimized objective function.

TABLE 10 hyper-parametric search space

Hyper-parameter	Learning rate	Activating a function	Batch size	Number of convolution kernels per convolution layer	Number of convolution kernels per fully-connected layer	Power
							Range	55.15	91.79	235.15	90.55	0.00	115.06

(6) Feature data set X of training set_trainAnd metrology data set Y_trainInputting the data into a convolutional neural network, and training the convolutional neural network by using a back propagation algorithm; feature data set X of cross validation set_validationAnd metrology data set Y_validationInputting the result into a convolutional neural network, and calculating an evaluation index decision coefficient R of a cross validation set² _valNamely:

indicating the measured predicted value of the i-th group of data,

the mean value of the measured actual values is shown.

(7) Manually randomly setting a set of hyper-parameters

And (3) building an initial convolutional network structure, circulating for 50 times according to the iteration times, repeating the step (6) in each circulation, and determining the hyper-parameter of the next circulation according to the EI acquisition function, namely:

determining a hyper-parameter for a next cycle, wherein,

is the acquisition function, v^*Is that

The current value of the optimum function is,

is a function of the cumulative density of a standard normal distribution,

and

mean and standard deviation, respectively, D_1:tIs an observed data set. The decision coefficients and corresponding hyperparameters of the 50-time cycle cross validation sets are obtained, and the hyperparameter combination with the minimum decision coefficient of the cross validation set is selected as the optimal hyperparameter combination, as shown in table 11.

(8) Feature data set X of test set_testAnd metrology data set Y_testInputting the optimal hyperparameter obtained after circulation according to the step (7)

As shown in table 11, in the corresponding optimal model, the evaluation index determination coefficient R of the test set is calculated² _testAs shown in table 11, the effect of the optimal model was verified.

TABLE 11 optimal hyperparameters

Hyper-parameter	Learning rate	Activating a function	Batch size	Number of convolution kernels per convolution layer	Number of convolution kernels per fully-connected layer
							0.006890958660007832	relu	1000	200,52,200	200,111,97

It should be noted that, according to the implementation requirement, each step/component described in the present application can be divided into more steps/components, and two or more steps/components or partial operations of the steps/components can be combined into new steps/components to achieve the purpose of the present invention.

It will be understood by those skilled in the art that the foregoing is only a preferred embodiment of the present invention, and is not intended to limit the invention, and that any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims

1. A channel modeling method based on Bayesian optimization is characterized by comprising the following steps:

building a simulation environment, arranging a plurality of receiving antennas and a plurality of transmitting antennas in the simulation environment, forming a group of receiving and transmitting antennas by any receiving antenna and any transmitting antenna, obtaining three-dimensional coordinates of the receiving antennas and the transmitting antennas in each group of receiving and transmitting antennas as input parameters, and simulating each group of receiving and transmitting antennas to obtain channel parameters corresponding to each group of receiving and transmitting antennas, wherein the input parameters form a characteristic data set X_dataA measurement data set Y consisting of channel parameters_data；

feature data set X of training set_trainAnd metrology data set Y_trainInputting the data into a convolutional neural network model, and training convolution by using a back propagation algorithmNeural network model, simultaneously cross-validating the feature data set X of the set_validationAnd metrology data set Y_validationInputting the data into a convolutional neural network model, and calculating an evaluation index decision coefficient of a cross validation set and a corresponding hyper-parameter by using a Bayesian optimization algorithm;

feature data set X of test set_testAnd metrology data set Y_testAnd inputting the test result into an optimal convolutional neural network model corresponding to the optimal hyper-parameter combination obtained after circulation, calculating to obtain an evaluation index decision coefficient of the test set, and verifying the effect of the optimal convolutional neural network model.

2. The method of claim 1, wherein the method is performed by

Determining a cost function J of a convolutional neural network model_mseFrom

indicating the measured predicted value of the i-th group of data,

the mean value of the measured actual values is shown.

3. The method of claim 2Is characterized by that

4. The method of claim 3, wherein the method is performed by

By adjusting the hyper-parameters

So that is at

Finding optimal hyper-parameters in

Wherein,

the optimal hyper-parameter is represented by,

representing a hyper-parametric search space,

a set of hyper-parameters is represented,

representing the optimized objective function.

5. The method of claim 4, wherein the method is performed by

Determining a hyper-parameter for a next cycle, wherein,

is the acquisition function, v^*Is that

The current value of the optimum function is,

is a function of the cumulative density of a standard normal distribution,

and

mean and standard deviation, respectively, D_1:tIs an observed data set.

6. A channel modeling system based on Bayesian optimization, comprising:

the simulation module is used for building a simulation environment, a plurality of receiving antennas and a plurality of transmitting antennas are arranged in the simulation environment, any receiving antenna and any transmitting antenna form a group of receiving and transmitting antennas, three-dimensional coordinates of the receiving antennas and the transmitting antennas in each group of receiving and transmitting antennas are obtained and used as input parameters, simulation is carried out on each group of receiving and transmitting antennas to obtain channel parameters corresponding to each group of receiving and transmitting antennas, and the input parameters form a characteristic data set X_dataA measurement data set Y consisting of channel parameters_data；

a verification module for testing the feature data set X of the test set_testAnd metrology data set Y_testAnd inputting the test result into an optimal convolutional neural network model corresponding to the optimal hyper-parameter combination obtained after circulation, calculating to obtain an evaluation index decision coefficient of the test set, and verifying the effect of the optimal convolutional neural network model.

7. The system of claim 6, wherein the system is comprised of

Determining a cost function J of a convolutional neural network model_mseFrom

Determining convolution spiritDetermination of coefficient R by evaluation index of network model²Wherein m represents the number of data groups, y_iThe measured actual value of the ith group of data is shown,

indicating the measured predicted value of the i-th group of data,

the mean value of the measured actual values is shown.

8. The system of claim 7, wherein the system is comprised of

And calculating an evaluation index decision coefficient of the cross validation set. By

By adjusting the hyper-parameters

So that is at

Finding optimal hyper-parameters in

Wherein,

the optimal hyper-parameter is represented by,

representing a hyper-parametric search space,

a set of hyper-parameters is represented,

representing the optimized objective function.

9. The system of claim 8, wherein the system is comprised of

Determining a hyper-parameter for a next cycle, wherein,

is the acquisition function, v^*Is that

The current value of the optimum function is,

is a function of the cumulative density of a standard normal distribution,

and

mean and standard deviation, respectively, D_1:tIs an observed data set.

10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method of any one of claims 1 to 5.