CN113709779B

CN113709779B - Cellular network fault diagnosis method

Info

Publication number: CN113709779B
Application number: CN202110862707.3A
Authority: CN
Inventors: 朱晓荣; 吴铭骁; 何明坤; 肖芳
Original assignee: Nanjing University of Posts and Telecommunications
Current assignee: Nanjing University of Posts and Telecommunications
Priority date: 2021-07-29
Filing date: 2021-07-29
Publication date: 2023-06-02
Anticipated expiration: 2041-07-29
Also published as: CN113709779A

Abstract

The invention discloses a cellular network fault diagnosis method, which comprises the following steps: Step 1, determining the network fault data set; Step 2, obtaining the network fault data set after dimension reduction; The fault data set is expressed in the form of a feature matrix; the label information of the network fault data set after dimensionality reduction in step 2 is expressed in the form of a label matrix; the introduced weight matrix is converted into an adjacency matrix with only 0 and 1 matrix elements; Step 4. Fault diagnosis based on graph convolutional neural network. This new cellular network fault diagnosis method deeply studies the intelligent fault diagnosis of heterogeneous wireless networks, combines the big data processing method to analyze the similarity between samples, converts the existing network fault parameter data set into graph structure data, and utilizes graph convolution The neural network extracts features from the graph structure data to complete the classification task for the sample nodes and predict the fault type of the cell.

Description

A method for diagnosing cellular network faults

技术领域Technical Field

本发明涉及通信技术领域，特别是一种蜂窝网络故障诊断方法。The present invention relates to the field of communication technology, and in particular to a cellular network fault diagnosis method.

背景技术Background Art

近年来，面对移动数据流量的激增和各种服务的不同需求，异构蜂窝网络逐渐成为提升系统容量的重要方法之一。但随着蜂窝网络规模的扩大和复杂性的增加，对于蜂窝网络的操作和维护任务也变得复杂繁琐。虽然端到端用户在吞吐量和延迟方面的体验得到了显著改善，但是蜂窝网络也变得更容易出现故障。当故障发生或将要发生时，如何预测和定位它们已经成为了一个巨大的挑战。In recent years, in the face of the surge in mobile data traffic and the different demands of various services, heterogeneous cellular networks have gradually become one of the important ways to improve system capacity. However, with the expansion of the scale and complexity of cellular networks, the operation and maintenance tasks of cellular networks have also become complicated and cumbersome. Although the end-to-end user experience in terms of throughput and latency has been significantly improved, cellular networks have also become more prone to failures. When failures occur or are about to occur, how to predict and locate them has become a huge challenge.

以往传统的网络故障检测和诊断主要是通过人工手动操作完成的，难以准确获得网络症状与故障类别之间的映射关系，同时也会耗费大量的人力物力。In the past, traditional network fault detection and diagnosis were mainly completed through manual operations, which made it difficult to accurately obtain the mapping relationship between network symptoms and fault categories, and also consumed a lot of manpower and material resources.

随着人工智能的快速发展，目前最常用的智能故障诊断方法大多都是基于机器学习的，智能故障诊断已经从传统的基于模型的故障诊断方法转变为数据驱动的故障诊断方法。基于机器学习的网络故障诊断技术通过挖掘大量训练数据中的信息来学习网络事件之间的映射关系，然后根据这些映射关系建立故障诊断模型，最后将训练好的模型应用于新观测到的网络症状上，以实现对网络状态的预测。然而，现有的故障诊断解决方案大多使用机器学习中的有监督学习方法，这些故障诊断方法大多依赖于海量数据集，并且要求数据集中的样本具有足够的标记信息。但是实际情况中，获取标记信息十分困难，标记样本往往非常耗时，进行人工类别标记的成本太高，而且历史数据集过少，获取成本也很昂贵。这些故障诊断方法大多是基于有监督学习的微调和分类，大量未标记样本信息不能得到充分利用，未标记样本数据就被浪费了。With the rapid development of artificial intelligence, most of the most commonly used intelligent fault diagnosis methods are based on machine learning. Intelligent fault diagnosis has changed from traditional model-based fault diagnosis methods to data-driven fault diagnosis methods. Network fault diagnosis technology based on machine learning learns the mapping relationship between network events by mining information from a large amount of training data, then establishes a fault diagnosis model based on these mapping relationships, and finally applies the trained model to the newly observed network symptoms to achieve the prediction of network status. However, most of the existing fault diagnosis solutions use supervised learning methods in machine learning. Most of these fault diagnosis methods rely on massive data sets and require that the samples in the data sets have sufficient labeled information. However, in actual situations, it is very difficult to obtain labeled information, labeling samples is often very time-consuming, the cost of manual category labeling is too high, and the historical data sets are too few and the acquisition cost is also expensive. Most of these fault diagnosis methods are based on fine-tuning and classification of supervised learning, and a large amount of unlabeled sample information cannot be fully utilized, and the unlabeled sample data is wasted.

综上，针对4G/5G异构无线网故障诊断过程中，如何在有效标记信息非常少的情况下进行准确网络故障诊断的问题，是本领域技术人员亟待解决的技术问题。In summary, in the process of 4G/5G heterogeneous wireless network fault diagnosis, how to perform accurate network fault diagnosis when there is very little effective marking information is a technical problem that needs to be urgently solved by technical personnel in this field.

发明内容Summary of the invention

本发明所要解决的技术问题是克服现有技术的不足而提供一种蜂窝网络故障诊断方法，本发明结合大数据处理方法分析了样本间的相似特性，深入研究了异构无线网络的智能故障诊断算法，将已有的网络故障参数数据集转换成图结构数据，利用图卷积神经网络从图数据中提取特征，完成对于样本节点的分类任务，预测出小区的故障类型。The technical problem to be solved by the present invention is to overcome the shortcomings of the prior art and provide a cellular network fault diagnosis method. The present invention combines the big data processing method to analyze the similar characteristics between samples, deeply studies the intelligent fault diagnosis algorithm of heterogeneous wireless networks, converts the existing network fault parameter data set into graph structure data, and uses the graph convolutional neural network to extract features from the graph data, completes the classification task of sample nodes, and predicts the fault type of the cell.

本发明为解决上述技术问题采用以下技术方案：The present invention adopts the following technical solutions to solve the above technical problems:

根据本发明提出的一种蜂窝网络故障诊断方法，包括以下步骤：A cellular network fault diagnosis method proposed in the present invention comprises the following steps:

步骤1、确定网络故障数据集；具体如下：Step 1: Determine the network fault data set; the details are as follows:

步骤101、网络故障数据集中的数据包括原始特征参数和标签类别，定义了非网络故障情况以及5种网络故障作为标签类别，其中，5种网络故障分别为上行干扰、下行干扰、覆盖空洞、空口故障和基站故障；Step 101: The data in the network fault data set includes original feature parameters and label categories, and defines non-network fault conditions and five types of network faults as label categories, wherein the five types of network faults are uplink interference, downlink interference, coverage holes, air interface failures, and base station failures;

步骤102、将识别故障小区时所用到的所有关键性能指标作为原始特征参数，原始特征参数包括16个特征参数，这16个特征参数包括参考信号接收功率、参考信号接收质量、上行链路分组丢失率、下行链路分组丢失率、上行链路信噪比、下行链路信噪比、无线资源控制连接建立成功率、演进无线接入承载建立成功率、掉话率、切换成功率、上行平均吞吐量、下行平均吞吐量、节点传出平均吞吐量、节点传入平均吞吐量和切换时延和链路误码率；Step 102: all key performance indicators used in identifying a faulty cell are used as original characteristic parameters, and the original characteristic parameters include 16 characteristic parameters, including reference signal received power, reference signal received quality, uplink packet loss rate, downlink packet loss rate, uplink signal-to-noise ratio, downlink signal-to-noise ratio, radio resource control connection establishment success rate, evolved radio access bearer establishment success rate, call drop rate, handover success rate, uplink average throughput, downlink average throughput, node outgoing average throughput, node incoming average throughput, handover delay, and link bit error rate;

步骤2、采用XGBoost算法，从原始特征参数中选取使XGBoost算法的诊断准确率最高的前n个特征参数作为网络故障数据集中数据的特征参数，从而得到降维后的网络故障数据集，0＜n＜16；Step 2: Using the XGBoost algorithm, select the first n feature parameters that make the XGBoost algorithm have the highest diagnostic accuracy from the original feature parameters as the feature parameters of the data in the network fault data set, thereby obtaining a network fault data set after dimensionality reduction, 0＜n＜16;

步骤3、图数据转换，具体如下：Step 3: Graph data conversion, as follows:

步骤301、将步骤2降维后的网络故障数据集用特征矩阵的形式来表示，特征矩阵中的每个行向量分别对应于降维后的网络故障数据集中的一条除类别信息外的特征参数向量；Step 301: The network fault data set after the dimension reduction in step 2 is represented in the form of a feature matrix, where each row vector in the feature matrix corresponds to a feature parameter vector other than the category information in the network fault data set after the dimension reduction;

步骤302、将步骤2降维后的网络故障数据集的标签信息用标签矩阵的形式表示；在标签矩阵中，标记数据的标签行向量是独热编码的形式，而未标记数据的标签行向量则为零向量；标记数据是指带标签的数据，未标记数据是不带标签的数据；Step 302: The label information of the network fault data set after the dimension reduction in step 2 is represented in the form of a label matrix; in the label matrix, the label row vector of the labeled data is in the form of one-hot encoding, and the label row vector of the unlabeled data is a zero vector; the labeled data refers to the data with a label, and the unlabeled data is the data without a label;

步骤303、将降维后的网络故障数据集映射成一个无向图G＝(V,E)，无向图由两种类型的元素组成，即节点集V和边集E；引入一个权重矩阵来表示无向图中节点之间的相似度，权重矩阵中的元素是各节点对间的相似度，而各节点对之间的相似度是通过两两节点计算欧氏距离并进行归一化得到的；再通过设置的参数阈值，将权重矩阵中的全部元素与该参数阈值进行比较，若当前元素大于该参数阈值，则将该元素置为1，否则置为0；从而将权重矩阵转换为矩阵元素只有0和1的邻接矩阵，邻接矩阵表示了节点间的相邻关系；Step 303: Map the network fault data set after dimensionality reduction into an undirected graph G = (V, E), where the undirected graph consists of two types of elements, namely, a node set V and an edge set E; introduce a weight matrix to represent the similarity between nodes in the undirected graph, where the elements in the weight matrix are the similarities between each node pair, and the similarity between each node pair is obtained by calculating the Euclidean distance between two nodes and normalizing them; then compare all elements in the weight matrix with the parameter threshold through a set parameter threshold, and if the current element is greater than the parameter threshold, set the element to 1, otherwise set it to 0; thereby converting the weight matrix into an adjacency matrix whose matrix elements are only 0 and 1, and the adjacency matrix represents the adjacent relationship between nodes;

步骤4、基于图卷积神经网络的故障诊断，具体如下：Step 4: Fault diagnosis based on graph convolutional neural network, as follows:

步骤401、调整图卷积神经网络的参数并选择层次结构，构建图卷积神经网络模型；Step 401: Adjust the parameters of the graph convolutional neural network and select a hierarchical structure to build a graph convolutional neural network model;

步骤402、步骤301的特征矩阵和步骤302的标签矩阵是图卷积神经网络的输入，用来训练图卷积神经网络模型中的参数，并根据训练后的图卷积神经网络模型、步骤303的邻接矩阵、采用层与层之间的传播公式获得各节点的高阶聚合特征属性；将各节点的高阶聚合特征属性输入图卷积神经网络模型中的Softmax层，得到最终的故障分类诊断结果。The feature matrix of step 402, step 301 and the label matrix of step 302 are the input of the graph convolutional neural network, which are used to train the parameters in the graph convolutional neural network model, and according to the trained graph convolutional neural network model, the adjacency matrix of step 303, and the propagation formula between layers, the high-order aggregate feature attributes of each node are obtained; the high-order aggregate feature attributes of each node are input into the Softmax layer in the graph convolutional neural network model to obtain the final fault classification diagnosis result.

作为本发明所述的一种蜂窝网络故障诊断方法进一步优化方案，步骤2具体如下：As a further optimization scheme of the cellular network fault diagnosis method according to the present invention, step 2 is specifically as follows:

步骤201、使用XGBoost算法，获得原始特征参数中的每个特征参数的重要性评分；Step 201: Use the XGBoost algorithm to obtain the importance score of each feature parameter in the original feature parameters;

步骤202、根据特征参数的重要性评分的大小，对特征参数进行降序排序；Step 202: sort the feature parameters in descending order according to the importance scores of the feature parameters;

步骤203、根据步骤202中排好序的原始特征参数，从中选取使XGBoost算法的诊断准确率最高的前n个特征参数作为网络故障数据集中数据的特征参数，从而得到降维后的网络故障数据集，0＜n＜16；Step 203: According to the original feature parameters sorted in step 202, the first n feature parameters that make the diagnosis accuracy of the XGBoost algorithm the highest are selected as the feature parameters of the data in the network fault data set, thereby obtaining a network fault data set after dimensionality reduction, 0＜n＜16;

作为本发明所述的一种蜂窝网络故障诊断方法进一步优化方案，图卷积神经网络模型中的参数包括图卷积层层数、dropout层概率大小、以及滤波器矩阵大小。As a further optimization scheme of a cellular network fault diagnosis method described in the present invention, the parameters in the graph convolutional neural network model include the number of graph convolutional layers, the probability size of the dropout layer, and the size of the filter matrix.

本发明采用以上技术方案与现有技术相比，具有以下技术效果：Compared with the prior art, the present invention adopts the above technical solution and has the following technical effects:

(1)本发明是基于半监督的故障诊断算法，减少了用于训练的标记数据的数量；(1) The present invention is based on a semi-supervised fault diagnosis algorithm, which reduces the amount of labeled data used for training;

(2)本发明使用XGBoost算法来进行最优特征参数组合的求解，有效避免了模型训练速度降低，以及产生维数灾难等问题；(2) The present invention uses the XGBoost algorithm to solve the optimal feature parameter combination, which effectively avoids the problems of reducing the model training speed and generating dimensionality disaster;

(3)充分利用了网络关键性能指标参数与故障类型之间复杂的非线性关系，并很好地利用样本与样本之间的相似性联系，性能得到进一步提高；(3) The complex nonlinear relationship between network key performance indicator parameters and fault types is fully utilized, and the similarity between samples is well utilized to further improve the performance;

(4)合理构建图卷积神经网络层次并合理设置卷积神经网络参数，提高模型故障诊断的有效性和可靠性。(4) Rationally construct the graph convolutional neural network hierarchy and reasonably set the convolutional neural network parameters to improve the effectiveness and reliability of model fault diagnosis.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1是本发明一种4G/5G异构无线网络场景图。FIG1 is a 4G/5G heterogeneous wireless network scenario diagram of the present invention.

图2是本发明中数据的结构图。FIG. 2 is a structural diagram of data in the present invention.

图3是本发明中图卷积神经网络的层次结构图。FIG3 is a hierarchical structure diagram of the graph convolutional neural network in the present invention.

具体实施方式DETAILED DESCRIPTION

下面结合附图对本发明的技术方案做进一步的详细说明：The technical solution of the present invention is further described in detail below in conjunction with the accompanying drawings:

本发明提出了一种新的基于图卷积神经网络的智能故障诊断算法模型，该模型充分利用了标记数据的标记信息和无标记数据的特征参数信息，在仅使用少量标记过的标记数据用于训练的情况下，就能够达到很好的诊断准确率。模型能快速检测出网络故障，而且能进一步识别出可能的网络故障类型，从而加快故障小区的恢复速度。This paper proposes a new intelligent fault diagnosis algorithm model based on graph convolutional neural network, which makes full use of the label information of labeled data and the characteristic parameter information of unlabeled data, and can achieve good diagnostic accuracy when only a small amount of labeled labeled data is used for training. The model can quickly detect network faults and further identify possible network fault types, thereby accelerating the recovery speed of faulty cells.

一种4G/5G异构无线网络场景图如图1所示，内容主要包括最优特征组合选择。本发明考虑宏小区，微小区和毫微微小区交叉重叠覆盖的异构无线网络场景，在这种场景下，由于网络的多样性，系统更加复杂，网络管理也更加困难。本发明考虑此场景下的网络故障诊断与预测，首先需要分析衡量网络性能的关键性能指标以及常见的网络故障，这部分是构建网络故障诊断模型的前期工作。A 4G/5G heterogeneous wireless network scenario diagram is shown in Figure 1, and the content mainly includes the selection of the optimal feature combination. The present invention considers a heterogeneous wireless network scenario with overlapping coverage of macro cells, micro cells and femto cells. In this scenario, due to the diversity of the network, the system is more complex and network management is more difficult. The present invention considers the network fault diagnosis and prediction in this scenario. First, it is necessary to analyze the key performance indicators that measure network performance and common network faults. This part is the preliminary work for building a network fault diagnosis model.

1)特征选取，包括以下步骤：1) Feature selection, including the following steps:

步骤101，根据路测得来的真实数据集并结合实际情况，选取了非网络故障情况以及另外5种常见的网络故障作为网络故障数据集的标签类别，其中5种网络故障分别是上行干扰、下行干扰、覆盖空洞、空口故障和基站故障；Step 101, based on the real data set obtained from the route test and combined with the actual situation, non-network fault conditions and five other common network faults are selected as label categories of the network fault data set, where the five network faults are uplink interference, downlink interference, coverage holes, air interface failures, and base station failures;

步骤102，将识别故障小区时所用到的关键性能指标参数作为样本数据的原始特征参数，包括参考信号接收功率、参考信号接收质量、上行链路分组丢失率、下行链路分组丢失率、上行链路信噪比、下行链路信噪比、无线资源控制连接建立成功率、演进无线接入承载建立成功率、掉话率、切换成功率、上行平均吞吐量、下行平均吞吐量、节点传出平均吞吐量、节点传入平均吞吐量、切换时延、链路误码率共16种关键性能指标参数。Step 102, the key performance indicator parameters used to identify the faulty cell are used as the original characteristic parameters of the sample data, including reference signal reception power, reference signal reception quality, uplink packet loss rate, downlink packet loss rate, uplink signal-to-noise ratio, downlink signal-to-noise ratio, radio resource control connection establishment success rate, evolved radio access bearer establishment success rate, call drop rate, switching success rate, uplink average throughput, downlink average throughput, node outgoing average throughput, node incoming average throughput, switching delay, link bit error rate, a total of 16 key performance indicator parameters.

2)最优特征组合选择，如果数据集中的数据样本包含过多的特征参数，不仅不利于接下来图的邻接矩阵的构建，导致生成的邻接矩阵不合理，而且在故障诊断阶段，还会降低模型训练的速度，产生维数灾难等问题。因此，需要根据网络参数的重要性，选择少量重要的网络参数来识别故障小区。包括以下步骤：2) Optimal feature combination selection. If the data samples in the data set contain too many feature parameters, it will not only be detrimental to the construction of the adjacency matrix of the subsequent graph, resulting in an unreasonable adjacency matrix, but also reduce the speed of model training in the fault diagnosis stage, causing dimensionality disaster and other problems. Therefore, it is necessary to select a small number of important network parameters to identify faulty cells based on the importance of network parameters. The following steps are included:

步骤201，使用XGBoost算法进行最优特征组合的选择，XGBoost会创建多个提升树，XGBoost在训练的每次迭代中，都会在先前模型损失函数的梯度下降方向上建立新的决策树模型，从而使得之前模型的预测值和真实值之间的残差逐渐减少。XGBoost算法可以为每个特征参数获得一个重要性评分，属性在模型中用于构建决策树的次数越多，就越重要；Step 201, use the XGBoost algorithm to select the optimal feature combination. XGBoost will create multiple boosted trees. In each iteration of training, XGBoost will build a new decision tree model in the gradient descent direction of the previous model loss function, so that the residual between the predicted value and the true value of the previous model is gradually reduced. The XGBoost algorithm can obtain an importance score for each feature parameter. The more times an attribute is used to build a decision tree in the model, the more important it is;

步骤202，利用XGBoost框架的特征重要性排序功能，根据特征参数的重要性评分对16个特征参数进行降序排序；Step 202, using the feature importance sorting function of the XGBoost framework, sorting the 16 feature parameters in descending order according to the importance scores of the feature parameters;

步骤203，根据步骤202中排好序的原始特征参数中选取使XGBoost算法的诊断准确率最高的前n个特征参数作为降维后的网络故障数据集的特征参数，0＜n＜16；Step 203, selecting the first n feature parameters that make the diagnosis accuracy of the XGBoost algorithm the highest from the original feature parameters sorted in step 202 as the feature parameters of the network fault data set after dimensionality reduction, 0＜n＜16;

3)图数据转换，需要将数据集中的数据转换成图数据这种非欧式数据，从而符合图卷积神经网络的输入格式。如图2所示，本发明所提出的方法首先需要把原始的网络故障数据集映射成一个无向图G＝(V,E)，图由两种类型的元素组成，即节点集V和边集E。在本发明中，原始数据集中的每个样本数据对应于图中的一个节点，每个节点都拥有各自的属性特征，而节点与节点之间的相似性度量可以通过边的权值来衡量。包括以下步骤：3) Graph data conversion, the data in the data set needs to be converted into graph data, a non-Euclidean data, so as to conform to the input format of the graph convolutional neural network. As shown in Figure 2, the method proposed in the present invention first needs to map the original network fault data set into an undirected graph G = (V, E), and the graph consists of two types of elements, namely the node set V and the edge set E. In the present invention, each sample data in the original data set corresponds to a node in the graph, each node has its own attribute characteristics, and the similarity measure between nodes can be measured by the edge weight. It includes the following steps:

步骤301，将降维后的网络故障数据集中样本数据的特征参数用特征矩阵的形式表示，特征矩阵中的每个行向量分别对应于降维后的网络故障数据集中的一条样本数据去除类别信息后所构成的特征参数向量；Step 301, the characteristic parameters of the sample data in the network fault data set after dimensionality reduction are expressed in the form of a characteristic matrix, and each row vector in the characteristic matrix corresponds to a characteristic parameter vector formed by removing the category information from a piece of sample data in the network fault data set after dimensionality reduction;

步骤302，将原始数据集中样本的标签信息用标签矩阵的形式表示，矩阵中标记样本的标签信息使用类似于独热编码的行向量来表示，分别对应于小区当前所遭受的不同种故障类别。而未标记数据的对应的行向量则为零向量，表示标记信息未知；Step 302: The label information of the samples in the original data set is represented in the form of a label matrix. The label information of the labeled samples in the matrix is represented by a row vector similar to one-hot encoding, which corresponds to the different types of faults currently suffered by the cell. The row vector corresponding to the unlabeled data is a zero vector, indicating that the label information is unknown.

步骤303，先构建一个权重矩阵来记录图中节点之间的相似度，权重矩阵中的元素w_ij，即各节点对间的相似度是通过两两计算欧氏距离并归一化后得到的，即

其中，w_ij表示节点x_i和x_j之间的相似性度量，x_i和x_j分别表示第i和第j两个不同的样本，δ被称为高斯带宽参数，可根据实际情况自主定义。然后，设置合理的参数阈值α，将权重矩阵中的所有元素与该阈值α进行比较，值大于该阈值的元素w_ij，重新置为1，否则全部置为0。于是，把权重矩阵转换为元素只有0和1的邻接矩阵A，其中0表示两个样本之间没有关系，而1表示两个样本之间相似性程度很高。Step 303, first construct a weight matrix to record the similarity between nodes in the graph. The elements w _ij in the weight matrix, i.e., the similarity between each node pair, are obtained by calculating the Euclidean distance between each pair and normalizing them, i.e.

Among them, w _ij represents the similarity measure between nodes _xi and x _j , _xi and x _j represent two different samples, δ, which is called the Gaussian bandwidth parameter and can be defined according to the actual situation. Then, a reasonable parameter threshold α is set, and all elements in the weight matrix are compared with the threshold α. The elements w _ij with values greater than the threshold are reset to 1, otherwise all are reset to 0. Thus, the weight matrix is converted into an adjacency matrix A with only 0 and 1 elements, where 0 indicates that there is no relationship between the two samples, and 1 indicates that the similarity between the two samples is very high.

4)基于图卷积神经网络的故障诊断，为了提高网络故障故障诊断的准确率，在训练GCN前，需要确定最佳的神经网络结构和参数，本发明为了说明GCN的结构与工作流程，如图3所示展示了本发明定义的一个GCN模型，包括以下步骤：4) Fault diagnosis based on graph convolutional neural network. In order to improve the accuracy of network fault diagnosis, it is necessary to determine the optimal neural network structure and parameters before training GCN. In order to illustrate the structure and workflow of GCN, the present invention shows a GCN model defined by the present invention as shown in FIG3, which includes the following steps:

步骤401，选择合理的GCN参数以及层次结构，构建图卷积神经网络。GCN的结构和参数选择主要包括图卷积层层数、各图卷积层的滤波器矩阵大小以及dropout层概率大小。按照实际情况，当图卷积层层数取2时最佳，此时既可以合理地聚合各中心节点的邻居节点的特征参数信息，也可以避免由于卷积层层数过多导致训练参数过多而造成的过拟合问题。dropout层的概率大小一般选取0.25或者0.5合适，滤波器矩阵大小需要依照前一层卷积层的大小来决定，同时还得考虑运算中属性降维等因素；Step 401, select reasonable GCN parameters and hierarchical structure to construct a graph convolutional neural network. The structure and parameter selection of GCN mainly include the number of graph convolution layers, the size of the filter matrix of each graph convolution layer, and the probability size of the dropout layer. According to actual conditions, it is best when the number of graph convolution layers is 2. At this time, it can not only reasonably aggregate the feature parameter information of the neighboring nodes of each central node, but also avoid the overfitting problem caused by too many training parameters due to too many convolution layers. The probability size of the dropout layer is generally selected to be 0.25 or 0.5. The size of the filter matrix needs to be determined according to the size of the previous convolution layer, and factors such as attribute dimensionality reduction in the operation must also be considered;

步骤402，GCN构建完成后，将步骤301中的特征矩阵和步骤302中的标签矩阵作为GCN的输入，根据GCN定义的层与层之间的传播公式，

通过划分训练集，训练图卷积神经网络模型，利用训练完成的GCN模型去聚合各节点以及邻居节点的特征参数，得到各节点的高阶聚合特征属性。其中，H^(l+1)表示当前图卷积层输出的各节点的高阶聚合特征属性所构成的矩阵，H^(l)是之前第l层的输出，

表示步骤303中的邻接矩阵加了自环，

是关于

的度矩阵，σ表示激活函数，W^(l)表示第l层图卷积层的可训练权重矩阵；Step 402, after GCN is constructed, the feature matrix in step 301 and the label matrix in step 302 are used as the input of GCN. According to the propagation formula between layers defined by GCN,

By dividing the training set and training the graph convolutional neural network model, the trained GCN model is used to aggregate the feature parameters of each node and its neighboring nodes to obtain the high-order aggregate feature attributes of each node. Among them, H ^(l+1) represents the matrix composed of the high-order aggregate feature attributes of each node output by the current graph convolution layer, and H ^(l) is the output of the previous lth layer.

It means that the adjacency matrix in step 303 has self-loops added.

About

, σ represents the activation function, and W ^(l) represents the trainable weight matrix of the l-th graph convolutional layer;

步骤403，将得到的各节点的高阶聚合特征属性输入Softmax层，得到最终的故障分类诊断结果。Softmax激活函数需要被应用在最后一层图卷积层输出的特征矩阵的每一行上。诊断结果会以节点级的特征矩阵Z表示，而矩阵Z中的每一行类似于独热编码的行向量所对应的网络故障就分别代表当前小区所遭受的不同种网络故障类别。Step 403, input the obtained high-order aggregated feature attributes of each node into the Softmax layer to obtain the final fault classification diagnosis result. The Softmax activation function needs to be applied to each row of the feature matrix output by the last graph convolution layer. The diagnosis result will be represented by the node-level feature matrix Z, and each row in the matrix Z is similar to the network fault corresponding to the row vector of the one-hot encoding, which represents the different types of network faults suffered by the current cell.

以上所述，仅为本发明的具体实施方式，但本发明的保护范围并不局限于此，任何熟悉本技术领域的技术人员在本发明揭露的技术范围内，可轻易想到的变化或替换，都应涵盖在本发明的保护范围内。The above description is only a specific implementation mode of the present invention, but the protection scope of the present invention is not limited thereto. Any changes or substitutions that can be easily thought of by any technician familiar with the technical field within the technical scope disclosed by the present invention should be covered by the protection scope of the present invention.

Claims

1. A cellular network fault diagnosis method, is characterized in that, comprises the following steps:

Step 1. Determine the network fault data set; the details are as follows:

Step 101, the data in the network fault data set includes original characteristic parameters and label categories, and defines non-network fault conditions and 5 types of network faults as label categories, wherein the 5 types of network faults are uplink interference, downlink interference, coverage holes, and air interfaces Faults and base station failures;

Step 102, using all key performance indicators used when identifying the faulty cell as original characteristic parameters, the original characteristic parameters include 16 characteristic parameters, and these 16 characteristic parameters include reference signal received power, reference signal received quality, uplink packet loss rate, downlink packet loss rate, uplink signal-to-noise ratio, downlink signal-to-noise ratio, radio resource control connection establishment success rate, evolved radio access bearer establishment success rate, call drop rate, handover success rate, uplink average Throughput, downlink average throughput, node outgoing average throughput, node incoming average throughput, switching delay and link bit error rate;

Step 2. Using the XGBoost algorithm, select the first n characteristic parameters that make the diagnosis accuracy of the XGBoost algorithm the highest from the original characteristic parameters as the characteristic parameters of the data in the network fault data set, so as to obtain the network fault data set after dimensionality reduction, 0< n<16;

Step 3, graph data conversion, as follows:

Step 301, express the network fault data set after dimension reduction in step 2 in the form of a feature matrix, and each row vector in the feature matrix corresponds to a feature parameter in the network fault data set after dimension reduction except category information vector;

Step 302, express the label information of the network fault data set after dimension reduction in step 2 in the form of a label matrix; in the label matrix, the label row vectors of labeled data are in the form of one-hot encoding, while the label row vectors of unlabeled data Then it is a zero vector; labeled data refers to data with labels, and unlabeled data refers to data without labels;

Step 303, map the network fault data set after dimensionality reduction into an undirected graph G=(V, E), the undirected graph is composed of two types of elements, namely node set V and edge set E; introduce a weight matrix To represent the similarity between nodes in the undirected graph, the elements in the weight matrix are the similarity between each node pair, and the similarity between each node pair is calculated by the Euclidean distance between two nodes and normalized Obtained; then compare all the elements in the weight matrix with the parameter threshold through the set parameter threshold, if the current element is greater than the parameter threshold, set the element to 1, otherwise set to 0; thus transform the weight matrix It is an adjacency matrix whose matrix elements only have 0 and 1, and the adjacency matrix represents the adjacency relationship between nodes;

Step 4. Fault diagnosis based on the graph convolutional neural network, as follows:

Step 401, adjusting the parameters of the graph convolutional neural network and selecting a hierarchical structure to construct a graph convolutional neural network model;

Step 402, the feature matrix of step 301 and the label matrix of step 302 are the input of the graph convolutional neural network, which are used to train the parameters in the graph convolutional neural network model, and according to the trained graph convolutional neural network model, step 303 The adjacency matrix and the propagation formula between layers are used to obtain the high-order aggregate feature attributes of each node; the high-order aggregate feature attributes of each node are input into the Softmax layer in the graph convolutional neural network model to obtain the final fault classification and diagnosis result.

2. A kind of cellular network fault diagnosis method according to claim 1, is characterized in that, step 2 is specifically as follows:

Step 201, using the XGBoost algorithm to obtain the importance score of each feature parameter in the original feature parameters;

Step 202, sort the feature parameters in descending order according to the importance scores of the feature parameters;

Step 203, according to the original characteristic parameters sorted in step 202, select the first n characteristic parameters that make the diagnosis accuracy of the XGBoost algorithm the highest as the characteristic parameters of the data in the network fault data set, so as to obtain the network fault data after dimension reduction Set, 0<n<16.

3. A kind of cellular network fault diagnosis method according to claim 1, is characterized in that, the parameter in the graph convolution neural network model comprises graph convolution layer number of layers, dropout layer probability size and filter matrix size.