TWI735942B - System and method for predicting and preventing obstacles of network communication equipment based on machine learning - Google Patents

System and method for predicting and preventing obstacles of network communication equipment based on machine learning Download PDF

Info

Publication number
TWI735942B
TWI735942B TW108132058A TW108132058A TWI735942B TW I735942 B TWI735942 B TW I735942B TW 108132058 A TW108132058 A TW 108132058A TW 108132058 A TW108132058 A TW 108132058A TW I735942 B TWI735942 B TW I735942B
Authority
TW
Taiwan
Prior art keywords
data
network
management
kpi
network communication
Prior art date
Application number
TW108132058A
Other languages
Chinese (zh)
Other versions
TW202112103A (en
Inventor
吳銘晏
林裕祥
曾則翔
許真民
Original Assignee
中華電信股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中華電信股份有限公司 filed Critical 中華電信股份有限公司
Priority to TW108132058A priority Critical patent/TWI735942B/en
Publication of TW202112103A publication Critical patent/TW202112103A/en
Application granted granted Critical
Publication of TWI735942B publication Critical patent/TWI735942B/en

Links

Images

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

The invention discloses system and method for predicting and preventing obstacles of network communication equipment based on machine learning. First, collect performance management (PM) data, configuration management (CM) data and obstacle events of network communication equipment, convert the performance management (PM) data and the configuration management (CM) data into a key performance indicator (KPI), and use the key performance indicator (KPI) as an input factor and the obstacle event as an output result to build a machine learning model as an obstacle diagnosis model. Second, enter a new key performance indicator (KPI) converted from new performance management (PM) data and new configuration management (CM) data into the obstacle diagnosis model to predict or diagnose possibility of obstacles of the network communication equipment, and then adjust the equipment parameters of the network communication equipment to prevent the occurrence of obstacles.

Description

基於機器學習預測與防範網路通訊設備發生障礙之系統及方法 System and method for predicting and preventing obstacles in network communication equipment based on machine learning

本發明是關於一種預測與防範網路通訊設備發生障礙之技術,特別是指一種基於機器學習預測與防範網路通訊設備發生障礙之系統及方法。 The present invention relates to a technology for predicting and preventing obstacles in network communication equipment, in particular to a system and method for predicting and preventing obstacles in network communication equipment based on machine learning.

以往小型基地台(Small Cell)設備發生障礙時,由小型基地台設備本身發送障礙事件與告警至網管系統,以便通知維運人員查測小型基地台設備發生障礙之根源。又,在小型基地台設備發生障礙之前,網管系統亦會收集小型基地台設備之效能及組態等資訊,這些資訊皆為時序性資料,亦即有時間相依的特性,且小型基地台設備發生障礙之原因也與小型基地台設備之效能及組態的改變有相當程度的關係。 In the past, when the small cell equipment was obstructed, the small cell equipment itself sent obstruction events and alarms to the network management system to notify the maintenance personnel to investigate the root cause of the small cell equipment obstruction. In addition, before the small base station equipment is obstructed, the network management system will also collect information such as the performance and configuration of the small base station equipment. These information are all time-dependent data, that is, have time-dependent characteristics, and the small base station equipment occurs The reason for the obstacle is also related to the change of the performance and configuration of the small base station equipment to a considerable extent.

在一現有技術中,提出一種電信網路障礙根源分析的系統與方法,其主要應用資訊分享技術於網路障礙根源分析,先利用分散各地的 網管人員作為網路問題的觀測點,並藉由自動關聯、收集與分享網管人員的動態與網路障礙維修紀錄,以達到網路障礙根源分析目的。然而,此現有技術必須基於傳統規則模式(Rule-Based),並主要由網管人員需要定期回報障礙資訊與維修紀錄,以致無法達到智慧維運(AIOps)之目標。 In an existing technology, a system and method for the root cause analysis of telecommunications network obstacles is proposed. It mainly applies information sharing technology to the root cause analysis of network obstacles. Network administrators serve as observation points for network problems, and by automatically associating, collecting and sharing network administrators’ dynamics and network obstacle maintenance records to achieve the purpose of root cause analysis of network obstacles. However, this existing technology must be based on the traditional rule-based model, and network management personnel need to report obstacle information and maintenance records regularly, so that the goal of intelligent maintenance (AIOps) cannot be achieved.

因此,如何提供一種新穎或創新之預測與防範網路通訊設備(如小型基地台設備)發生障礙之技術,實已成為本領域技術人員之一大研究課題。 Therefore, how to provide a novel or innovative technology for predicting and preventing obstacles in network communication equipment (such as small base station equipment) has become a major research topic for those skilled in the art.

本發明提供一種新穎或創新之基於機器學習預測與防範網路通訊設備發生障礙之系統及方法,能快速預測或診斷網路通訊設備是否發生障礙。 The present invention provides a novel or innovative system and method based on machine learning to predict and prevent obstacles in network communication equipment, which can quickly predict or diagnose whether network communication equipment has obstacles.

本發明之基於機器學習預測與防範網路通訊設備發生障礙之系統包括:至少一網路通訊設備;一網管模組,係收集來自網路通訊設備之效能管理(Performance Management;PM)資料、組態管理(Configuration Management;CM)資料與障礙事件;以及一診斷模組,係將網管模組所收集之網路通訊設備之效能管理(PM)資料與組態管理(CM)資料計算出或轉換成相應之關鍵績效指標(Key Performance Indicator;KPI),且診斷模組以關鍵績效指標(KPI)作為輸入因子及以障礙事件作為輸出結果來建置機器學習模型作為障礙診斷模型,其中,診斷模組係將由網路通訊設備之新效能管理(PM)資料與新組態管理(CM)資料計算出或轉換而成之新關鍵績效指標(KPI)輸入障礙診斷模型,以由診斷模組透過障礙診斷模型利用新關鍵績 效指標(KPI)預測或診斷網路通訊設備是否可能發生障礙,俾於預測或診斷出網路通訊設備可能發生障礙時,由診斷模組調整網路通訊設備之設備參數以防範或防止網路通訊設備發生障礙。 The system for predicting and preventing obstacles in network communication equipment based on machine learning of the present invention includes: at least one network communication equipment; State management (Configuration Management; CM) data and failure events; and a diagnostic module, which calculates or converts the performance management (PM) data and configuration management (CM) data of the network communication equipment collected by the network management module The corresponding key performance indicator (KPI) is established, and the diagnosis module uses the key performance indicator (KPI) as the input factor and the obstacle event as the output result to build the machine learning model as the obstacle diagnosis model. Among them, the diagnosis module The set is to input the new key performance indicators (KPI) calculated or converted from the new performance management (PM) data and the new configuration management (CM) data of the network communication equipment into the obstacle diagnosis model, so that the diagnosis module can pass the obstacle Diagnosis model utilizes new key performance Performance indicators (KPI) predict or diagnose whether the network communication equipment may be malfunctioning. When it is predicted or diagnosed that the network communication equipment may be malfunctioning, the diagnostic module adjusts the equipment parameters of the network communication equipment to prevent or prevent the network Obstacles to communication equipment.

本發明之基於機器學習預測與防範網路通訊設備發生障礙之方法包括:令一網管模組收集來自至少一網路通訊設備之效能管理(PM)資料、組態管理(CM)資料與障礙事件;令一診斷模組將網管模組所收集之網路通訊設備之效能管理(PM)資料與組態管理(CM)資料計算出或轉換成相應之關鍵績效指標(KPI),且令診斷模組以關鍵績效指標(KPI)作為輸入因子及以障礙事件作為輸出結果來建置機器學習模型作為障礙診斷模型;以及令診斷模組將由網路通訊設備之新效能管理(PM)資料與新組態管理(CM)資料計算出或轉換而成之新關鍵績效指標(KPI)輸入障礙診斷模型,以令診斷模組透過障礙診斷模型利用新關鍵績效指標(KPI)預測或診斷網路通訊設備是否可能發生障礙,俾於預測或診斷出網路通訊設備可能發生障礙時,令診斷模組調整網路通訊設備之設備參數以防範或防止網路通訊設備發生障礙。 The method for predicting and preventing network communication equipment obstacles based on machine learning of the present invention includes: making a network management module collect performance management (PM) data, configuration management (CM) data and obstacle events from at least one network communication equipment ; Make a diagnostic module calculate or convert the performance management (PM) data and configuration management (CM) data of the network communication equipment collected by the network management module into the corresponding key performance indicators (KPI), and make the diagnostic model The group uses key performance indicators (KPI) as input factors and obstacle events as output results to build machine learning models as obstacle diagnosis models; and the diagnosis module will be set by the new performance management (PM) data and new groups of network communication equipment The new key performance indicators (KPI) calculated or converted from the state management (CM) data are input into the obstacle diagnosis model, so that the diagnosis module can use the new key performance indicators (KPI) to predict or diagnose whether the network communication equipment is Obstacles may occur. When it is predicted or diagnosed that the network communication equipment may have an obstacle, the diagnostic module is asked to adjust the equipment parameters of the network communication equipment to prevent or prevent the network communication equipment from malfunctioning.

為讓本發明之上述特徵和優點能更明顯易懂,下文特舉實施例,並配合所附圖式作詳細說明。在以下描述內容中將部分闡述本發明之額外特徵及優點,且此等特徵及優點將部分自所述描述內容可得而知,或可藉由對本發明之實踐習得。本發明之特徵及優點借助於在申請專利範圍中特別指出的元件及組合來認識到並達到。應理解,前文一般描述與以下詳細描述二者均僅為例示性及解釋性的,且不欲約束本發明所欲主張之範圍。 In order to make the above-mentioned features and advantages of the present invention more comprehensible, embodiments are specifically described below in conjunction with the accompanying drawings. In the following description, the additional features and advantages of the present invention will be partially explained, and these features and advantages will be partly known from the description, or can be learned by practicing the present invention. The features and advantages of the present invention are realized and achieved by means of the elements and combinations specifically pointed out in the scope of the patent application. It should be understood that both the foregoing general description and the following detailed description are only illustrative and explanatory, and are not intended to limit the scope of the present invention.

1‧‧‧基於機器學習預測與防範網路通訊設備發生障礙之系統 1. A system based on machine learning to predict and prevent obstacles in network communication equipment

10‧‧‧網路通訊設備 10‧‧‧Network communication equipment

20‧‧‧網管模組 20‧‧‧Network Management Module

21‧‧‧伺服器 21‧‧‧Server

22‧‧‧資料庫 22‧‧‧Database

30‧‧‧診斷模組 30‧‧‧Diagnostic Module

31‧‧‧權重評估演算法 31‧‧‧Weight evaluation algorithm

40‧‧‧障礙診斷模型 40‧‧‧Disorder diagnosis model

50‧‧‧用戶設備 50‧‧‧User Equipment

60‧‧‧安全閘道器 60‧‧‧Security Gateway

70‧‧‧演進節點B閘道器 70‧‧‧Evolution Node B Gateway

80‧‧‧核心網路 80‧‧‧Core network

90‧‧‧關鍵績效指標(KPI)公式資料庫 90‧‧‧Key Performance Indicator (KPI) Formula Database

100‧‧‧機器學習模型架構 100‧‧‧Machine learning model architecture

101‧‧‧障礙事件日誌 101‧‧‧Obstacle Event Log

102‧‧‧障礙標籤 102‧‧‧Barrier label

103‧‧‧PM與CM資料 103‧‧‧PM and CM data

104‧‧‧關鍵績效指標(KPI) 104‧‧‧Key Performance Indicators (KPI)

105‧‧‧新PM與CM資料 105‧‧‧New PM and CM data

106‧‧‧新關鍵績效指標(KPI) 106‧‧‧New Key Performance Indicators (KPI)

110‧‧‧資料集 110‧‧‧Data Collection

111‧‧‧訓練資料集 111‧‧‧Training Data Set

112‧‧‧驗證資料集 112‧‧‧Verification Data Set

120‧‧‧機器學習模型 120‧‧‧Machine Learning Model

121‧‧‧softmax演算法 121‧‧‧softmax algorithm

122‧‧‧退出層 122‧‧‧Exit layer

CM‧‧‧組態管理(資料) CM‧‧‧Configuration Management (Data)

IPSec‧‧‧網際網路安全協定(連接介面) IPSec‧‧‧Internet Security Protocol (connection interface)

KPI‧‧‧關鍵績效指標 KPI‧‧‧Key Performance Indicators

PM‧‧‧效能管理(資料) PM‧‧‧Performance Management (Data)

S1‧‧‧連接介面 S1‧‧‧Connecting interface

S11至S15、S21至S24‧‧‧步驟 Steps S11 to S15, S21 to S24‧‧‧

WAN‧‧‧廣域網路 WAN‧‧‧Wide Area Network

第1圖為本發明之基於機器學習預測與防範網路通訊設備發生障礙之系統及其網路通訊設備之網路架構之示意圖;第2圖為本發明之網路通訊設備及其網路架構中有關網路障礙特徵之示意圖;第3圖為本發明之基於機器學習預測與防範網路通訊設備發生障礙之系統之實施例示意圖;第4圖為本發明之機器學習模型架構及其機器學習模組之示意圖;第5A圖為本發明之基於機器學習預測與防範網路通訊設備發生障礙之方法中有關障礙診斷模型之建置流程圖;第5B圖為本發明之基於機器學習預測與防範網路通訊設備發生障礙之方法中有關網路通訊設備之障礙診斷流程圖;第6A圖、第6B圖與第6C圖分別為本發明中第一測試案例之模型驗證準確率、模型驗證精確率與模型驗證招回率之曲線圖;以及第7A圖、第7B圖與第7C圖分別為本發明中第二測試案例之模型驗證準確率、模型驗證精確率與模型驗證招回率之曲線圖。 Figure 1 is a schematic diagram of the system of the present invention based on machine learning to predict and prevent obstacles in network communication equipment and the network architecture of the network communication equipment; Figure 2 is the network communication equipment of the present invention and its network architecture Figure 3 is a schematic diagram of an embodiment of the system for predicting and preventing obstacles in network communication equipment based on machine learning in the present invention; Figure 4 is the machine learning model architecture and machine learning of the present invention The schematic diagram of the module; Figure 5A is the flow chart of the construction of the obstacle diagnosis model in the method of predicting and preventing network communication equipment based on machine learning in the present invention; Figure 5B is the prediction and prevention based on machine learning of the present invention The flow chart for the diagnosis of the obstacles of the network communication equipment in the method for the obstacles of the network communication equipment; Fig. 6A, Fig. 6B and Fig. 6C are respectively the model verification accuracy rate and the model verification accuracy rate of the first test case in the present invention A graph with the model verification recall rate; and Figures 7A, 7B, and 7C are the graphs of the model verification accuracy rate, model verification accuracy rate, and model verification recall rate of the second test case of the present invention, respectively .

以下藉由特定的具體實施形態說明本發明之實施方式,熟悉此技術之人士可由本說明書所揭示之內容了解本發明之其他優點與功效, 亦可因而藉由其他不同的具體等同實施形態加以施行或應用。 The following describes the implementation of the present invention with specific specific embodiments. Those familiar with this technology can understand the other advantages and effects of the present invention from the content disclosed in this specification. It can also be implemented or applied by other different specific equivalent embodiments.

第1圖為本發明之基於機器學習預測與防範網路通訊設備發生障礙之系統1及其網路通訊設備10之網路架構之示意圖。同時,基於機器學習預測與防範網路通訊設備發生障礙之系統1之主要技術內容如下,其餘技術內容相同於第2圖至第7C圖之說明,於此不再重覆敘述。 Figure 1 is a schematic diagram of the system 1 and the network architecture of the network communication device 10 based on machine learning to predict and prevent obstacles in network communication equipment of the present invention. At the same time, the main technical content of the system 1 based on machine learning to predict and prevent obstacles in network communication equipment is as follows, and the rest of the technical content is the same as the description in Figures 2 to 7C, and will not be repeated here.

如第1圖所示,基於機器學習預測與防範網路通訊設備發生障礙之系統1主要包括至少一(如複數)網路通訊設備10、一網管模組20、一診斷模組30與一關鍵績效指標(KPI)公式資料庫90(見第3圖),亦可進一步包括複數用戶設備(UE)50、至少一安全閘道器(Security Gateway;SecGW)60、至少一演進節點B閘道器(eNode Gateway)70與一核心網路80。例如,網路通訊設備10可為小型基地台(Small Cell)設備,亦可為無線基地台(Wi-Fi AP)設備或家庭閘道器(Home Gateway)設備等用戶端設備(Customer Premise Equipment;CPE)。網管模組20可為EMS(Element Management System;網元管理系統)網管模組或EMS網管系統,並具有伺服器21與資料庫22(見第3圖)。診斷模組30可為診斷伺服器(Diagnosis Server;DS),並具有權重評估演算法31。但是,本發明並不以此為限。 As shown in Figure 1, the system 1 based on machine learning to predict and prevent obstacles in network communication equipment mainly includes at least one (such as plural) network communication equipment 10, a network management module 20, a diagnostic module 30, and a key Performance indicator (KPI) formula database 90 (see Figure 3), may further include a plurality of user equipment (UE) 50, at least one security gateway (Security Gateway; SecGW) 60, and at least one evolved node B gateway (eNode Gateway) 70 and a core network 80. For example, the network communication device 10 may be a small cell (Small Cell) device, or may be a wireless base station (Wi-Fi AP) device or a home gateway (Home Gateway) device and other customer premise equipment (Customer Premise Equipment; CPE). The network management module 20 can be an EMS (Element Management System) network management module or an EMS network management system, and has a server 21 and a database 22 (see Figure 3). The diagnosis module 30 may be a diagnosis server (Diagnosis Server; DS), and has a weight evaluation algorithm 31. However, the present invention is not limited to this.

網管模組20可收集來自網路通訊設備10之效能管理(PM)資料、組態管理(CM)資料與障礙事件。診斷模組30可將網管模組20所收集之網路通訊設備10之效能管理(PM)資料與組態管理(CM)資料計算出或轉換成相應之關鍵績效指標(KPI),且診斷模組30能以關鍵績效指標(KPI)作為輸入因子及以障礙事件作為輸出結果來建置機器學習模型120(見第4圖)作為障礙診斷模型40。同時,診斷模組30可將由網路通訊設備10之 新效能管理(PM)資料與新組態管理(CM)資料計算出或轉換而成之新關鍵績效指標(KPI)輸入障礙診斷模型40,以由診斷模組30透過障礙診斷模型40利用新關鍵績效指標(KPI)預測或診斷網路通訊設備10是否可能發生障礙,俾於預測或診斷出網路通訊設備10可能發生障礙時,由診斷模組30調整網路通訊設備10之設備參數以防範或防止網路通訊設備10發生障礙。 The network management module 20 can collect performance management (PM) data, configuration management (CM) data, and obstacle events from the network communication device 10. The diagnostic module 30 can calculate or convert the performance management (PM) data and configuration management (CM) data of the network communication equipment 10 collected by the network management module 20 into corresponding key performance indicators (KPI), and the diagnostic model The group 30 can use key performance indicators (KPIs) as input factors and obstacle events as output results to build a machine learning model 120 (see Figure 4) as the obstacle diagnosis model 40. At the same time, the diagnostic module 30 can be used by the network communication equipment 10 The new key performance indicator (KPI) calculated or converted from the new performance management (PM) data and the new configuration management (CM) data is input into the obstacle diagnosis model 40, so that the diagnosis module 30 uses the new key through the obstacle diagnosis model 40 Performance indicators (KPI) predict or diagnose whether the network communication device 10 may be malfunctioning. When it is predicted or diagnosed that the network communication device 10 may be malfunctioning, the diagnostic module 30 adjusts the device parameters of the network communication device 10 to prevent Or to prevent the network communication equipment 10 from malfunctioning.

網路通訊設備10可透過例如廣域網路(Wide Area Network;WAN)之網路連接至安全閘道器(SecGW)60,並由安全閘道器(SecGW)60透過例如S1連接介面或IPSec(Internet Protocol Security;網際網路安全協定)連接介面以連接至演進節點B閘道器70,再由演進節點B閘道器70透過例如S1連接介面延伸至例如EPC(Evolved Packet Core;演進封包核心)網路之核心網路80。 The network communication device 10 can be connected to a secure gateway (SecGW) 60 through a network such as a wide area network (Wide Area Network; WAN), and the secure gateway (SecGW) 60 can use, for example, an S1 connection interface or an IPSec (Internet Protocol Security (Internet Security Protocol) connection interface to connect to the evolved node B gateway 70, and then the evolved node B gateway 70 extends to, for example, the EPC (Evolved Packet Core) network through the S1 connection interface, for example Road core network 80.

網管模組20(如EMS網管模組或EMS網管系統)透過通訊協定納管所有網路通訊設備10(如小型基地台設備)之供裝、監控、參數設定或(及)關鍵績效指標(KPI)之管理等網管功能,以由網路通訊設備10(如小型基地台設備)固定每一段時間(如每15分鐘)上傳效能管理(PM)資料、組態管理(CM)資料、設備紀錄(Log)至網管模組20,再由診斷模組30依據效能管理(PM)資料、組態管理(CM)資料與3GPP定義之LTE(Long Term Evolution;長期演進技術)關鍵績效指標(KPI)規範計算各網路通訊設備10的關鍵績效指標(KPI)。 The network management module 20 (such as EMS network management module or EMS network management system) manages the supply, monitoring, parameter setting or (and) key performance indicators (KPIs) of all network communication equipment 10 (such as small base station equipment) through the communication protocol ( Log) to the network management module 20, and then the diagnostic module 30 according to the performance management (PM) data, configuration management (CM) data and the LTE (Long Term Evolution; Long Term Evolution) key performance indicator (KPI) specifications defined by 3GPP Calculate the key performance indicators (KPI) of each network communication device 10.

再者,第5代(5G)網路之建置初期為加強服務涵蓋範圍,並提供穩定與高效的無線網路環境,採用例如長期演進技術(LTE)之小型基地台設備等網路通訊設備10扮演著重要的角色。又,有鑑於全球部署之網路 通訊設備10(如小型基地台設備)有大量增加趨勢,當網路通訊設備10發生障礙時,電信營運商需要立即找出障礙點,並派工排除問題,以確保其服務正常運作。然而,網路通訊設備10之障礙發生因素繁多,且障礙點可能來自網路通訊設備10本身、演進節點B閘道器70或核心網路80,亦或是網路通訊設備10彼此之間的通訊網路障礙。所以,在最短時間內找出網路通訊設備10之障礙發生原因,將有效幫助電信營運商改善用戶體驗。 Furthermore, the 5th generation (5G) network was initially built to enhance service coverage and provide a stable and efficient wireless network environment, using network communication equipment such as long-term evolution technology (LTE) small base station equipment. 10 plays an important role. Also, in view of the global deployment of networks Communication equipment 10 (such as small base station equipment) has a large increase trend. When network communication equipment 10 is obstructed, telecom operators need to immediately identify the obstacle and dispatch workers to eliminate the problem to ensure the normal operation of its service. However, there are many factors that cause obstacles to the network communication equipment 10, and the obstacles may come from the network communication equipment 10 itself, the evolved node B gateway 70 or the core network 80, or between the network communication equipment 10 and each other. Communication network barriers. Therefore, finding out the cause of the obstacle of the network communication device 10 in the shortest time will effectively help the telecommunication operator to improve the user experience.

因此,本發明能在初始階段介接網路通訊設備10之資訊,並大量收集網路通訊設備10之效能管理(PM)資料、組態管理(CM)資料、設備紀錄(Log)與障礙事件等資訊,以運用如第4圖所示之機器學習模型120(例如包括複數遞歸神經網路(Recurrent Neural Network;RNN)層與複數神經元之遞歸神經網路模型)建構機器學習模型架構(如遞歸神經網路架構),再計算最適的類別權重或時序權重,進而快速診斷網路通訊設備10發生障礙之機率。 Therefore, the present invention can interface with the information of the network communication equipment 10 in the initial stage, and collect a large amount of performance management (PM) data, configuration management (CM) data, equipment records (Log) and failure events of the network communication equipment 10 And other information to use the machine learning model 120 shown in Figure 4 (for example, a recurrent neural network model including a complex recurrent neural network (Recurrent Neural Network; RNN) layer and a complex number of neurons) to construct a machine learning model architecture (such as Recursive neural network architecture), and then calculate the most suitable category weight or time sequence weight, and then quickly diagnose the probability of failure of the network communication device 10.

又,隨著無線網路架構的演進,網路通訊設備10(如小型基地台設備)之間的資料路由與交換將愈來愈複雜。因此,網路通訊設備10之障礙診斷若以傳統規則模式(Rule-Based)進行,將耗時大量時間追查每個節點之狀況。是以,本發明採用新興的機器學習模型120(見第4圖)或機器學習技術,將有助於例如電信營運商在第5代(5G)網路或更先進網路中,藉由診斷模組建立知識基礎模式(Knowledge-Based)之障礙診斷方法,以達成智慧維運(Artificial Intelligence for IT Operations;AIOps)之目標。 In addition, with the evolution of wireless network architecture, data routing and exchange between network communication devices 10 (such as small base station devices) will become more and more complicated. Therefore, if the fault diagnosis of the network communication device 10 is performed in a traditional rule-based mode, it will take a lot of time to trace the status of each node. Therefore, the present invention adopts the emerging machine learning model 120 (see Figure 4) or machine learning technology, which will help, for example, telecom operators in the 5th generation (5G) network or more advanced network, by diagnosis The module establishes a knowledge-based obstacle diagnosis method to achieve the goal of Artificial Intelligence for IT Operations (AIOps).

另外,本發明提供或設計有關障礙事件之權重評估演算法31,能在不平衡的網路通訊設備10之資料集中提升模型學習準確率。因此, 本發明能用於各種網路通訊設備10之障礙偵測與預防,以減少電信運營商在網路通訊設備10之維運成本與時間。 In addition, the present invention provides or designs the weight evaluation algorithm 31 related to obstacle events, which can improve the accuracy of model learning in the data set of the unbalanced network communication device 10. therefore, The present invention can be used for obstacle detection and prevention of various network communication equipment 10 to reduce the maintenance cost and time of the network communication equipment 10 by the telecommunication operator.

第2圖為本發明之網路通訊設備10(如小型基地台設備)及其網路架構中有關網路障礙特徵之示意圖。如圖所示,網路通訊設備10可包括例如下列11項障礙事件:[1]設備之PCI(Peripheral Component Interconnect;週邊構件互連)衝突、[2]設備未取得PCI(週邊構件互連)、[3]設備之CPU(Central Processing Unit;中央處理器)負載過高、[4]設備之記憶體負載過高、[5]設備之溫度超過臨界值、[6]設備之時間同步失敗、[7]設備之IP(Internet Protocol;網際網路協定)位址衝突、[8]設備之MAC(Media Access Control;媒體存取控制)位址衝突、[9]設備之韌體更新失敗或遺失、[10]設備交遞(Handover)之X2連接介面斷線、[11]設備上傳PM(效能管理)資料、組態管理(CM)資料失敗。上述11項障礙事件皆為網路通訊設備10(如小型基地台設備)本身所產生的障礙事件日誌101或事件紀錄日誌(見第4圖),這些障礙事件將嚴重影響網路通訊設備10提供網路服務(如無線網路服務)之品質,甚至使得網路通訊設備10無法提供網路服務。 Figure 2 is a schematic diagram of the network communication equipment 10 (such as a small base station equipment) of the present invention and related network obstacle characteristics in its network architecture. As shown in the figure, the network communication device 10 may include, for example, the following 11 obstacle events: [1] PCI (Peripheral Component Interconnect) conflict of the device, [2] The device does not obtain PCI (Peripheral Component Interconnect) , [3] The CPU (Central Processing Unit) load of the device is too high, [4] The memory load of the device is too high, [5] The temperature of the device exceeds the critical value, [6] The time synchronization of the device fails, [7] Device IP (Internet Protocol) address conflict, [8] Device MAC (Media Access Control) address conflict, [9] Device firmware update failed or missing , [10] The X2 connection interface of the device handover (Handover) was disconnected, [11] The device failed to upload PM (performance management) data and configuration management (CM) data. The 11 obstacle events mentioned above are all the obstacle event log 101 or event log (see Figure 4) generated by the network communication equipment 10 (such as small base station equipment). These obstacle events will seriously affect the network communication equipment 10 to provide The quality of network services (such as wireless network services) even makes the network communication device 10 unable to provide network services.

網管模組20(如EMS網管模組或EMS網管系統)納管網路通訊設備10(如小型基地台設備)可包括例如下列4項障礙事件:[1]設備之供裝失敗、[2]設備之參數取得失敗、[3]設備之參數設定失敗、[4]設備之災防告警系統(Public Warning System;PWS)之參數設定失敗。上述4項障礙事件皆為網管模組20(如EMS網管模組或EMS網管系統)針對設備管理所產生的障礙事件日誌101或事件紀錄日誌(見第4圖),雖不影響網路通訊設備10提供網路服務(如無線網路服務),但卻無法即時監控與優化網路通 訊設備10之效能管理(PM)資料之參數。 The network management module 20 (such as the EMS network management module or the EMS network management system) to manage the network communication equipment 10 (such as the small base station equipment) may include, for example, the following 4 obstacles: [1] Failure to install the equipment, [2] Failed to obtain the parameters of the equipment, [3] Failed to set the parameters of the equipment, [4] Failed to set the parameters of the equipment's disaster prevention warning system (Public Warning System; PWS). The above 4 obstacle events are all the obstacle event log 101 or event log (see Figure 4) generated by the network management module 20 (such as EMS network management module or EMS network management system) for equipment management, although it does not affect the network communication equipment 10 Provide network services (such as wireless network services), but cannot monitor and optimize network communication in real time Parameters of the performance management (PM) data of the telecommunications equipment 10.

在網路(如廣域網路或局部網路)之安全閘道器(SecGW)60中,可包括例如下列3項障礙事件:[1]IPSec(網際網路安全協定)通道(Tunnel)建置失敗、[2]S1-MME(Mobility Management Entity;移動管理實體)連接介面斷線、[3]IPSec通道出現意外斷線。上述3項障礙事件皆為網路通訊設備10(如小型基地台設備)連接安全閘道器(SecGW)60所產生的障礙事件日誌101或事件紀錄日誌(見第4圖),這些障礙事件將嚴重影響網路通訊設備10提供網路服務。 The security gateway (SecGW) 60 of a network (such as a wide area network or a local network) can include, for example, the following 3 obstacles: [1] IPSec (Internet Security Protocol) tunnel establishment failure , [2] S1-MME (Mobility Management Entity) connection interface is disconnected, [3] IPSec channel is unexpectedly disconnected. The above three obstacle events are all the obstacle event log 101 or event log (see Figure 4) generated by the network communication equipment 10 (such as small base station equipment) connected to the secure gateway (SecGW) 60. These obstacle events will be Seriously affect the network communication equipment 10 to provide network services.

第3圖為本發明之基於機器學習預測與防範網路通訊設備發生障礙之系統1之實施例示意圖。如圖所示,網路通訊設備10(如小型基地台設備)定期透過通訊協定將效能管理(PM)資料與組態管理(CM)資料傳送至網管模組20(如EMS網管模組或EMS網管系統)之伺服器21(如自動組態伺服器(Auto Configuration Server,ACS)),以將效能管理(PM)資料與組態管理(CM)資料儲存於網管模組20(或伺服器21)之資料庫22(如ACS資料庫)中,且網路通訊設備10將所發生有關障礙事件之障礙事件日誌101(事件紀錄日誌)一併轉發至網管模組20以記錄於伺服器21或資料庫22中。 Figure 3 is a schematic diagram of an embodiment of the system 1 for predicting and preventing obstacles in network communication equipment based on machine learning of the present invention. As shown in the figure, the network communication equipment 10 (such as small base station equipment) regularly transmits performance management (PM) data and configuration management (CM) data to the network management module 20 (such as EMS network management module or EMS) through the communication protocol. The server 21 of the network management system (such as Auto Configuration Server (ACS)) to store performance management (PM) data and configuration management (CM) data in the network management module 20 (or server 21) ) In the database 22 (such as the ACS database), and the network communication device 10 forwards the obstacle event log 101 (event record log) related to the obstacle event that has occurred to the network management module 20 for recording on the server 21 or Database 22.

又,網管模組20之伺服器21接收例如約二百多個效能管理(PM)資料與組態管理(CM)資料之參數,並透過表現層狀態轉換應用程式介面(Representational State Transfer Application Programming Interface;Rest API)將效能管理(PM)資料與組態管理(CM)資料之參數傳送至診斷模組30(如診斷伺服器),且由診斷模組30透過關鍵績效指標(KPI)公式資料庫90將效能管理(PM)資料與組態管理(CM)資料之參數計算出或轉換成相 應之關鍵績效指標(KPI),以依據關鍵績效指標(KPI)呈現網路通訊設備10(如小型基地台設備)之無線網路品質資訊。而且,診斷模組30(如診斷伺服器)記錄所有設備每一段時間(如每15分鐘)的89項關鍵績效指標(KPI)與事件日誌,同時進行設備發生障礙診斷之資料前處理、合併、模型建置與障礙預測。 In addition, the server 21 of the network management module 20 receives, for example, more than two hundred performance management (PM) data and configuration management (CM) parameters, and uses the Representational State Transfer Application Programming Interface (Representational State Transfer Application Programming Interface). ; Rest API) The parameters of performance management (PM) data and configuration management (CM) data are sent to the diagnostic module 30 (such as a diagnostic server), and the diagnostic module 30 uses the key performance indicator (KPI) formula database 90 Calculate or convert the parameters of performance management (PM) data and configuration management (CM) data into relative According to the key performance indicators (KPI), the wireless network quality information of the network communication equipment 10 (such as small base station equipment) is presented based on the key performance indicators (KPI). Moreover, the diagnostic module 30 (such as a diagnostic server) records 89 key performance indicators (KPIs) and event logs of all equipment for a period of time (such as every 15 minutes), and at the same time performs data pre-processing, merging, and Model building and obstacle prediction.

下列表1為本發明之關鍵績效指標(KPI)與公式。在第3圖所示網路通訊設備10(如小型基地台設備)之關鍵績效指標(KPI)與障礙事件日誌101(事件紀錄日誌)之資料處理中,網路通訊設備10之無線網路品質資訊可由例如89項關鍵績效指標(KPI)呈現,亦可包括時間與設備識別碼(ID)。每一項關鍵績效指標(KPI)皆有相對應之公式,而公式內之參數即為網路通訊設備10定期帶上來之效能管理(PM)資料與組態管理(CM)資料之參數。 Table 1 below shows the key performance indicators (KPI) and formulas of the present invention. In the data processing of the key performance indicators (KPI) of the network communication equipment 10 (such as small base station equipment) and the obstacle event log 101 (event record log) shown in Figure 3, the wireless network quality of the network communication equipment 10 The information can be presented by, for example, 89 key performance indicators (KPIs), and can also include time and device identification numbers (ID). Each key performance indicator (KPI) has a corresponding formula, and the parameters in the formula are the parameters of performance management (PM) data and configuration management (CM) data regularly brought up by the network communication device 10.

Figure 108132058-A0101-12-0010-1
Figure 108132058-A0101-12-0010-1

下列表2為本發明之障礙事件日誌之內容。舉例而言,設備的障礙事件日誌可分為18種障礙類別,每一筆障礙事件日誌被記錄於第3圖所示資料庫22(如ACS資料庫)中,且診斷模組30(如診斷伺服器)會定期讀取資料庫22之最新日誌。從障礙事件日誌之內容可得知障礙發生之日期 時間(例如2018-06-23 09:05:18)、設備識別碼(例如B22539-LTE)與事件內容(例如發生來自ip…之s1連接介面斷線)。從上述關鍵績效指標(KPI)之資訊與障礙事件日誌之內容,可以由障礙發生之日期時間與設備識別碼(ID)合併二者資料。此外,倘若同一個設備在一段時間(如15分鐘)內發生二筆障礙,則合併的過程將會有二筆相同的關鍵績效指標(KPI)之資訊分別對應二筆障礙事件日誌之內容。 Table 2 below is the content of the obstacle event log of the present invention. For example, the obstacle event log of the equipment can be divided into 18 types of obstacles. Each obstacle event log is recorded in the database 22 (such as the ACS database) shown in Figure 3, and the diagnostic module 30 (such as the diagnostic servo Device) will periodically read the latest log of the database 22. From the content of the obstacle event log, you can know the date of the obstacle occurrence Time (e.g. 2018-06-23 09:05:18), device identification code (e.g. B22539-LTE) and event content (e.g. s1 connection interface disconnection from ip...). From the information of the above-mentioned key performance indicators (KPI) and the content of the obstacle event log, the date and time of the occurrence of the obstacle can be combined with the equipment identification code (ID). In addition, if two obstacles occur on the same device within a period of time (such as 15 minutes), there will be two identical key performance indicator (KPI) information corresponding to the content of the two obstacle event logs during the merging process.

Figure 108132058-A0101-12-0011-2
Figure 108132058-A0101-12-0011-2

在資料前處理部分,第3圖所示網路通訊設備10帶上來之效能管理(PM)資料與組態管理(CM)資料之參數偶爾會造成少數參數在計算關鍵績效指標(KPI)時,使關鍵績效指標(KPI)之分母為0,以致計算結果錯誤。因此,診斷模組30可依據各別的障礙事件類別,採用各個障礙事件類別之各項關鍵績效指標(KPI)之平均數、眾數或零填補關鍵績效指標(KPI)之空值。另外,診斷模組30可採用歸一化(Normalization)來優化關鍵績效指標(KPI)之數值,因有些關鍵績效指標(KPI)之數值極大,如上行(Uplink)吞吐量與下行(Downlink)吞吐量(例如以Kbit為單位);或者,診斷模組30 對於有些關鍵績效指標(KPI)之數值是簡單以真假值(如True:1或Flase:0)表示,可將每項關鍵績效指標(KPI)之數值縮放至[0,1]之間,以加速後續模型收斂。然後,診斷模組30可透過標籤編碼(Label Encoding)與一獨熱編碼(One Hot Encoding)將解析後有關網路通訊設備10之(如18+1類)障礙事件類別與正常類別轉換成(如N*19維)多維之矩陣。 In the data pre-processing part, the parameters of performance management (PM) data and configuration management (CM) data brought by the network communication device 10 shown in Figure 3 occasionally cause a small number of parameters to be used in the calculation of key performance indicators (KPI). Make the denominator of the key performance indicator (KPI) 0, so that the calculation result is wrong. Therefore, the diagnostic module 30 can use the average, mode or zero of each key performance indicator (KPI) of each obstacle event category to fill the empty value of the key performance indicator (KPI) according to the respective obstacle event category. In addition, the diagnostic module 30 can use normalization to optimize the value of key performance indicators (KPI), because some key performance indicators (KPI) have extremely large values, such as Uplink throughput and Downlink throughput Quantity (for example, in Kbit); or, the diagnostic module 30 For some key performance indicators (KPI) values are simply true and false values (such as True:1 or Flase:0). The value of each key performance indicator (KPI) can be scaled to [0,1]. To speed up the subsequent model convergence. Then, the diagnostic module 30 can use Label Encoding and One Hot Encoding to convert the parsed network communication equipment 10 (such as category 18+1) obstacle event category and normal category into ( Such as N*19 dimension) multi-dimensional matrix.

本發明從多個關鍵績效指標(KPI)之間的數值變化發現單一障礙事件可同時與多個關鍵績效指標(KPI)之數值變化產生連動。例如,若第2圖所示某一網路通訊設備10(如小型基地台設備)連接安全閘道器(SecGW)60後,突然發生S1-MME(S1-移動管理實體)連接介面斷線,此時設備的S1連接介面所接收之位元組(Bytes received of S1 link interface)之訊務量將會降低,而設備的可用率(Availability Rate)也將降低,但因網路通訊設備10(如小型基地台設備)上仍有相關用戶設備50(見第1圖)持續連線,故用戶設備50之連接數量仍維持不變,故從上述資訊中得知關鍵績效指標(KPI)之間的數值變化與障礙事件有關連存在。所以,本發明之診斷模組30採用例如長短期記憶(Long Short-Term Memory,LSTM)模型作為基礎,以找出關鍵績效指標(KPI)之間的關聯性,進而建置第4圖所示具有機器學習模型120(如遞歸神經網路模型)之機器學習模型架構100(如遞歸神經網路模型架構)。 The present invention finds from the numerical changes between multiple key performance indicators (KPI) that a single obstacle event can be linked to the numerical changes of multiple key performance indicators (KPI) at the same time. For example, if a network communication device 10 (such as a small base station device) shown in Figure 2 is connected to a secure gateway (SecGW) 60, the S1-MME (S1-Mobile Management Entity) connection interface is suddenly disconnected, At this time, the traffic of Bytes received of S1 link interface of the device will decrease, and the availability rate of the device will also decrease. However, due to the 10 ( If there are still related user equipment 50 (see Figure 1) on the small base station equipment), the number of connections of the user equipment 50 remains unchanged. Therefore, it is known from the above information that the key performance indicators (KPI) are between The value change of is related to the obstacle event. Therefore, the diagnostic module 30 of the present invention uses, for example, a long short-term memory (Long Short-Term Memory, LSTM) model as a basis to find out the correlation between key performance indicators (KPIs), and then builds as shown in Figure 4 A machine learning model framework 100 (such as a recurrent neural network model) with a machine learning model 120 (such as a recurrent neural network model).

第4圖為本發明之機器學習模型架構100(如遞歸神經網路模型架構)及其機器學習模型120(如遞歸神經網路模型)之示意圖。如圖所示,在機器學習模型架構100中,可將障礙事件日誌101(事件紀錄日誌)經資料處理(如前處理或合併)以產生障礙標籤102後傳送至資料集110,並將 效能管理(PM)資料與組態管理(CM)資料103經資料處理(如前處理或合併)以產生關鍵績效指標(KPI)104後傳送至資料集110。 FIG. 4 is a schematic diagram of the machine learning model architecture 100 (such as the recurrent neural network model architecture) and its machine learning model 120 (such as the recurrent neural network model) of the present invention. As shown in the figure, in the machine learning model architecture 100, the obstacle event log 101 (event record log) can be processed (such as pre-processing or merging) to generate obstacle tags 102 and then sent to the data set 110, and Performance management (PM) data and configuration management (CM) data 103 are processed (such as pre-processing or merging) to generate key performance indicators (KPI) 104 and then sent to the data set 110.

又,經資料處理後的資料集110分為訓練資料集111與驗證資料集112,可藉由訓練資料集111對例如二層長短期記憶(LSTM)之遞歸神經網路(RNN)進行預訓練,再加入例如四層神經網路(Neural Network;NN),接著由softmax演算法121分析出例如18種障礙類別與正常(normal)類別。在機器學習模型120之訓練過程中,可藉由驗證資料集112驗證機器學習模型120之準確率(accuracy),並在多回合的迭代訓練後,將每個隱藏層(如二層遞歸神經網路(RNN)加上四層神經網路(NN)共六個隱藏層)加入退出層(dropout layer)122中,以防止機器學習模型120之過度擬合(overfitting)。然後,由診斷模組30依據新效能管理(PM)資料與新組態管理(CM)資料105所產生新關鍵績效指標(KPI)106之資訊驗證機器學習模型120之準確率。 In addition, the data set 110 after data processing is divided into a training data set 111 and a verification data set 112. The training data set 111 can be used to pre-train a recurrent neural network (RNN) such as a two-layer long short-term memory (LSTM) , For example, a four-layer neural network (Neural Network; NN) is added, and then the softmax algorithm 121 analyzes, for example, 18 obstacle categories and normal categories. During the training process of the machine learning model 120, the accuracy of the machine learning model 120 can be verified by the verification data set 112, and after multiple rounds of iterative training, each hidden layer (such as a two-layer recurrent neural network) Road (RNN) plus four layers of neural network (NN) (total six hidden layers) are added to the dropout layer 122 to prevent overfitting of the machine learning model 120. Then, the diagnostic module 30 verifies the accuracy of the machine learning model 120 based on the information of the new key performance indicator (KPI) 106 generated by the new performance management (PM) data and the new configuration management (CM) data 105.

下列表3為本發明第1圖中有關障礙事件之權重評估演算法31。舉例而言,設備發生障礙共分為18種類別,而每一種障礙發生之次數並非相同,因此在調和第4圖所示機器學習模型120(如遞歸神經網路模型)之訓練過程中,可由診斷模組30使用權重評估演算法31來優化或調整每一種障礙事件發生之類別權重,以提高分類準確率。 Table 3 below is the weight evaluation algorithm 31 of the obstacle event in Figure 1 of the present invention. For example, there are 18 types of equipment obstacles, and the number of occurrences of each obstacle is not the same. Therefore, in the training process of reconciling the machine learning model 120 (such as the recurrent neural network model) shown in Figure 4, it can be The diagnostic module 30 uses the weight evaluation algorithm 31 to optimize or adjust the category weight of each obstacle event occurrence, so as to improve the classification accuracy.

Figure 108132058-A0101-12-0014-3
Figure 108132058-A0101-12-0014-3

例如,在上述表3與第1圖有關障礙事件之權重評估演算法31中,可定義class_sampleSize代表每一項障礙類別之發生次數,total_sample代表總次數,mu代表在class_sampleSize中擁有最多次數佔total_sample的比例(即

Figure 108132058-A0101-12-0014-4
)。每個障礙事件類別之類別權重為
Figure 108132058-A0101-12-0014-5
,且類別權重之數值最小為1,並以log作為權重評估的正規化。權重評估演算法31主要表示當某一類障礙事件發生次數愈多時,機器學習模型120之訓練過程中,障礙事件之類別權重愈小;反之,當某一類障礙事件發生次數愈少時,機器學習模型120之訓練過程中,障礙事件之類別權重愈大。例如,class_sampleSize={c0:2813,c1:78,c2:1014,c3:510,c4:7914,c5:348},total_sample=12677,mu=12677/7914=1.6018,
Figure 108132058-A0101-12-0014-6
,並依此類推c1至c5的權重,其中c0至c5表示類別。 For example, in the above Table 3 and Figure 1 in the weight evaluation algorithm for obstacle events 31, you can define class_sampleSize to represent the number of occurrences of each obstacle category, total_sample to represent the total number of times, and mu to represent the number of occurrences in class_sampleSize that account for the largest number of total_sample Ratio (i.e.
Figure 108132058-A0101-12-0014-4
). The category weight of each obstacle event category is
Figure 108132058-A0101-12-0014-5
, And the minimum value of the category weight is 1, and log is used as the normalization of the weight evaluation. The weight evaluation algorithm 31 mainly means that when the number of occurrences of a certain type of obstacle event increases, the weight of the type of obstacle event becomes smaller during the training process of the machine learning model 120; conversely, when the number of occurrences of a certain type obstacle event decreases, the machine learning In the training process of the model 120, the larger the weight of the obstacle event category. For example, class_sampleSize={c0:2813,c1:78,c2:1014,c3:510,c4:7914,c5:348}, total_sample=12677, mu=12677/7914=1.6018,
Figure 108132058-A0101-12-0014-6
, And so on, the weights of c1 to c5, where c0 to c5 represent categories.

下列以一個具體實施例說明本發明之運作方式。首先,如第 1圖所示網路通訊設備10之網路架構,以監管網路通訊設備10(如小型基地台設備)為例,假設某一電信運營商目前部署約750個網路通訊設備10於全台地區,包括全台之超商、宴會廳、高鐵、醫院與警察局等公共場所。同時,網路通訊設備10(如小型基地台設備)透過網路(如廣域網路)之連接至安全閘道器(SecGW)60,並由網管模組20(如EMS網管模組或EMS網管系統)透過通訊協定納管網路通訊設備10之供裝、監控、參數設定或(及)關鍵績效指標(KPI)之管理等功能。 A specific example is given below to illustrate the operation of the present invention. First of all, as the first Figure 1 shows the network architecture of the network communication equipment 10, taking the supervision of the network communication equipment 10 (such as small base station equipment) as an example, suppose a certain telecom operator currently deploys about 750 network communication equipment 10 in the whole station Areas, including public places such as supermarkets, banquet halls, high-speed rail, hospitals and police stations throughout Taiwan. At the same time, network communication equipment 10 (such as small base station equipment) is connected to a secure gateway (SecGW) 60 through a network (such as a wide area network), and is managed by a network management module 20 (such as an EMS network management module or an EMS network management system). ) Manage functions such as installation, monitoring, parameter setting, or (and) key performance indicator (KPI) management of the network communication equipment 10 through the communication protocol.

接著,如第3圖所示網路通訊設備10(如小型基地台設備)之網路架構。網路通訊設備10每一段時間(如每15分鐘)透過通訊協定傳送效能管理(PM)資料與組態管理(CM)資料至網管模組20之伺服器21(如自動組態伺服器(ACS))以儲存於資料庫22中,並將設備所發生有關障礙事件之障礙事件日誌一併轉發至伺服器21以記錄於資料庫22(如ACS資料庫)。又,伺服器21接收例如約二百多個效能管理(PM)資料與組態管理(CM)資料之參數,以由表現層狀態轉換應用程式介面(Rest API)傳送效能管理(PM)資料與組態管理(CM)資料之參數至診斷模組30(如診斷伺服器),且由診斷模組30將效能管理(PM)資料與組態管理(CM)資料之參數透過關鍵績效指標(KPI)公式資料庫90轉換成關鍵績效指標(KPI),以依據關鍵績效指標(KPI)呈現網路通訊設備10(如小型基地台設備)之無線網路品質資訊。然後,診斷模組30記錄所有設備每一段時間(如每15分鐘)的89項關鍵績效指標(KPI)與事件日誌,同時進行設備發生障礙診斷之資料前處理、合併、機器學習模型120(見第4圖)之建置與障礙預測。 Next, as shown in Figure 3, the network architecture of the network communication device 10 (such as a small base station device). The network communication device 10 transmits performance management (PM) data and configuration management (CM) data to the server 21 of the network management module 20 (such as the automatic configuration server (ACS) )) to store in the database 22, and forward the obstacle event log of the equipment related obstacle events to the server 21 for recording in the database 22 (such as the ACS database). In addition, the server 21 receives, for example, more than two hundred performance management (PM) data and configuration management (CM) data parameters, and transmits performance management (PM) data and parameters through the presentation layer state transition application program interface (Rest API) The configuration management (CM) data parameters are sent to the diagnostic module 30 (such as a diagnostic server), and the performance management (PM) data and configuration management (CM) data parameters are passed through the key performance indicator (KPI) by the diagnostic module 30 ) The formula database 90 is converted into a key performance indicator (KPI) to present the wireless network quality information of the network communication device 10 (such as a small base station device) according to the key performance indicator (KPI). Then, the diagnostic module 30 records 89 key performance indicators (KPIs) and event logs of all equipment for a period of time (such as every 15 minutes), and at the same time performs data pre-processing, merging, and machine learning model 120 (see Figure 4) Construction and obstacle prediction.

此具體實施例收集107年5月1日至8月30日(共計123 日),有關設備的效能管理(PM)資料、組態管理(CM)資料、設備紀錄(Log)、設備發生的障礙事件日誌(障礙事件紀錄)等資料。這些資料於第4圖所示診斷模組30(如診斷伺服器)經關鍵績效指標(KPI)計算、資料處理與合併後,例如共有5,050,010筆關鍵績效指標(KPI)之資訊,每一筆關鍵績效指標(KPI)之資訊具有89個關鍵績效指標(KPI),其中的10,986筆關鍵績效指標(KPI)之資訊有障礙事件紀錄。 This specific example collects from May 1 to August 30, 107 (a total of 123 Day), related equipment performance management (PM) data, configuration management (CM) data, equipment record (Log), equipment incident log (obstacle event record) and other data. These data are calculated, processed and merged by key performance indicators (KPI) in the diagnostic module 30 (such as the diagnostic server) shown in Figure 4. For example, a total of 5,050,010 key performance indicators (KPI) information, each key performance KPI information has 89 key performance indicators (KPIs), of which 10,986 key performance indicators (KPI) information have a record of obstacles.

如第4圖所示之機器學習模型架構100(如遞歸神經網路模型架構)。例如,訓練資料集111在107年5月1日至8月15日,共有4,099,340筆關鍵績效指標(KPI)之資訊及9,834筆障礙事件紀錄。驗證資料集112在107年8月16日至8月30日,共有891,815筆關鍵績效指標(KPI)之資訊及1,053筆障礙事件紀錄。測試資料集在107年8月31日,共有58,855筆關鍵績效指標(KPI)之資訊及99筆障礙事件紀錄。機器學習模型120之歷元(Epoch)約1000次,批量大小(Batch Size)約128至256,機器學習模型120之隱藏層之激活函數(activation function)採用ReLU(Rectified Linear Unit;修正線性單元)與softmax演算法121,各隱藏層約有64至128個神經元,最後分為19種類別(包括18種障礙類別與1種正常類別),其中網路通訊設備10(如小型基地台設備)之障礙項目如第2圖所示。 The machine learning model architecture 100 shown in Figure 4 (such as the recurrent neural network model architecture). For example, in the training data set 111 from May 1 to August 15, 107, there were 4,099,340 key performance indicator (KPI) information and 9,834 obstacle event records. From August 16th to August 30th, 2007, the verification data set 112 has a total of 891,815 key performance indicator (KPI) information and 1,053 obstacle event records. The test data set was on August 31, 107, with a total of 58,855 key performance indicator (KPI) information and 99 obstacle event records. The Epoch of the machine learning model 120 is about 1000 times, and the batch size is about 128 to 256. The activation function of the hidden layer of the machine learning model 120 uses ReLU (Rectified Linear Unit; modified linear unit). Compared with the softmax algorithm 121, each hidden layer has about 64 to 128 neurons, and finally is divided into 19 categories (including 18 obstacle categories and 1 normal category), of which 10 are network communication equipment (such as small base station equipment) The obstacles are shown in Figure 2.

第5A圖為本發明之基於機器學習預測與防範網路通訊設備發生障礙之方法中有關障礙診斷模型40之建置流程圖,且一併參閱第1圖。在第5A圖之障礙診斷模型40之建置流程中,由第1圖所示網管模組20收集網路通訊設備10之效能管理(PM)資料、組態管理(CM)資料、設備紀 錄(Log)與障礙事件等設備資訊,再由第1圖所示診斷模組30(如診斷伺服器)依據效能管理(PM)資料與組態管理(CM)資料計算出或轉換成例如LTE關鍵績效指標(KPI)之關鍵績效指標(KPI),以進一步建置障礙診斷模型40。 FIG. 5A is a flow chart of the establishment of the obstacle diagnosis model 40 in the method for predicting and preventing the occurrence of obstacles in network communication equipment based on machine learning of the present invention, and refer to FIG. 1 as well. In the construction process of the obstacle diagnosis model 40 in Fig. 5A, the network management module 20 shown in Fig. 1 collects performance management (PM) data, configuration management (CM) data, and equipment records of the network communication equipment 10 Equipment information such as logs and obstacle events is calculated or converted into LTE by the diagnostic module 30 (such as a diagnostic server) shown in Figure 1 based on performance management (PM) data and configuration management (CM) data. Key performance indicators (KPIs) are key performance indicators (KPIs) to further build obstacle diagnosis models40.

舉例而言,在第5A圖之步驟S11中,由第1圖所示網管模組20(如EMS網管模組或EMS網管系統)收集來自網路通訊設備10(如小型基地台設備)之效能管理(PM)資料、組態管理(CM)資料、設備紀錄(Log)與障礙事件等設備資訊。接著,在第5A圖之步驟S12中,由第1圖所示診斷模組30(如診斷伺服器)連接或通訊網管模組20,以將來自網路通訊設備10之效能管理(PM)資料與組態管理(CM)資料等設備資訊,透過例如3GPP規範之LTE關鍵績效指標(KPI)計算出或轉換成相應之關鍵績效指標(KPI)。然後,在第5A圖之步驟S13、步驟S14及步驟S15中,由診斷模組30建立有關障礙事件之權重評估演算法31以計算或調整障礙事件之權重,並由診斷模組30以關鍵績效指標(KPI)作為輸入因子及以障礙事件作為輸出結果來建置機器學習模型作為障礙診斷模型40,再由診斷模組30輸出障礙診斷模型40。 For example, in step S11 in Figure 5A, the network management module 20 (such as an EMS network management module or an EMS network management system) shown in Figure 1 collects the performance from the network communication equipment 10 (such as small base station equipment) Management (PM) data, configuration management (CM) data, equipment records (Log) and equipment information such as obstacle events. Then, in step S12 of FIG. 5A, the diagnostic module 30 (such as a diagnostic server) shown in FIG. 1 is connected to or the communication network management module 20 to transfer the performance management (PM) data from the network communication device 10 Equipment information such as configuration management (CM) data is calculated or converted into corresponding key performance indicators (KPIs) through, for example, the LTE key performance indicators (KPI) specified by 3GPP. Then, in step S13, step S14, and step S15 in Figure 5A, the diagnostic module 30 establishes a weight evaluation algorithm 31 related to the obstacle event to calculate or adjust the weight of the obstacle event, and the diagnostic module 30 determines the key performance The index (KPI) is used as an input factor and the obstacle event is used as an output result to build a machine learning model as the obstacle diagnosis model 40, and then the diagnosis module 30 outputs the obstacle diagnosis model 40.

第5B圖為本發明之基於機器學習預測與防範網路通訊設備發生障礙之方法中有關網路通訊設備10之障礙診斷流程圖,且一併參閱第1圖。在第5B圖之網路通訊設備10之障礙診斷流程中,由網管模組20收集來自網路通訊設備10之新效能管理(PM)資料與新組態管理(CM)資料,以由診斷模組30將新效能管理(PM)資料與新組態管理(CM)資料計算出或轉換成相應之新關鍵績效指標(KPI),再將新關鍵績效指標(KPI)輸入障礙診斷模型40。然後,由診斷模組30透過障礙診斷模型40利用新關鍵績效 指標(KPI)預測或診斷網路通訊設備10是否可能發生或出現障礙,俾於預測或診斷出網路通訊設備10可能發生或出現障礙時,由診斷模組30調整網路通訊設備10之設備參數,例如調整用戶設備50之連線數上限來減少網路通訊設備10之負載,以防範或防止網路通訊設備10發生障礙。 FIG. 5B is a flowchart of the obstacle diagnosis of the network communication device 10 in the method for predicting and preventing network communication device obstacles based on machine learning of the present invention, and also refer to FIG. 1. In the failure diagnosis process of the network communication equipment 10 in Fig. 5B, the network management module 20 collects the new performance management (PM) data and the new configuration management (CM) data from the network communication equipment 10 for the diagnosis model The group 30 calculates or converts the new performance management (PM) data and the new configuration management (CM) data into corresponding new key performance indicators (KPI), and then inputs the new key performance indicators (KPI) into the obstacle diagnosis model 40. Then, the diagnostic module 30 utilizes the new key performance through the obstacle diagnosis model 40 Indicators (KPI) predict or diagnose whether the network communication device 10 may occur or appear to be faulty. When it is predicted or diagnosed that the network communication device 10 may occur or appear to be faulty, the diagnostic module 30 adjusts the equipment of the network communication device 10 Parameters, such as adjusting the upper limit of the number of connections of the user equipment 50 to reduce the load of the network communication device 10, to prevent or prevent the network communication device 10 from malfunctioning.

舉例而言,在第5B圖之步驟S21中,由第1圖所示網管模組20(如EMS網管模組或EMS網管系統)收集來自網路通訊設備10(如小型基地台設備)之效能管理(PM)資料、組態管理(CM)資料、設備紀錄(Log)與障礙事件等設備資訊。接著,在第5B圖之步驟S22中,由第1圖所示診斷模組30(如診斷伺服器)連接或通訊網管模組20,以將來自網路通訊設備10之效能管理(PM)資料與組態管理(CM)資料等設備資訊,透過例如3GPP規範之LTE關鍵績效指標(KPI)計算出或轉換成相應之關鍵績效指標(KPI)。然後,在第5B圖之步驟S23及步驟S24中,由診斷模組30輸入網路障礙特徵至障礙診斷模型40,並在發生障礙時,由診斷模組30依據障礙診斷模型40預防網路通訊設備10之障礙與調整網路通訊設備10之設備參數。 For example, in step S21 in Figure 5B, the network management module 20 (such as EMS network management module or EMS network management system) shown in Figure 1 collects the performance from the network communication equipment 10 (such as small base station equipment) Management (PM) data, configuration management (CM) data, equipment records (Log) and equipment information such as obstacle events. Then, in step S22 of FIG. 5B, the diagnostic module 30 (such as a diagnostic server) shown in FIG. 1 is connected to or the communication network management module 20 to transfer the performance management (PM) data from the network communication device 10 Equipment information such as configuration management (CM) data is calculated or converted into corresponding key performance indicators (KPIs) through, for example, the LTE key performance indicators (KPI) specified by 3GPP. Then, in step S23 and step S24 in Figure 5B, the diagnosis module 30 inputs the network obstacle characteristics to the obstacle diagnosis model 40, and when an obstacle occurs, the diagnosis module 30 prevents network communication according to the obstacle diagnosis model 40 Obstacles of the device 10 and adjustment of the device parameters of the network communication device 10.

第6A圖、第6B圖與第6C圖分別為本發明中第一測試案例之模型驗證準確率(accuracy)、模型驗證精確率(precision)與模型驗證招回率(recall)之曲線圖,第7A圖、第7B圖與第7C圖分別為本發明中第二測試案例之模型驗證準確率、模型驗證精確率與模型驗證招回率之曲線圖。同時,此具體實施例可分為二種測試案例,第6A圖至第6C圖所示第一測試案例未採用本發明中有關障礙事件之權重評估演算法(即未調整障礙事件之權重),第7A圖至第7C圖所示第二測試案例有採用本發明中有關障 礙事件之權重評估演算法(即有調整障礙事件之權重)。 Fig. 6A, Fig. 6B and Fig. 6C are graphs of model verification accuracy, model verification accuracy and model verification recall rate (recall) of the first test case of the present invention, respectively. Fig. 7A, Fig. 7B and Fig. 7C are respectively the graphs of the model verification accuracy rate, the model verification accuracy rate and the model verification recall rate of the second test case of the present invention. At the same time, this specific embodiment can be divided into two types of test cases. The first test case shown in Fig. 6A to Fig. 6C does not use the obstacle event weight evaluation algorithm of the present invention (that is, the weight of the obstacle event is not adjusted). The second test case shown in Fig. 7A to Fig. 7C has the related obstacles in the present invention. The algorithm for evaluating the weight of obstacle events (that is, the weight of events with adjustment obstacles).

第一測試案(未採用本發明之權重評估演算法來調整障礙事件之權重)在次數1000時,第6A圖所示之模型驗證準確率於點A1達到98.3%(即0.983),第6B圖所示之模型驗證精確率於點A2達到27.5%(即0.275),第6C圖所示之模型驗證招回率於點A3達到98%(即0.98),且F1分數(F1-Scroe)為42.95。反之,第二測試案(有採用本發明之權重評估演算法來調整障礙事件之權重)在次數1000時,第7A圖所示之模型驗證準確率於點B1達到96%(即0.96),第7B圖所示之模型驗證精確率於點B2達到53.5%,第7C圖所示之模型驗證招回率於點B3達到92.2%,且F1分數為67.66。從上述測試結果來看,第7A圖至第7C圖所示第二測試案有採用本發明之權重評估演算法以調整障礙事件之權重時,模型驗證精確率大幅提升26%,而模型驗證準確率與模型驗證招回率雖較第一測試案有小幅下降2%至6%,但皆仍維持在90%以上,且F1分數提升24.71。因此,第一測試案與第二測試案之測試結果可參閱下列表4所示。 The first test case (the weight evaluation algorithm of the present invention is not used to adjust the weight of the obstacle event) when the number of times is 1000, the model verification accuracy shown in Figure 6A reaches 98.3% (ie 0.983) at point A1, Figure 6B The model verification accuracy rate shown in point A2 reached 27.5% (ie 0.275), the model verification recall rate shown in Figure 6C reached 98% (ie 0.98) at point A3, and the F1 score (F1-Scroe) was 42.95 . On the contrary, in the second test case (the weight evaluation algorithm of the present invention is used to adjust the weight of the obstacle event) when the number of times is 1000, the model verification accuracy shown in Figure 7A reaches 96% (ie 0.96) at point B1. The model verification accuracy rate shown in Figure 7B reached 53.5% at point B2, the model verification recall rate shown in Figure 7C reached 92.2% at point B3, and the F1 score was 67.66. From the above test results, when the second test case shown in Fig. 7A to Fig. 7C adopts the weight evaluation algorithm of the present invention to adjust the weight of obstacle events, the model verification accuracy rate is greatly improved by 26%, and the model verification is accurate. Although the accuracy rate and model verification recall rate decreased slightly from 2% to 6% compared with the first test case, they still remained above 90%, and the F1 score increased by 24.71. Therefore, the test results of the first test case and the second test case can be seen in Table 4 below.

Figure 108132058-A0101-12-0019-7
Figure 108132058-A0101-12-0019-7

綜上,本發明之基於機器學習預測與防範網路通訊設備發生障礙之系統及方法可至少具有下列特色、優點或技術功效。 In summary, the system and method for predicting and preventing obstacles in network communication equipment based on machine learning of the present invention can at least have the following characteristics, advantages, or technical effects.

一、本發明採用新興的機器學習技術,將有助於例如電信營運商在第5代(5G)網路或更先進網路中建立知識基礎模式之障礙診斷方法,以達成智慧維運(AIOps)之目標,俾預測與防範網路通訊設備發生障礙。 1. The present invention adopts emerging machine learning technology, which will help, for example, telecom operators establish a knowledge-based model of obstacle diagnosis method in the 5th generation (5G) network or more advanced network, so as to achieve intelligent maintenance (AIOps). The goal of) is to predict and prevent obstacles to network communication equipment.

二、本發明能在初始階段介接網路通訊設備之資訊,並大量收集網路通訊設備之效能管理(PM)資料、組態管理(CM)資料、設備紀錄(Log)與障礙事件等資訊,以建構機器學習模型(如遞歸神經網路模型)和計算最適的類別權重,進而快速預測或診斷網路通訊設備是否發生障礙或其機率。 2. The present invention can interface with the information of network communication equipment in the initial stage, and collect a large amount of information such as performance management (PM) data, configuration management (CM) data, equipment log (Log) and obstacle events of the network communication equipment , In order to construct a machine learning model (such as a recurrent neural network model) and calculate the most suitable category weight, and then quickly predict or diagnose whether the network communication equipment is malfunctioning or its probability.

三、本發明提供或設計有關障礙事件之權重評估演算法,能在不平衡的網路通訊設備之資料集中提升模型學習準確率。而且,在調和機器學習模型(如遞歸神經網路模型)之訓練過程中,使用權重評估演算法能優化或調整每一種障礙事件發生之類別權重,以提高分類準確率。 3. The present invention provides or designs a weight evaluation algorithm related to obstacle events, which can improve the accuracy of model learning in the data set of unbalanced network communication equipment. Moreover, in the training process of reconciling machine learning models (such as recurrent neural network models), the use of weight evaluation algorithms can optimize or adjust the category weights of each obstacle event to improve the classification accuracy.

四、本發明能用於各種網路通訊設備之障礙偵測與預防,以減少例如電信運營商在網路通訊設備之維運成本與時間。 4. The present invention can be used for obstacle detection and prevention of various network communication equipment, so as to reduce the maintenance cost and time of, for example, the telecommunication operator in the network communication equipment.

五、本發明可能應用之產業為電信產業、通訊產業等,且可能應用之產品為網路管理服務與系統、通訊設備管理服務與系統等。 5. The possible applications of the present invention are the telecommunications industry, communication industry, etc., and the possible applications of the products are network management services and systems, communication equipment management services and systems, etc.

上述實施形態僅例示性說明本發明之原理、特點及其功效,並非用以限制本發明之可實施範疇,任何熟習此項技藝之人士均能在不違背本發明之精神及範疇下,對上述實施形態進行修飾與改變。任何運用本發明所揭示內容而完成之等效改變及修飾,均仍應為申請專利範圍所涵蓋。 因此,本發明之權利保護範圍,應如申請專利範圍所列。 The above embodiments are only illustrative of the principles, features and effects of the present invention, and are not intended to limit the scope of implementation of the present invention. Anyone familiar with the art can comment on the above without departing from the spirit and scope of the present invention. Modifications and changes to the implementation form. Any equivalent changes and modifications made by using the content disclosed in the present invention should still be covered by the scope of the patent application. Therefore, the protection scope of the present invention should be as listed in the scope of the patent application.

1‧‧‧基於機器學習預測與防範網路通訊設備發生障礙之系統 1. A system based on machine learning to predict and prevent obstacles in network communication equipment

10‧‧‧網路通訊設備 10‧‧‧Network communication equipment

20‧‧‧網管模組 20‧‧‧Network Management Module

30‧‧‧診斷模組 30‧‧‧Diagnostic Module

31‧‧‧權重評估演算法 31‧‧‧Weight evaluation algorithm

40‧‧‧障礙診斷模型 40‧‧‧Disorder diagnosis model

50‧‧‧用戶設備 50‧‧‧User Equipment

60‧‧‧安全閘道器 60‧‧‧Security Gateway

70‧‧‧演進節點B閘道器 70‧‧‧Evolution Node B Gateway

80‧‧‧核心網路 80‧‧‧Core network

CM‧‧‧組態管理(資料) CM‧‧‧Configuration Management (Data)

IPSec‧‧‧網際網路安全協定(連接介面) IPSec‧‧‧Internet Security Protocol (connection interface)

KPI‧‧‧關鍵績效指標 KPI‧‧‧Key Performance Indicators

PM‧‧‧效能管理(資料) PM‧‧‧Performance Management (Data)

S1‧‧‧連接介面 S1‧‧‧Connecting interface

WAN‧‧‧廣域網路 WAN‧‧‧Wide Area Network

Claims (20)

一種基於機器學習預測與防範網路通訊設備發生障礙之系統,包括:至少一網路通訊設備;一網管模組,係收集來自該網路通訊設備之效能管理(PM)資料、組態管理(CM)資料與障礙事件;以及一診斷模組,係將該網管模組所收集之該網路通訊設備之效能管理(PM)資料與組態管理(CM)資料計算出或轉換成相應之關鍵績效指標(KPI),且該診斷模組以該關鍵績效指標(KPI)作為輸入因子及以該障礙事件作為輸出結果來建置具有隱藏層與退出層之機器學習模型作為障礙診斷模型,其中,在該機器學習模型之訓練過程中,藉由驗證資料集驗證具有該隱藏層與該退出層之該機器學習模型之準確率,且將該機器學習模型之該隱藏層加入該退出層中,及其中,該診斷模組係將由該網路通訊設備之新效能管理(PM)資料與新組態管理(CM)資料計算出或轉換而成之新關鍵績效指標(KPI)輸入為具有該隱藏層與該退出層之該機器學習模型之該障礙診斷模型,以由該診斷模組透過為具有該隱藏層與該退出層之該機器學習模型之該障礙診斷模型利用該新關鍵績效指標(KPI)預測或診斷該網路通訊設備是否可能發生障礙,俾於預測或診斷出該網路通訊設備可能發生障礙時,由該診斷模組調整該網路通訊設備之設備參數以防範或防止該網路通訊設備發生障礙。 A system based on machine learning to predict and prevent obstacles in network communication equipment, including: at least one network communication device; a network management module, which collects performance management (PM) data and configuration management ( CM) data and obstacle events; and a diagnostic module, which calculates or converts the performance management (PM) data and configuration management (CM) data of the network communication equipment collected by the network management module into the corresponding key Performance indicators (KPI), and the diagnostic module uses the KPI as an input factor and the obstacle event as an output result to build a machine learning model with a hidden layer and an exit layer as the obstacle diagnosis model, where: During the training process of the machine learning model, verify the accuracy of the machine learning model with the hidden layer and the exit layer by verifying the data set, and add the hidden layer of the machine learning model to the exit layer, and Among them, the diagnostic module inputs new key performance indicators (KPIs) calculated or converted from new performance management (PM) data and new configuration management (CM) data of the network communication equipment as having the hidden layer The obstacle diagnosis model of the machine learning model with the exit layer, so that the diagnosis module uses the new key performance indicator (KPI) through the obstacle diagnosis model of the machine learning model with the hidden layer and the exit layer Predict or diagnose whether the network communication equipment may be malfunctioning. When it is predicted or diagnosed that the network communication equipment may be malfunctioning, the diagnostic module adjusts the equipment parameters of the network communication equipment to prevent or prevent the network Obstacles to communication equipment. 如申請專利範圍第1項所述之系統,其中,該診斷模組採用該機器學習模型建立知識基礎模式之障礙診斷方法,且該機器學習模型為包括複數遞歸神經網路(RNN)層與複數神經元之遞歸神經網路模型。 For example, the system described in item 1 of the scope of patent application, wherein the diagnostic module uses the machine learning model to establish a knowledge-based obstacle diagnosis method, and the machine learning model includes a complex recurrent neural network (RNN) layer and complex numbers Recurrent neural network model of neurons. 如申請專利範圍第1項所述之系統,其中,該網路通訊設備每一段時間上傳該效能管理(PM)資料、該組態管理(CM)資料與設備紀錄(Log)至該網管模組,且該網管模組納管該網路通訊設備之供裝、監控、參數設定或該關鍵績效指標(KPI)之管理。 Such as the system described in item 1 of the scope of patent application, wherein the network communication device uploads the performance management (PM) data, the configuration management (CM) data and the device log (Log) to the network management module every period of time , And the network management module manages the installation, monitoring, parameter setting of the network communication equipment or the management of the key performance indicator (KPI). 如申請專利範圍第1項所述之系統,其中,該網管模組係具有一伺服器或資料庫,供該網路通訊設備將該效能管理(PM)資料與該組態管理(CM)資料傳送至該網管模組以儲存於該伺服器或資料庫中,且該網路通訊設備係將有關該障礙事件之障礙事件日誌一併轉發至該網管模組以記錄於該伺服器或資料庫中。 For example, the system described in item 1 of the scope of patent application, wherein the network management module has a server or database for the network communication equipment to perform the performance management (PM) data and the configuration management (CM) data Send to the network management module for storage in the server or database, and the network communication equipment forwards the obstacle event log related to the obstacle event to the network management module for recording in the server or database middle. 如申請專利範圍第1項所述之系統,更包括一關鍵績效指標(KPI)公式資料庫,其中,該網管模組係透過表現層狀態轉換應用程式介面(Rest API)將該效能管理(PM)資料與該組態管理(CM)資料之參數傳送至該診斷模組,以由該診斷模組透過該關鍵績效指標(KPI)公式資料庫將該效能管理(PM)資料與組態管理(CM)資料之參數計算出或轉換成相應之該關鍵績效指標(KPI)。 For example, the system described in item 1 of the scope of patent application further includes a key performance indicator (KPI) formula database, in which the network management module uses the performance management (PM) ) Data and parameters of the configuration management (CM) data are sent to the diagnostic module, so that the diagnostic module uses the key performance indicator (KPI) formula database to manage the performance management (PM) data and configuration ( CM) data parameters are calculated or converted into corresponding key performance indicators (KPI). 如申請專利範圍第1項所述之系統,其中,該診斷模組更建立有關該障礙事件之權重評估演算法,以在該機器學習模型之訓練過程中,由該診斷模組使用該權重評估演算法來優化或調整該障礙事件之類別權重。 For example, the system described in item 1 of the scope of patent application, wherein the diagnostic module further establishes a weight evaluation algorithm related to the obstacle event, so that the diagnostic module uses the weight evaluation during the training process of the machine learning model Algorithm to optimize or adjust the category weight of the obstacle event. 如申請專利範圍第1項所述之系統,其中,該診斷模組計算該關鍵績效指標(KPI)時,係採用各個障礙事件類別之關鍵績效指標(KPI)之平均數、眾數或零填補該關鍵績效指標(KPI)之空值。 For example, the system described in item 1 of the scope of patent application, wherein the diagnostic module calculates the key performance indicator (KPI) by using the average, mode, or zero fill of the key performance indicators (KPI) of each obstacle category The null value of the key performance indicator (KPI). 如申請專利範圍第1項所述之系統,其中,該診斷模組係採用歸一化來優化該關鍵績效指標(KPI)之數值,或以真假值表示該關鍵績效指標(KPI)之數值。 Such as the system described in item 1 of the scope of patent application, wherein the diagnostic module adopts normalization to optimize the value of the key performance indicator (KPI), or the value of the key performance indicator (KPI) is expressed as a true or false value . 如申請專利範圍第1項所述之系統,其中,該診斷模組係透過標籤編碼與一獨熱編碼將有關該網路通訊設備之障礙事件類別與正常類別轉換成多維之矩陣。 For example, in the system described in item 1 of the scope of patent application, the diagnostic module converts the obstacle event category and the normal category of the network communication equipment into a multi-dimensional matrix through a tag code and a one-hot code. 如申請專利範圍第1項所述之系統,其中,該診斷模組係採用長短期記憶(LSTM)模型作為基礎,以找出該關鍵績效指標(KPI)之間的關聯性,進而建置該機器學習模型。 For example, the system described in item 1 of the scope of the patent application, wherein the diagnostic module uses a long and short-term memory (LSTM) model as a basis to find the correlation between the key performance indicators (KPI), and then build the Machine learning model. 一種基於機器學習預測與防範網路通訊設備發生障礙之方法,包括:令一網管模組收集來自至少一網路通訊設備之效能管理(PM)資料、組態管理(CM)資料與障礙事件;令一診斷模組將該網管模組所收集之該網路通訊設備之效能管理(PM)資料與組態管理(CM)資料計算出或轉換成相應之關鍵績效指標(KPI),且令該診斷模組以該關鍵績效指標(KPI)作為輸入因子及以該障礙事件作為輸出結果來建置具有隱藏層與退出層之機器學習模型作為障礙診斷模型,其中,在該機器學習模型之訓練過程中,藉由驗證資料集驗證具有該隱藏層與該退出層之該機器學習模型之準確率,且將該機器學習模型之該隱藏層加入該退出層中;以及令該診斷模組將由該網路通訊設備之新效能管理(PM)資料與新組態管理(CM)資料計算出或轉換而成之新關鍵績效指標(KPI)輸入為具有該隱藏層與該退出層之該機器學習模型之該障礙診斷模型,以令該診斷模組透過 為具有該隱藏層與該退出層之該機器學習模型之該障礙診斷模型利用該新關鍵績效指標(KPI)預測或診斷該網路通訊設備是否可能發生障礙,俾於預測或診斷出該網路通訊設備可能發生障礙時,令該診斷模組調整該網路通訊設備之設備參數以防範或防止該網路通訊設備發生障礙。 A method for predicting and preventing obstacles in network communication equipment based on machine learning includes: ordering a network management module to collect performance management (PM) data, configuration management (CM) data and obstacle events from at least one network communication equipment; Make a diagnostic module calculate or convert the performance management (PM) data and configuration management (CM) data of the network communication equipment collected by the network management module into corresponding key performance indicators (KPI), and make the The diagnosis module uses the key performance indicator (KPI) as an input factor and the obstacle event as an output result to build a machine learning model with a hidden layer and an exit layer as an obstacle diagnosis model. In the training process of the machine learning model In the process, the accuracy of the machine learning model with the hidden layer and the exit layer is verified by the verification data set, and the hidden layer of the machine learning model is added to the exit layer; and the diagnostic module will be used by the network The input of the new key performance indicator (KPI) calculated or converted from the new performance management (PM) data and the new configuration management (CM) data of the communication equipment is the machine learning model with the hidden layer and the exit layer. The obstacle diagnosis model so that the diagnosis module can pass The obstacle diagnosis model, which is the machine learning model with the hidden layer and the exit layer, uses the new key performance indicator (KPI) to predict or diagnose whether the network communication device may have an obstacle, so as to predict or diagnose the network When the communication equipment may be obstructed, the diagnostic module is made to adjust the equipment parameters of the network communication equipment to prevent or prevent the network communication equipment from malfunctioning. 如申請專利範圍第11項所述之方法,其中,該診斷模組係採用該機器學習模型建立知識基礎模式之障礙診斷方法,且該機器學習模型為包括複數遞歸神經網路(RNN)層與複數神經元之遞歸神經網路模型。 For example, the method described in item 11 of the scope of patent application, wherein the diagnostic module uses the machine learning model to establish a knowledge-based obstacle diagnosis method, and the machine learning model includes a complex recurrent neural network (RNN) layer and A recurrent neural network model of complex neurons. 如申請專利範圍第11項所述之方法,更包括令該網路通訊設備每一段時間上傳該效能管理(PM)資料、該組態管理(CM)資料與設備紀錄(Log)至該網管模組,且令該網管模組納管該網路通訊設備之供裝、監控、參數設定或該關鍵績效指標(KPI)之管理。 For example, the method described in item 11 of the scope of patent application further includes enabling the network communication device to upload the performance management (PM) data, the configuration management (CM) data and the device log (Log) to the network management module every period of time. Group, and make the network management module take over the supply, monitoring, parameter setting of the network communication equipment or the management of the key performance indicator (KPI). 如申請專利範圍第11項所述之方法,更包括令該網路通訊設備將該效能管理(PM)資料與該組態管理(CM)資料傳送至該網管模組以儲存於該網管模組之一伺服器或資料庫中,且令該網路通訊設備將有關該障礙事件之障礙事件日誌一併轉發至該網管模組以記錄於該伺服器或資料庫中。 For example, the method described in item 11 of the scope of patent application further includes making the network communication device transmit the performance management (PM) data and the configuration management (CM) data to the network management module for storage in the network management module In a server or database, the network communication device is allowed to forward the obstacle event log related to the obstacle event to the network management module for recording in the server or database. 如申請專利範圍第11項所述之方法,更包括令該網管模組透過表現層狀態轉換應用程式介面(Rest API)將該效能管理(PM)資料與該組態管理(CM)資料之參數傳送至該診斷模組,以令該診斷模組透過一關鍵績效指標(KPI)公式資料庫將該效能管理(PM)資料與組態管理(CM)資料之參數計算出或轉換成相應之該關鍵績效指標(KPI)。 For example, the method described in item 11 of the scope of patent application further includes the parameters of the performance management (PM) data and the configuration management (CM) data through the presentation layer state transition application program interface (Rest API) of the network management module Send to the diagnostic module so that the diagnostic module calculates or converts the parameters of the performance management (PM) data and configuration management (CM) data into the corresponding parameters through a key performance indicator (KPI) formula database. Key performance indicators (KPI). 如申請專利範圍第11項所述之方法,更包括令該診斷模組建立有關該障礙事件之權重評估演算法,以在該機器學習模型之訓練過程中,令該診斷模組使用該權重評估演算法來優化或調整該障礙事件之類別權重。 For example, the method described in item 11 of the scope of patent application further includes making the diagnostic module establish a weight evaluation algorithm related to the obstacle event, so as to make the diagnostic module use the weight evaluation during the training process of the machine learning model Algorithm to optimize or adjust the category weight of the obstacle event. 如申請專利範圍第11項所述之方法,更包括令該診斷模組計算該關鍵績效指標(KPI)時,採用各個障礙事件類別之關鍵績效指標(KPI)之平均數、眾數或零填補該關鍵績效指標(KPI)之空值。 For example, the method described in item 11 of the scope of the patent application also includes the use of the average, mode or zero filling of the key performance indicators (KPI) of each obstacle category when the diagnostic module calculates the key performance indicator (KPI) The null value of the key performance indicator (KPI). 如申請專利範圍第11項所述之方法,更包括令該診斷模組採用歸一化來優化該關鍵績效指標(KPI)之數值,或以真假值表示該關鍵績效指標(KPI)之數值。 For example, the method described in item 11 of the scope of patent application further includes making the diagnostic module use normalization to optimize the value of the key performance indicator (KPI), or express the value of the key performance indicator (KPI) with a true or false value . 如申請專利範圍第11項所述之方法,更包括令該診斷模組透過標籤編碼與一獨熱編碼將有關該網路通訊設備之障礙事件類別與正常類別轉換成多維之矩陣。 For example, the method described in item 11 of the scope of patent application further includes making the diagnostic module convert the obstacle event category and the normal category of the network communication equipment into a multi-dimensional matrix through a tag code and a one-hot code. 如申請專利範圍第11項所述之方法,更包括令該診斷模組採用長短期記憶(LSTM)模型作為基礎,以找出該關鍵績效指標(KPI)之間的關聯性,進而建置該機器學習模型。 For example, the method described in item 11 of the scope of the patent application further includes making the diagnostic module use the long-term short-term memory (LSTM) model as the basis to find the correlation between the key performance indicators (KPI), and then build the Machine learning model.
TW108132058A 2019-09-05 2019-09-05 System and method for predicting and preventing obstacles of network communication equipment based on machine learning TWI735942B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW108132058A TWI735942B (en) 2019-09-05 2019-09-05 System and method for predicting and preventing obstacles of network communication equipment based on machine learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW108132058A TWI735942B (en) 2019-09-05 2019-09-05 System and method for predicting and preventing obstacles of network communication equipment based on machine learning

Publications (2)

Publication Number Publication Date
TW202112103A TW202112103A (en) 2021-03-16
TWI735942B true TWI735942B (en) 2021-08-11

Family

ID=76035711

Family Applications (1)

Application Number Title Priority Date Filing Date
TW108132058A TWI735942B (en) 2019-09-05 2019-09-05 System and method for predicting and preventing obstacles of network communication equipment based on machine learning

Country Status (1)

Country Link
TW (1) TWI735942B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI793604B (en) * 2021-05-12 2023-02-21 國立陽明交通大學 Method for fault diagnosis in communication network

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102111797A (en) * 2011-02-15 2011-06-29 大唐移动通信设备有限公司 Fault diagnosis method and fault diagnosis equipment
TW201630395A (en) * 2014-09-25 2016-08-16 貝那得 摩赫畢 Methods and apparatus for hybrid access to a core network based on proxied authentication
CN106713063A (en) * 2015-11-18 2017-05-24 德科仕通信(上海)有限公司 VoIP network packet loss fault detection method
TWI644203B (en) * 2017-12-28 2018-12-11 中華電信股份有限公司 System and method for monitoring cloud network convergence

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102111797A (en) * 2011-02-15 2011-06-29 大唐移动通信设备有限公司 Fault diagnosis method and fault diagnosis equipment
TW201630395A (en) * 2014-09-25 2016-08-16 貝那得 摩赫畢 Methods and apparatus for hybrid access to a core network based on proxied authentication
CN106713063A (en) * 2015-11-18 2017-05-24 德科仕通信(上海)有限公司 VoIP network packet loss fault detection method
TWI644203B (en) * 2017-12-28 2018-12-11 中華電信股份有限公司 System and method for monitoring cloud network convergence

Also Published As

Publication number Publication date
TW202112103A (en) 2021-03-16

Similar Documents

Publication Publication Date Title
CN106941423A (en) Failure cause localization method and device
CN114629802B (en) Service awareness-based power communication backbone network quality assessment method
CN112564964B (en) Fault link detection and recovery method based on software defined network
US20120263072A1 (en) Ethernet traffic statistics and analysis method and system
JPH0366253A (en) Method and system for monitoring communication network and communication network element
CN105469317B (en) A kind of reliability of power communication network analysis method
US20090219823A1 (en) Method and apparatus for automating hub and spoke internet protocol virtual private network trouble diagnostics
CN113542039A (en) Method for positioning 5G network virtualization cross-layer problem through AI algorithm
CN112637006A (en) Power communication gateway key node and influence domain analysis method
CN109246495A (en) A kind of optical network service method for evaluating quality of oriented multilayer, multi objective
WO2019137052A1 (en) Method and device for network operation and maintenance
CN112463892A (en) Early warning method and system based on risk situation
TWI735942B (en) System and method for predicting and preventing obstacles of network communication equipment based on machine learning
CN117014471A (en) Engineering thing networking safety monitoring system based on artificial intelligence
CN115022908A (en) Method for predicting and positioning abnormity of core network and base station transmission network
CN117041089B (en) Equipment monitoring system and method for remote data center
CN105682131B (en) Radio reception device communication quality monitoring method, system and monitoring device
Manzanilla-Salazar et al. ENodeB failure detection from aggregated performance KPIs in smart-city LTE infrastructures
CN114726708A (en) Network element equipment fault prediction method and system based on artificial intelligence
CN107528631A (en) Link error detection diagnostic method and device based on digital fiber compartment system
CN102783087A (en) Associative alarm method and device based on management layers
CN111404742A (en) Industry interconnection framework towards wisdom supply chain
Sakharova et al. Mathematical metamodel of the process of functioning of the intelligent management system of the Carrier Ethernet network
CN215954134U (en) Large-scale real-time data communication state monitoring system for environment-friendly energy enterprise
WO2022126705A1 (en) Method for making decision on intervention occasion in case of fault in telecommunication core network