KR20210003004A

KR20210003004A - Method and apparatus for abnormality diagnose and prediction based on ensemble model

Info

Publication number: KR20210003004A
Application number: KR1020190079005A
Authority: KR
Inventors: 김낙우; 이준기; 이현용; 이병탁; 이동수
Original assignee: 한국전자통신연구원
Priority date: 2019-07-01
Filing date: 2019-07-01
Publication date: 2021-01-11

Abstract

The present invention relates to a technique for processing an ensemble by optimizing primary output results in various models in applying a diagnosis and prediction technique for abnormal situations occurring in facilities and devices. A sequence of methods of the present invention configures and processes a learning network for constructing an optimal ensemble model according to the estimation of each probability distribution by using a response variable having an adaptive threshold value and a specific probability distribution outputted from the various models as an input of the ensemble model. Through this, in diagnosing an abnormal situation related to arbitrary facilities or devices, the abnormality diagnosis technique based on the optimal ensemble model can be implemented.

Description

TECHNICAL FIELD [Method and apparatus for abnormality diagnose and prediction based on ensemble model}

본 발명은 설비 및 장치에서 발생하는 이상 상황에 대한 진단 및 예측 기법을 적용함에 있어, 여러 모델에서의 1차 출력 결과를 최적화하여 앙상블을 처리하기 위한 기법에 관한 것이다. 보다 구체적으로, 본 발명은 상기 여러 모델이 출력하는 적응적 임계치 및 특정한 확률분포를 갖는 반응변수를 앙상블 모델의 입력으로 하여 각 확률분포 추정에 따른 최적의 앙상블 모델을 구축하는 학습망의 구성에 관한 것이다.The present invention relates to a technique for processing an ensemble by optimizing primary output results from various models in applying a diagnosis and prediction technique for abnormal situations occurring in facilities and devices. More specifically, the present invention relates to the configuration of a learning network for constructing an optimal ensemble model according to estimation of each probability distribution by using an adaptive threshold output from the various models and a response variable having a specific probability distribution as inputs of an ensemble model. will be.

종래의 앙상블 기반 분류 예측 기법은 Random forest 등의 Bagging 기법을 이용하여 다수의 결정 트리로부터 분류 또는 예측 값을 평균이나, 곱, 혹은 과반수 투표 등의 방식으로 결과를 도출하는 방법, 혹은 약한 학습기를 여러 개로 연결하여 이전의 약한 모델에서 적합하지 못한 훈련 샘플에 가중치를 두고 훈련하여 최종적으로 강한 학습기를 생성하는 기법인 Boosting 기법 등의 시도가 활발히 이루어졌다. 최근에는 이러한 Bagging이나 Boosting 기법뿐만 아니라, 심층학습 기반의 앙상블 분류 기법이 널리 활용되고 있다. 심층학습 기반에서의 앙상블 분류 기법은 모델 내 계층 상에 Dropout 기법을 적용하여 기존의 방법과 유사한 앙상블 처리 효과를 볼 수 있다.The conventional ensemble-based classification prediction technique uses a bagging technique such as random forest to classify or predict values from multiple decision trees to derive results by means of averaging, multiplication, or majority voting, or several weak learners. Active attempts such as the Boosting technique, a technique that connects with dogs and trains with weights on training samples that were not suitable in the previous weak model, finally creates a strong learner have been actively made. Recently, not only such a bagging or boosting technique, but also an ensemble classification technique based on deep learning has been widely used. The ensemble classification method based on deep learning can see the effect of ensemble processing similar to the existing method by applying the Dropout method on the layer within the model.

Bagging 기법은 훈련 데이터세트를 여러 개로 구분하여 각각의 데이터세트를 독립적으로 학습시킨다. 이에, 각 Bagging 모형에서의 가중치를 앙상블하여 평균하기 때문에 각각의 Bagging 모형을 독립 모델로 볼 수 있으며, 이러한 Bagging 모형을 통해 반응변수의 Variance를 감소시키는 효과를 볼 수 있다. The bagging technique divides training datasets into several and trains each dataset independently. Therefore, since the weights in each bagging model are ensembled and averaged, each bagging model can be viewed as an independent model, and through such a bagging model, the effect of reducing the variance of the response variable can be seen.

Boosting 기법은 훈련 데이터세트를 여러 개로 구분하는 형태는 Bagging 기법과 동일하지만, 연환결합(cascading)된 이전 분류기에서 학습 시 에러가 큰 데이터에 대해서는 다음 분류기에서의 추출 확률을 더 높이도록 조정하여 적절한 학습이 되도록 하는 방법으로, 분류기 성능에 따른 가중치를 주기도 한다. 이러한 Boosting 모형을 통해 반응변수의 Bias를 감소시키는 효과를 낼 수 있다.The boosting method divides the training data set into multiple pieces in the same way as the Bagging method, but for data with large errors when training in the cascading previous classifier, it is appropriate to learn by adjusting the extraction probability in the next classifier to be higher. As a way to achieve this, a weight is also given according to the classifier performance. Through this boosting model, the effect of reducing the bias of the response variable can be achieved.

이러한 종래의 컴퓨터 비전 분야에서 활용하는 기법인 Bagging이나 Boosting기법 등과 근래에 유행하고 있는 심층학습 기반의 모델 등을 결합한 앙상블 기법이 활용되는 사례도 있다. 앙상블 기법은 Bagging, Boosting, 그리고 심층학습 모델을 각각 구성하고, 그 반응변수의 결과를 조화시켜 처리하는 방법이다. 이 때의 앙상블 방법은 출력 변수의 평균, 곱, 그리고 과반수 투표 등을 활용한다.There is also a case in which an ensemble technique that combines a deep learning-based model that is popular in recent years, such as bagging or boosting techniques, which are techniques used in the conventional computer vision field, is used. The ensemble technique is a method of constructing bagging, boosting, and deep learning models, respectively, and harmonizing and processing the results of the response variables. In this case, the ensemble method uses the average of the output variables, the product, and the majority vote.

그러나, 기존의 앙상블 방법은 1단계 출력에서의 임계치에 대한 고려없이 반응변수 값만 2단계 앙상블 모형의 입력 변수로 활용함으로써, 1단계에서 활용된 모형 반응변수 확률분포의 치우침(skewness)을 제대로 반영하지 못하는 문제가 있었다. 1단계 모형 결과에 따라 반응변수의 확률분포는 균등분포, 지수분포, 베타분포, 웨이블(Weibull) 분포, 그리고 정규분포 등의 다양한 형태를 가질 수 있는데, 기존의 방법은 이러한 분포 특성을 제대로 반영하지 못하는 한계점이 있다.However, the existing ensemble method does not properly reflect the skewness of the probability distribution of the model response variable used in the first step by using only the response variable value as the input variable of the second-stage ensemble model without considering the threshold in the first step output. There was a problem that couldn't. Depending on the results of the first-stage model, the probability distribution of the response variable can have various forms such as uniform distribution, exponential distribution, beta distribution, Weibull distribution, and normal distribution, but the existing methods properly reflect these distribution characteristics. There are limitations that cannot be done.

본 발명은 다양한 형태의 학습 모형에 기반한 앙상블 모델을 구현하는데 있어, 각 개별 모델의 반응변수를 채택하기 위해 최적의 확률분포 모형을 적용함으로써, 각 개별 모델의 적응적 임계치를 앙상블 모델 학습에 새롭게 반영하여 학습하도록 한다. 즉, 앙상블 모델에 있어 이전 개별 모델의 반응변수만으로 학습하는 것이 아닌, 적응적 임계치도 함께 포함하여 학습할 수 있는 모형을 구현한다.In the present invention, in implementing an ensemble model based on various types of learning models, by applying an optimal probability distribution model to adopt the response variable of each individual model, the adaptive threshold of each individual model is newly reflected in the ensemble model learning. To learn. In other words, in the ensemble model, we implement a model that can learn by including adaptive thresholds, rather than learning only by the response variables of the previous individual models.

이를 통해 임의의 설비나 장치와 관련된 이상 상황을 진단하는 데에 있어, 최적의 앙상블 모형에 기반한 이상 진단 기법의 구현이 가능하다.Through this, in diagnosing an abnormal situation related to an arbitrary facility or device, it is possible to implement an abnormality diagnosis technique based on an optimal ensemble model.

본 발명은 앙상블 모델을 구현하는 방법의 하나로서, 계층적 학습 모형 구성 시에 1단계 출력을 통해 각 개별 모델이 갖는 반응변수의 임계치 및 확률분포를 생성하고, 생성된 반응변수의 임계치 및 확률분포를 2단계 학습 모형의 입력 변수로 이용함으로써, 1단계 반응변수의 확률분포 추정에 따른 최적화 학습에 기반한 2단계 학습 모형을 구성하는 방법이 제공된다. 이 때, 1단계 반응변수에 대한 최적화 학습은 2단계 학습 모형에 있어 1단계의 각 모형 중요도에 따른 가중치 곱과 연동하여 구성한다.The present invention is one of the methods of implementing an ensemble model, and when constructing a hierarchical learning model, the threshold and probability distribution of the response variables of each individual model are generated through one-step output, and the threshold and probability distribution of the generated response variables A method of constructing a two-stage learning model based on optimized learning based on estimation of the probability distribution of a first-stage response variable is provided by using as an input variable of a two-stage learning model. In this case, the optimization learning for the first-stage response variable is constructed in conjunction with the weight product according to the importance of each model in the first-stage in the second-stage learning model.

이를 구현하기 위하여 본 발명은, 앙상블 모델 기반 이상 진단 및 예측 방법으로서, 훈련데이터를 입력하여 개별 모델에서의 최적화 학습을 수행하는 단계, 적응적 임계치(Threshold) 및 반응 변수 값(Probability)을 출력하는 단계, 각 모델의 임계치 및 반응 변수 값에 기반하여 앙상블 모델의 최적화 학습을 수행하는 단계, 앙상블 모델에 기반하여 이상을 진단하고 예측하는 단계를 포함하는 앙상블 모델 기반 이상 진단 및 예측 방법을 제공한다.In order to implement this, the present invention is a method for diagnosing and predicting anomalies based on an ensemble model, in which training data is input to perform optimization learning in an individual model, and an adaptive threshold and a response variable value (Probability) are output. An ensemble model-based abnormality diagnosis and prediction method is provided, including: performing optimization learning of the ensemble model based on the threshold values and response variable values of each model, and diagnosing and predicting the abnormality based on the ensemble model.

또한 본 발명은 훈련데이터를 입력하여 개별 모델에서의 최적화 학습을 수행하는 수단, 적응적 임계치(Threshold) 및 반응 변수 값(Probability)을 출력하는 수단, 각 모델의 임계치 및 반응 변수 값에 기반하여 앙상블 모델의 최적화 학습을 수행하는 수단, 앙상블 모델에 기반하여 이상을 진단하고 예측하는 수단을 포함하는 앙상블 모델 기반 이상 진단 및 예측 장치를 제공한다.In addition, the present invention provides a means for performing optimization learning in individual models by inputting training data, a means for outputting an adaptive threshold and a response variable value, and an ensemble based on the threshold and response variable values of each model. An ensemble model-based anomaly diagnosis and prediction apparatus including a means for performing optimization learning of a model and a means for diagnosing and predicting an anomaly based on an ensemble model is provided.

이상에서 소개한 본 발명의 구성 및 작용은 이후에 도면과 함께 설명하는 구체적인 실시예를 통하여 더욱 명확해질 것이다.The configuration and operation of the present invention introduced above will become more apparent through specific embodiments described with reference to the drawings.

본 발명은 종래의 발명과 달리, 앙상블 기반의 계층적 학습 모형 구성 시 이전 단계 출력에서의 각 개별 모델이 갖는 반응변수의 임계치 및 확률분포를 다음 단계 학습 모형의 입력 변수로 이용함으로써, 이전 단계에서의 반응변수의 확률분포 추정에 기반하여 앙상블 모형을 최적화 학습하는 것이 가능하다. Unlike the conventional invention, the present invention uses the threshold value and probability distribution of the response variable of each individual model in the output of the previous stage as input variables of the next stage learning model when constructing an ensemble-based hierarchical learning model. It is possible to optimize learning of the ensemble model based on the estimation of the probability distribution of the response variable of.

본 발명을 전력선에서의 부분 방전 발생에 대한 이상 신호 진단 분야에서 활용하면, 여러 학습기를 통해 전력선에 대한 이상 신호 발생 여부를 판별한 후, 앞선 1차 학습기들의 각 출력 변수 및 임계치 묶음에 대한 확률분포 추정 기반의 2차 앙상블 학습기를 통해 최적의 부분 방전 이상 신호 진단이 가능해진다.If the present invention is used in the field of diagnosing abnormal signals for the occurrence of partial discharge in a power line, after determining whether an abnormal signal has occurred for the power line through several learners, the probability distribution for each output variable and threshold bundle of the preceding primary learners An optimal partial discharge abnormal signal diagnosis is possible through an estimation-based secondary ensemble learner.

도 1은 앙상블 모델 기반 이상 진단 및 예측 처리 방법 개념도, 도 2는 앙상블 모델 기반 이상 진단 및 예측 방법 순서도, 도 3은 앙상블 모델처리부 블록도, 도 4는 앙상블 모델처리부 순서도, 도 5는 여러 학습모형에서 추정된 반응변수 확률 모형 예시, 도 6은 여러 학습모형에서 추정된 임계치 결과 예시1 is a conceptual diagram of an ensemble model-based abnormality diagnosis and prediction processing method, FIG. 2 is a flow chart of an ensemble model-based abnormality diagnosis and prediction method, FIG. 3 is a block diagram of an ensemble model processing unit, FIG. 4 is a flow chart of an ensemble model processing unit, and FIG. 5 is several learning models. An example of a probability model of a response variable estimated from, Figure 6 is an example of a threshold result estimated from several learning models

본 발명의 이점 및 특징, 그리고 이들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 기술되어 있는 실시예를 참조하면 명확해질 것이다. 그러나 본 발명은 이하에서 개시되는 실시예에 한정되는 것이 아니라 서로 다른 다양한 형태로 구현될 것이며, 단지 본 실시예는 본 발명의 개시가 완전하도록 하며, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 발명의 범주를 완전하게 알려주기 위해 제공되는 것이며, 본 발명은 청구항의 기재에 의해 정의된다. Advantages and features of the present invention, and a method of achieving them will become apparent with reference to the embodiments described in detail together with the accompanying drawings. However, the present invention is not limited to the embodiments disclosed below, but will be implemented in various forms different from each other, only this embodiment makes the disclosure of the present invention complete, and those skilled in the art to which the present invention pertains. It is provided to fully inform the scope of the invention to the person, and the invention is defined by the description of the claims.

한편, 본 명세서에서 사용된 용어는 실시예를 설명하기 위한 것이며 본 발명을 제한하고자 하는 것은 아니다. 본 명세서에서, 단수형은 문구에서 특별히 언급하지 않는 한 복수형도 포함한다. 명세서에서 사용되는 "포함한다(comprises)" 또는 "포함하는(comprising)"은 언급된 구성요소, 단계, 동작 및/또는 소자 이외의 하나 이상의 다른 구성요소, 단계, 동작 및/또는 소자의 존재 또는 추가를 배제하지 않는다.On the other hand, terms used in the present specification are for explaining examples and are not intended to limit the present invention. In this specification, the singular form also includes the plural form unless specifically stated in the phrase. As used herein, "comprises" or "comprising" refers to the presence of one or more other components, steps, actions and/or elements other than the recited elements, steps, actions and/or elements, or Does not exclude addition.

이하, 본 발명의 바람직한 실시예를 첨부 도면을 참조하여 상세히 설명한다. 각 도면의 구성요소들에 참조부호를 부가함에 있어서, 동일한 구성요소들에 대해서는 비록 다른 도면상에 표시되더라도 가급적 동일한 부호를 부여하고 또한 본 발명을 설명함에 있어, 관련된 공지 구성 또는 기능에 대한 구체적인 설명이 본 발명의 요지를 흐릴 수 있는 경우에는 그 상세한 설명을 생략한다.Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. In adding reference numerals to elements of each drawing, even though they are indicated on different drawings, the same elements are assigned the same reference numerals as much as possible, and in describing the present invention, a detailed description of related known configurations or functions If the gist of the present invention may be obscure, detailed description thereof will be omitted.

도 1은 앙상블 모델 기반 이상 진단 및 예측 방법의 처리 개념도이다. 먼저, 훈련 데이터세트(train dataset)(10) 전체 혹은 랜덤 샘플 처리된 데이터세트가 각각의 개별모델처리부(20A, 20B, ??, 20N)로 입력된다. 개별모델처리부(20A, 20B, ??, 20N)는 입력된 훈련데이터세트(10)를 이용하여 가중치 훈련을 수행한 후, 예측 모형을 완성하여 반응변수(혹은 출력변수)(probability) 및 모델 기반의 데이터 이상 유무를 판정하기 위한 임계치(threshold)를 출력한다.1 is a conceptual diagram illustrating an ensemble model-based abnormality diagnosis and prediction method. First, the entire training dataset 10 or a dataset processed with random samples is input to each individual model processing unit 20A, 20B, ??, and 20N. Individual model processing units (20A, 20B, ??, 20N) perform weight training using the input training data set 10, and then complete the prediction model to provide a response variable (or output variable) (probability) and model-based A threshold for determining whether there is an abnormality in the data of is outputted.

이 때, 반응변수의 확률분포 모형은 다양한 형태로 분포가능한데, 예를 들어, 균등분포(도 5의 a), 지수분포, 베타분포(도 5의 b, c), 웨이블 분포, 그리고 정규분포(도 5의 d) 등의 연속형 확률분포로 나타난다. At this time, the probability distribution model of the response variable can be distributed in various forms, for example, uniform distribution (Fig. 5a), exponential distribution, beta distribution (Fig. 5b, c), wavel distribution, and normal distribution. It appears as a continuous probability distribution such as (d in Fig. 5).

또한 도 6에서 보는 바와 같이, 각각의 학습모형에서 추정된 예측 결과로부터의 이상 유무 판정을 위한 임계치는 출력된 반응변수 확률분포의 중앙값이나 평균값에서 나타나는 것이 아니라, 각 학습 모형에 상관없이 특정 값을 갖는다. In addition, as shown in Fig. 6, the threshold value for determining the presence or absence of an abnormality from the prediction results estimated in each learning model is not indicated by the median or average value of the output probability distribution of the response variable, but a specific value regardless of each learning model. Have.

이러한 결과로 인해, 현재 단계에서의 반응변수 결과값을 다음 단계인 앙상블 모델처리부(30)에서 직접 입력 받아 처리할 때, 그 학습 결과가 최적의 성능을 나타내지 않을 확률이 높아진다. 즉, 도 5에서 보이는 바와 같이, 1단계 모형에서의 반응변수 결과값이 일반적으로 치우침(skewness) 현상을 보인다.Due to this result, when the response variable result value in the current step is directly input and processed by the ensemble model processing unit 30, which is the next step, the probability that the learning result does not show optimal performance increases. That is, as shown in FIG. 5, the response variable result value in the first-stage model generally exhibits a skewness phenomenon.

따라서, 본 발명에서는 반응변수 결과값을 임계치와 함께 학습하여, 결과값을 보정하는 프로세스를 다음 단계인 앙상블 모델처리부(30)에서 수행한다. 앙상블 모델처리부(30)에서는 각 개별 모델의 반응변수 및 임계치 기반의 최적화 학습을 수행하며, 시험 데이터세트(test dataset)(40)를 통해 최종적인 이상 진단 및 시각화 과정(50)을 수행하여 진단 결과(Diagnostic result)를 출력한다. Accordingly, in the present invention, the process of correcting the result value by learning the response variable result value together with the threshold value is performed in the ensemble model processing unit 30, which is a next step. The ensemble model processing unit 30 performs optimization learning based on response variables and thresholds of each individual model, and performs a final abnormality diagnosis and visualization process 50 through a test dataset 40 to obtain a diagnosis result. (Diagnostic result) is output.

이상의 과정을 도 2의 앙상블 모델 기반 이상 진단 및 예측 방법 순서도를 참조하여 각 단계별로 설명한다.The above process will be described for each step with reference to the flow chart of a method for diagnosing and predicting an abnormality based on an ensemble model of FIG. 2.

S10: 훈련데이터 입력S10: input training data

훈련데이터는 시간 순서에 따른 시퀀셜(sequential) 정형데이터가 되도록 전처리 작업을 수행한다. 즉, 훈련데이터는 그 데이터의 특성에 따라 영상, 이미지, 음성, 텍스트, 숫자 등의 형태로 나타난다. 영상이나 이미지, 음성 등의 데이터는 훈련데이터로의 입력을 위한 정형화된 시퀀셜 데이터 구성을 위해 FFT(Fast Fourier Transform), DCT(Discrete Cosine Transform), MFCC(Mel Frequency Cepstral Coefficient) 등의 전처리 변환을 거쳐 외부 파일로 저장되거나 모델에 직접 입력하여 처리된다. The training data is pre-processed to become sequential structured data according to a time sequence. That is, the training data appears in the form of images, images, voices, texts, numbers, etc. according to the characteristics of the data. Data such as video, image, and audio are subjected to preprocessing transformation such as FFT (Fast Fourier Transform), DCT (Discrete Cosine Transform), MFCC (Mel Frequency Cepstral Coefficient), etc. It is saved as an external file or directly input to the model for processing.

S20: 각 개별 모델에서의 최적화 학습S20: Optimization training on each individual model

단계 S10에서의 입력데이터의 형태는 데이터 특성에 따라 다양한 구조를 갖기 때문에, 단계 S20에서는 각 개별 모델처리부(20A, 20B, …, 20N)에서 데이터 특성에 따른 입력데이터를 각각 처리한다. 예를 들어, 입력데이터는 영상, 이미지, 음성, 텍스트, 숫자 등의 특성에 따라 XGBoost, LightGBM, CNN(Convolutional Neural Network), RNN(Recurrent Neural Network) 등의 개별 학습 모델을 성능 분석에 기초하여 다르게 구성할 수 있다. 또는 비슷한 데이터 특성을 갖는 입력데이터를 여러 군집으로 나누고, 군집별로 모델을 달리 적용하여 최적화 학습을 수행하는 것도 가능하다.Since the form of the input data in step S10 has various structures according to the data characteristics, in step S20, each individual model processing unit 20A, 20B, ..., 20N processes input data according to the data characteristics, respectively. For example, the input data is based on the performance analysis of individual training models such as XGBoost, LightGBM, CNN (Convolutional Neural Network), RNN (Recurrent Neural Network), etc., depending on the characteristics of video, image, voice, text, and numbers. Configurable. Alternatively, it is possible to divide input data having similar data characteristics into several clusters and perform optimization learning by applying different models for each cluster.

S30: 적응적 임계치(Threshold) 및 반응변수 값(Probability)출력S30: Output of adaptive threshold and response variable value (Probability)

S20에서의 각 입력데이터의 최적화 학습을 통한 단계 S30에서의 모델 출력은 데이터의 정상/비정상을 구분하는 적응적 임계치(threshold)와 모델을 통한 입력데이터의 사후분포를 포함한 반응변수 값(probability)으로 나타난다. 여기서 임계치는 동일한 표본에 대해 여러 임계값의 자동적인 입력을 통해 출력에서의 예측 성능을 분석함으로써 적응적으로 최적의 임계치 선택이 가능하다.The model output in step S30 through optimizing learning of each input data in S20 is an adaptive threshold that distinguishes between normal and abnormal data and the probability of a response variable including the posterior distribution of the input data through the model. appear. Here, the threshold value can be adaptively selected by analyzing the prediction performance at the output through automatic input of several threshold values for the same sample.

S40: 각 모델의 임계치 및 반응 변수 값 기반 앙상블 모델 최적화 학습S40: Ensemble model optimization training based on threshold and response variable values of each model

S40 단계에서는 각 모델의 반응 변수값과 임계치를 기반으로 앙상블 모델을 구현한다. 각 모델의 예측 성능에 대한 기대값은 입력데이터의 모델 출력에 대한 사후확률 분포와 모델의 예측 성능, 그리고 모델 가중치의 곱에 비례한다. 이러한 각 모델의 결과를 앙상블하기 위한 최적의 가중치 파라미터를 구하는 학습과정이 앙상블 모델 최적화 학습이다. 예를 들어, 사후 확률분포가 낙타의 쌍봉형태일 때 쌍봉형태의 정중앙 값을 갖는 임계치의 가중치와 사후 확률분포가 단일 봉우리 형태일 때 봉우리 정상값을 갖는 임계치의 가중치는 가중치의 신뢰성이 전혀 다르게 된다. 즉, 앙상블 모델 학습에서는 각 모델의 사후확률분포모델, 예측성능, 임계치를 입력하여 최적화 학습이 진행된다.In step S40, the ensemble model is implemented based on the response variable value and the threshold value of each model. The expected value for the prediction performance of each model is proportional to the product of the posterior probability distribution for the model output of the input data, the prediction performance of the model, and the model weight. Ensemble model optimization learning is a learning process to find optimal weight parameters for ensemble the results of each of these models. For example, when the posterior probability distribution is in the form of a camel's bimodal, the weight of the critical value with the central value of the bimodal shape and the weight of the critical value with the peak normal value when the posterior probability distribution is in the form of a single peak have completely different weight reliability. . That is, in ensemble model training, optimization learning is performed by inputting the posterior probability distribution model, prediction performance, and threshold values of each model.

S50: 앙상블 모델 기반 이상 진단 예측S50: Ensemble model-based abnormality diagnosis prediction

S40 단계에서의 최적화 학습이 끝난 앙상블 모델은 적응적 임계치와 예측 성능을 출력한다. S50에서 시험데이터세트를 통해 앙상블 모델의 이상 진단 예측 정확도를 확인할 수 있다.The ensemble model after optimization learning in step S40 outputs an adaptive threshold and prediction performance. In S50, the accuracy of prediction of abnormality diagnosis of the ensemble model can be checked through the test data set.

S60: 시험데이터 기반 이상 진단 결과 시각화S60: Visualization of abnormality diagnosis results based on test data

S50 단계에서의 이상 진단 결과의 정확성 확인을 위해 S60 단계에서는 모델의 가중치 파라미터 등을 기반으로 시각화한다. 예를 들어, 영상이나 이미지에서의 이상 진단 모델의 타당성 확인을 위해 수행된 모델에 Grad-CAM 기법 등을 적용하여 이상 데이터 영역 등을 시각적으로 확인할 수 있다.In order to check the accuracy of the abnormality diagnosis result in step S50, in step S60, it is visualized based on the weight parameter of the model. For example, by applying a Grad-CAM technique or the like to a model performed to check the validity of an abnormality diagnosis model in an image or image, the abnormal data area can be visually checked.

도 3은 도 1에 나타낸 앙상블 모델처리부(30)의 블록도를 나타낸다. 앙상블 모델처리부(30)는 입력처리부(310), 데이터분석부(320), 특징처리부(330), 모델학습부(340), 이상진단부(350), 그리고 진단출력부(360)로 구성되어 있다.3 shows a block diagram of the ensemble model processing unit 30 shown in FIG. 1. The ensemble model processing unit 30 is composed of an input processing unit 310, a data analysis unit 320, a feature processing unit 330, a model learning unit 340, an abnormality diagnosis unit 350, and a diagnosis output unit 360. have.

입력처리부(310)는 이전 개별모델처리부(도 1의 20)로부터 전달된 임계치 및 반응변수 값을 저장하는 모듈이다. 데이터분석부(320)는 반응변수 값의 확률분포를 추정한다. 이 때, 전달받은 반응변수 값의 확률분포는 균등분포, 지수분포, 베타분포, 웨이블 분포, 정규분포 등의 다양한 연속형 확률분포일 수 있는데, 이러한 유형의 분포 중 가장 유사한 하나를 선택하여 각 개별모델 반응변수 값 별로 추정한다.The input processing unit 310 is a module that stores a threshold value and a response variable value transmitted from the previous individual model processing unit (20 in FIG. 1). The data analysis unit 320 estimates a probability distribution of values of a response variable. At this time, the probability distribution of the received response variable values can be various continuous probability distributions such as uniform distribution, exponential distribution, beta distribution, wavel distribution, and normal distribution. Estimate for each individual model response variable value.

특징처리부(330)는 각 개별모델 별 확률분포에서의 임계치 처리를 통해 수정된 임계치 및 모델 별 가중치 및 모델 별 반응변수와의 곱을 통해 모델학습부(340)로 입력될 변수 값을 처리한다. 이 때, 모델학습부(340)의 입력 변수 값은 다음의 수학식 1을 이용한다. The feature processing unit 330 processes a variable value to be input to the model learning unit 340 by multiplying the modified threshold value through the processing of the threshold value in the probability distribution for each individual model, the weight for each model, and the response variable for each model. In this case, the value of the input variable of the model learning unit 340 is Equation 1 below.

수학식 1에서 데이터의 확률밀도함수를

라 할 때, 1차 개별모델처리부(20) 이후의 사후 확률 밀도함수는

가 되고(여기서,

: 개별모델처리부에서의 가중치 파라미터, x: 임계치 확률 변수), 특정 임계치 j상의 i번째 개별모델처리부(20)에서의 사후 확률밀도 값은

이 된다.In Equation 1, the probability density function of the data

When d, the posterior probability density function after the first individual model processing unit 20 is

Becomes (where,

: The weight parameter in the individual model processing unit, x: the threshold random variable), the posterior probability density value in the i-th individual model processing unit 20 above the specific threshold value j

Becomes.

또한, 수학식 1에서,

는 i번째 개별모델처리부의 성능에 따른 가중치 파라미터이고,

는 i번째 개별모델처리부의 입력변수에 대한 반응변수의 확률 값이 된다. 일례로,

가 정규분포를 따른다 가정하면, x의 특정 임계치 j가 확률밀도함수의 평균값일 때,

값이 최대값을 갖게 되고, x의 특정 임계치 j의 치우침 정도에 따라

값이 달라지게 된다. 이 때의 정규분포 함수는 도 5의 (D)를 참고한다.Also, in Equation 1,

Is a weighting parameter according to the performance of the i-th individual model processor,

Is the response variable for the input variable of the i-th individual model processing unit. It becomes a probability value. For example,

Assuming that is a normal distribution, when a specific threshold j of x is the average value of the probability density function,

The value will have a maximum value, and depending on the degree of bias of the specific threshold j of x

The value will be different. For the normal distribution function at this time, refer to FIG. 5D.

모델학습부(340)는 특징처리부(330)에서 처리된 각 개별모델 별 수정된 임계치와 모델 별 가중치 및 모델 별 반응변수를 입력 받아 학습모델을 훈련시킨다. 이상진단부(350)는 훈련된 학습모델의 적응적 임계치 및 확률변수 값으로부터 시험데이터의 이상진단 여부를 확인하고, 진단출력부(360)를 통해 시각화해서 출력한다. The model learning unit 340 receives the corrected threshold value for each individual model processed by the feature processing unit 330, a weight for each model, and a response variable for each model to train a learning model. The abnormality diagnosis unit 350 checks whether or not the test data is abnormally diagnosed from the adaptive threshold value and the random variable value of the trained learning model, and visualizes and outputs it through the diagnosis output unit 360.

도 4에 이러한 앙상블 모델처리부(30)의 처리 프로세스를 도시하였으며, 각 단계별로 설명한다.4 shows the processing process of the ensemble model processing unit 30, and each step will be described.

S110: 각 모델에서의 적응적 임계치(Threshold) 및 반응변수 값(Probability)입력. 여기서 앙상블 모델의 입력은 각각의 입력데이터에 대한 여러 개별적인 학습 모델의 출력이다. 이 때, 출력값은 데이터의 정상/비정상을 구분하는 임계치와 개별학습 모델을 통한 입력데이터의 사후분포를 포함한 반응 변수 값이다.S110: Input of adaptive threshold and probability for each model. Here, the input of the ensemble model is the output of several individual learning models for each input data. In this case, the output value is a value of a response variable including a threshold for classifying normal/abnormal data and a posterior distribution of input data through an individual learning model.

S120: 각 모델 별 가중치 및 적응적 임계치, 반응변수 값 기반의 앙상블 모델 학습. 각각의 개별 학습 모델 반응 변수값과 임계치를 기반으로 하여 앙상블 모델을 학습한다. 각 모델의 예측 성능에 대한 기대값은 입력데이터의 모델 출력에 대한 사후확률 분포와 모델의 예측 성능, 그리고 모델 가중치의 곱에 비례한다. 이러한 각 모델의 결과를 앙상블 하기 위한 최적의 가중치 파라미터를 구하는 학습과정이 앙상블 모델 최적화 학습이다. 즉, 앙상블 모델 학습에서는 각 모델의 사후확률분포모델, 예측성능, 임계치를 입력으로 하여 최적화 학습이 진행된다.S120: Ensemble model training based on weights, adaptive thresholds, and response variable values for each model. The ensemble model is trained based on each individual learning model response variable value and threshold. The expected value for the prediction performance of each model is proportional to the product of the posterior probability distribution for the model output of the input data, the prediction performance of the model, and the model weight. Ensemble model optimization learning is a learning process to find the optimal weight parameters to ensemble the results of each of these models. That is, in ensemble model learning, optimization learning is performed by inputting the posterior probability distribution model, prediction performance, and threshold values of each model.

S130: 앙상블 모델 기반 적응적 임계치(Threshold) 및 예측성능치(Accuracy) 출력. 최적화 학습이 끝난 앙상블 모델은 적응적 임계치와 예측 성능을 출력한다. 훈련데이터세트와 구분된 시험데이터세트를 통해 앙상블 모델의 이상 진단 예측 정확도를 확인한다.S130: Ensemble model-based adaptive threshold and predictive performance value (Accuracy) output. The ensemble model after optimization learning outputs the adaptive threshold and prediction performance. Through the training data set and the separate test data set, the accuracy of prediction of abnormality diagnosis of the ensemble model is checked.

이상에서 설명한 본 발명은 전력선에서의 부분 방전 발생에 대한 이상 신호 진단에 활용이 가능하다. 예를 들어, 1차로 Xgboost, LightGBM, Lstm-Attention 등 여러 학습기를 통해 전력선에 대한 이상 신호 발생 여부를 판별한 후, 앞선 1차 학습기들의 각 출력 변수 및 임계치 묶음에 대한 확률분포 추정 기반의 2차 앙상블 학습기를 통해 최적의 부분 방전 이상 신호를 진단할 수 있다.The present invention described above can be utilized in diagnosing an abnormal signal for the occurrence of partial discharge in a power line. For example, after determining whether an abnormal signal has occurred on the power line through various learners such as Xgboost, LightGBM, Lstm-Attention, etc. as the first order, the second order based on the estimation of the probability distribution for each output variable and threshold bundle of the preceding first learners. An optimal partial discharge abnormal signal can be diagnosed through the ensemble learner.

이상에서 설명한 본 발명은 장치 측면 또는 방법적 측면으로 실시가능한데, 특히 본 발명의 각 구성요소의 기능(function) 또는 과정(process)은 DSP(digital signal processor), 프로세서, 컨트롤러, ASIC(application-specific IC), 프로그래머블 로직소자(FPGA 등), 기타 전자소자 중의 적어도 하나 그리고 이들의 조합이 포함되는 하드웨어 요소로써 구현 가능하다. 또한 하드웨어 요소와 결합되어 또는 독립적으로 소프트웨어로써도 구현 가능한데, 이 소프트웨어는 기록매체에 저장 가능하다.The present invention described above can be implemented in terms of a device or a method. In particular, a function or process of each component of the present invention is a digital signal processor (DSP), a processor, a controller, and an application-specific application-specific (ASIC). IC), programmable logic devices (FPGA, etc.), and at least one of other electronic devices, and a combination of these can be implemented as hardware elements. It can also be implemented as software in combination with hardware elements or independently, and this software can be stored on a recording medium.

본 발명의 바람직한 실시예를 통하여 본 발명의 구성을 상세히 설명하였으나, 본 발명이 속하는 기술분야의 통상의 지식을 가진 자는 본 발명이 그 기술적 사상이나 필수적인 특징을 변경하지 않고서 본 명세서에 개시된 내용과는 다른 구체적인 형태로 실시될 수 있다는 것을 이해할 수 있을 것이다. 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적이 아닌 것으로 이해해야만 한다. 본 발명의 보호범위는 상기 상세한 설명보다는 후술한 특허청구범위에 의하여 정해지며, 특허청구의 범위 그리고 그 균등 개념으로부터 도출되는 모든 변경 또는 변형된 형태는 본 발명의 기술적 범위에 포함되는 것으로 해석되어야 한다.Although the configuration of the present invention has been described in detail through a preferred embodiment of the present invention, those of ordinary skill in the art to which the present invention pertains to the present invention are different from the contents disclosed in the present specification without changing the technical idea or essential features. It will be appreciated that it can be implemented in other specific forms. It should be understood that the embodiments described above are illustrative in all respects and not limiting. The scope of protection of the present invention is determined by the claims described later rather than the detailed description, and all changes or modifications derived from the scope of the claims and their equivalent concepts should be interpreted as being included in the technical scope of the present invention. .

10: 훈련 데이터세트, 40: 시험 데이터세트, 50: 이상 진단 및 시각화 과정10: training dataset, 40: test dataset, 50: abnormality diagnosis and visualization process

Claims

As an ensemble model-based abnormality diagnosis and prediction method,
Inputting training data to perform optimization learning on individual models,
Outputting an adaptive threshold and a probability of a response variable,
Performing optimization learning of the ensemble model based on the threshold value and response variable value of each model,
An ensemble model-based abnormality diagnosis and prediction method comprising the step of diagnosing and predicting an abnormality based on the ensemble model.