KR20200010671A

KR20200010671A - System and method for fault diagnosis of equipment based on machine learning

Info

Publication number: KR20200010671A
Application number: KR1020180078640A
Authority: KR
Inventors: 안민정; 이홍철; 최영진; 홍진석; 권상현; 김명소; 곽유영; 변성준
Original assignee: 에임시스템 주식회사; 고려대학교 산학협력단
Priority date: 2018-07-06
Filing date: 2018-07-06
Publication date: 2020-01-31
Also published as: KR102501884B1

Abstract

According to the present invention, a machine learning-based facility failure diagnosis method comprises a failure diagnosis process using a multiplayer perception (MLP) model, in which a priority of variables contributing to a failure is estimated by calculating a failure diagnostic indicator P_(iL) with a weight extracted by training the MLP having a regression weight as an input.

Description

System and method for fault diagnosis of equipment based on machine learning

본 발명은 기계 학습 기반의 설비 이상 진단 시스템 및 방법에 관한 것으로, 보다 상세하게는 데이터 독립변수간의 Regression Weight 계수값을 입력으로 하여 MLP 모델을 학습시켜 추출된 지표를 통해 이상의 원인이 되는 변수의 우선순위를 산출하는 시스템 및 방법에 관한 것이다.The present invention relates to a system learning diagnosis system and method based on machine learning. More specifically, priority is given to variables that cause abnormalities through indicators extracted by learning MLP models by inputting regression weight coefficient values between data independent variables. A system and method for calculating ranks.

설비 시스템에 결함이나 손상이 발생할 경우, 시스템의 가동 중단을 야기할 수 있으며 이로 인한 사회적, 경제적 피해의 규모는 막대하다. 따라서 고장이 발생하기 전에 예방정비를 실행하는 것이 중요하며 현재 대부분의 시스템은 일정한 시간 간격을 두고 실시하는 정기적 예방정비에 의존하고 있다. 그러나 정기적 예방정비는 부품의 실제 결함 유무와 관계없이 일정한 주기에 의해 실시되므로 정상 부품의 불필요한 교체로 인한 비용 손실을 발생시키고 갑작스런 시스템의 고장을 예방하는 데에 한계가 있다. 따라서 이러한 문제들을 해결하기 위해 최근 고장예지 및 건전성관리(PHM: prognostics and health management)기술이 활발히 연구되고 있다. 고장예지 및 건전성관리 기술은 설비 시스템의 상태를 실시간으로 감시하여 시스템의 이상을 조기에 감지하고 미래에 발생할 고장을 미리 예측한다. 따라서 적절한 조치를 미리 취함으로써 불필요한 유지보수 비용을 줄일 수 있고, 시스템의 안전성과 신뢰성을 높여 재앙적인 사고를 예방할 수 있다.Defects or damage to the plant's systems can lead to downtime, and the social and economic consequences of this are enormous. Therefore, it is important to implement preventive maintenance before failure occurs. Currently, most systems rely on regular preventive maintenance at regular intervals. However, the regular preventive maintenance is carried out at regular intervals regardless of the actual defect of the parts, there is a limit to prevent the sudden failure of the system, resulting in cost loss due to unnecessary replacement of the normal parts. Therefore, prognostics and health management (PHM) technology has been actively researched recently to solve these problems. Fault prediction and health management technology monitors the condition of the facility system in real time to detect system failures early and predict future failures in advance. Therefore, by taking appropriate measures in advance, unnecessary maintenance costs can be reduced, and disasters can be prevented by increasing the safety and reliability of the system.

고장예지 및 건전성관리 기술에는 크게 두 가지 방법이 있다. 첫째로, 데이터주도방법(datadriven approach)은 데이터를 이용하여 시스템의 신뢰성, 건전성 정보를 통계적 방법으로 추론하는 것이다. 기계학습(machine learning)기법은 가장 많이 사용되는 기법으로 건전성인자와 고장 여부의 관계를 훈련하여 미래고장을 예측할 수 있다. 이 방법은 물리적 손상 모델을 구현하기 어려운 다변량 시스템에 적용이 가능하다는 장점이 있지만, 훈련을 위해 많은 양의 데이터가 필요하다. 다음으로 모델기반방법(model based approach)은 물리적 고장 모델을 기반으로 고장을 진단 및 예지한다. 이 방법은 정확도가 높으며 적은 양의 고장 데이터로 고장 진단이 가능하다. 모델의 변수를 바꿔줌으로써 다양한 운행 환경에서도 적용 가능하다. 그러나, 고장 메커니즘 파악이 어렵거나 모델 변수의 수가 매우 많은 경우 모델이 실제 고장 메커니즘을 온전히 구현하지 못하므로 적용 분야가 한정적이다.There are two main methods of failure prediction and health management techniques. First, the datadriven approach is to infer the reliability and soundness information of the system by statistical method using data. Machine learning is the most widely used technique and can predict future failure by training the relationship between health factor and failure. This method has the advantage of being applicable to multivariate systems where physical damage models are difficult to implement, but requires a large amount of data for training. Next, the model based approach diagnoses and predicts failures based on physical failure models. This method is highly accurate and allows fault diagnosis with a small amount of fault data. By changing the parameters of the model, it can be applied to various driving environments. However, if the failure mechanism is difficult to identify or the number of model variables is very large, the application field is limited because the model does not fully implement the actual failure mechanism.

공정 이상치 탐지 및 분류 그리고 진단에 대한 기술은 SPC 차트, k-NN, 데이터마이닝 기법, 인공신경망 등과 같은 다양한 방법론의 기술들이 연구되었고 진행되고 있다. 현 공정에서는 대표적인 공정관리도 방법으로써 SPC차트를 많이 사용한다. 하지만 SPC 차트는 관리한계도가 높아지는 현상 때문에 이상치에 대해 효과적으로 탐지하기 어려웠고 판정의 어려움으로 실 공정에서 나오는 데이터의 정상/비정상에 대한 판단여부도 불확실한 점이 있었다. Techniques for process outlier detection and classification and diagnosis have been studied and progressed in various methodologies such as SPC charts, k-NN, data mining techniques, and neural networks. In the current process, SPC charts are frequently used as a representative process control method. However, SPC charts were difficult to detect outliers effectively due to the increased management limit, and there was uncertainty about the normal / abnormality of data from the actual process due to the difficulty of judgment.

본 발명이 이루고자 하는 기술적 과제는 데이터 독립변수간의 Regression Weight 계수값을 입력으로 하여 MLP 모델을 학습시켜 추출된 지표를 통해 이상의 원인이 되는 변수의 우선순위를 산출할 수 있는 기계학습 기반의 설비 이상 진단 시스템 및 방법을 제공하는데 있다.The technical problem to be achieved by the present invention is the machine learning-based facility fault diagnosis that can calculate the priority of the variable causing the abnormality through the extracted index by learning the MLP model by inputting the regression weight coefficient values between the data independent variables To provide a system and method.

상기 기술적 과제를 해결하기 위한 본 발명에 따른 기계 학습 기반의 설비 이상 진단 방법은, Regression weight를 입력으로 하는 다층신경망(Multilayer Perceptron) 모델을 학습시켜 추출된 weight로 이상 진단 지표 P_iL을 계산하여 이상의 원인이 되는 변수의 우선순위를 산출하는, MLP 모델을 이용한 상기 이상 진단 과정을 포함한다.Machine learning-based facility failure diagnosis method according to the present invention for solving the technical problem, by calculating a multi-layer neural network (Multilayer Perceptron) model with the input regression weight to calculate the abnormality diagnostic index P _iL The abnormal diagnosis process using the MLP model, which calculates the priority of the causative variable, is included.

MLP 모델을 이용한 상기 이상 진단 과정은, 독립변수와 종속변수를 정의하고, 데이터 표준화를 수행하고, 선형 회귀분석의 계수 β를 추출하고, β를 input으로 정의하는 데이터 입력 정의 과정; Training Data로 학습하고, Validation data로 모델을 검증하고, 최적의 MLP 구조를 결정하고, Test Data로 모델을 검증하는 MLP 학습 과정; 및 변수 별 weight로 P_iL을 계산하고, 데이터에 따라 P_iL을 해석하고, 영향력이 큰 독립변수를 선택하는 MLP 진단 과정을 포함할 수 있다.The abnormality diagnosis process using the MLP model may include: a data input definition process of defining independent and dependent variables, performing data standardization, extracting coefficient β of linear regression analysis, and defining β as input; MLP learning process for learning with Training Data, validating the model with Validation data, determining the optimal MLP structure, and validating the model with Test Data; And an MLP diagnosis process for calculating P _{iL based} on weight for each variable, interpreting P _iL according to data, and selecting an independent variable having a high influence.

상기 데이터 입력 정의 과정은, 독립변수와 종속변수를 정의하고 선형 회귀분석을 수행하여 추출된 계수 β를 MLP 모델의 input으로 사용할 수 있다.In the data input definition process, the coefficient β extracted by defining the independent variable and the dependent variable and performing linear regression analysis may be used as an input of the MLP model.

상기 MLP 학습 과정은, hidden layer, node의 수를 조정하여 최적의 MLP 구조를 결정하고 Test data로 검증할 수 있다.In the MLP learning process, an optimal MLP structure may be determined by adjusting the number of hidden layers and nodes and verified with test data.

상기 MLP 진단 과정은, 이상 진단의 지표로 P_iL 값을 계산하여 데이터 타입에 따른 해석 방법을 결정할 수 있다.In the MLP diagnosis process, an analysis method according to a data type may be determined by calculating a P _iL value as an indicator of an abnormal diagnosis.

상기된 본 발명에 의하면, 데이터 독립변수간의 Regression Weight 계수값을 입력으로 하여 MLP 모델을 학습시켜 추출된 지표를 통해 이상의 원인이 되는 변수의 우선순위를 산출할 수 있는 기계학습 기반의 설비 이상 진단 시스템 및 방법을 제공할 수 있다.According to the present invention described above, machine learning based facility fault diagnosis system that can calculate the priority of the variable causing the abnormality through the extracted index by learning the MLP model by inputting the regression weight coefficient value between the data independent variables And methods.

도 1은 본 발명의 일 실시예에 따른 기계 학습 기반의 설비 이상 탐지/분류/진단 시스템의 전체적인 구성을 나타낸다.
도 2는 본 발명의 일 실시예에 따른 기계 학습 기반의 설비 이상 탐지/분류/진단 시스템의 전체 공정도를 나타낸다.
도 3은 데이터 전처리 과정의 일 예를 보여준다.
도 4는 CVD(Chemical vapor Deposition) 설비와 스토커(Stocker) 설비로부터 수집되는 데이터의 일 예를 보여준다.
도 5는 차원 축소 과정을 설명하는 도면이다.
도 6은 차원 축소 결과로서, 1차 차원 축소의 예를 보여준다.
도 7은 차원 축소 결과로서, 2차 차원 축소의 예를 보여준다.
도 8은 안정화 구간을 설정하는 과정의 일 예를 보여준다.
도 9는 특질 추출의 일 예를 보여준다.
도 10은 이상 탐지 과정의 일 예를 보여준다.
도 11은 SVR 모델을 적용하여 이상을 탐지하는 과정을 나타낸다.
도 12는 SVR 학습 과정에서 선택할 수 있는 SVR 커널들의 예를 보여준다.
도 13은 Golden Line과 Margin의 예를 보여준다.
도 14는 SVR 예측 결과 이상으로 탐지된 데이터의 예를 보여준다.
도 15는 k-NNDD 모델을 적용하여 이상을 탐지하는 과정을 나타낸다.
도 16은 DTW 알고리즘의 개념을 나타내는 도면이다.
도 17은 계층적 군집화를 통해 새로운 관측치로부터 정상 관측치가 얻어지는 것을 나타낸 도면이다.
도 18은 비모수 추정 방법을 활용한 k-NNDD의 이상치 탐지 기법의 개념을 나타낸다.
도 19는 SSE로 Elbow point를 선택하여 군집수를 결정하는 것과 DTW 거리를 기반으로 한 계층적 군집화 결과를 나타낸다.
도 20은 SSE와 실루엣 통계량의 개념을 나타낸다.
도 21은 DTW distance를 기반으로 한 계층적 군집화 결과를 나타낸다.
도 22는 기울기를 대표 값으로 k-means를 수행한 분리형 군집화 결과를 나타낸다.
도 23은 이상 분류 라벨링 결과(3 class)를 나타낸다.
도 24는 k-NNDD 모델을 통해 novelty score를 계산하는 것을 나타낸다.
도 25는 k-NNDD 모델을 이용한 이상 분류 과정을 나타낸다.
도 26은 k-NNDD 모델 분류 결과의 예를 나타낸다.
도 27은 RNN-LSTM 모델의 개념을 나타낸다.
도 28은 RNN-LSTM 모델을 통해 이상을 분류하는 과정을 나타낸다.
도 29는 RNN-LSTM 모델의 Accuracy와 Cost의 예를 보여준다.
도 30은 MLP 모델의 다층신경망 모델을 나타낸다.
도 31은 데이터셋에 따라 가중치를 반영하여 k-NNDD, RNN, MLP 모델을 선정하는 것의 예를 나타낸다.
도 32는 MLP 모델을 통해 이상 진단 지표 P_iL을 계산하는 과정을 나타낸다.
도 33은 MLP 모델을 이용한 이상 진단 과정을 나타낸다.
도 34는 MLP 모델 적용 결과의 예를 나타낸다.Figure 1 shows the overall configuration of the machine abnormality detection / classification / diagnosis system based on machine learning in accordance with an embodiment of the present invention.
Figure 2 shows the overall process of the machine learning-based facility abnormality detection / classification / diagnostic system according to an embodiment of the present invention.
3 shows an example of a data preprocessing process.
4 shows an example of data collected from a chemical vapor deposition (CVD) facility and a stocker facility.
5 is a diagram illustrating a dimensional reduction process.
6 shows an example of primary dimension reduction as a result of dimension reduction.
7 shows an example of secondary dimension reduction as a result of dimension reduction.
8 shows an example of a process of setting a stabilization section.
9 shows an example of feature extraction.
10 shows an example of an anomaly detection process.
11 illustrates a process of detecting an abnormality by applying an SVR model.
12 shows an example of SVR kernels that can be selected in the SVR learning process.
13 shows examples of Golden Line and Margin.
14 shows an example of data detected above an SVR prediction result.
15 shows a process of detecting an abnormality by applying a k-NNDD model.
16 is a diagram illustrating the concept of a DTW algorithm.
FIG. 17 shows that normal observations are obtained from new observations through hierarchical clustering.
18 illustrates a concept of an outlier detection technique of k-NNDD using a nonparametric estimation method.
19 shows the determination of the number of clusters by selecting the elbow point with the SSE and the hierarchical clustering results based on the DTW distance.
20 illustrates the concept of SSE and silhouette statistics.
21 shows a hierarchical clustering result based on the DTW distance.
22 shows the results of isolated clustering in which k-means were performed with a representative value of the slope.
Fig. 23 shows abnormal classification labeling results (3 class).
24 illustrates calculating the novelty score through the k-NNDD model.
25 shows an abnormal classification process using a k-NNDD model.
26 shows an example of a k-NNDD model classification result.
27 shows a concept of an RNN-LSTM model.
28 illustrates a process of classifying an abnormality through the RNN-LSTM model.
29 shows an example of the accuracy and the cost of the RNN-LSTM model.
30 shows a multilayer neural network model of the MLP model.
31 shows an example of selecting a k-NNDD, RNN, and MLP model by reflecting weights according to a data set.
32 illustrates a process of calculating the abnormal diagnosis index P _iL through the MLP model.
33 shows an abnormal diagnosis process using the MLP model.
34 shows an example of an MLP model application result.

이하에서는 도면을 참조하여 본 발명의 바람직한 실시예들을 상세히 설명한다. 이하 설명 및 첨부된 도면들에서 실질적으로 동일한 구성요소들은 각각 동일한 부호들로 나타냄으로써 중복 설명을 생략하기로 한다. 또한 본 발명을 설명함에 있어 관련된 공지기능 혹은 구성에 대한 구체적인 설명이 본 발명의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우 그에 대한 상세한 설명은 생략하기로 한다.Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings. In the following description and the accompanying drawings, substantially identical components are represented by the same reference numerals, and thus, redundant descriptions will be omitted. In addition, in describing the present invention, when it is determined that a detailed description of related known functions or configurations may unnecessarily obscure the subject matter of the present invention, a detailed description thereof will be omitted.

도 1은 본 발명의 일 실시예에 따른 기계 학습 기반의 설비 이상 탐지/분류/진단 시스템의 전체적인 구성을 나타낸다.Figure 1 shows the overall configuration of the machine abnormality detection / classification / diagnosis system based on machine learning in accordance with an embodiment of the present invention.

본 발명의 일 실시예에 따른 설비 이상 탐지/분류/진단 시스템은 설비의 이상을 실시간으로 탐지하고 분류한 후 이상이 발생한 원인을 진단하는 시스템으로서, 데이터 전처리, 특질 추출, 이상 탐지, 이상 분류 라벨링, 이상 분류, 이상 진단의 과정을 포함한다.Facility fault detection / classification / diagnosis system according to an embodiment of the present invention is a system for detecting and classifying facility faults in real time and diagnosing the cause of the fault, including data preprocessing, feature extraction, fault detection, fault classification labeling. Includes, process of classification, diagnosis of abnormalities.

도 2는 본 발명의 일 실시예에 따른 기계 학습 기반의 설비 이상 탐지/분류/진단 시스템의 전체 공정도를 나타낸다. Figure 2 shows the overall process of the machine learning-based facility abnormality detection / classification / diagnostic system according to an embodiment of the present invention.

본 발명의 일 실시예에 따른 설비 이상 탐지/분류/진단 시스템은 설비 타입 별로 분석 모델을 선택하여 적용할 수 있는 유연성 있고 확장 가능한 프로세스로 구성된다. 또한 이상 탐지, 분류, 진단 과정은 각 모델 별로 학습(learning)과 예측(prediction)의 두 페이스(phase)로 구성된다.Facility failure detection / classification / diagnosis system according to an embodiment of the present invention is composed of a flexible and scalable process that can be applied to select the analysis model for each type of equipment. In addition, anomaly detection, classification, and diagnosis process consists of two phases of learning and prediction for each model.

데이터 전처리 과정은 데이터 수집, 차원 축소, 안정화구간 설정 과정을 포함한다.The data preprocessing process includes data collection, dimension reduction, and stabilization interval setting.

특질 추출 과정은 데이터 영역 별 지표 추출 과정을 포함한다.The feature extraction process includes an index extraction process for each data area.

이상 탐지 과정은 SVR 모델 적용, k-NNDD 모델 적용 과정을 포함한다.The anomaly detection process includes the SVR model application and the k-NNDD model application process.

이상 분류 라벨링 과정은 군집 수 결정 및 군집화 과정을 포함한다.The abnormal classification labeling process includes cluster number determination and clustering.

이상 분류 과정은 k-NNDD 모델 적용, RNN-LSTM 모델 적용, MLP 모델 적용 과정을 포함한다.The classification process includes applying k-NNDD model, applying RNN-LSTM model, and applying MLP model.

이상 진단 과정은 MLP 모델 적용 과정을 포함한다.The abnormal diagnosis process includes applying the MLP model.

도 3은 데이터 전처리 과정의 일 예를 보여준다. 3 shows an example of a data preprocessing process.

데이터 전처리 과정에서는, 설비에서 수집되는 대량의 계측 데이터를 수집하여 유의미한 변수를 추출하고, 해당 변수의 데이터 내에서 안정화 구간을 사전 정의된 규칙에 의해 자동으로 설정한다. In the data preprocessing process, a large amount of measurement data collected at the facility is collected to extract meaningful variables, and the stabilization interval is automatically set according to a predefined rule in the data of the variable.

도 4는 CVD(Chemical vapor Deposition) 설비와 스토커(Stocker) 설비로부터 수집되는 데이터의 일 예를 보여준다. 설비 타입에 따라 저장되는 데이터의 형태와 안정화 구간에 대한 정의가 다르기 때문에, 데이터 수집 과정에서 설비 별 초기 데이터를 로딩하기 위한 프로그래밍이 필요하다.4 shows an example of data collected from a chemical vapor deposition (CVD) facility and a stocker facility. Since the types of data stored and the definition of stabilization intervals differ according to the facility type, programming to load initial data for each facility is necessary in the data collection process.

도 5는 차원 축소 과정을 설명하는 도면이다. 5 is a diagram illustrating a dimensional reduction process.

변수가 많을 경우 모델을 학습하고 예측하는 연산의 계산 복잡도가 증가하고, 노이즈로 인한 성능 손실 발생할 수 있으므로 비슷한 결과를 산출할 수 있는 최소의 변수를 선택한다. 변수 선택은 도메인 지식이 풍부한 엔지니어의 정성적 판단이 가장 중요하다. 정성적 판단이 어려울 경우 표준화 프로세스(종속변수 제거 및 특이값 분해)를 적용하여 정량적 기준으로 변수를 선택할 수 있다.If there are many variables, the computational complexity of learning and predicting the model increases, and performance loss due to noise may occur, so select the minimum variable that can produce similar results. The choice of variables is most important for qualitative judgment by engineers with domain knowledge. If qualitative judgment is difficult, variables can be selected on a quantitative basis by applying standardization processes (removal of dependent variables and singular value decomposition).

도 5를 참조하면, 1차로, 전체 데이터에서 종속 변수를 제거하여 차원을 축소한다. Referring to FIG. 5, first, the dimension is reduced by removing the dependent variable from the entire data.

VIF 값은 다음 수학식으로 표현된다. The VIF value is represented by the following equation.

VIF(Variance Inflation Factor)는 회귀분석에서 변수들 간 독립성을 확인하기 위해 쓰이는 지표로 해당 변수의 VIF 값이 높을수록 다른 독립변수들과 상관관계가 높음을 의미한다.The variation inflation factor (VIF) is used to check the independence between variables in regression analysis. The higher the VIF value of the variable, the higher the correlation with other independent variables.

여기서, 단계적 VIF 방법을 사용할 수 있다. 즉, 1. 각 변수들의 VIF 값을 모두 측정하고; 2. 가장 큰 VIF 값을 가진 변수를 제거하고; 3. 남은 변수들의 VIF 값을 측정하고; 4. 2,3 과정을 모든 변수가 임계값(예컨대 10) 이하가 될 때까지 반복하여 최종적으로 남은 변수를 선택한다. Here, a stepped VIF method can be used. That is, 1. Measure all VIF values of each variable; 2. remove the variable with the largest VIF value; 3. measure the VIF values of the remaining variables; 4. Repeat steps 2 and 3 until all variables are below the threshold (eg 10) to select the last remaining variable.

2차로, 특이값 분해 과정을 통해, 종속변수가 제거된 데이터에서 차원을 축소한다. SVD(Singular Value Decomposition)에서, 특이값 분해는 행렬 A를 회전부분 V, U와 증폭부분 ∑(특이값)로 분해하여 ∑의 원소들 중 큰 값들을 추출하여 차원을 축소한다. 이때 Truncated SVD 방법을 사용할 수 있다. A'와 코사인 유사도가 가장 높은 X들로 최종 변수를 선택한다. Secondly, the singular value decomposition process reduces dimensions in the data from which the dependent variable is removed. In SVD (Singular Value Decomposition), singular value decomposition decomposes matrix A into rotation parts V, U and amplification part ∑ (specific values) to extract the larger values of the elements of ∑ to reduce the dimension. At this time, the Truncated SVD method can be used. The final variable is chosen as X with the highest cosine similarity to A '.

도 6은 차원 축소 결과로서, 1차 차원 축소의 예를 보여준다. 도 6을 참조하면, VIF 테스트로 171개 변수 중 49개 변수를 추출하였다. 6 shows an example of primary dimension reduction as a result of dimension reduction. Referring to FIG. 6, 49 variables of the 171 variables were extracted by the VIF test.

도 7은 차원 축소 결과로서, 2차 차원 축소의 예를 보여준다. 도 7을 참조하면, SVD 테스트로 49개 변수 중 40개 변수를 추출하였다. 2차 차원 축소를 통해 유의미한 변수가 추출되었다. 7 shows an example of secondary dimension reduction as a result of dimension reduction. Referring to FIG. 7, 40 variables of 49 variables were extracted by the SVD test. Significant variables were extracted by the second dimension reduction.

도 8은 안정화 구간을 설정하는 과정의 일 예를 보여준다. 8 shows an example of a process of setting a stabilization section.

안정화 구간 설정 과정에서, 계측 데이터의 유형에 따라 주요 관리 대상이 되는 안정화 구간의 정의는 다르나 일반화시킬 수 있는 기준을 적용하여 안정화 구간을 설정할 수 있다.During the stabilization interval setting process, Depending on the type of measurement data, the definition of the stabilization period, which is the main management target, is different, but the stabilization period can be set by applying generalizable criteria.

구체적으로, 전체 데이터를 X-bar Chart에 입력하여 UCL, LCL값을 추출하고, 관리도 내에 처음(다만 5회 연속 등으로 수정 가능)으로 데이터가 들어오는 지점을 안정화 구간의 시작으로 설정하고, 한 공정신호의 영역을 시작, 안정화구간, 끝의 세 스텝으로 구분한다.Specifically, extract the UCL and LCL values by inputting the entire data into the X-bar chart, and set the point where the data enters first in the control chart (but can be modified five times in a row) as the start of the stabilization period. The process signal area is divided into three steps: start, stabilization section and end.

도 9는 특질 추출의 일 예를 보여준다.9 shows an example of feature extraction.

데이터 영역 별 지표 추출로서, 선택한 변수의 계측 데이터는 시계열 데이터이므로 다양한 분석 모델을 적용하기 위해 한 사이클의 계측 데이터를 대표할 수 있는 특질(설비 타입에 따라 중요도가 다를 수 있음)을 추출한다.As index data extraction by data area, measurement data of selected variables are time series data, so to apply various analytical models, features that can represent measurement data of one cycle (importance may vary depending on equipment type) are extracted.

이상 탐지 과정은 이상 분류와 진단을 위해 대량의 데이터로부터 이상 탐지 모델을 생성하여 이상 데이터를 추출하는 과정이다.An anomaly detection process is a process of extracting anomaly data by generating an anomaly detection model from a large amount of data for classification and diagnosis of anomalies.

구체적으로, One class classification 중 경계선을 통한 분류 방법(boundary method)인 SVR과 거리기반의 Novelty detection 방법인 k-NNDD 모델을 적용하여 공통적으로 탐지된 이상데이터를 수집하여 라벨링한다. 이때 0은 정상을, 1은 이상을 나타낸다.Specifically, the SVR, which is a boundary method, and the k-NNDD model, which is a distance-based novel detection method, are collected and labeled in common among one class classification. 0 represents normal and 1 represents abnormal.

도 10은 이상 탐지 과정의 일 예를 보여준다. 10 shows an example of an anomaly detection process.

이상 탐지 과정에서는 SVR 모델을 적용한다. 정상 상태의 주기신호에 대한 평균을 나태는 함수(Golden Line)를 추정하기 위하여 비선형 regression 기법인 SVR 모델을 적용한다. 추정된 함수에 관리한계선(Margin)을 설정하고 이 경계를 벗어난 데이터(Outlier)의 비율(예컨대 Threshold, 10% 등)로 이상 여부를 판단할 수 있다.In the anomaly detection process, the SVR model is applied. The SVR model, a nonlinear regression technique, is applied to estimate the Golden Line, which represents the average of steady-state periodic signals. A margin of control may be set in the estimated function, and the abnormality may be determined by the ratio of data outliers (eg, Threshold, 10%, etc.).

도 11은 SVR 모델을 적용하여 이상을 탐지하는 과정을 나타낸다. 11 illustrates a process of detecting an abnormality by applying an SVR model.

데이터 전처리 과정에서 데이터 표준화를 수행한다. SVR 모델이 다양한 타입의 계측 데이터에 범용적으로 적용될 수 있도록 데이터를 표준화한다.Data normalization is performed during data preprocessing. Standardize the data so that the SVR model can be universally applied to various types of measurement data.

SVR 학습 과정에서, SVR 커널을 선택하고, 커널 별 파라미터를 설정하고, Golden Line을 설정하고, Margin을 설정한다. 이때 데이터에 따라 적합한 SVR 커널을 선택하고 데이터를 학습하여 정상 데이터의 대표 함수와 관리한계선(Margin)을 설정한다.In the SVR learning process, the SVR kernel is selected, kernel-specific parameters are set, Golden Line is set, and Margin is set. At this time, the appropriate SVR kernel is selected according to the data and the data are learned to set the representative function and the margin of management of the normal data.

SVR 예측 과정에서, 이상 데이터 수를 추출하고 Threshold 기준 이상을 탐지한다. 학습 단계에서 설정된 관리한계선(Margin)에서 벗어나는 데이터의 개수를 수집하여 사용자 정의된 Threshold를 기준으로 이상 여부를 판단하여 라벨링한다.In the SVR prediction process, the number of abnormal data is extracted and the threshold threshold is detected. The number of data that deviates from the margin set in the learning stage is collected and the label is determined based on the user-defined threshold.

도 12는 SVR 학습 과정에서 선택할 수 있는 SVR 커널들의 예를 보여준다. Laplacian Kernel은 Parameter가 분산이므로 비교적 처리 속도가 빠르고 시계열 데이터 특성에 적합하다.12 shows an example of SVR kernels that can be selected in the SVR learning process. Laplacian Kernel is distributed in parameters, so it is relatively fast and suitable for time series data characteristics.

SVR 학습 과정에서, Golden Line 설정 시, N개 cycle의 SVR-line (Laplacian Kernel)의 Median 값을 Golden Line으로 정의할 수 있다.In SVR learning process, when setting Golden Line, Median value of SVR-line (Laplacian Kernel) of N cycles can be defined as Golden Line.

Margin 설정 시, N초 별 SVR 값에 Whiskers (IQR, Interquartile Range에 1.5를 곱한 값) 방법을 적용하여 Margin으로 설정하고 이 경계를 벗어나는 데이터를 이상(비율로 설정)으로 탐지할 수 있다.When setting Margin, Whiskers (IQR, Interquartile Range multiplied by 1.5) method is applied to SVR value for every N seconds, and it can be set to Margin, and data beyond this boundary can be detected as abnormal (set as ratio).

도 13은 Golden Line과 Margin의 예를 보여준다.13 shows examples of Golden Line and Margin.

SVR 예측 과정에서, 학습 단계에서 설정된 Margin을 벗어나는 데이터를 이상(비율로 설정, default : 10%)으로 탐지할 수 있다.In the SVR prediction process, data beyond the margin set in the learning phase can be detected as an abnormality (set as a ratio, default: 10%).

도 14는 SVR 예측 결과 이상으로 탐지된 데이터의 예를 보여준다.14 shows an example of data detected above an SVR prediction result.

또한 이상 탐지 과정에서는 k-NNDD (k-Nearest Neighbor Data Description) 모델을 적용한다.In addition, anomaly detection process uses a k-NDD (k-Nearest Neighbor Data Description) model.

k-NNDD 모델은 거리 기반의 이상 탐지 방법으로 novelty score를 계산하여 정상 집단에 속하는지 여부를 판단한다. Novelty score란 새로운 관측치와 이웃패턴 간의 거리를 비율적으로 계산한 분류 척도이며, 새로운 관측치와 이웃 패턴간의 거리가 멀수록 이상치로 분류할 확률을 높게 추정한다.The k-NNDD model uses the distance-based anomaly detection method to calculate novelty scores to determine whether they belong to the normal population. Novelty score is a classification measure that calculates the distance between a new observation and a neighboring pattern proportionally. The greater the distance between a new observation and a neighboring pattern is, the more likely it is to be classified as an outlier.

도 15는 k-NNDD 모델을 적용하여 이상을 탐지하는 과정을 나타낸다.15 shows a process of detecting an abnormality by applying a k-NNDD model.

데이터 전처리 과정에서 DTW distance를 계산한다. DTW distance는 k-NNDD 모델에 사용할 거리의 종류 중 하나이다. 설비의 타입에 따라 적합한 거리의 종류를 결정할 수 있다.DTW distance is calculated during data preprocessing. DTW distance is one of the types of distance to be used in the k-NNDD model. Depending on the type of installation, the type of distance that is suitable can be determined.

k-NNDD 학습 과정에서, 계측정 군집화를 수행하고, 정상 모델을 설정하고, k와 Threshold를 결정한다. 계층적 군집화를 통해 주어진 학습 데이터에서 정상 모델을 분류할 수 있고, 데이터 유형 별로 시행착오 테스트를 통해 k(비교할 이웃 데이터 수)와 Threshold(이상 탐지 임계값)를 결정할 수 있다. In k-NNDD training, we perform metrology clustering, set up a normal model, and determine k and threshold. Hierarchical clustering can classify normal models from a given training data, and by trial and error testing for each data type, k (number of neighboring data to compare) and Threshold (anomaly detection threshold) can be determined.

k-NNDD 예측 과정에서, Novelty score를 계산하고, 이상을 탐지한다. 학습 단계에서 설정된 k개의 이웃 데이터로 Novelty score를 계산하여 Threshold를 초과할 경우 이상으로 탐지한다.In the k-NNDD prediction process, a Novelty score is calculated and anomalies are detected. The Novelty score is calculated from the k neighbor data set in the learning phase and detected as abnormal when the threshold is exceeded.

DTW(Dynamic Time Warping) 알고리즘은 두 개의 시계열 데이터 간에 유사도를 알아내기 위한 알고리즘 중 하나로 동적 패턴의 시변성을 반영한다. DTW 알고리즘의 일반적인 계산 복잡도는 O(N²)으로 대용량 트랜잭션에 효율적이다.Dynamic Time Warping (DTW) algorithm is one of the algorithms for finding similarity between two time series data and reflects the time-varying dynamic pattern. The general computational complexity of the DTW algorithm is O (N ² ), which is efficient for large transactions.

도 16은 DTW 알고리즘의 개념을 나타내는 도면이다.16 is a diagram illustrating the concept of a DTW algorithm.

DTW distance로 계층적 군집화(Hierarchical clustering)를 통해 주어진 학습 데이터에서 정상 모델을 분류할 수 있다.With DTW distance, hierarchical clustering can be used to classify normal models from given training data.

도 17은 계층적 군집화를 통해 새로운 관측치로부터 정상 관측치가 얻어지는 것을 나타낸 도면이다.FIG. 17 shows that normal observations are obtained from new observations through hierarchical clustering.

k-NNDD(k-Nearest Neighbor Data Description) 알고리즘은 One class classification 중 경계선을 통한 분류 방법(boundary method)으로 인접한 관측치 사이의 거리를 기반으로 이상 여부를 판단한다. K-FDC 방법론에서는 DTW distance를 기준 거리로 사용한다.The k-Nearest Neighbor Data Description (k-NNDD) algorithm is a boundary method of one class classification to determine whether an abnormality is based on the distance between adjacent observations. The K-FDC methodology uses DTW distance as the reference distance.

도 18은 비모수 추정 방법을 활용한 k-NNDD의 이상치 탐지 기법의 개념을 나타낸다.18 illustrates a concept of an outlier detection technique of k-NNDD using a nonparametric estimation method.

다음으로, 이상 분류 라벨링 과정을 설명한다. Next, the abnormal classification labeling process will be described.

이상 분류에 사용할 k-NNDD 모델과 RNN-LSTM 모델은 사전에 정의된 분류 class가 있어야 하는(Labeling) 지도 학습 (Supervised Learning) 방법이므로, 비지도 학습(Unsupervised Learning) 방법인 군집화(Clustering)를 통해 앞 단계(SVR, k-NNDD)에서 탐지한 이상 데이터의 분류 class를 생성한다. 최적의 군집 수는 군집화 평가지표(SSE, 실루엣 통계량 등)를 도식화하여 지표에 따른 Elbow point로 결정한다. 결정된 군집 수(K)로 군집화(Clustering)하여 1~K로 한 공정 사이클 데이터를 Labeling 한다. 군집화 방법은 한 공정 사이클 데이터의 DTW distance, 평균값, 최대값, 중앙값 등 사용하는 대표 값에 적합한 방법(계층적 군집화, 분리형 군집화 등)을 테스트를 통해 선정할 필요가 있다.The k-NNDD model and the RNN-LSTM model to be used for anomaly classification are supervised learning methods that must have a predefined classification class. Therefore, clustering, an unsupervised learning method, Create classification class of abnormal data detected in the previous step (SVR, k-NNDD). The optimal number of clusters is determined by the elbow point according to the index by plotting clustering evaluation indexes (SSE, silhouette statistics, etc.). Clustering with the determined number of clusters (K) to label the process cycle data from 1 to K. The clustering method needs to select a test method (hierarchical clustering, separate clustering, etc.) suitable for representative values to be used such as DTW distance, average value, maximum value, and median value of a process cycle data.

도 19는 SSE로 Elbow point를 선택하여 군집수를 결정하는 것과 DTW 거리를 기반으로 한 계층적 군집화 결과를 나타낸다. 19 shows the determination of the number of clusters by selecting the elbow point with the SSE and the hierarchical clustering results based on the DTW distance.

군집 수 결정에서, 군집 성능 평가 지표(SSE, 실루엣 통계량 등)를 도식화하여 최적의 군집 수를 선택한다.In determining the number of clusters, the cluster performance evaluation indicators (SSE, silhouette statistics, etc.) are plotted to select the optimal number of clusters.

도 20은 SSE와 실루엣 통계량의 개념을 나타낸다. 20 illustrates the concept of SSE and silhouette statistics.

도 21은 DTW distance를 기반으로 한 계층적 군집화 결과를 나타낸다.21 shows a hierarchical clustering result based on the DTW distance.

도 22는 기울기를 대표 값으로 k-means를 수행한 분리형 군집화 결과를 나타낸다. 22 shows the results of isolated clustering in which k-means were performed with a representative value of the slope.

도 23은 이상 분류 라벨링 결과(3 class)를 나타낸다. Fig. 23 shows abnormal classification labeling results (3 class).

다음으로, 이상 분류 과정을 설명한다. Next, the abnormal classification process will be described.

이상 분류 과정에서, k-NNDD(k-Nearest Neighbor Data Description) 모델을 적용한다. k-NNDD 모델은, 새로운 관측치가 있을 때, 라벨링된 이상 분류 모델에 대해 class 별로 k-NNDD를 적용하여 novelty score(d1/d2)값이 가장 작은 순서로 해당 class에 속할 우선순위를 부여한다.In the classification process, a k-Nearest Neighbor Data Description (k-NNDD) model is applied. When there is a new observation, the k-NNDD model applies k-NNDD for each class to the labeled abnormal classification model, and gives priority to belong to the class in the order of the lowest novelty score (d1 / d2).

도 24는 k-NNDD 모델을 통해 novelty score를 계산하는 것을 나타낸다.24 illustrates calculating the novelty score through the k-NNDD model.

도 25는 k-NNDD 모델을 이용한 이상 분류 과정을 나타낸다.25 shows an abnormal classification process using a k-NNDD model.

데이터 전처리 과정에서, DTW distance matrix를 계산한다. 즉, 이상 종류 별로 Labeling 된 모델 내 DTW distance matrix를 계산한다.In the data preprocessing, the DTW distance matrix is calculated. That is, the DTW distance matrix in the labeled model is calculated for each error type.

k-NNDD 학습 과정에서 k 값을 결정하는데, k 값(인접한 관측치의 개수)은 크기가 최소인 모델 크기의 예컨대 20%(최소 5 이상)를 기준으로 할 수 있다. In the k-NNDD learning process, the k value is determined, and the k value (the number of adjacent observations) may be based on, for example, 20% (minimum 5 or more) of the smallest model size.

k-NNDD 예측 과정에서, 학습 단계에서 설정된 k 개의 이웃 데이터로 Novelty score를 계산하여 가장 작은 값을 가지는 순서로 class의 우선순위를 부여한다.In the k-NNDD prediction process, the Novelty score is calculated from the k neighbor data set in the learning step, and the class priority is given in the order of having the smallest value.

k-NNDD 모델 분류 결과는, 새로운 관측치가 있을 때, 학습된 이상 분류 모델에 대해 class 별로 k-NNDD를 적용하여 novelty score(d1/d2)값이 가장 작은 순서로 해당 class에 속할 우선순위를 부여한다.The k-NNDD model classification result gives priority to belong to the class in the order of the lowest novelty score (d1 / d2) by applying k-NNDD for each class to the trained abnormal classification model when there are new observations. do.

도 26은 k-NNDD 모델 분류 결과의 예를 나타낸다.26 shows an example of a k-NNDD model classification result.

또한 이상 분류 과정에서, RNN-LSTM 모델을 적용한다. 즉, Deep Learning 모델 중 순차적인 정보를 처리하는데 적합한 RNN-LSTM(Recurrent Neural Network-Long Short Term Memory) 모델에 Softmax 함수를 조합하여 이상을 분류한다. In addition, in the classification process, the RNN-LSTM model is applied. That is, the error is classified by combining a Softmax function with a Recurrent Neural Network-Long Short Term Memory (RNN-LSTM) model suitable for processing sequential information among deep learning models.

도 27은 RNN-LSTM 모델의 개념을 나타낸다. 27 shows a concept of an RNN-LSTM model.

RNN-LSTM 모델에서, 라벨링된 데이터를 훈련/검증/테스트 셋으로 나누어 RNN-LSTM 모델을 훈련시키고 성능을 검증한 후 실시간으로 데이터를 분류한다. 이때 반복 학습을 통해 Hyper parameter를 조정하는 작업이 중요하다.In the RNN-LSTM model, the labeled data is divided into training / verification / test sets to train the RNN-LSTM model, verify its performance, and classify the data in real time. At this time, it is important to adjust Hyper parameters through iterative learning.

도 28은 RNN-LSTM 모델을 통해 이상을 분류하는 과정을 나타낸다.28 illustrates a process of classifying an abnormality through the RNN-LSTM model.

Hyper parameter 설정 과정에서, RNN-LSTM 구조를 형성하고, Training 방안을 구성한다. 여기서, 최적의 RNN-LSTM 구조를 결정하기 위하여 Cell stack size, hidden layer, put size 등의 parameter를 설정한다. 학습을 위한 parameter로는 batch size, Hidden layer, learning rate, Max iteration, drop out 등이 있다.In the process of hyper parameter setting, the RNN-LSTM structure is formed and the training method is configured. Here, parameters such as cell stack size, hidden layer, and put size are set to determine the optimal RNN-LSTM structure. Parameters for learning include batch size, hidden layer, learning rate, max iteration, drop out, etc.

RNN-LSTM 학습 과정에서, Training data로 학습을 수행하고, Validation Data로 과적합을 체크하고, 최적의 RNN-LSTM 구조를 확정한다. 이때 초기 구성한 Hyper parameter를 학습과 검증을 통해 조정하여 최적의 RNN-LSTM 구조를 확정한다.In the RNN-LSTM learning process, the training is performed with training data, the overfit is checked with validation data, and the optimal RNN-LSTM structure is determined. At this time, the optimally configured Hyper parameter is adjusted through learning and verification to determine the optimal RNN-LSTM structure.

RNN-LSTM 예측 과정에서, Test data로 모델을 검증하고, 실시간 이상 분류를 수행한다. 즉, Test 데이터를 바탕으로 본 모델의 성능을 검증한 후 실시간 데이터로 분류한다.During the RNN-LSTM prediction process, the model is verified with test data and real-time abnormal classification is performed. That is, after verifying the performance of this model based on the test data, it is classified into real-time data.

RNN-LSTM 모델의 성능은, 학습 횟수가 증가할수록 Cost는 감소하고 정확도는 증가한다. Cost는 학습을 통해 줄이고자 하는 오차의 전체 합 / Batch Size 이고, Accuracy는 예측 Class와 실제 Class와의 정확도이다. The performance of the RNN-LSTM model is that as the number of learning increases, the cost decreases and the accuracy increases. Cost is the total sum of the errors to be reduced through learning / batch size, and Accuracy is the accuracy of the prediction class and the actual class.

도 29는 RNN-LSTM 모델의 Accuracy와 Cost의 예를 보여준다.29 shows an example of the accuracy and the cost of the RNN-LSTM model.

또한 이상 분류 과정에서, MLP(Multilayer Perceptron) 모델을 적용한다. MLP 모델은 Regression weight를 입력으로 하는 다층신경망(Multilayer Perceptron) 모델을 학습시켜 이상을 분류한다.In addition, in the classification process, a MLP (Multilayer Perceptron) model is applied. MLP model We classify abnormalities by learning a multilayer perceptron model with input regression weights.

도 30은 MLP 모델의 다층신경망 모델을 나타낸다.30 shows a multilayer neural network model of the MLP model.

이상 분류 종합 과정에서, k-NNDD, RNN, MLP 모델로 분류한 결과에 사용자 정의 가중치(1:2:1 등)를 반영하여 최종적인 이상 분류 class를 확정한다. 이때 데이터 타입에 따라 적합한 분류 모델을 선정하는 테스트가 필요하다.In the error classification synthesis process, the final classification class is determined by reflecting the user-defined weight (1: 2: 1, etc.) in the results classified by the k-NNDD, RNN, and MLP models. At this time, a test for selecting a suitable classification model according to the data type is required.

도 31은 데이터셋에 따라 가중치를 반영하여 k-NNDD, RNN, MLP 모델을 선정하는 것의 예를 나타낸다.31 shows an example of selecting a k-NNDD, RNN, and MLP model by reflecting weights according to a data set.

다음으로, 이상 진단 과정을 설명한다. Next, the abnormal diagnosis process will be described.

이상 진단 과정에서는, MLP(Multilayer Perceptron) 모델을 적용한다. Regression weight를 입력으로 하는 다층신경망(Multilayer Perceptron) 모델을 학습시켜 추출된 weight로 이상 진단 지표 P_iL을 계산하여 이상의 원인이 되는 변수의 우선순위를 산출한다. In the abnormal diagnosis process, a multilayer perceptron (MLP) model is applied. By learning the multilayer perceptron model with input regression weight, the error diagnosis index P _iL is calculated from the extracted weight to calculate the priority of variables that cause the abnormality.

도 32는 MLP 모델을 통해 이상 진단 지표 P_iL을 계산하는 과정을 나타낸다. 32 illustrates a process of calculating the abnormal diagnosis index P _iL through the MLP model.

MLP 모델은 인공신경망을 이용한 이상 진단 기법으로 변수의 weight 값을 조합한 지표인 P_iL 값을 계산하여 해당 cycle이 특정 군집으로 분류되는데 가장 크게 기여한 변수를 찾아낸다. P_iL 값이 높을수록 해당 변수의 영향력이 크다고 볼 수 있으며 다른 군집 cycle의 패턴과 차이를 보인다.The MLP model is an anomaly diagnosis technique using an artificial neural network that calculates the P _iL value, which is an index that combines the weight values of variables, and finds the variables that contributed the most to the classification of a given cluster. The higher the P _iL value, the greater the influence of the variable, which is different from other cluster cycle patterns.

도 33은 MLP 모델을 이용한 이상 진단 과정을 나타낸다.33 shows an abnormal diagnosis process using the MLP model.

데이터 입력 정의 과정에서, 독립변수와 종속변수를 정의하고, 데이터 표준화를 수행하고, 선형 회귀분석의 계수(β)를 추출하고, β를 input으로 정의한다. 즉, 독립변수(센서 데이터 등)와 종속변수(Torque 값 등)를 정의하고 선형 회귀분석을 수행하여 추출된 계수(β)를 MLP 모델의 input으로 사용한다. In the data input definition process, independent and dependent variables are defined, data normalization is performed, coefficient (β) of linear regression analysis is extracted, and β is defined as input. In other words, the independent variables (sensor data, etc.) and dependent variables (Torque values, etc.) are defined and linear regression analysis is used to use the extracted coefficients (β) as input to the MLP model.

MLP 학습 과정에서, Training Data로 학습하고, Validation data로 모델을 검증하고, 최적의 MLP 구조를 결정하고, Test Data로 모델을 검증한다. 여기서, hidden layer, node의 수를 조정하여 최적의 MLP 구조를 결정하고 Test data로 검증한다.In the MLP learning process, we train with training data, validate the model with validation data, determine the optimal MLP structure, and verify the model with test data. Here, the optimal MLP structure is determined by adjusting the number of hidden layers and nodes and verified with test data.

MLP 진단 과정에서, 변수 별 weight로 P_iL을 계산하고, 데이터에 따라 P_iL을 해석하고, 영향력이 큰 독립변수를 선택한다. 즉, 이상 진단의 지표로 P_iL 값을 계산하여 데이터 타입에 따른 해석 방법을 결정한다. 이 해석 방법에 따라 가장 큰 영향력을 미친 독립 변수를 선택한다. In the MLP diagnosis process, P _iL is calculated by the weight of each variable, P _iL is interpreted according to the data, and an independent variable having a high influence is selected. That is, the P _iL value is calculated as an indicator of abnormality diagnosis to determine an analysis method according to the data type. This method of analysis selects the independent variables that have the greatest impact.

MLP(Multilayer Perceptron) 모델 적용 결과, P_iL지표를 통해 군집 별 영향력이 높은 변수를 정량적으로 판단할 수 있다. As a result of the application of the MLP (Multilayer Perceptron) model, it is possible to quantitatively determine the variables with high impact by clusters through the P _iL index.

도 34는 MLP 모델 적용 결과의 예를 나타낸다.34 shows an example of an MLP model application result.

본 발명의 실시예에 따르면, 설비 이상 진단을 자동화하여 원인이 되는 변수를 신속히 파악함으로써 MTTR(평균수리시간)을 감소시킬 수 있다. 또한, 설비 이상 패턴을 분류하고 예측함으로써 고장 전 사전 조치를 통해 MTBF(평균 고장 간격)을 증가시킬 수 있다. 또한, 설비의 이상이 발생하기 이전에 사전 조치를 함으로써 제품의 불량을 감소시킬 수 있다. According to an embodiment of the present invention, it is possible to reduce the MTTR (average repair time) by automating the diagnosis of equipment failure to quickly identify the causative variables. In addition, by classifying and predicting plant failure patterns, you can increase the MTBF (mean time between failures) by taking precautions before failure. In addition, it is possible to reduce product defects by taking precautions before facility failure occurs.

한편, 상술한 본 발명의 실시예들은 컴퓨터에서 실행될 수 있는 프로그램으로 작성가능하고, 컴퓨터로 읽을 수 있는 기록매체를 이용하여 상기 프로그램을 동작시키는 범용 디지털 컴퓨터에서 구현될 수 있다. 상기 컴퓨터로 읽을 수 있는 기록매체는 마그네틱 저장매체(예를 들면, 롬, 플로피 디스크, 하드 디스크 등), 광학적 판독 매체(예를 들면, 시디롬, 디브이디 등)와 같은 저장매체를 포함한다.Meanwhile, the above-described embodiments of the present invention can be written as a program that can be executed in a computer, and can be implemented in a general-purpose digital computer that operates the program using a computer-readable recording medium. The computer readable recording medium may include a storage medium such as a magnetic storage medium (eg, a ROM, a floppy disk, a hard disk, etc.) and an optical reading medium (eg, a CD-ROM, a DVD, etc.).

본 발명의 실시예들은 기능적인 블록 구성들 및 다양한 처리 단계들로 나타내어질 수 있다. 이러한 기능 블록들은 특정 기능들을 실행하는 다양한 개수의 하드웨어 또는/및 소프트웨어 구성들로 구현될 수 있다. 예를 들어, 실시예는 하나 이상의 마이크로프로세서들의 제어 또는 다른 제어 장치들에 의해서 다양한 기능들을 실행할 수 있는, 메모리, 프로세싱, 로직(logic), 룩 업 테이블(look-up table) 등과 같은 집적 회로 구성들을 채용할 수 있다. 본 발명에의 구성 요소들이 소프트웨어 프로그래밍 또는 소프트웨어 요소들로 실행될 수 있는 것과 유사하게, 실시예는 데이터 구조, 프로세스들, 루틴들 또는 다른 프로그래밍 구성들의 조합으로 구현되는 다양한 알고리즘을 포함하여, C, C++, 자바(Java), 어셈블러(assembler) 등과 같은 프로그래밍 또는 스크립팅 언어로 구현될 수 있다. 기능적인 측면들은 하나 이상의 프로세서들에서 실행되는 알고리즘으로 구현될 수 있다. 또한, 실시예는 전자적인 환경 설정, 신호 처리, 및/또는 데이터 처리 등을 위하여 종래 기술을 채용할 수 있다. "매커니즘", "요소", "수단", "구성"과 같은 용어는 넓게 사용될 수 있으며, 기계적이고 물리적인 구성들로서 한정되는 것은 아니다. 상기 용어는 프로세서 등과 연계하여 소프트웨어의 일련의 처리들(routines)의 의미를 포함할 수 있다.Embodiments of the present invention can be represented by functional block configurations and various processing steps. Such functional blocks may be implemented in various numbers of hardware or / and software configurations that perform particular functions. For example, an embodiment may comprise an integrated circuit configuration such as memory, processing, logic, look-up table, etc., that may execute various functions by the control of one or more microprocessors or other control devices. You can employ them. Similar to the components in the present invention may be implemented in software programming or software elements, embodiments include C, C ++, including various algorithms implemented in combinations of data structures, processes, routines or other programming constructs. It may be implemented in a programming or scripting language such as Java, assembler, or the like. Functional aspects may be implemented in algorithms running on one or more processors. In addition, embodiments may employ prior art for electronic configuration, signal processing, and / or data processing. Terms such as "mechanism", "element", "means", "configuration" can be used broadly and are not limited to mechanical and physical configurations. The term may include the meaning of a series of routines of software in conjunction with a processor or the like.

실시예에서 설명하는 특정 실행들은 일 실시예들로서, 어떠한 방법으로도 실시 예의 범위를 한정하는 것은 아니다. 명세서의 간결함을 위하여, 종래 전자적인 구성들, 제어 시스템들, 소프트웨어, 상기 시스템들의 다른 기능적인 측면들의 기재는 생략될 수 있다. 또한, 도면에 도시된 구성 요소들 간의 선들의 연결 또는 연결 부재들은 기능적인 연결 및/또는 물리적 또는 회로적 연결들을 예시적으로 나타낸 것으로서, 실제 장치에서는 대체 가능하거나 추가의 다양한 기능적인 연결, 물리적인 연결, 또는 회로 연결들로서 나타내어질 수 있다. 또한, "필수적인", "중요하게" 등과 같이 구체적인 언급이 없다면 본 발명의 적용을 위하여 반드시 필요한 구성 요소가 아닐 수 있다.Specific implementations described in the embodiments are examples, and do not limit the scope of the embodiments in any way. For brevity of description, descriptions of conventional electronic configurations, control systems, software, and other functional aspects of the systems may be omitted. In addition, the connection or connecting members of the lines between the components shown in the drawings by way of example shows a functional connection and / or physical or circuit connections, in the actual device replaceable or additional various functional connections, physical It may be represented as a connection, or circuit connections. In addition, unless otherwise specified, such as "essential", "important" may not be a necessary component for the application of the present invention.

이제까지 본 발명에 대하여 그 바람직한 실시예들을 중심으로 살펴보았다. 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.So far I looked at the center of the preferred embodiment for the present invention. Those skilled in the art will appreciate that the present invention can be implemented in a modified form without departing from the essential features of the present invention. Therefore, the disclosed embodiments should be considered in descriptive sense only and not for purposes of limitation. The scope of the present invention is shown in the claims rather than the foregoing description, and all differences within the scope will be construed as being included in the present invention.

Claims

As a machine learning based facility fault diagnosis method,
The above-mentioned abnormal diagnosis process using the MLP model, which calculates the priority of the variable causing the abnormality by calculating the abnormal diagnosis index P _iL using the extracted weight by learning a multilayer perceptron model that uses the regression weight as an input. Machine learning-based facility failure diagnosis method.

The method of claim 1,
The abnormal diagnosis process using the MLP model,
A data input definition process for defining independent and dependent variables, performing data normalization, extracting coefficient β of linear regression analysis, and defining β as input;
MLP learning process for learning with Training Data, validating the model with Validation data, determining the optimal MLP structure, and validating the model with Test Data; And
By weight and calculates the P _iL, according to the data analysis and the P _iL, diagnostic methods based on machine learning facility that includes more than the MLP diagnostic process to influence the selection of a large independent variable by variable.

The method of claim 2,
In the data input definition process, a machine learning-based facility fault diagnosis method using independent coefficients and dependent variables and performing linear regression analysis, uses the extracted coefficient β as an input of the MLP model.

The method of claim 2,
The MLP learning process is to determine the optimal MLP structure by adjusting the number of hidden layers, nodes, and verify with the test data, machine learning based facility fault diagnosis method.

The method of claim 2,
The MLP diagnosis process is to determine the analysis method according to the data type by calculating the P _iL value as an indicator of the fault diagnosis, machine learning based facility fault diagnosis method.