KR102378729B1

KR102378729B1 - Method and apparatus for determining optimized training scheme

Info

Publication number: KR102378729B1
Application number: KR1020210035435A
Authority: KR
Inventors: 장영민; 부이 반
Original assignee: 국민대학교산학협력단
Priority date: 2021-03-18
Filing date: 2021-03-18
Publication date: 2022-03-24
Also published as: KR102378729B9

Abstract

The present invention relates to a method and apparatus for determining an optimized training scheme. According to an embodiment of the present invention, the method for determining an optimized training scheme may comprise the steps of: loading an original data set related to a vibration signal generated by a machine; generating fake abnormal data using a generative adversarial network; generating a plurality of training data sets by changing the ratio of a part of the original data set and the fake abnormal data; extracting features by inputting the plurality of training data sets to a plurality of different feature extraction algorithms; generating a plurality of machine defect discrimination training models by training a plurality of training models implemented with different algorithms using the feature extraction result as training data; verifying the accuracy of the plurality of machine defect discrimination training models using other parts of the original data set; and determining an optimized training model among the plurality of machine defect discrimination training models, and an optimized ratio of the part of the original data set and the fake abnormal data according to each feature extraction algorithm, based on the result of verifying the accuracy of the machine defect discrimination training model. The present invention can prevent mechanical defects in advance.

Description

Method and apparatus for determining the optimal training method {METHOD AND APPARATUS FOR DETERMINING OPTIMIZED TRAINING SCHEME}

본 발명은 기계 결함 판별 학습모델의 정확도를 검증한 결과를 기반으로 하여, 특징 추출 알고리즘 별로 오리지널 진동 데이터와 적대 신경망(GAN: generative adversarial network)을 이용하여 생성한 모사(fake) 데이터의 최적 비율과, 복수의 기계 결함 판별 학습모델 중 최적의 학습모델을 결정하는 최적의 훈련 방식 판단 방법 및 장치에 관한 것이다.The present invention is based on the results of verifying the accuracy of the machine defect identification learning model, and the optimal ratio of original vibration data and fake data generated using a generative adversarial network (GAN) for each feature extraction algorithm and , relates to a method and apparatus for determining an optimal training method for determining an optimal learning model among a plurality of machine defect discrimination learning models.

최근 산업기술이 발달함에 따라 회전 기계들이 대형화, 고 정밀화되면서 이들 설비에서 고장이 발생하면 막대한 경제적 손실이나 인명에 피해를 줄 수 있어 이들 회전 기계에 대한 고장 감시가 매우 중요해졌다. With the recent development of industrial technology, as rotary machines become larger and more precise, failures in these facilities can cause enormous economic loss or damage to human life, so fault monitoring of these rotary machines has become very important.

이들 회전하는 구동부가 포함된 기계는 물리적인 마찰조건 하에 구동되므로 잦은 주기로 정비를 수행하여야만 고장 및 사고를 방지할 수 있다. 이와 같이 회전 기계는 잦은 정비를 요구하므로 유지보수 비용 및 시간이 많이 소요되는 문제점이 있다.Since the machine including these rotating driving parts is driven under physical friction conditions, it is necessary to perform maintenance at frequent intervals to prevent breakdowns and accidents. As such, the rotating machine requires frequent maintenance, so there is a problem that maintenance cost and time are required.

이러한 문제점으로 인해 고장 발생 가능성이 있을 때에만 정비를 수행하는 상태 기반 정비가 최근 주목을 받고 있다. 상태 기반 정비는 고장 발생 가능성이 있을 때에만 정비를 수행하므로 유지보수비용 및 시간 감소를 통해 운용비용 감소와 이익증대를 실현하여 준다.Due to these problems, condition-based maintenance, which performs maintenance only when there is a possibility of a failure, has recently attracted attention. Since condition-based maintenance performs maintenance only when there is a possibility of a failure, it reduces operating costs and increases profits by reducing maintenance costs and time.

이에 따라, 한국공개특허공보 제2020-0101507호는 기계학습을 이용한 회전기계 고장 진단 및 예측시스템에 대해 개시하기도 하였다.Accordingly, Korean Patent Application Laid-Open No. 2020-0101507 also disclosed a system for diagnosing and predicting a malfunction of a rotating machine using machine learning.

결함 진단이 적용되어 큰 효과를 기대할 수 있는 분야는 유지보수에 높은 비용이 필요한 시스템 등이 될 수 있으며, 대표적으로 고속철도와 풍력발전용 터빈 등의 기어박스 등을 꼽을 수 있다Areas where fault diagnosis can be applied to have great effects can be systems that require high maintenance costs, and gearboxes such as high-speed railways and wind turbines are representative.

기어박스는 기계에서 회전동력을 전달하기 위해 기본적으로 사용되는 요소이므로 이에 대해 여러 센서 신호를 통해 결함을 감지하고 진단하려는 연구가 활발하게 이루어지고 있다.Since the gearbox is a basic element used to transmit rotational power in a machine, research to detect and diagnose faults through various sensor signals is being actively conducted.

전술한 배경기술은 발명자가 본 발명의 도출을 위해 보유하고 있었거나, 본 발명의 도출 과정에서 습득한 기술 정보로서, 반드시 본 발명의 출원 전에 일반 공중에게 공개된 공지기술이라 할 수는 없다.The above-mentioned background art is technical information that the inventor possessed for the derivation of the present invention or acquired in the process of derivation of the present invention, and cannot necessarily be said to be a known technique disclosed to the general public prior to the filing of the present invention.

국내 공개특허공보 제10-2020-0101507호(2020.08.28)Domestic Patent Publication No. 10-2020-0101507 (2020.08.28)

본 발명의 일 과제는, 기계 결함 판별 학습모델의 정확도를 검증한 결과를 기반으로 하여, 특징 추출 알고리즘 별로 오리지널 진동 데이터와 적대 신경망(GAN)을 이용하여 생성한 모사(fake) 데이터의 최적 비율과, 복수의 기계 결함 판별 학습모델 중 최적의 학습모델을 결정하는데 있다.An object of the present invention is, based on the result of verifying the accuracy of the machine defect identification learning model, the optimal ratio of original vibration data and fake data generated using an adversarial neural network (GAN) for each feature extraction algorithm and , is to determine the optimal learning model among a plurality of machine defect discrimination learning models.

본 발명이 해결하고자 하는 과제는 이상에서 언급한 과제에 한정되지 않으며, 언급되지 않은 본 발명의 다른 과제 및 장점들은 하기의 설명에 의해서 이해될 수 있고, 본 발명의 실시 예에 의해보다 분명하게 이해될 것이다. 또한, 본 발명이 해결하고자 하는 과제 및 장점들은 특허 청구 범위에 나타낸 수단 및 그 조합에 의해 실현될 수 있음을 알 수 있을 것이다.The problem to be solved by the present invention is not limited to the above-mentioned problems, and other problems and advantages of the present invention not mentioned can be understood by the following description, and more clearly understood by the embodiments of the present invention will be In addition, it will be understood that the problems and advantages to be solved by the present invention can be realized by means and combinations thereof indicated in the claims.

본 발명의 일 실시 예에 따른 최적의 훈련 방식 판단 방법은, 기계에서 발생하는 진동신호와 관련하여, 레이블링된 정상 데이터 세트 및 레이블링된 비정상 데이터 세트를 포함하는 오리지널 데이터 세트를 로딩하는 단계와, 레이블링된 비정상 데이터 세트 중 적어도 일부를 생성적 적대 신경망(GAN: generative adversarial network)에 적용하여 모사(fake) 비정상 데이터 그룹을 생성하는 단계와, 오리지널 데이터 세트 중 제1 데이터 그룹과 모사 비정상 데이터 그룹의 비율을 변경하며 복수의 훈련 데이터 세트를 생성하는 단계와, 복수의 훈련 데이터 세트를 서로 다른 복수의 특징 추출 알고리즘에 입력하여 특징을 추출하는 단계와, 서로 다른 복수의 특징 추출 알고리즘에 입력하여 특징을 추출한 결과를 훈련 데이터로 하여 서로 다른 알고리즘으로 구현된 복수의 학습모델을 훈련시켜, 기계에서 발생하는 진동신호의 특징을 이용하여 기계의 결함여부를 판단하는 서로 다른 알고리즘으로 구현된 복수의 기계 결함 판별 학습모델을 생성하는 단계와, 오리지널 데이터 세트 중 제2 데이터 그룹을 이용하여 복수의 기계 결함 판별 학습모델의 정확도를 검증하는 단계와, 기계 결함 판별 학습모델의 정확도를 검증한 결과를 기반으로 하여, 특징 추출 알고리즘 별로 오리지널 데이터 세트 중 제1 데이터 그룹과 모사 비정상 데이터 그룹의 최적 비율과, 복수의 기계 결함 판별 학습모델 중 최적 학습모델을 결정하는 단계를 포함할 수 있다.A method for determining an optimal training method according to an embodiment of the present invention comprises the steps of: loading an original data set including a labeled normal data set and a labeled abnormal data set in relation to a vibration signal generated by a machine; generating a fake anomalous data group by applying at least a portion of the acquired anomaly data set to a generative adversarial network (GAN); generating a plurality of training data sets while changing Using the results as training data, a plurality of learning models implemented with different algorithms are trained, and a plurality of machine fault discrimination learning implemented with different algorithms to determine whether the machine is defective using the characteristics of the vibration signal generated by the machine. Creating a model, verifying the accuracy of a plurality of machine defect determination learning models using a second data group from the original data set, and verifying the accuracy of the machine defect determination learning model. The method may include determining an optimal ratio of the first data group and the simulated abnormal data group in the original data set for each extraction algorithm, and an optimal learning model among a plurality of machine defect discrimination learning models.

본 발명의 일 실시 예에 따른 최적의 훈련 방식 판단 장치는, 프로세서 및 프로세서와 동작 가능하게 연결되고 프로세서에서 수행되는 적어도 하나의 코드를 저장하는 메모리를 포함하고, 메모리는 프로세서를 통해 실행될 때, 프로세서가 기계에서 발생하는 진동신호와 관련하여, 레이블링된 정상 데이터 세트 및 레이블링된 비정상 데이터 세트를 포함하는 오리지널 데이터 세트를 로딩하고, 레이블링된 비정상 데이터 세트 중 적어도 일부를 생성적 적대 신경망(GAN: generative adversarial network)에 적용하여 모사(fake) 비정상 데이터 그룹을 생성하고, 오리지널 데이터 세트 중 제1 데이터 그룹과 모사 비정상 데이터 그룹의 비율을 변경하며 복수의 훈련 데이터 세트를 생성하고, 복수의 훈련 데이터 세트를 서로 다른 복수의 특징 추출 알고리즘에 입력하여 특징을 추출하고, 서로 다른 복수의 특징 추출 알고리즘에 입력하여 특징을 추출한 결과를 훈련 데이터로 하여 서로 다른 알고리즘으로 구현된 복수의 학습모델을 훈련시켜, 기계에서 발생하는 진동신호의 특징을 이용하여 기계의 결함여부를 판단하는 서로 다른 알고리즘으로 구현된 복수의 기계 결함 판별 학습모델을 생성하고, 오리지널 데이터 세트 중 제2 데이터 그룹을 이용하여 복수의 기계 결함 판별 학습모델의 정확도를 검증하고, 기계 결함 판별 학습모델의 정확도를 검증한 결과를 기반으로 하여, 특징 추출 알고리즘 별로 오리지널 데이터 세트 중 제1 데이터 그룹과 모사 비정상 데이터 그룹의 최적 비율과, 복수의 기계 결함 판별 학습모델 중 최적 학습모델을 결정하도록 야기하는 코드를 저장할 수 있다.An apparatus for determining an optimal training method according to an embodiment of the present invention includes a processor and a memory operably connected to the processor and storing at least one code executed by the processor, wherein the memory is executed through the processor, the processor With respect to the vibration signal generated by the machine, it loads an original data set including a labeled normal data set and a labeled anomalous data set, and generates at least a portion of the labeled anomalous data set in a generative adversarial network (GAN). network) to create a fake abnormal data group, change the ratio of the first data group and the simulated abnormal data group in the original data set to create a plurality of training data sets, and It is generated in the machine by training a plurality of learning models implemented with different algorithms by inputting them into a plurality of different feature extraction algorithms to extract features, and using the results of inputting features into a plurality of different feature extraction algorithms as training data. Generates a plurality of machine defect determination learning models implemented with different algorithms to determine whether a machine is defective by using the characteristics of the vibration signal, Based on the results of verifying the accuracy of and validating the accuracy of the machine defect discrimination learning model, the optimal ratio of the first data group and the simulated abnormal data group among the original data set for each feature extraction algorithm, and multiple machine defect discrimination learning It is possible to store the code that causes the model to determine the optimal learning model.

이 외에도, 본 발명을 구현하기 위한 다른 방법, 다른 시스템 및 상기 방법을 실행하기 위한 컴퓨터 프로그램이 저장된 컴퓨터로 판독 가능한 기록매체가 더 제공될 수 있다.In addition to this, another method for implementing the present invention, another system, and a computer-readable recording medium storing a computer program for executing the method may be further provided.

전술한 것 외의 다른 측면, 특징, 이점이 이하의 도면, 특허청구범위 및 발명의 상세한 설명으로부터 명확해질 것이다.Other aspects, features and advantages other than those described above will become apparent from the following drawings, claims, and detailed description of the invention.

본 발명에 의하면, 기계 결함 판별 학습모델의 정확도를 검증한 결과를 기반으로 하여, 특징 추출 알고리즘 별로 오리지널 진동 데이터와 적대 신경망(GAN)을 이용하여 생성한 모사(fake) 데이터의 최적 비율과, 복수의 기계 결함 판별 학습모델 중 최적의 학습모델을 결정할 수 있다.According to the present invention, based on the result of verifying the accuracy of the machine fault determination learning model, the optimal ratio of original vibration data and fake data generated using an adversarial neural network (GAN) for each feature extraction algorithm, It is possible to determine the optimal learning model among the machine fault discrimination learning models of

또한, 기계 결함을 검출하는 기계 결함 판별 학습모델의 검출 정확도가 개선됨으로써 기계 결함을 미리 방지할 수 있고, 기계 결함 검출 비용을 감소시킬 수 있다.In addition, by improving the detection accuracy of the machine defect discrimination learning model for detecting machine defects, it is possible to prevent machine defects in advance and reduce the cost of detecting machine defects.

또한, 생성적 적대 신경망(GAN: generative adversarial network)를 이용하여 훈련 데이터를 증대시켜 기계 결함 판별 훈련 프로세스의 성능을 향상시킬 수 있다.In addition, it is possible to increase the training data using a generative adversarial network (GAN) to improve the performance of the machine fault determination training process.

본 발명의 효과는 이상에서 언급된 것들에 한정되지 않으며, 언급되지 아니한 다른 효과들은 아래의 기재로부터 당업자에게 명확하게 이해될 수 있을 것이다.Effects of the present invention are not limited to those mentioned above, and other effects not mentioned will be clearly understood by those skilled in the art from the following description.

도 1 및 도 2는 본 실시 예에 따른 최적의 훈련 방식 판단 장치의 구성을 개략적으로 설명하기 위하여 도시한 블록도이다.
도 3은 본 실시 예에 따른 기계에서 출력되는 진동신호의 출력 방향을 설명하는 예시도이다.
도 4는 본 실시 예에 따른 정상 기계 및 비정상 기계에서 출력되는 진동신호를 도시한 예시도이다.
도 5는 도 1 중 제1 생성부의 구성을 개략적으로 설명하기 위하여 도시한 블록도이다.
도 6은 본 실시 예에 따른 정상 기계 및 비정상 기계에서 출력되는 진동신호를 주파수 도메인으로 변환한 결과를 도시한 도면이다.
도 7은 본 실시 예에 따른 주파수 도메인으로 변환한 정상 기계 및 비정상 기계에서 출력되는 진동신호로부터 통계적 특성값을 산출한 결과를 도시한 도면이다.
도 8은 다른 실시 예에 따른 최적의 훈련 방식 판단 장치의 구성을 개략적으로 설명하기 위하여 도시한 블록도이다.
도 9은 본 실시 예에 따른 최적의 훈련 방식 판단 방법을 설명하기 위한 흐름도이다.1 and 2 are block diagrams schematically illustrating the configuration of an apparatus for determining an optimal training method according to the present embodiment.
3 is an exemplary view for explaining an output direction of a vibration signal output from the machine according to the present embodiment.
4 is an exemplary view illustrating vibration signals output from the normal machine and the abnormal machine according to the present embodiment.
FIG. 5 is a block diagram schematically illustrating the configuration of the first generator in FIG. 1 .
6 is a diagram illustrating a result of converting a vibration signal output from a normal machine and an abnormal machine into a frequency domain according to the present embodiment.
7 is a view showing the result of calculating statistical characteristic values from vibration signals output from the normal machine and the abnormal machine converted to the frequency domain according to the present embodiment.
8 is a block diagram schematically illustrating a configuration of an apparatus for determining an optimal training method according to another exemplary embodiment.
9 is a flowchart illustrating a method for determining an optimal training method according to the present embodiment.

본 발명의 이점 및 특징, 그리고 그것들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 설명되는 실시 예들을 참조하면 명확해질 것이다. 그러나 본 발명은 아래에서 제시되는 실시 예들로 한정되는 것이 아니라, 서로 다른 다양한 형태로 구현될 수 있고, 본 발명의 사상 및 기술 범위에 포함되는 모든 변환, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 아래에 제시되는 실시 예들은 본 발명의 개시가 완전하도록 하며, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 발명의 범주를 완전하게 알려주기 위해 제공되는 것이다. 본 발명을 설명함에 있어서 관련된 공지 기술에 대한 구체적인 설명이 본 발명의 요지를 흐릴 수 있다고 판단되는 경우 그 상세한 설명을 생략한다.Advantages and features of the present invention, and a method for achieving them will become apparent with reference to the detailed description in conjunction with the accompanying drawings. However, the present invention is not limited to the embodiments presented below, it can be implemented in a variety of different forms, and should be understood to include all transformations, equivalents, and substitutes included in the spirit and scope of the present invention. . The embodiments presented below are provided so that the disclosure of the present invention is complete, and to completely inform those of ordinary skill in the art to which the present invention pertains to the scope of the invention. In describing the present invention, if it is determined that a detailed description of a related known technology may obscure the gist of the present invention, the detailed description thereof will be omitted.

본 출원에서 사용한 용어는 단지 특정한 실시 예를 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아니다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 출원에서, "포함하다" 또는 "가지다" 등의 용어는 명세서상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다. 제1, 제2 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 구성요소들은 상기 용어들에 의해 한정되어서는 안 된다. 상기 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 사용된다.The terms used in the present application are only used to describe specific embodiments, and are not intended to limit the present invention. The singular expression includes the plural expression unless the context clearly dictates otherwise. In the present application, terms such as “comprise” or “have” are intended to designate that a feature, number, step, operation, component, part, or combination thereof described in the specification exists, but one or more other features It should be understood that this does not preclude the existence or addition of numbers, steps, operations, components, parts, or combinations thereof. Terms such as first, second, etc. may be used to describe various elements, but the elements should not be limited by the terms. The above terms are used only for the purpose of distinguishing one component from another.

또한, 본 출원에서 "부"는 프로세서 또는 회로와 같은 하드웨어 구성(hardware component), 및/또는 프로세서와 같은 하드웨어 구성에 의해 실행되는 소프트웨어 구성(software component)일 수 있다.Also, in the present application, “unit” may be a hardware component such as a processor or circuit, and/or a software component executed by a hardware component such as a processor.

이하, 본 발명에 따른 실시 예들을 첨부된 도면을 참조하여 상세히 설명하기로 하며, 첨부 도면을 참조하여 설명함에 있어, 동일하거나 대응하는 구성 요소는 동일한 도면번호를 부여하고 이에 대한 중복되는 설명은 생략하기로 한다.Hereinafter, embodiments according to the present invention will be described in detail with reference to the accompanying drawings. decide to do

도 1 및 도 2는 본 실시 예에 따른 최적의 훈련 방식 판단 장치의 구성을 개략적으로 설명하기 위하여 도시한 블록도이다. 도 1 및 도 2를 참조하면, 최적의 훈련 방식 판단 장치(100)는 저장부(110), 로딩부(120), 제1 생성부(130), 제2 생성부(140), 추출부(150), 제3 생성부(160), 검증부(170), 결정부(180) 및 제어부(190)를 포함할 수 있다.1 and 2 are block diagrams schematically illustrating the configuration of an apparatus for determining an optimal training method according to the present embodiment. 1 and 2 , the apparatus 100 for determining the optimal training method includes a storage unit 110 , a loading unit 120 , a first generation unit 130 , a second generation unit 140 , and an extraction unit ( 150 ), a third generation unit 160 , a verification unit 170 , a determination unit 180 , and a control unit 190 may be included.

저장부(110)는 기계(예를 들어, 회전 기계, 미도시)에서 발생하는 진동신호와 관련하여, 레이블링된 정상 데이터 세트 및 레이블링된 비정상 데이터 세트를 포함하는 오리지널 데이터 세트를 저장할 수 있다.The storage unit 110 may store an original data set including a labeled normal data set and a labeled abnormal data set in relation to a vibration signal generated by a machine (eg, a rotating machine, not shown).

로딩부(120)는 레이블링된 정상 데이터 세트 및 레이블링된 비정상 데이터 세트를 포함하는 오리지널 데이터 세트를 로딩할 수 있다. 로딩부(120)는 정상 기계(예를 들어, 정상 회전 기계, 미도시)에서 발생하는 제1 진동신호를 정상(normal) 상태로 레이블링한 정상 데이터 세트와, 비정상 기계(예를 들어, 비정상 회전 기계, 미도시)에서 발생하는 제2 진동신호를 비정상(broken) 상태로 레이블링한 비정상 데이터 세트를 포함하는 오리지널 데이터 세트를 로딩할 수 있다.The loading unit 120 may load the original data set including the labeled normal data set and the labeled abnormal data set. The loading unit 120 includes a normal data set in which a first vibration signal generated in a normal machine (eg, a normal rotating machine, not shown) is labeled as a normal state, and an abnormal machine (eg, abnormal rotation) An original data set including an abnormal data set in which a second vibration signal generated in a machine (not shown) is labeled as a broken state may be loaded.

본 실시 예에서, 로딩한다는 의미는, 정상 기계로부터 레이블링된 정상 데이터 세트를 입력 받고, 비정상 기계로부터 레이블링된 비정상 데이터 세트를 입력 받는 것으로 해석될 수 있다. 또한, 로딩한다는 의미는, 저장부(110)에 저장되어 있는 레이블링된 정상 데이터 세트 및 레이블링된 비정상 데이터 세트를 포함하는 오리지널 데이터 세트를 입력 받는 것으로 해석될 수도 있다.In this embodiment, loading may be interpreted as receiving a labeled normal data set from a normal machine and receiving a labeled abnormal data set from an abnormal machine. Also, the meaning of loading may be interpreted as receiving an original data set including the labeled normal data set and the labeled abnormal data set stored in the storage unit 110 .

본 실시 예에서, 정상 기계 또는 비정상 기계에서 발생하는 진동신호는 도 3에 도시된 바와 같이, 수평(x)방향의 진동신호(g_x vibration signal)와, 수직(y)방향의 진동신호(g_y vibration signal)와, 축(z)방향의 진동신호(g_z vibration signal)와, 수평(x)방향, 수직(y)방향 및 축(z)방향을 합친 대각선(T)방향의 진동신호(g_T vibration signal)를 포함할 수 있다.In this embodiment, the vibration signal generated in a normal machine or an abnormal machine is a vibration signal in the horizontal (x) direction (g_x vibration signal) and a vibration signal in the vertical (y) direction (g_y vibration) as shown in FIG. 3 . signal), a vibration signal in the axis (z) direction (g_z vibration signal), and a vibration signal in the diagonal (T) direction (g_T vibration signal) in the horizontal (x) direction, vertical (y) direction, and axis (z) direction. ) may be included.

도 4는 본 실시 예에 따른 정상 기계 및 비정상(고장) 기계에서 출력되는 진동신호를 도시한 예시도이다. 도 4를 참조하면, 도 4a는 정상(normal) 기계에서 발생한 수평(x)방향의 진동신호(g_x vibration signal)를 도시하고 있고, 도 4b는 비정상(고장, broken) 기계에서 발생한 수평(x)방향의 진동신호(g_x vibration signal)를 도시하고 있다. 특히 도 4b는 톱니(tooth)가 부러졌을 때 회전 기계에서 발생한 수평(x)방향의 진동신호(g_x vibration signal)를 도시하고 있다.4 is an exemplary diagram illustrating vibration signals output from a normal machine and an abnormal (failure) machine according to the present embodiment. Referring to FIG. 4, FIG. 4A shows a vibration signal (g_x) in the horizontal (x) direction generated in a normal machine, and FIG. 4B is a horizontal (x) generated in an abnormal (broken) machine. The vibration signal (g_x vibration signal) of the direction is shown. In particular, Figure 4b shows a vibration signal (g_x vibration signal) in the horizontal (x) direction generated in the rotating machine when the tooth (tooth) is broken.

제1 생성부(130)는 레이블링된 비정상 데이터 세트 중 적어도 일부를 생성적 적대 신경망(GAN: generative adversarial network)에 적용하여 모사(fake) 비정상 데이터 그룹을 생성할 수 있다.The first generator 130 may generate a fake abnormal data group by applying at least a portion of the labeled abnormal data set to a generative adversarial network (GAN).

여기서, 생성적 적대 신경망이란, 비지도 학습에 사용되는 인공지능 알고리즘으로, 생성기(generator)와 식별기(discriminator)로 구성되어 있어, 생성기가 모사(fake) 데이터를 생성하면, 식별기에서 실제(real) 데이터를 기초로 모사 데이터가 실제인지 모조인지 여부를 확률적으로 검토하는 과정을 반복 학습함으로써, 결국 생성기가 실제 데이터와 거의 유사한 모사 데이터를 생성하도록 구성하는 알고리즘을 의미할 수 있다.Here, the generative adversarial neural network is an artificial intelligence algorithm used for unsupervised learning, and is composed of a generator and a discriminator. By iteratively learning the process of probabilistically examining whether the simulated data is real or fake based on the data, it may mean an algorithm in which the generator is configured to generate simulated data that is almost similar to the real data.

도 5는 도 1 중 제1 생성부(130)의 구성을 개략적으로 설명하기 위하여 도시한 블록도이다. 도 6을 참조하면, 제1 생성부(130)는 생성기(131) 및 식별기(132)를 포함할 수 있다.FIG. 5 is a block diagram schematically illustrating the configuration of the first generator 130 of FIG. 1 . Referring to FIG. 6 , the first generator 130 may include a generator 131 and an identifier 132 .

생성기(131)는 소정의 입력값으로서 모사 비정상 데이터를 생성할 수 있다. 여기서, 입력값은 랜덤 값(random value)일 수 있다. 일 실시 예로 랜덤 값은 가우시안 분포를 이용하여 생성된 랜덤 노이즈일 수 있다.The generator 131 may generate simulated abnormal data as a predetermined input value. Here, the input value may be a random value. In an embodiment, the random value may be random noise generated using a Gaussian distribution.

식별기(132)는 생성기(131)가 생성한 모사 비정상 데이터 및 레이블링된 비정상 데이터 세트 중 적어도 일부의 비교를 통해 모사 비정상 데이터가 참인지 거짓인지 식별할 수 있다.The identifier 132 may identify whether the simulated anomalous data is true or false by comparing at least a portion of the simulated abnormal data generated by the generator 131 and the labeled abnormal data set.

본 실시 예에서 식별기(132)는 생성기(131)에 의해 생성된 훈련 데이터 즉, 모사 비정상 데이터를 사용하여 훈련되는 신경망일 수 있다. 또한 생성기(131)는 식별기(132)에 의해 거짓으로 판별된 모사 비정상 데이터를 훈련 데이터로 사용하여 훈련되는 신경망으로, 실제 비정상 데이터에 가까운 모사 비정상 데이터를 생성할 수 있다. 이로부터 생성적 적대 신경망은 두 신경망의 경쟁을 통하여 학습하고 결과물을 도출해 내는 알고리즘이라 할 수 있다.In this embodiment, the identifier 132 may be a neural network trained using the training data generated by the generator 131, that is, simulated abnormal data. In addition, the generator 131 is a neural network trained using the simulated abnormal data determined to be false by the identifier 132 as training data, and may generate simulated abnormal data close to the actual abnormal data. From this, the generative adversarial neural network can be said to be an algorithm that learns through competition between two neural networks and derives results.

생성기(131) 및 식별기(132)가 포함된 신경망은 통상적으로 밀집 처리(dense processing), 배치 정규화 처리(batch normalization processing), 활성화 처리(activation processing), 입력 변형 처리(input reshaping processing), 가우시안 드롭아웃 처리(gaussian dropout processing), 가우시안 노이즈 처리(gaussian noise processing), 2차원 합성곱(two-dimensional convolution), 및 2차원 업 샘플링(two-dimensional up sampling) 같은, 복수의 처리 레이어로 이루어지는 다수-레이어 네트워크에 의해 구현될 수 있다.A neural network including a generator 131 and an identifier 132 typically performs dense processing, batch normalization processing, activation processing, input reshaping processing, and Gaussian drop processing. Multi-layers consisting of multiple processing layers, such as out processing (gaussian dropout processing), Gaussian noise processing (gaussian noise processing), two-dimensional convolution, and two-dimensional up sampling (two-dimensional up sampling) It may be implemented by a layer network.

도 1로 돌아와서, 제2 생성부(140)는 오리지널 데이터 세트 중 제1 데이터 그룹과 모사 비정상 데이터 그룹의 비율을 변경하며 복수의 훈련 데이터 세트를 생성할 수 있다. Returning to FIG. 1 , the second generator 140 may generate a plurality of training data sets by changing the ratio of the first data group and the simulated abnormal data group among the original data sets.

본 실시 예에서 제2 생성부(140)는 오리지널 데이터 세트 중 제1 데이터 그룹을 100%로 하고 모사 비정상 데이터 그룹을 0%로 하는 제1 비율의 제1 훈련 데이터 세트를 생성할 수 있다. 본 실시 예에서 제2 생성부(140)는 오리지널 데이터 세트 중 제1 데이터 그룹을 80%로 하고 모사 비정상 데이터 그룹을 20%로 하는 제2 비율의 제2 훈련 데이터 세트를 생성할 수 있다. 본 실시 예에서 제2 생성부(140)는 오리지널 데이터 세트 중 제1 데이터 그룹을 60%로 하고 모사 비정상 데이터 그룹을 40%로 하는 제3 비율의 제3 훈련 데이터 세트를 생성할 수 있다. 본 실시 예에서 제2 생성부(140)는 오리지널 데이터 세트 중 제1 데이터 그룹을 40%로 하고 모사 비정상 데이터 그룹을 60%로 하는 제4 비율의 제4 훈련 데이터 세트를 생성할 수 있다. 본 실시 예에서 제2 생성부(140)는 오리지널 데이터 세트 중 제1 데이터 그룹을 20%로 하고 모사 비정상 데이터 그룹을 80%로 하는 제5 비율의 제5 훈련 데이터 세트를 생성할 수 있다. 본 실시 예에서 제2 생성부(140)는 오리지널 데이터 세트 중 제1 데이터 그룹을 0%로 하고 모사 비정상 데이터 그룹을 100%로 하는 제6 비율의 제6 훈련 데이터 세트를 생성할 수 있다.In this embodiment, the second generator 140 may generate a first training data set of a first ratio in which the first data group of the original data set is 100% and the simulated abnormal data group is 0%. In this embodiment, the second generator 140 may generate a second training data set having a second ratio in which the first data group among the original data sets is 80% and the simulated abnormal data group is 20%. In the present embodiment, the second generator 140 may generate a third training data set having a third ratio in which the first data group among the original data sets is 60% and the simulated abnormal data group is 40%. In the present embodiment, the second generator 140 may generate a fourth training data set having a fourth ratio in which the first data group among the original data sets is 40% and the simulated abnormal data group is 60%. In this embodiment, the second generator 140 may generate a fifth training data set with a fifth ratio in which the first data group among the original data sets is 20% and the simulated abnormal data group is 80%. In this embodiment, the second generator 140 may generate a sixth training data set with a sixth ratio in which the first data group among the original data sets is 0% and the simulated abnormal data group is 100%.

본 실시 예에서, 오리지널 데이터 세트 중 제1 데이터 그룹과 모사 비정상 데이터 그룹의 비율을 제1 비율 내지 제6 비율 6개로 한정하고 있으나, 이에 국한되지 않고 6개 이상 또는 6개 미만 등 다양한 실시예의 적용이 가능할 수 있다.In the present embodiment, the ratio of the first data group and the simulated abnormal data group in the original data set is limited to six first to sixth ratios, but the application of various embodiments such as six or more or less than six is not limited thereto. This may be possible.

추출부(150)는 복수의 훈련 데이터 세트를 서로 다른 복수의 특징 추출 알고리즘에 입력하여 특징을 추출할 수 있다. 본 실시 예에서 복수의 훈련 데이터 세트는 제1 훈련 데이터 세트 내지 제6 훈련 데이터 세트를 포함할 수 있다.The extractor 150 may extract features by inputting a plurality of training data sets into a plurality of different feature extraction algorithms. In this embodiment, the plurality of training data sets may include a first training data set to a sixth training data set.

추출부(150)는 복수의 훈련 데이터 세트(제1 훈련 데이터 세트 내지 제6 훈련 데이터 세트)에 제1 특징 추출 알고리즘을 적용하여 생성한 주파수도메인의 데이터 그룹을 제1 특징으로 추출할 수 있다. The extractor 150 may extract, as a first feature, a data group of a frequency domain generated by applying a first feature extraction algorithm to a plurality of training data sets (a first training data set to a sixth training data set).

여기서 제1 특징 추출 알고리즘은, 시간도메인의 진동신호를 주파수도메인의 진동신호로 변환시키는 FFT(fast fourier transform) 알고리즘을 포함할 수 있다.Here, the first feature extraction algorithm may include a fast Fourier transform (FFT) algorithm for converting a time domain vibration signal into a frequency domain vibration signal.

주파수도메인의 진동신호에는 기계의 정상 및 고장을 구별하는데 사용할 수 있는 보다 고유한 정보들이 포함되어 있기 때문에, 제1 특징 추출 알고리즘인 FFT 알고리즘을 이용하여 시간 도메인의 진동신호를 주파수 도메인의 진동신호로 변환할 수 있다.Since the vibration signal in the frequency domain contains more unique information that can be used to distinguish normal and failure of the machine, the first feature extraction algorithm, FFT algorithm, is used to convert the time domain vibration signal into a frequency domain vibration signal. can be converted

도 6는 본 실시 예에 따른 정상 기계 및 비정상 기계에서 출력되는 진동신호를 주파수 도메인으로 변환한 결과를 도시한 도면이다. 도 6을 참조하면, 도 4a는 정상 기계에서 발생한 시간도메인의 수평(x)방향 진동신호(g_x vibration signal)를 제1 특징 추출 알고리즘에 적용하여 주파수도메인의 수평(x)방향 진동신호(g_x FFT signal)를 추출한 예를 도시하고 있다. 도 4b는 비정상 기계에서 발생한 시간도메인의 수평(x)방향 진동신호(g_x vibration signal)를 제1 특징 추출 알고리즘에 적용하여 주파수도메인의 수평(x)방향 진동신호(g_x FFT signal)를 추출한 예를 도시하고 있다.6 is a diagram illustrating a result of converting a vibration signal output from a normal machine and an abnormal machine into a frequency domain according to the present embodiment. Referring to FIG. 6 , FIG. 4A shows a horizontal (x) direction vibration signal (g_x FFT) in the frequency domain by applying a time domain horizontal (x) vibration signal (g_x vibration signal) generated in a normal machine to the first feature extraction algorithm. An example of extracting signal) is shown. Figure 4b is an example of extracting the horizontal (x) direction vibration signal (g_x FFT signal) of the frequency domain by applying the horizontal (x) direction vibration signal (g_x vibration signal) of the time domain generated in the abnormal machine to the first feature extraction algorithm is showing

본 실시 예에서 추출부(150)는 제1 훈련 데이터 세트 내지 제6 훈련 데이터 세트를 제1 특징 추출 알고리즘에 적용하여 6세트의 제1 특징을 추출할 수 있다.In this embodiment, the extractor 150 may extract six sets of first features by applying the first to sixth training data sets to the first feature extraction algorithm.

또한, 추출부(150)는 주파수도메인의 데이터 그룹(즉, FFT 변환된 제1 훈련 데이터 세트 내지 제6 훈련 데이터 세트)에 제2 특징 추출 알고리즘을 적용하여 산출한 통계적 특성값(statistical analysis)을 제2 특징으로 추출할 수 있다.In addition, the extraction unit 150 is a statistical analysis calculated by applying the second feature extraction algorithm to the data group of the frequency domain (that is, the FFT-transformed first training data set to the sixth training data set) It can be extracted as the second feature.

여기서 제2 특징 추출 알고리즘은, 주파수도메인으로 변환된 수평방향의 진동신호, 수직방향의 진동신호, 깊이방향의 진동신호 및 대각선방향의 진동신호 각각의 평균값(mean), 중간(median)값, 최소(min)값, 최대(max)값, 첨도(kurtosis)값, 왜도(skewness)값, 표준편차(standard deviation)값과, 양진폭(peak-to-peak)값 중 하나 이상을 산출하는 알고리즘을 포함할 수 있다.Here, the second feature extraction algorithm is the average value, median value, minimum Algorithm to calculate at least one of (min) value, maximum (max) value, kurtosis value, skewness value, standard deviation value, and peak-to-peak value may include

여기서, 첨도는 확률분포의 모양이 뾰족한 정도를 나타내는 지표로 신호의 값들의 분포가 특정 값 근처에 몰려 뾰족한 형태를 이룰수록 첨도가 증가할 수 있다. 또한, 왜도는 확률 분포의 비대칭성을 나타내는 지표로 신호의 편중성(신호의 평균을 기준으로 신호 값들의 분포가 한쪽으로 몰리는 정도)이 증가할수록 왜도 또한 증가할 수 있다. 양진폭(peak-to-peak)은 신호의 전체 폭을 나타내는 지표로 신호에서 가장 작은 값과 가장 큰 값의 차이를 나타낼 수 있다.Here, the kurtosis is an index indicating the degree of sharpness of the probability distribution, and the kurtosis may increase as the distribution of signal values gathers near a specific value to form a sharp shape. In addition, skewness is an index indicating the asymmetry of the probability distribution, and as the signal bias (the degree to which the distribution of signal values converges to one side with respect to the average of the signal) increases, the skewness may also increase. The positive amplitude (peak-to-peak) is an index indicating the overall width of the signal and may indicate the difference between the smallest value and the largest value in the signal.

도 7은 본 실시 예에 따른 주파수 도메인으로 변환한 정상 기계 및 비정상 기계에서 출력되는 진동신호로부터 통계적 특성값을 산출한 결과를 도시한 도면이다. 도 7을 참조하면, 도 7a는 주파수도메인의 수평(x)방향 진동신호(g_x FFT signal)로부터 산출한 통계적 특성의 분포를 도시하고 있다. 도 7b는 주파수도메인의 수직(y)방향 진동신호(g_y FFT signal)로부터 산출한 통계적 특성의 분포를 도시하고 있다. 도 7c는 주파수도메인의 축(z)방향 진동신호(g_z FFT signal)로부터 산출한 통계적 특성의 분포를 도시하고 있다. 도 7d는 주파수도메인의 대각선(T)방향 진동신호(g_T FFT signal)로부터 산출한 통계적 특성의 분포를 도시하고 있다.7 is a view showing the result of calculating statistical characteristic values from vibration signals output from the normal machine and the abnormal machine converted to the frequency domain according to the present embodiment. Referring to FIG. 7 , FIG. 7A shows the distribution of statistical characteristics calculated from the horizontal (x) direction vibration signal (g_x FFT signal) in the frequency domain. 7B shows the distribution of statistical characteristics calculated from the vertical (y) direction vibration signal (g_y FFT signal) in the frequency domain. 7C shows the distribution of statistical characteristics calculated from a vibration signal (g_z FFT signal) in the axis (z) direction of the frequency domain. 7D shows the distribution of statistical characteristics calculated from a vibration signal (g_T FFT signal) in the diagonal (T) direction of the frequency domain.

본 실시 예에서 추출부(150)는 주파수도메인으로 변환된 제1 훈련 데이터 세트 내지 제6 훈련 데이터 세트를 제2 특징 추출 알고리즘에 적용하여 6세트의 제2 특징을 추출할 수 있다. 여기서 제2 특징 추출 알고리즘이 출력하는 제2 특징은 최소 1개(평균값, 중간값, 최소값, 최대값, 첨도값, 왜도값, 표준편차값 및 양진폭값 중 하나) 내지 최대 8개(평균값, 중간값, 최소값, 최대값, 첨도값, 왜도값, 표준편차값 및 양진폭값 모두)를 포함할 수 있으며, 훈련 데이터 세트 별로 최소 6세트 내지 최대 48세트의 제2 특징을 추출할 수 있다.In this embodiment, the extractor 150 may extract six sets of second features by applying the first to sixth training data sets converted to the frequency domain to the second feature extraction algorithm. Here, the second feature output by the second feature extraction algorithm is at least 1 (one of the mean value, median value, minimum value, maximum value, kurtosis value, skewness value, standard deviation value, and positive amplitude value) to a maximum of 8 (average value). , median, minimum, maximum, kurtosis, skewness, standard deviation, and positive amplitude), and at least 6 to 48 sets of second features can be extracted for each training data set. there is.

추출부(150)는 복수의 훈련 데이터 세트(제1 훈련 데이터 세트 내지 제6 훈련 데이터 세트)에 제3 특징 추출 알고리즘을 적용하여 생성한 차원이 축소된 데이터 그룹을 제3 특징으로 추출할 수 있다.The extraction unit 150 may extract a reduced-dimensional data group generated by applying a third feature extraction algorithm to a plurality of training data sets (the first training data set to the sixth training data set) as the third feature. .

여기서 제3 특징 추출 알고리즘은, 차원이 축소된 데이터를 생성하는 주성분 분석(PCA: principal component analysis) 알고리즘을 포함할 수 있다. Here, the third feature extraction algorithm may include a principal component analysis (PCA) algorithm that generates data with reduced dimensions.

주성분 분석은 다변량 데이터를 분석하기 위해 고차원의 데이터를 저차원의 데이터로 환원시키는 기법들 중 대표적인 알고리즘일 수 있다. 서로 연관 가능성이 있는 고차원 공간의 표본들을 선형 연관성이 없는 저차원 공간(주성분)의 표본으로 변환하기 위해 직교 변환을 사용할 수 있다. 이 알고리즘은 데이터의 차원을 줄여 주요 정보를 추출하고 요약하여, 자료의 변동성을 가장 많이 표현하는 처음 몇 개의 주성분을 선택하여 정보의 손실을 최소화할 수 있다. 따라서 주성분의 차원 수는 원래 표본의 차원 수보다 작거나 같을 수 있다. 특히, 주성분 분석의 특징은 데이터들의 분산을 최대한 보존하면서 서로 직교하는 새로운 기저(basis)들을 찾아, 고차원 공간의 표본들을 선형 연관성이 없는 저차원 공간으로 변환하는 것을 포함할 수 있다. 따라서, 주성분 분석은 데이터를 한 개의 축으로 사상시켰을 때, 그 분산이 가장 커지는 축을 첫 번째 주성분, 두 번째로 커지는 축을 두 번째 주성분으로 놓이도록 새로운 좌표계로 데이터를 선형 변환할 수 있다. 이와 같이 표본의 차이를 가장 잘 나타내는 성분들로 분해함으로써 여러 가지로 응용할 수 있다.Principal component analysis may be a representative algorithm among techniques for reducing high-dimensional data to low-dimensional data in order to analyze multivariate data. An orthogonal transformation can be used to transform samples in a high-dimensional space that are likely to be related to each other into samples in a low-dimensional space (principal components) that do not have linear correlation. This algorithm extracts and summarizes key information by reducing the dimension of the data, and minimizes the loss of information by selecting the first few principal components that express the most variability in the data. Therefore, the number of dimensions of the principal component can be less than or equal to the number of dimensions of the original sample. In particular, the characteristic of principal component analysis may include transforming samples in a high-dimensional space into a low-dimensional space without linear correlation by finding new basis orthogonal to each other while maximally preserving the variance of the data. Therefore, in principal component analysis, when data is mapped to one axis, the data can be linearly transformed into a new coordinate system so that the axis with the largest variance is the first principal component and the second axis is the second principal component. It can be applied in various ways by decomposing the sample difference into the components that best represent the difference.

본 실시 예에서 추출부(150)는 제1 훈련 데이터 세트 내지 제6 훈련 데이터 세트를 제3 특징 추출 알고리즘에 적용하여 6세트의 제3 특징을 추출할 수 있다.In this embodiment, the extractor 150 may extract six sets of third features by applying the first to sixth training data sets to the third feature extraction algorithm.

이로써 추출부(150)는 6세트의 제1 특징과, 최소 6세트 내지 최대 48세트의 제2 특징과, 6세트의 제3 특징을 합친 토탈(total) 최소 18세트 내지 최대 60세트의 서로 다른 특징을 추출할 수 있다.As a result, the extraction unit 150 generates a total of 6 sets of first features, a minimum of 6 sets to a maximum of 48 sets of second features, and a total of at least 18 sets to a maximum of 60 sets of different 3 sets of 6 sets. features can be extracted.

본 실시 예에서, 특징 추출 알고리즘을 3개로 한정하고 있으나, 이에 국한되지 않고 3개 이상 또는 3개 미만 등 다양한 실시예의 적용이 가능할 수 있다.In the present embodiment, the feature extraction algorithm is limited to three, but it is not limited thereto, and various embodiments such as three or more or less than three may be applied.

제3 생성부(160)는 추출부(150)에서 추출한 서로 다른 복수의 특징을 훈련 데이터로 하여 서로 다른 알고리즘으로 구현된 복수의 학습모델을 훈련시켜, 기계에서 발생하는 진동신호의 특징을 이용하여 기계의 결함여부를 판단하는 서로 다른 알고리즘으로 구현된 복수의 기계 결함 판별 학습모델을 생성할 수 있다.The third generation unit 160 trains a plurality of learning models implemented with different algorithms using a plurality of different features extracted by the extraction unit 150 as training data, and uses the characteristics of the vibration signal generated by the machine. It is possible to create a plurality of machine defect determination learning models implemented with different algorithms for determining whether a machine is defective.

제3 생성부(160)는 제1 특징 내지 제3 특징을 각각 훈련 데이터로 하여 제1 알고리즘으로 구현된 제1 학습모델을 훈련시켜, 기계에서 발생하는 진동신호의 특징을 이용하여 기계의 결함여부를 판단하는 복수의 제1 기계 결함 판별 학습모델을 생성할 수 있다. 여기서, 제1 학습모델을 훈련시키는 훈련 데이터는 토탈 최소 18세트 내지 최대 60세트의 훈련 데이터를 포함할 수 있고, 각 세트의 훈련 데이터로 제1 학습모델을 훈련시킬 수 있다. 또한, 복수의 제1 기계 결함 판별 학습모델은 각 세트 별로 토탈 최소 18개 내지 최대 60개의 제1 기계 결함 판별 학습모델을 포함할 수 있다.The third generator 160 trains the first learning model implemented with the first algorithm using the first to third features as training data, respectively, and uses the features of the vibration signal generated by the machine to determine whether the machine is defective. It is possible to generate a plurality of first machine defect determination learning models to determine. Here, the training data for training the first learning model may include a total of a minimum of 18 sets to a maximum of 60 sets of training data, and the first learning model may be trained with the training data of each set. In addition, the plurality of first machine defect determination learning models may include a total of at least 18 to a maximum of 60 first machine defect determination learning models for each set.

본 실시 예에서, 제1 학습모델은 인공신경망(ANN: artificial neural network)을 포함할 수 있다. 인공신경망은 기계학습과 생물학적 뇌 구조에 아이디어를 얻은 알고리즘으로, 시냅스들의 결합으로 네트워크를 형성한 뉴런들이 데이터 학습을 통해 시냅스의 결합 세기를 변화시켜 문제해결 능력을 갖추는 방법이라 할 수 있이다. 신경망은 모델링 도구로 구성된 통계 데이터들의 비선형 구조를 포함할 수 있다.In this embodiment, the first learning model may include an artificial neural network (ANN). An artificial neural network is an algorithm inspired by machine learning and biological brain structure, and it is a method for neurons that form a network by combining synapses to change the strength of synapses through data learning and to have problem-solving ability. The neural network may include a non-linear structure of statistical data composed of a modeling tool.

제3 생성부(160)는 제1 특징 내지 제3 특징을 각각 훈련 데이터로 하여 제2 알고리즘으로 구현된 제2 학습모델을 훈련시켜, 기계에서 발생하는 진동신호의 특징을 이용하여 기계의 결함여부를 판단하는 복수의 제2 기계 결함 판별 학습모델을 생성할 수 있다. 여기서, 제2 학습모델을 훈련시키는 훈련 데이터는 토탈 최소 18세트 내지 최대 60세트의 훈련 데이터를 포함할 수 있고, 각 세트의 훈련 데이터로 제2 학습모델을 훈련시킬 수 있다. 또한, 복수의 제2 기계 결함 판별 학습모델은 각 세트 별로 토탈 최소 18개 내지 최대 60개의 제2 기계 결함 판별 학습모델을 포함할 수 있다.The third generator 160 trains the second learning model implemented with the second algorithm using the first to third features as training data, respectively, and uses the features of the vibration signal generated by the machine to determine whether the machine is defective. It is possible to generate a plurality of second machine defect determination learning models to determine. Here, the training data for training the second learning model may include a total of minimum 18 sets to a maximum of 60 sets of training data, and the second learning model may be trained with each set of training data. In addition, the plurality of second machine defect determination learning models may include a total of at least 18 to a maximum of 60 second machine defect determination learning models for each set.

본 실시 예에서, 제2 학습모델은, K-평균 클러스터링(K-means clustering)을 포함할 수 있다. K-평균 클러스터링은 주어진 데이터를 k개의 클러스터로 묶는 알고리즘으로, 각 클러스터와 거리 차이의 분산을 최소화하는 방식으로 동작할 수 있다. 이 알고리즘은 자율 학습의 일종으로, 레이블이 달려 있지 않은 입력 데이터에 레이블을 달아주는 역할을 수행할 수 있다.In this embodiment, the second learning model may include K-means clustering. K-means clustering is an algorithm that groups given data into k clusters, and can operate in a way that minimizes the variance of each cluster and distance difference. This algorithm is a kind of self-learning, and can play the role of labeling unlabeled input data.

제3 생성부(160)는 제1 특징 내지 제3 특징을 각각 훈련 데이터로 하여 제3 알고리즘으로 구현된 제3 학습모델을 훈련시켜, 기계에서 발생하는 진동신호의 특징을 이용하여 기계의 결함여부를 판단하는 복수의 제3 기계 결함 판별 학습모델을 생성할 수 있다. 여기서, 제3 학습모델을 훈련시키는 훈련 데이터는 토탈 최소 18세트 내지 최대 60세트의 훈련 데이터를 포함할 수 있고, 각 세트의 훈련 데이터로 제3 학습모델을 훈련시킬 수 있다. 또한, 복수의 제3 기계 결함 판별 학습모델은 각 세트 별로 토탈 최소 18개 내지 최대 60개의 제3 기계 결함 판별 학습모델을 포함할 수 있다.The third generator 160 trains the third learning model implemented with the third algorithm using the first to third features as training data, respectively, and uses the features of the vibration signal generated by the machine to determine whether the machine is defective. It is possible to generate a plurality of third machine defect determination learning models to determine. Here, the training data for training the third learning model may include a total of minimum 18 sets to a maximum of 60 sets of training data, and the third learning model may be trained with each set of training data. In addition, the plurality of third machine defect determination learning models may include a total of at least 18 to a maximum of 60 third machine defect determination learning models for each set.

본 실시 예에서, 제3 학습모델은, 서포트 벡터 머신(SVM: support vector machine)을 포함할 수 있다. 서포트 벡터 머신은 입력 데이터를 고차원 공간으로 변화시켜 고차원 공간에서 선형으로 분리할 수 있는 최적의 경계(decision boundary)를 찾는 알고리즘이라 할 수 있다. 여기서 최적의 경계는 데이터의 군집(class)을 예측해 분류하는 역할을 할 수 있다. 이때, 서로 다른 데이터를 가장 마진으로 분류해야 어떤 데이터가 들어와도 잘 분류할 수 있다.In this embodiment, the third learning model may include a support vector machine (SVM). A support vector machine is an algorithm that transforms input data into a high-dimensional space and finds an optimal decision boundary that can be linearly separated in a high-dimensional space. Here, the optimal boundary can play a role in predicting and classifying data classes. At this time, different data must be classified with the most margin so that any data can be classified well.

본 실시 예에서, 학습모델의 개수를 3개로 한정하고 있으나, 이에 국한되지 않고 3개 이상 또는 3개 미만 등 다양한 실시예의 적용이 가능할 수 있다.In this embodiment, the number of learning models is limited to three, but the number of learning models is not limited thereto, and various embodiments such as three or more or less than three may be applied.

제3 생성부(160)는 제1 기계 결함 판별 학습모델로부터 토탈 최소 18개 내지 최대 60개의 기계 결함 여부에 대한 제1 분류값을 출력할 수 있고, 제2 기계 결함 판별 학습모델로부터 토탈 최소 18개 내지 최대 60개의 기계 결함 여부에 대한 제2 분류값을 출력할 수 있고, 제3 기계 결함 판별 학습모델로부터 토탈 최소 18개 내지 최대 60개의 기계 결함 여부에 대한 제3 분류값을 출력할 수 있다. 이로써 제3 생성부(160)는 토탈 최소 54개 내지 최대 180개의 기계 결함 여부에 대한 분류값을 출력할 수 있다.The third generation unit 160 may output a first classification value for whether a total of 18 to 60 machine defects in total from the first machine defect determination learning model, and a total minimum of 18 from the second machine defect determination learning model. It is possible to output a second classification value for whether or not there are up to 60 machine defects, and output a third classification value for a total of at least 18 to up to 60 machine defects from the third machine defect determination learning model. . Accordingly, the third generation unit 160 may output a classification value for whether a total of at least 54 to a maximum of 180 machine defects.

검증부(170)는 오리지널 데이터 세트 중 제2 데이터 그룹을 이용하여 복수의 기계 결함 판별 학습모델의 정확도를 검증할 수 있다.The verification unit 170 may verify the accuracy of the plurality of machine defect determination learning models by using the second data group among the original data sets.

여기서, 오리지널 데이터 세트 중 제1 데이터 그룹은, 100%의 오리지널 데이터 세트 중 70%의 오리지널 데이터 세트를 포함할 수 있다. 또한, 오리지널 데이터 세트 중 제2 데이터 그룹은, 100%의 오리지널 데이터 세트 중 30%의 오리지널 데이터 세트를 포함할 수 있으며, 검증을 위한 테스트 데이터일 수 있다.Here, the first data group among the original data sets may include 70% of the original data sets among 100% of the original data sets. In addition, the second data group of the original data set may include 30% of the original data set among 100% of the original data set, and may be test data for verification.

본 실시 예에서 정확도를 검증한다 함은, 오리지널 데이터 세트 중 제1 데이터 그룹과 모사 비정상 데이터 그룹을 이용하여 복수의 기계 결함 판별 학습모델 각각으로부터 출력되는 분류값과, 오리지널 데이터 세트 중 제2 데이터 그룹을 이용하여 복수의 기계 결함 판별 학습모델 각각으로부터 출력되는 분류값을 비교하여 일치하는 정도를 분석하는 것을 포함할 수 있다.In this embodiment, verifying the accuracy means using the first data group and the simulated abnormal data group in the original data set, the classification value output from each of a plurality of machine defect determination learning models, and the second data group in the original data set. It may include analyzing the degree of agreement by comparing the classification values output from each of a plurality of machine defect identification learning models using .

검증부(170)는 오리지널 데이터 세트 중 제2 데이터 그룹을 이용하여 복수의 제1 기계 결함 판별 학습모델에 대한 제1 정확도를 검증할 수 있다. 검증부(170)는 오리지널 데이터 세트 중 제2 데이터 그룹을 이용하여 복수의 제2 기계 결함 판별 학습모델에 대한 제2 정확도를 검증할 수 있다. 검증부(170)는 오리지널 데이터 세트 중 제3 데이터 그룹을 이용하여 복수의 제3 기계 결함 판별 학습모델에 대한 제3 정확도를 검증할 수 있다.The verification unit 170 may verify the first accuracy of the plurality of first machine defect determination learning models by using the second data group among the original data sets. The verification unit 170 may verify the second accuracy of the plurality of second machine defect determination learning models by using the second data group among the original data sets. The verification unit 170 may verify the third accuracy of the plurality of third machine defect determination learning models by using the third data group among the original data sets.

결정부(180)는 기계 결함 판별 학습모델들의 정확도를 검증한 결과를 기반으로 하여, 특징 추출 알고리즘 별로 오리지널 데이터 세트 중 제1 데이터 그룹과 모사 비정상 데이터 그룹의 최적 비율과, 복수의 기계 결함 판별 학습모델 중 최적 학습모델을 결정할 수 있다.The determiner 180 determines the optimal ratio of the first data group and the simulated abnormal data group in the original data set for each feature extraction algorithm, based on the result of verifying the accuracy of the machine defect determination learning models, and the plurality of machine defect determination learning models Among the models, the optimal learning model can be determined.

결정부(180)는 기계 결함 판별 학습모델들의 정확도를 검증한 결과로부터 미리 정해진 임계 정확도 이상을 도출하게 하는 훈련 데이터 세트 중 제1 데이터 그룹의 비율이 가장 낮은 훈련 데이터 세트에서의 제1 데이터 그룹과 모사 비정상 데이터 그룹의 비율을 최적 비율로 결정할 수 있다.The determination unit 180 includes a first data group in a training data set having the lowest ratio of a first data group among a training data set that derives more than a predetermined threshold accuracy from a result of verifying the accuracy of the machine fault determination learning models; The proportion of simulated anomalous data groups can be determined as an optimal proportion.

결정부(180)는 제1 기계 결함 판별 학습모델 내지 제3 기계 결함 판별 학습모델 중 가장 높은 정확도를 갖는 어느 한 기계 결함 판별 학습모델을 최적 학습모델로 결정할 수 있다.The determiner 180 may determine any one machine defect determination learning model having the highest accuracy among the first machine defect determination learning model to the third machine defect determination learning model as the optimal learning model.

결정부(180)는 제1 특징 추출 알고리즘 내지 제3 특징 추출 알고리즘 중 가장 높은 정확도를 갖는 기계 결함 판별 학습모델에 입력하는 훈련 데이터를 생성한 어느 한 특징 추출 알고리즘을 최적 특징 추출 알고리즘으로 결정할 수 있다.The determiner 180 may determine, as an optimal feature extraction algorithm, any one of the first feature extraction algorithms to the third feature extraction algorithms that generates training data input to the machine defect identification learning model having the highest accuracy. .

기계의 종류마다 효과 있는 특징 추출 알고리즘, 인공지능 모델이 다르고, 오리지널 데이터와 모사 비정상 데이터 그사이의 비율이 다를 수 있으므로, 기계마다 본 발명의 알고리즘을 통해 최적 방식 즉, 데이터 비율, 기계 결함 판별 학습모델 및 특징 추출 알고리즘을 선택(결정)할 수 있다.Effective feature extraction algorithms and artificial intelligence models are different for each type of machine, and the ratio between original data and simulated abnormal data may be different. and a feature extraction algorithm may be selected (determined).

여기서 선택은, 상술한 제1 비율 내지 제6 비율 중 어느 한 비율을 포함할 수 있고, 최소 54개 내지 최대 180개의 분류값을 출력하는 복수의 기계 결함 판별 학습모델 중 어느 한 기계 결함 판별 학습모델을 포함할 수 있고, 최소 18개 내지 최대 60개의 특징을 추출하는 복수의 특징 추출 알고리즘 중 어느 한 알고리즘을 선택할 수 있다.Here, the selection may include any one of the above-described first to sixth ratios, and any one of a plurality of machine defect determination learning models outputting a minimum of 54 to a maximum of 180 classification values. may include, and any one algorithm among a plurality of feature extraction algorithms for extracting a minimum of 18 to a maximum of 60 features may be selected.

오리지널 데이터와 모사 데이터의 비율에 있어서는 모사 데이터를 넣어 훈련 데이터의 사이즈를 커지게 하는 것과 모사 데이터의 추가로 인하여 훈련 데이터의 정확도가 떨어지는 것이 trade off 관계일 수 있다.In the ratio of the original data to the simulated data, increasing the size of the training data by adding the simulated data and decreasing the accuracy of the training data due to the addition of the simulated data may be a trade-off relationship.

본 발명의 최적의 훈련 방식 판단에 따르면, 훈련 데이터의 사이즈를 커지게 하면서도 훈련 데이터의 정확도를 떨어뜨리지 않는(혹은 일정 이상 떨어뜨리지 않는) 최대 모사 데이터 비율을 찾을 수 있다.According to the determination of the optimal training method of the present invention, it is possible to find the maximum simulated data ratio that does not decrease the accuracy of the training data while increasing the size of the training data (or does not drop more than a certain amount).

제어부(190) 최적의 훈련 방식 판단 장치(100) 전체의 동작을 제어할 수 있다. 제어부(190)는 프로세서(processor)와 같이 데이터를 처리할 수 있는 모든 종류의 장치를 포함할 수 있다. 여기서, '프로세서(processor)'는, 예를 들어 프로그램 내에 포함된 코드 또는 명령어로 표현된 기능을 수행하기 위해 물리적으로 구조화된 회로를 갖는, 하드웨어에 내장된 데이터 처리 장치를 의미할 수 있다. 이와 같이 하드웨어에 내장된 데이터 처리 장치의 일 예로써, 마이크로프로세서(microprocessor), 중앙처리장치(central processing unit: CPU), 프로세서 코어(processor core), 멀티프로세서(multiprocessor), ASIC(application-specific integrated circuit), FPGA(field programmable gate array) 등의 처리 장치를 망라할 수 있으나, 본 발명의 범위가 이에 한정되는 것은 아니다.The controller 190 may control the overall operation of the apparatus 100 for determining the optimal training method. The controller 190 may include all kinds of devices capable of processing data, such as a processor. Here, the 'processor' may refer to a data processing device embedded in hardware having a physically structured circuit to perform a function expressed by, for example, a code or an instruction included in a program. As an example of the data processing apparatus embedded in the hardware as described above, a microprocessor, a central processing unit (CPU), a processor core, a multiprocessor, an application-specific integrated (ASIC) circuit) and a processing device such as a field programmable gate array (FPGA), but the scope of the present invention is not limited thereto.

도 8은 다른 실시 예에 따른 최적의 훈련 방식 판단 장치의 구성을 개략적으로 설명하기 위하여 도시한 블록도이다. 이하의 설명에서 도 1 내지 도 7에 대한 설명과 중복되는 부분은 그 설명을 생략하기로 한다.8 is a block diagram schematically illustrating a configuration of an apparatus for determining an optimal training method according to another exemplary embodiment. In the following description, descriptions of parts overlapping with those of FIGS. 1 to 7 will be omitted.

도 8을 참조하면, 다른 실시 예에 따른 최적의 훈련 방식 판단 장치(100)는 프로세서(101) 및 메모리(102)를 포함할 수 있다.Referring to FIG. 8 , the apparatus 100 for determining an optimal training method according to another embodiment may include a processor 101 and a memory 102 .

본 실시 예에서 프로세서(101)는 도 1의 저장부(110), 로딩부(120), 제1 생성부(130), 제2 생성부(140), 추출부(150), 제3 생성부(160), 검증부(170), 결정부(180) 및 제어부(190)가 수행하는 기능을 처리할 수 있다. In the present embodiment, the processor 101 includes the storage unit 110 , the loading unit 120 , the first generation unit 130 , the second generation unit 140 , the extraction unit 150 , and the third generation unit of FIG. 1 . Functions performed by 160 , the verification unit 170 , the determiner 180 , and the control unit 190 may be processed.

이러한 프로세서(101)는 최적의 훈련 방식 판단 장치(100) 전체의 동작을 제어할 수 있다. 여기서, '프로세서(processor)'는, 예를 들어 프로그램 내에 포함된 코드 또는 명령어로 표현된 기능을 수행하기 위해 물리적으로 구조화된 회로를 갖는, 하드웨어에 내장된 데이터 처리 장치를 의미할 수 있다. 이와 같이 하드웨어에 내장된 데이터 처리 장치의 일 예로써, 마이크로프로세서(microprocessor), 중앙처리장치(central processing unit: CPU), 프로세서 코어(processor core), 멀티프로세서(multiprocessor), ASIC(application-specific integrated circuit), FPGA(field programmable gate array) 등의 처리 장치를 망라할 수 있으나, 본 발명의 범위가 이에 한정되는 것은 아니다.The processor 101 may control the overall operation of the apparatus 100 for determining the optimal training method. Here, the 'processor' may refer to a data processing device embedded in hardware having a physically structured circuit to perform a function expressed by, for example, a code or an instruction included in a program. As an example of the data processing apparatus embedded in the hardware as described above, a microprocessor, a central processing unit (CPU), a processor core, a multiprocessor, an application-specific integrated (ASIC) circuit) and a processing device such as a field programmable gate array (FPGA), but the scope of the present invention is not limited thereto.

메모리(102)는 프로세서(101)와 동작 가능하게 연결되고, 프로세서(101)에서 수행되는 동작과 연관하여 적어도 하나의 코드를 저장할 수 있다. The memory 102 may be operatively connected to the processor 101 and store at least one code in association with an operation performed by the processor 101 .

또한, 메모리(102)는 프로세서(101)가 처리하는 데이터를 일시적 또는 영구적으로 저장하는 기능을 수행할 수 있다. 여기서, 메모리(102)는 자기 저장 매체(magnetic storage media) 또는 플래시 저장 매체(flash storage media)를 포함할 수 있으나, 본 발명의 범위가 이에 한정되는 것은 아니다. 이러한 메모리(102)는 내장 메모리 및/또는 외장 메모리를 포함할 수 있으며, DRAM, SRAM, 또는 SDRAM 등과 같은 휘발성 메모리, OTPROM(one time programmable ROM), PROM, EPROM, EEPROM, mask ROM, flash ROM, NAND 플래시 메모리, 또는 NOR 플래시 메모리 등과 같은 비휘발성 메모리, SSD, CF(compact flash) 카드, SD 카드, Micro-SD 카드, Mini-SD 카드, Xd 카드, 또는 메모리 스틱(memory stick) 등과 같은 플래시 드라이브, 또는 HDD와 같은 저장 장치를 포함할 수 있다.In addition, the memory 102 may perform a function of temporarily or permanently storing data processed by the processor 101 . Here, the memory 102 may include a magnetic storage media or a flash storage media, but the scope of the present invention is not limited thereto. Such memory 102 may include internal memory and/or external memory, including volatile memory such as DRAM, SRAM, or SDRAM, one time programmable ROM (OTPROM), PROM, EPROM, EEPROM, mask ROM, flash ROM, NAND flash memory, or non-volatile memory such as NOR flash memory, SSD, compact flash (CF) card, SD card, Micro-SD card, Mini-SD card, Xd card, or flash drive such as a memory stick , or a storage device such as HDD.

도 9은 본 실시 예에 따른 최적의 훈련 방식 판단 방법을 설명하기 위한 흐름도이다. 이하의 설명에서 도 1 내지 도 8에 대한 설명과 중복되는 부분은 그 설명을 생략하기로 한다.9 is a flowchart illustrating a method for determining an optimal training method according to the present embodiment. In the following description, descriptions of parts overlapping with those of FIGS. 1 to 8 will be omitted.

도 9를 참조하면, S910단계에서, 최적의 훈련 방식 판단 장치(100)는 기계에서 발생하는 진동신호와 관련하여, 레이블링된 정상 데이터 세트 및 레이블링된 비정상 데이터 세트를 포함하는 오리지널 데이터 세트를 로딩할 수 있다. 본 실시 예에서 최적의 훈련 방식 판단 장치(100)는 정상 기계에서 발생하는 제1 진동신호를 정상(normal) 상태로 레이블링한 정상 데이터 세트와, 비정상 기계에서 발생하는 제2 진동신호를 비정상(broken) 상태로 레이블링한 비정상 데이터 세트를 포함하는 오리지널 데이터 세트를 로딩할 수 있다.Referring to FIG. 9 , in step S910 , the apparatus 100 for determining the optimal training method loads the original data set including the labeled normal data set and the labeled abnormal data set in relation to the vibration signal generated by the machine. can In this embodiment, the optimal training method determination apparatus 100 breaks the normal data set in which the first vibration signal generated in the normal machine is labeled as normal and the second vibration signal generated in the abnormal machine is broken. ), we can load the original data set containing the anomalous data set labeled with the state.

S920단계에서, 최적의 훈련 방식 판단 장치(100)는 레이블링된 비정상 데이터 세트 중 적어도 일부를 생성적 적대 신경망(GAN: generative adversarial network)에 적용하여 모사(fake) 비정상 데이터 그룹을 생성할 수 있다. 본 실시 예에서, 최적의 훈련 방식 판단 장치(100)는 가우시안 분포를 이용하여 생성된 랜덤 노이즈로부터 모사 비정상 데이터를 생성하고, 모사 비정상 데이터 및 레이블링된 비정상 데이터 세트 중 적어도 일부의 비교를 통해 모사 비정상 데이터가 참인지 거짓인지 식별할 수 있다.In step S920 , the apparatus 100 for determining the optimal training method may generate a fake abnormal data group by applying at least a portion of the labeled abnormal data set to a generative adversarial network (GAN). In this embodiment, the optimal training method determination apparatus 100 generates simulated abnormal data from random noise generated using a Gaussian distribution, and simulates abnormality through comparison of at least some of the simulated abnormal data and the labeled abnormal data set. It can identify whether data is true or false.

S930단계에서, 최적의 훈련 방식 판단 장치(100)는 오리지널 데이터 세트 중 제1 데이터 그룹과 모사 비정상 데이터 그룹의 비율을 변경하며 복수의 훈련 데이터 세트를 생성할 수 있다. 본 실시 예에서, 최적의 훈련 방식 판단 장치(100)는 제1 비율 내지 제6 비율로 변경하면서 훈련 데이터 세트를 생성할 수 있다. In operation S930, the optimal training method determining apparatus 100 may generate a plurality of training data sets by changing the ratio of the first data group and the simulated abnormal data group among the original data sets. In this embodiment, the apparatus 100 for determining the optimal training method may generate the training data set while changing the first to sixth ratios.

S940단계에서, 최적의 훈련 방식 판단 장치(100)는 복수의 훈련 데이터 세트를 서로 다른 복수의 특징 추출 알고리즘에 입력하여 특징을 추출할 수 있다.In operation S940 , the apparatus 100 for determining the optimal training method may extract a feature by inputting a plurality of training data sets into a plurality of different feature extraction algorithms.

본 실시 예에서, 최적의 훈련 방식 판단 장치(100)는 복수의 훈련 데이터 세트에 제1 특징 추출 알고리즘을 적용하여 생성한 주파수도메인의 데이터 그룹을 제1 특징으로 추출할 수 있다. 여기서, 제1 특징 추출 알고리즘은, 시간도메인의 진동신호를 주파수도메인의 진동신호로 변환시키는 FFT(fast fourier transform) 알고리즘을 포함할 수 있다.In the present embodiment, the apparatus 100 for determining the optimal training method may extract, as a first feature, a data group of a frequency domain generated by applying a first feature extraction algorithm to a plurality of training data sets. Here, the first feature extraction algorithm may include a fast Fourier transform (FFT) algorithm for converting a time domain vibration signal into a frequency domain vibration signal.

또한, 최적의 훈련 방식 판단 장치(100)는 주파수도메인의 데이터 그룹에 제2 특징 추출 알고리즘을 적용하여 산출한 통계적 특성값(statistical analysis)을 제2 특징으로 추출할 수 있다. 여기서, 제2 특징 추출 알고리즘은, 주파수도메인으로 변환된 수평방향의 진동신호, 수직방향의 진동신호, 깊이방향의 진동신호 및 대각선방향의 진동신호 각각의 평균값(mean), 중간(median)값, 최소(min)값, 최대(max)값, 첨도(kurtosis)값, 왜도(skewness)값, 표준편차(standard deviation)값과, 양진폭(peak-to-peak)값 중 하나 이상을 산출하는 알고리즘을 포함할 수 있다.Also, the apparatus 100 for determining the optimal training method may extract a statistical analysis calculated by applying the second feature extraction algorithm to the data group of the frequency domain as the second feature. Here, the second feature extraction algorithm is the average value, median value, To calculate one or more of the minimum (min) value, the maximum (max) value, the kurtosis value, the skewness value, the standard deviation value, and the peak-to-peak value Algorithms may be included.

또한, 최적의 훈련 방식 판단 장치(100)는 는 복수의 훈련 데이터 세트에 제3 특징 추출 알고리즘을 적용하여 생성한 차원이 축소된 데이터 그룹을 제3 특징으로 추출할 수 있다. 여기서, 제3 특징 추출 알고리즘은, 차원이 축소된 데이터를 생성하는 주성분 분석(Principal Component Analysis) 알고리즘을 포함할 수 있다.Also, the apparatus 100 for determining the optimal training method may extract, as a third feature, a data group with reduced dimensions generated by applying a third feature extraction algorithm to a plurality of training data sets. Here, the third feature extraction algorithm may include a principal component analysis algorithm that generates data with reduced dimensions.

본 실시 예에서 최적의 훈련 방식 판단 장치(100)는 제1 비율 내지 제6 비율의 데이터를 이용하여 최소 18개 내지 최대 60개의 특징을 추출할 수 있다.In the present embodiment, the apparatus 100 for determining the optimal training method may extract a minimum of 18 to a maximum of 60 features by using the data of the first to sixth ratios.

S950단계에서, 최적의 훈련 방식 판단 장치(100)는 서로 다른 복수의 특징 추출 알고리즘에 입력하여 특징을 추출한 결과를 훈련 데이터로 하여 서로 다른 알고리즘으로 구현된 복수의 학습모델을 훈련시켜, 기계에서 발생하는 진동신호의 특징을 이용하여 기계의 결함여부를 판단하는 서로 다른 알고리즘으로 구현된 복수의 기계 결함 판별 학습모델을 생성할 수 있다.In step S950, the apparatus 100 for determining the optimal training method trains a plurality of learning models implemented with different algorithms by inputting a plurality of different feature extraction algorithms and using the result of feature extraction as training data, and generating in the machine It is possible to generate a plurality of machine defect determination learning models implemented with different algorithms for determining whether a machine is defective by using the characteristics of the vibration signal.

본 실시 예에서, 최적의 훈련 방식 판단 장치(100)는 제1 특징 내지 제3 특징을 각각 훈련 데이터로 하여 제1 알고리즘으로 구현된 제1 학습모델을 훈련시켜, 기계에서 발생하는 진동신호의 특징을 이용하여 기계의 결함여부를 판단하는 복수의 제1 기계 결함 판별 학습모델을 생성할 수 있다. 여기서, 제1 학습모델을 훈련시키는 훈련 데이터는 토탈 최소 18세트 내지 최대 60세트의 훈련 데이터를 포함할 수 있고, 각 세트의 훈련 데이터로 제1 학습모델을 훈련시킬 수 있다. 또한, 복수의 제1 기계 결함 판별 학습모델은 각 세트 별로 토탈 최소 18개 내지 최대 60개의 제1 기계 결함 판별 학습모델을 포함할 수 있다.In this embodiment, the optimal training method determination apparatus 100 trains the first learning model implemented with the first algorithm by using the first to third characteristics as training data, respectively, and the characteristics of the vibration signal generated by the machine It is possible to generate a plurality of first machine defect determination learning models that determine whether the machine is defective by using . Here, the training data for training the first learning model may include a total of a minimum of 18 sets to a maximum of 60 sets of training data, and the first learning model may be trained with the training data of each set. In addition, the plurality of first machine defect determination learning models may include a total of at least 18 to a maximum of 60 first machine defect determination learning models for each set.

또한, 최적의 훈련 방식 판단 장치(100)는 제1 특징 내지 제3 특징을 각각 훈련 데이터로 하여 제2 알고리즘으로 구현된 제2 학습모델을 훈련시켜, 기계에서 발생하는 진동신호의 특징을 이용하여 기계의 결함여부를 판단하는 복수의 제2 기계 결함 판별 학습모델을 생성할 수 있다. 여기서, 제2 학습모델을 훈련시키는 훈련 데이터는 토탈 최소 18세트 내지 최대 60세트의 훈련 데이터를 포함할 수 있고, 각 세트의 훈련 데이터로 제2 학습모델을 훈련시킬 수 있다. 또한, 복수의 제2 기계 결함 판별 학습모델은 각 세트 별로 토탈 최소 18개 내지 최대 60개의 제2 기계 결함 판별 학습모델을 포함할 수 있다.In addition, the optimal training method determination apparatus 100 trains a second learning model implemented with a second algorithm using the first to third features as training data, respectively, and uses the features of the vibration signal generated by the machine. A plurality of second machine defect determination learning models for determining whether a machine is defective may be generated. Here, the training data for training the second learning model may include a total of minimum 18 sets to a maximum of 60 sets of training data, and the second learning model may be trained with each set of training data. In addition, the plurality of second machine defect determination learning models may include a total of at least 18 to a maximum of 60 second machine defect determination learning models for each set.

또한, 최적의 훈련 방식 판단 장치(100)는 제1 특징 내지 제3 특징을 각각 훈련 데이터로 하여 제3 알고리즘으로 구현된 제3 학습모델을 훈련시켜, 기계에서 발생하는 진동신호의 특징을 이용하여 기계의 결함여부를 판단하는 복수의 제3 기계 결함 판별 학습모델을 생성할 수 있다. 여기서, 제3 학습모델을 훈련시키는 훈련 데이터는 토탈 최소 18세트 내지 최대 60세트의 훈련 데이터를 포함할 수 있고, 각 세트의 훈련 데이터로 제3 학습모델을 훈련시킬 수 있다. 또한, 복수의 제3 기계 결함 판별 학습모델은 각 세트 별로 토탈 최소 18개 내지 최대 60개의 제3 기계 결함 판별 학습모델을 포함할 수 있다.In addition, the optimal training method determination apparatus 100 trains the third learning model implemented with the third algorithm using the first to third features as training data, respectively, and uses the features of the vibration signal generated by the machine. A plurality of third machine defect determination learning models for determining whether a machine is defective may be generated. Here, the training data for training the third learning model may include a total of minimum 18 sets to a maximum of 60 sets of training data, and the third learning model may be trained with each set of training data. In addition, the plurality of third machine defect determination learning models may include a total of at least 18 to a maximum of 60 third machine defect determination learning models for each set.

최적의 훈련 방식 판단 장치(100)는 제1 기계 결함 판별 학습모델로부터 토탈 최소 18개 내지 최대 60개의 기계 결함 여부에 대한 제1 분류값을 출력할 수 있고, 제2 기계 결함 판별 학습모델로부터 토탈 최소 18개 내지 최대 60개의 기계 결함 여부에 대한 제2 분류값을 출력할 수 있고, 제3 기계 결함 판별 학습모델로부터 토탈 최소 18개 내지 최대 60개의 기계 결함 여부에 대한 제3 분류값을 출력할 수 있다. 이로써 최적의 훈련 방식 판단 장치(100)는 토탈 최소 54개 내지 최대 180개의 기계 결함 여부에 대한 분류값을 출력할 수 있다.The optimal training method determination apparatus 100 may output a first classification value for whether a total of at least 18 to a maximum of 60 machine defects from the first machine defect determination learning model, and a total from the second machine defect determination learning model It is possible to output a second classification value for whether there are a minimum of 18 to a maximum of 60 machine defects, and a third classification value for a total of at least 18 to a maximum of 60 machine defects can be output from the third machine defect determination learning model. can Accordingly, the apparatus 100 for determining the optimal training method may output a classification value for a total of at least 54 and at most 180 machine defects.

S960단계에서, 최적의 훈련 방식 판단 장치(100)는 오리지널 데이터 세트 중 제2 데이터 그룹을 이용하여 복수의 기계 결함 판별 학습모델의 정확도를 검증할 수 있다. 본 실시 예에서, 최적의 훈련 방식 판단 장치(100)는 오리지널 데이터 세트 중 제2 데이터 그룹을 이용하여 복수의 제1 기계 결함 판별 학습모델에 대한 제1 정확도를 검증하고, 오리지널 데이터 세트 중 제2 데이터 그룹을 이용하여 복수의 제2 기계 결함 판별 학습모델에 대한 제2 정확도를 검증하고, 오리지널 데이터 세트 중 제2 데이터 그룹을 이용하여 복수의 제3 기계 결함 판별 학습모델에 대한 제3 정확도를 검증할 수 있다.In operation S960 , the apparatus 100 for determining the optimal training method may verify the accuracy of the plurality of machine defect identification learning models by using the second data group among the original data sets. In this embodiment, the apparatus 100 for determining the optimal training method verifies the first accuracy of the plurality of first machine defect determination learning models by using the second data group among the original data sets, and the second among the original data sets Validating the second accuracy of the plurality of second machine defect determination learning models using the data group, and verifying the third accuracy of the plurality of third machine defect determination learning models using the second data group of the original data set can do.

여기서, 정확도를 검증한다 함은, 오리지널 데이터 세트 중 제1 데이터 그룹과 모사 비정상 데이터 그룹을 이용하여 복수의 기계 결함 판별 학습모델 각각으로부터 출력되는 분류값과, 오리지널 데이터 세트 중 제2 데이터 그룹을 이용하여 복수의 기계 결함 판별 학습모델 각각으로부터 출력되는 분류값을 비교하여 일치하는 정도를 분석하는 것을 포함할 수 있다.Here, verifying the accuracy means using the classification value output from each of a plurality of machine fault determination learning models using the first data group and the simulated abnormal data group among the original data set, and the second data group of the original data set to compare the classification values output from each of the plurality of machine defect determination learning models to analyze the degree of agreement.

S970단계에서, 최적의 훈련 방식 판단 장치(100)는 기계 결함 판별 학습모델들의 정확도를 검증한 결과를 기반으로 하여, 특징 추출 알고리즘 별로 오리지널 데이터 세트 중 제1 데이터 그룹과 모사 비정상 데이터 그룹의 최적 비율과, 복수의 기계 결함 판별 학습모델 중 최적 학습모델을 결정할 수 있다.In step S970, the optimal training method determination apparatus 100 determines the optimal ratio of the first data group and the simulated abnormal data group in the original data set for each feature extraction algorithm based on the result of verifying the accuracy of the machine fault determination learning models. And, it is possible to determine an optimal learning model among a plurality of machine defect discrimination learning models.

본 실시 예에서, 최적의 훈련 방식 판단 장치(100)는 기계 결함 판별 학습모델들의 정확도를 검증한 결과로부터 미리 정해진 임계 정확도 이상을 도출하게 하는 훈련 데이터 세트 중 제1 데이터 그룹의 비율이 가장 낮은 훈련 데이터 세트에서의 제1 데이터 그룹과 모사 비정상 데이터 그룹의 비율을 최적 비율로 결정할 수 있다. 최적의 훈련 방식 판단 장치(100)는 제1 기계 결함 판별 학습모델 내지 제3 기계 결함 판별 학습모델 중 가장 높은 정확도를 갖는 어느 한 기계 결함 판별 학습모델을 최적 학습모델로 결정할 수 있다. 최적의 훈련 방식 판단 장치(100)는 제1 특징 추출 알고리즘 내지 제3 특징 추출 알고리즘 중 가장 높은 정확도를 갖는 기계 결함 판별 학습모델에 입력하는 훈련 데이터를 생성한 어느 한 특징 추출 알고리즘을 최적 특징 추출 알고리즘으로 결정할 수 있다.In this embodiment, the optimal training method determination apparatus 100 is a training with the lowest ratio of the first data group among the training data sets that derive more than a predetermined threshold accuracy from the result of verifying the accuracy of the machine fault determination learning models. A ratio between the first data group and the simulated abnormal data group in the data set may be determined as an optimal ratio. The optimal training method determination apparatus 100 may determine any one machine defect determination learning model having the highest accuracy among the first machine defect determination learning model to the third machine defect determination learning model as the optimum learning model. The optimal training method determination apparatus 100 uses any one feature extraction algorithm that generates training data to be input to the machine fault determination learning model having the highest accuracy among the first feature extraction algorithm to the third feature extraction algorithm, and the optimal feature extraction algorithm. can be determined as

기계의 종류마다 효과 있는 특징 추출 알고리즘, 인공지능 모델이 다르고, 오리지널 데이터와 모사 비정상 데이터 그사이의 비율이 다를 수 있으므로, 기계마다 본 발명의 알고리즘을 통해 최적 방식 즉, 데이터 비율, 기계 결함 판별 학습모델 및 특징 추출 알고리즘을 선택(결정)할 수 있다. 여기서 선택은, 상술한 제1 비율 내지 제6 비율 중 어느 한 비율을 포함할 수 있고, 최소 54개 내지 최대 180개의 분류값을 출력하는 복수의 기계 결함 판별 학습모델 중 어느 한 기계 결함 판별 학습모델을 포함할 수 있고, 최소 18개 내지 최대 60개의 특징을 추출하는 복수의 특징 추출 알고리즘 중 어느 한 알고리즘을 선택할 수 있다. 오리지널 데이터와 모사 데이터의 비율에 있어서는 모사 데이터를 넣어 훈련 데이터의 사이즈를 커지게 하는 것과 모사 데이터의 추가로 인하여 훈련 데이터의 정확도가 떨어지는 것이 trade off 관계일 수 있다. 본 발명의 최적의 훈련 방식 판단에 따르면, 훈련 데이터의 사이즈를 커지게 하면서도 훈련 데이터의 정확도를 떨어뜨리지 않는(혹은 일정 이상 떨어뜨리지 않는) 최대 모사 데이터 비율을 찾을 수 있다.Effective feature extraction algorithms and artificial intelligence models are different for each type of machine, and the ratio between original data and simulated abnormal data may be different. and a feature extraction algorithm may be selected (determined). Here, the selection may include any one of the above-described first to sixth ratios, and any one of a plurality of machine defect determination learning models outputting a minimum of 54 to a maximum of 180 classification values. may include, and any one algorithm among a plurality of feature extraction algorithms for extracting a minimum of 18 to a maximum of 60 features may be selected. In the ratio of the original data to the simulated data, increasing the size of the training data by adding the simulated data and decreasing the accuracy of the training data due to the addition of the simulated data may be a trade-off relationship. According to the determination of the optimal training method of the present invention, it is possible to find the maximum simulated data ratio that does not decrease the accuracy of the training data while increasing the size of the training data (or does not drop more than a certain amount).

이상 설명된 본 발명에 따른 실시 예는 컴퓨터 상에서 다양한 구성요소를 통하여 실행될 수 있는 컴퓨터 프로그램의 형태로 구현될 수 있으며, 이와 같은 컴퓨터 프로그램은 컴퓨터로 판독 가능한 매체에 기록될 수 있다. 이때, 매체는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체, CD-ROM 및 DVD와 같은 광기록 매체, 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical medium), 및 ROM, RAM, 플래시 메모리 등과 같은, 프로그램 명령어를 저장하고 실행하도록 특별히 구성된 하드웨어 장치를 포함할 수 있다.The above-described embodiment according to the present invention may be implemented in the form of a computer program that can be executed through various components on a computer, and such a computer program may be recorded in a computer-readable medium. In this case, the medium includes a hard disk, a magnetic medium such as a floppy disk and a magnetic tape, an optical recording medium such as CD-ROM and DVD, a magneto-optical medium such as a floppy disk, and a ROM. , RAM, flash memory, and the like, and hardware devices specially configured to store and execute program instructions.

한편, 상기 컴퓨터 프로그램은 본 발명을 위하여 특별히 설계되고 구성된 것이거나 컴퓨터 소프트웨어 분야의 당업자에게 공지되어 사용 가능한 것일 수 있다. 컴퓨터 프로그램의 예에는, 컴파일러에 의하여 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용하여 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드도 포함될 수 있다.Meanwhile, the computer program may be specially designed and configured for the present invention, or may be known and used by those skilled in the computer software field. Examples of the computer program may include not only machine code generated by a compiler, but also high-level language code that can be executed by a computer using an interpreter or the like.

본 발명의 명세서(특히 특허청구범위에서)에서 "상기"의 용어 및 이와 유사한 지시 용어의 사용은 단수 및 복수 모두에 해당하는 것일 수 있다. 또한, 본 발명에서 범위(range)를 기재한 경우 상기 범위에 속하는 개별적인 값을 적용한 발명을 포함하는 것으로서(이에 반하는 기재가 없다면), 발명의 상세한 설명에 상기 범위를 구성하는 각 개별적인 값을 기재한 것과 같다. In the specification of the present invention (especially in the claims), the use of the term "above" and similar referential terms may be used in both the singular and the plural. In addition, when a range is described in the present invention, each individual value constituting the range is described in the detailed description of the invention as including the invention to which individual values belonging to the range are applied (unless there is a description to the contrary). same as

본 발명에 따른 방법을 구성하는 단계들에 대하여 명백하게 순서를 기재하거나 반하는 기재가 없다면, 상기 단계들은 적당한 순서로 행해질 수 있다. 반드시 상기 단계들의 기재 순서에 따라 본 발명이 한정되는 것은 아니다. 본 발명에서 모든 예들 또는 예시적인 용어(예들 들어, 등등)의 사용은 단순히 본 발명을 상세히 설명하기 위한 것으로서 특허청구범위에 의해 한정되지 않는 이상 상기 예들 또는 예시적인 용어로 인해 본 발명의 범위가 한정되는 것은 아니다. 또한, 당업자는 다양한 수정, 조합 및 변경이 부가된 특허청구범위 또는 그 균등물의 범주 내에서 설계 조건 및 팩터에 따라 구성될 수 있음을 알 수 있다.The steps constituting the method according to the present invention may be performed in an appropriate order unless the order is explicitly stated or there is no description to the contrary. The present invention is not necessarily limited to the order in which the steps are described. The use of all examples or exemplary terms (eg, etc.) in the present invention is merely for the purpose of describing the present invention in detail, and the scope of the present invention is limited by the examples or exemplary terms unless limited by the claims. it is not going to be In addition, those skilled in the art will appreciate that various modifications, combinations, and changes may be made in accordance with design conditions and factors within the scope of the appended claims or their equivalents.

따라서, 본 발명의 사상은 상기 설명된 실시 예에 국한되어 정해져서는 아니 되며, 후술하는 특허청구범위뿐만 아니라 이 특허청구범위와 균등한 또는 이로부터 등가적으로 변경된 모든 범위는 본 발명의 사상의 범주에 속한다고 할 것이다.Therefore, the spirit of the present invention should not be limited to the above-described embodiments, and the scope of the spirit of the present invention is not limited to the scope of the scope of the present invention. will be said to belong to

100: 최적의 훈련 방식 판단 장치(100)
101: 프로세서 102: 메모리
110: 저장부 120: 로딩부
130: 제1 생성부 140: 제2 생성부
150: 추출부 160: 제3 생성부
170: 검증부 180: 결정부
190: 제어부100: optimal training method determination device 100
101: processor 102: memory
110: storage unit 120: loading unit
130: first generating unit 140: second generating unit
150: extraction unit 160: third generation unit
170: Verification unit 180: Determination unit
190: control unit

Claims

A method of searching for an optimal training scheme, performed by a processor of an optimal training scheme determining device, comprising:
loading an original data set comprising a labeled normal data set and a labeled abnormal data set in relation to a vibration signal generated by the machine;
generating a fake abnormal data group by applying at least a portion of the labeled abnormal data set to a generative adversarial network (GAN);
generating a plurality of training data sets while adding data of the simulated abnormal data group to a first data group among the original data sets and changing the ratio between the first data group and the simulated abnormal data group;
extracting features by inputting the plurality of training data sets into a plurality of different feature extraction algorithms;
A plurality of learning models implemented with different algorithms are trained using the result of extracting features by input to the plurality of different feature extraction algorithms as training data. generating a plurality of machine defect determination learning models implemented with different algorithms to determine ;
verifying the accuracy of the plurality of machine defect determination learning models by using a second data group among the original data sets; and
Based on the result of verifying the accuracy of the machine defect determination learning model, the optimal ratio of the first data group and the simulated abnormal data group in the original data set for each feature extraction algorithm, and the plurality of machine defect determination learning models Including the step of determining the optimal learning model,
The determining step is
The first data group and the simulated abnormal data in the training data set having the lowest ratio of the first data group among the training data sets for deriving more than a predetermined threshold accuracy from the results of verifying the accuracy of the machine fault determination learning models determining the proportion of the group;
determining any one machine defect determination learning model having the highest accuracy among the plurality of machine defect determination learning models; and
Comprising the step of determining any one feature extraction algorithm that generates training data to be input to the machine fault determination learning model having the highest accuracy among the plurality of feature extraction algorithms,
How to determine the optimal training method.

The method of claim 1,
The loading step is
labeling a first vibration signal generated in a normal machine as a normal state; and
Labeling the second vibration signal generated in the abnormal machine as a broken state,
How to determine the optimal training method.

3. The method of claim 2,
The vibration signal is
A horizontal vibration signal generated in the machine, a vertical vibration signal generated in the machine, a depth vibration signal generated in the machine, and horizontal, vertical and depth directions generated in the machine Including a vibration signal in the diagonal direction as a reference,
How to determine the optimal training method.

The method of claim 1,
The step of creating the fake abnormal data group comprises:
generating simulated anomalous data from random noise generated using a Gaussian distribution; and
identifying whether the simulated anomalous data is true or false through comparison of at least a portion of the simulated anomalous data and the labeled anomalous data set;
How to determine the optimal training method.

4. The method of claim 3,
The step of extracting the feature is
extracting, as a first feature, a data group of a frequency domain generated by applying a first feature extraction algorithm to the plurality of training data sets;
extracting, as a second feature, a statistical analysis calculated by applying a second feature extraction algorithm to the data group of the frequency domain; and
and extracting a reduced-dimensional data group generated by applying a third feature extraction algorithm to the plurality of training data sets as a third feature,
How to determine the optimal training method.

6. The method of claim 5,
The first feature extraction algorithm,
It includes a fast Fourier transform (FFT) algorithm that converts a time domain vibration signal into a frequency domain vibration signal,
The second feature extraction algorithm,
Mean value, median value, minimum value of each of the vibration signal in the horizontal direction, the vibration signal in the vertical direction, the vibration signal in the depth direction, and the vibration signal in the diagonal direction converted into the frequency domain An algorithm for calculating one or more of a value, a max value, a kurtosis value, a skewness value, a standard deviation value, and a peak-to-peak value; ,
The third feature extraction algorithm,
comprising a principal component analysis (PCA) algorithm that generates reduced-dimensional data;
How to determine the optimal training method.

6. The method of claim 5,
The step of generating the plurality of machine defect determination learning models,
A first learning model implemented with a first algorithm is trained using the first to third features as training data, respectively, and a plurality of first learning models for judging whether the machine is defective using the features of the vibration signal generated by the machine 1 generating a machine fault determination learning model;
A second learning model implemented as a second algorithm is trained using the first to third features as training data, respectively, and a plurality of second learning models that determine whether the machine is defective using the features of the vibration signal generated by the machine 2 generating a machine fault determination learning model; and
A third learning model implemented as a first algorithm is trained using the first to third features as training data, respectively, and a plurality of third learning models that determine whether the machine is defective using the features of the vibration signal generated by the machine 3 comprising the step of generating a machine fault determination learning model,
How to determine the optimal training method.

8. The method of claim 7,
The step of verifying the accuracy is
verifying first accuracy for the plurality of first machine defect determination learning models by using a second data group of the original data set;
verifying second accuracy of the plurality of second machine defect determination learning models by using a second data group among the original data sets; and
using a second data group of the original data set to verify a third accuracy for the plurality of third machine defect determination learning models,
How to determine the optimal training method.

delete

A computer-readable recording medium storing a computer program for executing the method of any one of claims 1 to 8 using a computer.

As a device for searching for an optimal training method,
processor; and
a memory operatively coupled to the processor and storing at least one code executed by the processor;
the memory, when executed by the processor, the processor loads an original data set comprising a labeled normal data set and a labeled abnormal data set in relation to a vibration signal generated in the machine;
generating a fake abnormal data group by applying at least a portion of the labeled abnormal data set to a generative adversarial network (GAN);
generating a plurality of training data sets by changing the ratio of the first data group to the simulated abnormal data group while adding data of the simulated abnormal data group to a first data group among the original data sets;
extracting features by inputting the plurality of training data sets into a plurality of different feature extraction algorithms;
A plurality of learning models implemented with different algorithms are trained using the result of extracting features by input to the plurality of different feature extraction algorithms as training data. Create a plurality of machine fault determination learning models implemented with different algorithms to determine
verifying the accuracy of the plurality of machine fault determination learning models by using a second data group among the original data sets;
Based on the result of verifying the accuracy of the machine defect determination learning model, the optimal ratio of the first data group and the simulated abnormal data group in the original data set for each feature extraction algorithm, and the plurality of machine defect determination learning models stores the code that causes to determine the optimal learning model,
The memory causes the processor to
When performing the determination, the first in the training data set in which the ratio of the first data group is the lowest among the training data sets for deriving more than a predetermined threshold accuracy from the result of verifying the accuracy of the machine fault determination learning models. determining the ratio of the data group to the simulated abnormal data group;
Determining any one machine defect determination learning model having the highest accuracy among the plurality of machine defect determination learning models,
Storing a code causing to determine one of the feature extraction algorithms that generated training data to be input to the machine fault determination learning model with the highest accuracy among the plurality of feature extraction algorithms,
Optimal training method judgment device.

12. The method of claim 11,
The memory causes the processor to
When loading the original data set, code that causes a first vibration signal generated by a normal machine to be labeled as a normal state and a second vibration signal generated from an abnormal machine to be labeled as a broken state. to save,
Optimal training method judgment device.

13. The method of claim 12,
The memory causes the processor to
The vibration signal includes a horizontal vibration signal generated in the machine, a vertical vibration signal generated in the machine, a depth direction vibration signal generated in the machine, and horizontal and vertical vibration signals generated in the machine. storing a code causing to include a vibration signal in a diagonal direction based on a direction and a depth direction,
Optimal training method judgment device.

12. The method of claim 11,
The memory causes the processor to
When generating the fake anomalous data group, simulated anomalous data is generated from random noise generated using a Gaussian distribution,
storing code that causes the comparison of at least a portion of the simulated anomalous data and the labeled anomalous data set to identify whether the simulated anomalous data is true or false;
Optimal training method judgment device.

14. The method of claim 13,
The memory causes the processor to
When extracting the feature, a data group of a frequency domain generated by applying a first feature extraction algorithm to the plurality of training data sets is extracted as a first feature,
extracting a statistical analysis calculated by applying a second feature extraction algorithm to the data group of the frequency domain as a second feature;
storing a code that causes a reduced-dimensional data group generated by applying a third feature extraction algorithm to the plurality of training data sets to be extracted as a third feature,
Optimal training method judgment device.

16. The method of claim 15,
The memory causes the processor to
The first feature extraction algorithm includes a fast Fourier transform (FFT) algorithm for converting a time domain vibration signal into a frequency domain vibration signal,
The second feature extraction algorithm includes an average value of each of the vibration signal in the horizontal direction, the vibration signal in the vertical direction, the vibration signal in the depth direction, and the vibration signal in the diagonal direction converted into the frequency domain. one of median value, min value, max value, kurtosis value, skewness value, standard deviation value, and peak-to-peak value an algorithm for calculating an anomaly;
wherein the third feature extraction algorithm stores code that causes it to include a principal component analysis (PCA) algorithm that generates reduced-dimensional data;
Optimal training method judgment device.

16. The method of claim 15,
The memory causes the processor to
When generating the plurality of machine defect discrimination learning models, the first learning model implemented by the first algorithm is trained using the first to third features as training data, respectively, and the characteristics of the vibration signal generated by the machine to generate a plurality of first machine defect determination learning models that determine whether the machine is defective,
A second learning model implemented as a second algorithm is trained using the first to third features as training data, respectively, and a plurality of second learning models that determine whether the machine is defective using the features of the vibration signal generated by the machine 2 Create a machine fault determination learning model,
A third learning model implemented as a first algorithm is trained using the first to third features as training data, respectively, and a plurality of third learning models that determine whether the machine is defective using the features of the vibration signal generated by the machine 3 to store the code that causes the machine fault determination learning model to be created,
Optimal training method judgment device.

18. The method of claim 17,
The memory causes the processor to
When verifying the accuracy, verifying the first accuracy for the plurality of first machine defect determination learning models by using a second data group among the original data set,
verifying the second accuracy of the plurality of second machine defect determination learning models by using a second data group among the original data sets;
storing code that causes to verify a third accuracy for the plurality of third machine fault determination learning models using a second data group of the original data set;
Optimal training method judgment device.

delete