KR20210077389A

KR20210077389A - Deep learning based abnormal symptom detection system using mechanical sound of mechanical equipment

Info

Publication number: KR20210077389A
Application number: KR1020190168976A
Authority: KR
Inventors: 최용호; 김태경; 김윤중
Original assignee: (주)유코아시스템
Priority date: 2019-12-17
Filing date: 2019-12-17
Publication date: 2021-06-25
Also published as: KR102272622B1

Abstract

The present invention relates to a deep learning-based abnormal symptom detection system through noise or vibration of mechanical equipment, which includes: a sensor for data collection; a feature extraction unit; a deep learning unit configured to perform deep learning through any one or a combination of two or more among artificial neural networks (ANN), deep neural networks (DNN), convolution neural networks (CNN), long short term memory recurrent neural networks (LSTM RNN), connectionist temporal classification (CTC), and attention mechanism; and an abnormality detection unit for detecting abnormal signs of mechanical equipment.

Description

Deep learning based abnormal symptom detection system using mechanical sound of mechanical equipment

본 발명은 기계설비의 소음 또는 진동을 통한 딥러닝기반 이상징후 감지시스템에 관한 것으로서 더욱 상세하게는 하나 이상의 설비에서 발생되는 기계음을 데이터 수집용 센서로 수집하여 특징추출 알고리즘을 적용하여 기계음에 대한 특징을 추출하고 학습된 딥러닝 모델을 통해 해당 기계음으로부터 추출된 특징 데이터가 정상 또는 이상인지 판단함에 따라 설치 및 유지보수 비용이 절감되는 기계설비의 기계음을 이용한 딥러닝기반 이상징후 감지시스템에 관한 것이다. The present invention relates to a system for detecting anomalies based on deep learning through noise or vibration of mechanical equipment, and more particularly, by collecting machine sounds generated from one or more facilities with a data collection sensor and applying a feature extraction algorithm to the characteristics of the machine sound It relates to a deep learning-based abnormal symptom detection system using machine sounds of machinery that reduces installation and maintenance costs by extracting and judging whether the feature data extracted from the machine sound is normal or abnormal through the learned deep learning model.

종래의 설비 진단기술은 설비 내부에서 발생하는 데이터를 수집하고 이를 활용하여 구동부의 열화 및/또는 고장을 모니터링 함으로써, 사전에 부품 고장 여부 확인, 부품 수급, 사전 조치 등을 통해 설비의 가동률의 저하가 최소가 되도록 CBM(Condition Based Management)을 수행하여 왔다. The conventional facility diagnosis technology collects data generated inside the facility and uses it to monitor the deterioration and/or failure of the driving unit. CBM (Condition Based Management) has been performed to minimize it.

예컨대, 모터 드라이버(driver)에서 수집할 수 있는 토크(Torque) 및 속도(Speed) 데이터, 설비의 내/외부 진동 데이터, 온도, 경사, 전류, 기압, 압력, 알람 데이터 등을 센서나 DAQ(Data Acquisition) 등으로 획득하여 활용함으로써, 설비의 고장 진단을 예측하고 있다. For example, torque and speed data that can be collected from a motor driver, internal/external vibration data of equipment, temperature, slope, current, barometric pressure, pressure, alarm data, etc. Acquisition), etc., are used to predict equipment failure diagnosis.

그러나 기존의 설비 진단기술은 몇 가지 애로 사항들이 있는데, 대표적인 애로 사항은 다음과 같다.However, there are several difficulties in the existing facility diagnosis technology, and the representative difficulties are as follows.

첫 번째는 일반적으로 설비로부터 시계열(time series) 데이터를 획득하게 되는데, 시간 도메인(Time domain) 상에서 획득된 시계열 데이터를 직접 정량화하여 설비를 진단하기 어렵다는 점이다. The first is that time series data is generally obtained from equipment, and it is difficult to diagnose equipment by directly quantifying the time series data obtained in the time domain.

왜냐하면, 구동부별 구조적인 특징에 따라서 각 부품의 고유 주파수에 해당하는 진동이 데이터에 포함되어 있기 때문이다. 따라서, 설비의 정량적인 진단을 하기 위해서 도메인을 시간에서 주파수(frequency)로 변경하여 진단을 수행하게 되는데, 주파수 특성을 보다 넓은 범위(range)에서 정확히 측정하기 위해서 샘플링 레이트(sampling rate)를 높이게 되고 이에 따라 데이터 사이즈가 증가하는 문제가 있다.This is because, according to the structural characteristics of each driving unit, the vibration corresponding to the natural frequency of each component is included in the data. Therefore, in order to perform quantitative diagnosis of equipment, the domain is changed from time to frequency and the diagnosis is performed. In order to accurately measure frequency characteristics in a wider range, the sampling rate is increased. Accordingly, there is a problem in that the data size increases.

두 번째는 설비별, 공정별, 모델별, 구동 축별 데이터의 특성이 모두 다르기 때문에, 효과적인 진단을 위해서는 각각의 데이터의 특성이 반영된 개별 진단 알고리즘이 필요하다. 그러나 그러한 개별 진단 알고리즘을 구축하는 데에는 많은 시간, 비용, 및 노력이 요구되고, 또한, 구축된 진단 알고리즘은 해당 데이터에만 적용되므로 효용성 측면에서 한계가 있다.Second, since the characteristics of data for each facility, process, model, and drive axis are all different, an individual diagnosis algorithm reflecting the characteristics of each data is required for effective diagnosis. However, a lot of time, money, and effort are required to construct such an individual diagnostic algorithm, and also, the built diagnostic algorithm is applied only to the corresponding data, so there is a limit in terms of utility.

세 번째는 데이터를 취득함에 있어, 대부분은 정상 데이터이고 간헐적으로 발생하는 고장 데이터의 양은 매우 적다. 그러나 설비의 진단에서, 보통 고장 데이터만을 이용하고, 대부분의 정상 데이터는 수집 직후에 버려진다. Third, when acquiring data, most of it is normal data, and the amount of intermittent failure data is very small. However, in the diagnosis of equipment, only failure data is usually used, and most of the normal data is discarded immediately after collection.

따라서, 정상 데이터의 활용 측면에서 문제가 있고, 또한 설비 진단의 신뢰성 측면에도 문제가 있을 수 있다.Therefore, there is a problem in terms of utilization of normal data, and there may also be problems in terms of reliability of equipment diagnosis.

본 발명은 상기의 문제점을 해결하기 위해 안출된 것으로서 하나 이상의 설비에서 발생되는 기계음 또는 진동을 데이터 수집용 센서로 수집하여 특징추출 알고리즘을 적용하여 기계음에 대한 특징을 추출하고 학습된 딥러닝 모델을 통해 해당 기계음으로부터 추출된 특징 데이터가 정상 또는 이상인지 판단함에 따라 설치 및 유지보수 비용이 절감되는 기계설비의 기계음을 이용한 딥러닝기반 이상징후 감지시스템을 제공함에 그 목적이 있다. The present invention has been devised to solve the above problems, and collects machine sounds or vibrations generated in one or more facilities with a data collection sensor, applies a feature extraction algorithm to extract features for machine sounds, and uses the learned deep learning model. The purpose of this is to provide a deep learning-based abnormal symptom detection system using the machine sound of mechanical equipment that reduces installation and maintenance costs by determining whether the feature data extracted from the machine sound is normal or abnormal.

본 발명은 상기의 목적을 달성하기 위해 아래와 같은 특징을 갖는다.The present invention has the following features to achieve the above object.

본 발명은 기계설비의 기계음을 이용한 딥러닝기반 이상징후 감지시스템에 있어서, 상기 딥러닝기반 이상징후 감지시스템은 하나 이상의 기계설비로부터 발생되는 기계음을 수집하는 데이터 수집용 센서와; 상기 데이터 수집용 센서로부터 수집된 기계음을 전달받아 해당 기계음의 특징을 추출하는 특징 추출부와; 기계음의 특징 데이터를 이용하며 CNN(Convolution Neural Network), LSTM RNN(Long Short Term Memory Recurrent Neural Network), Attention mechanism의 조합을 통해 심층학습을 수행하도록 구성된 딥러닝부와; 상기 특징 추출부로부터 추출된 특징 데이터를 전달받아 상기 딥러닝부를 통해 상기 기계설비에 대한 이상징후를 감지하는 이상감지부를 포함한다.The present invention provides a system for detecting anomalies based on deep learning using machine sounds of mechanical equipment, the system comprising: a data collection sensor for collecting machine sounds generated from one or more mechanical equipment; a feature extraction unit for receiving the machine sound collected from the data collection sensor and extracting features of the machine sound; a deep learning unit configured to perform deep learning using a combination of a convolutional neural network (CNN), a LSTM long short term memory recurrent neural network (RNN), and an attention mechanism using machine sound feature data; and an abnormality detection unit that receives the characteristic data extracted from the characteristic extraction unit and detects abnormal signs of the mechanical equipment through the deep learning unit.

여기서 상기 데이터 수집용 센서는 기계설비로부터 발생되는 기계음을 수집하는 마이크로 마이크이며, 상기 딥러닝기반 이상징후 감지시스템은 기계설비의 이상 또는 설정에 따라 유지보수가 수행된 경우 유지보수 이력정보를 제공하는 유지보수입력부가 포함되며, 상기 딥러닝부는 유지보수 수행 전, 후의 기계음 특징 데이터를 통해 심층학습을 수행한다. Here, the sensor for data collection is a micro-microphone that collects mechanical sounds generated from mechanical equipment, and the deep learning-based abnormal symptom detection system provides maintenance history information when maintenance is performed according to abnormalities or settings of mechanical equipment. A maintenance input unit is included, and the deep learning unit performs deep learning through machine sound feature data before and after performing maintenance.

또한 상기 특징 추출부는 상기 데이터 수집용 센서로부터 수집된 기계음을 전달받아 해당 기계음을 주파수 영역으로 변환하고 잡음 제거 및 멜 필터 뱅크(Mel Filter Bank)를 통해 특징을 추출한다. In addition, the feature extraction unit receives the machine sound collected from the data collection sensor, converts the machine sound into a frequency domain, and extracts features through noise removal and Mel Filter Bank.

아울러 상기 특징 추출부에서 특징 추출시 사용되는 파라메터는 Frame window size, Window shift rate, LLD(Low level descriptor) 또는 HSF(High-level Statisticla Functions) 중 어느 하나 이상이다. In addition, the parameter used in extracting the feature by the feature extraction unit is any one or more of a frame window size, a window shift rate, a low level descriptor (LLD), or a high-level statistical function (HSF).

또한 상기 딥러닝기반 이상징후 감지시스템은 1000개 이상의 기계음 데이터로 4-fold cross validation 기법으로 딥러닝부의 학습과정에서 반복적인 교차 검증을 수행한다. In addition, the deep learning-based anomaly detection system performs repeated cross-validation in the learning process of the deep learning unit using a 4-fold cross validation technique with more than 1000 machine sound data.

본 발명에 따르면 기계설비에서 발생하는 기계음을 데이터 수집하여 딥러닝부를 기계설비 이상징후를 감지할 수 있으며, 하나의 마이크로마이크로 여러 대의 설비의 기계음 수집 가능하고 중요 또는 고가의 부품, 일반 설비의 고장 예지를 기계음만으로 판단하여 센서 부착 및 유지보수 비용 절감할 수 있는 효과가 있다. According to the present invention, the deep learning unit can detect abnormal signs of mechanical equipment by collecting data from the machine sound generated by the machine, and it is possible to collect the machine sound of several facilities with one micromicro, and predict the failure of important or expensive parts and general facilities. It has the effect of reducing the cost of sensor attachment and maintenance by judging by only the mechanical sound.

도 1은 본 발명의 일실시예에 따른 딥러닝기반 이상징후 감지시스템을 개략적으로 도시한 도면이다.
도 2는 본 발명의 일실시예에 따른 본 발명의 일실시예에 따른 딥러닝기반 이상징후 감지시스템의 내부구성을 나타내는 블럭도이다.
도 3은 본 발명의 일실시예에 따른 특징 추출부를 통한 특징 추출과정을 개략적으로 나타내는 도면이다.
도 4는 본 발명의 일실시예에 따라 특징 추출부에서 MFCC를 구하는 과정을 나타내는 도면이다.
도 5는 본 발명의 일실시예에 따른 CLA 모델의 구조를 나타내는 도면이다.
도 6은 본 발명의 일실시예에 따라 4-fold cross validation 기법으로 딥러닝부의 학습과정에서 반복적인 교차 검증을 수행하는 모습을 나타내는 도면이다. 1 is a diagram schematically illustrating a deep learning-based anomaly detection system according to an embodiment of the present invention.
2 is a block diagram showing the internal configuration of a deep learning-based anomaly detection system according to an embodiment of the present invention.
3 is a diagram schematically illustrating a feature extraction process through a feature extraction unit according to an embodiment of the present invention.
4 is a diagram illustrating a process of obtaining an MFCC in a feature extraction unit according to an embodiment of the present invention.
5 is a diagram showing the structure of a CLA model according to an embodiment of the present invention.
6 is a diagram illustrating a state in which repeated cross-validation is performed in the learning process of the deep learning unit by a 4-fold cross validation technique according to an embodiment of the present invention.

본 발명과 본 발명의 동작상의 이점 및 본 발명의 실시에 의하여 달성되는 목적을 설명하기 위하여 이하에서는 본 발명의 바람직한 실시예를 예시하고 이를 참조하여 살펴본다.In order to explain the present invention, the operational advantages of the present invention, and the objects achieved by the practice of the present invention, preferred embodiments of the present invention are exemplified below and will be described with reference to them.

먼저, 본 출원에서 사용한 용어는 단지 특정한 실시예를 설명하기 위해 사용된 것으로서, 본 발명을 한정하려는 의도가 아니며, 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함할 수 있다. 또한 본 출원에서, "포함하다" 또는 "가지다" 등의 용어는 명세서 상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.First, the terms used in the present application are only used to describe specific embodiments, and are not intended to limit the present invention, and the singular expression may include a plural expression unless the context clearly indicates otherwise. Also in the present application, terms such as "comprise" or "have" are intended to designate that a feature, number, step, operation, component, part, or a combination thereof described in the specification exists, but one or more other It is to be understood that this does not preclude the possibility of addition or presence of features or numbers, steps, operations, components, parts, or combinations thereof.

본 발명을 설명함에 있어서, 관련된 공지 구성 또는 기능에 대한 구체적인 설명이 본 발명의 요지를 흐릴 수 있다고 판단되는 경우에는 그 상세한 설명은 생략한다.In describing the present invention, if it is determined that a detailed description of a related known configuration or function may obscure the gist of the present invention, the detailed description thereof will be omitted.

도 1은 본 발명의 일실시예에 따른 딥러닝기반 이상징후 감지시스템을 개략적으로 도시한 도면이며, 도 2는 본 발명의 일실시예에 따른 본 발명의 일실시예에 따른 딥러닝기반 이상징후 감지시스템의 내부구성을 나타내는 블럭도이다.1 is a diagram schematically showing a deep learning-based anomaly detection system according to an embodiment of the present invention, and FIG. 2 is a deep learning-based abnormal symptom detection system according to an embodiment of the present invention according to an embodiment of the present invention. It is a block diagram showing the internal configuration of the sensing system.

도 3은 본 발명의 일실시예에 따른 특징 추출부를 통한 특징 추출과정을 개략적으로 나타내는 도면이며, 도 4는 본 발명의 일실시예에 따라 특징 추출부에서 MFCC를 구하는 과정을 나타내는 도면이고, 도 5는 본 발명의 일실시예에 따른 CLA 모델의 구조를 나타내는 도면이며, 도 6은 본 발명의 일실시예에 따라 4-fold cross validation 기법으로 딥러닝부의 학습과정에서 반복적인 교차 검증을 수행하는 모습을 나타내는 도면이다. 3 is a diagram schematically illustrating a feature extraction process through a feature extraction unit according to an embodiment of the present invention, and FIG. 4 is a diagram illustrating a process for obtaining an MFCC by the feature extraction unit according to an embodiment of the present invention, FIG. 5 is a diagram showing the structure of a CLA model according to an embodiment of the present invention, and FIG. 6 is a 4-fold cross validation technique according to an embodiment of the present invention. Repeated cross-validation is performed in the learning process of the deep learning unit. It is a drawing showing the appearance.

도면을 참조하면 본 발명의 일실시예에 따른 딥러닝기반 이상징후 감지시스템(100)은 하나 이상의 기계설비로부터 발생되는 기계음을 수집하는 데이터 수집용 센서(10)와, 상기 데이터 수집용 센서(10)로부터 수집된 기계음을 전달받아 해당 기계음의 특징을 추출하는 특징 추출부(20)와, 기계음의 특징 데이터를 이용하며 CNN(Convolution Neural Network), LSTM RNN(Long Short Term Memory Recurrent Neural Network), Attention mechanism의 조합을 통해 심층학습을 수행하도록 구성된 딥러닝부(30)와, 상기 특징 추출부(20)로부터 추출된 특징 데이터를 전달받아 상기 딥러닝부(30)를 통해 상기 기계설비에 대한 이상징후를 감지하는 이상감지부(40) 및 기계설비의 이상 또는 설정에 따라 유지보수가 수행된 경우 유지보수 이력정보를 제공하는 유지보수입력부(50)로 이루어진다. Referring to the drawings, a deep learning-based abnormal symptom detection system 100 according to an embodiment of the present invention includes a data collection sensor 10 for collecting machine sounds generated from one or more mechanical equipment, and the data collection sensor 10 ), a feature extraction unit 20 that extracts the characteristics of the machine sound by receiving the collected machine sound, and a Convolution Neural Network (CNN), Long Short Term Memory Recurrent Neural Network (LSTM RNN), Attention using the feature data of the machine sound. The deep learning unit 30 configured to perform deep learning through a combination of mechanisms, and receiving the feature data extracted from the feature extraction unit 20, abnormal symptoms for the machine equipment through the deep learning unit 30 It consists of an abnormality detection unit 40 that detects , and a maintenance input unit 50 that provides maintenance history information when maintenance is performed according to an abnormality or setting of mechanical equipment.

여기서 상기 데이터 수집용 센서(10)는 기계설비로부터 발생되는 기계음을 수집하도록 구비되는데, 바람직하게는 마이크로 마이크가 적용된다. Here, the sensor 10 for data collection is provided to collect mechanical sounds generated from mechanical equipment, and preferably, a micro-microphone is applied.

본 발명에 따른 데이터 수집용 센서(10)는 하나 또는 여러 대의 기계설비로부터 발생되는 기계음을 수집하는데, 필요에 따라 기계음이 아닌 진동음이나 진동 자체를 수집하는 센서가 적용될 수도 있다. The sensor 10 for data collection according to the present invention collects mechanical sounds generated from one or several mechanical equipment, and if necessary, a sensor that collects vibration sound or vibration itself rather than mechanical sound may be applied.

또한 상기 특징 추출부(20)는 상기 데이터 수집용 센서(10)로부터 수집된 기계음을 전달받아 해당 기계음의 특징을 추출하도록 구비되는데, 이러한 특징 추출부(20)에서 추출하는 특징 데이터는 다양한 형태로 형성될 수 있으나, 본 발명의 일실시예에 따라 도 3에 도시된 바와 같이 상기 데이터 수집용 센서(10)로부터 수집된 기계음을 전달받아 해당 기계음을 주파수 영역으로 변환하고 잡음 제거 및 멜 필터 뱅크(Mel Filter Bank) 등의 필터링을 통해 특징을 추출할 수 있다. In addition, the feature extracting unit 20 is provided to receive the machine sound collected from the data collection sensor 10 and extract the features of the corresponding machine sound, and the feature data extracted by the feature extracting unit 20 is in various forms. may be formed, but according to an embodiment of the present invention, as shown in FIG. 3 , the mechanical sound collected from the data collection sensor 10 is received and the corresponding mechanical sound is converted into a frequency domain, and the noise removal and Mel filter bank ( Mel Filter Bank), etc., can be used to extract features.

특징 추출부(20)에서 추출하는 특징은 딥러닝부(30)에서 심층 학습을 수행하거나 이상감지부(40)에서 이상을 감지하는데 유용한 성분을 기계음으로부터 추출하는 과정이다. The feature extracted by the feature extraction unit 20 is a process of performing deep learning in the deep learning unit 30 or extracting a component useful for detecting an abnormality in the abnormality detecting unit 40 from the machine sound.

상기 특징 추출부(20)에서 특징 추출시 사용되는 파라메터는 Frame window size, Window shift rate, LLD(Low level descriptor) 또는 HSF(High-level Statisticla Functions) 중 어느 하나 이상으로 구성될 수 있다. The parameter used for extracting the feature by the feature extractor 20 may include any one or more of a frame window size, a window shift rate, a low level descriptor (LLD), and a high-level statistical function (HSF).

여기서 LLD로는 mfcc, log energy, filter bank energy 등이 해당될 수 있고, HSF로는 mean, variance, max, min, mode, quant. 등이 해당될 수 있다. Here, as LLD, mfcc, log energy, filter bank energy, etc. may correspond, and as HSF, mean, variance, max, min, mode, quant. etc. may apply.

이러한 특징 데이터의 추출은 일반적으로 정보의 압축, 차원 감소 과정과 관련된다. 특징 데이터 추출의 주요 연구 분야는 다양한 잡음환경/화자/채널 변이에 강인한(robust) 특징, 시간적인 변화를 잘 표현하는 특징의 추출이다. The extraction of such feature data is generally related to information compression and dimensionality reduction processes. The main research area of feature data extraction is the extraction of features that are robust to various noise environments/speakers/channel variations and that express temporal changes well.

흔히 사용되는 특징 추출 과정에서 청각특성을 반영한 것으로는 달팽이관의 주파수 응답을 응용한 필터뱅크 분석, mel 또는 Bark 척도 단위의 중심주파수 배치, 주파수에 따른 대역폭의 증가, 프리엠퍼시스 필터 등이 사용된다. As a reflection of auditory characteristics in the commonly used feature extraction process, filter bank analysis applying the frequency response of the cochlea, center frequency arrangement in mel or Bark scale units, increase in bandwidth according to frequency, and pre-emphasis filter are used.

Robustness를 향상시키기 위한 방법으로 가장 널리 사용되는 것은 convolutive 채널의 영향을 줄이기 위한 cepstral mean subtraction(CMS)이다. 음성신호의 동적 특성을 반영하기 위하여 켑스트럼의 1차(delta), 2차 미분값을 사용한다. The most widely used method for improving robustness is cepstral mean subtraction (CMS) to reduce the influence of convolutive channels. In order to reflect the dynamic characteristics of the speech signal, the first (delta) and second derivative values of the cepstrum are used.

CMS 및 미분은 시간축 방향의 필터링으로 생각할 수 있으며 시간축 방향으로의 temporally uncorrelated 특징 벡터를 얻는 과정이다. 필터뱅크 계수로부터 켑스트럼을 얻는 과정은 필터뱅크 계수를 uncorrelated로 바꾸기 위한 orthogonal transform으로 생각할 수 있다. CMS and differentiation can be thought of as filtering in the time axis direction, and are processes of obtaining temporally uncorrelated feature vectors in the time axis direction. The process of obtaining the cepstrum from the filterbank coefficients can be thought of as an orthogonal transform to transform the filterbank coefficients into uncorrelated.

본 발명에 따른 특징 추출부(20)에서 추출하는 특징으로 Mel frequency cepstral coefficient(MFCC)가 있다.There is a Mel frequency cepstral coefficient (MFCC) as a feature extracted by the feature extraction unit 20 according to the present invention.

여기서 MFCC를 구하는 방법을 간단히 설명하면, 음성신호는 anti-aliasing filter를 거친 다음, A/D변환을 거쳐서 디지털 신호 x(n)로 변환된다. 디지털 음성신호는 고대역 통과 특성을 갖는 디지털 프리엠퍼시스 필터를 거친다. Here, the method of obtaining the MFCC is briefly described. The audio signal is converted into a digital signal x(n) through an anti-aliasing filter and then A/D conversion. A digital audio signal passes through a digital pre-emphasis filter having a high-pass characteristic.

이 필터를 사용하는 이유는 첫째로 인간의 외이/중이의 주파수 특성을 모델링하기 위하여 고대역 필터링을 한다. 이는 입술에서의 방사에 의하여 20 dB/decade로 감쇄되는 것을 보상하게 되어 음성으로부터 성도 특성만을 얻게 된다.The reason for using this filter is first, high-band filtering is performed to model the frequency characteristics of the human outer/middle ear. This compensates for the 20 dB/decade attenuation by radiation from the lips, so that only the vocal tract characteristics are obtained from the voice.

둘째 청각시스템이 1 kHz이상의 스펙트럼 영역에 대하여 민감하다는 사실을 어느 정도 보상하게 된다. PLP 특징추출에서는 인간 청각기관의 주파수 특성인 equal-loudness curve를 직접 모델링에 사용한다. 프리엠퍼시스 필터의 특성 H(z)는 다음과 같으며, a는 0.95-0.98 범위의 값을 사용한다.Second, it compensates somewhat for the fact that the auditory system is sensitive to the spectral region above 1 kHz. In PLP feature extraction, the equal-loudness curve, which is the frequency characteristic of the human auditory organ, is used for direct modeling. The characteristic H(z) of the pre-emphasis filter is as follows, and a value in the range of 0.95-0.98 is used.

프리엠퍼시스된 신호는 해밍 윈도우를 씌워서 블록 단위의 프레임으로 나누어진다. 이후부터의 처리는 모두 프레임 단위로 이루어진다. 프레임의 크기는 보통 20-30 ms이며 프레임 이동은 10 ms가 흔히 사용된다. 한 프레임의 음성신호는 FFT를 이용하여 주파수 영역으로 변환된다. 주파수 대역을 여러개의 필터뱅크로 나누고 각 뱅크에서의 에너지를 구한다. 밴드 에너지에 로그를 취한 후 discrete cosine transform (DCT)를 하면 최종적인 MFCC가 얻어진다. The pre-emphasized signal is divided into frames in block units by covering the Hamming window. All subsequent processing is performed on a frame-by-frame basis. The size of a frame is usually 20-30 ms, and 10 ms is commonly used for frame movement. A voice signal of one frame is converted into a frequency domain using FFT. Divide the frequency band into several filter banks and find the energy in each bank. After taking the logarithm of the band energy, the final MFCC is obtained by performing discrete cosine transform (DCT).

필터뱅크의 모양 및 중심주파수의 설정 방법은 귀의 청각적 특성(달팽이관에서의 주파수 특성)을 고려하여 결정된다. 도 4에서는 삼각형 모양의 필터를 사용하였으며 중심주파수는 1kHz까지는 선형적으로 위치하고 그 이상에서는 mel 척도로 분포하는 20개의 뱅크로 이루어져 있다. The shape of the filter bank and the method of setting the center frequency are determined in consideration of the auditory characteristics of the ear (frequency characteristics in the cochlea). In Fig. 4, a triangular-shaped filter is used, and the center frequency is linearly located up to 1 kHz and consists of 20 banks distributed on a mel scale above that.

한편 상기 딥러닝부(30)는 기계음의 특징 데이터를 이용하며 CNN(Convolution Neural Network), LSTM RNN(Long Short Term Memory Recurrent Neural Network), Attention mechanism의 조합을 통해 심층학습을 수행하도록 구성된다. Meanwhile, the deep learning unit 30 is configured to perform deep learning through a combination of a Convolution Neural Network (CNN), a Long Short Term Memory Recurrent Neural Network (RNN), and an Attention mechanism using the feature data of the machine sound.

최근에 센싱 기술과 컴퓨터 연산 능력이 향상되면서 양질의 데이터를 취득할 수 있게 되어 이를 바탕으로 딥러닝기반 고장진단 연구가 활발히 진행되고 있는데, 예를 들어, 진동신호를 기반으로 합성곱 신경망(Convolutional Neural Network, CNN)을 훈련시켜 전자기기의 고장진단을 시도한 연구, 회전체의 진동신호를 이미지화 한 후에 딥 빌리프 네트워크(Deep Belief Network, DBN)를 학습시켜 고장진단을 수행하는 연구 등이 있다. Recently, as sensing technology and computing power have improved, high-quality data can be acquired. Based on this, deep learning-based fault diagnosis research is being actively conducted. For example, convolutional neural networks based on vibration signals Network (CNN) training to try to diagnose electronic device failures, and research on performing fault diagnosis by learning Deep Belief Network (DBN) after imaging the vibration signal of a rotating body.

고장진단 기술에 자주 사용되고 있는 CNN의 경우, 이미지의 분석에 가장 많이 사용되는 네트워크이다. 그렇지만 일반적으로 CNN은 계층적이고, RNN은 순차적이라고 표현을 하고 자연어 처리와 같은 시퀀스 데이터를 사용하는 모델에서는 RNN이 우수하다고 여겨진다. 그러므로 시퀀스 데이터인 베어링으로부터 얻어진 신호를 분석하여 기계의 고장 유무를 판단하는 작업에는 RNN이 더 적합하다고 볼 수 있다.CNN, which is frequently used in troubleshooting technology, is the most used network for image analysis. However, in general, CNN is hierarchical, RNN is expressed as sequential, and RNN is considered to be superior in models that use sequence data such as natural language processing. Therefore, it can be seen that RNN is more suitable for the task of analyzing the signal obtained from the bearing, which is the sequence data, to determine whether there is a machine failure.

또한 주변 소음이 섞여 있을 수 있는 공장의 환경에서는 얻어진 신호의 처리에 더 장기적인 판단을 필요로 한다. Attention mechanism은 이러한 기능을 수행하도록 연구되고 있는 아키텍쳐이다. Also, in a factory environment where ambient noise may be mixed, the processing of the obtained signal requires longer-term judgment. Attention mechanism is an architecture that is being studied to perform this function.

어느 한 연구에서 Attention mechanism은 인코더와 디코더로 이루어져 있으며, 인코더에서는 LSTM의 히든 스테이트를 행렬로 만들고, 디코더에서는 히든 스테이트중 중요한 스테이트를 강조한다. In one study, the attention mechanism consists of an encoder and a decoder. In the encoder, the hidden states of the LSTM are made into a matrix, and in the decoder, important states among the hidden states are emphasized.

이러한 방법은 일정 거리이상의 중요한 출력벡터에 집중할 수 있게 만들어, neural machine translation (NMT)에서 훌륭한 성능을 보여주었다.This method made it possible to focus on important output vectors over a certain distance, showing excellent performance in neural machine translation (NMT).

그러므로 본 발명에서는 CNN, LSTM RNN, attention mechanism을 조합한 모델(CLA)을 제안한다. CLA모델은 국소적인 특징의 파악에 유리한 CNN, 순차적인 데이터 처리에 유리한 LSTM RNN, 중요한 정보에 집중할 수 있게 하는 어텐션 메커니즘의 장점을 이용하여 단순히 CNN을 사용한 모델에 비해 보다 우수한 성능을 유도할 수 있게 된다. Therefore, the present invention proposes a model (CLA) that combines CNN, LSTM RNN, and attention mechanism. The CLA model utilizes the advantages of CNN, which is advantageous for local feature identification, LSTM RNN, which is advantageous for sequential data processing, and the attention mechanism, which allows you to focus on important information, to induce better performance than a model using simple CNN. do.

본 발명에서 사용된 CLA 모델은 기계음을 특징 추출 과정에서 80차원의 mel spectrogram으로 변환하고 3초 단위의 300 프레임셋이 모델에 입력된다. 모델은 CNN레이어로 스펙트럼의 공간적(spatial)인 특성을 추출하고, LSTM레이어로 시간적(temporal)인 정보를 추출한후 Attention 레이어로 지역적인 특성을 집중하여 정보를 수집한다. The CLA model used in the present invention is converted into an 80-dimensional mel spectrogram during the feature extraction process of the machine sound, and 300 framesets in units of 3 seconds are input to the model. The model extracts the spatial characteristics of the spectrum with the CNN layer, the temporal information with the LSTM layer, and then collects the information by concentrating the local characteristics with the attention layer.

출력 레이어에서는 이상유무를 판단하기 위하여 0또는 1로 출력하도록 설계한다. CLA 모델의 일실시예는 도 5에 도시된 바와 같고, 각 레이어의 사양은 [표 1]에 나타나 있다. In the output layer, it is designed to output as 0 or 1 to determine whether there is an abnormality. An embodiment of the CLA model is as shown in FIG. 5, and the specification of each layer is shown in [Table 1].

본 CLA 모델은 입력이 들어오면 순서대로 CNN layer, LSTM layer, Attention Layer를 거친 후 최종 출력(각각 도 5의 (1), (2), (3), (4))을 계산한다. 최종 출력은 0~1 사이의 값으로 기계에 결함이 있는지에 대한 확률 값이다. This CLA model calculates the final output ((1), (2), (3), (4) of FIG. 5, respectively) after going through the CNN layer, LSTM layer, and attention layer in order when an input is received. The final output is a value between 0 and 1, which is a probability value of whether the machine is defective.

본 모델에 입력되는 특징은 80 mel-band spectrogram으로, 20ms의 사이즈의 윈도우를 10ms씩 이동시키며 생성하였다. 기계로부터 얻어진 오디오는 3초 길이이며, 16000Hz의 샘플링 레이트를 가지고 있다. 그러므로 최종적으로 생성된 스펙트로그램은 300프레임의 time step을 가지며, 각 프레임의 dimension은 80이다. The feature input to this model is an 80 mel-band spectrogram, which was created by moving a 20ms-sized window by 10ms. The audio from the machine is 3 seconds long and has a sampling rate of 16000 Hz. Therefore, the finally generated spectrogram has a time step of 300 frames, and the dimension of each frame is 80.

CNN layer는 Convolution 연산과 max pooling을 두번씩 반복하는 레이어이다. 본 모델에 사용된 CNN layer는 입력된 spectrogram를 분석하여 조금 더 고장 감지에 적합한 특징을 추출하기 위한 작업을 하는 Layer이다. CNN layer is a layer that repeats convolution operation and max pooling twice. The CNN layer used in this model is a layer that analyzes the input spectrogram and works to extract features more suitable for fault detection.

이는 convolution 연산과 max pooling을 통해 이루어지며, convolution 연산은 입력된 데이터의 국소적인 부분에서 연산을 통해 필요한 특징을 추출하는 과정이다. Max pooling 연산은 데이터의 차원을 축소하며 동시에 필요한 특징만을 남겨놓는 과정으로 이후에 진행될 LSTM및 어텐션 레이어의 계산 복잡도를 낮추는 역할을 한다.This is done through convolution operation and max pooling, and the convolution operation is a process of extracting necessary features through operation from a local part of the input data. The Max pooling operation reduces the dimension of data while leaving only necessary features, and serves to lower the computational complexity of the LSTM and attention layer that will be performed later.

LSTM layer에서 LSTM은 Recurrent Neural Network(RNN) 아키텍처의 한 종류로 일반적으로 RNN을 지칭할 경우, LSTM구조로 이루어져 있을 정도로 많이 사용되는 구조이다. In the LSTM layer, LSTM is a type of Recurrent Neural Network (RNN) architecture, and in general, when referring to RNN, it is a structure that is used so much that it consists of an LSTM structure.

본 CLA 모델에서는 CNN layer를 통해 얻어진 특징벡터에서 시간적 정보를 얻는데 사용된다. 이는 RNN이 가지는 시퀀스 정보 처리 능력을 사용하여 장거리의 정보를 처리하여야 알 수 있는 노이즈나 고장으로 인한 소리를 판별하기 위해서이다. In this CLA model, it is used to obtain temporal information from the feature vectors obtained through the CNN layer. This is to discriminate the noise caused by the failure or the noise that can only be known when long-distance information is processed using the sequence information processing capability of the RNN.

본 CLA 모델에서는 64개의 셀을 가지는 LSTM을 2층으로 사용하였으며, 순방향으로 데이터를 입력하여 사용하였다.In this CLA model, an LSTM having 64 cells was used as two layers, and data was input in the forward direction.

어텐션 레이어는 어텐션 메커니즘을 활용하여 만들어진 레이어로 CNN및 LSTM을 통하여 얻어진 정보 벡터에서 최종적인 판단을 하기위한 컨텍스트 벡터를 만드는 용도로 사용된다. The attention layer is a layer created by utilizing the attention mechanism and is used to create a context vector for making a final judgment from the information vector obtained through CNN and LSTM.

어텐션 메커니즘의 입력은 LSTM레이어의 출력 H=h₁, h₂, h₃, …, hT, hi=(p₁, p₂, …, p_C)이며 웨이트 파라메터 q=(q₁, q₂, q₃, …, q_C)를 사용하여 출력을 계산한다. 위의 표현식에서 T와 C는 각각 LSTM 입력의 Timestep 길이, LSTM의 셀 개수이며 어텐션 레이어의 출력 o는 아래와 같다.The input of the attention mechanism is the output of the LSTM layer H=h ₁ , h ₂ , h ₃ , … , hT, hi=(p ₁ , p ₂ , …, p _C ) and calculate the output using the weight parameter q=(q ₁ , q ₂ , q ₃ , …, q _{C ).} In the above expression, T and C are the timestep length of the LSTM input and the number of cells in the LSTM, respectively, and the output o of the attention layer is as follows.

위의 식에서 ·은 두 벡터간의 dot production연산을 나타내며, *은 스칼라 값과 벡터간의 곱셈을 나타낸다. 위의 식에서 얻어진 o는 C크기의 벡터이며 출력 레이어의 입력으로 사용된다.In the above equation, · represents the dot production operation between two vectors, and * represents the multiplication between a scalar value and a vector. The o obtained in the above equation is a vector of size C and is used as the input of the output layer.

출력 레이어는 한 개의 유닛으로 구성되어 있는 fully connected layer(FCL)이다. 입력된 벡터를 최종적인 확률 값으로 바꾸는 레이어이다. 활성함수로 sigmoid 함수를 사용하여 0~1 사이의 값으로 출력되며, 최종적인 판단에서는 0.5이상의 값일 경우, 입력된 오디오가 녹음된 기계에는 고장이 있다고 판단하며, 0.5 미만의 값일 경우, 입력된 오디오가 녹음된 기계에는 고장이 없다고 판단하였다.The output layer is a fully connected layer (FCL) composed of one unit. It is a layer that converts the input vector into the final probability value. It is output as a value between 0 and 1 using the sigmoid function as the activation function. In the final judgment, if the value is over 0.5, it is judged that there is a malfunction in the machine where the input audio was recorded. If the value is less than 0.5, the input audio It was judged that there was no malfunction in the recorded machine.

[실시예 1][Example 1]

본 실험에 사용된 데이터는 각각 CNC, CWRU, WP데이터셋이며, 추가적으로 노이즈 데이터셋이 사용되었다. The data used in this experiment are CNC, CWRU, and WP datasets, respectively, and a noise dataset was additionally used.

-CNC-CNC

CNC데이터 셋은 CNC기계로부터 녹음된 데이터의 모음이다. 전체 데이터 수는 177개이고 정상음의 개수는 91개, 이상음의 개수는 86개이다.A CNC data set is a collection of recorded data from a CNC machine. The total number of data is 177, the number of normal sounds is 91, and the number of abnormal sounds is 86.

-CWRU-CWRU

CWRU데이터셋은The CWRU dataset is

https://csegroups.case.edu/bearingdatacenter/pages/download-data-file에서 제공하는 Normal Baseline Data, 12k Drive End Bearing Fault Data, 48k Drive End Bearing Fault Data, Fan-End Bearing Fault Data 중 48000 sample per second로 기록된 48k drive end bearing fault data를 제외하고 사용하여 만들어진 데이터셋이다. 48000 samples per of Normal Baseline Data, 12k Drive End Bearing Fault Data, 48k Drive End Bearing Fault Data, and Fan-End Bearing Fault Data provided at https://csegroups.case.edu/bearingdatacenter/pages/download-data-file It is a data set created by excluding the 48k drive end bearing fault data recorded as the second.

해당 사이트에서 제공하는 데이터의 포맷은 Matlab에서 사용하기 위한 Matlab 포맷으로 되어 있으며, 모터의 fan과 drive end위치에서 얻어진 데이터가 들어있다. The format of data provided by this site is in Matlab format for use in Matlab, and includes data obtained from the position of the motor fan and drive end.

파이썬의 scipy 모듈에서 제공하는 함수를 사용하여 데이터 파일을 읽어 딕셔너리 구조로 구성할 수 있으며, 읽어진 딕셔너리의 구조는 다음과 같다.Using the function provided by the scipy module of Python, a data file can be read and composed into a dictionary structure. The structure of the read dictionary is as follows.

dict_keys(['__header__', '__version__', '__globals__', 'X097_DE_time', 'X097_FE_time', 'X097RPM'])dict_keys(['__header__', '__version__', '__globals__', 'X097_DE_time', 'X097_FE_time', 'X097RPM'])

데이터 딕셔너리의 header, version, globals 키에는 matlab 파일의 기본구조 및 생성날짜 등을 기록한 정보가 저장되어있으며, DE_time, FE_time 키에는 각각 모터의 drive end, fan 위치에서 기록된 샘플이 저장되어 있다. 본 실험에서는 DE_time키의 데이터를 사용하였다. The header, version, and globals keys of the data dictionary store the basic structure and creation date of the matlab file, and the DE_time and FE_time keys contain samples recorded at the drive end and fan positions of the motor, respectively. In this experiment, the data of the DE_time key was used.

총 데이터의 개수는 333개이며 정상음의 개수는 45개, 이상음의 개수는 288개이다.The total number of data is 333, the number of normal sounds is 45, and the number of abnormal sounds is 288.

-WP-WP

WP데이터셋은 물펌프의 소리를 녹음하여 만들어진 데이터셋으로, 정상소리는 물이 분사되는 소리이고 비정상소리는 물이 없어 분사되지 않고 분사기계만 돌아가는 소리이다. 전체 데이터의 수는 1124개이고 정상음의 개수는 907개이며 이상음의 개수는 217개이다.The WP dataset is a dataset created by recording the sound of a water pump. The normal sound is the sound of water being sprayed, and the abnormal sound is the sound of the spraying machine running without water being sprayed. The total number of data is 1124, the number of normal sounds is 907, and the number of abnormal sounds is 217.

-Noise-Noise

노이즈 데이터셋은 위의 세 데이터셋에 추가되어 데이터의 진단 난이도를 높이는데 사용되었으며, 데이터 셋은 freesound.org 사이트에서 다운받은 배경음 소리로 구성되어 있다.The noise dataset was added to the above three datasets and used to increase the diagnostic difficulty of the data, and the dataset consists of background sound downloaded from the freesound.org site.

배경음 소리는 총 7개의 파일로 이 중 4개의 파일은 음식점, 카페, 통화로부터 얻어진 사람들의 말소리이다. 나머지 3개의 파일은 길거리에서 차량의 소리를 녹음한 파일로 자동차 소리로 구성되어 있다. 각 파일은 약 2분에서 5분 사이의 길이를 갖고 있다.The background sound is a total of 7 files, of which 4 files are the speech sounds of people obtained from restaurants, cafes, and phone calls. The other three files are files that record the sound of a vehicle on the street, and it consists of the sound of a car. Each file is about 2 to 5 minutes long.

본 실험에서는 CNN만을 사용한 모델과 CLA모델 간의 성능을 비교한다. CNN모델은 4층의 CNN으로 구성되어 있고, CNN뒤에는 2층의 FCL이 사용되었다. 각 데이터는 랜덤하게 노이즈가 섞여진 후, CNN, CLA모델의 훈련 및 평가에 사용되었다. 이때 훈련과 평가용 데이터의 비율은 8대2로 설정되었으며, 100epoch번 반복되었다. In this experiment, we compare the performance between the model using only CNN and the CLA model. The CNN model consists of 4 layers of CNN, and 2 layers of FCL are used behind the CNN. Each data was randomly mixed with noise and then used for training and evaluation of CNN and CLA models. At this time, the ratio of training and evaluation data was set to 8:2, and it was repeated 100 epochs.

성능지표로는 아래와 같이 precision, recall, 그리고 F1-score가 사용되었으며, 다음과 같이 계산된다.As performance indicators, precision, recall, and F1-score were used as shown below, and are calculated as follows.

True Positive(TP) : 실제 True인 정답을 True라고 예측 (정답)True Positive (TP): Predicting the correct answer that is actually True as True (correct answer)

False Positive(FP) : 실제 False인 정답을 True라고 예측 (오답)False Positive (FP): Predicting a true false answer as true (wrong answer)

False Negative(FN) : 실제 True인 정답을 False라고 예측 (오답)False Negative (FN): Predict the correct answer that is true as False (wrong answer)

True Negative(TN) : 실제 False인 정답을 False라고 예측 (정답)True Negative (TN): Predicting a correct answer that is actually False as False (correct answer)

CLA모델과 CNN모델의 실험 결과를 [표 2]에 나타내었다. [Table 2] shows the experimental results of the CLA model and the CNN model.

위의 [표 2]에서 볼 수 있는 것과 같이 CLA모델은 CNN모델보다 좋은 성능을 보여주고 있다. 이는 어텐션 메커니즘 및 LSTM이 사용되지 않은 기존의 CNN모델의 경우, 노이즈가 섞여 있는 데이터의 분석에 약하기 때문으로 판단된다. As can be seen in [Table 2] above, the CLA model shows better performance than the CNN model. This is considered to be because the existing CNN model, which does not use the attention mechanism and LSTM, is weak in analyzing data mixed with noise.

어텐션 메커니즘과 LSTM이 가지는 장기적인 문맥을 파악하는 능력은 긴 시간의 오디오에 섞여 있는 노이즈를 구분하고, 고장의 유무를 판단하는데 중요한 특징만을 추출하는 능력을 가졌기 때문이다. The ability to understand the long-term context of the attention mechanism and LSTM is because it has the ability to distinguish noise mixed in audio for a long time and to extract only important features to determine the presence or absence of a malfunction.

이 때문에 CLA모델은 노이즈가 섞여있는 CNC, CWRU, WP데이터 셋에 대해서 모두 CNN보다 각각 2.81%, 0.73%, 3.64% 좋아진 성능을 얻을 수 있었다.For this reason, the CLA model was able to get 2.81%, 0.73%, and 3.64% better performance than CNN for CNC, CWRU, and WP data sets mixed with noise, respectively.

본 발명에서는 베어링 고장 및 기계의 이상 등 여러가지 기계의 고장 유무를 판단하는 CLA모델을 딥러닝부(30)에 적용한다. CLA모델은 CNN을 기본 베이스로 하여 장기적인 문맥을 파악할 수 있는 LSTM과 Attention Mechanism을 추가한 모델로, 기계의 고장 유무를 실시간으로 판단하는 환경에서 얻어진 오디오 녹음을 입력으로 사용한다. In the present invention, the CLA model for determining the presence or absence of failure of various machines such as bearing failure and machine abnormality is applied to the deep learning unit 30 . The CLA model is a model that adds LSTM and Attention Mechanism that can identify long-term contexts based on CNN. It uses audio recordings obtained from an environment that determines whether a machine is malfunctioning in real time as an input.

이 때, 기계의 주변상황에 따라 사람의 말소리나 다른 기계음과 같은 노이즈가 섞일 수 있는데 기존 CNN모델의 경우 CNN의 특성상 장기적인 문맥파악능력이 약하므로 노이즈에 대한 구별이 어려울 수 있다. 상기 CLA모델은 LSTM과 Attention mechanism의 장기적인 문맥파악능력을 활용하여 노이즈가 섞여 있는 환경에서도 좋은 성능을 기대할 수 있는 모델이다. At this time, noise such as human speech or other machine sounds may be mixed depending on the surrounding conditions of the machine. In the case of the existing CNN model, it may be difficult to distinguish the noise because the long-term ability to grasp the context is weak due to the nature of the CNN. The CLA model is a model that can expect good performance even in an environment in which noise is mixed by utilizing the long-term contextual ability of LSTM and attention mechanism.

CNC, CWRU, WP 데이터에 사람의 말소리와 교통소리를 노이즈로 섞어 데이터셋을 만든 후, 훈련과 평가를 진행하였을 때 CNN으로 구성된 모델에 비해 좋은 성능을 보여주었다. 이를 통해 LSTM과 attention mechanism이 기계의 진단에 불필요한 노이즈를 구분하는데 도움이 될 수 있는 것을 알 수 있다. After creating a dataset by mixing human speech and traffic sounds with CNC, CWRU, and WP data as noise, training and evaluation were performed, and it showed good performance compared to the CNN model. From this, it can be seen that LSTM and attention mechanism can help to distinguish unnecessary noise for machine diagnosis.

또한 본 발명에 따른 상기 딥러닝기반 이상징후 감지시스템(100)에는 기계설비의 이상 또는 설정에 따라 유지보수가 수행된 경우 유지보수 이력정보를 제공하는 유지보수입력부(50)가 포함되는데, 이러한 유지보수입력부(50)는 기계설비를 관리하는 관리자가 특징 추출부(20) 또는 딥러닝부(30)와 데이터 통신가능하도록 연결되는 단말기 등을 통해 해당 유지보수 이력정보를 입력하도록 구성될 수 있다. In addition, the deep learning-based abnormal symptom detection system 100 according to the present invention includes a maintenance input unit 50 that provides maintenance history information when maintenance is performed according to an abnormality or setting of mechanical equipment, such maintenance The maintenance input unit 50 may be configured so that an administrator who manages the mechanical equipment inputs the corresponding maintenance history information through a terminal connected to the feature extraction unit 20 or the deep learning unit 30 to enable data communication.

이는 본 발명에 따른 딥러닝부(30)가 심층 학습을 수행할 때 유지보수 이력정보의 전, 후 기계음 특징 데이터를 기초로 심층학습을 수행하는 것이 가장 효율적이기 때문이다. This is because, when the deep learning unit 30 according to the present invention performs deep learning, it is most efficient to perform deep learning based on machine sound characteristic data before and after maintenance history information.

이에 따라 데이터 수집용 센서(10)가 수집한 전체 기계음 데이터를 학습하는 것이 아닌 유지보수 이력정보를 통해 해당 유지보수의 전, 후 기계음 데이터를 중심으로 심층 학습이 이루어짐이 바람직하다. Accordingly, it is preferable that deep learning be performed mainly on machine sound data before and after the maintenance through maintenance history information, rather than learning the entire machine sound data collected by the sensor 10 for data collection.

한편 본 발명에 따른 딥러닝기반 이상징후 감지시스템(100)의 딥러닝부(30)는 도 6에 도시된 바와 같이 1000개 이상의 기계음 데이터를 4-fold cross validation 기법으로 학습과정에서 반복적인 교차 검증을 수행한다. On the other hand, the deep learning unit 30 of the deep learning-based anomaly detection system 100 according to the present invention repeatedly cross-validates 1000 or more machine sound data in a 4-fold cross validation technique in the learning process as shown in FIG. carry out

이러한 반복적인 교차 검증으로 딥러닝 모델 구조와 파라메타는 계속적으로 보완이 이루어져 본 발명에 따른 시스템의 신뢰도는 보다 향상될 수 있게 된다. With such repeated cross-validation, the deep learning model structure and parameters are continuously supplemented, so that the reliability of the system according to the present invention can be improved.

이와 같이 본 발명은 도면에 도시된 일실시예를 참고로 설명되었으나, 이는 예시적인 것에 불과하며, 본 기술 분야의 통상의 지식을 가진 자라면 이로부터 다양한 변형 및 균등한 타 실시예가 가능하다는 점을 이해할 것이다. As described above, the present invention has been described with reference to one embodiment shown in the drawings, which is merely exemplary, and those of ordinary skill in the art will appreciate that various modifications and equivalent other embodiments are possible therefrom. will understand

따라서, 본 발명의 진정한 기술적 보호범위는 첨부된 특허청구범위의 기술적 사상에 의해 정해져야 할 것이다.Accordingly, the true technical protection scope of the present invention should be defined by the technical spirit of the appended claims.

10 : 데이터 수집용 센서 20 : 특징 추출부
30 : 딥러닝부 40 : 이상감지부
50 : 유지보수입력부
100 : 딥러닝기반 이상징후 감지시스템10: sensor for data collection 20: feature extraction unit
30: deep learning unit 40: abnormality detection unit
50: maintenance input unit
100: Deep learning-based anomaly detection system

Claims

In the deep learning-based abnormal symptom detection system using the machine sound of mechanical equipment,
The deep learning-based anomaly detection system 100 is
a sensor 10 for collecting data for collecting mechanical sounds generated from one or more mechanical equipment;
a feature extracting unit 20 for receiving the machine sound collected from the data collection sensor 10 and extracting features of the corresponding machine sound;
A deep learning unit 30 configured to perform deep learning through a combination of a Convolution Neural Network (CNN), a Long Short Term Memory Recurrent Neural Network (LSTM RNN), and an Attention mechanism using machine sound feature data;
Machine equipment comprising an abnormality detection unit 40 that receives the feature data extracted from the feature extraction unit 20 and detects abnormal symptoms of the mechanical equipment through the deep learning unit 30 Deep learning-based anomaly detection system using machine sound.

According to claim 1,
The data collection sensor 10 is
Deep learning-based anomaly detection system using machine sound of mechanical equipment, characterized in that it is a micro-microphone that collects mechanical sounds generated from mechanical equipment.

According to claim 1,
The deep learning-based anomaly detection system 100 is
A maintenance input unit 50 that provides maintenance history information when maintenance is performed according to an abnormality or setting of mechanical equipment is further included,
The deep learning unit 30 is a deep learning-based abnormal symptom detection system using machine sound of machine equipment, characterized in that it performs deep learning through machine sound feature data before and after maintenance is performed.

According to claim 1,
The feature extraction unit 20
Using the mechanical sound of mechanical equipment, characterized in that receiving the machine sound collected from the data collection sensor 10, converting the machine sound into a frequency domain, and extracting features through noise removal and Mel Filter Bank Deep learning-based anomaly detection system.

5. The method of claim 4,
The parameters used for feature extraction in the feature extraction unit 20 are
A deep learning-based anomaly detection system using a machine sound of a mechanical facility, characterized in that it is at least one of frame window size, window shift rate, LLD (Low level descriptor) or HSF (High-level Statisticla Functions).

According to claim 1,
The deep learning-based anomaly detection system 100 is
Deep learning-based abnormal symptom detection system using machine sound of mechanical equipment, characterized in that it repeatedly cross-validates in the learning process of the deep learning unit 30 with a 4-fold cross validation technique with more than 1000 machine sound data.