WO2021137320A1

WO2021137320A1 - Apparatus for classifying abnormal data using artificial neural network

Info

Publication number: WO2021137320A1
Application number: PCT/KR2019/018788
Authority: WO
Inventors: 박재한; 김철운; 유재홍; 차길환
Original assignee: 주식회사 알고리고
Priority date: 2019-12-31
Filing date: 2019-12-31
Publication date: 2021-07-08

Abstract

The present invention relates to an apparatus and method for classifying abnormal data using an artificial neural network. To this end, the method may provide: a generation vector step of generating a generation vector by a generation module, which is a component of an abnormal data classification module trained to generate a normal vector, which is a multidimensional vector of normal time series distribution data on the basis of a latent variable; a classification target vector step of receiving a classification target vector, which is a multidimensional vector of classification target data that is a target for abnormal data classification; an abnormal data score output step of outputting an abnormal data score, which is a loss value on the basis of a difference between the generation vector and the classification target vector; and a latent variable adjustment step of adjusting a latent variable in a direction in which the abnormal data score becomes lower.

Description

Abnormal data classification device using artificial neural network

The present invention relates to an apparatus and method for classifying abnormal data using an artificial neural network.

ConvNet (Convolutional Neural Network) can be used to develop a model that classifies the posture of a seated user based on the time series matrix-type time series pressure distribution data generated from the smart seat composed of the mxn pressure sensor matrix. . 1 is a distribution diagram illustrating pressure distribution data, and FIG. 2 is a graph illustrating a change in pressure magnitude with time. As shown in FIGS. 1 and 2 , the pressure distribution data as shown in FIG. 1 may be generated from a plurality of pressure sensors configured in a matrix form, and each pressure sensor changes the pressure magnitude in a time series as shown in FIG. 2 with respect to the time dimension can be represented to be sensed. 3 is an exemplary diagram of time series pressure distribution data. As shown in FIG. 3 , the time series pressure distribution data may refer to a plurality of continuous pressure distribution data in the time dimension generated based on the time series pressure magnitude change generated by each pressure sensor as shown in FIG. 2 .

When a ConvNet learned based on pressure distribution data is used for posture classification, the corresponding ConvNet may be supervised learning based on pressure distribution data labeled or tagged with at least one posture. One of the important things in supervised learning to improve the accuracy of ConvNet is the quantity and quality of the data being trained. Therefore, it is clear that the posture classification accuracy of ConvNet improves as the ConvNet is trained by collecting as much high-quality data as possible. need.

However, the actual pressure distribution data input by users while using the service cannot guarantee the quality of data due to abnormal data, for example, the possibility of pressure distribution due to objects or pressure distribution due to abnormal use of users. Therefore, it is necessary to classify such abnormal data before using it for ConvNet training.

Since most of these abnormal data are generated in unexpected situations, it is very difficult to build a deep learning classification model through labeling with the existing general statistical approach or manual feature engineering.

Accordingly, an object of the present invention is to provide an apparatus and method for classifying abnormal data using an artificial neural network to classify abnormal data input by a user with high accuracy and use only normal data for learning of ConvNet.

Hereinafter, specific means for achieving the object of the present invention will be described.

An object of the present invention is to provide a memory module for storing a program code of an abnormal data classification module that receives time-series distribution data and outputs an abnormal data score for discriminating whether the data is abnormal; and a processing module for processing the program code of the abnormal data classification module, wherein the program code of the abnormal data classification module learns to generate a stationary vector that is a multidimensional vector of the time series distribution data that is normal based on a latent variable a generation vector step in which a generation module, which is a component of the abnormal data classification module, generates a generation vector; a classification target vector step of receiving a classification target vector that is a multidimensional vector of classification target data that is a target of abnormal data classification; an abnormal data score output step of outputting an abnormal data score that is a loss value based on a difference between the generated vector and the vector to be classified; and a latent variable adjustment step of adjusting the latent variable in a direction in which the abnormal data score is lowered. is configured to be performed on a computer, including, wherein the abnormal data classification module determines that the classification target data is the abnormal data when the abnormal data score lowered by the adjustment of the latent variable in the latent variable adjustment step is greater than or equal to a specific value It can be achieved by providing an apparatus for classifying abnormal data using an artificial neural network, characterized in that the classification is performed.

In addition, the memory module further comprises a minute change module for outputting minute change data representing a minute change of the time series distribution data or the classification target data, the processing module further processes the program code of the minute change module, , The program code of the fine change module, a receiving step of receiving the time series distribution data or the classification target data; a change amount data generating step of generating change amount data, which is data on the amount of change in the distribution of the received time series distribution data or the classification target data; a change amount distribution data generating step of converting the change amount data into matrix change amount distribution data; and a minute change data generation step of generating minute change data based on the change amount distribution data according to time; configured to be performed on a computer, wherein the stationary vector includes the minute change data of the time series distribution data and the classification target vector may be configured to include the fine change data of the classification target data.

In addition, the memory module further includes a spatial data module for outputting spatial data indicating a distribution change due to a dynamic movement of the time series distribution data or the classification target data, wherein the processing module includes a program code of the spatial data module further processing, wherein the program code of the spatial data module includes: a receiving step of receiving the time series distribution data or the classification target data; a spatial feature extraction step of inputting the time series distribution data or the classification target data into an embedding network composed of ConvNet and extracting spatial features of the distribution using a feature map; and a spatial data generation step of inputting the spatial feature into a Long-Short Term Memory (LSTM), embedding a sequence that is a temporal feature, and generating spatial data; configured to be performed on a computer, wherein the stationary vector is the time series distribution It may be configured to include the spatial data of data, and the classification target vector may be configured to include the spatial data of the classification target data.

Another object of the present invention is a generation vector step in which the generation vector module, a component of the abnormal data classification module, which receives time series distribution data and outputs an abnormal data score for discriminating whether the data is abnormal or not, generates the generation vector ; a classification target vector step in which the classification target vector module receives a classification target vector that is a multidimensional vector of classification target data that is an object of abnormal data classification; an abnormal data score output step of outputting, by the abnormal data score output module, an abnormal data score that is a loss value based on a difference between the generated vector and the classification target vector; and a latent variable adjustment step in which the latent variable adjustment module adjusts the latent variable in a direction in which the abnormal data score is lowered; the generating module is configured to be performed on a computer, including the time series that is normal based on the latent variable It is learned to generate a normal vector that is a multidimensional vector of distribution data, and the abnormal data classification module is configured to: When the abnormal data score lowered by the adjustment of the latent variable in the latent variable adjustment step is greater than or equal to a specific value, the classification target data is the It can be achieved by providing a method for classifying abnormal data using an artificial neural network, characterized in that the data is classified as abnormal data.

As described above, according to the present invention, there are the following effects.

First, according to an embodiment of the present invention, there is an effect of being able to classify and collect unexpected abnormal data when collecting data. Since abnormal data are most often generated in unexpected situations, it is very difficult to build a deep learning classification model through labeling with the existing general statistical approach or manual feature engineering.

Second, according to an embodiment of the present invention, there is an effect that unsupervised learning for classification, classification, and detection of abnormal data is possible without separate labeling or tagging for abnormal data.

Third, according to an embodiment of the present invention, the characteristics of minute changes that are processed as noise by a general deep learning system and vanishing by the micro change data module can be used for posture classification and abnormal data classification.

Fourth, according to an embodiment of the present invention, even when vibration or noise caused by an external environment occurs due to the combination of the spatial data module and the fine change data module, the characteristics of the posture change and the fine change due to the dynamic movement of the user The effect of being able to use it for posture classification and abnormal data classification is generated. When two data are simultaneously extracted using a general deep learning system, very ambiguous features are extracted and the classification accuracy is reduced.

The following drawings attached to the present specification illustrate preferred embodiments of the present invention, and serve to further understand the technical spirit of the present invention together with the detailed description of the present invention, so that the present invention is limited only to the matters described in those drawings should not be interpreted as

1 is a distribution diagram showing pressure distribution data;

2 is a graph showing the change in pressure magnitude with time;

3 is an exemplary diagram of time series pressure distribution data;

4 is a schematic diagram illustrating an apparatus for classifying abnormal data according to an embodiment of the present invention;

5 is a schematic diagram showing an example of pre-processing according to an embodiment of the present invention;

6 is a flowchart illustrating an example of a preprocessing flow according to an embodiment of the present invention;

7 is a flowchart illustrating a method of generating fine change data of the fine change data module 11 according to an embodiment of the present invention;

8 is a flowchart illustrating a spatial data generation method of the spatial data module 12 according to an embodiment of the present invention;

9 is a flowchart illustrating posture category classification of the posture classification module 13 according to an embodiment of the present invention;

10 is a schematic diagram showing a learning process of the abnormal data classification module 14 according to an embodiment of the present invention;

11 is a schematic diagram illustrating an abnormal data classification process of the abnormal data classification module 14 according to an embodiment of the present invention.

Hereinafter, with reference to the accompanying drawings, a person of ordinary skill in the art to which the present invention pertains will be described in detail an embodiment in which the present invention can be easily carried out. However, in the detailed description of the principle of operation of the preferred embodiment of the present invention, if it is determined that a detailed description of a related known function or configuration may unnecessarily obscure the gist of the present invention, the detailed description thereof will be omitted.

In addition, the same reference numerals are used throughout the drawings for parts having similar functions and functions. Throughout the specification, when it is said that a specific part is connected to another part, this includes not only a case in which it is directly connected, but also a case in which it is indirectly connected with another element interposed therebetween. In addition, the inclusion of specific components does not exclude other components unless otherwise stated, but means that other components may be further included.

For convenience of explanation, the invention is described based on a module for classifying a user's posture based on pressure distribution data, but the scope of the invention is not limited thereto, and a device for classifying a specific category based on time series distribution data However, in a device that classifies a specific category based on time-series distribution data, it may include a range including a feature for classifying abnormal data.

인공신경망을 이용한 비정상 데이터 구분 장치Abnormal data classification device using artificial neural network

With respect to the configuration of the apparatus for classifying abnormal data using an artificial neural network, FIG. 4 is a schematic diagram illustrating an apparatus for classifying abnormal data according to an embodiment of the present invention. As shown in FIG. 4 , the apparatus 1 for classifying abnormal data using an artificial neural network according to an embodiment of the present invention includes a preprocessing module 10 , a fine change data module 11 , a spatial data module 12 , and a posture. It may include a classification module 13 and an abnormal data classification module 14 .

The pre-processing module 10 receives a plurality of pressure sensor data, and performs time window setting, noise removal, normalization, and sensor deflection removal on the received pressure sensor data to generate time-series pressure distribution data as shown in FIG. 3 . to be. The pressure sensor data pre-processed by the pre-processing module 10 is processed as time-series pressure distribution data embedded as time-series data, so that the accuracy of classification by the classification module composed of an artificial neural network is improved. 5 is a schematic diagram illustrating an example of a preprocessing according to an embodiment of the present invention, and FIG. 6 is a flowchart illustrating an example of a preprocessing flow according to an embodiment of the present invention. 5 and 6, the pre-processing module 10 performs time window setting, noise removal, normalization and sensor deflection removal, etc. to output a multi-dimensional vector (pre-processing data) based on the received pressure sensor data. and may be configured to generate time-series pressure distribution data using a plurality of multidimensional vectors (pre-processed data) having a sequence.

The fine change data module 11 is a module that generates fine change data based on the time series pressure distribution data generated by the preprocessing module 10 . The minute change data module 11 according to an embodiment of the present invention generates change amount data, which is data on the amount of change in the pressure distribution, based on the time series pressure distribution data, and converts the change amount data into change amount distribution data in the form of a matrix, Fine change data is generated based on the change amount distribution data over time. Specifically, FIG. 7 is a flowchart illustrating a method of generating fine change data of the fine change data module 11 according to an embodiment of the present invention. As shown in FIG. 7 , the micro change data module 11 according to an embodiment of the present invention receives the preprocessed time series pressure distribution data (S) as input, and each position (i, j) and time ( t), the variation data (ΔS _ij ^t =S _ij ^t+1 - S _ij ^t ) may be calculated. Based on the change amount data, the change amount distribution data P, which is the distribution of the pressure distribution change, is generated through the histogram, and in this process, spatial information about the change disappears and only the distribution information about the change remains. The generation of fine change data based on the change amount distribution data P is the distance between P _t ₊₁ _{and P t (D t} =KL(P _t+1 | |P _t ) is calculated to generate time-dependent distribution data (D), and then the time-dependent distribution data (D) is sequentially used as an element of the output vector as an output vector (O), which is fine change data. ) is created and printed.

According to the fine change data module 11 according to an embodiment of the present invention, the distribution of the pressure distribution change is calculated, the spatial information disappears, and the overall distribution information of the fine change remains as the fine change data. The effect of preventing the expression of ambiguous information that may be propagated later is generated by the fine change data module 11 due to the mixture of the fine change feature and the spatial feature.

The spatial data module 12 is a module for generating spatial data indicating a change in posture due to a dynamic movement of a user based on the time series pressure distribution data generated by the preprocessing module 10 . With respect to the generation of spatial data, the spatial data module 12 generates spatial data by extracting spatial features of the pressure distribution and embedding the spatial features through processing for a plurality of sequences. Specifically, FIG. 8 is a flowchart illustrating a spatial data generation method of the spatial data module 12 according to an embodiment of the present invention. As shown in Fig. 8, the spatial data module 12 inputs the preprocessed time series pressure distribution data composed of a plurality of preprocessed data into the embedding network composed of ConvNet to extract spatial features of the pressure distribution using the Feature Map, , a sequence that is a temporal feature is embedded through Long-Short Term Memory (LSTM) and output as spatial data.

In this case, the spatial data module 12 according to an embodiment of the present invention may be configured to output the average distribution of the preprocessed data by using the output spatial data, which is the embedded vector, as an input of the verification network composed of ConvNet. According to this, it is possible to check whether the spatial data, which is an embedded vector, well contains the spatial characteristics of the pressure distribution, and the spatial data output by the spatial data module 12 is generated in the posture classification module 13 . There is an effect that not only has a spatial feature for category classification, but also includes a feature for classifying abnormal data. In addition, it is learned so that there is no information loss in the process of embedding through the verification network, so that various feature extractions are performed in the embedding process. Therefore, according to the verification network of the spatial data module 12, it is possible to prepare to extract new features of unexpected abnormal data. In addition, the embedded vector (output of the spatial data module 12) is used (shared) for different tasks in the 'verification network', 'posture classification module 13' and 'abnormal data classification module 14', respectively. can be This causes the embedded vector (output of the spatial data module 12) to be learned for multi-task (multiple similar tasks), so that more generalized feature extraction is performed well.

The posture classification module 13 is a module that receives the fine change data generated by the fine change data module 11 and the spatial data generated by the spatial data module 12, performs posture category classification, and generates posture classification data. Specifically, the posture classification module 13 according to an embodiment of the present invention is configured to extract features by embedding through non-linear calculations based on fine change data and spatial data, and to perform posture category classification through a linear algorithm. can 9 is a flowchart illustrating posture category classification of the posture classification module 13 according to an embodiment of the present invention. As shown in FIG. 9 , the posture classification module 13 according to an embodiment of the present invention includes a ConvNet (CNN, Convolutional Neural Network) that serves as a convolution layer and FFNN (Feed) that serves as a fully connected layer. -Forward Neural Network), fine change data and spatial data are input to CNN, nonlinear features are extracted through CNN, and embedded vectors are embedded in the form required for posture category classification based on spatiotemporal features The output and the vector embedded through the CNN becomes an input to the FFNN, and posture category classification is performed through the FFNN.

The abnormal data classification module 14 integrates the fine change data generated by the fine change data module 11, the spatial data generated by the spatial data module 12, and the posture classification data generated by the posture classification module 13. It is a module that receives a vector and classifies whether the plurality of pressure sensor data input to the abnormal data classification device 1 is abnormal data. Specifically, the abnormal data classification module 14 according to an embodiment of the present invention may include a generation module and a classification module, and the generation module receives random noise (Z) using the classification module to receive normal fine change data After learning the generation module to generate a normal vector, which is a multidimensional vector that integrates spatial data and posture classification data, the generation module uses the loss function output L(Loss) of the generation module as an abnormal data score to generate random noise ( Based on whether L is lowered to a specific value or less according to a change in Z), it is possible to distinguish whether data input to the abnormal data classification apparatus 1 is abnormal data.

The generation module of the abnormal data classification module 14 according to an embodiment of the present invention may be configured to generate a normal vector by being composed of an encoder and a decoder, and the encoder of the generation module is configured to classify normal fine change data, spatial data and posture It can be composed of a plurality of consecutive ConvNets that receive mxnx 3 standardized multidimensional vectors with integrated data and encode them into 1 x 1 xk latent variables, and the decoder of the pose transition module (4) has a 1 x 1 xk latent variable. It may consist of a plurality of consecutive networks that decode variables to output them as mxnx 3 multidimensional vectors. At this time, the generation module may be trained to input a multidimensional vector that is a normal vector and output a multidimensional vector close to the normal vector, and the generation module may be learned by the classification module for discriminating whether the multidimensional vector output by the generation module is a normal vector. have.

The classification module of the abnormal data classification module 14 according to an embodiment of the present invention may be configured to distinguish whether a multidimensional vector output by the generation module through a CONCAT function and a plurality of encoders is a normal vector.

The classification module of the abnormal data classification module 14 according to a modification of the present invention may be configured to be updated by the reinforcement learning module. At this time, the Agent of the reinforcement learning module becomes a classification module, the State becomes a multidimensional vector and a stationary vector output by the generation module, and the Action determines whether the multidimensional vector is a normal vector in the state of the state to determine the stationary vector probability. output, and Reward can be configured to be generated when the normal vector division of the division module is correct.

Regarding the learning of the abnormal data classification module 14, FIG. 10 is a schematic diagram illustrating a learning process of the abnormal data classification module 14 according to an embodiment of the present invention. As shown in FIG. 10 , in the generation module, the Loss Function may be configured to configure the division module and the MinMax game, and may be simultaneously learned. Equation 1 below is the loss function of the generation module and the division module.

In Equation 1 above, G denotes a generation module, D denotes a classification module, z denotes a random noise input as a latent variable, y denotes a normal vector that is a multidimensional vector that integrates normal fine change data, spatial data, and posture classification data; G(x) denotes a generative vector that is a generated multidimensional vector. Therefore, according to Equation 1, the loss function of the generating module and the classifying module is D when the generating module is not sufficiently trained and the classifying module perfectly distinguishes y and G(z) through the random noise z, which is a latent variable. It has a max value of 0 by (z,y)=1, D(z,G(z))=0, and after learning of the generating module, the classification module separates y and G(z) through random noise z If not, it has a min value of -log4 by D(z,y)=1/2 and D(z,G(z))=1/2. In other words, the generation vector G(z), which is a multidimensional vector generated by the generation module through random noise z by the above Loss function, and the normal vector y, which is a multidimensional vector integrating normal fine change data, spatial data, and posture classification data, are the same. In this case, the generation module has a global minimum, and the generation module and the classification module are learned in this direction. The generation module and the classification module have the effect of optimizing the generation module quickly due to the mutual antagonistic dependency.

In addition, according to an embodiment of the present invention, a normal vector and a generation module in which a division module is input for each zone to the Loss function of the generation module so that the generation of a multidimensional vector G(z) close to a normal vector proceeds more precisely for each zone It may further include a regional loss function for discriminating between normality and abnormality of the generation vector by comparison of the generation vectors generated by . A zone loss function according to an embodiment of the present invention may be configured as follows.

In Equation 2 above, L _BP (G,D) is the area loss function that is a loss function for each area, i is the i-th layer of the division module, T is the entire layer of the division module, and N _i is the area in the i-th layer It can mean the number of features. Accordingly, if the classification module accurately identifies the normal or abnormal in a specific region in a specific layer of the generated vector, ||D(z,y)-D(z,G(z))||=1, and the normal vector If a specific region in a specific layer of y and the generative vector G(z) is not separated by the division module, ||D(z,y)-D(z,G(z))||=0. Accordingly, the loss function of the abnormal data classification module 14 to which the above-described regional loss function is applied may be configured as follows.

In Equation 3 above, λ is a weight constant, L _BP may mean the regional loss function of Equation 2, and the generation module is learned in a direction to minimize the regional loss function. Accordingly, the generation module is learned so that a generation vector close to a normal vector is output more precisely.

In addition, in order to consider the sequence change order of the multidimensional vector, a loss function may be configured so that the generation module may consider the frame order. According to one embodiment of the present invention generation module generates vector at a certain time the latent variables in t z _t and the previous time, the t-1 generation vector of G time t to a (z _t-1) to the input data in the It can be configured to output G(z _{t ) which is} End, separated module to distinguish Fake sequence of _{(G (z t-1)} , G (z t)) and Real sequence of _{_{(y t-1, y t}} ) through _{_{(z t-1, z t}} ) The sequence loss function of the abnormal data classification module 14 for this purpose may be configured as shown in the following equation.

According to the sequence loss function as in Equation 4 above, the effect that the sequence of the generation vector output by the generation module becomes very similar to that of the normal vector is elaborated.

Regarding the abnormal data classification of the abnormal data classification module 14, FIG. 11 is a schematic diagram illustrating the abnormal data classification process of the abnormal data classification module 14 according to an embodiment of the present invention. 11 , the generation module receives the random noise z to generate a multidimensional vector (generated vector) close to a normal vector, and input user data (classification target data that is a target for discriminating whether or not abnormal data) is generated. Whether the input user data (classification target data) is normal or abnormal is determined based on the difference between the multidimensional vector (classification target vector) and the generated vector.

In addition, the generation module according to an embodiment of the present invention can be configured so that the parameter is fixed after learning, and the latent variable through Back Propagation so that the division loss function (L), which is the difference between G(z) and y, is reduced. and may be configured to adjust for random noise z. Equation 5 below relates to the division loss function (L) for the difference between G(z) and y, and Equation 6 relates to the control of random noise, which is a latent variable.

In Equations 5 and 6 above, L is the division loss function that is the difference between the generated vector generated close to the normal vector and the multidimensional vector (the target vector) of the user data (classification target data), G(z) is the generation vector, and z is Random noise as a latent variable, y denotes a multidimensional vector (classification target vector) of user data (classification target data), and η denotes a learning rate. According to this, if the multidimensional vector y (classification target vector) of the user data (classification target data) is normal data, the loss value of L is reduced by adjusting z to reduce the classification loss function L while the parameters of the generation module G are fixed. is lowered below a certain value. In addition, if the multidimensional vector y (classification target vector) of the user data (classification target data) is abnormal data, even if z is adjusted to reduce the division loss function L while the parameters of the generation module G are fixed, the loss value of L is not specific. does not drop below the value. That is, Loss L when y is abnormal data has a relatively higher value than L when y is normal data. Accordingly, there is an effect that classification of abnormal data, classification of abnormal data, and detection of abnormal data can be performed by using L as an anomaly score.

According to an embodiment of the present invention, there is an effect of being able to classify and collect unexpected abnormal data when collecting data. Since abnormal data are most often generated in unexpected situations, it is very difficult to build a deep learning classification model through labeling with the existing general statistical approach or manual feature engineering. In addition, there is an effect that unsupervised learning for the classification, classification, and detection of abnormal data is possible without separate labeling or tagging for abnormal data. In addition, when classification, classification, and detection of abnormal data are attempted based on other deep learning models and clustering algorithms, the distance between vectors embedded in abstract space has no choice but to be used as an anomaly score. In the existing method, there is no constraint that makes the normal data and the abnormal data far from the abstract space, and it is inaccurate to say that the degree of distance between the embedded vectors expresses the abnormality of the user data (data to be classified). Since the apparatus for classifying abnormal data according to an embodiment of the present invention includes a generation module G that effectively uses the feature extraction performance of deep learning and creates normal data at the same time, the meaning of the abnormality of the user data (classification target data) is anomaly score The effect expressed in

As described above, those skilled in the art to which the present invention pertains will be able to understand that the present invention may be embodied in other specific forms without changing the technical spirit or essential features thereof. Therefore, the above-described embodiments are to be understood in all respects as illustrative and not restrictive. The scope of the present invention is indicated by the following claims rather than the detailed description, and all changes or modifications derived from the meaning and scope of the claims and equivalent concepts should be construed as being included in the scope of the present invention.

The features and advantages described herein are not all inclusive, and many additional features and advantages will become apparent to those skilled in the art, particularly upon consideration of the drawings, the specification, and the claims. Moreover, it should be noted that the language used herein has been principally selected for readability and teaching purposes, and may not be chosen to delineate or limit the subject matter of the present invention.

The foregoing description of embodiments of the present invention has been presented for purposes of illustration. It is not intended to be exhaustive or to limit the invention to the precise form disclosed. Those skilled in the art will appreciate that many modifications and variations are possible in light of the above disclosure.

Therefore, the scope of the present invention is not limited by the detailed description, but by any claims of the application based thereon. Accordingly, the disclosure of the embodiments of the present invention is illustrative and not intended to limit the scope of the present invention as set forth in the following claims.

Claims

a memory module for storing a program code of an abnormal data classification module that receives time-series distribution data, determines whether the data is abnormal, and outputs an abnormal data score; and

a processing module for processing the program code of the abnormal data classification module;

including,

The program code of the abnormal data classification module is,

a generation vector generation step of generating a generation vector by a generation module, which is a component of the abnormal data classification module, trained to generate a stationary vector that is a multidimensional vector of the time-series distribution data that is normal based on a latent variable;

A classification target vector receiving step of receiving a classification target vector that is a multidimensional vector of the classification target data to be classified as abnormal data;

an abnormal data score output step of outputting an abnormal data score that is a loss value based on a difference between the generated vector and the vector to be classified; and

a latent variable adjustment step of adjusting the latent variable in a direction in which the abnormal data score is lowered;

configured to be performed on a computer, including

The memory module further includes a program code of a minute change module for outputting minute change data representing a minute change of the time series distribution data or the classification target data, and the processing module further executes the program code of the minute change module process,

The program code of the micro change module,

a receiving step of receiving the time series distribution data or the classification target data;

a change amount data generating step of generating change amount data, which is data on the amount of change in the distribution of the received time series distribution data or the classification target data;

a change amount distribution data generating step of converting the change amount data into matrix change amount distribution data; and

a minute change data generation step of generating minute change data based on the time-dependent distribution data;

configured to be performed on a computer, including

The abnormal data classification module, when the abnormal data score lowered by the adjustment of the latent variable in the latent variable adjustment step is equal to or greater than a specific value, the classification target data is classified as the abnormal data,

The normal vector is configured to include the fine change data of the time series distribution data, the classification target vector is configured to include the fine change data of the classification target data,

The generation module is performed by a classification module that is the abnormal data classification module,

The classification module is configured to be updated by the reinforcement learning module, the Agent of the reinforcement learning module becomes the classification module, State becomes the generated vector and the normal vector output by the generation module, and Action is the State In the situation of , it is to output the normal vector probability by distinguishing whether the generated vector is the normal vector, and Reward is configured to be generated when the normal vector division of the division module is correct,

Abnormal data classification device using artificial neural network.
According to claim 1,

The memory module further includes a program code of a spatial data module for outputting spatial data indicating a distribution change due to a dynamic movement of the time series distribution data or the classification target data, wherein the processing module is configured to further processing the program code,

The program code of the spatial data module,

a receiving step of receiving the time series distribution data or the classification target data;

a spatial feature extraction step of inputting the time series distribution data or the classification target data into an embedding network composed of ConvNet and extracting spatial features of the distribution using a feature map; and

a spatial data generation step of inputting the spatial feature into a Long-Short Term Memory (LSTM), embedding a sequence that is a temporal feature, and generating spatial data;

configured to be performed on a computer, including

The stationary vector is configured to include the spatial data of the time series distribution data, and the classification target vector is configured to include the spatial data of the classification target data,

Abnormal data classification device using artificial neural network.