KR102352954B1

KR102352954B1 - Real-time Abnormal Insider Event Detection on Enterprise Resource Planning Systems via Predictive Auto-regression Model

Info

Publication number: KR102352954B1
Application number: KR1020210070224A
Authority: KR
Inventors: 오현택; 유종민; 김민경; 김옥수; 정세훈
Original assignee: 주식회사 아미크; 한국과학기술원
Priority date: 2021-05-31
Filing date: 2021-05-31
Publication date: 2022-01-19

Abstract

Suggested are a system and method for detecting abnormal behavior of a user of a real-time enterprise information system based on predictive autoregression. In accordance with one embodiment, the method for detecting abnormal behavior of a user of a real-time enterprise information system based on predictive autoregression, which is performed by a computer system, includes the following steps of: using two consecutive session events in an enterprise information system as a present event and a future event; inputting the present event into a predictive autoregression model (PAM) to predict a future event to generate a prediction result; and comparing the prediction result to the actual future event to determine abnormal behavior of a user in real time.

Description

Real-time Abnormal Insider Event Detection on Enterprise Resource Planning Systems via Predictive Auto-regression Model}

아래의 실시예들은 기업정보시스템에 대한 사용자 이상행위 탐지 시스템 및 방법에 관한 것으로, 더욱 상세하게는 예측 자동 회귀 기반 실시간 기업정보시스템에 대한 사용자 이상행위 탐지 시스템 및 방법에 관한 것이다. The following embodiments relate to a system and method for detecting an abnormal user behavior for a corporate information system, and more particularly, to a system and method for detecting an abnormal user behavior for a predictive auto-regression-based real-time corporate information system.

기업정보시스템(Enterprise Resource Planning System, ERP 시스템)은 기업의 모든 부문에서 비즈니스 프로세스를 관리하는 데 도움이 되는 포괄적인 자원 관리 시스템이다. An Enterprise Resource Planning System (ERP system) is a comprehensive resource management system that helps manage business processes in all sectors of an enterprise.

도 1은 일반적인 ERP 시스템을 설명하기 위한 도면이다.1 is a view for explaining a general ERP system.

도 1을 참조하면, ERP 시스템(110)에서는 전반적인 비즈니스 프로세스(인력(101), 재무(102), 생산(103), 영업(104), 운영(105) 등)의 자원과 정보를 유기적으로 공유하여 운영 효율성을 개선(121, 122)할 수 있다. 즉, ERP 시스템(110)은 복잡한 기업 구조를 가진 대부분의 현대 기업에서 필수적인 비즈니스 허브 역할을 한다. 그러나 기업 내에서 중요하고 중요한 데이터를 처리하는 ERP 시스템(110)은 이러한 중요한 정보를 침해하는 기업 위협의 위험에 직면해 있다. 특히, 내부자의 기업 위협은 고려해야 할 가장 중요한 문제 중 하나로 간주되어 왔다. 내부자에는 현재 및 이전 직원뿐만 아니라 조직 시스템과 데이터에 대한 적절한 액세스 권한을 가진 계약자나 비즈니스 파트너도 포함된다. 일반적으로 내부자 위협은 외부 공격자에 의한 위협보다 기업에게 훨씬 더 위험하고 비용이 많이 든다. 내부자는 이미 시스템에 대한 합법적인 접근 권한을 가지고 있기 때문에 일반적으로 조직에서 중요한 데이터가 저장되는 위치를 알고 있다. 또한 내부자가 기업 자원에 접근하는 것이 일반적이라고 생각하기 쉽다. 한 보고서에 따르면, 보안 관리자의 약 80% 이상이 조직의 내부자 위협 효과가 '일부 효과적'이라고 응답했다. 내부자 위협을 감지하고 조직을 보호해야 할 필요성이 최근까지 제기되어 왔다.Referring to FIG. 1 , the ERP system 110 organically shares resources and information of the overall business process (people 101, finance 102, production 103, sales 104, operation 105, etc.) Thus, it is possible to improve the operational efficiency (121, 122). That is, the ERP system 110 serves as an essential business hub in most modern enterprises having a complex corporate structure. However, the ERP system 110 that processes important and important data within the enterprise faces the risk of enterprise threats that infringe such important information. In particular, insider corporate threats have been regarded as one of the most important issues to consider. Insiders include current and former employees, as well as contractors or business partners with appropriate access to organizational systems and data. In general, insider threats are much more risky and costly to businesses than threats posed by external attackers. Since insiders already have legitimate access to the system, they usually know where sensitive data is stored in the organization. It is also easy to assume that it is common for insiders to access corporate resources. According to one report, more than 80% of security managers say their organization's insider threat effectiveness is 'somewhat effective'. The need to detect insider threats and protect organizations has been raised until recently.

내부자 위협 예방의 필요성이 대두되자, 시스템 내부자의 비정상적인 활동을 감지하기 위한 역할 및 시나리오 기반 접근법이 제시되었다. 이러한 접근 방식은 사전 정의되거나 알려진 비정상적이거나 위협적인 행동을 식별하기 위한 확실한 성능을 얻었다. 그러나 예상치 못한 특이한 행동에 대처할 수 없다는 점에서 한계가 있다. 비정상적인 행동에 대해 미리 정의된 정보를 사용하는 것의 한계를 극복하기 위해, 그리고 대규모 데이터셋을 사용할 수 있게 되면서 다양한 딥러닝 기반 접근법이 제안되었다. 이러한 접근 방식은 입력 장치 또는 데이터 누출로 이어지는 네트워크 신호의 비정상적인 움직임을 분석하여 비정상적인 내부자 행동을 감지하려고 했다. 그러나 이러한 간접 접근법은 ERP 시스템(110)의 실제 비정상적인 내부자 행동과 관련이 없을 수 있는 허위 경보를 자주 만들 수 있다는 점에서 한계가 있다. 또한, 내부자 위협 감지가 실시간으로 수행되어야 할 필요성을 고려하지 않았다.As the need for insider threat prevention has emerged, role and scenario-based approaches have been proposed to detect anomalous activity inside the system. This approach has gained solid performance for identifying predefined or known anomalous or threatening behavior. However, there is a limitation in that it cannot cope with unexpected and unusual behavior. To overcome the limitations of using predefined information for abnormal behavior, and as large datasets become available, various deep learning-based approaches have been proposed. This approach sought to detect anomalous insider behavior by analyzing anomalous movement in input devices or network signals leading to data leaks. However, this indirect approach is limited in that it can frequently create false alarms that may not be related to the actual abnormal insider behavior of the ERP system 110 . It also did not take into account the need for insider threat detection to be performed in real time.

M. Du, F. Li, G. Zheng, and V. Srikumar, "Deeplog: Anomaly detection and diagnosis from system logs through deep learning," in Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, pp. 1285-1298, 2017. M. Du, F. Li, G. Zheng, and V. Srikumar, "Deeplog: Anomaly detection and diagnosis from system logs through deep learning," in Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, pp. 1285-1298, 2017. P. Wang, B. Xu, J. Xu, G. Tian, C.-L. Liu, and H. Hao, “Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification," Neurocomputing, vol. 174, pp. 806-814, 2016. P. Wang, B. Xu, J. Xu, G. Tian, C.-L. Liu, and H. Hao, “Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification,” Neurocomputing, vol. 174, pp. 806-814, 2016. J. Yu, Y. Lee, K. C. Yow, M. Jeon, and W. Pedrycz, “Abnormal event detection and localization via adversarial event prediction," IEEE Transactions on Neural Networks and Learning Systems, 2021. J. Yu, Y. Lee, K. C. Yow, M. Jeon, and W. Pedrycz, “Abnormal event detection and localization via adversarial event prediction,” IEEE Transactions on Neural Networks and Learning Systems, 2021. P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, P.-A. Manzagol, and L. Bottou, "Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion.," Journal of machine learning research, vol. 11, no. 12, 2010. P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, P.-A. Manzagol, and L. Bottou, "Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion.," Journal of machine learning research, vol. 11, no. 12, 2010. A. Sherstinsky, "Fundamentals of recurrent neural network (rnn) and long short-term memory (lstm) network," Physica D: Nonlinear Phenomena, vol. 404, p. 132306, 2020. A. Sherstinsky, "Fundamentals of recurrent neural network (rnn) and long short-term memory (lstm) network," Physica D: Nonlinear Phenomena, vol. 404, p. 132306, 2020. J. Chung, C. Gulcehre, K. Cho, and Y. Bengio, "Empirical evaluation of gated recurrent neural networks on sequence modeling," arXiv preprint arXiv:1412.3555, 2014. J. Chung, C. Gulcehre, K. Cho, and Y. Bengio, “Empirical evaluation of gated recurrent neural networks on sequence modeling,” arXiv preprint arXiv:1412.3555, 2014. J. R. Hershey and P. A. Olsen, "Approximating the kullback leibler divergence between gaussian mixture models," in 2007 IEEE International Conference on Acoustics, Speech and Signal Processing-ICASSP'07, vol. 4, pp. IV-317, IEEE, 2007. J. R. Hershey and P. A. Olsen, "Approximating the kullback leibler divergence between gaussian mixture models," in 2007 IEEE International Conference on Acoustics, Speech and Signal Processing-ICASSP'07, vol. 4, pp. IV-317, IEEE, 2007.

실시예들은 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 시스템 및 방법에 관하여 기술하며, 보다 구체적으로 예측 자동 회귀 모델(Predictive Auto-regression Model, PAM)을 이용하여 실시간으로 기업정보시스템에 대한 사용자 이상행위를 탐지할 수 있는 기술을 제공한다. The embodiments describe a predictive automatic regression-based real-time enterprise information system user anomaly detection system and method, and more specifically, a predictive auto-regression model (PAM) for real-time user It provides technology that can detect abnormal behavior.

실시예들은 현재 이벤트를 예측 자동 회귀 모델(PAM)에 입력하여 미래 이벤트를 예측하고, 예측 결과와 실제 미래 이벤트를 비교함으로써 실시간으로 사용자 이상행위를 판단하는, 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 시스템 및 방법을 제공하는데 있다. Embodiments predict a future event by inputting a current event into a predictive automatic regression model (PAM), and determine a user abnormality in real time by comparing the prediction result and the actual future event, predictive autoregression-based real-time enterprise information system user abnormality To provide a behavior detection system and method.

일 실시예에 따른 컴퓨터 시스템에 의해 수행되는 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 방법은, 기업정보시스템에서 연속된 2개의 세션 이벤트를 사용하여 현재 이벤트 및 미래 이벤트로 사용하는 단계; 상기 현재 이벤트를 예측 자동 회귀 모델(Predictive Auto-regression Model, PAM)에 입력하여 미래 이벤트를 예측하여 예측 결과를 생성하는 단계; 및 상기 예측 결과와 실제 상기 미래 이벤트를 비교하여 실시간으로 사용자 이상행위를 판단하는 단계를 포함하여 이루어질 수 있다. According to an embodiment, a method for detecting user abnormal behavior based on predictive autoregression based on real-time corporate information system performed by a computer system includes: using two consecutive session events in a corporate information system as a current event and a future event; generating a prediction result by predicting a future event by inputting the current event into a predictive auto-regression model (PAM); and comparing the prediction result with the actual future event to determine the user's abnormal behavior in real time.

상기 연속된 2개의 세션 이벤트를 사용하여 현재 이벤트 및 미래 이벤트로 사용하는 단계는, 기업정보시스템에서 원시 시스템 로그를 dense 임베딩 벡터(Dense Embedding Vector, DEV) 또는 원핫 인코딩(one-hot encoding)으로 인코딩하는 단계를 포함하고, 상기 dense 임베딩 벡터 또는 원핫 인코딩(one-hot encoding)을 이용하여 상기 연속된 2개의 세션 이벤트를 사용하여 현재 이벤트 및 미래 이벤트로 사용할 수 있다. In the step of using the two consecutive session events as a current event and a future event by using the two consecutive session events, the raw system log is encoded by a dense embedding vector (DEV) or one-hot encoding in the corporate information system. and using the two consecutive session events using the dense embedding vector or one-hot encoding to be used as a current event and a future event.

상기 예측 자동 회귀 모델(PAM)은, 입력되는 상기 현재 이벤트를 이용하여 미래 이벤트를 예측하도록 훈련될 수 있다. The predictive autoregression model (PAM) may be trained to predict a future event using the input current event.

상기 현재 이벤트를 예측 자동 회귀 모델(PAM)에 입력하여 미래 이벤트를 예측하여 예측 결과를 생성하는 단계는, 인코더를 이용하여 상기 현재 이벤트를 인코딩하여 잠재 특징을 추출하는 단계; 및 추출된 상기 잠재 특징에 예측 변수를 사용하여 상기 예측 결과를 생성하는 단계를 포함할 수 있다. The step of generating a prediction result by inputting the current event into a predictive autoregression model (PAM) to predict a future event may include: extracting a latent feature by encoding the current event using an encoder; and generating the prediction result by using a predictor variable for the extracted latent feature.

상기 예측 결과와 실제 상기 미래 이벤트를 비교하여 실시간으로 사용자 이상행위를 판단하는 단계는, 평균 제곱 오차(MSE)를 기반으로 상기 예측 결과와 실제 상기 미래 이벤트를 비교하여 산정된 오류가 기설정된 임계값 이상인 경우 사용자 이상행위로 판단될 수 있다. The step of comparing the prediction result and the actual future event to determine the user's abnormal behavior in real time may include comparing the prediction result with the actual future event based on a mean square error (MSE), and the calculated error is a preset threshold value. If it is abnormal, it may be judged as an abnormal behavior of the user.

상기 예측 결과와 실제 상기 미래 이벤트를 비교하여 실시간으로 사용자 이상행위를 판단하는 단계는, 예측 오류 손실을 산정하여 상기 예측 결과와 실제 상기 미래 이벤트의 차이를 판단할 수 있다. Comparing the prediction result and the actual future event to determine the user abnormal behavior in real time may include calculating a prediction error loss to determine a difference between the prediction result and the actual future event.

상기 예측 결과와 실제 상기 미래 이벤트를 비교하여 실시간으로 사용자 이상행위를 판단하는 단계는, 인코딩 및 디코딩 과정에서 생성되는 잠재 특징의 불확실성을 낮추기 위해 자동 회귀 손실을 적용하며, 상기 자동 회귀 손실은 실제 상기 미래 이벤트의 특징을 추출했을 때의 결과와 상기 현재 이벤트를 통한 예측을 위한 특징을 도출했을 때의 결과의 차이일 수 있다. The step of comparing the prediction result and the actual future event to determine the user anomaly in real time applies an auto-regression loss to lower the uncertainty of the latent feature generated in the encoding and decoding process, and the auto-regression loss is the actual It may be a difference between a result of extracting a feature of a future event and a result of extracting a feature for prediction through the current event.

다른 실시예에 따른 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 시스템은, 기업정보시스템에서 연속된 2개의 세션 이벤트를 사용하여 현재 이벤트 및 미래 이벤트로 사용하는 전처리부; 상기 현재 이벤트를 입력하여 미래 이벤트를 예측하여 예측 결과를 생성하는 예측 자동 회귀 모델(Predictive Auto-regression Model, PAM); 및 상기 예측 결과와 실제 상기 미래 이벤트를 비교하여 실시간으로 사용자 이상행위를 판단하는 사용자 이상행위 판단부를 포함하여 이루어질 수 있다. A predictive automatic regression-based real-time enterprise information system user anomaly detection system according to another embodiment includes: a preprocessor that uses two consecutive session events in the enterprise information system as a current event and a future event; a predictive auto-regression model (PAM) for generating a prediction result by predicting a future event by inputting the current event; and a user abnormal behavior determination unit that compares the prediction result with the actual future event to determine the user abnormal behavior in real time.

상기 전처리부는, 기업정보시스템에서 원시 시스템 로그를 dense 임베딩 벡터(Dense Embedding Vector, DEV) 또는 원핫 인코딩(one-hot encoding)으로 인코딩하고, 상기 dense 임베딩 벡터 또는 원핫 인코딩(one-hot encoding)을 이용하여 상기 연속된 2개의 세션 이벤트를 사용하여 현재 이벤트 및 미래 이벤트로 사용할 수 있다. The preprocessor encodes a raw system log in a corporate information system with a dense embedding vector (DEV) or one-hot encoding, and uses the dense embedding vector or one-hot encoding Thus, the two consecutive session events can be used as a current event and a future event.

상기 예측 자동 회귀 모델(PAM)은, 상기 현재 이벤트를 인코딩하여 잠재 특징을 추출하는 인코더; 및 추출된 상기 잠재 특징에 예측 변수를 사용하여 상기 예측 결과를 생성하는 예측기를 포함할 수 있다. The predictive autoregression model (PAM) may include an encoder for encoding the current event to extract latent features; and a predictor that generates the prediction result by using a predictor variable for the extracted latent feature.

상기 사용자 이상행위 판단부는, 평균 제곱 오차(MSE)를 기반으로 상기 예측 결과와 실제 상기 미래 이벤트를 비교하여 산정된 오류가 기설정된 임계값 이상인 경우 사용자 이상행위로 판단될 수 있다. The user abnormal behavior determination unit may determine the user abnormal behavior when an error calculated by comparing the prediction result and the actual future event based on a mean square error (MSE) is equal to or greater than a preset threshold.

상기 사용자 이상행위 판단부는, 예측 오류 손실을 산정하여 상기 예측 결과와 실제 상기 미래 이벤트의 차이를 판단할 수 있다. The user abnormal behavior determination unit may determine a difference between the prediction result and the actual future event by calculating a prediction error loss.

상기 사용자 이상행위 판단부는, 인코딩 및 디코딩 과정에서 생성되는 잠재 특징의 불확실성을 낮추기 위해 자동 회귀 손실을 적용하며, 상기 자동 회귀 손실은 실제 상기 미래 이벤트의 특징을 추출했을 때의 결과와 상기 현재 이벤트를 통한 예측을 위한 특징을 도출했을 때의 결과의 차이일 수 있다. The user anomaly determination unit applies an auto-regression loss to reduce uncertainty of latent features generated in the encoding and decoding process, and the auto-regression loss is a result of extracting the features of the future event and the current event. It may be the difference in the results when the features for prediction are derived through the

실시예들에 따르면 예측 자동 회귀 모델(Predictive Auto-regression Model, PAM)을 이용하여 실시간으로 기업정보시스템에 대한 사용자 이상행위를 탐지할 수 있는, 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 시스템 및 방법을 제공할 수 있다.According to embodiments, a predictive auto-regression-based real-time enterprise information system user anomaly detection system capable of detecting a user anomaly with respect to an enterprise information system in real time using a predictive auto-regression model (PAM) and methods may be provided.

실시예들에 따르면 현재 이벤트를 예측 자동 회귀 모델(PAM)에 입력하여 미래 이벤트를 예측하고, 예측 결과와 실제 미래 이벤트를 비교함으로써 실시간으로 사용자 이상행위를 판단하는, 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 시스템 및 방법을 제공할 수 있다. According to embodiments, predictive auto-regression-based real-time corporate information system that predicts future events by inputting current events into a predictive automatic regression model (PAM), and determines user abnormal behavior in real time by comparing the predicted results and actual future events A system and method for detecting user anomalies may be provided.

도 1은 일반적인 ERP 시스템을 설명하기 위한 도면이다.
도 2는 일 실시예에 따른 컴퓨터 시스템의 내부 구성의 일례를 설명하기 위한 블록도이다.
도 3은 일 실시예에 따른 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 시스템을 나타내는 블록도이다.
도 4는 일 실시예에 따른 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 방법을 나타내는 흐름도이다.
도 5는 일 실시예에 따른 ERP 시스템에 대해 제안된 AIED을 나타내는 도면이다.
도 6는 일 실시예에 따른 PAM의 구조적 세부 사항을 나타내는 도면이다. 1 is a view for explaining a general ERP system.
2 is a block diagram illustrating an example of an internal configuration of a computer system according to an embodiment.
3 is a block diagram illustrating a system for detecting abnormal behavior of a user in a real-time enterprise information system based on predictive automatic regression according to an embodiment.
4 is a flowchart illustrating a method for detecting an abnormal behavior of a user in a real-time enterprise information system based on predictive auto-regression according to an embodiment.
5 is a diagram illustrating an AIED proposed for an ERP system according to an embodiment.
6 is a diagram illustrating structural details of a PAM according to an embodiment.

이하, 첨부된 도면을 참조하여 실시예들을 설명한다. 그러나, 기술되는 실시예들은 여러 가지 다른 형태로 변형될 수 있으며, 본 발명의 범위가 이하 설명되는 실시예들에 의하여 한정되는 것은 아니다. 또한, 여러 실시예들은 당해 기술분야에서 평균적인 지식을 가진 자에게 본 발명을 더욱 완전하게 설명하기 위해서 제공되는 것이다. 도면에서 요소들의 형상 및 크기 등은 보다 명확한 설명을 위해 과장될 수 있다.Hereinafter, embodiments will be described with reference to the accompanying drawings. However, the described embodiments may be modified in various other forms, and the scope of the present invention is not limited by the embodiments described below. In addition, various embodiments are provided in order to more completely explain the present invention to those of ordinary skill in the art. The shapes and sizes of elements in the drawings may be exaggerated for clearer description.

정보통신기술의 발전과 기업의 세계화로 많은 기업이 ERP(Enterprise Resource Planning) 시스템이라는 전기 자원 관리 시스템을 통해 운영되고 있다. ERP 시스템은 기업을 위해 효율적이고 중앙 집중화된 자원 관리를 가능하게 한다. 그러나 많은 기업 자원이 시스템에 의해 관리되고 있기 때문에 내부자의 위협적인 행동은 ERP 시스템 운영에서 가장 중요한 위험 중 하나이다. 내부자가 기업자원에 접근하는 것은 정상적인 이벤트로 간주되기 때문에 외부인의 위협에 비해 훨씬 은밀하고 치명적이다. 기존의 내부자 위협 감지 방법은 시스템 관리자가 수동으로 정의한 특정 이벤트를 감지하는 것을 목표로 한다. 이러한 접근 방식은 이벤트 패턴의 변화에 강력하지 않으며, 미리 정의된 사례가 제시되지 않은 경우에는 사용할 수 없다. With the development of information and communication technology and the globalization of companies, many companies are operating through an electric resource management system called an ERP (Enterprise Resource Planning) system. ERP systems enable efficient and centralized resource management for enterprises. However, since many corporate resources are managed by the system, the threatening behavior of insiders is one of the most important risks in ERP system operation. Since insider access to corporate resources is considered a normal event, it is far more stealthy and lethal than threats from outsiders. Existing insider threat detection methods aim to detect specific events manually defined by system administrators. This approach is not robust to changing event patterns, and cannot be used unless a predefined case is presented.

본 발명의 실시예에서는 예측 자동 회귀 모델(Predictive Auto-regression Model, PAM)을 사용하여 실시간으로 비정상적인 내부자 이벤트 감지 방법을 제시한다. 기존 접근 방식과 비교하여 제안된 방법은 일반적인 이벤트를 사용하여 예측 모델을 컴파일하고 예측 결과의 가능성이 임계값보다 낮을 때 위협을 식별할 수 있다. 실험은 일련의 ERP 시스템 로그로 정의된 이벤트를 포함하여 데이터셋을 사용하여 수행된다. 로그는 기업의 실제 상황에서 캡처(capture)된다. 결과는 제안된 방법이 ERP 시스템에서 비정상적인 이벤트를 성공적으로 식별할 수 있음을 보여준다.In an embodiment of the present invention, a method for detecting an abnormal insider event in real time using a predictive auto-regression model (PAM) is presented. Compared with the existing approaches, the proposed method can compile a predictive model using common events and identify threats when the probability of predictive outcomes is lower than a threshold. Experiments are performed using a dataset containing defined events as a series of ERP system logs. Logs are captured in the real world of the enterprise. The results show that the proposed method can successfully identify abnormal events in the ERP system.

본 발명의 실시예에서는 사전 정의된 비정상 또는 비정상적인 이벤트 사례 없이 기업 내부자의 비정상 이벤트를 식별할 수 있는 ERP 시스템에 대한 실시간으로 비정상 내부자 이벤트 감지 방법을 제시한다. 이 발명의 핵심 가설은 정상 이벤트에 의해서만 훈련된 모델이 정상 이벤트를 잘 예측할 수 있다는 것이다. 이 가설을 사용하여 제안된 방법은 정상 이벤트 샘플에 의해 훈련되며, 예측 결과가 해당 미래 이벤트와 일치하지 않을 때 비정상적인 이벤트가 식별된다. 이러한 방식으로, 특히 예측 결과를 사용하여 제안된 모델은 거의 실시간으로 비정상적인 사용자 이벤트에 대응할 수 있다. 이 방법론의 경우 정확한 예측 모델을 도출하는 것이 중요하다. An embodiment of the present invention proposes a method for detecting an abnormal insider event in real time for an ERP system that can identify an abnormal event of a company insider without a predefined case of abnormal or abnormal event. A key hypothesis of this invention is that a model trained only on normal events can predict normal events well. Using this hypothesis, the proposed method is trained on a sample of normal events, and an anomalous event is identified when the prediction result does not match the corresponding future event. In this way, in particular, using the prediction results, the proposed model can respond to anomalous user events in near real-time. For this methodology, it is important to derive an accurate predictive model.

예측 결과를 개선하기 위해, 실시예들은 보다 차별적인 예측 모델을 도출할 수 있는 예측 자동 회귀 모델(PAM)을 제안한다. 비정상적인 내부자 이벤트를 감지하는 데 제안된 방법의 효과를 입증하기 위해 실제 이벤트 데이터셋을 활용하는 방법을 평가한다. 데이터셋은 실제 기업에서 캡처한 ERP 시스템 로그로 구성된다. 실험 결과는 ERP 시스템에서 비정상적인 이벤트를 감지하는 데 제안된 방법의 효과를 보여준다.In order to improve the prediction result, the embodiments propose a predictive autoregressive model (PAM) capable of deriving a more differentiated predictive model. To demonstrate the effectiveness of the proposed method in detecting anomalous insider events, we evaluate how to utilize real event datasets. The dataset consists of ERP system logs captured by real companies. The experimental results show the effectiveness of the proposed method in detecting abnormal events in the ERP system.

본 발명의 주요 기여사항은 다음과 같다.The main contributions of the present invention are as follows.

장단기 메모리(LSTM)를 기반으로 한 새로운 실시간 이상 내부자 이벤트 감지 방법이 제안된다. 제안된 방법은 실시간 처리 속도로 비정상적인 내부자 이벤트에 대한 강력한 감지 성능을 제공한다.A new real-time anomaly insider event detection method based on long-term memory (LSTM) is proposed. The proposed method provides strong detection performance for abnormal insider events with real-time processing speed.

주어진 정상 이벤트를 사용하여 정확한 예측 모델을 도출하기 위해 예측 자동 회귀 모델(PAM)이 제안되며, 이는 제안된 차별적 확률 모델을 활용하여 정상 및 비정상 이벤트를 식별한다.To derive an accurate predictive model using a given normal event, a predictive autoregressive model (PAM) is proposed, which utilizes the proposed differential probabilistic model to identify normal and abnormal events.

위의 기여사항 외에도, 최적의 하이퍼 파라미터 설정을 찾기 위해 포괄적인 실험 결과를 제공한다. 또한, 기존의 시계열 예측 모델과 비정상적인 내부자 이벤트 감지(AIED)와 기업 위협 감지(ETD)를 위한 기존 최첨단 방법을 포함한 다양한 방법과의 성능 비교를 제공한다.In addition to the above contributions, we present comprehensive experimental results to find the optimal hyperparameter settings. It also provides a performance comparison of traditional time-series predictive models and a variety of methods, including existing state-of-the-art methods for anomalous insider event detection (AIED) and enterprise threat detection (ETD).

아래에서는 AIED 및 ETD에 대한 관련 연구를 소개하고, PAM의 전체 아키텍처를 포함하여 제안된 방법의 세부 사항을 설명한다. Below, we introduce relevant studies on AIED and ETD, and describe the details of the proposed method, including the overall architecture of PAM.

도 2는 일 실시예에 따른 컴퓨터 시스템의 내부 구성의 일례를 설명하기 위한 블록도이다. 예를 들어, 본 발명의 실시예들에 따른 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 시스템이 도 2의 컴퓨터 시스템(장치)(200)을 통해 구현될 수 있다. 도 2에 도시한 바와 같이, 컴퓨터 시스템(200)은 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 방법을 실행하기 위한 구성요소로서 프로세서(210), 메모리(220), 영구 저장 장치(230), 버스(240), 입출력 인터페이스(250) 및 네트워크 인터페이스(260)를 포함할 수 있다.2 is a block diagram illustrating an example of an internal configuration of a computer system according to an embodiment. For example, the predictive automatic regression-based real-time enterprise information system user abnormal behavior detection system according to embodiments of the present invention may be implemented through the computer system (device) 200 of FIG. 2 . As shown in FIG. 2 , the computer system 200 includes a processor 210 , a memory 220 , and a permanent storage device 230 as components for executing a predictive automatic regression-based real-time enterprise information system user abnormal behavior detection method. , a bus 240 , an input/output interface 250 , and a network interface 260 may be included.

프로세서(210)는 명령어들의 임의의 시퀀스를 처리할 수 있는 임의의 장치를 포함하거나 그의 일부일 수 있다. 프로세서(210)는 예를 들어 컴퓨터 프로세서, 이동 장치 또는 다른 전자 장치 내의 프로세서 및/또는 디지털 프로세서를 포함할 수 있다. 프로세서(210)는 예를 들어, 서버 컴퓨팅 디바이스, 서버 컴퓨터, 일련의 서버 컴퓨터들, 서버 팜, 클라우드 컴퓨터, 컨텐츠 플랫폼, 이동 컴퓨팅 장치, 스마트폰, 태블릿, 셋톱 박스, 미디어 플레이어 등에 포함될 수 있다. 프로세서(210)는 버스(240)를 통해 메모리(220)에 접속될 수 있다.Processor 210 may include or be part of any apparatus capable of processing any sequence of instructions. Processor 210 may include, for example, a computer processor, a processor in a mobile device, or other electronic device and/or a digital processor. The processor 210 may be included in, for example, a server computing device, a server computer, a set of server computers, a server farm, a cloud computer, a content platform, a mobile computing device, a smartphone, a tablet, a set-top box, a media player, and the like. The processor 210 may be connected to the memory 220 via a bus 240 .

메모리(220)는 컴퓨터 시스템(200)에 의해 사용되거나 그에 의해 출력되는 정보를 저장하기 위한 휘발성 메모리, 영구, 가상 또는 기타 메모리를 포함할 수 있다. 메모리(220)는 예를 들어 랜덤 액세스 메모리(RAM: random access memory) 및/또는 동적 RAM(DRAM: dynamic RAM)을 포함할 수 있다. 메모리(220)는 컴퓨터 시스템(200)의 상태 정보와 같은 임의의 정보를 저장하는 데 사용될 수 있다. 메모리(220)는 예를 들어 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지를 위한 명령어들을 포함하는 컴퓨터 시스템(200)의 명령어들을 저장하는 데에도 사용될 수 있다. 컴퓨터 시스템(200)은 필요에 따라 또는 적절한 경우에 하나 이상의 프로세서(210)를 포함할 수 있다.Memory 220 may include volatile memory, persistent, virtual, or other memory for storing information used by or output by computer system 200 . The memory 220 may include, for example, random access memory (RAM) and/or dynamic RAM (DRAM). Memory 220 may be used to store any information, such as state information of computer system 200 . The memory 220 may also be used to store instructions of the computer system 200 including, for example, instructions for detecting a predictive auto-regression-based real-time corporate information system user anomaly. Computer system 200 may include one or more processors 210 as needed or appropriate.

버스(240)는 컴퓨터 시스템(200)의 다양한 컴포넌트들 사이의 상호작용을 가능하게 하는 통신 기반 구조를 포함할 수 있다. 버스(240)는 예를 들어 컴퓨터 시스템(200)의 컴포넌트들 사이에, 예를 들어 프로세서(210)와 메모리(220) 사이에 데이터를 운반할 수 있다. 버스(240)는 컴퓨터 시스템(200)의 컴포넌트들 간의 무선 및/또는 유선 통신 매체를 포함할 수 있으며, 병렬, 직렬 또는 다른 토폴로지 배열들을 포함할 수 있다.Bus 240 may include a communications infrastructure that enables interaction between various components of computer system 200 . Bus 240 may carry data between, for example, components of computer system 200 , such as between processor 210 and memory 220 . Bus 240 may include wireless and/or wired communication media between components of computer system 200 , and may include parallel, serial, or other topological arrangements.

영구 저장 장치(230)는 (예를 들어, 메모리(220)에 비해) 소정의 연장된 기간 동안 데이터를 저장하기 위해 컴퓨터 시스템(200)에 의해 사용되는 바와 같은 메모리 또는 다른 영구 저장 장치와 같은 컴포넌트들을 포함할 수 있다. 영구 저장 장치(230)는 컴퓨터 시스템(200) 내의 프로세서(210)에 의해 사용되는 바와 같은 비휘발성 메인 메모리를 포함할 수 있다. 영구 저장 장치(230)는 예를 들어 플래시 메모리, 하드 디스크, 광 디스크 또는 다른 컴퓨터 판독 가능 매체를 포함할 수 있다.Persistent storage 230 is a component, such as memory or other persistent storage, as used by computer system 200 to store data for an extended period of time (eg, compared to memory 220 ). may include Persistent storage 230 may include non-volatile main memory as used by processor 210 in computer system 200 . Persistent storage 230 may include, for example, flash memory, a hard disk, an optical disk, or other computer readable medium.

입출력 인터페이스(250)는 키보드, 마우스, 음성 명령 입력, 디스플레이 또는 다른 입력 또는 출력 장치에 대한 인터페이스들을 포함할 수 있다. 구성 명령들 및/또는 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지를 위한 정보가 입출력 인터페이스(250)를 통해 수신될 수 있다.Input/output interface 250 may include interfaces to a keyboard, mouse, voice command input, display, or other input or output device. Information for configuration commands and/or predictive automatic regression-based real-time enterprise information system user abnormal behavior detection may be received through the input/output interface 250 .

네트워크 인터페이스(260)는 근거리 네트워크 또는 인터넷과 같은 네트워크들에 대한 하나 이상의 인터페이스를 포함할 수 있다. 네트워크 인터페이스(260)는 유선 또는 무선 접속들에 대한 인터페이스들을 포함할 수 있다. 구성 명령들 및/또는 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지를 위한 정보는 네트워크 인터페이스(260)를 통해 수신될 수 있다.Network interface 260 may include one or more interfaces to networks such as a local area network or the Internet. Network interface 260 may include interfaces for wired or wireless connections. Information for configuration commands and/or predictive auto-regression-based real-time enterprise information system user abnormal behavior detection may be received through the network interface 260 .

또한, 다른 실시예들에서 컴퓨터 시스템(200)은 도 2의 구성요소들보다 더 많은 구성요소들을 포함할 수도 있다. 그러나, 대부분의 종래기술적 구성요소들을 명확하게 도시할 필요성은 없다. Also, in other embodiments, computer system 200 may include more components than those of FIG. 2 . However, there is no need to clearly show most of the prior art components.

도 3은 일 실시예에 따른 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 시스템을 나타내는 블록도이다.3 is a block diagram illustrating a system for detecting abnormal behavior of a user in a real-time enterprise information system based on predictive automatic regression according to an embodiment.

도 3은 도 2의 일 실시예에 따른 컴퓨터 시스템(200)의 프로세서(210)가 포함할 수 있는 구성요소의 예를 도시한 도면이다. 여기서, 컴퓨터 시스템(200)의 프로세서(210)는 일 실시예에 따른 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 시스템(200)을 포함할 수 있다. 일 실시예에 따른 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 시스템(300)은 전처리부(310), 예측 자동 회귀 모델(320) 및 사용자 이상행위 탐지부(330)를 포함하여 이루어질 수 있다. 3 is a diagram illustrating an example of components that the processor 210 of the computer system 200 according to the embodiment of FIG. 2 may include. Here, the processor 210 of the computer system 200 may include the predictive automatic regression-based real-time enterprise information system user abnormal behavior detection system 200 according to an embodiment. The predictive automatic regression-based real-time enterprise information system user anomaly detection system 300 according to an embodiment may include a preprocessor 310 , a predictive automatic regression model 320 and a user anomaly detection unit 330 . .

프로세서(210) 및 프로세서(210)의 구성요소들은 도 4의 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 방법이 포함하는 단계들(S410 내지 S430)을 수행할 수 있다. 예를 들어, 프로세서(210) 및 프로세서(210)의 구성요소들은 메모리(220)가 포함하는 운영체제의 코드와 상술한 적어도 하나의 프로그램 코드에 따른 명령(instruction)을 실행하도록 구현될 수 있다. 여기서, 적어도 하나의 프로그램 코드는 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지를 처리하기 위해 구현된 프로그램의 코드에 대응될 수 있다.The processor 210 and the components of the processor 210 may perform the steps S410 to S430 included in the predictive automatic regression-based real-time enterprise information system user anomaly detection method of FIG. 4 . For example, the processor 210 and components of the processor 210 may be implemented to execute an operating system code included in the memory 220 and an instruction according to at least one program code described above. Here, the at least one program code may correspond to a code of a program implemented to process the prediction automatic regression-based real-time enterprise information system user abnormal behavior detection.

예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 방법은 도시된 순서대로 발생하지 않을 수 있으며, 단계들 중 일부가 생략되거나 추가의 과정이 더 포함될 수 있다.The predictive automatic regression-based real-time corporate information system user abnormal behavior detection method may not occur in the order shown, and some of the steps may be omitted or additional processes may be further included.

도 4는 일 실시예에 따른 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 방법을 나타내는 흐름도이다.4 is a flowchart illustrating a method for detecting an abnormal behavior of a user in a real-time enterprise information system based on predictive auto-regression according to an embodiment.

도 4를 참조하면, 일 실시예에 따른 컴퓨터 시스템에 의해 수행되는 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 방법은, 기업정보시스템에서 연속된 2개의 세션 이벤트를 사용하여 현재 이벤트 및 미래 이벤트로 사용하는 단계(S410), 현재 이벤트를 예측 자동 회귀 모델(Predictive Auto-regression Model, PAM)에 입력하여 미래 이벤트를 예측하여 예측 결과를 생성하는 단계(S420) 및 예측 결과와 실제 미래 이벤트를 비교하여 실시간으로 사용자 이상행위를 판단하는 단계(S430)를 포함하여 이루어질 수 있다. Referring to FIG. 4 , the predictive auto-regression-based real-time enterprise information system user abnormal behavior detection method performed by a computer system according to an embodiment uses two consecutive session events in the enterprise information system to provide a current event and a future event (S410) of using the current event as a predictive auto-regression model (PAM) to predict the future event to generate a predicted result (S420) and comparing the predicted result with the actual future event to determine the user's abnormal behavior in real time (S430).

실시예들에 따르면 현재 이벤트를 예측 자동 회귀 모델(PAM)에 입력하여 미래 이벤트를 예측하고, 예측 결과와 실제 미래 이벤트를 비교함으로써 실시간으로 사용자 이상행위를 판단할 수 있다. According to embodiments, by inputting a current event into a predictive automatic regression model (PAM), predicting a future event, and comparing the predicted result with an actual future event, it is possible to determine an abnormal user behavior in real time.

일 실시예에 따른 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 방법은 도 3에서 설명한 일 실시예에 따른 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 시스템을 예를 들어 보다 상세히 설명할 수 있다. 일 실시예에 따른 예측 자동 회귀 기반 실시간 기업정보시스템 사용자 이상행위 탐지 시스템(300)은 전처리부(310), 예측 자동 회귀 모델(320) 및 사용자 이상행위 탐지부(330)를 포함하여 이루어질 수 있다. The predictive automatic regression-based real-time enterprise information system user anomaly detection method according to an embodiment may be described in more detail with an example of the predictive automatic regression-based real-time enterprise information system user anomaly detection system according to the embodiment described in FIG. have. The predictive automatic regression-based real-time enterprise information system user anomaly detection system 300 according to an embodiment may include a preprocessor 310 , a predictive automatic regression model 320 and a user anomaly detection unit 330 . .

단계(S410)에서, 전처리부(310)는 기업정보시스템에서 연속된 2개의 세션 이벤트를 사용하여 현재 이벤트 및 미래 이벤트로 사용할 수 있다. 즉, 연속된 2개의 세션 이벤트를 사용하며, 앞의 세션 이벤트를 현재 이벤트로 사용하고, 뒤에 세션 이벤트를 미래 이벤트로 사용할 수 있다. In step S410, the preprocessor 310 may use two consecutive session events in the enterprise information system as a current event and a future event. That is, two consecutive session events may be used, the previous session event may be used as a current event, and the subsequent session event may be used as a future event.

또한, 전처리부(310)는 기업정보시스템에서 원시 시스템 로그를 dense 임베딩 벡터(Dense Embedding Vector, DEV)로 인코딩하고, dense 임베딩 벡터를 이용하여 연속된 2개의 세션 이벤트를 사용하여 현재 이벤트 및 미래 이벤트로 사용할 수 있다. In addition, the preprocessor 310 encodes the raw system log into a dense embedding vector (DEV) in the corporate information system, and uses two consecutive session events using the dense embedding vector to create a current event and a future event. can be used as

단계(S420)에서, 예측 자동 회귀 모델(320)은 현재 이벤트를 예측 자동 회귀 모델(PAM)에 입력하여 미래 이벤트를 예측하여 예측 결과를 생성할 수 있다. 예측 자동 회귀 모델(320)은 입력되는 현재 이벤트를 이용하여 미래 이벤트를 예측하도록 훈련될 수 있다. In step S420 , the predictive autoregression model 320 may input a current event to the predictive autoregression model (PAM) to predict a future event to generate a prediction result. The predictive autoregression model 320 may be trained to predict future events using input current events.

예측 자동 회귀 모델(320)은 인코더 및 예측기를 포함하여 이루어질 수 있다. 인코더는 현재 이벤트를 인코딩하여 잠재 특징을 추출하고, 예측기는 추출된 잠재 특징에 예측 변수를 사용하여 예측 결과를 생성할 수 있다.The predictive autoregression model 320 may include an encoder and a predictor. An encoder may encode a current event to extract a latent feature, and a predictor may generate a prediction result by using a predictor variable in the extracted latent feature.

단계(S430)에서, 사용자 이상행위 탐지부(330)는 예측 결과와 실제 미래 이벤트를 비교하여 실시간으로 사용자 이상행위를 판단할 수 있다. 사용자 이상행위 탐지부(330)는 평균 제곱 오차(MSE)를 기반으로 예측 결과와 실제 미래 이벤트를 비교하여 산정된 오류가 기설정된 임계값 이상인 경우 사용자 이상행위로 판단될 수 있다. In step S430, the user abnormal behavior detection unit 330 may determine the user abnormal behavior in real time by comparing the prediction result with the actual future event. The user anomaly detection unit 330 may determine a user anomaly when an error calculated by comparing a prediction result and an actual future event based on the mean square error (MSE) is equal to or greater than a preset threshold.

사용자 이상행위 탐지부(330)는 예측 오류 손실을 산정하여 예측 결과와 실제 미래 이벤트의 차이를 판단할 수 있다. 또한, 사용자 이상행위 탐지부(330)는 인코딩 및 디코딩 과정에서 생성되는 잠재 특징의 불확실성을 낮추기 위해 자동 회귀 손실을 적용하며, 자동 회귀 손실은 실제 미래 이벤트의 특징을 추출했을 때의 결과와 현재 이벤트를 통한 예측을 위한 특징을 도출했을 때의 결과의 차이일 수 있다. The user anomaly detection unit 330 may determine the difference between the prediction result and the actual future event by calculating the prediction error loss. In addition, the user anomaly detection unit 330 applies an auto-regression loss to reduce the uncertainty of the latent feature generated in the encoding and decoding process, and the auto-regression loss is the result of extracting the features of the actual future event and the current event. It may be the difference in the result when the features for prediction are derived through

실시예들은 이벤트 예측 기반 실시간으로 기업정보시스템 사용자 이상행위 탐지 방법을 제공하며, 주어진 SAL 로그를 기반으로 예측 자동 회귀 모델은 이전 이벤트에 대한 다음 이벤트의 예측 정확도를 높이는 측면으로 학습할 수 있다. 이에 따라 이벤트 예측 모델을 구축할 수 있다. 실시예들은 실시간으로 발생하는 이벤트에 대해 다음 이벤트를 예측하고 예측된 이벤트와 실제 발생한 이벤트의 오류를 계산할 수 있다. 그 결과는 매 입력되는 이벤트에 대한 이상의 정도를 시각화 가능하다.The embodiments provide an event prediction-based real-time corporate information system user anomaly detection method, and a predictive autoregression model based on a given SAL log can learn in terms of increasing the prediction accuracy of the next event with respect to the previous event. Accordingly, an event prediction model can be built. Embodiments may predict a next event with respect to an event occurring in real time, and may calculate an error between the predicted event and the actual event. As a result, it is possible to visualize the degree of anomaly for every input event.

일반적으로 ERP 시스템 로그는 영숫자 조합으로 표시된다. 그러나 이러한 유형의 표현은 계산 모델의 입력으로 받아들여질 수 없다. 따라서 시스템 로그를 딥러닝 모델에 적용할 수 있는 신호로 변경해야 한다. 원시 시스템 로그를 변경하기 위해 원핫 인코딩은 분류된 로그를 신호로 변경하는 가장 간단한 방법으로 간주할 수 있다. Du(비특허문헌 1)는 간단하고 효율적이므로 컴퓨팅 시스템의 원시 시스템 로그를 변환하기 위해 원핫 인코딩을 활용한다. 불행히도, 단일 핫 벡터 인코딩의 방법론적 단순성 때문에, 새로운 클래스에 대한 새로운 차원을 확장할 때 차원의 저주(curse of dimensionality)로 간주되는 경우가 있다. 그것의 방법론적 특성이 원핫 벡터를 사용하는 클래스를 나타내기 때문에 인코딩 공간의 크기는 신호 클래스의 수에 선형적으로 비례한다. 따라서, 계산 비용 측면에서 비효율적이다.Typically, ERP system logs are displayed in alphanumeric combinations. However, this type of representation cannot be accepted as input to a computational model. Therefore, it is necessary to change the system log into a signal that can be applied to the deep learning model. To change the raw system log, one-hot encoding can be considered as the simplest way to change the classified log into a signal. Since Du (Non-Patent Document 1) is simple and efficient, it utilizes one-hot encoding to transform the raw system log of a computing system. Unfortunately, because of the methodological simplicity of single hot vector encoding, it is sometimes considered a curse of dimensionality when extending a new dimension for a new class. The size of the encoding space is linearly proportional to the number of signal classes because its methodological properties indicate classes using one-hot vectors. Therefore, it is inefficient in terms of computational cost.

이러한 문제를 극복하기 위해 원시 시스템 로그를 사전 처리하기 위해 dense 벡터 임베딩(비특허문헌 2)을 사용할 수 있다. 본 실시예에서 데이터 사전 처리 작업은 다음과 같이 수행된다. 처음에 원시 ERP 시스템 로그는 양의 정수 h = {h_t}t=1:n에 매핑되며, 여기서 h_t와 n은 각각 단일 사용자 세션에 대한 t 번째 로그와 로그 시퀀스의 길이를 나타낸다. 매핑된 정수는 dense 벡터 x = {x_t}t=1:n ∈ X(여기서, x_t∈ R^m)로 인코딩되며, 여기서 m은 신경망 기반 임베딩 네트워크를 사용하는 dense 임베딩 방법에 대해 미리 정의된 차원이다. 인코딩 작업이 끝나면 벡터를 사용하여 예측 모델을 학습한다. To overcome this problem, dense vector embedding (Non-Patent Document 2) can be used to preprocess the raw system log. In this embodiment, the data pre-processing operation is performed as follows. Initially, the raw ERP system log _{maps to a positive integer h = {h t} }t=1:n, where h _t and n represent the length of the t-th log and log sequence, respectively, for a single user session. The mapped integer is encoded as a dense vector x = {x _t }t=1:n ∈ X (where x _t ∈ R ^m ), where m is the predefined for dense embedding method using neural network-based embedding networks. is a dimension After encoding, the vector is used to train a predictive model.

예를 들어 원시 ERP 시스템 로그(복잡한 코드로 표시)는 정수 s = {s_t}t=1:n으로 대체되며, 여기서 s_t와 n은 각각 시간 t에서 단일 로그와 행위의 길이를 나타내며, 신경망을 기반으로 하는 임베딩 네트워크를 사용하여 dense 벡터 x = {x_t}t=1:n으로 다시 변환된다.For example, the raw ERP system log (represented by complex code) is _{replaced by the integer s = {s t} }t=1:n, where s _t and n represent the length of a single log and action at time t, respectively, and the neural network It is transformed back into a dense vector x = {x _t }t=1:n using an embedding network based on .

원핫 인코딩 대신 임베딩 계층을 사용하면 몇 가지 이점이 있다. 첫째, 임의의 입력 신호 사이의 의미적 관계를 나타낼 수 있는 훈련 가능한 임베딩 계층이 활용되며, 입력 신호를 직교 벡터 공간으로 매핑한다(원핫 인코딩으로는 불가능하다). 둘째, dense 임베딩은 원시 신호를 특정 벡터 공간에 임베딩하는 차원 유연성을 갖는다. N-분류 신호가 주어질 때, 원핫 벡터 공간의 차원은 적어도 N보다 크다. 반면에, dense 임베딩은 매핑된 벡터들 사이에 직교성을 강요하지 않기 때문에 N보다 더 작은 차원을 가진 벡터 공간에 신호를 매핑할 수 있다. 한편, 여기에서는 원시 시스템 로그를 사전 처리하기 위해 dense 벡터 임베딩을 이용하는 것으로 설명하고 있으나, 이에 제한되지 않으며, 시계열 데이터를 딥러닝에 적용할 수 있도록 인코딩하는 다른 방법(예컨대, 원핫 인코딩(one-hot encoding))을 사용하는 것이 가능하다.There are several advantages to using an embedding layer instead of one-hot encoding. First, a trainable embedding layer that can represent semantic relationships between arbitrary input signals is utilized, and maps the input signals into an orthogonal vector space (which is not possible with one-hot encoding). Second, dense embeddings have dimensional flexibility to embed raw signals into a specific vector space. Given an N-classification signal, the dimension of the one-hot vector space is at least greater than N. On the other hand, since dense embeddings do not enforce orthogonality between mapped vectors, signals can be mapped into a vector space with dimensions smaller than N. Meanwhile, although it is described here that dense vector embedding is used to pre-process the raw system log, it is not limited thereto, and another method of encoding time series data to be applicable to deep learning (eg, one-hot encoding) encoding)) can be used.

다양한 이상 감지 방법(비특허문헌 3) 또는 특이치 감지 접근법과 유사하게, 실시예들에 따른 방법은 정상적인 이벤트를 캡처하기 위한 모델을 도출하는 것을 목표로 하고, 그런 다음 비정상적인 샘플이 제공될 때 더 큰 오류 또는 더 낮은 가능성을 예상한다. Similar to various anomaly detection methods (Non-Patent Document 3) or outlier detection approaches, the method according to embodiments aims to derive a model for capturing a normal event, and then further Expect a large error or a lower probability.

도 5는 일 실시예에 따른 ERP 시스템에 대해 제안된 AIED을 나타내는 도면이다. 5 is a diagram illustrating an AIED proposed for an ERP system according to an embodiment.

도 5를 참조하면, ERP 시스템(500)에 대해 주어진 이벤트 샘플 x(501, 502, 503)에서 AIED는 다음 식과 같이 계산 오류 또는 시스템 관리자가 수동으로 설정한 임계값(τ)과 비교(508, 509)하여 수행될 수 있다.Referring to FIG. 5 , in a given event sample x (501, 502, 503) for the ERP system 500, AIED is compared with a calculation error or a threshold value (τ) manually set by the system administrator as shown in the following equation (508, 509) can be performed.

여기서, F는 주어진 정규 사건 샘플 X^nor에 대한 판별 모델 또는 확률 모델에 의해 정의될 수 있다. 모델 유형(판별 모델 또는 확률 모델 등)에 따라 결과는 오류 또는 가능성으로 이해될 수 있다. 이 방법론으로서, 주어진 정상 이벤트의 모든 특징을 포함할 수 있는 모델을 구축하는 것이 중요하다.Here, F can be defined by a discriminant model or a probabilistic model for a given normal event sample X ^{nor .} Depending on the type of model (such as a discriminant model or a probabilistic model), the results can be understood as errors or probabilities. As a methodology, it is important to build a model that can include all features of a given normal event.

여기에서는 재구성 설정보다는 예측 설정을 사용하는데, 이는 주어진 데이터셋에서 확률 분포를 컴파일하는 가장 일반적인 방법이다(비특허문헌 4). 모델을 도출하기 위해 예측 설정을 활용할 수 있는 이점이 있다. 재구성 세트에는 제한된 입력과 출력이 필요하지만, 예측 설정은 입력과 출력 사이의 시간 간격을 변경하여 각 입력에 해당하는 출력을 결정할 수 있기 때문에 입력과 출력의 상관 관계를 학습하는 데 더 유연하다. 이 방법론적 특성은 입력 샘플의 보다 다양한 패턴을 제공함으로써 모델의 일반화 성능을 향상시킬 수 있다.Here, a prediction setting is used rather than a reconstruction setting, which is the most common method of compiling a probability distribution in a given dataset (Non-Patent Document 4). There are advantages to using predictive settings to derive models. Although the reconstruction set requires limited inputs and outputs, the prediction setup is more flexible in learning the correlation of inputs and outputs because it can determine the output corresponding to each input by changing the time interval between the input and output. This methodological property can improve the generalization performance of the model by providing a more diverse pattern of input samples.

제안된 방법은 처음에 현재 이벤트(502)를 인코더 f^enc(504)를 이용하여 인코딩하여 잠재 특징(feature, 505)을 추출하고 예측 변수 f^pre(506)을 사용하여 예측 결과(507)를 생성할 수 있다. 현재 이벤트(502)에 대한 인코딩 프로세스는 다음 식과 같이 표시될 수 있다.The proposed method initially encodes a current event 502 using an encoder f ^enc (504) to extract a latent feature (feature, 505) and generates a prediction result (507) using a predictor ^{variable f pre (506).} can do. The encoding process for the current event 502 may be expressed as the following equation.

[수학식 1][Equation 1]

여기서, f^enc는 제안 방법의 인코더 모듈(504)이다. x^C와 z는 각각 현재 이벤트(502) 샘플과 추출된 잠재 특징(505)을 나타낸다. 제안된 방법은 잠재 특징 z(505)를 사용하여 예측 변수 f^pre(506)를 통해 다음 식과 같이 미래 이벤트(507)를 예측할 수 있다.Here, f ^enc is the encoder module 504 of the proposed method. x ^C and z represent the current event 502 sample and the extracted latent feature 505, respectively. The proposed method can predict the future event 507 using the latent feature z (505) through the predictor variable f ^pre (506) as shown in the following equation.

[수학식 2][Equation 2]

여기서, f^pre는 PAM의 예측 변수(506)를 나타내고

는 예측 결과(507)를 나타낸다. where f ^pre denotes the predictor variable 506 of PAM and

denotes the prediction result 507 .

반복 신경망(RNN) 구조를 인코더 f^enc(504)와 예측 변수 f^pre(506)에 활용하여 순차 데이터를 처리하며, 시계열 데이터를 처리하는 데 일반적으로 사용되는 접근 방식이다(비특허문헌 3). 인코더 f^enc(504)와 예측 변수 f^pre(506)는 LSTM 셀을 사용하여 컴파일될 수 있다(비특허문헌 5).A recurrent neural network (RNN) structure is used for the encoder f ^enc (504) and the predictor f ^pre (506) to process sequential data, and is an approach commonly used to process time series data (Non-Patent Document 3). The encoder f ^enc (504) and the predictor f ^pre (506) can be compiled using an LSTM cell (Non-Patent Document 5).

LSTM은 입력 게이트 i, foget 게이트 g, 출력 게이트 o로 구성된다. t 번째 시간 단계에서 입력 게이트 i_t는 t 시간에서 현재 내부 상태 s_t에 대한 새로운 입력의 영향을 나타내는 가중치를 결정한다. 입력 게이트의 프로세스는 다음 식과 같이 표시될 수 있다.The LSTM consists of an input gate i, a foget gate g, and an output gate o. At the t-th time step, the input gate i _t determines the weight representing the influence of the new input on the current internal state s _{t at time t.} The process of the input gate can be expressed as the following equation.

[수학식 3][Equation 3]

여기서, σ(·)는 입력을 0과 1 사이의 값으로 매핑하는 시그모이드 함수이다. 값이 1에 가까우면 입력 특징

가 더 중요해진다. Wis와 Wih는 상태 s_t-1및 은닉 상태 α_t-1과 관련된 가중치 파라미터이다. W_ix와 b_i는 가중치 행렬과 공간 특징

에 대한 bias이다.Here, σ(·) is a sigmoid function that maps the input to a value between 0 and 1. Values close to 1 indicate input features

becomes more important Wis and Wih are weight parameters associated _{with state s t-1} and hidden state α _t-1. W _ix and b _i are weight matrices and spatial features

is the bias for

[수학식 4][Equation 4]

여기서, W_gs, W_gh, W_gx는 각각 s_t-1, α_t-1,

에 대한 가중치 행렬을 나타낸다. _bg는 바이어스를 나타낸다. 이러한 입력을 활용하여 각 LSTM 셀의 s_t-1, α_t-1,

, g_t, s_t를 다음 식과 같이 업데이트한다.where W _gs , W _gh , and W _gx are s _t-1 , α _t-1 ,

Represents the weight matrix for . _b g indicates bias. Utilizing these inputs, the s _t-1 , α _t-1 ,

, g _t , and s _t are updated as follows.

[수학식 5][Equation 5]

여기서, 는 요소별 곱을 나타내고, W_sh와 W_sx는 각각 히든 상태 α_t-1과 공간 특징

와 관련된 가중치 행렬을 나타낸다. Here, denotes the product _{of each element, and W sh} and W _sx are the hidden state α _t-1 and spatial features, respectively.

Represents the weight matrix associated with .

출력 게이트 o_t는 현재 상태가 미래 상태에 미치는 영향을 결정하며 다음 식과 같이 정의된다.The output gate o _t determines the effect of the present state on the future state and is defined as the following equation.

[수학식 6][Equation 6]

여기서 W_cx, W_ch, W_cs는 각각 s_t, h_t-1,

에 해당하는 가중치 파라미터를 나타낸다. b_c는 출력 게이트의 bias를 나타낸다. LSTM의 은닉 상태는 다음 식과 같이 계산된다.where W _cx , W _ch , and W _cs are s _t , h _t-1 ,

Indicates the weight parameter corresponding to . b _c represents the bias of the output gate. The hidden state of the LSTM is calculated as follows.

[수학식 7][Equation 7]

여기서, ω는 ReLu 활성화 함수이다. LSTM에 사용되는 RNN 구조는 게이트 반복 유닛(GRU) 구조(비특허문헌 6)로 대체될 수 있다. 실시예들은 실험에 대한 메모리 셀 유형에 따라 AIED에 대한 ablation 연구를 제공할 것이다.Here, ω is the ReLu activation function. The RNN structure used in the LSTM may be replaced with a gate repeating unit (GRU) structure (Non-Patent Document 6). Examples will provide an ablation study for AIED depending on the memory cell type for the experiment.

인코더와 예측 변수를 사용하여 이벤트 예측이 오버되면 예측 오류를 사전 정의된 임계값 β와 비교하여 비정상적인 이벤트를 감지한다. 도 5에서와 같이 예측 결과와 해당 미래 이벤트를 비교함으로써 예측 오류를 얻는다. 오류는 평균 제곱 오차(MSE)를 기반으로 공식화되며, 이 프로세스는 다음 식과 같이 표시된다.Using the encoder and predictor variables, when the event prediction is over, the prediction error is compared to a predefined threshold β to detect an anomalous event. As shown in FIG. 5 , a prediction error is obtained by comparing the prediction result with the corresponding future event. The error is formulated based on the mean squared error (MSE), and this process is expressed as

[수학식 8][Equation 8]

여기서, x^F와

는 각각 제안된 방법에 의한 미래 사건과 예측 결과를 나타낸다. lx는 x^F의 시간 길이를 나타낸다.where, x ^F and

represents future events and prediction results by the proposed method, respectively. lx represents the time length of x ^{F .}

따라서 오류가 있는 AIED 워크플로우는 다음 식과 같이 정의된다.Therefore, the AIED workflow with errors is defined as:

[수학식 9][Equation 9]

여기서, τ는 이벤트 이상 임계값이며, 수동으로 결정한다. 계산된 오류와 수동으로 정의된 임계값 τ를 비교하여 이상 이벤트가 감지되므로 AIED의 성능은 임계값에 따라 달라진다. 한편, 여기에서는 평균 제곱 오차(MSE)를 기반으로 예측결과의 오류를 판단하고 있으나, 이에 제한되지 않으며, 일반화된 오차율 함수를 적용할 수 있다.Here, τ is the event anomaly threshold, which is determined manually. Anomalous events are detected by comparing the calculated error with a manually defined threshold τ, so the performance of the AIED depends on the threshold. Meanwhile, here, the error of the prediction result is determined based on the mean square error (MSE), but the present invention is not limited thereto, and a generalized error rate function may be applied.

주어진 정상 이벤트 데이터셋에서 강력한 확률 모델을 얻기 위해, 실시예들은 예측 방법을 사용한다. 정확한 AIED의 전제 조건은 현재 이벤트(502) x^C 및 해당 미래 이벤트 x^F와 관련된 확률 p(x^F|x^C)를 계산하기 위한 최적의 예측 모델을 도출하는 것이다. 도 5에서 제시한 방법의 워크플로우와 같이, 인코더 f^enc는 처음에 현재 사건 x^C를 잠재 특징으로 f^enc: x^C → z와 같이 매핑한 다음, 예측 변수 f^pre는 예측 결과를 f^pre : z →

로 출력한다. 위의 주석을 사용하여 p(x^F|x^C)를 다음 식과 같이 p(z|x^C)와 p(x^F|z,x^C)로 재구성할 수 있다.To obtain a robust probabilistic model from a given normal event dataset, embodiments use a predictive method. A prerequisite for accurate AIED is to derive an optimal predictive model for calculating the probability p(x ^F |x ^C ^{) associated with the current event 502 x C} and the corresponding future event x ^{F .} As in the workflow of the method presented in Fig. 5, the encoder f ^enc first maps the current event x ^C as a latent feature as f ^enc : x ^C → z, and then the predictor f ^pre converts the prediction result to f ^pre : z →

output as Using the above annotation, p(x ^F |x ^C ) can be reconstructed into p(z|x ^C ) and p(x ^F |z,x ^{C ) as follows:}

[수학식 10][Equation 10]

여기서, p(x^C)와 p(z)는 각각 x^C와 z의 사전 확률을 나타낸다. p(x^C)는 현재 이벤트(502)로 정의된 주어진 이벤트 샘플에 의해 도출될 수 있다. 따라서 최적의 p(z|x^C), 최적의 p(x^F|z) 및 p(z)를 도출하려면 p(x^F|x^C)가 필요하며, 제안된 예측 자동 회귀 모델(PAM)의 관점에서 이 문제를 다룬다.Here, p(x ^C ) and p(z) represent the prior probabilities of ^{x C and z, respectively.} p(x ^C ) may be derived by a given event sample defined as the current event 502 . Therefore, to derive optimal p(z|x ^C ), optimal p(x ^F |z) and p(z), p(x ^F |x ^C ) is required, and the We deal with this issue from a point of view.

도 6는 일 실시예에 따른 PAM의 구조적 세부 사항을 나타내는 도면이다. 6 is a diagram illustrating structural details of a PAM according to an embodiment.

도 6를 참조하면, PAM은 예측 오류 손실 L_P와 자동 회귀 손실 L_A를 계산하기 위한 두 개의 주요 파이프라인으로 구성되며, 이것들은 모두 인코더(610)와 예측기(620)의 예측 변수 아래에서 작동한다. 여기서, 예측 오류 손실 L_P은 예측 값(621)과 실제 값(631)이 얼마나 차이 나는지를 판단할 수 있다. 입력/출력 예측 오류만 판단하는 경우 인코딩 및 디코딩 과정에서 생성되는 잠재 특징인 z의 불확실성이 늘어나게 된다. z의 불확실성을 낮추기 위해서 미래 이벤트의 형상이 되도록 하는 자동 회귀 손실 L_A을 제공할 수 있다. 실제 미래 이벤트(631)의 특징을 추출했을 때의 결과(

)와 현재 이벤트(611)를 통한 예측을 위한 특징을 도출했을 때의 결과(z)가 최대한 같을수록 예측 결과가 좋게 된다.Referring to FIG. 6 , PAM consists of _{two main pipelines for calculating prediction error loss L P} and autoregression loss L _A , both of which operate under the predictor variables of encoder 610 and predictor 620 . do. Here, the prediction error loss L _P may determine how much the predicted value 621 and the actual value 631 are different. If only the input/output prediction error is determined, the uncertainty of z, which is a latent feature generated in the encoding and decoding process, increases. In order to lower the uncertainty of z, we can provide an _{autoregressive loss L A that allows the shape of future events.} The result of extracting the features of the actual future event (631)

) and the result z when the feature for prediction is derived through the current event 611 is the same as possible, the better the prediction result.

사전 처리된 이벤트 x_1:2k가 주어지면(여기서, n = 2k) 현재 이벤트 x_1:k([수학식 1]의 x^C)(611)와 미래 이벤트 x_k+1:2k([수학식 8]의 x^F)(621)로 나뉜다. 그런 다음 현재 이벤트(611)는 인코더(610) f^enc([수학식 1])를 통해 잠재 특징(z)으로 인코딩된다. 잠재 특징(z)은 예측 결과

_k+1:2k([수학식 2])를 생성하기 위해 예측 변수에 적용될 수 있다.Given a preprocessed event x _1:2k (where n = 2k), the current event x _1:k ^{(x C in} [Equation 1])(611) and the future event x _k+1:2k ([Equation 1] 8] in x ^F ) (621). Then, the current event 611 is encoded as a latent feature z through the ^{encoder 610 f enc ([Equation 1]).} The latent feature (z) is the predicted result

It can be applied to the predictor variable to generate _{k+1:2k ([Equation 2]).}

[수학식 11][Equation 11]

L_P를 최소화하면 기본적으로 적절한 p(z|x^C)와 p(x^F|z)를 찾을 수 있다.By minimizing L _{P ,} we can basically find suitable p(z|x ^C ) and p(x ^F |z).

그러나, 예측 오차 손실만을 사용하는 것은 잠재 공간에 대한 추상적인 표현을 고려할 수 없기 때문에 전역화된 특징을 학습하는 데 충분하지 않을 수 있다. 방법론적으로, 예측 오차 손실은 가능한 가깝게 추상화되거나 일반화되지 않는 샘플 공간의 오류를 최소화한다. 이 오류 최소화 프로세스 동안 추상화된 특징은 원래 입력보다 더 전역화된 특징인 잠재 공간에 매핑된다. 그러나, 이전 연구에 따르면, 잠재 특징 공간이 임의로 구성되는 경우, 무작위성으로 인해 입력 샘플의 재구성에 불확실성이 발생할 수 있다. 결과적으로, 불확실성은 예측 성능을 저하시키고, 잘 재구성되지 않은 결과는 AIED 성능을 저하시킬 수 있다.However, using only the prediction error loss may not be sufficient for learning globalized features because abstract representations of the latent space cannot be considered. Methodologically, the prediction error loss minimizes errors in the sample space that are not abstracted or generalized as closely as possible. During this error minimization process, abstracted features are mapped into latent space, which are more globalized features than the original input. However, according to previous studies, when the latent feature space is constructed arbitrarily, uncertainty may arise in the reconstruction of the input sample due to randomness. Consequently, uncertainty can degrade prediction performance, and poorly reconstructed results can degrade AIED performance.

따라서, 실시예들은 불확실성을 줄이기 위해 잠재 특징에 자동 회귀 손실 L_A를 적용할 수 있다. 실시예들은 z로 표기된 예측에 사용되는 잠재 특징의 분포를 실제 미래 이벤트 x_k+1:2k에 임베딩된 실제 분포에 가깝게 밀어 넣는다. 예측 변수의 역방향 작동은 주어진 미래 이벤트에서 잠재 특징을 취하는 인코더의 역할로 작용한다. 예측 변수 f^pre의 역처리는 다음 식과 같이 표시될 수 있다. Accordingly, the embodiments can be applied to automatically return loss L _A potential feature to reduce the uncertainty. Embodiments push the distribution of latent features used for prediction, denoted by z, close to the actual distribution embedded in the actual _{future event x k+1:2k .} The reverse operation of the predictor acts as an encoder to take a latent feature in a given future event. The reverse processing of the predictor variable f ^pre can be expressed as the following equation.

[수학식 12][Equation 12]

여기서, f^-pre는 예측 변수 f^pre의 역방향 작동을 나타내며,

는 훈련 단계에서 주어진 미래 이벤트에서 추출한 잠재 특징이다. 이 단계에서는 새 모델이 필요하지 않으므로 모델의 복잡도에 영향을 미치지 않는다. 역방향 작동은 도 6에 도시된 바와 같이 원래 예측 모델 f^pre의 가중치 공유를 기반으로 수행된다.where f ^-pre denotes the reverse action of the predictor f ^{pre ,}

is a latent feature extracted from a given future event in the training phase. No new model is required at this stage, so it does not affect the complexity of the model. The backward operation is performed based on weight sharing of the ^{original prediction model f pre} , as shown in FIG. 6 .

두 잠재 특징의 분포 간 차이를 최소화하면서 Kullback-Leibler(KL) 발산을 사용하여 차이를 측정한다(비특허문헌 7). 주어진 z와

에서, 잠재 특징에 대한 자동 회귀 손실은 다음 식과 같이 정의된다.While minimizing the difference between the distributions of two latent features, the difference is measured using Kullback-Leibler (KL) divergence (Non-Patent Document 7). given z and

In , the autoregression loss for the latent feature is defined as

[수학식 13][Equation 13]

여기서, D_KL은 K_L 발산 모듈을 나타내고, z_i와

_j는 각각 잠재 특징 z와

에서 i 번째 및 j 번째 요소를 나타낸다. K_L 분산의 최소값은 0이며, 이는 두 분포가 정확히 같다는 것을 의미한다. where D _KL represents the K _L divergence module, z _i and

_j is the latent feature z and

represents the i-th and j-th elements in . The minimum value of the K _L variance is 0, which means that the two distributions are exactly equal.

따라서 총 손실 함수는 다음과 같이 예측 오차 손실과 자동 회귀 손실의 조합으로 정의된다.Therefore, the total loss function is defined as the combination of the prediction error loss and the autoregression loss as follows.

[수학식 14][Equation 14]

여기서, λ는 두 손실 사이의 균형 가중치이다. 본 발명에서는 0.1을 최상의 성능을 위한 균형 가중치의 값으로 설정하였다.where λ is the balance weight between the two losses. In the present invention, 0.1 is set as the value of the balance weight for the best performance.

Apaydin 등에 따르면, LSTM을 사용하는 PAM의 모델 복잡도는 W = n_c Х n_c Х 3 + n_c Х n_o + n_c Х 2인 O(2W)이다. 여기서, n_c, n_i, n_o는 각각 LSTM 셀, 입력 단위 및 출력 단위의 수를 나타낸다. 또한, 신경망을 통한 dense 벡터 임베딩의 복잡성은 O(N²)이며, 여기서 N은 신경망을 통한 dense 벡터 임베딩에 사용되는 은닉 계층의 차원이다. 따라서 제안된 방법의 모델 복잡성은 O(2W + N²)이다. 그러나 모델 복잡도는 인코더와 예측기에 통합된 메모리 셀의 수를 늘리거나 줄일 수 있기 때문에 입력 이벤트 샘플의 시간 길이에 따라 변경될 수 있다. 이론적 모델 복잡도와 더불어 실제 실행 속도는 작업에 대한 실시간 애플리케이션으로서 처리 속도를 입증하는 데 필수적이다. 실험에서 실시예들에 따른 방법의 실시간 처리 능력이 타당함을 보여준다.According to Apaydin et al., the model complexity of PAM using LSTM is O(2W) _{with W = n c} Х n _c Х 3 + n _c Х n _o + n _{c Х 2.} Here, n _c , n _i , and n _o represent the number of LSTM cells, input units, and output units, respectively. Also, the complexity of dense vector embedding via neural networks is O(N ² ), where N is the dimension of the hidden layer used for dense vector embedding via neural networks. Therefore, the model complexity of the proposed method is O(2W + N ² ). However, the model complexity can change with the length of time of the input event samples as it can increase or decrease the number of memory cells integrated into the encoder and predictor. In addition to the theoretical model complexity, the actual execution speed is essential to demonstrate processing speed as a real-time application to the task. The experiment shows that the real-time processing capability of the method according to the embodiments is reasonable.

이상에서 설명된 장치는 하드웨어 구성요소, 소프트웨어 구성요소, 및/또는 하드웨어 구성요소 및 소프트웨어 구성요소의 조합으로 구현될 수 있다. 예를 들어, 실시예들에서 설명된 장치 및 구성요소는, 예를 들어, 프로세서, 컨트롤러, ALU(arithmetic logic unit), 디지털 신호 프로세서(digital signal processor), 마이크로컴퓨터, FPA(field programmable array), PLU(programmable logic unit), 마이크로프로세서, 또는 명령(instruction)을 실행하고 응답할 수 있는 다른 어떠한 장치와 같이, 하나 이상의 범용 컴퓨터 또는 특수 목적 컴퓨터를 이용하여 구현될 수 있다. 처리 장치는 운영 체제(OS) 및 상기 운영 체제 상에서 수행되는 하나 이상의 소프트웨어 애플리케이션을 수행할 수 있다. 또한, 처리 장치는 소프트웨어의 실행에 응답하여, 데이터를 접근, 저장, 조작, 처리 및 생성할 수도 있다. 이해의 편의를 위하여, 처리 장치는 하나가 사용되는 것으로 설명된 경우도 있지만, 해당 기술분야에서 통상의 지식을 가진 자는, 처리 장치가 복수 개의 처리 요소(processing element) 및/또는 복수 유형의 처리 요소를 포함할 수 있음을 알 수 있다. 예를 들어, 처리 장치는 복수 개의 프로세서 또는 하나의 프로세서 및 하나의 컨트롤러를 포함할 수 있다. 또한, 병렬 프로세서(parallel processor)와 같은, 다른 처리 구성(processing configuration)도 가능하다.The device described above may be implemented as a hardware component, a software component, and/or a combination of the hardware component and the software component. For example, devices and components described in the embodiments may include, for example, a processor, a controller, an arithmetic logic unit (ALU), a digital signal processor, a microcomputer, a field programmable array (FPA), It may be implemented using one or more general purpose or special purpose computers, such as a programmable logic unit (PLU), microprocessor, or any other device capable of executing and responding to instructions. The processing device may execute an operating system (OS) and one or more software applications running on the operating system. A processing device may also access, store, manipulate, process, and generate data in response to execution of the software. For convenience of understanding, although one processing device is sometimes described as being used, one of ordinary skill in the art will recognize that the processing device includes a plurality of processing elements and/or a plurality of types of processing elements. It can be seen that can include For example, the processing device may include a plurality of processors or one processor and one controller. Other processing configurations are also possible, such as parallel processors.

소프트웨어는 컴퓨터 프로그램(computer program), 코드(code), 명령(instruction), 또는 이들 중 하나 이상의 조합을 포함할 수 있으며, 원하는 대로 동작하도록 처리 장치를 구성하거나 독립적으로 또는 결합적으로(collectively) 처리 장치를 명령할 수 있다. 소프트웨어 및/또는 데이터는, 처리 장치에 의하여 해석되거나 처리 장치에 명령 또는 데이터를 제공하기 위하여, 어떤 유형의 기계, 구성요소(component), 물리적 장치, 가상 장치(virtual equipment), 컴퓨터 저장 매체 또는 장치에 구체화(embody)될 수 있다. 소프트웨어는 네트워크로 연결된 컴퓨터 시스템 상에 분산되어서, 분산된 방법으로 저장되거나 실행될 수도 있다. 소프트웨어 및 데이터는 하나 이상의 컴퓨터 판독 가능 기록 매체에 저장될 수 있다.Software may comprise a computer program, code, instructions, or a combination of one or more thereof, which configures a processing device to operate as desired or is independently or collectively processed You can command the device. The software and/or data may be any kind of machine, component, physical device, virtual equipment, computer storage medium or apparatus, to be interpreted by or to provide instructions or data to the processing device. may be embodied in The software may be distributed over networked computer systems and stored or executed in a distributed manner. Software and data may be stored in one or more computer-readable recording media.

실시예에 따른 방법은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터 판독 가능 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 매체에 기록되는 프로그램 명령은 실시예를 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다. The method according to the embodiment may be implemented in the form of program instructions that can be executed through various computer means and recorded in a computer-readable medium. The computer-readable medium may include program instructions, data files, data structures, etc. alone or in combination. The program instructions recorded on the medium may be specially designed and configured for the embodiment, or may be known and available to those skilled in the art of computer software. Examples of the computer-readable recording medium include magnetic media such as hard disks, floppy disks and magnetic tapes, optical media such as CD-ROMs and DVDs, and magnetic media such as floppy disks. - includes magneto-optical media, and hardware devices specially configured to store and execute program instructions, such as ROM, RAM, flash memory, and the like. Examples of program instructions include not only machine language codes such as those generated by a compiler, but also high-level language codes that can be executed by a computer using an interpreter or the like.

이상과 같이 실시예들이 비록 한정된 실시예와 도면에 의해 설명되었으나, 해당 기술분야에서 통상의 지식을 가진 자라면 상기의 기재로부터 다양한 수정 및 변형이 가능하다. 예를 들어, 설명된 기술들이 설명된 방법과 다른 순서로 수행되거나, 및/또는 설명된 시스템, 구조, 장치, 회로 등의 구성요소들이 설명된 방법과 다른 형태로 결합 또는 조합되거나, 다른 구성요소 또는 균등물에 의하여 대치되거나 치환되더라도 적절한 결과가 달성될 수 있다.As described above, although the embodiments have been described with reference to the limited embodiments and drawings, various modifications and variations are possible from the above description by those skilled in the art. For example, the described techniques are performed in an order different from the described method, and/or the described components of the system, structure, apparatus, circuit, etc. are combined or combined in a different form than the described method, or other components Or substituted or substituted by equivalents may achieve an appropriate result.

그러므로, 다른 구현들, 다른 실시예들 및 특허청구범위와 균등한 것들도 후술하는 특허청구범위의 범위에 속한다.Therefore, other implementations, other embodiments, and equivalents to the claims are also within the scope of the following claims.

Claims

In the predictive automatic regression-based real-time enterprise information system user abnormal behavior detection method performed by a computer system,
using two consecutive session events in the corporate information system as current events and future events;
generating a prediction result by predicting a future event by inputting the current event into a predictive auto-regression model (PAM); and
Comparing the prediction result and the actual future event to determine the user's abnormal behavior in real time
including,
The step of generating a prediction result by inputting the current event into a predictive automatic regression model (PAM) to predict a future event,
extracting latent features by encoding the current event using an encoder; and
generating the prediction result by using a predictor variable for the extracted latent feature
Including, a user anomaly detection method.

In the predictive automatic regression-based real-time enterprise information system user abnormal behavior detection method performed by a computer system,
using two consecutive session events in the corporate information system as current events and future events;
generating a prediction result by predicting a future event by inputting the current event into a predictive auto-regression model (PAM); and
Comparing the prediction result and the actual future event to determine the user's abnormal behavior in real time
including,
The step of using the two consecutive session events as a current event and a future event is,
Encoding the raw system log by dense embedding vector (DEV) or one-hot encoding in the enterprise information system
including,
Using the two consecutive session events using the dense embedding vector or one-hot encoding as a current event and a future event
Characterized in, the user anomaly detection method.

According to claim 1,
The predictive autoregression model (PAM) is
being trained to predict future events using the input current event
Characterized in, the user anomaly detection method.

delete

According to claim 1,
The step of comparing the prediction result with the actual future event to determine the user's abnormal behavior in real time,
If the error calculated by comparing the prediction result with the actual future event is greater than or equal to a preset threshold, it is determined as a user abnormal behavior
Characterized in, the user anomaly detection method.

According to claim 1,
The step of comparing the prediction result with the actual future event to determine the user's abnormal behavior in real time,
determining the difference between the prediction result and the actual future event by calculating the prediction error loss;
Characterized in, the user anomaly detection method.

According to claim 1,
The step of comparing the prediction result with the actual future event to determine the user's abnormal behavior in real time,
Auto-regression loss is applied to reduce uncertainty of latent features generated during encoding and decoding, and the auto-regression loss is the result of extracting the actual features of the future event and deriving features for prediction through the current event What is the difference in the result when
Characterized in, the user anomaly detection method.

In the predictive automatic regression-based real-time corporate information system user abnormal behavior detection system,
A preprocessor that uses two consecutive session events in the corporate information system as current and future events;
a predictive auto-regression model (PAM) for generating a prediction result by predicting a future event by inputting the current event; and
A user abnormal behavior determination unit that compares the prediction result with the actual future event to determine the user abnormal behavior in real time
including,
The predictive autoregression model (PAM) is
an encoder for encoding the current event to extract latent features; and
A predictor that generates the prediction result by using a predictor variable for the extracted latent feature
Including, user anomaly detection system.

In the predictive automatic regression-based real-time corporate information system user abnormal behavior detection system,
A preprocessor that uses two consecutive session events in the corporate information system as current and future events;
a predictive auto-regression model (PAM) for generating a prediction result by predicting a future event by inputting the current event; and
A user abnormal behavior determination unit that compares the prediction result with the actual future event to determine the user abnormal behavior in real time
including,
The preprocessor is
In the enterprise information system, the raw system log is encoded with a dense embedding vector (DEV) or one-hot encoding, and the continuous Using 2 session events as current event and future event
Characterized in, the user anomaly detection system.

9. The method of claim 8,
The predictive autoregression model (PAM) is
being trained to predict future events using the input current event
Characterized in, the user anomaly detection system.

delete

9. The method of claim 8,
The user abnormal behavior determination unit,
If the error calculated by comparing the prediction result with the actual future event is greater than or equal to a preset threshold, it is determined as a user abnormal behavior
Characterized in, the user anomaly detection system.

9. The method of claim 8,
The user abnormal behavior determination unit,
determining the difference between the prediction result and the actual future event by calculating the prediction error loss;
Characterized in, the user anomaly detection system.

9. The method of claim 8,
The user abnormal behavior determination unit,
Auto-regression loss is applied to reduce uncertainty of latent features generated during encoding and decoding, and the auto-regression loss is the result of extracting the actual features of the future event and deriving features for prediction through the current event What is the difference in the result when
Characterized in, the user anomaly detection system.