KR20190120729A

KR20190120729A - System Modeling Method by Machine Learning using Big data

Info

Publication number: KR20190120729A
Application number: KR1020190089769A
Authority: KR
Inventors: 양영진; 유호동; 김탁곤
Original assignee: (주) 아인스에스엔씨
Priority date: 2019-07-24
Filing date: 2019-07-24
Publication date: 2019-10-24
Also published as: KR102238424B1

Abstract

The present invention relates to a system modelling method having a big data machine learning embedded therein, and the purpose of the present invention is to build a hypothetical model, calculate parameter values verified by using big data obtained by a real system by means of machine learning, and apply the parameter values to the hypothetical model. The system modelling method, which completes a simulation model for a target system through machine learning for big data obtained by operations and observations of the target system on a hypothetical model defined through knowledge acquisition of the target system, comprises the steps of: defining a hypothetical model for a target system by identifying obtainable information related to the target system; learning information required for the hypothetical model by using machine learning algorithms from big data practically obtained from the target system in a function block provided by the hypothetical model; and completing a simulation model for the target system by applying information learned and verified through a machine learning process to the hypothetical model.

Description

System Modeling Method by Machine Learning using Big Data

본 발명은 물리적인 복잡계(대상시스템)을 모델링하고 시뮬레이션하는 기술에 관한 것으로, 보다 구체적으로는 빅데이터 개념을 적용하고 인공지능 기술을 적용하여 대상시스템을 모델링하는 모델을 고도화하는 기술에 관한 것이다.The present invention relates to a technique for modeling and simulating a physical complex system (target system), and more particularly, to a technique for upgrading a model for modeling a target system by applying a big data concept and applying artificial intelligence technology.

산업 현장에서 생산성, 경제성, 안전성 등을 향상시키고자 하는 요구가 확산되고 있다. 최근에는 사물인터넷(IoT), 빅데이터, 인공지능, 사이버 물리 시스템(CPS, Cyber Physical Systems) 등의 기술이 널리 이용되고 있으며, 이들 기술들이 집약된 기술 트렌드로서 디지털 트윈(Digital Twin) 기술이 주목받고 있다.There is a growing demand to improve productivity, economics and safety in industrial sites. Recently, technologies such as the Internet of Things (IoT), big data, artificial intelligence, cyber physical systems (CPS) are widely used, and digital twin technology is attracting attention as a technology trend in which these technologies are concentrated. I am getting it.

디지털 트윈은 물리적 객체(자산, 프로세스 및 시스템 등)들에 대한 디지털 복제(쌍둥이)로서, 수명주기 전체에 걸쳐 대상 객체 요소들의 속성/상태를 유지하며 이들이 어떻게 작동하는 지에 대한 동적 성질을 묘사하는 가상의 모델이라 할 수 있다.Digital twins are digital clones (twins) of physical objects (such as assets, processes, and systems) that maintain virtual attributes that describe the dynamic nature of how they work and maintain the properties / states of the target object elements throughout their lifecycle. It can be said that the model of.

컴퓨팅 시스템에서 구현된 디지털 트윈은 대상 객체(물리적 자산)와 연동하여 현실 상황을 반영하면서 현실에서 발생할 수 있는 상황을 예측하거나 운영을 최적화하는 조건을 알려주는 등 산업 현장에서 다양한 용도로 활용되면서 산업의 경쟁력을 강화하는 수단으로 인식되고 있다.Digital twins implemented in computing systems are used for various purposes in the industrial field, such as linking with target objects (physical assets) to reflect the real situation and informing the conditions for predicting what may happen in the real world or optimizing operations. It is recognized as a means to strengthen competitiveness.

사물인터넷(IoT) 기술과 디지털 트윈 기술은 밀접한 관련이 있다. 사물인터넷 플랫폼 기술의 고도화는 운용 중인 시스템의 센서 데이터를 실시간으로 수집한 후 기계학습/인공지능 기반의 예측, 고장 진단, 최적화, 예지 진단 등 스마트 서비스를 가능하게 한다. 디지털 트윈 기술은 산업 현장에서 이용되는 것을 전제로 시작되었으나, 최근에는 사이버 시티 등 그 적용되는 영역을 더욱 다양한 스마트 서비스로 확대하고자 하는 노력이 시도되고 있다. Internet of Things (IoT) technology and digital twin technology are closely related. The advancement of IoT platform technology enables the collection of real-time sensor data and enables smart services such as machine learning / AI based prediction, fault diagnosis, optimization, and predictive diagnosis. Digital twin technology has been started on the premise of being used in industrial sites, but recently, efforts have been made to expand the applied areas such as cyber city to more various smart services.

이러한 노력의 일환으로서, 미국공개특허 제2017/0286572호 "Digital twin of twinned physical system"에서, 실제 센서의 디지털 트윈을 이용하여 실제 센서의 동작을 시뮬레이션하고, 실세 센서가 작동하지 않게 되었을 때, 디지털 트윈 모델 내의 실제 센서에 대응하는 가상 센서로부터 데이터를 수집하여 실제 센서의 동작을 대체하는 기술이 제안되었다.As part of this effort, U.S. Patent Application Publication No. 2017/0286572 "Digital twin of twinned physical system" uses a digital twin of a real sensor to simulate the motion of a real sensor, and when the actual sensor becomes inoperative, A technique has been proposed to replace the operation of a real sensor by collecting data from a virtual sensor corresponding to a real sensor in a twin model.

이러한 디지털 트윈 모델은 컴퓨팅 시스템과 그 내부의 프로세서에 의하여 외부의 데이터를 학습하는 과정을 수행하고, 학습에 의하여 내부의 모델이 구체적인 파라미터 및 판단 기준을 확보한 후에, 새로운 입력이 주어질 때 모델이 학습 결과를 이용하여 새로운 입력에 대한 출력을 추론하는 과정을 수행한다. The digital twin model performs a process of learning external data by a computing system and a processor therein, and after the internal model acquires specific parameters and criteria by learning, the model is trained when a new input is given. Use the result to infer the output for the new input.

일반적으로 실 세계의 시스템에 대한 동작/성능 분석이나 예측을 수행하기 위해 해당 시스템에 대한 추상화된 모델을 만들고 실행하여 동작/성능 등 관심 측면의 척도를 측정/관찰하게 된다. 이러한 모델을 통한 대상 시스템의 분석/예측 결과가 신뢰성을 확보하기 위해서는 해당 시스템을 얼마나 잘(정확하게) 모델링 하는지가 중요한 관건이 된다. In general, in order to perform an operation / performance analysis or prediction of a real-world system, an abstracted model of the system is created and executed to measure / observe a measure of an aspect of interest such as an operation / performance. In order to ensure reliability of the analysis / prediction of the target system through such a model, it is important to accurately model the system.

모델링의 한 방법으로 대상 시스템이 포함하는 물리적인 법칙이나 동작 규칙 등의 지식을 활용하여 추상화 모델을 만드는 방법이 있는데, 이는 제어된 입력과 해당 출력 사이의 인과관계를 모델로 표현할 수 있는 모델링 및 시뮬레이션(M&S) 기반의 방법이다. 이 방법은 모델링 대상이 되는 시스템에 대한 자세한 정보들이 가용해야 한다는 제한사항이 있다. 또한 모델링 및 시뮬레이션(M&S) 기반의 방식으로 모델을 구축하는 경우에 그 모델의 유효성을 보장하기 위해서 실제 시스템을 얼마나 잘 반영하는지에 대한 모델실증 과정이 필수적이다. 실 세계 시스템으로부터 데이터 획득이 어려울 경우에 모델의 유효성을 확인할 수 없기 때문에 그 모델 기반의 분석/예측 결과에 대한 신뢰성을 보장할 수 없는 문제가 생길 수 있다. One method of modeling is to create an abstraction model using knowledge of physical laws or behavioral rules included in the target system. This modeling and simulation can be used to express the causal relationship between the controlled input and the corresponding output. (M & S) based method. This method has a limitation that detailed information about the system to be modeled is available. In addition, when building a model in a modeling and simulation (M & S) -based manner, a model demonstration process of how well it reflects the actual system is essential to ensure the validity of the model. If data is difficult to obtain from a real-world system, the validity of the model cannot be verified, which can lead to problems that cannot guarantee the reliability of the model-based analysis / prediction results.

시스템의 예측 및 분석을 위한 다른 모델링 방법으로 대상 시스템의 운용 및 관측을 통해 획득된 많은 데이터들을 분석함으로써 해당 시스템이 내포하고 있는 규칙/패턴/함수를 도출하는 형태의 데이터 모델링 방법이 있다. 데이터 모델링의 대표적인 방법이라 할 수 있는 기계학습 기반의 모델은 한 세트의 데이터와 다른 데이터 세트 간의 상관관계를 나타내는 방법으로, 빅 데이터 시대에 들어서며 수 많은 데이터를 활용하여 좀 더 효과적인 기계학습이 가능케 되었다. 그러나 기계학습을 통해 구축된 데이터 모델은 해당 시스템이 미래에도 아무런 변경이 없이 운용된다는 전제하에 예측이 가능할 수 있지만, 시스템의 구성 또는 운용 법칙 등이 변경되면 이전에 학습된 모델로 미래를 예측할 수 없다는 한계점이 있다.Another modeling method for predicting and analyzing a system is a data modeling method of deriving a rule / pattern / function included in a corresponding system by analyzing a large amount of data acquired through operation and observation of a target system. Machine learning-based models, which are representative methods of data modeling, represent the correlation between one set of data and another set of data, and in the age of big data, more efficient machine learning has become possible by utilizing a large number of data. . However, the data model constructed through machine learning can be predicted under the assumption that the system will operate without any change in the future, but if the system configuration or the rules of operation are changed, the model cannot be predicted with the previously learned model. There are limitations.

다각화 된 사회를 예측하는 수단으로 '빅 데이터'라는 용어가 널리 보급되고 있다. 빅 데이터는 일반적인 소프트웨어 도구가 허용 가능한 경과 시간 내에 수집, 관리 및 처리 할 수 있는 능력을 뛰어 넘는 크기의 데이터 셋을 의미한다. 이러한 대용량 데이터는 기존의 제한된 데이터보다 더 많은 통찰력을 제공하기 때문에 과학, 공학, 국방, 경영, 의학, 정치 등 다양한 분야의 연구에서 많은 관심을 받고 있다. 이러한 이유로 빅 데이터를 활용한 모델링이 빅 데이터 시대에 필수적이고 중요한 이슈가 되고 있다.The term 'big data' is widely used as a means of predicting diversified societies. Big data refers to datasets of a size that go beyond the ability of common software tools to collect, manage, and process them within acceptable elapsed time. This large amount of data is attracting much attention from various fields such as science, engineering, defense, management, medicine, and politics because it provides more insight than existing limited data. For this reason, modeling using big data has become an essential and important issue in the big data era.

빅 데이터를 이용한 모델링은 데이터의 상관관계를 나타내는 데 중점을 둔 데이터 모델링이라고 정의할 수 있다. 이러한 접근법은 데이터 마이닝(data mining)과 기계학습(machine learning)의 두 가지 유형으로 분류되어 연구되고 있다. 도 1은 두 가지 방식으로 데이터 모델링을 수행하는 방법을 보여준다. Modeling using big data can be defined as data modeling that focuses on the correlation of data. This approach is classified into two types: data mining and machine learning. 1 shows a method of performing data modeling in two ways.

데이터 마이닝은 도 1의 왼쪽과 같이 데이터 모델링의 유용한 방법 중에 하나이다. 데이터 마이닝 기술을 사용함으로써 시스템 결과를 예측하고자 하는 사용자는 한 차원에서 데이터의 패턴이나 속성을 분석할 수 있을 뿐 아니라, 데이터 패턴에서 데이터의 분포함수를 식별할 수 있다. Data mining is one of the useful methods of data modeling as shown on the left side of FIG. By using data mining techniques, users who want to predict system results can not only analyze patterns or attributes of data in one dimension, but also identify distribution functions of data in data patterns.

그런 다음 적합성 테스트(GOF; Goodness of Fitness) 같은 방법을 사용하여 실 세계의 데이터로 분포함수를 검증한 후 "난수 생성" 모델을 얻을 수 있다. 최종적으로 얻어진 모델은 미래의 데이터 패턴을 예측하는 과정에서 활용될 수 있다. You can then use a method such as Goodness of Fitness (GOF) to verify the distribution function with real-world data and obtain a "random number generation" model. The resulting model can be used to predict future data patterns.

한편, 기계학습은 도 1의 우측에서와 같이 데이터 모델링의 또 다른 수단이 될 수 있다. 인공신경망(ANN) 및 유전자 알고리즘(GE)과 같은 기계학습 알고리즘을 사용하여 사용자는 한 세트의 데이터(d1)와 다른 세트의 데이터(dn) 사이의 연관을 매핑할 수 있다. On the other hand, machine learning may be another means of data modeling as shown in the right side of FIG. Using machine learning algorithms, such as artificial neural networks (ANN) and genetic algorithms (GE), a user can map the association between one set of data d1 and another set of data dn.

데이터 마이닝 과정과 마찬가지로 RMSE (Root-Mean-Square Error)와 같은 일반 성능 지수를 사용하여 실제 맵 데이터의 유효성 검사 프로세스를 통과함으로써 데이터 모델을 획득할 수 있다. 그 후 주어진 데이터 세트 "d1"을 사용하여 데이터 세트 "dn"의 미래 값을 예측할 수 있다.As with the data mining process, a general performance index such as root-mean-square error (RME) can be used to obtain a data model by passing the validation process of the actual map data. The given data set "d1" can then be used to predict the future value of the data set "dn".

이러한 데이터 모델링은 획득, 모델링, 검증 및 예측과 같은 일련의 과정으로 진행된다. 데이터 모델링은 대상 시스템의 미래 행동을 예측하기 위해 과학, 공학, 경제, 산업 등 여러 분야에서 폭넓게 사용되어 왔으며, 일부 연구자들은 충분한 정보가 주어지면 상관관계가 강력하고 유익한 예측을 내리기에 충분하다고 주장했다. This data modeling is a series of processes such as acquisition, modeling, verification and prediction. Data modeling has been widely used in many fields, including science, engineering, economics, and industry, to predict future behavior of target systems, and some researchers have argued that, given sufficient information, correlations are sufficient to make strong and informative predictions. .

그러나 이러한 기대와는 달리, 데이터 모델링이 언제나 강력한 모델링 방식이 되는 것을 의미하지 않는다. 이 방법은 몇 가지 한계를 가지는데, 대표적인 한계 중 하나는 제어 입력과 해당 출력 간의 인과관계를 나타내는 것이 아니라 데이터 간의 상관관계를 설명할 수 있다는 것이다. But contrary to these expectations, data modeling doesn't always mean it's a powerful modeling approach. This method has several limitations. One of the typical limitations is that it does not indicate the causal relationship between the control input and its output, but rather the correlation between the data.

데이터 모델을 통해서는 시스템의 돌발 상황과 변화하는 상황에 대처할 수 없다. 즉, 모델을 학습한 후에 시스템의 구성요소 또는 구조/행위가 변경되면 데이터 모델을 이용한 정확한 예측이 불가능하다. Data models can't cope with system outbreaks and changing conditions. In other words, if the components or structure / behavior of the system is changed after the model is trained, accurate prediction using the data model is impossible.

또 다른 한계는 예기치 않은 이벤트에 대처할 수 없다는 것이다. 실제 시스템에서는 시스템의 복잡성과 불확실성으로 인해 예기치 않은 이벤트가 발생할 수 있다. 이들은 일반적으로 우리가 획득할 수 있는 데이터 세트에 포함되지 않는데, 이러한 이벤트가 발생하면 원래 데이터 세트에 기초한 데이터 모델은 예상치 못한 이벤트를 정확하게 예측할 수 없다. Another limitation is that you cannot cope with unexpected events. In a real system, the complexity and uncertainty of the system can cause unexpected events. They are generally not included in the data set we can obtain, and when these events occur, the data model based on the original data set cannot accurately predict unexpected events.

또한 이와 비슷하게 대상 시스템으로부터 획득된 데이터의 양에 따라 예측 결과에 영향을 받는다는 한계가 있다.Similarly, there is a limitation that the prediction result is influenced by the amount of data obtained from the target system.

이러한 데이터 모델링의 한계를 극복하기 위해 시스템 과학 기반의 시뮬레이션 모델링이 필요하다. 시뮬레이션 모델링은 시뮬레이션 필드에서 일반적으로 사용되는 이론 기반 모델링 방식으로 정의될 수 있으며, 모델을 구축하기 위해 대상 시스템에 존재하는 물리적 또는 운영 법칙이 사용된다. To overcome this limitation of data modeling, simulation modeling based on system science is required. Simulation modeling can be defined by the theory-based modeling methods commonly used in simulation fields, and the physical or operational laws that exist in the target system are used to build the model.

이를 통해 데이터 모델링과 달리 제어 입력과 대응 출력 세트 사이의 인과관계를 명확하게 나타내는 것이 가능하다. 그럼에도 불구하고 시뮬레이션 모델링만으로도 빅 데이터 시대에 복잡한 시스템의 모델링을 위한 완벽한 해결책이 될 수 없다. This makes it possible to clearly represent the causal relationship between the control input and the corresponding output set, unlike data modeling. Nevertheless, simulation modeling alone is not the perfect solution for modeling complex systems in the big data era.

예를 들어, 시스템에 대한 지식을 충분히 얻는 것이 어려울 때, 목적을 만족시키는 모델링 및 시뮬레이션(M&S) 모델을 완전하게 구축하는 것은 불가능하다. 시뮬레이션 모델링 접근법은 대상 시스템에 대한 사전 지식을 기반으로 하며, 그 완성은 시스템에 대해 얼마나 많이 이해할 수 있는지에 달려 있기 때문이다. 정확한 시뮬레이션 모델링을 위해 대상 시스템에 대한 광범위한 물리적 및 운영 지식이 필요하다. For example, when it is difficult to get enough knowledge of the system, it is impossible to completely build a modeling and simulation (M & S) model that satisfies the purpose. The simulation modeling approach is based on prior knowledge of the target system, and its completion depends on how much you can understand the system. Accurate simulation modeling requires extensive physical and operational knowledge of the target system.

또한 시뮬레이션 모델이 만들어진 후에 모델검증을 통한 모델의 유효성을 확인하는 과정이 필요한데, 실제 시스템에서 검증용 데이터가 없거나 획득이 어려울 경우에는 모델의 유효성 확인이 곤란하게 된다. In addition, it is necessary to verify the validity of the model through model verification after the simulation model is created. If there is no verification data in the real system or it is difficult to obtain the model, it is difficult to verify the validity of the model.

이와 같이 두 가지 모델링 방식에는 한계점이 명확하게 존재한다. These two modeling approaches clearly have limitations.

도 2는 이러한 한계점을 간단한 예를 통해 설명하고 있다. 도 2의 상단에 나타난 x1, x2, x3, x4, y를 입출력으로 가지는 시스템을 모델링 한다고 했을 때, 시뮬레이션 모델링 방법을 사용한 경우는 오른쪽 도안과 같다. 시스템 모델은 대상 시스템에 대한 수학적인 지식을 통해서 구축 될 수 있지만, 모델을 통해 출력되는 출력 값이 실제 시스템에서 획득된 데이터를 통해서 실증되는 경우에 유효하게 된다. 2 illustrates this limitation through a simple example. Assuming that a system having x1, x2, x3, x4, y as input / output shown in the upper part of FIG. 2 is modeled, the simulation modeling method is the same as the right figure. The system model can be constructed through mathematical knowledge of the target system, but it is valid when the output value output through the model is demonstrated through the data obtained from the actual system.

반대로 왼쪽 도안과 같이 시스템을 운용하여 얻은 x1, x2, x3, x4, y에 대한 데이터를 통해 기계학습을 할 경우에는 x1, x2, x3, x4 값을 입력했을 때 y를 출력하는 정확도 높은 모델을 얻을 수 있지만 도 2의 상단의 시스템 운용 법칙을 변경하는 경우(예를 들어, 도 2의 상단의 우측의 시스템 운용 법칙("X", 곱하기)을 나누기로 변경하는 경우) 기존에 획득된 빅데이터를 통해 나온 기계학습 모델에 의해서는 시스템 운용 법칙이 변경된 연구대상 시스템의 출력 y를 적절히 예측하지 못한다는 한계가 있다. On the contrary, in the case of machine learning through the data of x1, x2, x3, x4, y obtained by operating the system as shown in the figure on the left, a highly accurate model that outputs y when x1, x2, x3, x4 values are input Can be obtained, but when changing the system operating law of the upper part of FIG. 2 (for example, changing the system operating law ("X", multiplication) of the right side of the upper part of FIG. 2 by dividing). The machine learning model, which is derived from, has a limitation that the law of system operation does not properly predict the output y of the system under study.

미국공개특허 제2017/0286572호 "Digital twin of twinned physical system" (2017년 10월 5일)United States Patent Application Publication No. 2017/0286572 "Digital twin of twinned physical system" (October 5, 2017)

상기에 기재한 종래기술의 문제점을 극복하기 위하여, 보다 강건한 분석/예측 지원이 가능하도록 두 모델링 방식의 장점을 상호보완적으로 활용하여 각 접근방법의 한계점을 극복할 수 있는 상호협력적인 방법이 필요하다. 따라서, 본 발명은 종래기술의 문제점을 개선하기 위하여, 가설적 모델을 세우고, 실제 시스템에서 획득한 빅 데이터(Big data)를 기계학습(Machine Learning)을 이용하여 검증된 파라미터 값들을 산출하며, 이를 가설적 모델(Gray Box)에 적용하도록 하는 빅 데이터 기계학습이 내장된 시스템 모델링방법을 제공하는데 그 목적이 있다.In order to overcome the problems of the prior art described above, a mutually cooperative method is needed to overcome the limitations of each approach by complementarily utilizing the advantages of the two modeling methods to enable more robust analysis / prediction support. Do. Therefore, in order to improve the problems of the prior art, the present invention establishes a hypothetical model, calculates the parameter values verified by using machine learning on big data obtained from an actual system, The purpose is to provide a system modeling method with built-in big data machine learning that can be applied to a hypothetical model (Gray Box).

본 발명의 목적을 달성하기 위한 빅 데이터 기계학습이 내장된 시스템 모델링과정은 대상시스템의 지식획득을 통해 정의된 가설적모델에 상기 대상시스템의 운영 및 관측으로 획득된 빅데이터에 대한 기계학습을 통해 대상시스템에 대한 시뮬레이션 모델을 완성하도록 하는 시스템 모델링방법으로서, 상기 대상시스템과 관련된 획득 가능한 정보를 파악하여 대상시스템에 대한 가설적모델을 정의하는 제1과정; 상기 가설적모델에서 제공하는 기능블록에 상응하여 상기 대상시스템에서 실제 획득된 빅데이터를 기계학습 알고리즘을 통해 가설적모델에 필요한 정보를 학습하는 제2과정; 상기 기계학습 과정을 통해 학습 및 검증된 정보들을 가설적 모델에 적용하여 대상시스템에 대한 시뮬레이션모델을 완성하는 제3과정;을 포함하여 이루어진 것을 특징으로 한다.In order to achieve the object of the present invention, a system modeling process in which big data machine learning is embedded is performed through machine learning about big data obtained by operating and observing the target system in a hypothetical model defined by acquiring knowledge of a target system. A system modeling method for completing a simulation model for a target system, the system modeling method comprising: a first step of defining a hypothetical model for a target system by identifying obtainable information related to the target system; A second step of learning the information necessary for the hypothetical model through a machine learning algorithm on the big data actually acquired in the target system corresponding to the functional block provided by the hypothetical model; And a third process of completing the simulation model for the target system by applying the information learned and verified through the machine learning process to the hypothetical model.

여기서, 상기 제2과정은 가설적모델 내부의 다수의 기능블록은 각 기능블록 별로 기계학습을 수행하여 검증된 파라미터값을 산출하여 이를 상기 가설적모델에 필요한 정보로 제공하는 것을 특징으로 한다.Here, the second process is characterized in that the plurality of functional blocks in the hypothetical model perform machine learning for each functional block to calculate verified parameter values and provide them as necessary information for the hypothetical model.

상기 파라미터값은 변수값, 확률, 함수 또는 그래프를 포함하며, 이는 상기 기능블록 안에 들어가는 것을 특징으로 한다.The parameter value includes a variable value, a probability, a function or a graph, which is characterized by falling into the function block.

또한, 상기 가설적모델의 한가지 형태로서 세포 오토마타(Cellular Automata)모델을 사용하고, 상기 빅 데이타를 이용하여 기계학습을 이용하여 가설적 모델에 필요한 셀(cell)의 상태천이 함수가 학습되고, 이것을 세포 오토마타 모델에 제공하는 것을 특징으로 한다.In addition, as a form of the hypothetical model, a cell automata model is used, and the state transition function of a cell required for the hypothetical model is learned using machine learning using the big data. It is characterized by providing a cell automata model.

본 발명에 따른 빅 데이터 기계학습이 내장된 시스템 모델링방법은 대상 시스템의 운영 및 관측을 통해 획득된 실제 빅데이터를 이용한 기계학습 내용을 내장함으로써, 모델 자체가 검증된 모델이 되며, 각각의 방법만을 사용하여 관심대상 시스템을 분석 또는 예측할 때 직면할 수 있는 한계점들을 극복할 수 있다.The system modeling method in which the big data machine learning is built according to the present invention includes the machine learning contents using the actual big data obtained through the operation and observation of the target system, thereby making the model itself a verified model. Can be used to overcome limitations that may be encountered when analyzing or predicting a system of interest.

즉, 기계학습을 시스템모델에 내장 시킴으로써, 대상 시스템에 대해 요구되는 사전지식의 정도를 경감시키고 모델 실증의 효과를 달성하는 한편, 기계학습만으로 할 수 없었던 시스템 구조/규칙 변경에 따른 시스템 행위 분석 및 예측이 가능해진다. In other words, by embedding machine learning into the system model, the degree of prior knowledge required for the target system can be reduced and the effect of model demonstration can be achieved. Prediction is possible.

또한 본 발명은 산불, 교통, 질병 등 지형 그리드를 기반으로 하는 모든 시스템에 적용이 용이하다.In addition, the present invention is easy to apply to any system based on the terrain grid, such as forest fires, traffic, disease.

또한, 빅 데이터와 기계학습을 통해 세포 오토마타 모델의 상태천이 함수를 학습하기 때문에 다음과 같은 이점을 갖는다. 첫째는 공간변이(space variant)에 대한 특징으로, 상태천이 함수가 전체 셀에 대해서 일괄적으로 표현되는 것이 아니고 개별 셀에 따라 다르게 표현될 수 있다. 예를 들어 위치에 따라 달라지는 지형의 특징을 셀 별로 반영할 수 있다. In addition, because the state transition function of the cell automata model is learned through big data and machine learning, it has the following advantages. The first is a feature of the space variant. The state transition function is not collectively expressed for the entire cell but may be expressed differently for each individual cell. For example, it is possible to reflect the feature of the terrain that varies depending on the location for each cell.

둘째는 시간변이(time variant)에 대한 특징으로, 상태천이 함수가 오전, 오후 등의 시간에 따라 달라질 수 있는 특징을 반영할 수 있다. The second is a characteristic of a time variant, and may reflect a characteristic in which the state transition function may vary depending on the time of the morning or the afternoon.

셋째는 확률(stochastic)에 대한 특징으로, 예를 들어 교통 시뮬레이션에서 운전자의 행위가 결정적인 것이 아니라 확률적으로 나타날 수 있기 때문에 빅 데이터와 기계학습을 통해 확률적인(stochastic) 특징을 반영하는 것이 중요하다.The third is stochastic. It is important to reflect stochastic through big data and machine learning, for example, because the driver's behavior can be probabilistic rather than deterministic in traffic simulation. .

또한, 본 발명은 시스템의 비선형(nonlinear) 특징들을 반영할 수 있다. 즉 실제 세계의 많은 시스템들은 입출력 사이에 중첩의 정리가 성립이 되지 않는 비선형적인 특징을 갖기 때문에 상태천이 함수가 비선형(nonlinear) 특징을 고려할 수 있다는 것은 큰 이점이 된다. 마지막으로 실제 시스템으로부터 얻어진 빅 데이터를 통해 기상 조건 등 추가적인 정보를 표현할 수 있다는 효과가 있다.In addition, the present invention may reflect the nonlinear features of the system. In other words, many systems in the real world have a non-linear characteristic that does not constitute a theorem between overlapping input and output, so that the state transition function can consider nonlinear features. Finally, the big data obtained from the actual system can be used to express additional information such as weather conditions.

도 1은 종래기술에 따른 데이터 마이닝과 기계학습의 두 가지 방식으로 데이터 모델링을 수행하는 과정을 설명하기 위한 도이고,
도 2는 도 1의 두 가지 모델링 방식의 한계점을 설명하기 위한 도이고,
도 3은 본 발명을 구현하기 위한 가설적 모델에 빅 데이터 기계학습 적용을 통한 모델링 구성도이고,
도 4는 본 발명의 실시 예에 따른 빅 데이터 기계학습이 내장된 시스템 모델링 과정의 전체 흐름도이고,
도 5는 도 3의 구체적인 실시 예에 대하여 설명하기 위한 구성도이고,
도 6은 도 5에서 가설적모델에 적용된 세포 오토마타의 정의와 상태 갱신의 방식을 설명하기 위한 도이고,
도 7은 도 6의 세포 오토마타에서 각 세포의 상태천이 함수에 대해 설명하기 위한 도면이다.1 is a view for explaining a process of performing data modeling in two ways, data mining and machine learning according to the prior art,
2 is a view for explaining the limitations of the two modeling method of FIG.
3 is a schematic diagram of modeling through applying big data machine learning to a hypothetical model for implementing the present invention.
4 is an overall flowchart of a system modeling process in which big data machine learning is embedded according to an embodiment of the present invention;
5 is a configuration diagram for describing a specific embodiment of FIG. 3;
FIG. 6 is a diagram for explaining the definition of cell automata and state update method applied to the hypothetical model in FIG.
7 is a view for explaining the state transition function of each cell in the cell automata of FIG.

상술한 본 발명의 특징 및 효과는 첨부된 도면과 관련한 다음의 상세한 설명을 통하여 보다 분명해질 것이며, 그에 따라 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자가 본 발명의 기술적 사상을 용이하게 실시할 수 있을 것이다. The above-described features and effects of the present invention will become more apparent from the following detailed description taken in conjunction with the accompanying drawings, and thus, those skilled in the art to which the present invention pertains may easily implement the technical idea of the present invention. Could be.

본 발명은 다양한 변경을 가할 수 있고 여러 가지 형태를 가질 수 있는 바, 특정 실시 예들을 예시하고 본문에 상세하게 설명하고자 한다. As the inventive concept allows for various changes and numerous embodiments, particular embodiments will be illustrated and described in detail in the text.

그러나, 이는 본 발명을 특정한 제시형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. However, this is not intended to limit the present invention to a particular presentation, it should be understood to include all modifications, equivalents, and substitutes included in the spirit and scope of the present invention.

본 출원에서 사용한 용어는 단지 특정한 실시 예들을 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아니다.The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention.

본 발명은 실제 시스템에서 획득한 빅 데이터(Big data)를 모델에 맞도록 기계학습(Machine Learning) 시키므로, 검증된 파라미터 값들을 산출하고, 이를 가설적 모델(Gray Box)에 적용하도록 하는 빅 데이터 기계학습이 내장된 시스템 모델링방법에 관한 것이다. According to the present invention, since machine learning is performed to fit big data obtained from an actual system to a model, a large data machine that calculates verified parameter values and applies it to a hypothetical model (Gray Box). A system modeling method with embedded learning.

본 발명의 바람직한 실시 예에 따른 구성 및 구체적인 작용에 대하여 첨부된 도 3 내지 도 6을 참조하여 상세히 설명하면 다음과 같다. 이때 선행기술인 미국공개특허 제2017/0286572호 "Digital twin of twinned physical system"에도 기재된 것과 같이, 본 발명의 모델링 및 시뮬레이션은 컴퓨팅 시스템과 그 내부의 프로세서에 의하여 수행되며, 이러한 사실은 본 출원 명세서의 기재로부터 당업자라면 자명하게 파악하는 데에 전혀 어려움이 없을 것이다.Referring to the configuration and specific operation according to the preferred embodiment of the present invention with reference to the accompanying drawings 3 to 6 as follows. At this time, as described in the prior art US Patent Publication No. 2017/0286572 "Digital twin of twinned physical system", the modeling and simulation of the present invention is performed by a computing system and a processor therein, which is Those skilled in the art will have no difficulty in clearly understanding the description.

먼저, 본 발명은 모델링 및 시뮬레이션 기반의 모델링에 기계학습 방법을 적용한 협력적 모델링 방법으로, 본 발명의 대상시스템에서 실제 획득된 빅 데이터를 이용한 기계학습 방법을 활용하여, 가설적 모델 구체화에 필요한 동작/기능 함수, 파라미터 등을 예측하도록 한다.First, the present invention is a cooperative modeling method applying a machine learning method to modeling and simulation-based modeling. An operation required for materializing a hypothetical model using a machine learning method using big data actually obtained in the target system of the present invention. Predict function functions, parameters, etc.

도 3 및 도 4는 본 발명의 실시 예에 따른 빅 데이터 기계학습이 내장된 시스템 구성도 및 모델링과정의 흐름도로서, 이를 참조하면 모델링하기 위한 대상시스템(100)의 모델링 및 시뮬레이션의 목적에 따른 사용자의 모델링 요구사항을 분석하여야 한다. (S110)3 and 4 are flowcharts of a system configuration and modeling process in which big data machine learning is embedded according to an embodiment of the present invention. Referring to this, a user according to the purpose of modeling and simulation of a target system 100 for modeling The modeling requirements of the system should be analyzed. (S110)

모델링은 상기 대상시스템(100) 전체를 완벽하게 표현하는 것이 아니라, 미리 설정된 목적에 맞도록 시스템을 추상화 하는 과정이기 때문에 목적에 맞는 시스템 모델링이 중요하다Modeling is not a complete representation of the entire target system 100, but a process of abstracting the system to meet a predetermined purpose is important to model the system according to the purpose

시스템 모델링을 위하여 먼저, 대상 시스템과 관련한 획득 가능한 도메인 지식/경험 및 이론 등을 파악하여 해당 시스템에 대한 가설적모델(Gray box model)(110)을 정의한다. (S130)In order to model the system, first, a grasp of domain knowledge / experience, theory, etc. related to a target system is identified, and a gray box model 110 for the corresponding system is defined. (S130)

상기 가설적모델(110) 자체로는 시뮬레이션이 불가능하기 때문에 모델 완성에 필요한 동작 함수, 파라미터 등의 정보를 확보하여 완전한 모델을 구성하는 과정이 필요하게 된다. Since the hypothetical model 110 itself cannot be simulated, a process of constructing a complete model by obtaining information such as operation functions and parameters required for model completion is required.

모델 완성에 필요한 정보들은 모델링 대상시스템(100)을 실제로 운용/관측함으로써 빅데이터(120)를 획득하고, 획득된 빅데이터(120)에 대해 기계학습을 통해 가설적모델(110)에 필요한 정보를 획득한다. (S150)The information required for model completion is obtained by acquiring the big data 120 by actually operating / observing the modeling target system 100, and obtaining the information necessary for the hypothetical model 110 through machine learning on the acquired big data 120. Acquire. (S150)

즉, 실제 시스템의 운용 및 관측을 통해서 획득된 빅데이터(120)를 인공신경망 등의 기계학습 알고리즘을 이용하여 가설적 모델(110)에 필요한 정보들을 학습할 수 있다.That is, the big data 120 acquired through the operation and observation of the actual system may learn information necessary for the hypothetical model 110 using a machine learning algorithm such as an artificial neural network.

실 데이터에 기반하여 학습 및 검증된 정보들을 가설적 모델에 적용하여 대상 시스템에 대한 시스템모델 (White box model)(130) 을 완성한다. (S170)The white box model 130 for the target system is completed by applying the learned and verified information to the hypothetical model based on the actual data. (S170)

최종적으로, 완성된 시스템모델(130)의 시뮬레이션을 통해 관심대상 시스템에 대한 동작/성능 분석 및 예측을 수행한다. (S190)Finally, the simulation of the completed system model 130 performs the operation / performance analysis and prediction for the system of interest. (S190)

도 5는 도 3을 구체적인 실시 예로 설명하기 위한 도로서, 먼저 대상 시스템(100)의 지식 획득을 통해 대상시스템(100)의 가설적모델(110)을 세우게 되는데, 상기 대상시스템(100)의 구조적인 특징을 가설적모델(110)로 구할 수 있지만, "m, n, a, b, c, d, e, k"에 해당하는 계수들, 즉 실제 값들은 알 수 없다. FIG. 5 is a diagram for explaining a specific example of FIG. 3. First, a hypothetical model 110 of the target system 100 is established by obtaining knowledge of the target system 100. The structure of the target system 100 is illustrated in FIG. Can be obtained by the hypothetical model 110, but the coefficients corresponding to "m, n, a, b, c, d, e, k", that is, the actual values are unknown.

따라서, 가설적모델(110) 내부의 다수의 기능블록은 각 기능블록 별로 빅데이터를 이용한 기계학습을 수행하여 검증된 파라미터값을 산출하게 되고, 산출된 파라미터값은 상기 가설적모델(110)에 필요한 정보로 제공한다. Therefore, a plurality of functional blocks in the hypothetical model 110 calculates a verified parameter value by performing machine learning using big data for each functional block, and the calculated parameter values are stored in the hypothetical model 110. Provide the necessary information.

보다 상세하게는, 상기 계수들은 대상시스템(100)의 실제 운용 및 관측을 통해 획득된 빅데이터(120)(x1, x2, x3, x4, g1, g2, y)를 통해서 학습될 수 있는데, 인공신경망과 같은 학습 방법을 통해 각 기능블록 별로 기계학습을 수행한다. 즉, g()는 "g1, g2, y"를 이용하고, g1()은 "x1, x2, g1"을 이용하며, g2()는 "x3, x4, g2"를 이용하여 각각 학습될 수 있다. More specifically, the coefficients may be learned through big data 120 (x1, x2, x3, x4, g1, g2, y) obtained through actual operation and observation of the target system 100. Machine learning is performed for each functional block through a learning method such as neural network. That is, g () uses "g1, g2, y", g1 () uses "x1, x2, g1", and g2 () can be learned using "x3, x4, g2", respectively. have.

여기서, 상기 파라미터 값들은 함수뿐만 아니라, 변수값, 확률 또는 그래프를 포함하며, 이는 각 기능블록 안에 들어가게 된다.Here, the parameter values include not only functions, but also variable values, probabilities or graphs, which fall within each functional block.

이와 같이 학습을 통해 상기 검증된 파라미터(m, n, a, b, c, d, e, k) 값 들이 산출되며, 이를 가설적 모델(110)에 적용하면 시스템 모델(130)이 완성되며, 완성된 시스템모델(130)의 시뮬레이션을 통해 대상시스템(100)에 대한 동작/성능 분석 및 예측을 수행하게 된다.In this way, the values of the verified parameters (m, n, a, b, c, d, e, k) are calculated through learning, and when applied to the hypothetical model 110, the system model 130 is completed. Through the simulation of the completed system model 130, the operation / performance analysis and prediction for the target system 100 is performed.

도 6은 도 5에서 가설적모델에 적용된 세포 오토마타의 정의와 상태 갱신의 방식을 설명하기 위한 도로서, 상기 가설적모델(110)의 한 실시 예로서, 상기 가설적모델(110)은 세포 오토마타(Cellular Automata) 모델(111)을 통해 만들어지며, 상기 빅데이타(120)를 이용하여 기계학습을 통해 가설적 모델에 필요한 셀(cell)의 상태천이 함수가 학습되고, 이것을 세포 오토마타 모델(111)에 제공하게 된다.FIG. 6 is a diagram for explaining the definition of a cell automata applied to a hypothetical model and a method of state update in FIG. 5. As an example of the hypothetical model 110, the hypothetical model 110 is a cell automata. (Cellular Automata) model 111, the state transition function of the cell (cell) required for the hypothetical model is learned through the machine learning using the big data 120, this is the cell automata model 111 To be provided.

보다 상세히 설명하면, 상기 대상시스템(100)에서 수집된 빅데이터(120)를 이용하여 인공신경망 모델링 등과 같은 기계학습을 사용하여 가설적모델(110)에 필요한 세포(Cell)의 상태천이 함수(Transition Function)가 학습되고, 이것을 가설적모델인 세포 오토마타 모델(111)에 제공한다. In more detail, the state transition function of cells required for the hypothetical model 110 using machine learning such as artificial neural network modeling using the big data 120 collected by the target system 100 Function) is learned and provided to the cellular automata model 111, which is a hypothetical model.

즉, 학습된 상태천이 함수들과 세포 오토마타 모델(111)이 합쳐지면 시스템모델(130)(White box)이 완성된다. 이러한 상호협력적인 접근 방법에서는 빅 데이터를 통한 기계학습을 통해 시뮬레이션 모델의 실증 문제가 해결되고, 시뮬레이션 모델링을 통한 가설적 모델을 통해 시스템 구조 및 규칙 변경에 대한 문제가 해결되는 향상된 모델링이 가능하다는 것을 알 수 있다.That is, when the trained state transition functions and the cell automata model 111 are combined, the system model 130 (white box) is completed. In this collaborative approach, machine learning through big data solves the empirical problem of the simulation model, and hypothetical model through the simulation modeling enables improved modeling that solves the problem of system structure and rule change. Able to know.

여기서, 상기 세포 오토마타 모델(111)은 수학, 물리학, 복잡계, 생물학, 미세구조 등의 모델링에서 다루는 이산 모형으로, 규칙적인 격자 형태로 배열된 셀(세포)들에서 정의된다. Here, the cell automata model 111 is a discrete model dealing with modeling of mathematics, physics, complex system, biology, microstructure, etc., and is defined in cells (cells) arranged in a regular lattice form.

각 셀들은 유한한 수의 상태를 가질 수 있으며, 격자는 유한한 수의 차원으로 정의된다. 각 셀에 대하여, 이웃들이라 부르는 셀들은 그 셀에 대한 관계로 정의하는데, 예를 들어 이웃을 하나의 셀에 대해 모든 방향으로 한 칸씩 떨어져 있는 셀들이라고 정의하면 된다. Each cell can have a finite number of states, and a grid is defined by a finite number of dimensions. For each cell, cells called neighbors are defined as relationships to the cell, for example, neighbors are cells that are spaced one space apart in one direction from one cell.

시간 t=0 일 때 각 셀의 상태를 지정해놓고 이를 초기 상태라고 한다. 새로운 세대는 상태천이 함수에 의해 이전 세대로부터 만들어지는데, 이 함수는 각 셀과 그 이웃들의 상태에 따라 그 셀의 새로운 상태가 지정되는, 즉 셀들의 행동 규칙을 정하는 수학적인 함수이다. When time t = 0, the state of each cell is designated and this is called the initial state. The new generation is created from the previous generation by a state transition function, which is a mathematical function that specifies the cell's new state according to the state of each cell and its neighbors, ie the rules of behavior of the cells.

일반적으로 상기 행동 규칙은 각 셀에 대해 동일하고 시간에 따라 변하지 않으며 각 세대의 모든 셀에 동시에 적용된다. In general, the behavior rule is the same for each cell and does not change over time and applies simultaneously to all cells of each generation.

상기 세포 오토마타 및 상태천이 함수에 대하여 도 6을 참조하여 상세히 설명하면, Cell(i, j)는 위치 (i, j)에 있는 셀로, 상태 s(i, j)를 가지며, N은 셀 (i, j)에서의 이웃 세포 패턴을, T는 상태천이 함수, 즉 에이전트의 행동 규칙을 나타낸다. 각 셀의 상태는 교통, 수질오염, 화재확산 등 문제의 종류에 따라 정의될 수 있다.The cell automata and state transition function are described in detail with reference to FIG. 6, where Cell (i, j) is a cell at position (i, j), and has state s (i, j), where N is cell (i , the neighboring cell pattern in j), where T represents the state transition function, i. The state of each cell can be defined according to the type of problem such as traffic, water pollution, fire spread.

도 7은 세포 오토마타에서 각 세포의 상태천이 함수에 대해 설명한 도로서, 상기 상태천이 함수가 일반적인 모델과 어떻게 다른지 알 수 있다.7 is a diagram illustrating a state transition function of each cell in a cell automata, and it can be seen how the state transition function is different from a general model.

일반적으로 이상적인 상태천이 함수는 이웃 셀들의 상태를 반영하여 상태천이가 일어나지만, 본 발명에서는 이웃 셀들의 상태뿐만 아니라 지형정보, 외부 요인(날씨, 온도 등의 기상 조건 등)을 입력으로 받아 상태천이가 이루어진다.In general, the ideal state transition function is a state transition occurs by reflecting the state of the neighboring cells, but in the present invention, as well as the state of the neighboring cells, the state transition is received by receiving the topographic information and external factors (weather conditions such as weather and temperature). Is done.

즉, 이상적인 모델에서 지형의 특성을 반영하고 나아가 실시간 기상정보까지 반영함으로써 다음 셀에 대한 더욱 정확한 예측이 이루어지는 것을 알 수 있다.In other words, it can be seen that more accurate prediction of the next cell is achieved by reflecting the characteristics of the terrain in the ideal model and further reflecting real-time weather information.

이때 상태천이 함수는 기존의 방식처럼 각 셀에 대해 동일하거나 시간에 따라 변하지 않는 것이 아니라 셀의 위치 변화와 시간의 변화에 따라 달라지며, 또한 상태천이 규칙이 결정적이지 않고 불확실성을 띄게 된다. 상태천이 함수를 만약 기존처럼 도메인 지식을 이용하여 구할 경우에는 실증(validation)이 필요하지만, 빅 데이터의 기계학습을 통해 구해지면 실제 데이터에 기반하기 때문에 실증의 문제가 해결될 수 있다.At this time, the state transition function is not the same or does not change with time for each cell as in the conventional method, but depends on the change of the position of the cell and the change of time, and the state transition rule is not deterministic and has uncertainty. If the state transition function is obtained using domain knowledge as before, validation is required, but if it is obtained through machine learning of big data, the problem of validation can be solved because it is based on real data.

이상과 같이, 본 발명의 실시 예에 따른 빅데이터 기계학습이 내장된 시스템 모델링방법은 비록 한정된 실시 예와 도면에 의해 설명되었으나 이 실시 예에 의해 한정되지 않으며, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 본 발명의 기술 사상과 아래에 기재될 특허청구범위의 균등범위 내에서 다양한 수정 및 변형 가능함은 물론이다.As described above, the system modeling method in which the big data machine learning is embedded according to an embodiment of the present invention is not limited by this embodiment, although it has been described with reference to the limited embodiments and the drawings. It is apparent that various modifications and changes can be made by those skilled in the art within the scope of the technical spirit of the present invention and the claims to be described below.

100 : 대상시스템 110 : 가설적모델
111 : 세포오토마타 모델 120 : 빅데이터
130 : 시스템모델100: target system 110: hypothetical model
111: Cell Automata Model 120: Big Data
130: System Model

Claims

In a computing system implementing a model for a target system,
The computing system
A hypothetical model defined based on obtainable structural information related to the target system through acquiring knowledge about the target system and including a plurality of functional blocks therein; And
At least one processor; Including,
The plurality of functional blocks
At least one first functional block configured to output first data and first data of the big data obtained through actual operation and observation of the target system based on the structural information; And
At least one second functional block configured to output third data as input and second data among the big data based on the structural information;
Including,
At least one or more of the at least one first functional block and the at least one or more second functional blocks are machine learning functional blocks capable of machine learning,
The at least one processor
The first machine learning function such that machine learning is performed by a first machine learning function block included in the at least one first function block by inputting the first data of the big data and outputting the second data. The second machine learning to control a block or to perform the machine learning by a second machine learning function block included in the at least one second function block by inputting the second data and outputting the third data. A computing system that implements a model for a target system that controls functional blocks.

The method of claim 1,
And the hypothetical model and the plurality of functional blocks are defined based on obtainable domain knowledge, experience, and theory relating to the target system.

The method of claim 1,
The at least one processor
When the machine learning is performed by the first machine learning function block, the verified first parameter value is obtained and the first parameter value is used as information identifying the first machine learning function block, or the second machine learning function is performed. And when the machine learning is performed by the block, obtaining the verified second parameter value and using the second parameter value as information identifying the second machine learning function block.

The method of claim 3,
The first parameter value or the second parameter value includes a variable value, probability, function or graph input to the first machine learning function block or the second machine learning function block, to implement a model for a target system. Computing system.

The method of claim 1,
The first data is data represented by input of the hypothetical model based on the structural information, the second data is data represented by internal variables of the hypothetical model based on the structural information, and the third data. And data is data represented by the output of the hypothetical model based on the structural information.

In a computing system implementing a model for a target system,
The computing system
A hypothetical model defined based on obtainable structural information related to the target system through acquiring knowledge about the target system and including a plurality of functional blocks therein; And
At least one processor; Including,
The plurality of functional blocks
At least one first functional block based on the structural information, the first data being the first data among the big data obtained through actual operation and observation of the target system; And
At least one second functional block configured to output second data as input and second data among the big data based on the structural information;
Including,
At least one or more of the at least one or more first-stage functional blocks and the at least one or more two-stage functional blocks are machine learning functional blocks capable of machine learning,
The at least one processor
Receive new input data for the target system,
Inputting the new input data into the hypothetical model and controlling the hypothetical model such that an inference process of the hypothetical model is executed,
And provide the output of the hypothetical model as a result of the inference of the hypothetical model with respect to the output of the target system with respect to the new input data.