KR102053243B1

KR102053243B1 - Adaptive Knowledge Base Construction Method and System

Info

Publication number: KR102053243B1
Application number: KR1020170019873A
Authority: KR
Inventors: 김말희; 강현중; 권순현; 이용준; 김귀훈; 표철식
Original assignee: 한국전자통신연구원
Priority date: 2016-08-03
Filing date: 2017-02-14
Publication date: 2019-12-06
Also published as: KR20180015561A

Abstract

본 발명의 일면에 따른 적응형 지식 베이스 구축 장치는 입력된 제1 데이터 집합을 토대로 학습 모델을 생성하고, 생성된 학습 모델에 따라 제1 데이터 집합에 대한 연관 관계를 분석하는 기계 학습 엔진; 학습된 모델 및 알고리즘을 토대로 규칙을 생성하는 규칙 생성기; 및 생성된 규칙과 제2 데이터 집합을 이용하여 시맨틱 규칙을 생성하는 시맨틱 규칙 생성기;를 포함한다.According to an aspect of the present invention, there is provided an adaptive knowledge base construction apparatus comprising: a machine learning engine that generates a learning model based on an input first data set, and analyzes an association relationship with the first data set according to the generated learning model; A rule generator for generating a rule based on the learned model and algorithm; And a semantic rule generator for generating semantic rules using the generated rules and the second data set.

Description

Adaptive Knowledge Base Construction Method and System

본 발명은 적응형 지식 베이스 구축 방법 및 그 시스템에 관한 것으로, 구체적으로 기계 학습을 통하여 생성된 학습 결과를 규칙으로 변환하고, 규칙으로 변환된 지식을 시맨틱 기술을 이용하여 지식 베이스를 구축하는 적응형 지식 베이스 구축 방법 및 그 시스템에 관한 것이다.The present invention relates to a method and system for constructing an adaptive knowledge base. In particular, the present invention converts a learning result generated through machine learning into a rule, and builds a knowledge base by using semantic techniques. Knowledge base construction method and system thereof.

최근 기계 학습 및 시맨틱 기술에 대한 연구가 활발하게 진행되고 있다. 기계 학습은 주어진 목적에 따라 데이터 기반 학습을 진행하고, 학습된 결과를 토대로 새로운 환경에 필요한 정보를 예측하는 기술이다. 통상적으로 학습 방법은 트리 기반 분석 방법과 연관 기반 분석 방법으로 분류될 수 있다. 트리 기반 분석 방법은 학습 결과 구축된 트리의 노드 정보를 이용하여 규칙(rule)을 생성하고, 연관 기반 분석 방법은 학습 데이터의 패턴을 분석하여 연관 규칙(rule)을 생성한다.Recently, research on machine learning and semantic technologies has been actively conducted. Machine learning is a technology that performs data-based learning according to a given purpose, and predicts the information required for a new environment based on the learned results. In general, a learning method may be classified into a tree-based analysis method and an association-based analysis method. A tree-based analysis method generates a rule using node information of a tree constructed as a learning result, and an association-based analysis method generates an association rule by analyzing a pattern of training data.

시맨틱 기술은 도메인 지식을 가진 설계자에 의하여 규칙(rule)을 생성하고, 생성된 규칙(rule)을 이용하여, 지식을 확장-추론-재활용하여 지식 베이스를 구축하는 기술을 말한다.Semantic technology refers to a technique for creating a rule by a designer having domain knowledge, and using the generated rule, to build a knowledge base by extending-inferring-reusing knowledge.

기계 학습을 이용하여 규칙을 생성하고, 규칙을 이용하여 요청에 따른 결과를 생성할 때, 생성 주기가 있어, 학습 결과를 재활용하지 못하는 문제가 있다. 시맨틱 기술의 경우, 도메인 지식을 갖는 사람이 개입하여 규칙을 변경하기 전에는 사전에 형성된 규칙 기반 지식을 토대로 확장 및 추론의 결과로서 지식 베이스가 구축되는데, 주변 환경에 의하여 실제 환경이 동적으로 변할 수 있어, 주변 환경에 따라 적응적(adaptive)으로 규칙을 변경하고, 변경된 규칙이 반영된 지식 베이스를 구축할 필요가 있다. When generating a rule using machine learning, and generating a result according to a request using the rule, there is a generation cycle, there is a problem that can not recycle the learning results. In the case of semantic technology, the knowledge base is built as a result of expansion and inference based on pre-formed rule-based knowledge before a person with domain knowledge intervenes to change the rules. Therefore, it is necessary to adapt the rules adaptively according to the surrounding environment and to build a knowledge base reflecting the changed rules.

본 발명은 전술한 문제를 해결하기 위하여, 적응형 지식 베이스 구축 방법 및 그 시스템을 제공하는 것을 그 목적으로 한다. 구체적으로 본 발명은 기계 학습 및 시맨틱 기술을 이용하여 사람의 개입 없이 동적으로 변화하는 환경을 반영하는 적응형 지식 베이스를 구축하는 방법 및 그 시스템을 제공하는 것이다.An object of the present invention is to provide an adaptive knowledge base construction method and system for solving the above problems. Specifically, the present invention provides a method and system for constructing an adaptive knowledge base that reflects a dynamically changing environment without human intervention using machine learning and semantic techniques.

본 발명의 목적은 이상에서 언급한 목적으로 제한되지 않으며, 언급되지 않은 또 다른 목적들은 아래의 기재로부터 당업자에게 명확하게 이해될 수 있을 것이다.The object of the present invention is not limited to the above-mentioned object, and other objects that are not mentioned will be clearly understood by those skilled in the art from the following description.

전술한 목적을 달성하기 위한 본 발명의 일면에 따른 적응형 지식 베이스 구축 장치는 입력된 제1 데이터 집합을 토대로 학습 모델을 생성하고, 생성된 학습 모델에 따라 제1 데이터 집합에 대한 연관 관계를 분석하는 기계 학습 엔진; 학습된 모델 및 알고리즘을 토대로 규칙을 생성하는 규칙 생성기; 및 생성된 규칙과 제2 데이터 집합을 이용하여 시맨틱 규칙을 생성하는 시맨틱 규칙 생성기;를 포함한다.According to an aspect of the present invention, an adaptive knowledge base construction apparatus generates a learning model based on an input first data set, and analyzes an association relationship with the first data set according to the generated learning model. Machine learning engine; A rule generator for generating a rule based on the learned model and algorithm; And a semantic rule generator for generating semantic rules using the generated rules and the second data set.

본 발명의 다른 일면에 따른 적응형 지식 베이스 구축 시스템은, 적응형 지식 베이스 구축 모듈을 제공하기 위한 프로그램이 저장된 메모리; 및 상기 프로그램을 실행시키는 프로세서를 포함하되, 상기 프로세서는 상기 프로그램을 실행시킴에 따라, 입력된 제1 데이터셋에 대한 학습 모델을 생성하고, 생성된 학습 모델에 따라 제1 데이터셋의 연관관계 분석 결과를 출력하고, 학습된 모델과, 학습된 알고리즘과 연관관계 분석 결과를 토대로 기계 학습 규칙을 생성하고, 생성된 기계 학습 규칙을 이용하여 시맨틱 규칙을 생성하는 것을 특징으로 한다.According to another aspect of the present invention, there is provided an adaptive knowledge base construction system comprising: a memory in which a program for providing an adaptive knowledge base construction module is stored; And a processor for executing the program, wherein the processor generates a learning model for the input first dataset as the program is executed, and analyzes the correlation of the first dataset according to the generated learning model. Outputting a result, generating a machine learning rule based on the learned model, the learned algorithm and the correlation analysis result, and generating semantic rules using the generated machine learning rule.

본 발명의 다른 일면에 따른 적응형 지식 베이스 구축 방법은 (1) 입력된 제1 데이터 집합을 기계 학습하는 단계; (2) 학습된 결과를 토대로 규칙을 생성하는 단계; (3) 생성된 규칙과 제2 데이터 집합을 이용하여 시맨틱 규칙을 생성하는 단계; 및 (4) 생성된 시맨틱 규칙을 저장하여 지식 베이스를 구축하는 단계;를 포함한다.According to another aspect of the present invention, an adaptive knowledge base construction method includes (1) machine learning an input first data set; (2) generating a rule based on the learned result; (3) generating a semantic rule using the generated rule and the second data set; And (4) storing the generated semantic rules to build a knowledge base.

본 발명에 따르면, 기계 학습 기술과 시맨틱 기술을 결합하여 지식 베이스를 구축하여 인간의 개입을 배제하고, 최적의 분석 및 높은 효율성을 달성하는 효과가 있다.According to the present invention, by combining a machine learning technology and semantic technology to build a knowledge base to exclude human intervention, it is effective to achieve optimal analysis and high efficiency.

본 발명은 IoT 기술, 빅데이터 기술과 연관한 분석, 상황 인식 서비스와 관련된 지능적 서비스 산업 분야에 적용할 수 있고, 기계 학습을 이용한 분석 기술 분야에서 플랫폼으로 활용될 수 있다.The present invention can be applied to the field of intelligent services related to IoT technology, analysis related to big data technology, and context awareness service, and can be used as a platform in the field of analysis technology using machine learning.

도 1은 본 발명에 따른 적응형 지식 베이스 구축 방법이 구현되는 컴퓨터 시스템의 구성을 설명하기 위한 예시도.
도 2는 본 발명의 일실시예에 따른 적응형 지식 베이스 구축 시스템의 구성도.
도 3은 본 발명의 다른 실시예에 따른 적응형 지식 베이스 구축 시스템의 구성도.
도 4는 본 발명의 부분 실시예에 따른 트리 학습 모델 기반 적응형 지식 베이스 구축 방법을 설명하기 위한 예시도.
도 5는 본 발명의 부분 실시예에 따른 Apriori 알고리즘을 적용한 적응형 지식 베이스 구축 방법을 설명하기 위한 예시도.
도 6은 본 발명의 다른 실시예에 따른 적응형 지식 베이스 구축 시스템의 구성도.
도 7은 본 발명의 다른 실시예에 따른 적응형 지식 베이스 구축 시스템의 구성도.
도 8은 본 발명의 다른 실시예에 따른 적응형 지식 베이스 구축 방법의 절차 흐름도.1 is an exemplary diagram for explaining the configuration of a computer system in which the adaptive knowledge base building method according to the present invention is implemented.
2 is a block diagram of an adaptive knowledge base construction system according to an embodiment of the present invention.
3 is a block diagram of an adaptive knowledge base building system according to another embodiment of the present invention.
4 is an exemplary diagram for explaining a tree learning model based adaptive knowledge base construction method according to an embodiment of the present invention.
5 is an exemplary diagram for explaining an adaptive knowledge base construction method applying the Apriori algorithm according to an embodiment of the present invention.
6 is a block diagram of an adaptive knowledge base building system according to another embodiment of the present invention.
7 is a block diagram of an adaptive knowledge base building system according to another embodiment of the present invention.
8 is a process flow diagram of an adaptive knowledge base building method according to another embodiment of the present invention.

본 발명의 이점 및 특징, 그리고 그것들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 후술되어 있는 실시예들을 참조하면 명확해질 것이다. 그러나 본 발명은 이하에서 개시되는 실시예들에 한정되는 것이 아니라 서로 다른 다양한 형태로 구현될 것이며, 단지 본 실시예들은 본 발명의 개시가 완전하도록 하며, 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 발명의 범주를 완전하게 알려주기 위해 제공되는 것이며, 본 발명은 청구항의 범주에 의해 정의될 뿐이다. 한편, 본 명세서에서 사용된 용어는 실시예들을 설명하기 위한 것이며 본 발명을 제한하고자 하는 것은 아니다. 본 명세서에서, 단수형은 문구에서 특별히 언급하지 않는 한 복수형도 포함한다. 명세서에서 사용되는 "포함한다(comprises)" 및/또는 "포함하는(comprising)"은 언급된 구성소자, 단계, 동작 및/또는 소자는 하나 이상의 다른 구성소자, 단계, 동작 및/또는 소자의 존재 또는 추가를 배제하지 않는다.Advantages and features of the present invention, and methods for achieving them will become apparent with reference to the embodiments described below in detail in conjunction with the accompanying drawings. However, the present invention is not limited to the embodiments disclosed below, but will be implemented in various forms, and only the present embodiments are intended to complete the disclosure of the present invention, and the general knowledge in the art to which the present invention pertains. It is provided to fully convey the scope of the invention to those skilled in the art, and the present invention is defined only by the scope of the claims. Meanwhile, the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. In this specification, the singular also includes the plural unless specifically stated otherwise in the phrase. As used herein, “comprises” and / or “comprising” refers to a component, step, operation and / or device that is present in one or more other components, steps, operations and / or elements. Or does not exclude additions.

이하, 본 발명의 바람직한 실시예에 대하여 첨부한 도면을 참조하여 상세히 설명하기로 한다. Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명에 따른 적응형 지식 베이스 구축 방법이 구현되는 컴퓨터 시스템의 구성을 설명하기 위한 예시도이다.1 is an exemplary diagram for explaining a configuration of a computer system in which an adaptive knowledge base construction method according to the present invention is implemented.

한편, 본 발명의 실시예에 따른 적응형 지식 베이스 구축 방법은 컴퓨터 시스템에서 구현되거나, 또는 기록매체에 기록될 수 있다. 도 1에 도시된 바와 같이, 컴퓨터 시스템은 적어도 하나 이상의 프로세서(110)와, 메모리(120)와, 사용자 입력 장치(150)와, 데이터 통신 버스(130)와, 사용자 출력 장치(160)와, 저장소(140)를 포함할 수 있다. 전술한 각각의 구성 요소는 데이터 통신 버스(130)를 통해 데이터 통신을 한다.Meanwhile, the adaptive knowledge base construction method according to an embodiment of the present invention may be implemented in a computer system or recorded in a recording medium. As shown in FIG. 1, a computer system includes at least one processor 110, a memory 120, a user input device 150, a data communication bus 130, a user output device 160, It may include a reservoir 140. Each of the above components communicates with each other via the data communication bus 130.

컴퓨터 시스템은 네트워크(180)에 연결된 네트워크 인터페이스(170)를 더 포함할 수 있다. 상기 프로세서(110)는 중앙처리 장치(central processing unit (CPU))이거나, 혹은 메모리(120) 및/또는 저장소(140)에 저장된 명령어를 처리하는 반도체 장치일 수 있다. The computer system can further include a network interface 170 coupled to the network 180. The processor 110 may be a central processing unit (CPU) or a semiconductor device that processes instructions stored in the memory 120 and / or the storage 140.

상기 메모리(120) 및 상기 저장소(140)는 다양한 형태의 휘발성 혹은 비휘발성 저장매체를 포함할 수 있다. 예컨대, 상기 메모리(120)는 ROM(123) 및 RAM(126)을 포함할 수 있다.The memory 120 and the storage 140 may include various types of volatile or nonvolatile storage media. For example, the memory 120 may include a ROM 123 and a RAM 126.

따라서, 본 발명의 실시예에 따른 적응형 지식 베이스 구축 방법은 컴퓨터에서 실행 가능한 방법으로 구현될 수 있다. 본 발명의 실시예에 따른 적응형 지식 베이스 구축 방법이 컴퓨터 장치에서 수행될 때, 컴퓨터로 판독 가능한 명령어들이 본 발명에 따른 운영 방법을 수행할 수 있다.Therefore, the adaptive knowledge base construction method according to the embodiment of the present invention may be implemented in a computer executable method. When the adaptive knowledge base construction method according to an embodiment of the present invention is performed in a computer device, computer readable instructions may perform the operation method according to the present invention.

한편, 상술한 본 발명에 따른 적응형 지식 베이스 구축 방법은 컴퓨터로 읽을 수 있는 기록매체에 컴퓨터가 읽을 수 있는 코드로서 구현되는 것이 가능하다. 컴퓨터가 읽을 수 있는 기록 매체로는 컴퓨터 시스템에 의하여 해독될 수 있는 데이터가 저장된 모든 종류의 기록 매체를 포함한다. 예를 들어, ROM(Read Only Memory), RAM(Random Access Memory), 자기 테이프, 자기 디스크, 플래시 메모리, 광 데이터 저장장치 등이 있을 수 있다. 또한, 컴퓨터로 판독 가능한 기록매체는 컴퓨터 통신망으로 연결된 컴퓨터 시스템에 분산되어, 분산방식으로 읽을 수 있는 코드로서 저장되고 실행될 수 있다.Meanwhile, the above-described adaptive knowledge base construction method according to the present invention can be implemented as computer readable codes on a computer readable recording medium. Computer-readable recording media include all kinds of recording media having data stored thereon that can be decrypted by a computer system. For example, there may be a read only memory (ROM), a random access memory (RAM), a magnetic tape, a magnetic disk, a flash memory, an optical data storage device, and the like. The computer readable recording medium can also be distributed over computer systems connected over a computer network, stored and executed as readable code in a distributed fashion.

도 2는 본 발명의 일실시예에 따른 적응형 지식 베이스 구축 시스템의 구성도를 나타낸다.2 is a block diagram of an adaptive knowledge base construction system according to an embodiment of the present invention.

본 발명에 따른 적응형 지식 베이스 구축 시스템은 기계 학습 엔진; 규칙 생성기; 시맨틱 규칙 엔진; 지식 베이스;를 포함한다. 바람직하게는, 도메인 전문가의 규칙 모델러를 더 포함할 수 있다.An adaptive knowledge base building system according to the present invention comprises a machine learning engine; Rule generator; Semantic rule engine; Knowledge base; Preferably, it may further include a rule modeler of a domain expert.

본 발명에 따른 적응형 지식 베이스 구축 시스템은 시맨틱 모델링에 기반을 둔 지식 베이스 구축 환경과 기계 학습에 기반을 두어 학습된 규칙(rule) 기반을 두어 지식 베이스 구축 환경이 병행하여 운영하거나, 각 지식 베이스 구축 환경이 개별적으로 운영될 수 있다. 시맨틱 모델링에 기반을 둔 경우와, 기계 학습에 기반을 둔 경우는 기능적으로 분리될 수 있는 것으로, 전체 시스템을 고려하여, 분산 시스템으로 구축될 수 있다. The adaptive knowledge base construction system according to the present invention is based on a knowledge base construction environment based on semantic modeling and a rule base learned based on machine learning to operate a knowledge base construction environment in parallel, or to each knowledge base. The build environment can be operated individually. The case based on semantic modeling and the case based on machine learning can be functionally separated, and can be constructed as a distributed system in consideration of the entire system.

기계 학습 엔진(210)은 제1 데이터셋(260)에 대한 최적의 학습 모델을 생성하고, 생성된 학습 모델을 이용하여 제1 데이터셋을 학습하고, 학습된 결과를 토대로 데이터셋에 대한 연관관계 분석 결과를 학습된 모델 결과로서 출력한다. The machine learning engine 210 generates an optimal learning model for the first dataset 260, trains the first dataset using the generated learning model, and relates to the dataset based on the learned results. The analysis results are output as learned model results.

규칙 생성기(220)는 출력된 학습된 모델 결과를 이용하여 학습된 모델 및 학습된 알고리즘을 규칙화하여 규칙을 생성하고, 분석된 데이터 연관 관계에 대한 규칙도 생성한다. 생성된 규칙을 시맨틱 규칙 엔진(230)에 전달하거나, 생성된 규칙으로부터 시맨틱 규칙을 생성하여 시맨틱 규칙 엔진(230)에 전달하거나, 생성된 규칙으로부터 비정형 데이터를 생성하여 시맨틱 규칙 엔진(230)에 전달할 수 있다. 통상적으로 규칙 생성기(220)는 일반화된 규칙을 생성한다. 다만, 시맨틱 규칙 엔진의 API를 규칙 생성기(220)에 적용하여 직접 시맨틱 규칙을 생성할 수 있고, 이때, 시맨틱 규칙 엔진(230)의 역할을 생성한 시맨틱 규칙을 지식 베이스에 저장할지 여부 및 기존의 시맨틱 규칙에 확장 적용 여부를 결정하게 된다. The rule generator 220 generates a rule by regularizing the learned model and the learned algorithm using the output learned model result, and also generates a rule for the analyzed data correlation. The generated rule is passed to the semantic rule engine 230, or a semantic rule is generated from the generated rule and passed to the semantic rule engine 230, or the unstructured data is generated from the generated rule and passed to the semantic rule engine 230. Can be. Typically rule generator 220 generates a generalized rule. However, the semantic rule can be generated directly by applying the API of the semantic rule engine to the rule generator 220. At this time, whether to store the semantic rule that has generated the role of the semantic rule engine 230 in the knowledge base and existing You decide whether to apply the extension to semantic rules.

본 발명에 따른 적응형 지식 베이스 구축 시스템은 분산 처리 시스템으로 구축될 수 있고, 규칙 생성기(220) 및 시맨틱 규칙 엔진(230)은 분산 처리 시스템 구성 방법에 따라 별도의 서버에 설치될 수 있다. 규칙 생성기(220)에서 시맨틱 규칙을 생성하는지는 통상 각 서버의 로드에 따라 적절하게 선택할 수 있다. The adaptive knowledge base building system according to the present invention may be built as a distributed processing system, and the rule generator 220 and the semantic rule engine 230 may be installed in separate servers according to the distributed processing system configuration method. Whether the rule generator 220 generates the semantic rule can usually be appropriately selected according to the load of each server.

시맨틱 규칙 엔진(230)은 확장 엔진(inference engine)이라고도 하며, 시맨틱 규칙을 확장하는 역할을 한다. 시맨틱 규칙 엔진(230)은 규칙 생성기(220)로부터 전달받은 규칙 및 비정형 데이터를 토대로 시맨틱 규칙을 생성하고, 생성된 시맨틱 규칙을 지식 베이스에 저장한다. 또한, 시맨틱 규칙 엔진(230)은 규칙 생성기(220)로부터 전달받은 시맨틱 규칙을 지식 베이스에 저장한 것인지 저장하지 않을지를 결정한다.The semantic rule engine 230 is also called an inference engine, and serves to extend the semantic rules. The semantic rule engine 230 generates semantic rules based on the rules and the unstructured data received from the rule generator 220 and stores the generated semantic rules in the knowledge base. In addition, the semantic rule engine 230 determines whether or not the semantic rule received from the rule generator 220 is stored in the knowledge base.

또한, 시맨틱 규칙 엔진(230)은 제2 데이터셋(270)을 입력받고, 구축된 지식 베이스에 저장된 시맨틱 규칙을 확장할 수 있다. 제2 데이터셋(270)은 수작업으로 입력된 새로운 규칙 및 새로운 시맨틱 규칙을 포함할 수 있다. In addition, the semantic rule engine 230 may receive the second dataset 270 and expand the semantic rule stored in the built knowledge base. The second dataset 270 may include a new rule and a new semantic rule manually input.

도 3은 본 발명의 다른 실시예에 따른 적응형 지식 베이스 구축 시스템의 구성도를 나타낸다.3 is a block diagram of an adaptive knowledge base building system according to another embodiment of the present invention.

본 발명에 따른 적응형 지식 베이스 구축 방법은 (1) 제1 데이터셋에 대한 학습 모델을 생성하는 단계(S310); (2) 생성된 학습 모델을 이용하여 학습된 모델 결과를 생성 하는 단계; (3) 학습된 모델 결과를 이용하여 규칙을 생성하는 단계(S330); (4) 생성된 규칙을 이용하여 시맨틱 규칙을 생성하는 단계(S340); (5) 생성된 시맨틱 규칙을 저장하여 지식 베이스를 구축하는 단계(S350);를 포함한다.The adaptive knowledge base construction method according to the present invention comprises the steps of: (1) generating a learning model for a first dataset (S310); (2) generating trained model results using the generated learning model; (3) generating a rule using the learned model results (S330); (4) generating a semantic rule using the generated rule (S340); (5) storing the generated semantic rules to build a knowledge base (S350).

상기 (1) 단계에서는 기계 학습 엔진(210)이 제1 데이터 집합을 이용하여 학습하여 최적의 학습 모델을 생성한다. In step (1), the machine learning engine 210 learns using the first data set to generate an optimal learning model.

상기 (2) 단계에서는 기계 학습 엔진(210)은 최적의 분석 모델을 학습하고, 추론 단계에서 입력으로 제공되는 제1 데이터 집합에 대한 연관 관계 분석 결과를 출력한다.In the step (2), the machine learning engine 210 learns an optimal analysis model and outputs the correlation analysis result for the first data set provided as an input in the inference step.

상기 (3) 단계에서는 규칙 생성기(220)가 학습된 모델과 학습된 알고리즘을 규칙화하고, 학습된 모델과 학습된 알고리즘을 이용하여 분석된 데이터의 연관 관계도 규칙화한다.In step (3), the rule generator 220 rules the learned model and the learned algorithm, and also rules the correlation between the analyzed data using the learned model and the learned algorithm.

상기 (4) 단계에서는 상기 규칙 생성기(220)에 의해 생성된 일반적인 규칙을 시맨틱 규칙으로 변환한다. In the step (4), the general rule generated by the rule generator 220 is converted into a semantic rule.

상기 (5) 단계에서는 변환된 시맨틱 규칙을 기존에 저장된 시맨틱 규칙과 병합하여, 지식 베이스를 확장한다.In step (5), the transformed semantic rule is merged with previously stored semantic rules to expand the knowledge base.

상기 (1) 단계에서의 기계 학습을 이용하여 규칙을 생성하는 방법은 알고리즘에 따라 상이하다. 본 발명은 기계 학습을 통하여 규칙 생성에 필요한 지식을 획득하여, 획득된 지식도 규칙으로 다시 생성하고, 시맨틱 기술을 적용하여 최종적으로 시맨틱 규칙으로 변환하여 통합하는 방법을 제공한다. 상기 (1) 단계에서 사용되는 기계 학습 방법은 도 4 및 도 5에서 다시 설명하도록 한다. 다만, 이는 예시적인 것이며, 다른 알고리즘 또는 모델에 의하여 규칙을 생성할 수 있고, 발명의 범위를 제한하고자 하는 것이 아니다.The method for generating a rule using the machine learning in the step (1) is different according to the algorithm. The present invention provides a method of acquiring the knowledge required for rule generation through machine learning, regenerating the acquired knowledge into a rule, and finally converting the semantic rule into a semantic rule by applying semantic technology. The machine learning method used in step (1) will be described again with reference to FIGS. 4 and 5. However, this is merely exemplary, and rules may be generated by other algorithms or models, and are not intended to limit the scope of the invention.

도 4는 본 발명의 부분 실시예에 따른 트리 학습 모델 기반 적응형 지식 베이스 구축 방법을 설명하기 위한 예시도이다.4 is an exemplary diagram for describing a tree learning model-based adaptive knowledge base construction method according to an exemplary embodiment of the present invention.

도 4(a) 은 트리 기반의 규칙 생성 방법을 설명하기 위한 예시도이다.4A is an exemplary diagram for explaining a tree-based rule generation method.

도 4(a)는 입력 데이터로서 A, C가 존재하고, 학습 결과 B, C-1, C-2가 생성되는 것이다. 입력 데이터 A, C에 대하여 도 4(a)와 같이 트리 형태의 데이터 구조를 획득할 수 있다. 각 노드는 데이터가 입력되거나, 출력을 생성할 수 있다. 도 4(a)는 가장 간단한 예시를 제시하는 것이지만, 이진 트리 형태로 더 복잡하고 다양한 트리를 생성할 수 있으며, 각 노드에 대하여 규칙적으로 제어문을 생성하는 것도 가능하다. 4 (a) shows A and C as input data, and learning results B, C-1, and C-2 are generated. For the input data A and C, a tree data structure may be obtained as shown in FIG. Each node can enter data or generate output. Although FIG. 4 (a) shows the simplest example, more complex and various trees can be generated in the form of a binary tree, and control statements can be generated regularly for each node.

예컨대, 도 4(a)에서 생성된 규칙은 다음과 같다. A가 역치(threshold)보다 큰 경우 "1"을 출력한다. 그렇지 않다면, C가 yes라면 "2"를 출력한다. C가 "no"이면 "3"을 출력한다.For example, the rule generated in FIG. 4 (a) is as follows. If A is greater than the threshold, output "1". Otherwise, if C is yes, print "2". If C is "no", print "3".

도 4(a)의 순서도를 도 4(b)에 나타내었다. 트리 구조의 장점은 재귀적으로 알고리즘을 구성할 수 있다는 점이다. 도 4(b)는 도 4(a)를 설명하기 위하여 제시된 순서도에 불과하지만, 트리 구조를 이용하여, 단순한 규칙을 반복적으로 적용하여 입력되는 제1 데이터셋에 대한 복잡한 기계 학습 모델을 생성할 수 있는 장점이 있다. The flowchart of FIG. 4 (a) is shown in FIG. 4 (b). The advantage of the tree structure is that you can construct algorithms recursively. Although FIG. 4 (b) is only a flow chart presented to explain FIG. 4 (a), it is possible to generate a complex machine learning model for an input first dataset by repeatedly applying simple rules using a tree structure. There is an advantage.

도 5는 본 발명의 부분 실시예에 따른 Apriori 알고리즘을 적용한 적응형 지식 베이스 구축 방법을 설명하기 위한 예시도이다. 구체적으로 도 5는 Apriori 알고리즘을 토대로 연관 분석 결과(Association Analysis result)를 규칙(rule)로 생성하는 방법을 설명하기 위한 예시도이다.5 is an exemplary diagram for describing a method of constructing an adaptive knowledge base to which an Apriori algorithm according to an exemplary embodiment of the present invention is applied. In detail, FIG. 5 is an exemplary diagram for describing a method of generating an association analysis result as a rule based on an Apriori algorithm.

Apriori 알고리즘은 기계 학습 방법으로 널리 알려진 방법이나, 시맨틱 규칙을 생성하는데 적용하여도 좋은 성능을 낼 수 있다. The Apriori algorithm is widely known as a machine learning method, but can be applied to generate semantic rules.

도 5(a)에 나타난 숫자는 하기 표 1에 따른 품목(item)에 해당하는 것을 의미하고, A, B, C, D, E는 하나의 거래(transaction)를 의미하고, 하기 표 2는 각 거래에서 구입한 품목을 나타낸다.The numbers shown in FIG. 5 (a) mean items corresponding to Table 1 below, A, B, C, D, and E denote one transaction, and Table 2 below Represents an item purchased in a transaction.

품목 명칭Item name 품목 아이디Item ID soy milksoy milk 1One lettucelettuce 22 diapersdiapers 33 winewine 44 chardchard 55 orange juiceorange juice 66

거래 IDDeal ID 품목 리스트Item list 품목 아이디 리스트Item ID list AA Soy milk, lettuceSoy milk, lettuce 1, 21, 2 BB Lettuce, diapersLettuce, diapers 2, 32, 3 CC Soy milk, wineSoy milk, wine 1, 41, 4 DD Lettuce, soy milk, diapers, wineLettuce, soy milk, diapers, wine 2, 1, 3, 42, 1, 3, 4 EE Lettuce, soy milk, diapersLettuce, soy milk, diapers 2, 1, 32, 1, 3

상기 표 2를 참고하여 도 5(a)의 시맨틱 웹을 형성할 수 있다. 형성된 시맨틱 웹은 시맨틱 규칙으로 규칙화할 수 있다.Referring to Table 2, the semantic web of Figure 5 (a) can be formed. The formed semantic web may be ordered by semantic rules.

예컨대, 가장 아래에 있는 박스를 분석하면, 상기 거래 ID 중 1을 포함하는 것은 총 4개, 2를 포함하는 것은 총 4개, 3을 포함하는 것은 총 4개, 4를 포함하는 것은 총 3개인 것을 알 수 있다. 하기 표는 이를 정리한 것이다.For example, if the bottom box is analyzed, a total of 4 includes 1, 4 includes 2, 4 includes 3, 4 includes 3, and 3 includes 4 It can be seen that. The following table summarizes this.

품목 집합Item assembly 지지도Support {1}{One} 44 {2}{2} 44 {3}{3} 33 {4}{4} 22

이때, 최소 지지도를 4로 설정하면, {1}, {2}은 빈발 품목 집합으로 분류되고, {3}, {4}은 비빈발 품목 집합으로 분류된다. 도 5(a)에서는 빈발 품목 집합을 반전 시켜 표시하였다. In this case, if the minimum support is set to 4, {1} and {2} are classified as a frequent item set, and {3} and {4} are classified as a non-frequent item set. In FIG. 5 (a), the frequent item set is inverted.

마찬가지로, 원소의 개수가 2개인 품목 집합에 대하여 지지도를 연산할 수 있고, 이를 하기 표에 나타내었다.Similarly, support can be calculated for a set of items with two elements, which are shown in the table below.

품목 집합Item assembly 지지도Support {1,2}{1,2} 33 {1,3}{1,3} 22 {1,4}{1,4} 22 {2,3}{2,3} 33 {2,4}{2,4} 1One {3,4}{3,4} 1One

이때, 최소 지지도를 3으로 설정하면, {1,2}, {2,3}은 빈발 품목 집합이 되고, {1,3}, {1,4}, {2,4}, {3,4}는 비빈발 품목 집합으로 분류된다. 도 5(a)에서는 빈발 품목 집합을 반전 시켜 표시하였다. In this case, if the minimum support is set to 3, {1,2} and {2,3} become frequent item sets, and {1,3}, {1,4}, {2,4}, and {3,4 } Is classified as a set of non-frequent items. In FIG. 5 (a), the frequent item set is inverted.

마찬가지로, 원소의 개수가 3개인 품목 집합에 대하여 지지도를 연산할 수 있고, 이를 하기 표에 나타내었다.Similarly, support can be calculated for an item set of three elements, which are shown in the table below.

품목 집합Item assembly 지지도Support {1,2,3}{1,2,3} 22 {1,2,4}{1,2,4} 1One {1,3,4}{1,3,4} 1One {2,3,4}{2,3,4} 1One

이때, 최소 지지도를 2로 설정하면, {1,2,3}은 빈발 품목 집합이되고, {1,3}, {1,2,4}, {1,3,4}, {2,3,4}는 비빈발 품목 집합으로 분류된다. 도 5(a)에서는 빈발 품목 집합을 반전 시켜 표시하였다. At this time, if the minimum support is set to 2, {1,2,3} becomes a frequent item set, and {1,3}, {1,2,4}, {1,3,4}, {2,3 , And 4} are classified as non-frequent item sets. In FIG. 5 (a), the frequent item set is inverted.

마찬가지로, 원소의 개수가 4개인 품목 집합에 대하여 지지도를 연산하는 경우, {1,2,3,4}는 지지도가 1이므로, 최소 지지도가 1이라면, 빈발 품목 집합이 되고, 최소 지지도가 2라면, 비빈발 품목 집합이 된다. 도 5(a)에서는 최소 지지도가 1인 경우를 상정하였다.Similarly, when calculating the support for a set of 4 items, {1,2,3,4} has 1 support, so if the minimum support is 1, it is a frequent item set, and if the minimum support is 2, In other words, it becomes a set of non-frequent items. In FIG. 5 (a), the case where the minimum support degree is 1 is assumed.

통상 Apriori 알고리즘에서는 비빈발 품목을 가지치기(pruning)하고, 진행하여 연산 속도를 증가시킬 수 있는 기계 학습 방법이다. In general, the Apriori algorithm is a machine learning method that can prun non-frequent items and proceed to increase computation speed.

도 5(b)는 본 발명의 부분 실시예에 따른 Apriori 알고리즘을 적용하는 방법을 설명하기 위한 예시도이다. 구체적으로 도 5(b)는 도 5(a)에서 최소 지지도 미만의 노드에 대하여 가지치기(pruning)를 한 결과를 나타낸다. 5B is an exemplary diagram for describing a method of applying an Apriori algorithm according to an exemplary embodiment of the present invention. Specifically, FIG. 5 (b) shows the result of pruning the nodes below the minimum support in FIG. 5 (a).

전체 트리에서 모든 연산을 다 할 경우, 연산 횟수는 품목의 개수에 따라 지수적으로 증가하는데, 이를 컴퓨터가 연산 속도를 따라갈 수 없는 문제가 있다. 최소 지지도 이하의 노드에 대하여 가치지기(pruning)하고, 최소 지지도 이상의 노드에 대하여만 연산을 계속할 경우, 적은 횟수의 연산에 의하여, 노드 간의 의미관련 규칙(semantic rule)을 생성할 수 있다. 도 5(b)에서 점선으로 표시한 화살표는 가리치기 될 노드를 알려주고 있다.When all the operations are performed in the entire tree, the number of operations increases exponentially with the number of items, which causes a problem that the computer cannot keep up with the operation speed. If pruning is performed for nodes below the minimum support and the operation continues only for nodes above the minimum support, semantic rules between nodes can be generated by a small number of operations. Arrows indicated by dotted lines in FIG. 5 (b) indicate nodes to be pointed out.

도 6은 본 발명의 다른 실시예에 따른 적응형 지식 베이스 구축 시스템의 구성도를 나타낸다.6 is a block diagram of an adaptive knowledge base building system according to another embodiment of the present invention.

본 발명에 따른 적응형 지식 베이스 구축 시스템은 분산 처리 시스템으로 구성될 수 있다. The adaptive knowledge base building system according to the present invention may be configured as a distributed processing system.

본 발명에 따른 적응형 지식 베이스 구축 시스템은 기계 학습 규칙 생성 서버(610); 시맨틱 규칙 생성 서버(620); 및 규칙 모델러 서버(630);를 포함한다.An adaptive knowledge base building system according to the present invention includes a machine learning rule generation server 610; Semantic rule generation server 620; And a rule modeler server 630.

기계 학습 규칙 생성 서버(610)는 기계 학습 엔진(210); 및 규칙 생성기(220)를 포함한다. 기계 학습 규칙 생성 서버(610)는 반복적 재학습(repeatitive learning)을 통하여 동적 환경 문맥(context)을 반영한다. Machine learning rule generation server 610 includes a machine learning engine 210; And rule generator 220. The machine learning rule generation server 610 reflects the dynamic environment context through iterative relearning.

시맨틱 규칙 생성 서버(620)는 기계 학습 규칙으로부터 생성된 시맨틱 규칙을 이용하여 제2 데이터셋에 대한 시맨틱 추론을 통하여 지식 베이스를 확장한다. 또한, 기계 학습 엔진이 제2 데이터셋을 입력 데이터로 하여 예측한 기계 학습 규칙을 시맨틱 규칙 엔진에 전달하고, 시맨틱 추론을 통하여 지식 베이스를 확장할 수도 있다. 이에 관한 설명은 도 7 및 도 8에서 후술한다. The semantic rule generation server 620 extends the knowledge base through semantic inference on the second dataset using semantic rules generated from machine learning rules. In addition, the machine learning engine may transmit the machine learning rule predicted using the second data set as input data to the semantic rule engine, and expand the knowledge base through semantic inference. Description thereof will be described later with reference to FIGS. 7 and 8.

규칙 생성기(220)가 생성된 규칙을 시맨틱 규칙 엔진(240)이 시맨틱 규칙으로 변환하는데, 시맨틱 규칙 엔진은 새롭게 입력된 제2 데이터셋을 이용하여 시맨틱 규칙을 생성할 수 있다. 이때, 새롭게 입력된 제2 데이터셋에 대한 시맨틱 추론을 이용하여 지식 베이스를 확장할 수 있다. The rule generator 220 converts the generated rule into a semantic rule by the semantic rule engine 240. The semantic rule engine may generate a semantic rule by using a newly input second data set. In this case, the knowledge base may be extended by using semantic inference of the newly input second data set.

상기 시맨틱 규칙은 RDF 트리플 트리 구조 또는 OWL 구조 중 어느 하나일 수 있다. The semantic rule may be either an RDF triple tree structure or an OWL structure.

시맨틱 규칙 엔진(230)은 규칙 모델러(240)로부터 규칙 모델을 전달받아, 생성된 시맨틱 규칙을 수정할 수 있고, 기계 학습 규칙을 시맨틱 규칙으로 변환하는 방법을 변경할 수 있다. The semantic rule engine 230 may receive the rule model from the rule modeler 240, modify the generated semantic rule, and change a method of converting the machine learning rule into the semantic rule.

규칙 모델러 서버(630)은 도메인 전문가에 의하여 입력된 규칙 모델을 시맨틱 규칙 엔진에 전송하는 규칙 모델러를 포함한다.The rule modeler server 630 includes a rule modeler for transmitting the rule model input by the domain expert to the semantic rule engine.

규칙 모델러는 도메인 전문가가 손쉽게 규칙 모델을 입력할 수 있도록 편리한 사용자 인터페이스(User Interface)를 제공한다. 기계 학습 규칙 및 시맨틱 규칙을 손쉽게 생성할 수 있는 UI를 제공하고, 기계 학습 규칙 및 시맨틱 규칙을 연결하는 UI도 함께 제공한다. 또한 규칙 모델러는 기계 학습 엔진이 생성한 기계 학습 규칙 및 시맨틱 규칙 사이의 연결 관계를 지식 베이스에 저장하기 전에 열람하고, 수정할 수 있는 사용자 인터페이스를 제공한다. Rule Modeler provides a convenient user interface for domain experts to easily enter rule models. It provides a UI for easily creating machine learning rules and semantic rules, and also provides a UI that connects machine learning rules and semantic rules. The rule modeler also provides a user interface for viewing and modifying the relationship between machine learning rules and semantic rules generated by the machine learning engine before storing them in the knowledge base.

도 7은 본 발명의 다른 실시예에 따른 적응형 지식 베이스 구축 시스템의 구성도를 나타낸다.7 is a block diagram of an adaptive knowledge base building system according to another embodiment of the present invention.

도 6과 달리 새롭게 입력된 제2 데이터셋은 시맨틱 규칙 엔진이 아니라 기계 학습 엔진에 직접 입력될 수도 있다. 시맨틱 규칙 엔진에 기계 학습 모듈을 포함시켜 도 6과 같이 처리하는 것도 가능하나, 통상 오랜 시간을 필요로 하는 기계 학습 과정은 단축 시키기 위하여, 제2 데이터 집합을 제1 데이터 집합과 함께 기계 학습 엔진에 모두 입력하여, 기계 학습 과정에서 한번에 처리할 경우, 컴퓨터 자원을 효율적으로 사용하는 것이 가능하다. 본 발명에 따른 적응형 지식 베이스 구축 시스템은 분산 처리 시스템으로 구축할 수 있고, 통상 기계 학습 엔진은 다수의 서버에 병렬형 시스템으로 구축되고, 시맨틱 규칙 엔진은 소수의 서버에서 설치되는 경우, 제2 데이터셋을 기계 학습 엔진에 입력하는 방법이 자원을 효율적으로 분배할 수 있을 것이다. Unlike FIG. 6, the newly input second dataset may be directly input into the machine learning engine instead of the semantic rule engine. It is also possible to include the machine learning module in the semantic rule engine and process it as shown in FIG. 6, but in order to shorten the machine learning process that normally requires a long time, the second data set is combined with the first data set in the machine learning engine. When inputted and processed at a time in the machine learning process, it is possible to efficiently use computer resources. The adaptive knowledge base building system according to the present invention can be built as a distributed processing system. In general, the machine learning engine is built as a parallel system on a plurality of servers, and the semantic rule engine is installed on a few servers. Inputting datasets into machine learning engines can distribute resources efficiently.

본 발명의 부분 실시예에 따른 적응형 지식 베이스 구축 시스템은, 제1 데이터셋에 대한 학습 모델을 생성하고, 생성된 학습 모델을 이용하여 제1 데이터셋의 연관 관계를 분석하고, 제2 데이터셋에 대한 시맨틱 추론 모델을 생성하고, 생성된 시맨틱 추론 모델을 이용하여 제2 데이터셋을 이용하여 추론하고 예측 결과를 생성하는 기계 학습 엔진; 상기 기계 학습 엔진이 분석한 학습된 모델 결과로부터 기계 학습 규칙을 생성하고, 상기 추론된 예측 결과로부터 기계 학습 규칙을 생성하는 규칙 생성기; 규칙 생성기로부터 전달받은 기계 학습 규칙을 시맨틱 규칙으로 변환하고, 변환된 시맨틱 규칙을 저장하여 지식 베이스를 구축하는 시맨틱 규칙 엔진;을 포함한다. An adaptive knowledge base building system according to an embodiment of the present invention generates a learning model for a first dataset, analyzes an association of the first dataset using the generated learning model, and analyzes a second dataset. A machine learning engine for generating a semantic inference model for, inferring using a second dataset using the generated semantic inference model, and generating a prediction result; A rule generator for generating machine learning rules from the learned model results analyzed by the machine learning engine and generating machine learning rules from the inferred prediction results; And a semantic rule engine that converts the machine learning rules received from the rule generator into semantic rules and stores the converted semantic rules to build a knowledge base.

바람직하게는 도메인 전문가에 의하여 시맨틱 규칙 생성 방법을 변경하는 규칙 모델러를 더 포함할 수 있다.Preferably, the domain expert may further include a rule modeler for changing the semantic rule generation method.

필요에 따라 시맨틱 규칙뿐만 아니라, 기계 학습 규칙도 확장될 수 있지만, 기계 학습 규칙 자체는 결정적인 특성이 있어서, 실제 환경이 동적으로 변화하는 환경에서 적응적으로 기계 학습 규칙이 변경될 필요성이 있지만, 기계 학습 엔진을 변경하는 것은 효율적이지 못한 문제가 있다. 규칙 모델러는 기계 학습 규칙을 확장하는 대신, 기계 학습 규칙을 시맨틱 규칙으로 변환하는 방법을 변경함으로써, 동적으로 변화하는 환경에서 적절한 변환 방법을 선택할 수 있도록 도움을 준다. 도메인 전문가는 본 발명에 따른 적응형 지식 베이스 구축 시스템의 구조를 알고 있는 자는 아니나, 시맨틱 규칙에 대한 정보에 대하여 해박한 지식을 가진 자이므로, 생성된 시맨틱 규칙을 지식 베이스에 곧바로 저장하는 대신 도메인 전문가에 의하여 채택 여부를 선택하게 할 수 있고, 이를 토대로 규칙 모델러가 작동할 수 있다. 규칙 모델러는 별도의 학습 과정을 통하여 기계 학습 규칙을 시맨틱 규칙으로 변환하는 방법을 결정하여 시맨틱 규칙 엔진에 전달하고, 시맨틱 규칙 엔진은 규칙 모델러로부터 전달받은 변환 방법을 이용하여 지식 베이스를 구축할 수 있다.Not only semantic rules, but also machine learning rules can be extended as needed, machine learning rules themselves are decisive, so machine learning rules need to be changed adaptively in environments where the actual environment is dynamically changing. There is a problem with changing the learning engine. Instead of extending the machine learning rules, the rule modeler changes the way that machine learning rules are translated into semantic rules, helping to choose the appropriate transformation method in a dynamically changing environment. The domain expert is not a person who knows the structure of the adaptive knowledge base construction system according to the present invention, but a person who has a thorough knowledge about the information on the semantic rules, so that the domain expert does not store the generated semantic rules directly in the knowledge base. It allows you to choose whether or not to adopt, and on this basis the rule modeler can work. The rule modeler determines how to convert machine learning rules into semantic rules through a separate learning process, and delivers them to the semantic rule engine. The semantic rule engine can build a knowledge base using the transformation method received from the rule modeler. .

시맨틱 규칙 엔진은 규칙 생성기로부터 전달받은 기계 학습 규칙과, 제2 데이터셋과, 규칙 모델러로부터 전달 받은 규칙 모델을 토대로 지식 베이스를 구축한다. 이러한 과정은 곧바로 진행되는 것이 아니라, 임시로 각각의 데이터를 저장하고, 적절한 주기로 지식 베이스를 구축하게 된다. 시맨틱 규칙 엔진은 본 발명의 특성에 따라 제1 데이터셋의 입력 순서, 제2 데이터셋의 입력 순서, 규칙 모델이 입력되는 순서 모두에 의하여 구축되는 지식 베이스도 상이해지는 특성이 있다. The semantic rule engine builds a knowledge base based on the machine learning rules passed from the rule generator, the second dataset, and the rule models passed from the rule modeler. This process does not proceed immediately, but temporarily stores each data and builds a knowledge base at appropriate intervals. The semantic rule engine has a characteristic that a knowledge base constructed by all of an input order of a first data set, an input order of a second data set, and an order of inputting a rule model is different according to the characteristics of the present invention.

그러나, 현실적으로 시간에 따라 입력되는 데이터가 달라질 경우, 그 분석 또한 상이해지는 것이 통상적인 것이다. 예컨대, IoT(Internet of Thing) 환경 하에서, 집안의 보일러를 작동하는 경우, 보일러가 작동되는 시간에 따라 기계 학습 내용은 상이해질 수 있다. 즉, 본 발명에 따른 적응형 지식 베이스 구축 시스템은 동적으로 변환하는 환경에서 더욱 강인한 특성이 있다.In reality, however, if the input data varies over time, the analysis is also common. For example, when operating a boiler in a house under an Internet of Thing (IoT) environment, the machine learning contents may be different according to the time when the boiler is operated. In other words, the adaptive knowledge base construction system according to the present invention has more robust characteristics in an environment for dynamically converting.

도 8은 본 발명의 다른 실시예에 따른 적응형 지식 베이스 구축 방법의 절차 흐름도를 나타낸다.8 is a flowchart illustrating a method of building an adaptive knowledge base according to another embodiment of the present invention.

본 발명의 부분 실시예에 따른 적응형 지식 베이스 구축 방법은, 제2 데이터셋에 대한 시맨틱 추론 모델을 생성하는 단계; 제2 데이터셋에 생성된 시맨틱 추론 모델을 적용한 추론을 하고 예측 결과를 생성하는 단계; 상기 추론된 예측 결과로부터 기계 학습 규칙을 생성하는 단계; 기계 학습 규칙을 시맨틱 규칙으로 변환하는 단계; 변환된 시맨틱 규칙을 저장하여 지식 베이스를 구축하는 단계;를 포함한다. An adaptive knowledge base construction method, in accordance with some embodiments of the invention, includes generating a semantic inference model for a second dataset; Inferring the semantic inference model generated in the second dataset and generating a prediction result; Generating a machine learning rule from the inferred prediction result; Converting machine learning rules to semantic rules; And storing the transformed semantic rules to build a knowledge base.

도 3의 설명과 달리 도 8의 경우, 제1 데이터셋을 입력 데이터로 하여 분석한 지식 베이스가 이미 구축된 경우, 제2 데이터셋을 시맨틱 규칙 엔진이 아닌 기계 학습 엔진에 입력하여 시맨틱 추론이 이루어진다는 점에서 차이가 있다. 제2 데이터셋이 시맨틱 규칙 엔진에 직접 입력될 경우, 구축된 지식 베이스를 이용하여 시맨틱 추론이 이루어질 수 있는 장점이 있으나, 통상 기계 학습 엔진은 고사양의 서버를 요하므로, 기계 학습 엔진의 컴퓨팅 파워가 통상 더 좋은 경우가 많다. 기계 학습 엔진에 제2 데이터셋을 입력하는 경우, 제2 데이터셋을 포함하여 학습 모델을 적용하기 쉽고, 고사양의 서버를 이용하여 빠르게 제2 데이터셋에 대한 추론을 진행할 수 있는 장점이 있다. 미리 추론에 이용된 데이터는 시맨틱 규칙 엔진에 다시 입력되어 구축된 지식 베이스를 이용하여 확인하고, 지식 베이스를 확장할 수 있다. In contrast to the description of FIG. 3, in case of FIG. 8, when a knowledge base analyzed by using the first data set as input data is already constructed, the semantic inference is performed by inputting the second data set into the machine learning engine instead of the semantic rule engine. There is a difference in that. When the second dataset is directly input to the semantic rule engine, semantic inference can be performed using the constructed knowledge base. However, since the machine learning engine generally requires a high-end server, the computing power of the machine learning engine is increased. Often it is better. In the case of inputting the second dataset into the machine learning engine, it is easy to apply the learning model including the second dataset, and there is an advantage of rapidly inferring the second dataset using a high specification server. The data used for the reasoning in advance can be confirmed by using the knowledge base built by being input back into the semantic rule engine and expanding the knowledge base.

이상, 본 발명의 구성에 대하여 첨부 도면을 참조하여 상세히 설명하였으나, 이는 예시에 불과한 것으로서, 본 발명이 속하는 기술 분야에 통상의 지식을 가진 자라면 본 발명의 기술적 사상의 범위 내에서 다양한 변형과 변경이 가능함은 물론이다. 따라서 본 발명의 보호 범위는 전술한 실시예에 국한되어서는 아니 되며 이하의 특허청구범위의 기재에 의하여 정해져야 할 것이다.In the above, the configuration of the present invention has been described in detail with reference to the accompanying drawings, which are merely examples, and those skilled in the art to which the present invention pertains have various modifications and changes within the scope of the technical idea of the present invention. Of course this is possible. Therefore, the protection scope of the present invention should not be limited to the above-described embodiment but should be defined by the description of the claims below.

100: 컴퓨터 시스템
110: 프로세서
120: 메모리
123: ROM
126: RAM
130: 데이터 통신 버스
140: 저장소
150: 사용자 입력 장치
160: 사용자 출력 장치
170: 네트워크 인터페이스
180: 네트워크
210: 기계 학습 엔진
220: 규칙 생성기
230: 시맨틱 규칙 엔진
240: 규칙 모델러
250: 도메인 전문가
260: 제1 데이터셋
270: 제2 데이터셋
280: 지식 베이스
S310: 학습 모델 생성 단계
S320: 학습된 모델 결과 생성 단계
S330: 기계 학습 규칙 생성 단계;
S340: 시맨틱 규칙 생성 단계;
S350: 지식 베이스 구축 단계;
S360: 제2 데이터셋을 이용한 시맨틱 추론 단계;
S370: 지식 베이스 확장 단계;
610: 기계 학습 규칙 생성 서버
620: 시맨틱 규칙 생성 서버
630: 규칙 모델러 서버
S810: 예측 모델 생성 단계;
S820: 예측 결과 생성 단계;
S830: 예측 결과의 규칙 생성 단계;
S840: 시맨틱 규칙 변환 단계;
S850: 지식 베이스 확장 단계;100: computer system
110: processor
120: memory
123: ROM
126: RAM
130: data communication bus
140: storage
150: user input device
160: user output device
170: network interface
180: network
210: machine learning engine
220: rule generator
230: semantic rule engine
240: rule modeler
250: domain expert
260: first data set
270: second dataset
280: Knowledge Base
S310: Learning Model Generation Steps
S320: Generate trained model results
S330: generating machine learning rule;
S340: generating semantic rule;
S350: building a knowledge base;
S360: Semantic inference using the second dataset;
S370: expanding the knowledge base;
610: Machine learning rule generation server
620: semantic rule generation server
630: Rule Modeler server
S810: predictive model generation step;
S820: generating a prediction result;
S830: generating rules of the prediction result;
S840: Semantic Rule Conversion Step;
S850: expanding the knowledge base;

Claims

A machine learning engine that generates a learning model based on the input first data set and analyzes an association relationship with the first data set according to the generated learning model;
A rule generator for generating a rule based on the learned model and algorithm; And
A semantic rule generator for generating semantic rules using the generated rules and the second data set;
Adaptive knowledge base building system comprising a.

The method of claim 1,
The semantic rule generator,
Generating rules by any of tree-based rule generation algorithms and apriori algorithms
Adaptive knowledge base building system.

The method of claim 1,
Further comprising a rule modeler for changing the semantic rule generation method by the domain expert
Adaptive knowledge base building system.

The method of claim 1,
The semantic rule generator,
Periodically expanding the knowledge base using temporarily stored machine learning rules, the second dataset, and rule models
Adaptive knowledge base building system.

In the adaptive knowledge base construction system,
A memory storing a program for providing an adaptive knowledge base building module; And
Including a processor for executing the program,
As the processor executes the program,
Generate a learning model for the input first dataset, output the correlation analysis result of the first data set according to the generated training model, and machine learning based on the learned model, the learned algorithm and the correlation analysis result. Generating a rule, and generating semantic rules using the generated machine learning rule
Adaptive knowledge base building system.

The method of claim 5,
The processor,
Generating semantic rules using rules generated by either RDF or OWL and the second data set
Adaptive knowledge base building system.

The method of claim 5,
The processor,
Generating a learning model and an inference model for the first dataset and the second dataset
Adaptive knowledge base building system.

The method of claim 5,
The processor,
Generating semantic rules using a rule model input by a domain expert
Adaptive knowledge base building system.

(1) machine learning an input first data set by a machine learning engine;
(2) the rule generator generating a rule based on the learned result;
(3) generating a semantic rule using the rule generated by the rule generator and the second data set; And
(4) constructing a knowledge base by storing the generated semantic rules by the semantic rule engine;
Adaptive knowledge base construction method comprising a.

The method of claim 9,
Providing, by a rule modeler, a model for generating a rule by a domain expert to generate the semantic rule;
Adaptive knowledge base construction method further comprising.

The method of claim 9,
Generating the semantic rule,
The rule generator generating a semantic rule using a rule generated by one of RDF and OWL and a second data set
Adaptive knowledge base construction method.

The method of claim 9,
Generating the semantic rule,
The rule generator generating a rule by any one of a tree-based rule generation algorithm and an apriori algorithm
Adaptive knowledge base construction method.

The method of claim 12,
The apriori algorithm,
The rule generator pruning the segmentation stage of each node according to the degree of support
How to build an adaptive knowledge base.

An adaptive knowledge base construction method for constructing a knowledge base using a machine learning engine and a semantic rule engine,
Generating a semantic inference model for the second dataset;
Inferring the semantic inference model generated in the second dataset and generating a prediction result;
Generating a machine learning rule from the inferred prediction result;
Converting machine learning rules to semantic rules; And
Storing the transformed semantic rules to expand the knowledge base;
Adaptive knowledge base construction method comprising a.

The method of claim 14,
Converting the machine learning rule into semantic rule,
Rules model provided by domain experts to determine how to translate machine learning rules into semantic rules
Adaptive knowledge base construction method.

The method of claim 14,
Converting the machine learning rule into semantic rule,
Generating semantic rules by either a tree structure model or an apriori algorithm
Adaptive knowledge base construction method.