KR20200000541A

KR20200000541A - Method for training artificial neurel network

Info

Publication number: KR20200000541A
Application number: KR1020180072497A
Authority: KR
Inventors: 송기영; 강형신
Original assignee: 주식회사 수아랩
Priority date: 2018-06-25
Filing date: 2018-06-25
Publication date: 2020-01-03
Also published as: WO2020005049A1

Abstract

Disclosed is an information processing method, which is performed by at least one processor. The information processing method, which is performed by at least one processor, according to an embodiment of the present invention comprises the steps of: receiving first data; generating second data based on the first data by using a first computational model including a plurality of sub-models connected in series, wherein the sub-models include at least one layer for changing a dimension; and generating a cost function based on at least two pieces of the first data, the second data, and a plurality of sub-data generated by the plurality of sub-models.

Description

Learning method of artificial neural network {METHOD FOR TRAINING ARTIFICIAL NEUREL NETWORK}

본 개시는 인공 신경망에 관한 것으로, 보다 구체적으로 인공 신경망의 훈련 방법 및 훈련된 신경망을 통한 데이터 분류에 관한 것이다. TECHNICAL FIELD This disclosure relates to artificial neural networks, and more particularly, to a method of training artificial neural networks and data classification through trained neural networks.

패턴 인식(pattern recognition)은 머신 러닝(machine learning)의 일 분야로서, 패턴 및 데이터의 규칙성을 인식하는 학문을 의미한다. 패턴 인식 기술은 감독 학습(supervised learning) 및 비감독 학습(unsupervised learning) 방법을 포함한다. 감독 학습 방법은 이미 패턴 인식의 결과가 결정되어 있는 데이터("트레이닝" 데이터 라고 한다)를 이용하여 알고리즘이 패턴 인식을 학습하는 방법을 의미한다.Pattern recognition is a field of machine learning, and refers to the science of recognizing the regularity of patterns and data. Pattern recognition techniques include supervised learning and unsupervised learning methods. Supervised learning means a method in which an algorithm learns pattern recognition using data for which the result of pattern recognition has already been determined (called "training" data).

패턴 인식 기술을 구현하기 위해 다양한 방법이 제시되고 있으며, 미국 등록 특허 8,873,856 는 이러한 패턴 인식 기술의 일 예를 도시한다.Various methods have been proposed to implement a pattern recognition technique, and US Patent 8,873,856 shows an example of such a pattern recognition technique.

그러나 패턴 인식 알고리즘의 종류와 방법에 무관하게, 상기 트레이닝 데이터가 다양하고 개수가 많을수록 양질의 학습이 수행되는 바, 트레이닝 데이터를 광범위하게 다량 확보하기 위한 기술이 요구된다. 특히, 상기 데이터가 제조과정에서 발생하는 상이한 패턴 상 결함인 경우, 동일 제조 과정 하에 발생할 수 있는 결함은 상이한 패턴 상에서도 유사한 모양을 띠는 경향이 있음에도 불구하고, 상이한 패턴 별로 결함 데이터를 별개의 작업을 통해 획득해야하는 불편함이 존재한다. However, regardless of the type and method of the pattern recognition algorithm, the more various and the more the training data, the higher quality learning is performed, so a technique for securing a large amount of training data is required. In particular, when the data are different pattern defects occurring in the manufacturing process, defects that may occur under the same manufacturing process tend to have a similar shape on different patterns, but the defect data may be processed separately for different patterns. There is discomfort that must be obtained through.

이에 따라, 확보하기 용이한 정상 데이터만으로 구성된 트레이닝 데이터로 다양한 패턴 상의 결함을 효율적으로 검출할 수 있는 기술에 대한 당업계의 요구가 존재한다.Accordingly, there is a need in the art for a technology capable of efficiently detecting defects on various patterns with training data composed of only normal data that is easy to secure.

미국특허 등록번호 US9,141,916B1United States Patent Registration No. US9,141,916B1 미국특허 공개번호 US2008/0222135A1United States Patent Publication No. US2008 / 0222135A1

본 개시는 전술한 배경기술에 대응하여 안출된 것으로 인공 신경망의 훈련 방법 및 훈련된 신경망을 통한 데이터 분류를 제공하기 위한 것이다.The present disclosure is devised in response to the above-described background, and is to provide a method for training an artificial neural network and data classification through the trained neural network.

전술한 과제를 해결하기 위한 본 개시의 일 실시예들에 따라, 하나 이상의 프로세서에서 수행되는 정보 처리 방법이 개시된다. 상기 정보 처리 방법은: 프로세서에 의해, 제1 데이터를 수신하는 단계; 프로세서에 의해, 직렬로 연결된 복수의 서브 모델을 포함하는 제1 연산 모델을 이용하여 상기 제 1 데이터에 기초한 제 2 데이터를 생성하는 단계-상기 서브 모델은 차원을 변경하기 위한 적어도 하나의 레이어를 포함함-; 프로세서에 의해, 상기 제 1 데이터, 상기 제 2 데이터, 및 상기 복수의 서브 모델에 의해 생성된 복수의 서브 데이터 중 적어도 둘에 기초한 비용함수(cost function)를 생성하는 단계; 를 포함할 수 있다.In accordance with one embodiment of the present disclosure for solving the above problems, an information processing method performed in one or more processors is disclosed. The information processing method comprises: receiving, by a processor, first data; Generating, by a processor, second data based on the first data using a first computational model comprising a plurality of submodels connected in series, wherein the submodel includes at least one layer for changing dimensions box-; Generating, by a processor, a cost function based on at least two of the first data, the second data, and a plurality of sub-data generated by the plurality of sub-models; It may include.

대안적인 실시예에서, 상기 제 2 데이터를 생성하는 단계는: 프로세서에 의해, 제 1 서브 모델에 포함되는 제 1 감소 레이어를 이용하여 상기 제 1 데이터의 차원을 감소시키고, 상기 제 1 서브 모델에 포함되는 제 1 복구 레이어를 이용하여 상기 제 1 데이터의 감소된 차원을 복구함으로써 제 1 서브 데이터를 생성하는 단계; 및 프로세서에 의해, 제 2 서브 모델에 포함되는 제 2 감소 레이어를 이용하여 상기 제 1 서브 데이터의 차원을 감소시키고, 상기 제 2 서브 모델에 포함되는 제 2 복구 레이어를 이용하여 상기 제 1 서브 데이터의 감소된 차원을 복구함으로써 상기 제 2 데이터를 생성하는 단계; 를 포함할 수 있다.In an alternative embodiment, generating the second data comprises: reducing, by a processor, the dimension of the first data using a first reduction layer included in a first submodel, Generating first sub data by recovering the reduced dimension of the first data using an included first recovery layer; And reduce, by a processor, the dimension of the first sub data using a second reduction layer included in a second sub-model, and use the first sub data using a second recovery layer included in the second sub-model. Generating the second data by recovering a reduced dimension of the; It may include.

대안적인 실시예에서, 상기 정보 처리 방법은 프로세서에 의해, 상기 비용함수에 기초하여 상기 복수의 서브 모델 중 적어도 하나를 트레이닝시키는 단계; 를 포함할 수 있다.In an alternative embodiment, the information processing method further comprises: training, by a processor, at least one of the plurality of submodels based on the cost function; It may include.

대안적인 실시예에서, 상기 비용함수는: In an alternate embodiment, the cost function is:

상기 제 1 데이터, 상기 제 2 데이터, 및 상기 복수의 서브 데이터 중 적어도 둘의 관계에 기초한 항(term)을 포함할 수 있다.And a term based on a relationship between at least two of the first data, the second data, and the plurality of sub-data.

대안적인 실시예에서, 상기 비용함수는: 상기 제 1 데이터와 상기 제 2 데이터의 관계에 기초한 항, 및 상기 제 1 데이터와 상기 제 1 서브 데이터의 관계에 기초한 항을 포함할 수 있다.In an alternative embodiment, the cost function may include: a term based on the relationship between the first data and the second data, and a term based on the relationship between the first data and the first sub data.

대안적인 실시예에서, 상기 제 1 데이터는: 정상 데이터로 이루어진 트레이닝 데이터(traing data)를 포함할 수 있다.In an alternative embodiment, the first data may comprise: training data consisting of normal data.

대안적인 실시예에서, 상기 제 1 데이터는: 이미지 데이터를 포함할 수 있다.In an alternative embodiment, the first data may comprise: image data.

대안적인 실시예에서, 상기 복수의 서브 모델 중 적어도 하나는 뉴럴 네트워크(neural networks)를 포함할 수 있다.In alternative embodiments, at least one of the plurality of submodels may comprise neural networks.

대안적인 실시예에서, 상기 뉴럴 네트워크는: 오토 인코더(auto-encoder) 또는 GAN(Generative Adversarial Networks)를 포함할 수 있다.In an alternative embodiment, the neural network may comprise an auto-encoder or GAN (Generative Adversarial Networks).

대안적인 실시예에서, 상기 정보 처리 방법은 프로세서에 의해, 상기 제 1 연산 모델의 적어도 일부를 공유하는 제 2 연산 모델을 생성하는 단계; 를 더 포함할 수 있다.In an alternative embodiment, the information processing method comprises: generating, by a processor, a second computational model that shares at least a portion of the first computational model; It may further include.

대안적인 실시예에서, 상기 제 2 연산 모델은: 제 1 감소 레이어 및 제 2 복구 레이어의 적어도 일부를 공유할 수 있다.In an alternate embodiment, the second computational model may: share at least a portion of the first reduction layer and the second recovery layer.

본 개시의 다른 일 실시예에 따라, 하나 이상의 프로세서에서 수행되는 정보 처리 방법이 개시된다. 상기 정보 처리 방법은: 프로세서에 의해, 제 1 데이터를 수신하는 단계; 프로세서에 의해, 직렬로 연결된 복수의 서브 모델을 포함하는 제 1 연산 모델을 이용하여 상기 제 1 데이터에 기초한 제 2 데이터를 생성하는 단계-상기 서브 모델은 차원을 변경하는 적어도 하나의 레이어를 포함함-; 프로세서에 의해, 상기 제 1 데이터 및 상기 제 2 데이터에 기초하여 테스트 결과 정보를 생성하는 단계; 를 포함할 수 있다.In accordance with another embodiment of the present disclosure, an information processing method performed in one or more processors is disclosed. The information processing method comprises: receiving, by a processor, first data; Generating, by a processor, second data based on the first data using a first computational model comprising a plurality of submodels connected in series, wherein the submodel includes at least one layer that changes dimensions -; Generating, by a processor, test result information based on the first data and the second data; It may include.

본 개시의 또 다른 일 실시예에 따라, 하나 이상의 프로세서에서 수행되는 정보 처리 방법이 개시된다. 상기 정보 처리 방법은: 프로세서에 의해, 제 1 데이터를 수신하는 단계; 프로세서에 의해, 제 1 연산 모델의 적어도 일부를 공유하는 제 2 연산 모델을 이용하여 상기 제 1 데이터에 기초한 제 3 데이터를 생성하는 단계; 프로세서에 의해, 상기 제 1 데이터 및 상기 제 3 데이터에 기초하여 테스트 결과 정보를 생성하는 단계; 를 포함할 수 있다.In accordance with another embodiment of the present disclosure, an information processing method performed in one or more processors is disclosed. The information processing method comprises: receiving, by a processor, first data; Generating, by a processor, third data based on the first data using a second computational model that shares at least a portion of the first computational model; Generating, by a processor, test result information based on the first data and the third data; It may include.

본 개시의 또 다른 일 실시예에 따라, 컴퓨팅 장치가 개시된다. 상기 컴퓨팅 장치는: 하나 이상의 프로세서; 및 상기 하나 이상의 프로세서에서 실행가능한 명령들을 저장하는 메모리; 를 포함하고, 상기 하나 이상의 프로세서는, 제 1 데이터를 수신하는 단계; 직렬로 연결된 복수의 서브 모델을 포함하는 제 1 연산 모델을 이용하여 상기 제 1 데이터에 기초한 제 2 데이터를 생성하는 단계-상기 서브 모델은 차원을 변경하는 적어도 하나의 레이어를 포함함-; 및 상기 제 1 데이터, 상기 제 2 데이터, 및 상기 복수의 서브 모델에 의해 생성된 복수의 서브 데이터에 적어도 부분적으로 기초한 비용함수(cost function)를 생성하는 단계; 을 수행할 수 있다.In accordance with yet another embodiment of the present disclosure, a computing device is disclosed. The computing device comprises: one or more processors; And a memory storing instructions executable in the one or more processors; Wherein the one or more processors comprise: receiving first data; Generating second data based on the first data using a first computational model comprising a plurality of sub-models connected in series, the sub-model comprising at least one layer that changes dimensions; And generating a cost function based at least in part on the first data, the second data, and the plurality of sub-data generated by the plurality of sub-models; Can be performed.

본 개시의 또 다른 일 실시예에 따라, 컴퓨팅 장치가 개시된다. 상기 컴퓨팅 장치는: 하나 이상의 프로세서; 및 상기 하나 이상의 프로세서에서 실행가능한 명령들을 저장하는 메모리; 를 포함하고, 상기 하나 이상의 프로세서는, 제 1 데이터를 수신하는 단계; 직렬로 연결된 복수의 서브 모델을 포함하는 제 1 연산 모델을 이용하여 상기 제 1 데이터에 기초한 제 2 데이터를 생성하는 단계-상기 서브 모델은 차원을 변경하는 적어도 하나의 레이어를 포함함-; 및 상기 제 1 데이터 및 상기 제 2 데이터에 기초하여 테스트 결과 정보를 생성하는 단계; 를 수행할 수 있다.In accordance with yet another embodiment of the present disclosure, a computing device is disclosed. The computing device comprises: one or more processors; And a memory storing instructions executable in the one or more processors; Wherein the one or more processors comprise: receiving first data; Generating second data based on the first data using a first computational model comprising a plurality of sub-models connected in series, the sub-model comprising at least one layer that changes dimensions; Generating test result information based on the first data and the second data; Can be performed.

본 개시의 또 다른 일 실시예에 따라, 컴퓨팅 장치가 개시된다. 컴퓨팅 장치는: 하나 이상의 프로세서; 및 상기 하나 이상의 프로세서에서 실행가능한 명령들을 저장하는 메모리; 를 포함하고, 상기 하나 이상의 프로세서는, 제 1 데이터를 수신하는 단계; 제 1 연산 모델의 적어도 일부를 공유하는 제 2 연산 모델을 이용하여 상기 제 1 데이터에 기초한 제 3 데이터를 생성하는 단계; 및 상기 제 1 데이터 및 상기 제 3 데이터에 기초하여 테스트 결과 정보를 생성하는 단계; 를 수행할 수 있다.In accordance with yet another embodiment of the present disclosure, a computing device is disclosed. The computing device includes: one or more processors; And a memory storing instructions executable in the one or more processors; Wherein the one or more processors comprise: receiving first data; Generating third data based on the first data using a second computational model that shares at least a portion of a first computational model; Generating test result information based on the first data and the third data; Can be performed.

본 개시에서 얻을 수 있는 기술적 해결 수단은 이상에서 언급한 해결 수단들로 제한되지 않으며, 언급하지 않은 또 다른 해결 수단들은 아래의 기재로부터 본 개시가 속하는 기술분야에서 통상의 지식을 가진 자에게 명확하게 이해될 수 있을 것이다.The technical solutions obtained in the present disclosure are not limited to the above-mentioned solutions, and other solutions not mentioned above are clearly described to those skilled in the art from the following description. It can be understood.

본 개시는 전술한 배경기술에 대응하여 안출된 것으로 인공 신경망의 훈련 방법 및 훈련된 신경망을 통한 데이터 분류를 제공할 수 있다.The present disclosure devised in response to the above-described background art can provide a training method for artificial neural networks and data classification through trained neural networks.

다양한 양상들이 이제 도면들을 참조로 기재되며, 여기서 유사한 참조 번호들은 총괄적으로 유사한 구성요소들을 지칭하는데 이용된다. 이하의 실시예에서, 설명 목적을 위해, 다수의 특정 세부사항들이 하나 이상의 양상들의 총체적 이해를 제공하기 위해 제시된다. 그러나, 그러한 양상(들)이 이러한 구체적인 세부사항들 없이 실시될 수 있음은 명백할 것이다. 다른 예시들에서, 공지의 구조들 및 장치들이 하나 이상의 양상들의 기재를 용이하게 하기 위해 블록도 형태로 도시된다.
도 1 은 본 개시의 일 실시예에 따라 정보 처리 방법을 수행하는 컴퓨팅 장치의 블록 구성도를 도시한다.
도 2는 본 개시의 일 실시예 따라 인공 뉴럴 네트워크의 일부를 나타낸 개략도이다.
도 3은 본 개시의 일 실시예에 따라 본 발명의 정보 처리 방법을 구현하기 위한 연산 모델의 개략도이다.
도 4는 본 개시의 일 실시예에 따라 제 1 연결 모델을 트레이닝를 수행하기 위한 정보 처리 방법의 단계를 설명하기 위한 순서도이다.
도 5는 본 개시의 일 실시예에 따라 제 1 연결 모델을 이용하여 테스트를 수행하기 위한 정보 처리 방법의 단계를 설명하기 위한 순서도이다.
도 6은 본 개시의 일 실시예에 따라 제 2 연결 모델을 이용하여 테스트를 수행하기 위한 정보 처리 방법의 단계를 설명하기 위한 순서도이다.
도 7은 본 개시의 일 실시예들이 구현될 수 있는 예시적인 컴퓨팅 환경에 대한 간략하고 일반적인 개략도를 도시한다.Various aspects are now described with reference to the drawings, wherein like reference numerals are used to refer to like components throughout. In the following examples, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of one or more aspects. However, it will be apparent that such aspect (s) may be practiced without these specific details. In other instances, well-known structures and devices are shown in block diagram form in order to facilitate describing one or more aspects.
1 illustrates a block diagram of a computing device performing an information processing method according to an embodiment of the present disclosure.
2 is a schematic diagram illustrating a portion of an artificial neural network in accordance with an embodiment of the present disclosure.
3 is a schematic diagram of a computational model for implementing the information processing method of the present invention in accordance with one embodiment of the present disclosure.
FIG. 4 is a flowchart for describing steps of an information processing method for training a first connection model according to an embodiment of the present disclosure.
FIG. 5 is a flowchart for describing steps of an information processing method for performing a test using a first connection model according to an embodiment of the present disclosure.
FIG. 6 is a flowchart for describing steps of an information processing method for performing a test using a second connection model according to an embodiment of the present disclosure.
7 shows a brief general schematic diagram of an example computing environment in which embodiments of the present disclosure may be implemented.

다양한 실시예들이 이제 도면을 참조하여 설명된다. 본 명세서에서, 다양한 설명들이 본 개시의 이해를 제공하기 위해서 제시된다. 그러나 이러한 실시예들은 이러한 구체적인 설명 없이도 실행될 수 있음이 명백하다. 다른 예들에서, 공지된 구조 및 장치들은 실시예들의 설명을 용이하게 하기 위해서 블록 다이어그램 형태로 제공된다.Various embodiments are now described with reference to the drawings. In this specification, various descriptions are presented to provide an understanding of the present disclosure. It is evident, however, that such embodiments may be practiced without these specific details. In other instances, well-known structures and devices are provided in block diagram form in order to facilitate describing the embodiments.

본 명세서에서 사용되는 용어 "컴포넌트", "모듈", "시스템" 등은 컴퓨터-관련 엔티티, 하드웨어, 펌웨어, 소프트웨어, 소프트웨어 및 하드웨어의 조합, 또는 소프트웨어의 실행을 지칭한다. 예를 들어, 컴포넌트는 프로세서상에서 실행되는 처리과정(procedure), 프로세서, 객체, 실행 스레드, 프로그램, 및/또는 컴퓨터일 수 있지만, 이들로 제한되는 것은 아니다. 예를 들어, 컴퓨팅 장치에서 실행되는 애플리케이션 및 컴퓨팅 장치 모두 컴포넌트일 수 있다. 하나 이상의 컴포넌트는 프로세서 및/또는 실행 스레드 내에 상주할 수 있고, 일 컴포넌트는 하나의 컴퓨터 내에 로컬화될 수 있고, 또는 2개 이상의 컴퓨터들 사이에 분배될 수 있다. 또한, 이러한 컴포넌트들은 그 내부에 저장된 다양한 데이터 구조들을 갖는 다양한 컴퓨터 판독가능한 매체로부터 실행할 수 있다. 컴포넌트들은 예를 들어 하나 이상의 데이터 패킷들을 갖는 신호(예를 들면, 로컬 시스템, 분산 시스템에서 다른 컴포넌트와 상호작용하는 하나의 컴포넌트로부터의 데이터 및/또는 신호를 통해 다른 시스템과 인터넷과 같은 네트워크를 통해 전송되는 데이터)에 따라 로컬 및/또는 원격 처리들을 통해 통신할 수 있다. As used herein, the terms “component”, “module”, “system” and the like refer to computer-related entities, hardware, firmware, software, a combination of software and hardware, or the execution of software. For example, a component may be, but is not limited to being, a process running on a processor, a processor, an object, a thread of execution, a program, and / or a computer. For example, both an application running on a computing device and the computing device can be a component. One or more components can reside within a processor and / or thread of execution, and a component can be localized within one computer or distributed between two or more computers. In addition, these components can execute from various computer readable media having various data structures stored thereon. The components may, for example, via a network such as the Internet and other systems via signals with one or more data packets (e.g., data and / or signals from one component interacting with other components in a local system, distributed system). Data transmitted) may be communicated via local and / or remote processes.

더불어, "포함한다" 및/또는 "포함하는"이라는 용어는, 해당 특징 및/또는 구성요소가 존재함을 의미하지만, 하나 이상의 다른 특징, 구성요소 및/또는 이들의 그룹의 존재 또는 추가를 배제하지 않는 것으로 이해되어야 한다. 또한, 달리 특정되지 않거나 단수 형태를 지시하는 것으로 문맥상 명확하지 않은 경우에, 본 명세서와 청구범위에서 단수는 일반적으로 "하나 또는 그 이상"을 의미하는 것으로 해석되어야 한다.In addition, the terms "comprises" and / or "comprising" mean that such features and / or components are present, but exclude the presence or addition of one or more other features, components, and / or groups thereof. It should be understood that it does not. Also, unless otherwise specified or in the context of indicating a singular form, the singular in the specification and claims should generally be interpreted as meaning "one or more."

제시된 실시예들에 대한 설명은 본 개시의 기술 분야에서 통상의 지식을 가진 자가 본 개시를 이용하거나 또는 실시할 수 있도록 제공된다. 이러한 실시예들에 대한 다양한 변형들은 본 개시의 기술 분야에서 통상의 지식을 가진 자에게 명백할 것이며, 여기에 정의된 일반적인 원리들은 본 개시의 범위를 벗어남이 없이 다른 실시예들에 적용될 수 있다. 그리하여, 본 개시는 여기에 제시된 실시예들로 한정되는 것이 아니라, 여기에 제시된 원리들 및 신규한 특징들과 일관되는 최광의의 범위에서 해석되어야 할 것이다.The description of the presented embodiments is provided to enable any person skilled in the art to make or use the present disclosure. Various modifications to these embodiments will be apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the scope of the present disclosure. Thus, the present disclosure should not be limited to the embodiments presented herein but should be construed in the broadest scope consistent with the principles and novel features presented herein.

도 1 은 본 개시의 일 실시예에 따라 컴퓨팅 장치의 블록 구성도를 도시한다.1 illustrates a block diagram of a computing device in accordance with one embodiment of the present disclosure.

본 개시의 일 실시예에 따른 컴퓨팅 장치(100)는 프로세서(110) 및 메모리(120)를 포함할 수 있다. 도 1에 도시된 블록 구성도는 컴퓨팅 장치를 간소화한 구성으로 표현한 것으로서, 본 개시는 이에 제한되지 않고, 구동에 필요한 추가적인 구성 요소들을 포함할 수 있다. Computing device 100 according to an embodiment of the present disclosure may include a processor 110 and a memory 120. The block diagram shown in FIG. 1 is a simplified configuration of a computing device, and the present disclosure is not limited thereto and may include additional components required for driving.

프로세서(110)는 하나 이상의 코어로 구성될 수 있으며, 컴퓨팅 장치의 중앙 처리 장치(CPU: central processing unit), 범용 그래픽 처리 장치(GPGPU: general purpose graphics processing unit), 텐서 처리 장치(TPU: tensor processing unit) 등의 데이터 분석, 머신 러닝을 위한 프로세서를 포함할 수 있으며, 전술한 프로세서의 종류는 예시일 뿐이며 본 개시는 이에 제한되지 않는다. 프로세서(110)는 메모리(130)에 저장된 컴퓨터 프로그램을 판독하여 본 개시의 일실시예에 따른 정보 처리 방법을 수행할 수 있다. 본 개시의 일실시예에 따라 프로세서(110)는 신경망의 학습을 위한 계산을 수행할 수 있다. 본 명세서에서, 학습 또는 트레이닝은 동일한 의미로 사용될 수 있다. 프로세서(110)는 머신 러닝(machine learning)에서 학습을 위한 입력 데이터의 처리, 입력 데이터에서의 피쳐(feature) 추출, 오류 계산, 역전파(backpropagation)를 이용한 신경망의 가중치 업데이트 등의 신경망의 학습을 위한 계산을 수행할 수 있다. 프로세서(110)의CPU, GPGPU, 및 TPU 중 적어도 하나가 연산 모델의 학습을 처리할 수 있다. 예를 들어, CPU 와 GPGPU가 함께 네트워크 함수의 학습, 연산 모델을 이용한 데이터 분류를 처리할 수 있다. 또한, 본 개시의 일 실시예에서 복수의 컴퓨팅 장치의 프로세서를 함께 사용하여 연산 모델의 학습, 연산 모델을 이용한 데이터 분류를 처리할 수 있다. 또한, 본 개시의 일 실시예에 따른 컴퓨팅 장치(100)에서 수행되는 컴퓨터 프로그램은 CPU, GPGPU 또는 TPU에 의해 실행가능한 프로그램일 수 있다. The processor 110 may include one or more cores, and may include a central processing unit (CPU), a general purpose graphics processing unit (GPGPU), and a tensor processing unit (TPU) of a computing device. and a processor for data analysis, machine learning, and the like, and the above-described types of processors are merely examples and the present disclosure is not limited thereto. The processor 110 may read a computer program stored in the memory 130 to perform an information processing method according to an embodiment of the present disclosure. According to an embodiment of the present disclosure, the processor 110 may perform calculation for learning a neural network. In this specification, learning or training may be used in the same sense. The processor 110 performs neural network learning such as processing input data for learning in machine learning, feature extraction from input data, error calculation, and weight update of neural networks using backpropagation. To perform the calculation. At least one of the CPU, the GPGPU, and the TPU of the processor 110 may process the learning of the computational model. For example, the CPU and GPGPU can work together to learn network functions and classify data using computational models. In addition, in an embodiment of the present disclosure, a processor of a plurality of computing devices may be used together to process learning of a calculation model and data classification using the calculation model. In addition, the computer program executed in the computing device 100 according to an embodiment of the present disclosure may be a program executable by a CPU, a GPGPU, or a TPU.

본 개시의 일 실시예에서 컴퓨팅 장치(100)는 CPU, GPGPU, 및 TPU 중 적어도 하나를 이용하여 연산 모델을 분산하여 처리할 수 있다. 또한 본 개시의 일 실시예에서 컴퓨팅 장치(100)는 다른 컴퓨팅 장치와 함께 연산 모델을 분산하여 처리할 수 있다.In an embodiment of the present disclosure, the computing device 100 may distribute and process a computational model using at least one of a CPU, a GPGPU, and a TPU. In addition, in an embodiment of the present disclosure, the computing device 100 may distribute and process a computational model along with other computing devices.

메모리(110)는 본 개시의 일 실시예에 따른 정보 처리 방법을 수행하기 위한 컴퓨터 프로그램을 저장할 수 있으며, 저장된 컴퓨터 프로그램은 프로세서(110)에 의하여 판독되어 구동될 수 있다.The memory 110 may store a computer program for performing the information processing method according to an embodiment of the present disclosure, and the stored computer program may be read and driven by the processor 110.

도 2는 본 개시의 일 실시예 따라 인공 신경망의 일부를 나타낸 개략도이다.2 is a schematic diagram illustrating a portion of an artificial neural network according to an embodiment of the present disclosure.

본 명세서에 걸쳐, 연산 모델, 신경망, 네트워크 함수, 뉴럴 네트워크(neural network)는 동일한 의미로 사용될 수 있다. 신경망은 일반적으로 "노드"라 지칭될 수 있는 상호 연결된 계산 단위들의 집합으로 구성될 수 있다. 이러한 "노드"들은 "뉴런(neuron)"들로 지칭될 수도 있다. 신경망은 적어도 하나 이상의 노드들을 포함하여 구성된다. 신경망들을 구성하는 노드(또는 뉴런)들은 하나 이상의“링크”에 의해 상호 연결될 수 있다.Throughout this specification, computational models, neural networks, network functions, neural networks may be used in the same sense. A neural network may consist of a set of interconnected computing units, which may generally be referred to as "nodes." Such "nodes" may be referred to as "neurons". The neural network comprises at least one node. Nodes (or neurons) that make up neural networks may be interconnected by one or more “links”.

신경망 내에서, 링크를 통해 연결된 하나 이상의 노드들은 상대적으로 입력 노드 및 출력 노드의 관계를 형성할 수 있다. 입력 노드 및 출력 노드의 개념은 상대적인 것으로서, 하나의 노드에 대하여 출력 노드 관계에 있는 임의의 노드는 다른 노드와의 관계에서 입력 노드 관계에 있을 수 있으며, 그 역도 성립할 수 있다. 상술한 바와 같이, 입력 노드 대 출력 노드 관계는 링크를 중심으로 생성될 수 있다. 하나의 입력 노드에 하나 이상의 출력 노드가 링크를 통해 연결될 수 있으며, 그 역도 성립할 수 있다. Within a neural network, one or more nodes connected via a link may form a relationship of input node and output node relatively. The concept of an input node and an output node is relative; any node in an output node relationship to one node may be in an input node relationship in relation to another node, and vice versa. As mentioned above, the input node to output node relationship can be created around the link. One or more output nodes can be connected to a single input node via a link, and vice versa.

하나의 링크를 통해 연결된 입력 노드 및 출력 노드 관계에서, 출력 노드는 입력 노드에 입력된 데이터에 기초하여 그 값이 결정될 수 있다. 여기서 입력 노드와 출력 노드를 상호 연결하는 노드는 가중치(weight)를 가질 수 있다. 가중치는 가변적일 수 있으며, 신경망이 원하는 기능을 수행하기 위해, 사용자 또는 알고리즘에 의해 가변될 수 있다. 예를 들어, 하나의 출력 노드에 하나 이상의 입력 노드가 각각의 링크에 의해 상호 연결된 경우, 출력 노드는 상기 출력 노드와 연결된 입력 노드들에 입력된 값들 및 각각의 입력 노드들에 대응하는 링크에 설정된 가중치에 기초하여 출력 노드 값을 결정할 수 있다.In an input node and output node relationship connected via one link, the output node may be determined based on data input to the input node. Here, the node interconnecting the input node and the output node may have a weight. The weight may be variable, and may be varied by the user or algorithm in order for the neural network to perform the desired function. For example, if one or more input nodes are interconnected by each link to one output node, the output node is set to the values input to the input nodes associated with the output node and to the links corresponding to the respective input nodes. The output node value may be determined based on the weight.

상술한 바와 같이, 신경망은 하나 이상의 노드들이 하나 이상의 링크를 통해 상호 연결되어 신경망 내에서 입력 노드 및 출력 노드 관계를 형성한다. 신경망 내에서 노드들과 링크들의 개수 및 노드들과 링크들 사이의 연관관계, 링크들 각각에 부여된 가중치의 값에 따라, 신경망의 특성이 결정될 수 있다. 예를 들어, 동일한 개수의 노드 및 링크들이 존재하고, 링크들 사이의 가중치 값이 상이한 두 신경망이 존재하는 경우, 두 개의 신경망들은 서로 상이한 것으로 인식될 수 있다.As discussed above, a neural network is formed by interconnecting one or more nodes through one or more links to form an input node and an output node relationship within the neural network. The characteristics of the neural network may be determined according to the number of nodes and links in the neural network, the relationship between the nodes and the links, and the value of the weight assigned to each of the links. For example, if there are the same number of nodes and links, and there are two neural networks with different weight values between the links, the two neural networks may be recognized as different from each other.

신경망은 하나 이상의 노드들을 포함하여 구성될 수 있다. 신경망을 구성하는 노드들 중 일부는, 최초 입력 노드로부터의 거리들에 기초하여, 하나의 레이어(layer)를 구성할 수 있다, 예를 들어, 최초 입력 노드로부터 거리가 n인 노드들의 집합은, n 레이어를 구성할 수 있다. 최초 입력 노드로부터 거리는, 최초 입력 노드로부터 해당 노드까지 도달하기 위해 거쳐야 하는 링크들의 최소 개수에 의해 정의될 수 있다. 그러나, 이러한 레이어의 정의는 설명을 위한 임의적인 것으로서, 신경망 내에서 레이어의 차수는 상술한 것과 상이한 방법으로 정의될 수 있다. 예를 들어, 노드들의 레이어는 최종 출력 노드로부터 거리에 의해 정의될 수도 있다.The neural network may comprise one or more nodes. Some of the nodes that make up the neural network may construct one layer based on distances from the original input node, for example, a set of nodes with a distance n from the original input node, You can configure n layers. The distance from the original input node may be defined by the minimum number of links that must pass to reach the node from the original input node. However, the definition of such a layer is arbitrary for explanation, and the order of the layer in the neural network may be defined in a manner different from that described above. For example, a layer of nodes may be defined by distance from the final output node.

최초 입력 노드는 신경망 내의 노드들 중 다른 노드들과의 관계에서 링크를 거치지 않고 데이터가 직접 입력되는 하나 이상의 노드들을 의미할 수 있다. 또는, 신경망 네트워크 내에서, 링크를 기준으로 한 노드 간의 관계에 있어서, 링크로 연결된 다른 입력 노드들 가지지 않는 노드들을 의미할 수 있다. 이와 유사하게, 최종 출력 노드는 신경망 내의 노드들 중 다른 노드들과의 관계에서, 출력 노드를 가지지 않는 하나 이상의 노드들을 의미할 수 있다. 또한, 히든 노드는 최초 입력 노드 및 최후 출력 노드가 아닌 신경망을 구성하는 노드들을 의미할 수 있다. 본 개시의 일 실시예에 따른 신경망은 입력 레이어의 노드의 개수가 출력 레이어의 노드의 개수와 동일할 수 있으며, 입력 레이어에서 히든 레이어로 진행됨에 따라 노드의 수가 감소하다가 다시 증가하는 형태의 신경망일 수 있다. 또한, 본 개시의 다른 일 실시예에 따른 신경망은 입력 레이어의 노드의 개수가 출력 레이어의 노드의 개수 보다 적을 수 있으며, 입력 레이어에서 히든 레이어로 진행됨에 따라 노드의 수가 감소하는 형태의 신경망일 수 있다. 또한, 본 개시의 또 다른 일 실시예에 따른 신경망은 입력 레이어의 노드의 개수가 출력 레이어의 노드의 개수보다 많을 수 있으며, 입력 레이어에서 히든 레이어로 진행됨에 따라 노드의 수가 증가하는 형태의 신경망일 수 있다. 본 개시의 또 다른 일 실시예에 따른 신경망은 상술한 신경망들의 조합된 형태의 신경망일 수 있다.The initial input node may refer to one or more nodes into which data is directly input without going through a link in relation to other nodes of the neural network. Alternatively, in a neural network network, in a relationship between nodes based on a link, it may mean nodes having no other input nodes connected by a link. Similarly, the final output node may refer to one or more nodes that do not have an output node in relation to other nodes of the nodes in the neural network. Also, the hidden node may refer to nodes constituting a neural network other than the first input node and the last output node. The neural network according to an embodiment of the present disclosure may have the same number of nodes in the input layer as the number of nodes in the output layer, and the number of nodes decreases and then increases again as the node progresses from the input layer to the hidden layer. Can be. In addition, the neural network according to another embodiment of the present disclosure may be a neural network in which the number of nodes in the input layer is smaller than the number of nodes in the output layer and the number of nodes decreases as the input layer progresses from the input layer to the hidden layer. have. In addition, the neural network according to another embodiment of the present disclosure may have a larger number of nodes in the input layer than the number of nodes in the output layer, and the number of nodes increases as the number of nodes increases from the input layer to the hidden layer. Can be. The neural network according to another embodiment of the present disclosure may be a neural network in a combined form of the neural networks described above.

딥 뉴럴 네트워크(DNN: deep neural network, 심층신경망)는 입력레이어와 출력 레이어 외에 복수의 히든 레이어를 포함하는 신경망을 의미할 수 있다. 딥 뉴럴 네트워크를 이용하면 데이터의 잠재적인 구조(latent structures)를 파악할 수 있다. 즉, 사진, 글, 비디오, 음성, 음악의 잠재적인 구조(예를 들어, 어떤 물체가 사진에 잇는지, 글의 내용과 감정이 무엇인지, 음성의 내용과 감정이 무엇인지 등)를 파악할 수 있다. 딥 뉴럴 네트워크는 컨벌루셔널 뉴럴 네트워크(CNN: convolutional neural network), 리커런트 뉴럴 네트워크(RNN: recurrentneural network), 오토 인코더(auto encoder), GAN(Generative Adversarial Networks), 제한 볼츠만 머신(RBM: restricted boltzmann machine), 심층 신뢰 네트워크(DBN: deep belief network), Q 네트워크, U 네트워크, 샴 네트워크 등을 포함할 수 있다.A deep neural network (DNN) may refer to a neural network including a plurality of hidden layers in addition to an input layer and an output layer. Deep neural networks can be used to identify potential structures in your data. In other words, you can identify the potential structures of photos, texts, videos, voices, and music (e.g., what objects are in the picture, what are the content and feelings of the text, what are the contents and feelings of the voice, etc.) . Deep neural networks include convolutional neural networks (CNNs), recurrentneural networks (RNNs), auto encoders, Generic Adversarial Networks (GAN), restricted boltzmann (RBM) machine), deep belief network (DBN), Q network, U network, Siamese network and the like.

뉴럴 네트워크는 교사 학습(supervised learning), 비교사 학습(unsupervised learning), 및 반교사학습(semi supervised learning) 중 적어도 하나의 방식으로 학습될 수 있다. 뉴럴 네트워크의 학습은 출력의 오류를 최소화하기 위한 것이다. 뉴럴 네트워크의 학습에서 반복적으로 학습 데이터를 뉴럴 네트워크에 입력시키고 학습 데이터에 대한 뉴럴 네트워크의 출력과 타겟의 에러를 계산하고, 에러를 줄이기 위한 방향으로 뉴럴 네트워크의 에러를 뉴럴 네트워크의 출력 레이어에서부터 입력 레이어 방향으로 역전파(backpropagation)하여 뉴럴 네트워크의 각 노드의 가중치를 업데이트 하는 과정이다. 교사 학습의 경우 각각의 학습 데이터에 정답이 라벨링되어있는 학습 데이터를 사용하며(즉, 라벨링된 학습 데이터), 비교사 학습의 경우는 각각의 학습 데이터에 정답이 라벨링되어 있지 않을 수 있다. 즉, 예를 들어 데이터 분류에 관한 교사 학습의 경우의 학습 데이터는 학습 데이터 각각에 카테고리가 라벨링 된 데이터 일 수 있다. 라벨링된 학습 데이터가 뉴럴 네트워크에 입력되고, 뉴럴 네트워크의 출력(카테고리)과 학습 데이터의 라벨이 비교함으로써 오류(error)가 계산될 수 있다. 다른 예로, 데이터 분류에 관한 비교사 학습의 경우 입력인 학습 데이터가 뉴럴 네트워크 출력과 비교됨으로써 오류가 계산될 수 있다. 계산된 오류는 뉴럴 네트워크에서 역방향(즉, 출력 레이어에서 입력 레이어 방향)으로 역전파 되며, 역전파에 따라 뉴럴 네트워크의 각 레이어의 각 노드들의 연결 가중치가 업데이트 될 수 있다. 업데이트 되는 각 노드의 연결 가중치는 학습률(learing rate)에 따라 변화량이 결정될 수 있다. 입력 데이터에 대한 뉴럴 네트워크의 계산과 에러의 역전파는 학습 사이클(epoch)을 구성할 수 있다. 학습률은 뉴럴 네트워크의 학습 사이클의 반복 횟수에 따라 상이하게 적용될 수 있다. 예를 들어, 뉴럴 네트워크의 학습 초기에는 높은 학습률을 사용하여 뉴럴 네트워크가 빠르게 일정 수준의 성능을 확보하도록 하여 효율성을 높이고, 학습 후기에는 낮은 학습률을 사용하여 정확도를 높일 수 있다.The neural network may be trained in at least one of supervised learning, unsupervised learning, and semi supervised learning. Training of neural networks is intended to minimize errors in the output. In the neural network learning, the neural network error is inputted from the output layer of the neural network in order to repeatedly input the training data into the neural network, calculate the error of the neural network output and target for the training data, and reduce the error. The process of updating the weight of each node of the neural network by backpropagation in the direction. In the case of teacher learning, the learning data using the correct answer is labeled in each learning data (that is, the labeled learning data), and in the case of the comparative learning, the correct answer may not be labeled in each learning data. That is, for example, the learning data in the case of teacher learning regarding data classification may be data in which a category is labeled in each of the learning data. The labeled training data is input to the neural network, and an error can be calculated by comparing the label of the training data with the output (category) of the neural network. As another example, in the case of non-comparative learning on data classification, an error may be calculated by comparing learning data as an input with a neural network output. The calculated error is propagated backward in the neural network (ie, the direction from the output layer to the input layer), and the connection weight of each node of each layer of the neural network may be updated according to the reverse propagation. The connection weight of each node to be updated may be determined according to a learning rate. The computation of the neural network for the input data and the backpropagation of the errors can constitute a learning cycle. The learning rate may be applied differently according to the number of repetitions of the learning cycle of the neural network. For example, in the early stages of neural network learning, high learning rates can be used to allow the neural network to quickly achieve a certain level of performance, increasing efficiency, and lower learning rates for later learning.

뉴럴 네트워크의 학습에서 일반적으로 학습 데이터는 실제 데이터(즉, 학습된 뉴럴 네트워크를 이용하여 처리하고자 하는 데이터)의 부분집합일 수 있으며, 따라서, 학습 데이터에 대한 오류는 감소하나 실제 데이터에 대해서는 오류가 증가하는 학습 사이클이 존재할 수 있다. 과적합(overfitting)은 이와 같이 학습 데이터에 과하게 학습하여 실제 데이터에 대한 오류가 증가하는 현상이다. 예를 들어, 노란색 고양이를 보여 고양이를 학습한 뉴럴 네트워크가 노란색 이외의 고양이를 보고는 고양이임을 인식하지 못하는 현상이 과적합의 일종일 수 있다. 과적합은 머신러닝 알고리즘의 오류를 증가시키는 원인으로 작용할 수 있다. 이러한 과적합을 막기 위하여 다양한 최적화 방법이 사용될 수 있다. 과적합을 막기 위해서는 학습 데이터를 증가시키거나, 레귤라이제이션(regulaization), 학습의 과정에서 네트워크의 노드 일부를 생략하는 드롭아웃(dropout) 등의 방법이 적용될 수 있다.In the learning of neural networks, training data can generally be a subset of the actual data (ie, the data to be processed using the trained neural network), thus reducing the error for the training data but not for the actual data. There may be an increasing learning cycle. Overfitting is a phenomenon in which errors on the actual data are increased by excessively learning the training data. For example, a neural network that learns a cat by showing a yellow cat may not recognize that the cat is a cat other than a yellow cat. Overfitting can be a cause of increased errors in machine learning algorithms. Various optimization methods can be used to prevent such overfitting. In order to prevent overfitting, a method of increasing learning data, regulating, or dropping out of a node of a network in the course of learning may be applied.

머신러닝 및 딥러닝 기본구조에 대한 구체적인 내용에 대한 설명은 본 출원에서 전체가 참조로서 통합되는 미국 특허번호 US9,141,916B1 (출원일: 2013년05월14일)에서 구체적으로 논의된다.Details of the machine learning and deep learning frameworks are discussed in detail in US Pat. No. US 9,141,916B1 (filed May 14, 2013), which is hereby incorporated by reference in its entirety.

또한, 비교사 학습 방법에 대한 구체적인 내용에 대한 설명은 본 출원에서 전체가 참조로서 통합되는 미국특허 공개번호 US2008/0222135A1 (출원일: 2007년09월11일)에서 구체적으로 논의된다.In addition, a description of the details of the method for learning the comparative company is specifically discussed in US Patent Publication No. US2008 / 0222135A1 (filed September 11, 2007), which is hereby incorporated by reference in its entirety.

도 3은 본 개시의 일 실시예에 따라 본 발명의 정보 처리 방법을 구현하기 위한 연산 모델의 개략도이다.3 is a schematic diagram of a computational model for implementing the information processing method of the present invention in accordance with one embodiment of the present disclosure.

본 개시의 정보 처리 방법은 컴퓨팅 장치(100)의 하나 이상의 프로세서(110)에 의하여 수행될 수 있다. 본 개시의 컴퓨팅 장치(100)의 하나 이상의 프로세서(110)는 본 개시의 연산 모델의 연산 프로세스를 수행할 수 있다. The information processing method of the present disclosure may be performed by one or more processors 110 of the computing device 100. One or more processors 110 of computing device 100 of the present disclosure may perform a computational process of the computational model of the present disclosure.

본 개시의 정보 처리 방법의 모든 연산 프로세스(즉, 연산 모델의 학습, 피쳐 추출, 피쳐 비교 등)은 컴퓨팅 장치(100)의 프로세서에 의해 수행될 수 있다. 즉, 연산 모델에서 데이터를 처리한다는 표현은 컴퓨팅 장치(100)의 프로세서(110)가 연산 모델을 실행하여 데이터를 처리하는 과정을 의미할 수 있다. All computational processes (ie, learning of computational models, feature extraction, feature comparison, etc.) of the information processing method of the present disclosure may be performed by a processor of the computing device 100. That is, the expression processing data in the computational model may mean a process in which the processor 110 of the computing device 100 processes the data by executing the computational model.

본 개시의 연산 모델(200, 300)은 데이터의 분류를 위해 사용될 수 있다. 예를 들어, 본 개시의 연산 모델(200, 300)은 어노말리 디택션(Anomaly Detection)에 사용될 수 있다.The computational models 200, 300 of the present disclosure can be used for classification of data. For example, the computational models 200 and 300 of the present disclosure can be used for anomaly detection.

어노말리 데이터는 데이터의 정상 패턴에서 벗어나는 비정상 데이터를 의미할 수 있다. 데이터는 비정형 패턴을 가질 수 있으며 어노말리 데이터는 이러한 비정형 패턴에서 벗어나는 데이터를 의미할 수 있다. 예를 들어, 생산 공정에서 생산 제품의 이미지에 관한 데이터는 정상 제품이라는 비정형 패턴을 가질 수 있으며, 어노말리 데이터는 정상 제품이라는 비정형 패턴에서 벗어난 데이터(즉, 예를 들어 불량 제품의 이미지 등)일 수 있다. 본 개시의 정상 데이터, 비정형 패턴, 어노말리 데이터에 관한 기재는 예시일 뿐이며 본 개시는 이에 제한되지 않는다.The anomaly data may refer to abnormal data that deviates from a normal pattern of data. The data may have an atypical pattern and the anomaly data may refer to data that deviates from this atypical pattern. For example, in a production process, data about an image of a product may have an atypical pattern called a normal product, and anomaly data may be data that deviates from the atypical pattern of a normal product (ie, an image of a bad product, for example). Can be. The description of the normal data, the atypical pattern, and the anomaly data of the present disclosure is merely an example and the present disclosure is not limited thereto.

프로세서(110)는 제 1 데이터를 수신할 수 있다. 예를 들어, 프로세서(110)는 메모리(120) 또는 다른 저장매체에 저장된 제 1 데이터를 수신할 수 있다. 제 1 연산 모델(200)은 제 1 데이터를 입력으로 사용할 수 있다. 즉, 프로세서(110)는 제 1 데이터를 제 1 연산 모델를 이용하여 처리할 수 있다. 이 경우에, 프로세서(110)는 제 1 연산 모델을 통해 제 1 데이터에 기초한 제 2 데이터를 출력으로 생성할 수 있다. The processor 110 may receive the first data. For example, the processor 110 may receive first data stored in the memory 120 or another storage medium. The first computational model 200 may use the first data as an input. In other words, the processor 110 may process the first data using the first calculation model. In this case, the processor 110 may generate, as an output, second data based on the first data through the first calculation model.

제 1 데이터는 연산 모델(200, 300)의 트레이닝 데이터일 수 있다. 또한, 제 1 데이터는 연산 모델(200, 300)을 통해 분류되기 위한 데이터일 수 있다. 예를 들면, 제 1 데이터는 어노말리(anomaly)의 판단의 대상이 되는 데이터일 수 있다. The first data may be training data of the computational models 200 and 300. In addition, the first data may be data to be classified through the calculation models 200 and 300. For example, the first data may be data that is subject to anomaly determination.

본 개시의 제 1 데이터는 정상 데이터로 이루어진 트레이닝 데이터(training data)를 포함할 수 있다. 본 개시의 프로세서(110)는 정상 데이터로 이루어진 트레이닝 데이터를 이용하여 연산 모델을 트레이닝시킬 수 있다. The first data of the present disclosure may include training data consisting of normal data. The processor 110 of the present disclosure may train a computational model using training data consisting of normal data.

구체적으로 설명하면, 본 개시의 제 1 연산 모델에 포함되는 서브 모델은 차원을 변경하기 위한 적어도 하나의 레이어(예를 들어, 감소 레이어 및 복구 레이어)를 포함할 수 있다. 이 경우에, 입력 데이터가 차원이 변경되어 다시 복구되는 과정에서 오류가 발생할 수 있다. 프로세서(110)는 하나의 입력 데이터로 상술한 과정을 복수 회 수행하여 발생한 오류들을 이용하여 제 1연산 모델을 학습시킬 수 있다. 구체적으로, 도 2를 참조하면, 프로세서(110)는 입력인 제 1 데이터에서 피쳐를 추출하여 다시 복구한 제 1 서브 데이터 및 제 1 서브 데이터에서 피쳐를 추출하여 다시 복구한 제 2 데이터를 이용하여 제 1 연산 모델을 트레이닝할 수 있다. 따라서, 본 개시의 정보 처리 방법은 하나의 입력 데이터를 이용하여 복수의 출력 데이터를 생성할 수 있다. 또한, 생성된 복수의 출력 데이터는 트레이닝 데이터로 학습에 이용될 수 있어, 본 개시의 정보 처리 방법은 트레이닝 데이터가 적은 경우에도 연산 모델의 효율적인 트레이닝이 가능하다.Specifically, the submodel included in the first computational model of the present disclosure may include at least one layer (eg, a reduction layer and a recovery layer) for changing the dimension. In this case, an error may occur while the input data is changed in dimension and recovered again. The processor 110 may train the first operation model using errors generated by performing the above-described process a plurality of times with one input data. Specifically, referring to FIG. 2, the processor 110 may use the first sub data extracted from the first data, which is the input data, and the recovered data, and the second data extracted from the first sub data, and the second data recovered. The first computational model can be trained. Thus, the information processing method of the present disclosure can generate a plurality of output data using one input data. In addition, the generated plurality of output data may be used for training as training data, and the information processing method of the present disclosure enables efficient training of a computational model even when the training data is small.

본 개시의 제 1 연산 모델(200)은 직렬로 연결된 복수의 서브 모델을 포함할 수 있다. 상술한 바와 같이, 제 1 연산에 포함되는 서브 모델은 차원을 변경하는 적어도 하나의 레이어를 포함할 수 있다. 따라서, 본 개시의 서브 모델은 입력 데이터의 차원의 변경하고 다시 복구하는 과정에서 오차를 발생시킬 수 있고, 이러한 오차는 학습 및 테스트에 이용될 수 있다.The first computational model 200 of the present disclosure may include a plurality of submodels connected in series. As described above, the submodel included in the first operation may include at least one layer for changing the dimension. Accordingly, the submodel of the present disclosure may generate an error in the process of changing and reconstructing the dimension of the input data, which may be used for training and testing.

제 1 연산 모델(200)에 포함되는 서브 모델은 다양한 뉴럴 네트워크를 포함할 수 있다. 예를 들어, 서브 모델은 컨벌루셔널 뉴럴 네트워크(CNN: convolutional neural network), 리커런트 뉴럴 네트워크(RNN: recurrentneural network), 오토 인코더(auto encoder), GAN(Generative Adversarial Networks), 제한 볼츠만 머신(RBM: restricted boltzmann machine), 심층 신뢰 네트워크(DBN: deep belief network), Q 네트워크, U 네트워크, 샴 네트워크 등을 포함할 수 있다. The submodel included in the first computational model 200 may include various neural networks. For example, the submodels include convolutional neural networks (CNNs), recurrent neural networks (RNNs), auto encoders, autonomous encoders (GANs), restrictive Boltzmann machines (RBMs). This may include restricted boltzmann machines, deep belief networks (DBNs), Q networks, U networks, and Siamese networks.

구체적으로 예를 들면, 서브 모델은 프로세서(110)에 의해 입력으로부터 피쳐를 추출한 후, 다시 피쳐로부터 입력과 동일한 출력을 생성하기 위한 모델일 수 있다. 일 실시예로, 서브 모델은 입력된 데이터의 차원을 축소하는 인코더 및 데이터의 차원을 다시 복구하는 디코더를 포함하는 오토 인코더(auto encoder)일 수 있다. 또한, 다른 일 실시예로, 서브 모델은 생성자(Generator) 및 구분자(discriminator)를 포함하는 GAN(Generative Adversarial Networks)일 수 있다. 이 경우에, 생성자는 오토 인코더의 인코더의 역할을 수행할 수 있고, 구분자는 오토 인코더의 디코더의 역할을 수행할 수 있다. 다만, 이에 한정되지 않고, 제 1 연산 모델(200)은 다양한 뉴럴 네트워크를 포함할 수 있다.Specifically, for example, the submodel may be a model for extracting a feature from an input by the processor 110 and then again generating an output identical to the input from the feature. In one embodiment, the submodel may be an auto encoder including an encoder for reducing the dimension of the input data and a decoder for restoring the dimension of the data again. In another embodiment, the submodel may be GAN (Generative Adversarial Networks) including a generator and a separator. In this case, the generator may play the role of the encoder of the auto encoder and the delimiter may play the role of the decoder of the auto encoder. However, the present invention is not limited thereto, and the first operation model 200 may include various neural networks.

복수의 서브 모델들은 서로 직렬로 연결될 수 있다. 여기서 직렬은 서브 모델의 출력의 적어도 일부가 다른 서브 모델의 입력이 되거나, 서브 모델의 일부가 다른 서브 모델과 중첩될 수 있음을 의미할 수 있다. 예를 들어, 도 2를 참조하면, 프로세서(110)는 제 1 서브 모델(210, 220)은 제 1 데이터(100)를 입력으로 처리하여 제 1 서브 데이터(600)를 생성할 수 있다. 제 2 서브 모델(230, 240)은 제 1 서브 모델(210, 220)의 출력인 제 1 서브 데이터(600)를 입력으로 처리하여 제 2 서브 데이터를 출력할 수 있다. 이 경우에, 제 2 서브 모델이 제 1 연산 모델 직렬 구조에서 마지막 단이므로, 제 2 서브 데이터는 제 1 연산 모델(200)의 출력인 제 2 데이터가 될 수 있다. 다만, 이에 한정되지 않고, 제 1 연산 모델(200)은 다양한 개수의 서브 모델을 포함할 수 있고, 다양한 방법으로 연결될 수 있다.The plurality of submodels may be connected in series with each other. Here, serial may mean that at least a part of the output of the submodel may be an input of another submodel, or a part of the submodel may overlap with another submodel. For example, referring to FIG. 2, the processor 110 may generate the first sub data 600 by processing the first data 100 as an input. The second sub-models 230 and 240 may output the second sub-data by processing the first sub-data 600 that is the output of the first sub-models 210 and 220 as an input. In this case, since the second sub-model is the last stage in the first computational model serial structure, the second sub-data may be the second data that is the output of the first computational model 200. However, the present disclosure is not limited thereto, and the first operation model 200 may include various numbers of submodels and may be connected in various ways.

본 개시의 서브 모델은 차원을 변경하기 위한 적어도 하나의 레이어를 포함할 수 있다. 예를 들어, 서브 모델은 감소 레이어 및 복구 레이어 중 적어도 하나를 포함할 수 있다.The submodel of the present disclosure may include at least one layer for changing the dimension. For example, the submodel may include at least one of a reduction layer and a recovery layer.

본 개시의 프로세서(110)는 감소 레이어(reduction layers)를 이용하여 입력된 데이터의 차원을 감소시킬 수 있다. 예를 들어, 도 2 를 참조하면, 프로세서(110)는 제 1 서브 모델(200)에 포함된 제 1 감소 레이어를 이용하여 제 1 데이터의 차원을 감소시킬 수 있다. 또한, 프로세서(110)는 제 2 서브 모델(300)에 포함된 제 2 감소 레이어를 이용하여 제 1 서브 데이터의 차원을 감소시킬 수 있다. 감소 레이어는 단일한 레이어로 구성될 수 있고, 또한 복수의 레이어로 구성될 수도 있다.The processor 110 of the present disclosure may reduce the dimension of the input data by using reduction layers. For example, referring to FIG. 2, the processor 110 may reduce the dimension of the first data by using the first reduction layer included in the first submodel 200. In addition, the processor 110 may reduce the dimension of the first sub data by using the second reduction layer included in the second sub model 300. The reduction layer may consist of a single layer or may consist of a plurality of layers.

본 개시의 프로세서(110)는 감소 레이어(reduction layers)를 이용하여 입력된 데이터의 감소된 차원을 복구시킬 수 있다. 예를 들어, 도 2 를 참조하면, 프로세서(110)는 제 1 서브 모델(200)에 포함된 제 1 복구 레이어를 이용하여 제 1 데이터의 감소된 차원을 복구시킬 수 있다. 또한, 프로세서(110)는 제 2 서브 모델(300)에 포함된 제 2 복구 레이어를 이용하여 제 1 서브 데이터의 차원을 감소시킬 수 있다. 복구 레이어는 단일한 레이어로 구성될 수 있고, 또한 복수의 레이어로 구성될 수도 있다.The processor 110 of the present disclosure may recover reduced dimensions of the input data using reduction layers. For example, referring to FIG. 2, the processor 110 may restore the reduced dimension of the first data using the first recovery layer included in the first submodel 200. In addition, the processor 110 may reduce the dimension of the first sub data using the second recovery layer included in the second sub model 300. The recovery layer may consist of a single layer or may consist of a plurality of layers.

본 개시의 프로세서(110)는 제 1 데이터, 제 2 데이터, 및 상기 복수의 서브 모델에 의해 생성된 복수의 서브 데이터에 적어도 부분적으로 기초한 비용함수(cost function)를 생성할 수 있다.The processor 110 of the present disclosure may generate a cost function based at least in part on the first data, the second data, and the plurality of sub data generated by the plurality of sub models.

프로세서(110)는 비용함수를 생성하여 비용함수의 값이 작아지는 방향으로 제 1 연산 모델(200)을 트레이닝시킬 수 있다. 본 명세서에 걸쳐, 비용 함수(cost function)는 손실 함수(Loss Function)은 동일한 의미로 사용될 수 있다. The processor 110 may generate a cost function to train the first computational model 200 in a direction in which the value of the cost function decreases. Throughout this specification, a cost function may be used in the same sense as a loss function.

프로세서(110)는 다양한 종류의 비용함수를 사용할 수 있다. 예를 들어, 프로세서(110)는 평균 제곱 오차(MSE: Mean Squared Error), 교차 엔트로피 오차(CEE:Cross Entropy Error) 및 IoU(Intersection over union) 중 적어도 하나를 비용함수로 사용할 수 있다. 다만, 이에 한정되지 않고, 프로세서(110)는 다양한 종류의 비용함수를 사용할 수 있다. 예를 들어, 프로세서(110)는 오류들의 평균값을 비용함수로 사용하거나, 오류의 합을 비용함수로 사용할 수도 있다.The processor 110 may use various kinds of cost functions. For example, the processor 110 may use at least one of mean squared error (MSE), cross entropy error (CEE), and intersection over union (IOU) as a cost function. However, the present invention is not limited thereto, and the processor 110 may use various types of cost functions. For example, the processor 110 may use the average value of the errors as a cost function, or use the sum of the errors as the cost function.

프로세서(110)는 다양한 데이터 간의 관계를 이용하여 비용함수를 생성할 수 있다. 여기서 데이터 간의 관계는 데이터 간의 오류(또는 유사도)를 의미할 수 있다. 예를 들면, 프로세서(110)는 복수의 데이터(예를 들어, 이미지)의 피쳐를 비교하여 오류값(또는 유사값)을 생성 있다. 여기서 프로세서(110)는 두 피쳐의 수학적 거리를 비교하는 함수를 계산할 수도 있으며, 뉴럴 네트워크로 구성되어 두 피쳐의 유사를 판단하는 구성을 이용할 수도 있다. 프로세서(110)는 임의의 데이터 비교 알고리즘으로 데이터에서 추출된 피쳐를 서로 비교하여 오류값을 생성할 수 있으며 본 개시는 비교 알고리즘에 제한되지 않는다. 프로세서(110)는 딥 뉴럴 네트워크 구조를 이용할 수 있으며, 그 중에서 디컨벌루셔널 뉴럴 네트워크 구조를 이용할 수 있다. 다만 이에 한정되지 않고, 데이터 간의 관계는 데이터 간의 차, 평균, 합 등 다양한 의미를 포함할 수 있다.The processor 110 may generate a cost function using a relationship between various data. Here, the relationship between the data may mean an error (or similarity) between the data. For example, the processor 110 may compare the features of the plurality of data (eg, images) to generate an error value (or similar value). Here, the processor 110 may calculate a function for comparing the mathematical distance between two features, or may be configured as a neural network to use a configuration for determining similarity between the two features. The processor 110 may generate an error value by comparing the features extracted from the data with any data comparison algorithm, and the present disclosure is not limited to the comparison algorithm. The processor 110 may use a deep neural network structure, and among them, may use a deconvolutional neural network structure. However, the present invention is not limited thereto, and the relationship between data may include various meanings such as a difference, an average, and a sum between data.

예를 들어, 프로세서(110)는 입력 데이터인 제 1 데이터(100)와 제 1 연산 모델(200)의 출력데이터인 제 2 데이터(400)의 관계를 이용하여 비용함수를 생성할 수 있다. 또한, 프로세서(110)는 제 1 데이터와 제 1 서브 모델(210, 220)의 출력데이터인 제 1 서브 데이터를 이용하여 비용함수를 생성할 수 있다. 다만, 이에 한정되지 않는다. For example, the processor 110 may generate a cost function using a relationship between the first data 100 that is input data and the second data 400 that is output data of the first operation model 200. In addition, the processor 110 may generate a cost function using the first data and the first sub data which are output data of the first sub models 210 and 220. However, the present invention is not limited thereto.

또한, 프로세서(110)는 비용함수들의 합을 비용함수로 사용할 수 있다. 예를 들어, 도 3을 참조하면, 프로세서(110)는 제 1 데이터와 제 2 데이터의 관계에 기초한 비용함수인 항(term)과 제 1 데이터와 제 1 서브 데이터에 기초한 비용함수인 항(term)의 합을 비용함수로 이용할 수 있다. In addition, the processor 110 may use the sum of the cost functions as the cost function. For example, referring to FIG. 3, the processor 110 may include a term that is a cost function based on the relationship between the first data and the second data, and a term that is a cost function based on the first data and the first sub data. ) Can be used as a cost function.

제 1 서브 모델에 포함되는 서브 모델의 개수가 증가될수록, 비용함수에 포함될 수 있는 항의 종류가 증가될 수 있다. 예를 들어, 제 1 연산 모델(200)이 제 1 내지 제 5 서브 모델을 포함하는 경우에, 제 1 데이터, 제 2 데이터 및 제 1 내지 제 4 서브 데이터 중 적어도 두개의 관계에 기초한 항이 비용함수에 포함될 수 있다. 다만 이에 한정되지 않는다.As the number of sub-models included in the first sub-model increases, the types of terms that may be included in the cost function may increase. For example, when the first computational model 200 includes the first to fifth submodels, the term based on at least two relationships among the first data, the second data, and the first to fourth subdata is a cost function. Can be included. However, the present invention is not limited thereto.

본 개시의 프로세서(110)는 비용함수에 포함되는 항에 가중치를 달리 설정할 수 있다. 예를 들어, 프로세서(110)는 제 1 데이터와 제 2 데이터의 관계에 기초한 항에 가중치로 2를 곱하고, 제 1 데이터와 제 1 서브 데이터에 기초한 항(term)에 가중치로 1을 곱한 비용함수를 생성할 수 있다. 이 경우에, 제 1 데이터와 제 2 데이터의 관계에 기초한 항이 비용함수의 값에 미치는 영향이 커질 수 있다. 다만, 이에 한정되지 않고, 가중치는 다양하게 설정될 수 있다. The processor 110 of the present disclosure may set different weights to terms included in the cost function. For example, the processor 110 may multiply a term based on the relationship between the first data and the second data by 2 and a cost function of multiplying the term based on the first data and the first sub data by 1 by the weight. Can be generated. In this case, the effect of the term based on the relationship between the first data and the second data on the value of the cost function can be large. However, the present invention is not limited thereto, and the weight may be variously set.

또한, 프로세서(110)는 트레이닝 시점에 따라 가중치를 가변적으로 설정할 수 있다. 예를 들어, 프로세서(110)는 모든 항의 가중치를 동일하게 설정하여 제 1 연산 모델을 트레이닝시킨 후, 제 1 데이터와 제 2 데이터의 관계에 기초한 항에 가중치로 2를 곱한 비용함수를 이용하여 제 1 연산 모델을 트레이닝시킬 수 있다. 다만, 이에 한정되지 않고, 프로세서(110)는 다양한 시점에 가중치를 다양하게 설정할 수 있다. In addition, the processor 110 may variably set a weight according to a training time point. For example, the processor 110 sets the weights of all terms equally to train the first computational model, and then uses the cost function of multiplying the term based on the relationship between the first data and the second data by the weight to 2 1 You can train a computational model. However, the present invention is not limited thereto, and the processor 110 may variously set weights at various time points.

본 개시의 프로세서(110)는 비용함수에 기초하여 복수의 서브 모델 중 적어도 하나를 트레이닝시킬 수 있다. 예를 들어, 프로세서(110)는 비용함수에 기초하여 제 1 서브 모델 및 제 2 서브 모델을 트레이닝시킬 수 있다. 또한, 프로세서(110)는 비용함수에 기초하여 제 1 서브 모델 또는 제 2 서브 모델만을 트레이닝시킬 수 있다.The processor 110 of the present disclosure may train at least one of the plurality of submodels based on the cost function. For example, the processor 110 may train the first submodel and the second submodel based on the cost function. In addition, the processor 110 may train only the first sub-model or the second sub-model based on the cost function.

또한, 프로세서(110)는 트레이닝 시점에 따라 트레이닝되는 서브모델을 가변적으로 설정할 수 있다. 예를 들어, 프로세서(110)는 제 1 서브 모델 및 제 2 서브 모델을 트레이닝 시킨 후, 제 1 서브 모델을 제외하고 제 2 서브 모델만을 추가적으로 트레이닝시킬 수 있다. 또한, 프로세서(110)는 서브 모델의 일부분만을 훈련시킬 수 있다. 예를 들어, 프로세서(110)는 제 1 서브 모델의 제 1 감소 레이어 및 제 2 서브 모델의 제 2 복구 레이어만을 트레이닝시킬 수 있다. 다만, 이에 한정되지 않고 프로세서(110)는 비용함수에 기초하여 다양한 방법으로 복수의 서브 모델을 트레이닝시킬 수 있다. In addition, the processor 110 may variably set a submodel to be trained according to a training time point. For example, the processor 110 may train the first submodel and the second submodel, and then additionally train only the second submodel except for the first submodel. In addition, the processor 110 may train only a portion of the submodel. For example, the processor 110 may train only the first reduced layer of the first submodel and the second recovery layer of the second submodel. However, the present invention is not limited thereto, and the processor 110 may train the plurality of submodels in various ways based on the cost function.

본 개시의 프로세서(110)는 제 1 연산 모델(200)의 적어도 일부를 공유하는 제 2 연산 모델(300)을 생성할 수 있다. 여기서 공유란 뉴럴 네트워크의 구조(비 제한적인 예로서, 레이어의 수, 노드의 수 등), 가중치 등을 동일하게(또는 유사하게) 사용하는 것을 의미할 수 있다. 예를 들어, 도 3을 참조하면 프로세서(110)는 제 1 연산 모델(200)의 제 1 감소 레이어(210) 및 제 2 복구 레이어(240)의 구조 및 가중치를 공유하는 제 2 연산 모델(300)을 생성할 수 있다. 이 경우에, 도 3에서 도시되는 바와 같이, 제 2 연산 모델(300)은 제 1 감소 레이어(210)를 공유하는 제 3 감소 레이어(310)의 출력이 제 2 복구 레이어(240)를 공유하는 제 3 복구 레이어(320)의 입력으로 인가되는 구조를 가질 수도 있다. 다만, 이에 한정되지 않고, 프로세서(110)는 제 1 연산 모델(200)의 다양한 부분을 공유하는 제 2 연산 모델을 생성할 수 있다.The processor 110 of the present disclosure may generate a second computational model 300 that shares at least a portion of the first computational model 200. Here, the sharing may mean using the same (or similar) structure of the neural network (a non-limiting example, the number of layers, the number of nodes, etc.), weights, and the like. For example, referring to FIG. 3, the processor 110 shares a structure and weights of the first reduction layer 210 and the second recovery layer 240 of the first calculation model 200 with the second calculation model 300. ) Can be created. In this case, as shown in FIG. 3, the second computational model 300 is characterized in that the output of the third reduction layer 310, which shares the first reduction layer 210, shares the second recovery layer 240. It may have a structure applied as an input of the third recovery layer 320. However, the present disclosure is not limited thereto, and the processor 110 may generate a second computational model that shares various parts of the first computational model 200.

본 개시의 프로세서(110)는 제 2 연산 모델을 이용하여 제 1 데이터에 기초한 제 3 데이터(500)를 생성할 수 있다. 이 경우에, 프로세서(110)는 임의의 알고리즘을 이용하여 제 1 데이터 및 상기 제 3 데이터에 기초한 테스트 결과 정보를 생성할 수 있다. 예를 들어, 프로세서(110)는 제 2 연산 모델의 입력 데이터인 제 1데이터와 출력 데이터인 제 3 데이터(500)를 비교하여 스코어링함으로써 테스트를 수행할 수 있다. 여기서 테스트 결과 정보는 데이터 분류를 위한 정보일 수 있고, 구체적으로 테스트 결과 정보는 제 1 데이터의 어노말리 여부를 판단하는 정보일 수 있다. 또한, 테스트 결과 정보는 입력데이터의 압축 효율을 평가하기 위한 정보일 수 있다. 본 개시의 정보 처리 방법은 트레이닝된 제 1 연산 모델의 적어도 일부를 공유하는 제 2 연산 모델을 사용하여 테스트 수행 속도를 향상 시킬 수도 있다. 즉, 본 개시의 정보 처리 방법은 연산 시간을 줄이기 위해 제 1 연산 모델에 비해 적은 수의 서브 모델(또는, 레이어)을 포함하는 제 2 연산 모델을 사용함으로써, 테스트의 정확도를 유지하면서 테스트 수행 속도를 향상시킬 수 있다. The processor 110 of the present disclosure may generate the third data 500 based on the first data using the second operation model. In this case, the processor 110 may generate test result information based on the first data and the third data using any algorithm. For example, the processor 110 may perform a test by comparing and scoring first data that is input data of the second operation model and third data 500 that are output data. Herein, the test result information may be information for data classification, and specifically, the test result information may be information for determining whether the first data is anomaly. In addition, the test result information may be information for evaluating the compression efficiency of the input data. The information processing method of the present disclosure may improve test execution speed by using a second computational model that shares at least a portion of the trained first computational model. That is, the information processing method of the present disclosure uses a second computational model including fewer submodels (or layers) than the first computational model to reduce computation time, thereby maintaining test accuracy while maintaining test accuracy. Can improve.

본 개시의 프로세서(110)는 제 1 연산 모델의 제 1 데이터 및 제 2 데이터에 기초하여 테스트 결과 정보를 생성할 수 있다. 자세히 설명하면, 프로세서(110)는 제 1 연산 모델의 입력 데이터인 제 1데이터와 출력 데이터인 제 2 데이터를 이용하여 테스트를 수행할 수 있다. 여기서 테스트 결과 정보는 데이터 분류를 위한 정보일 수 있고, 구체적으로 테스트 결과 정보는 제 1 데이터의 어노말리 여부를 판단하는 정보일 수 있다. 또한, 테스트 결과 정보는 입력데이터의 압축 효율을 평가하기 위한 정보일 수 있다. 다만, 이에 한정되지 않고 테스트 결과 정보는 다양한 정보를 포함할 수 있다. The processor 110 of the present disclosure may generate test result information based on the first data and the second data of the first calculation model. In detail, the processor 110 may perform a test using first data, which is input data of the first operation model, and second data, which is output data. Herein, the test result information may be information for data classification, and specifically, the test result information may be information for determining whether the first data is anomaly. In addition, the test result information may be information for evaluating the compression efficiency of the input data. However, the present invention is not limited thereto, and the test result information may include various information.

추가적인 실시예로, 본 개시의 프로세서(110)는 제 1 데이터, 제 2 데이터, 제 3 데이터 및 복수의 서브 데이터 중 적어도 두개에 기초하여 테스트 결과 정보를 생성할 수 있다. 자세히 설명하면, 본 개시의 연산 모델 및 서브 모델은 각각 입력 데이터와 동일한 출력데이터를 생성하도록 트레이닝될 수 있다. 따라서, 본 개시의 연산 모델의 출력 데이터 및 서브 모델의 출력 데이터는 테스트를 수행하는데 사용될 수 있다. 예를 들어, 프로세서(110)는 제 1 데이터 및 서브 모델의 출력 데이터(예를 들어, 제 1 서브 데이터)에 기초하여 테스트 결과를 생성할 수 있다. 다른 예로, 프로세서(110)는 적어도 두개의 서브 모델의 출력 데이터에 기초하여 테스트 결과를 생성할 수 있다. 예를 들어, 제 1 연산 모델이 3개의 서브 모델을 포함하는 경우에, 프로세서(110)는 제 1 서브 모델의 제 1 서브 데이터와 제 2 서브 모델의 제 2 서브 데이터에 기초하여 테스트 결과 정보를 생성할 수 있다.In an additional embodiment, the processor 110 of the present disclosure may generate test result information based on at least two of the first data, the second data, the third data, and the plurality of sub data. In detail, the computational model and submodel of the present disclosure may each be trained to produce output data identical to the input data. Thus, the output data of the computational model of the present disclosure and the output data of the submodel can be used to perform the test. For example, the processor 110 may generate a test result based on the first data and output data (eg, the first sub data) of the sub model. As another example, the processor 110 may generate a test result based on output data of at least two submodels. For example, when the first computational model includes three submodels, the processor 110 may generate test result information based on the first sub-data of the first sub-model and the second sub-data of the second sub-model. Can be generated.

다른 추가적인 실시예로, 본 개시의 프로세서(110)는 제 1 데이터, 제 2 데이터, 제 3 데이터 및 복수의 서브 데이터 중 적어도 두개에 기초하여 복수의 테스트 결과 정보를 생성하여 테스트를 수행할 수 있다. 자세히 설명하면, 본 개시의 서브 모델은 독립적인 예측 모델로 동작할 수 있으므로, 서브 모델에 의해 생성된 서브 데이터는 연산 모델에 의해 생성된 데이터와 동등한 자격으로 테스트에 활용될 수 있다. 예를 들어, 프로세서(110)는 제 1 데이터와 제 1 서브 모델에 의해 생성된 제 1 서브 데이터를 비교하고, 제 1 데이터와 제 1 연산 모델에 의해 생성된 제 2 데이터를 비교하여 생성된 복수의 테스트 결과 정보를 이용하여, 제 1 데이터의 어노말리를 결정할 수 있다. 또 다른 예로, 제 1 연산 모델이 4개의 서브 모델을 포함하는 경우에, 4개의 서브 모델에 의해 생성된 4개의 서브 데이터를 제 1 데이터와 각각 비교하여 생성된 4개의 테스트 결과 정보를 이용하여, 어노말리를 결정한 테스트 결정 정보가 일정한 비율(예를 들어, 75%, 즉 4개의 테스트 결과 정보 중 3개가 어노말리를 나타내는 결과 정보인 경우 등) 이상인 경우에 제 1 데이터의 어노말리를 결정할 수도 있다. 이 경우에, 복수의 테스트 결과 정보 중에 어느 하나는 가중치를 부여받을 수 있다. 예를 들어, 제 1 데이터와 제 2 데이터를 비교하여 생성된 테스트 결과는 제 1 데이터와 서브 데이터를 비교하여 생성된 테스트 결과에 비해 2배의 가중치를 부여받을 수 있다. 본 개시에 따르면, 본 개시의 연산 모델은 연산 모델에 포함된 서브 모델을 예측 모델로 조합하여 앙상블 모델(ensemble model)을 구성하는 효과를 가질 수 있다. 다만 이에 한정되지 않고, 본 개시의 프로세서(110)는 다양한 방법으로 테스트를 수행할 수 있다. In another additional embodiment, the processor 110 of the present disclosure may perform a test by generating a plurality of test result information based on at least two of the first data, the second data, the third data, and the plurality of sub data. . In detail, since the sub-model of the present disclosure can operate as an independent prediction model, the sub-data generated by the sub-model can be utilized for testing with the same qualification as the data generated by the computational model. For example, the processor 110 compares the first data with the first sub data generated by the first sub-model and compares the first data with the second data generated by the first computational model. The anomaly of the first data may be determined using the test result information. As another example, when the first computational model includes four sub-models, by using four test result information generated by comparing the four sub-data generated by the four sub-models with the first data, respectively, Anomaly of the first data may be determined when the test decision information that determines the anomaly is equal to or greater than a predetermined ratio (for example, when 75%, that is, three of the four test result information is result information indicating anomaly, etc.). . In this case, any one of the plurality of test result information may be weighted. For example, a test result generated by comparing the first data and the second data may be weighted twice as much as a test result generated by comparing the first data and the sub data. According to the present disclosure, the computational model of the present disclosure may have an effect of constructing an ensemble model by combining sub-models included in the computational model into a prediction model. However, the present invention is not limited thereto, and the processor 110 of the present disclosure may perform a test in various ways.

프로세서(110)는 다양한 데이터 간의 관계를 이용하여 테스트 결과 정보를 생성할 수 있다. 여기서 데이터 간의 관계는 데이터 간의 오류(또는 유사도)를 의미할 수 있다. 예를 들면, 프로세서(110)는 복수의 데이터(예를 들어, 이미지)의 피쳐를 비교하여 오류값(또는 유사도값)을 생성할 수 있다. 일 실시예로, 본 개시의 프로세서(110)는 오류값(또는 유사도값)이 일정한 기준(예를 들어, 90%)을 만족하지 못하는 경우에 제 1 데이터가 어노말리라는 테스트 결과 정보를 생성할 수 있다. 여기서 프로세서(110)는 두 피쳐의 수학적 거리를 비교하는 함수를 이용할 수 있으며, 뉴럴 네트워크로 구성되어 두 피쳐의 유사를 판단하는 구성을 이용할 수도 있다. 프로세서(110)는 임의의 데이터 비교 알고리즘으로 데이터에서 추출된 피쳐를 서로 비교하여 오류값(또는 유사도값)을 생성할 수 있으며 본 개시는 비교 알고리즘에 제한되지 않는다. 프로세서(110)는 뉴럴 네트워크 구조를 이용할 수 있으며 그중 디컨벌루셔널 뉴럴 네트워크 구조를 이용할 수 있다. 다만 이에 한정되지 않는다. The processor 110 may generate test result information by using a relationship between various data. Here, the relationship between the data may mean an error (or similarity) between the data. For example, the processor 110 may generate an error value (or similarity value) by comparing features of a plurality of data (eg, images). In one embodiment, the processor 110 of the present disclosure may generate test result information indicating that the first data is anomalous when an error value (or similarity value) does not satisfy a predetermined criterion (eg, 90%). Can be. In this case, the processor 110 may use a function for comparing the mathematical distance between two features, and may be configured as a neural network to determine the similarity of the two features. The processor 110 may generate an error value (or similarity value) by comparing the features extracted from the data with each other using an arbitrary data comparison algorithm, and the present disclosure is not limited to the comparison algorithm. The processor 110 may use a neural network structure, of which the deconvolutional neural network structure may be used. However, the present invention is not limited thereto.

도 4는 본 개시의 일 실시예에 따라 제 1 연결 모델을 트레이닝을 수행하기 위한 정보 처리 방법의 단계를 설명하기 위한 순서도이다.FIG. 4 is a flowchart for describing steps of an information processing method for training a first connection model according to an embodiment of the present disclosure.

본 개시의 일 실시예에 따라 제 1 연결 모델을 트레이닝를 하기 위한 정보 처리 방법은 다음과 같은 단계에 의해 구현될 수 있다.According to an embodiment of the present disclosure, an information processing method for training a first connection model may be implemented by the following steps.

본 개시의 일 실시예에 따라 하나 이상의 프로세서에서 수행되는 정보 처리 방법에 있어서, 상기 정보 처리 방법은: 프로세서에 의해, 제 1 데이터를 수신하는 단계(s410); 프로세서에 의해, 직렬로 연결된 복수의 서브 모델을 포함하는 제 1 연산 모델을 이용하여 상기 제 1 데이터에 기초한 제 2 데이터를 생성하는 단계(s420)-상기 서브 모델은 차원을 변경하기 위한 적어도 하나의 레이어를 포함함-; 프로세서에 의해, 상기 제 1 데이터, 상기 제 2 데이터, 및 상기 복수의 서브 모델에 의해 생성된 복수의 서브 데이터 중 적어도 둘에 기초한 비용함수(cost function)를 생성하는 단계(s430); 프로세서에 의해, 상기 비용함수에 기초하여 상기 복수의 서브 모델 중 적어도 하나를 트레이닝시키는 단계(s440); 를 포함할 수 있다.An information processing method performed in one or more processors according to an embodiment of the present disclosure, the information processing method comprising: receiving, by a processor, first data (s410); Generating, by a processor, second data based on the first data using a first computational model comprising a plurality of submodels connected in series (s420), wherein the submodel is at least one for changing dimensions Including a layer; Generating (s430), by a processor, a cost function based on at least two of the first data, the second data, and the plurality of sub-data generated by the plurality of sub-models; Training, by a processor, at least one of the plurality of submodels based on the cost function (s440); It may include.

대안적인 실시예에서, 상기 제 2 데이터를 생성하는 단계(s420)는: 프로세서에 의해, 제 1 서브 모델에 포함되는 제 1 감소 레이어를 이용하여 상기 제 1 데이터의 차원을 감소시키고, 상기 제 1 서브 모델에 포함되는 제 1 복구 레이어를 이용하여 상기 제 1 데이터의 감소된 차원을 복구함으로써 제 1 서브 데이터를 생성하는 단계; 및 프로세서에 의해, 제 2 서브 모델에 포함되는 제 2 감소 레이어를 이용하여 상기 제 1 서브 데이터의 차원을 감소시키고, 상기 제 2 서브 모델에 포함되는 제 2 복구 레이어를 이용하여 상기 제 1 서브 데이터의 감소된 차원을 복구함으로써 상기 제 2 데이터를 생성하는 단계; 를 포함할 수 있다.In an alternative embodiment, the step of generating the second data (s420) comprises: reducing, by a processor, the dimension of the first data using a first reduction layer included in a first submodel, Generating first sub data by recovering the reduced dimension of the first data using the first recovery layer included in the sub model; And reduce, by a processor, the dimension of the first sub data using a second reduction layer included in a second sub-model, and use the first sub data using a second recovery layer included in the second sub-model. Generating the second data by recovering a reduced dimension of the; It may include.

대안적인 실시예에서, 상기 제 1 데이터는: 정상 데이터로 이루어진 트레이닝 데이터(training data)를 포함할 수 있다.In an alternative embodiment, the first data may comprise: training data consisting of normal data.

도 5는 본 개시의 일 실시예에 따라 제 1 연결 모델을 이용하여 테스트를 수행하기 위한 정보 처리 방법의 단계를 설명하기 위한 순서도이다.FIG. 5 is a flowchart for describing steps of an information processing method for performing a test using a first connection model according to an embodiment of the present disclosure.

본 개시의 일 실시예에 따라 제 1 연결 모델을 이용하여 테스트를 수행하기 위한 정보 처리 방법은 다음과 같은 단계에 의해 구현될 수 있다.According to an embodiment of the present disclosure, an information processing method for performing a test using a first connection model may be implemented by the following steps.

본 개시의 일 실시예에 따라, 하나 이상의 프로세서에서 수행되는 정보 처리 방법에 있어서, 상기 정보 처리 방법은: 프로세서에 의해, 제 1 데이터를 수신하는 단계(s510); 프로세서에 의해, 직렬로 연결된 복수의 서브 모델을 포함하는 제 1 연산 모델을 이용하여 상기 제 1 데이터에 기초한 제 2 데이터를 생성하는 단계(s520)-상기 서브 모델은 차원을 변경하는 적어도 하나의 레이어를 포함함-; 프로세서에 의해, 상기 제 1 데이터 및 상기 제 2 데이터에 기초하여 테스트 결과 정보를 생성하는 단계(s530); 를 포함할 수 있다.According to one embodiment of the present disclosure, an information processing method performed in one or more processors, the information processing method comprising: receiving, by a processor, first data (s510); Generating, by the processor, second data based on the first data using a first operation model including a plurality of sub-models connected in series (s520), wherein the sub-model has at least one layer for changing dimensions Comprising; Generating, by a processor, test result information based on the first data and the second data (s530); It may include.

도 6은 본 개시의 일 실시예에 따라 제 2 연결 모델을 이용하여 테스트를 수행하기 위한 정보 처리 방법의 단계를 설명하기 위한 순서도이다.6 is a flowchart for explaining steps of an information processing method for performing a test using a second connection model according to an embodiment of the present disclosure.

본 개시의 일 실시예에 따라 제 2 연결 모델을 이용하여 테스트를 수행하기 위한 정보 처리 방법은 다음과 같은 단계에 의해 구현될 수 있다.According to an embodiment of the present disclosure, an information processing method for performing a test using a second connection model may be implemented by the following steps.

하나 이상의 프로세서에서 수행되는 정보 처리 방법에 있어서, 상기 정보처리 방법은: 프로세서에 의해, 제 1 데이터를 수신하는 단계(s610); 프로세서에 의해, 제 1 연산 모델의 적어도 일부를 공유하는 제 2 연산 모델을 이용하여 상기 제 1 데이터에 기초한 제 3 데이터를 생성하는 단계(s620); 프로세서에 의해, 상기 제 1 데이터 및 상기 제 3 데이터에 기초하여 테스트 결과 정보를 생성하는 단계(s630); 를 포함할 수 있다.An information processing method performed in one or more processors, the information processing method comprising: receiving, by a processor, first data (s610); Generating, by a processor, third data based on the first data using a second computational model that shares at least a portion of the first computational model (s620); Generating, by a processor, test result information based on the first data and the third data (s630); It may include.

도 7은 본 개시의 일 실시예들이 구현될 수 있는 예시적인 컴퓨팅 환경에 대한 간략하고 일반적인 개략도를 도시한다.7 shows a brief, general schematic diagram of an example computing environment in which embodiments of the present disclosure may be implemented.

본 개시가 일반적으로 하나 이상의 컴퓨터 상에서 실행될 수 있는 컴퓨터 실행가능 명령어와 관련하여 전술되었지만, 당업자라면 본 개시가 기타 프로그램 모듈들과 결합되어 및/또는 하드웨어와 소프트웨어의 조합으로서 구현될 수 있다는 것을 잘 알 것이다.Although the present disclosure has been described above generally with respect to computer executable instructions that may be executed on one or more computers, those skilled in the art will appreciate that the present disclosure may be implemented in combination with other program modules and / or as a combination of hardware and software. will be.

일반적으로, 프로그램 모듈은 특정의 태스크를 수행하거나 특정의 추상 데이터 유형을 구현하는 루틴, 프로그램, 컴포넌트, 데이터 구조, 기타 등등을 포함한다. 또한, 당업자라면 본 개시의 방법이 단일-프로세서 또는 멀티프로세서 컴퓨터 시스템, 미니컴퓨터, 메인프레임 컴퓨터는 물론 퍼스널 컴퓨터, 핸드헬드 컴퓨팅 장치, 마이크로프로세서-기반 또는 프로그램가능 가전 제품, 기타 등등(이들 각각은 하나 이상의 연관된 장치와 연결되어 동작할 수 있음)을 비롯한 다른 컴퓨터 시스템 구성으로 실시될 수 있다는 것을 잘 알 것이다.Generally, program modules include routines, programs, components, data structures, etc. that perform particular tasks or implement particular abstract data types. In addition, those skilled in the art will appreciate that the methods of the present disclosure may include uniprocessor or multiprocessor computer systems, minicomputers, mainframe computers as well as personal computers, handheld computing devices, microprocessor-based or programmable consumer electronics, And other computer system configurations, including one or more associated devices, which may operate in conjunction with one or more associated devices.

본 개시의 설명된 실시예들은 또한 어떤 태스크들이 통신 네트워크를 통해 연결되어 있는 원격 처리 장치들에 의해 수행되는 분산 컴퓨팅 환경에서 실시될 수 있다. 분산 컴퓨팅 환경에서, 프로그램 모듈은 로컬 및 원격 메모리 저장 장치 둘다에 위치할 수 있다.The described embodiments of the present disclosure can also be practiced in distributed computing environments where certain tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote memory storage devices.

컴퓨터는 통상적으로 다양한 컴퓨터 판독가능 매체를 포함한다. 컴퓨터에 의해 액세스 가능한 매체는 그 어떤 것이든지 컴퓨터 판독가능 매체가 될 수 있고, 이러한 컴퓨터 판독가능 매체는 휘발성 및 비휘발성 매체, 일시적(transitory) 및 비일시적(non-transitory) 매체, 이동식 및 비-이동식 매체를 포함한다. 제한이 아닌 예로서, 컴퓨터 판독가능 매체는 컴퓨터 판독가능 저장 매체 및 컴퓨터 판독가능 전송 매체를 포함할 수 있다. 컴퓨터 판독가능 저장 매체는 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터와 같은 정보를 저장하는 임의의 방법 또는 기술로 구현되는 휘발성 및 비휘발성 매체, 일시적 및 비-일시적 매체, 이동식 및 비이동식 매체를 포함한다. 컴퓨터 저장 매체는 RAM, ROM, EEPROM, 플래시 메모리 또는 기타 메모리 기술, CD-ROM, DVD(digital video disk) 또는 기타 광 디스크 저장 장치, 자기 카세트, 자기 테이프, 자기 디스크 저장 장치 또는 기타 자기 저장 장치, 또는 컴퓨터에 의해 액세스될 수 있고 원하는 정보를 저장하는 데 사용될 수 있는 임의의 기타 매체를 포함하지만, 이에 한정되지 않는다.Computers typically include a variety of computer readable media. Any medium that can be accessed by a computer can be a computer readable medium, which can be volatile and nonvolatile media, transitory and non-transitory media, removable and non-transitory media. Removable media. By way of example, and not limitation, computer readable media may comprise computer readable storage media and computer readable transmission media. Computer-readable storage media are volatile and nonvolatile media, temporary and non-transitory media, removable and non-removable implemented in any method or technology for storing information such as computer readable instructions, data structures, program modules or other data. Media. Computer storage media may include RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROMs, digital video disks or other optical disk storage devices, magnetic cassettes, magnetic tapes, magnetic disk storage devices or other magnetic storage devices, Or any other medium that can be accessed by a computer and used to store desired information.

컴퓨터 판독가능 전송 매체는 통상적으로 반송파(carrier wave) 또는 기타 전송 메커니즘(transport mechanism)과 같은 피변조 데이터 신호(modulated data signal)에 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터등을 구현하고 모든 정보 전달 매체를 포함한다. 피변조 데이터 신호라는 용어는 신호 내에 정보를 인코딩하도록 그 신호의 특성들 중 하나 이상을 설정 또는 변경시킨 신호를 의미한다. 제한이 아닌 예로서, 컴퓨터 판독가능 전송 매체는 유선 네트워크 또는 직접 배선 접속(direct-wired connection)과 같은 유선 매체, 그리고 음향, RF, 적외선, 기타 무선 매체와 같은 무선 매체를 포함한다. 상술된 매체들 중 임의의 것의 조합도 역시 컴퓨터 판독가능 전송 매체의 범위 안에 포함되는 것으로 한다.Computer-readable transmission media typically embody computer readable instructions, data structures, program modules or other data in a modulated data signal, such as a carrier wave or other transport mechanism, and the like. Includes all information delivery media. The term modulated data signal means a signal that has one or more of its characteristics set or changed to encode information in the signal. By way of example, and not limitation, computer readable transmission media includes wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, RF, infrared, or other wireless media. Combinations of any of the above should also be included within the scope of computer readable transmission media.

컴퓨터(1002)를 포함하는 본 개시의 여러가지 측면들을 구현하는 예시적인 환경(1000)이 나타내어져 있으며, 컴퓨터(1002)는 처리 장치(1004), 시스템 메모리(1006) 및 시스템 버스(1008)를 포함한다. 시스템 버스(1008)는 시스템 메모리(1006)(이에 한정되지 않음)를 비롯한 시스템 컴포넌트들을 처리 장치(1004)에 연결시킨다. 처리 장치(1004)는 다양한 상용 프로세서들 중 임의의 프로세서일 수 있다. 듀얼 프로세서 및 기타 멀티프로세서 아키텍처도 역시 처리 장치(1004)로서 이용될 수 있다.An example environment 1000 is illustrated that implements various aspects of the present disclosure, including a computer 1002, which includes a processing device 1004, a system memory 1006, and a system bus 1008. do. System bus 1008 couples system components, including but not limited to system memory 1006, to processing device 1004. Processing apparatus 1004 may be any of a variety of commercial processors. Dual processor and other multiprocessor architectures may also be used as the processing unit 1004.

시스템 버스(1008)는 메모리 버스, 주변장치 버스, 및 다양한 상용 버스 아키텍처 중 임의의 것을 사용하는 로컬 버스에 추가적으로 상호 연결될 수 있는 몇가지 유형의 버스 구조 중 임의의 것일 수 있다. 시스템 메모리(1006)는 판독 전용 메모리(ROM)(1010) 및 랜덤 액세스 메모리(RAM)(1012)를 포함한다. 기본 입/출력 시스템(BIOS)은 ROM, EPROM, EEPROM 등의 비휘발성 메모리(1010)에 저장되며, 이 BIOS는 시동 중과 같은 때에 컴퓨터(1002) 내의 구성요소들 간에 정보를 전송하는 일을 돕는 기본적인 루틴을 포함한다. RAM(1012)은 또한 데이터를 캐싱하기 위한 정적 RAM 등의 고속 RAM을 포함할 수 있다.System bus 1008 may be any of several types of bus structures that may be further interconnected to a memory bus, a peripheral bus, and a local bus using any of a variety of commercial bus architectures. System memory 1006 includes read only memory (ROM) 1010 and random access memory (RAM) 1012. Basic Input / Output System (BIOS) is stored in non-volatile memory 1010, such as ROM, EPROM, EEPROM, etc., and this BIOS is a basic aid to transfer information between components within computer 1002, such as during startup. Contains routines. RAM 1012 may also include high speed RAM, such as static RAM for caching data.

컴퓨터(1002)는 또한 내장형 하드 디스크 드라이브(HDD)(1014)(예를 들어, EIDE, SATA)이 내장형 하드 디스크 드라이브(1014)는 또한 적당한 섀시(도시 생략) 내에서 외장형 용도로 구성될 수 있음), 자기 플로피 디스크 드라이브(FDD)(1016)(예를 들어, 이동식 디스켓(1018)으로부터 판독을 하거나 그에 기록을 하기 위한 것임), 및 광 디스크 드라이브(1020)(예를 들어, CD-ROM 디스크(1022)를 판독하거나 DVD 등의 기타 고용량 광 매체로부터 판독을 하거나 그에 기록을 하기 위한 것임)를 포함한다. 하드 디스크 드라이브(1014), 자기 디스크 드라이브(1016) 및 광 디스크 드라이브(1020)는 각각 하드 디스크 드라이브 인터페이스(1024), 자기 디스크 드라이브 인터페이스(1026) 및 광 드라이브 인터페이스(1028)에 의해 시스템 버스(1008)에 연결될 수 있다. 외장형 드라이브 구현을 위한 인터페이스(1024)는 USB(Universal Serial Bus) 및 IEEE 1394 인터페이스 기술 중 적어도 하나 또는 그 둘 다를 포함한다.Computer 1002 also includes an internal hard disk drive (HDD) 1014 (e.g., EIDE, SATA) and the internal hard disk drive 1014 may also be configured for external use within a suitable chassis (not shown). ), A magnetic floppy disk drive (FDD) 1016 (eg, for reading from or writing to a removable diskette 1018), and an optical disk drive 1020 (eg, a CD-ROM disk 1022 for reading from or reading from or writing to other high capacity optical media such as DVD). The hard disk drive 1014, the magnetic disk drive 1016, and the optical disk drive 1020 are connected to the system bus 1008 by the hard disk drive interface 1024, the magnetic disk drive interface 1026, and the optical drive interface 1028, respectively. ) Can be connected. The interface 1024 for external drive implementation includes at least one or both of Universal Serial Bus (USB) and IEEE 1394 interface technologies.

이들 드라이브 및 그와 연관된 컴퓨터 판독가능 매체는 데이터, 데이터 구조, 컴퓨터 실행가능 명령어, 기타 등등의 비휘발성 저장을 제공한다. 컴퓨터(1002)의 경우, 드라이브 및 매체는 임의의 데이터를 적당한 디지털 형식으로 저장하는 것에 대응한다. 상기에서의 컴퓨터 판독가능 매체에 대한 설명이 HDD, 이동식 자기 디스크, 및 CD 또는 DVD 등의 이동식 광 매체를 언급하고 있지만, 당업자라면 집 드라이브(zip drive), 자기 카세트, 플래쉬 메모리 카드, 카트리지, 기타 등등의 컴퓨터에 의해 판독가능한 다른 유형의 매체도 역시 예시적인 운영 환경에서 사용될 수 있으며 또 임의의 이러한 매체가 본 개시의 방법들을 수행하기 위한 컴퓨터 실행가능 명령어를 포함할 수 있다는 것을 잘 알 것이다.These drives and their associated computer readable media provide nonvolatile storage of data, data structures, computer executable instructions, and the like. In the case of computer 1002, drives and media correspond to storing any data in a suitable digital format. Although the above description of computer readable media refers to HDDs, removable magnetic disks, and removable optical media such as CDs or DVDs, those skilled in the art will appreciate zip drives, magnetic cassettes, flash memory cards, cartridges, and the like. Other types of media readable by the computer, etc. may also be used in the exemplary operating environment and it will be appreciated that any such media may include computer executable instructions for performing the methods of the present disclosure.

운영 체제(1030), 하나 이상의 애플리케이션 프로그램(1032), 기타 프로그램 모듈(1034) 및 프로그램 데이터(1036)를 비롯한 다수의 프로그램 모듈이 드라이브 및 RAM(1012)에 저장될 수 있다. 운영 체제, 애플리케이션, 모듈 및/또는 데이터의 전부 또는 그 일부분이 또한 RAM(1012)에 캐싱될 수 있다. 본 개시가 여러가지 상업적으로 이용 가능한 운영 체제 또는 운영 체제들의 조합에서 구현될 수 있다는 것을 잘 알 것이다.Multiple program modules may be stored in the drive and RAM 1012, including operating system 1030, one or more application programs 1032, other program modules 1034, and program data 1036. All or a portion of the operating system, applications, modules and / or data may also be cached in RAM 1012. It will be appreciated that the present disclosure may be implemented in various commercially available operating systems or combinations of operating systems.

사용자는 하나 이상의 유선/무선 입력 장치, 예를 들어, 키보드(1038) 및 마우스(1040) 등의 포인팅 장치를 통해 컴퓨터(1002)에 명령 및 정보를 입력할 수 있다. 기타 입력 장치(도시 생략)로는 마이크, IR 리모콘, 조이스틱, 게임 패드, 스타일러스 펜, 터치 스크린, 기타 등등이 있을 수 있다. 이들 및 기타 입력 장치가 종종 시스템 버스(1008)에 연결되어 있는 입력 장치 인터페이스(1042)를 통해 처리 장치(1004)에 연결되지만, 병렬 포트, IEEE 1394 직렬 포트, 게임 포트, USB 포트, IR 인터페이스, 기타 등등의 기타 인터페이스에 의해 연결될 수 있다.A user may enter commands and information into the computer 1002 via one or more wired / wireless input devices, such as a keyboard 1038 and a mouse 1040. Other input devices (not shown) may include a microphone, IR remote control, joystick, game pad, stylus pen, touch screen, and the like. These and other input devices are often connected to the processing unit 1004 through an input device interface 1042 that is connected to the system bus 1008, but the parallel port, IEEE 1394 serial port, game port, USB port, IR interface, Etc. can be connected by other interfaces.

모니터(1044) 또는 다른 유형의 디스플레이 장치도 역시 비디오 어댑터(1046) 등의 인터페이스를 통해 시스템 버스(1008)에 연결된다. 모니터(1044)에 부가하여, 컴퓨터는 일반적으로 스피커, 프린터, 기타 등등의 기타 주변 출력 장치(도시 생략)를 포함한다.A monitor 1044 or other type of display device is also connected to the system bus 1008 via an interface such as a video adapter 1046. In addition to the monitor 1044, the computer generally includes other peripheral output devices (not shown) such as speakers, printers, and the like.

컴퓨터(1002)는 유선 및/또는 무선 통신을 통한 원격 컴퓨터(들)(1048) 등의 하나 이상의 원격 컴퓨터로의 논리적 연결을 사용하여 네트워크화된 환경에서 동작할 수 있다. 원격 컴퓨터(들)(1048)는 워크스테이션, 컴퓨팅 디바이스 컴퓨터, 라우터, 퍼스널 컴퓨터, 휴대용 컴퓨터, 마이크로프로세서-기반 오락 기기, 피어 장치 또는 기타 통상의 네트워크 노드일 수 있으며, 일반적으로 컴퓨터(1002)에 대해 기술된 구성요소들 중 다수 또는 그 전부를 포함하지만, 간략함을 위해, 메모리 저장 장치(1050)만이 도시되어 있다. 도시되어 있는 논리적 연결은 근거리 통신망(LAN)(1052) 및/또는 더 큰 네트워크, 예를 들어, 원거리 통신망(WAN)(1054)에의 유선/무선 연결을 포함한다. 이러한 LAN 및 WAN 네트워킹 환경은 사무실 및 회사에서 일반적인 것이며, 인트라넷 등의 전사적 컴퓨터 네트워크(enterprise-wide computer network)를 용이하게 해주며, 이들 모두는 전세계 컴퓨터 네트워크, 예를 들어, 인터넷에 연결될 수 있다.Computer 1002 may operate in a networked environment using logical connections to one or more remote computers, such as remote computer (s) 1048, via wired and / or wireless communications. Remote computer (s) 1048 may be a workstation, computing device computer, router, personal computer, portable computer, microprocessor-based entertainment device, peer device, or other conventional network node, and typically Although many or all of the components described above are included, for simplicity, only memory storage 1050 is shown. The logical connections shown include wired / wireless connections to a local area network (LAN) 1052 and / or a larger network, such as a wide area network (WAN) 1054. Such LAN and WAN networking environments are commonplace in offices and businesses, facilitating enterprise-wide computer networks such as intranets, all of which may be connected to worldwide computer networks, such as the Internet.

LAN 네트워킹 환경에서 사용될 때, 컴퓨터(1002)는 유선 및/또는 무선 통신 네트워크 인터페이스 또는 어댑터(1056)를 통해 로컬 네트워크(1052)에 연결된다. 어댑터(1056)는 LAN(1052)에의 유선 또는 무선 통신을 용이하게 해줄 수 있으며, 이 LAN(1052)은 또한 무선 어댑터(1056)와 통신하기 위해 그에 설치되어 있는 무선 액세스 포인트를 포함하고 있다. WAN 네트워킹 환경에서 사용될 때, 컴퓨터(1002)는 모뎀(1058)을 포함할 수 있거나, WAN(1054) 상의 통신 컴퓨팅 디바이스에 연결되거나, 또는 인터넷을 통하는 등, WAN(1054)을 통해 통신을 설정하는 기타 수단을 갖는다. 내장형 또는 외장형 및 유선 또는 무선 장치일 수 있는 모뎀(1058)은 직렬 포트 인터페이스(1042)를 통해 시스템 버스(1008)에 연결된다. 네트워크화된 환경에서, 컴퓨터(1002)에 대해 설명된 프로그램 모듈들 또는 그의 일부분이 원격 메모리/저장 장치(1050)에 저장될 수 있다. 도시된 네트워크 연결이 예시적인 것이며 컴퓨터들 사이에 통신 링크를 설정하는 기타 수단이 사용될 수 있다는 것을 잘 알 것이다.When used in a LAN networking environment, the computer 1002 is connected to the local network 1052 via a wired and / or wireless communication network interface or adapter 1056. The adapter 1056 may facilitate wired or wireless communication to the LAN 1052, which also includes a wireless access point installed therein for communicating with the wireless adapter 1056. When used in a WAN networking environment, the computer 1002 may include a modem 1058, connect to a communications computing device on the WAN 1054, or establish communications over the WAN 1054, such as via the Internet. Other means. The modem 1058, which may be an internal or external and wired or wireless device, is connected to the system bus 1008 through the serial port interface 1042. In a networked environment, program modules or portions thereof described with respect to computer 1002 may be stored in remote memory / storage device 1050. It will be appreciated that the network connections shown are exemplary and other means of establishing a communications link between the computers can be used.

컴퓨터(1002)는 무선 통신으로 배치되어 동작하는 임의의 무선 장치 또는 개체, 예를 들어, 프린터, 스캐너, 데스크톱 및/또는 휴대용 컴퓨터, PDA(portable data assistant), 통신 위성, 무선 검출가능 태그와 연관된 임의의 장비 또는 장소, 및 전화와 통신을 하는 동작을 한다. 이것은 적어도 Wi-Fi 및 블루투스 무선 기술을 포함한다. 따라서, 통신은 종래의 네트워크에서와 같이 미리 정의된 구조이거나 단순하게 적어도 2개의 장치 사이의 애드혹 통신(ad hoc communication)일 수 있다.Computer 1002 is associated with any wireless device or entity disposed and operating in wireless communication, such as a printer, scanner, desktop and / or portable computer, portable data assistant, communications satellite, wireless detectable tag. Communicate with any equipment or location and telephone. This includes at least Wi-Fi and Bluetooth wireless technology. Thus, the communication can be a predefined structure as in a conventional network or simply an ad hoc communication between at least two devices.

Wi-Fi(Wireless Fidelity)는 유선 없이도 인터넷 등으로의 연결을 가능하게 해준다. Wi-Fi는 이러한 장치, 예를 들어, 컴퓨터가 실내에서 및 실외에서, 즉 기지국의 통화권 내의 아무 곳에서나 데이터를 전송 및 수신할 수 있게 해주는 셀 전화와 같은 무선 기술이다. Wi-Fi 네트워크는 안전하고 신뢰성 있으며 고속인 무선 연결을 제공하기 위해 IEEE 802.11(a,b,g, 기타)이라고 하는 무선 기술을 사용한다. 컴퓨터를 서로에, 인터넷에 및 유선 네트워크(IEEE 802.3 또는 이더넷을 사용함)에 연결시키기 위해 Wi-Fi가 사용될 수 있다. Wi-Fi 네트워크는 비인가 2.4 및 5 GHz 무선 대역에서, 예를 들어, 11Mbps(802.11a) 또는 54 Mbps(802.11b) 데이터 레이트로 동작하거나, 양 대역(듀얼 대역)을 포함하는 제품에서 동작할 수 있다.Wireless Fidelity (Wi-Fi) allows you to connect to the Internet without wires. Wi-Fi is a wireless technology such as a cell phone that allows such a device, for example, a computer, to transmit and receive data indoors and outdoors, i. Wi-Fi networks use a wireless technology called IEEE 802.11 (a, b, g, etc.) to provide secure, reliable, high-speed wireless connections. Wi-Fi may be used to connect computers to each other, to the Internet, and to a wired network (using IEEE 802.3 or Ethernet). Wi-Fi networks can operate in unlicensed 2.4 and 5 GHz wireless bands, for example, at 11 Mbps (802.11a) or 54 Mbps (802.11b) data rates, or in products that include both bands (dual band). have.

본 개시의 기술 분야에서 통상의 지식을 가진 자는 정보 및 신호들이 임의의 다양한 상이한 기술들 및 기법들을 이용하여 표현될 수 있다는 것을 이해할 것이다. 예를 들어, 위의 설명에서 참조될 수 있는 데이터, 지시들, 명령들, 정보, 신호들, 비트들, 심볼들 및 칩들은 전압들, 전류들, 전자기파들, 자기장들 또는 입자들, 광학장들 또는 입자들, 또는 이들의 임의의 결합에 의해 표현될 수 있다.Those skilled in the art will appreciate that information and signals may be represented using any of a variety of different technologies and techniques. For example, data, instructions, instructions, information, signals, bits, symbols, and chips that may be referenced in the above description may include voltages, currents, electromagnetic waves, magnetic fields or particles, optical fields. Or particles, or any combination thereof.

본 개시의 기술 분야에서 통상의 지식을 가진 자는 여기에 개시된 실시예들과 관련하여 설명된 다양한 예시적인 논리 블록들, 모듈들, 프로세서들, 수단들, 회로들 및 알고리즘 단계들이 전자 하드웨어, (편의를 위해, 여기에서 "소프트웨어"로 지칭되는) 다양한 형태들의 프로그램 또는 설계 코드 또는 이들 모두의 결합에 의해 구현될 수 있다는 것을 이해할 것이다. 하드웨어 및 소프트웨어의 이러한 상호 호환성을 명확하게 설명하기 위해, 다양한 예시적인 컴포넌트들, 블록들, 모듈들, 회로들 및 단계들이 이들의 기능과 관련하여 위에서 일반적으로 설명되었다. 이러한 기능이 하드웨어 또는 소프트웨어로서 구현되는지 여부는 특정한 애플리케이션 및 전체 시스템에 대하여 부과되는 설계 제약들에 따라 좌우된다. 본 개시의 기술 분야에서 통상의 지식을 가진 자는 각각의 특정한 애플리케이션에 대하여 다양한 방식들로 설명된 기능을 구현할 수 있으나, 이러한 구현 결정들은 본 개시의 범위를 벗어나는 것으로 해석되어서는 안 될 것이다.One of ordinary skill in the art of the disclosure will appreciate that the various illustrative logical blocks, modules, processors, means, circuits and algorithm steps described in connection with the embodiments disclosed herein may be implemented in electronic hardware, It will be appreciated that for purposes of the present invention, various forms of program or design code, or combinations thereof, may be implemented. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends on the particular application and design constraints imposed on the overall system. One skilled in the art of the present disclosure may implement the described functionality in various ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.

여기서 제시된 다양한 실시예들은 방법, 장치, 또는 표준 프로그래밍 및/또는 엔지니어링 기술을 사용한 제조 물품(article)으로 구현될 수 있다. 용어 "제조 물품"은 임의의 컴퓨터-판독가능 장치로부터 액세스 가능한 컴퓨터 프로그램, 캐리어, 또는 매체(media)를 포함한다. 예를 들어, 컴퓨터-판독가능 매체는 자기 저장 장치(예를 들면, 하드 디스크, 플로피 디스크, 자기 스트립, 등), 광학 디스크(예를 들면, CD, DVD, 등), 스마트 카드, 및 플래쉬 메모리 장치(예를 들면, EEPROM, 카드, 스틱, 키 드라이브, 등)를 포함하지만, 이들로 제한되는 것은 아니다. 또한, 여기서 제시되는 다양한 저장 매체는 정보를 저장하기 위한 하나 이상의 장치 및/또는 다른 기계-판독가능한 매체를 포함한다. The various embodiments presented herein may be embodied in a method, apparatus, or article of manufacture using standard programming and / or engineering techniques. The term "article of manufacture" includes a computer program, carrier, or media accessible from any computer-readable device. For example, computer-readable media may include magnetic storage devices (eg, hard disks, floppy disks, magnetic strips, etc.), optical discs (eg, CDs, DVDs, etc.), smart cards, and flash memory. Devices, such as, but not limited to, EEPROM, cards, sticks, key drives, and the like. In addition, various storage media presented herein include one or more devices and / or other machine-readable media for storing information.

제시된 프로세스들에 있는 단계들의 특정한 순서 또는 계층 구조는 예시적인 접근들의 일례임을 이해하도록 한다. 설계 우선순위들에 기반하여, 본 개시의 범위 내에서 프로세스들에 있는 단계들의 특정한 순서 또는 계층 구조가 재배열될 수 있다는 것을 이해하도록 한다. 첨부된 방법 청구항들은 샘플 순서로 다양한 단계들의 엘리먼트들을 제공하지만 제시된 특정한 순서 또는 계층 구조에 한정되는 것을 의미하지는 않는다.It is to be understood that the specific order or hierarchy of steps in the processes presented is an example of exemplary approaches. Based upon design priorities, it is understood that the specific order or hierarchy of steps in the processes may be rearranged within the scope of the present disclosure. The accompanying method claims present elements of the various steps in a sample order, but are not meant to be limited to the specific order or hierarchy presented.

제시된 실시예들에 대한 설명은 임의의 본 개시의 기술 분야에서 통상의 지식을 가진 자가 본 개시를 이용하거나 또는 실시할 수 있도록 제공된다. 이러한 실시예들에 대한 다양한 변형들은 본 개시의 기술 분야에서 통상의 지식을 가진 자에게 명백할 것이며, 여기에 정의된 일반적인 원리들은 본 개시의 범위를 벗어남이 없이 다른 실시예들에 적용될 수 있다. 그리하여, 본 개시는 여기에 제시된 실시예들로 한정되는 것이 아니라, 여기에 제시된 원리들 및 신규한 특징들과 일관되는 최광의의 범위에서 해석되어야 할 것이다.The description of the presented embodiments is provided to enable any person skilled in the art to make or use the present disclosure. Various modifications to these embodiments will be apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the scope of the present disclosure. Thus, the present disclosure should not be limited to the embodiments presented herein but should be construed in the broadest scope consistent with the principles and novel features presented herein.

Claims

In the information processing method performed in one or more processors,
Receiving, by the processor, the first data;
Generating, by a processor, second data based on the first data using a first computational model comprising a plurality of submodels connected in series, the submodel including at least one layer for changing dimensions box-;
Generating, by a processor, a cost function based on at least two of the first data, the second data, and a plurality of sub-data generated by the plurality of sub-models;
Including,
Information processing method.

The method of claim 1,
Generating the second data includes:
Reduce, by a processor, the dimension of the first data using a first reduction layer included in a first submodel, and reduce the first data using a first recovery layer included in the first submodel Generating first sub data by recovering the dimension;
Reduce, by a processor, the dimension of the first sub data by using a second reduction layer included in a second sub-model, and by using a second recovery layer included in the second sub model by Generating the second data by recovering the reduced dimension;
Including,
Information processing method.

The method of claim 1,
Training, by a processor, at least one of the plurality of submodels based on the cost function;
Including,
Information processing method

The method of claim 1,
The cost function is:
A term based on a relationship of at least two of the first data, the second data, and the plurality of sub data;
Information processing method

The method of claim 2,
The cost function is:
A term based on the relationship between the first data and the second data, and a term based on the relationship between the first data and the first sub data,
Information processing method.

The method of claim 1,
The first data is:
Including training data consisting of normal data,
Information processing method

The method of claim 1,
The first data is:
Containing image data,
Information processing method

The method of claim 1,
At least one of the plurality of submodels includes neural networks,
Information processing method.

The method of claim 8,
The neural network is:
Including auto-encoder or GAN (Generative Adversarial Networks),
Information processing method.

The method of claim 1,
Generating, by a processor, a second computational model that shares at least a portion of the first computational model;
Further comprising,
Information processing method.

The method of claim 10,
The second computational model is:
Sharing at least a portion of the first reduction layer and the second recovery layer,
Information processing method.

In the information processing method performed in one or more processors,
Receiving, by the processor, the first data;
Generating, by a processor, second data based on the first data using a first computational model comprising a plurality of submodels connected in series, wherein the submodel includes at least one layer that changes dimensions -;
Generating, by a processor, test result information based on the first data and the second data;
Information processing method.

In the information processing method performed in one or more processors
Receiving, by the processor, the first data;
Generating, by a processor, third data based on the first data using a second computational model that shares at least a portion of the first computational model;
Generating, by a processor, test result information based on the first data and the third data;
Including,
Information processing method.

As a computing device,
One or more processors; And
A memory that stores instructions executable by the one or more processors;
Including,
The one or more processors,
Receiving first data;
Generating second data based on the first data using a first computational model comprising a plurality of sub-models connected in series, the sub-model comprising at least one layer that changes dimensions; And
Generating a cost function based at least in part on the first data, the second data, and the plurality of sub-data generated by the plurality of sub-models;
To do,
Computing device.

As a computing device,
One or more processors; And
A memory that stores instructions executable by the one or more processors;
Including,
The one or more processors,
Receiving first data;
Generating second data based on the first data using a first computational model comprising a plurality of sub-models connected in series, the sub-model comprising at least one layer that changes dimensions; And
Generating test result information based on the first data and the second data;
To do,
Computing device.

As a computing device,
One or more processors; And
A memory that stores instructions executable by the one or more processors;
Including,
The one or more processors,
Receiving first data;
Generating third data based on the first data using a second computational model that shares at least a portion of a first computational model; And
Generating test result information based on the first data and the third data;
To do,
Computing device.