KR102348368B1

KR102348368B1 - Device, method, system and computer readable storage medium for generating training data of machine learing model and generating fake image using machine learning model

Info

Publication number: KR102348368B1
Application number: KR1020200060788A
Authority: KR
Inventors: 서덕진
Original assignee: 주식회사 누아
Priority date: 2020-05-21
Filing date: 2020-05-21
Publication date: 2022-01-11
Also published as: KR20210144049A

Abstract

머신러닝 기반의 유사 이미지 생성을 수행하는 머신러닝 모델의 학습 데이터 생성 방법이 설명된다. 예시적인 학습 데이터 생성 방법 방법은 이미지 소스를 획득하는 단계; 이미지 소스를 분석하여 이미지 소스 내의 학습 구간을 설정하는 단계; 설정된 학습 구간으로부터 레이블 이미지를 추출하는 단계; 설정된 학습 구간 및 레이블 이미지에 기초하여, 적어도 하나의 인접 이미지를 추출하는 단계; 및 레이블 이미지 및 적어도 하나의 인접 이미지에 기초하여, 머신러닝 모델에 입력하기 위한 학습 데이터를 생성하는 단계를 포함할 수 있다.A method of generating training data for a machine learning model that performs machine learning-based similar image generation is described. An exemplary method of generating training data includes: acquiring an image source; analyzing the image source to set a learning section in the image source; extracting a label image from the set learning section; extracting at least one adjacent image based on the set learning section and the label image; and based on the label image and the at least one adjacent image, generating training data for input to the machine learning model.

Description

DEVICE, METHOD, SYSTEM AND COMPUTER READABLE STORAGE MEDIUM FOR GENERATING TRAINING DATA OF MACHINE LEARING MODEL AND GENERATING FAKE IMAGE USING MACHINE LEARNING MODEL}

본 개시는 머신러닝 모델의 학습 데이터 생성과 머신러닝 모델을 이용한 유사 이미지 생성을 위한 장치, 방법, 시스템 및 컴퓨터 판독가능 저장 매체에 관한 것이다.The present disclosure relates to an apparatus, method, system, and computer-readable storage medium for generating training data of a machine learning model and generating a similar image using the machine learning model.

여기에서 달리 언급하지 않으면 본 섹션에서 기술되는 내용은 본 출원에서의 청구범위의 선행 기술이 아니며, 본 섹션에 기재하였다는 이유로 선행 기술로 인정되어서는 안 된다.Unless otherwise stated herein, the subject matter described in this section is not prior art to the claims in this application, and should not be admitted as prior art on the grounds that it is recited in this section.

스마트폰과 같은 개인용 통신 장치의 사용이 일반화되고 통신기술의 발전에 의해 전송속도가 증가함에 따라 이용가능한 온라인 콘텐츠가 폭발적으로 증가하고 있다. 텍스트 위주의 종래 온라인 콘텐츠와 다르게, 최근 온라인 콘텐츠 대부분은 이미지 또는 복수의 이미지로 구성되는 비디오 데이터를 포함하고 있다. 이와 같이, 온라인 상의 이미지의 수가 극적으로 증가함에 따라, 이러한 이미지의 이용에 따른 저작권 침해 문제가 뒤따른다.As the use of personal communication devices, such as smart phones, is generalized and transmission speed increases due to the development of communication technology, available online contents are explosively increasing. Unlike the text-oriented conventional online content, most of the recent online content includes an image or video data composed of a plurality of images. As such, as the number of images online increases dramatically, the problem of copyright infringement with the use of such images follows.

최근 머신러닝의 기법의 발전에 따라, 머신러닝 기법을 이용하여 이미지를 생성하는 것이 고려된다. 머신러닝 기법을 이용하여 이미지를 생성할 경우 사용자는 직접 사진을 찍거나 제작하지 않아도 되므로, 비용 및 시간적으로 이점을 얻을 수 있다. 또한, 머신러닝 기법을 적절히 이용할 경우, 저작권 침해의 문제를 피하는 것도 가능할 수 있다. With the recent development of machine learning techniques, it is considered to generate images using machine learning techniques. Using machine learning techniques to generate images provides cost and time benefits as users do not have to take or create photos themselves. It may also be possible to avoid the problem of copyright infringement if machine learning techniques are used appropriately.

"Generative Adversarial Nets, Ian J. Goodfellow 외"(이하, "선행기술문헌 1")에서는 적대적인(adversarial) 과정을 통하여, 머신러닝을 이용한 이미지 생성모델을 평가하는 모델을 제시하고 있다. 이러한 모델에 따르면, 머신러닝 기법에 따라 생성망에서 유사 이미지를 생성하고, 생성된 유사 이미지와 실제 이미지를 비교하여 생성된 유사 이미지가 실제 이미지인지 여부를 평가하여 피드백함으로써, 생성망의 성능을 향상시킨다. "StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation, 최윤제 외"(이하, "선행기술문헌 2")에서는 선행기술문헌 1을 기본 구조로 하여, 성별, 나이, 표정 등의 다양한 도메인을 반영한 유사 이미지를 생성할 수 있는 모델을 제시한다."Generative Adversarial Nets, Ian J. Goodfellow et al." (hereinafter, "Prior Art Document 1") proposes a model for evaluating an image generation model using machine learning through an adversarial process. According to this model, the performance of the generative network is improved by generating a similar image in the generative network according to the machine learning technique, comparing the generated similar image with the real image, and evaluating whether the generated similar image is a real image and feedback. make it In "StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation, Choi, et al." (hereinafter, "Prior Art Document 2"), using Prior Art Document 1 as a basic structure, We present a model that can generate similar images reflecting various domains.

그러나, 이러한 머신러닝 기반의 이미지 생성 기법을 이용함에 있어서, 해당 머신러닝 모델이 충분한 성능을 발휘하도록 하기 위해서는 매우 많은 양의 학습 데이터, 즉 이미지를 해당 머신러닝 모델에 입력하여 학습을 수행시켜야 한다. 이는 머신 러닝 모델을 구축함에 있어서 필수적이지만, 매우 번거로운 작업이며 많은 시간과 비용이 소요될 수 있다.However, in using this machine learning-based image generation technique, in order for the machine learning model to exhibit sufficient performance, it is necessary to input a very large amount of training data, ie, images, to the machine learning model to perform learning. This is essential for building machine learning models, but it is a very cumbersome task and can take a lot of time and money.

본 개시는 위와 같은 문제점들을 해결하기 위한 것으로서, 머신러닝 모델의 학습 데이터 생성과 머신러닝 모델을 이용한 유사 이미지 생성을 위한 장치, 방법 및 컴퓨터 판독가능 저장 매체를 제시한다.The present disclosure is intended to solve the above problems, and provides an apparatus, a method, and a computer-readable storage medium for generating training data of a machine learning model and generating a similar image using a machine learning model.

본 개시의 일부 실시예에서, 학습 데이터 생성 방법이 설명된다. 예시적인 학습 데이터 생성 방법은 컴퓨팅 장치의 제어 하에서 수행되고, 머신러닝 기반의 유사 이미지 생성을 수행하는 머신러닝 모델의 학습 데이터를 생성할 수 있다. 학습 데이터 생성 방법은 이미지 소스를 획득하는 단계; 이미지 소스를 분석하여 이미지 소스 내의 학습 구간을 설정하는 단계; 설정된 학습 구간으로부터 레이블 이미지를 추출하는 단계; 설정된 학습 구간 및 레이블 이미지에 기초하여, 적어도 하나의 인접 이미지를 추출하는 단계; 및 레이블 이미지 및 적어도 하나의 인접 이미지에 기초하여, 머신러닝 모델에 입력하기 위한 학습 데이터를 생성하는 단계를 포함할 수 있다. In some embodiments of the present disclosure, a method of generating training data is described. The exemplary method of generating training data may be performed under the control of a computing device and may generate training data of a machine learning model that performs machine learning-based similar image generation. The training data generation method includes: acquiring an image source; analyzing the image source to set a learning section in the image source; extracting a label image from the set learning section; extracting at least one adjacent image based on the set learning section and the label image; and based on the label image and the at least one adjacent image, generating training data for input to the machine learning model.

일부 예시에서, 머신러닝 모델은 적대 신경망(Generative Adversarial Network)을 포함할 수 있다. 머신러닝 모델은 적어도 하나의 인접 이미지를 유사 이미지의 생성에 이용하고, 레이블 이미지를 생성된 유사 이미지의 판별에 사용되는 실제 이미지(real image)로서 이용할 수 있다. In some examples, the machine learning model may include a Generative Adversarial Network. The machine learning model may use at least one adjacent image to generate a similar image, and use the label image as a real image used to determine the generated similar image.

일부 다른 실시예에서, 유사 이미지 생성 장치가 설명된다. 예시적인 유사 이미지 생성 장치는 이미지 소스 관리부, 학습 구간 설정부, 레이블 이미지 추출부, 인접 이미지 추출부 및 유사 이미지 생성부를 포함할 수 있다. 이미지 소스 관리부는 이미지 소스를 획득할 수 있다. 학습 구간 설정부는 이미지 소스를 분석하여, 이미지 소스 내의 학습 구간을 설정할 수 있다. 레이블 이미지 추출부는 설정된 학습 구간으로부터 레이블 이미지를 추출할 수 있다. 인접 이미지 추출부는 설정된 학습 구간 및 레이블 이미지에 기초하여, 적어도 하나의 인접 이미지를 추출할 수 있다. 유사 이미지 생성부는 머신러닝 기반의 이미지 생성을 수행하는 머신러닝 모델을 포함하고, 머신러닝 모델을 이용하여 유사 이미지를 생성하고, 레이블 이미지 및 적어도 하나의 인접 이미지를 이용하여 머신러닝 모델을 학습시킬 수 있다.In some other embodiments, a similar image generating apparatus is described. The exemplary similar image generating apparatus may include an image source manager, a learning section setting unit, a label image extracting unit, an adjacent image extracting unit, and a similar image generating unit. The image source manager may acquire an image source. The learning section setting unit may analyze the image source and set the learning section in the image source. The label image extractor may extract a label image from the set learning section. The adjacent image extractor may extract at least one adjacent image based on the set learning section and the label image. The similar image generator includes a machine learning model that generates a machine learning-based image, generates a similar image using the machine learning model, and trains the machine learning model using the label image and at least one adjacent image. have.

일부 또 다른 실시예에서, 학습 데이터 생성 프로그램을 저장하는 컴퓨터 판독가능 저장 매체가 설명된다. 학습 데이터 생성 프로그램은 머신러닝 기반의 이미지 생성을 수행하는 머신러닝 모델의 학습 데이터를 생성할 수 있다. 예시적인 학습 데이터 생성 프로그램은 컴퓨팅 장치에 의해 실행되는 경우, 이미지 소스를 획득하는 동작; 이미지 소스를 분석하여 이미지 소스 내의 학습 구간을 설정하는 동작; 설정된 학습 구간으로부터 레이블 이미지를 추출하는 동작; 설정된 학습 구간 및 레이블 이미지에 기초하여, 적어도 하나의 인접 이미지를 추출하는 동작; 및 레이블 이미지 및 적어도 하나의 인접 이미지에 기초하여, 머신러닝 모델에 입력하기 위한 학습 데이터를 생성하는 동작을 수행하도록 하는 하나 이상의 명령어를 포함할 수 있다. In some still other embodiments, a computer-readable storage medium storing a program for generating learning data is described. The training data generation program may generate training data of a machine learning model that performs machine learning-based image generation. An exemplary learning data generating program, when executed by a computing device, includes: obtaining an image source; analyzing the image source to set a learning section in the image source; extracting a label image from the set learning section; extracting at least one adjacent image based on the set learning section and the label image; and one or more instructions to perform an operation of generating training data for input to the machine learning model based on the label image and the at least one neighboring image.

이상의 간단한 요약 및 효과에 관한 설명은 단순히 예시적인 것으로서 본 개시에서 의도한 기술적 사항을 제한하기 위한 것이 아니다. 이하의 상세한 설명과 첨부된 도면을 참조함으로써, 전술한 예시적인 실시예들과 기술적 특징들에 더하여, 추가적인 실시예와 기술적 특징들이 이해될 수 있을 것이다.The above brief summary and description of effects are merely exemplary and are not intended to limit the technical matters intended in the present disclosure. In addition to the above-described exemplary embodiments and technical features, additional embodiments and technical features may be understood by referring to the following detailed description and accompanying drawings.

본 개시의 전술한 특징들 및 기타 특징들은, 첨부 도면을 참조하여 이하의 설명으로부터 충분히 분명해질 것이다. 이러한 도면들은 본 개시에 따르는 단지 몇 가지의 실시예만을 도시한 것이고, 따라서 그 범위를 제한하는 것으로 간주되어서는 안되는 것을 이해하면서, 본 개시는 첨부된 도면의 사용을 통하여, 더 구체적이고 상세하게 기술될 것이다.
도 1은 본 개시의 적어도 일부 실시예에 따른, 유사 이미지 생성 장치를 도시하는 블록도이다.
도 2는 도 1의 유사 이미지 생성부의 일 예시를 도시하는 블록도이다.
도 3은 본 개시의 일부 실시예에 따라, 학습 데이터의 생성에 이용되는 레이블 이미지 및 인접 이미지가 추출되는 예시를 도시한다.
도 4는 본 개시의 적어도 일부 실시예에 따라, 학습 데이터를 생성하는 예시적인 프로세스를 도시한다.
도 5은 본 개시의 적어도 일부 실시예에 따라, 유사 이미지 생성에 관한 머신러닝 모델의 학습 데이터를 생성하는 데 이용될 수 있는 컴퓨터 프로그램 제품을 도시한다.
도 6은 본 개시의 적어도 일부 실시예에 따라 배열되는 컴퓨팅 장치의 예시적인 블록도이다.The foregoing and other features of the present disclosure will become sufficiently apparent from the following description with reference to the accompanying drawings. With the understanding that these drawings illustrate only a few embodiments in accordance with the present disclosure, and therefore should not be considered limiting of their scope, the present disclosure is described in greater detail and detail through the use of the accompanying drawings. will be
1 is a block diagram illustrating an apparatus for generating a similar image, according to at least some embodiments of the present disclosure.
FIG. 2 is a block diagram illustrating an example of the similar image generator of FIG. 1 .
3 illustrates an example in which a label image and an adjacent image used to generate training data are extracted, according to some embodiments of the present disclosure.
4 depicts an example process for generating training data, in accordance with at least some embodiments of the present disclosure.
5 illustrates a computer program product that may be used to generate training data of a machine learning model for generating similar images, in accordance with at least some embodiments of the present disclosure.
6 is an exemplary block diagram of a computing device arranged in accordance with at least some embodiments of the present disclosure.

이하, 첨부한 도면을 참조하여 본 개시가 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있도록 본원의 구현예 및 실시예를 상세히 설명한다. 그러나, 본원은 여러 가지 상이한 형태로 구현될 수 있으며 여기에서 설명하는 구현예 및 실시예에 한정되지 않는다. Hereinafter, embodiments and examples of the present application will be described in detail with reference to the accompanying drawings so that those of ordinary skill in the art to which the present disclosure pertains can easily carry out. However, the present application may be embodied in many different forms and is not limited to the embodiments and examples described herein.

본 개시는 일반적으로, 유사 이미지를 생성하는 머신러닝 모델에 이용되는 학습 데이터를 생성하기 위한 방법, 장치 및 컴퓨터 판독가능 저장 매체 및 이에 저장된 프로그램에 관한 것이다.The present disclosure relates generally to a method, apparatus, and a computer-readable storage medium and a program stored thereon for generating training data used in a machine learning model that generates a similar image.

도 1은 본 개시의 적어도 일부 실시예에 따른, 유사 이미지 생성 장치(100)를 도시하는 블록도이다. 유사 이미지 생성 장치(100)는 예컨대, 데스크 탑 컴퓨터, 랩탑 컴퓨터, 스마트폰, 태블릿 컴퓨터, 휴대 전화기, 개인 휴대용 단말기(PDA), 특정 용도 장치 또는 상기 기능 중 임의의 것을 포함하는 융합 장치와 같은 단일 컴퓨팅 장치; 또는 서버 팜(server farm), 분산형 네트워크, 클라우드 컴퓨팅 구성과 같이 복수의 컴퓨팅 장치와 같은 복수 컴퓨팅 구성일 수 있다. 따라서, 유사 이미지 생성 장치(100)는 개인용 단말 장치로부터 요청을 수신하고 그러한 요청에 응답하여 출력을 전송하는 서버로서 구현되거나 개인용 단말 장치에서 구현되는 것도 가능하다.1 is a block diagram illustrating a similar image generating apparatus 100 according to at least some embodiments of the present disclosure. The image-like device 100 may be a single device such as, for example, a desktop computer, a laptop computer, a smartphone, a tablet computer, a mobile phone, a personal digital assistant (PDA), a special purpose device, or a fusion device comprising any of the above functions. computing device; Alternatively, it may be a multiple computing configuration, such as a plurality of computing devices, such as a server farm, a distributed network, or a cloud computing configuration. Accordingly, the similar image generating apparatus 100 may be implemented as a server that receives a request from a personal terminal device and transmits an output in response to the request, or may be implemented in a personal terminal device.

유사 이미지 생성 장치(100)는 이미지 소스 관리부(110), 학습 구간 설정부(120), 레이블 이미지 추출부(130), 인접 이미지 추출부(140) 및 유사 이미지 생성부(150)를 포함한다. 추가적이거나 선택적인 실시예에서, 유사 이미지 생성 장치(100)는 이미지 전처리부(160) 및 메타 데이터 추출부(170) 중 적어도 하나를 더 포함할 수 있다. The similar image generating apparatus 100 includes an image source manager 110 , a learning section setting unit 120 , a label image extracting unit 130 , an adjacent image extracting unit 140 , and a similar image generating unit 150 . In an additional or optional embodiment, the similar image generating apparatus 100 may further include at least one of an image preprocessor 160 and a metadata extractor 170 .

유사 이미지 생성 장치(100)는 본 개시에 따른 하나 이상의 프로그램을 내부 메모리에 저장할 수 있으며, 이러한 하나 이상의 프로그램은 하나 이상의 프로세서, 예컨대, 중앙 처리 유닛(CPU) 또는 그래픽 처리 유닛(GPU) 등의 실행 및 처리에 의해, 이하에서 설명되는 유사 이미지 생성 장치(100)의 각 컴포넌트들(110, 120, 130, 140 및 150) 및 선택/추가적인 컴포넌트들(160 및 170)의 동작 중 적어도 일부를 수행할 수 있다. 이러한 컴포넌트들(110, 120, 130, 140 및 150)과 선택적인 컴포넌트들(160 및 170)은 개별적인 컴포넌트로 도시되었으나, 개시된 대상의 범위를 벗어나지 않으면서, 추가적인 컴포넌트로 분리될 수 있거나 더 적은 컴포넌트로 조합되거나 제거될 수 있다. 또한, 도 2에서는 컴포넌트가 소프트웨어에 의해 구현되는 것으로 도시되지만, 당업자라면 컴포넌트의 각각의 기능 및/또는 동작이 하드웨어, 소프트웨어, 펌웨어 또는 그들의 임의의 조합을 통하여 개별적으로 및/또는 종합적으로 구현될 수 있음을 이해할 것이다.The similar image generating apparatus 100 may store one or more programs according to the present disclosure in an internal memory, and the one or more programs may be executed by one or more processors, for example, a central processing unit (CPU) or a graphics processing unit (GPU). and the processing to perform at least some of the operations of the respective components 110 , 120 , 130 , 140 and 150 and the selection/additional components 160 and 170 of the similar image generating apparatus 100 described below. can Although these components 110 , 120 , 130 , 140 and 150 and optional components 160 and 170 are shown as separate components, they may be separated into additional components or fewer components without departing from the scope of the disclosed subject matter. can be combined or eliminated. In addition, although the component is illustrated as being implemented by software in FIG. 2 , each function and/or operation of the component may be individually and/or collectively implemented through hardware, software, firmware, or any combination thereof, to those skilled in the art. will understand that

이미지 소스 관리부(110)는 이미지 소스를 획득하도록 구성된다. 일부 예시에서, 이미지 소스 관리부(110)는 사용자에 의해 미리 정해진 방식에 의해 이미지 소스를 수집할 수 있다. 이미지 소스의 수집은 오픈소스 저장소의 네트워크 주소를 통한 획득, 미리 정해진 저장소로부터의 수신, 웹크롤링 등 다양한 방식이 이용될 수 있다. 일부 다른 예시에서, 이미지 소스 관리부(110)는 유사 이미지 생성 장치(100)의 사용자로부터 이미지 소스를 직접 입력받을 수 있다. 일부 또 다른 예시에서, 이미지 소스 관리부(110)는 유사 이미지 생성 장치(100)와 연관된 외부 컴퓨팅 장치로부터 또는 사용자에 의해 입력된 네트워크의 주소로부터 이미지 소스를 수신할 수 있다. 획득된 이미지 소스는 유사 이미지 생성 장치(100)의 내부 메모리에 저장될 수 있다. 이미지 소스는 이하에서 상세히 설명되는 바와 같이, 복수의 이미지를 추출할 수 있는 다양한 유형의 이미지 데이터일 수 있다. 이미지 소스는 예컨대, 복수의 프레임 이미지를 가지는 비디오 데이터, 360도 이미지와 같은 파노라마 이미지, 360도 비디오 데이터 등을 포함할 수 있다. 추가적으로, 이미지 소스 관리부(110)는 획득된 이미지 소스를 소정의 기법을 이용하여 분류하고 저장할 수 있다.The image source management unit 110 is configured to acquire an image source. In some examples, the image source manager 110 may collect the image source in a manner predetermined by the user. Various methods, such as acquisition through a network address of an open source repository, reception from a predetermined repository, and web crawling, may be used for the collection of the image source. In some other examples, the image source manager 110 may directly receive an image source from a user of the similar image generating apparatus 100 . In some other examples, the image source manager 110 may receive the image source from an external computing device associated with the similar image generating apparatus 100 or from an address of a network input by a user. The acquired image source may be stored in an internal memory of the similar image generating apparatus 100 . The image source may be various types of image data from which a plurality of images may be extracted, as described in detail below. The image source may include, for example, video data having a plurality of frame images, a panoramic image such as a 360 degree image, 360 degree video data, and the like. Additionally, the image source manager 110 may classify and store the acquired image source using a predetermined technique.

일부 선택적인 예시에서, 이미지 소스 관리부(110)는 사용자로부터 이미지 소스를 입력받을 수 있으며, 메타 데이터 추출부(170)는 입력된 이미지 소스로부터 메타 데이터를 추출하고, 이미지 소스 관리부(110)는 메타 데이터에 기초하여, 추가적인 이미지 소스의 획득을 수행할 수 있다. 이러한 예시에서, 이미지 소스 관리부(110)는 메타 데이터에 기초하여, 이미지 소스를 분류 및 저장할 수 있다.In some optional examples, the image source management unit 110 may receive an image source input from a user, the metadata extraction unit 170 extracts metadata from the input image source, and the image source management unit 110 receives the metadata. Based on the data, acquisition of additional image sources may be performed. In this example, the image source manager 110 may classify and store the image source based on the metadata.

학습 구간 설정부(120)는 이미지 소스 관리부(110)에 의해 획득된 이미지 소스를 분석하여 이미지 소스 내의 학습 구간을 설정할 수 있다. 학습 구간 설정부(120)는 이미지 소스 내에서 학습 가능한 구간을 도출하기 위하여, 다양한 분석을 수행할 수 있으며, 분석의 결과에 기초하여 학습 구간을 설정할 수 있다. 학습 구간 설정부(120)는 예컨대, 이미지의 소스의 색상 변화율의 측정, 이미지 복잡도의 측정 및 이미지 소스 내 오브젝트에 대한 구조 유사도 측정 중 적어도 하나를 수행할 수 있다.The learning section setting unit 120 may set a learning section in the image source by analyzing the image source obtained by the image source management unit 110 . The learning section setting unit 120 may perform various analyzes in order to derive a learnable section within the image source, and may set the learning section based on the result of the analysis. The learning section setting unit 120 may perform, for example, at least one of measuring a color change rate of an image source, measuring image complexity, and measuring a structural similarity of an object in the image source.

일부 예시에서, 학습 구간 설정부(120)는 이미지 소스 내에서 색상 변화율(Color Changing Ratio)을 측정할 수 있다. 이미지 소스가 비디오 데이터인 경우, 학습 구간 설정부(120)는 비디오 데이터 내 복수의 프레임들 간의 색상 변화율을 측정할 수 있다. 이미지 소스가 파노라마 이미지인 경우, 학습 구간 설정부(120)는 소정 사이즈의 프레임을 이용하여, 파노라마 이미지로부터 개별 이미지를 추출하고 추출된 이미지들 사이의 색상 변화율을 측정할 수 있다. 학습 구간 설정부(120)는 측정된 색상 변화율이 미리 정해진 범위 내에 있는 구간을 설정할 수 있으며, 이러한 구간을 학습 구간으로 결정할 수 있다.In some examples, the learning interval setting unit 120 may measure a color changing ratio in the image source. When the image source is video data, the learning interval setting unit 120 may measure a color change rate between a plurality of frames in the video data. When the image source is a panoramic image, the learning section setting unit 120 may extract an individual image from the panoramic image using a frame of a predetermined size and measure a color change rate between the extracted images. The learning section setting unit 120 may set a section in which the measured color change rate is within a predetermined range, and may determine this section as the learning section.

일부 다른 예시에서, 학습 구간 설정부(120)는 이미지 소스 내에서 이미지 복잡도(Image Complexity)를 측정할 수 있다. 이미지 소스 내에 소정의 오브젝트, 예컨대, 특정 랜드 마크가 포함되는 경우, 해당 오브젝트를 포함하는 이미지는 해당 오브젝트를 포함하지 않는 이미지와 이미지 복잡도가 상이할 수 있다. 이러한 예시에서, 학습 구간 설정부(120)는 해당 오브젝트가 포함되지 않는 구간, 예컨대, 복잡도가 소정의 범위를 벗어나는 구간을 설정될 학습 구간에서 제외할 수 있다. In some other examples, the learning interval setting unit 120 may measure image complexity in the image source. When a predetermined object, for example, a specific landmark is included in the image source, the image including the object may have different image complexity from the image not including the object. In this example, the learning section setting unit 120 may exclude a section in which a corresponding object is not included, for example, a section having a complexity outside a predetermined range from the learning section to be set.

일부 또 다른 예시에서, 학습 구간 설정부(120)는 이미지 소스 내에서 구조 유사도(Structural Similarity)를 측정할 수 있다. 이미지 소스 내에는 장애 요소가 있을 수 있다. 예컨대, 경복궁과 같은 특정 랜드 마크에 대한 비디오 데이터에는 행인과 같이, 해당 랜드 마크를 지나가는 행인 또는 차량과 같은 장애 요소가 있을 수 있다. 이러한 장애 요소는 사용자 또는 미리 저장된 데이터에 의해 미리 정의될 수 있으며, 학습 구간 설정부(120)는 미리 정의된 장애 요소와 유사한 구조도를 가지는 오브젝트를 이미지 소스로부터 검출할 수 있으며, 장애 요소가 포함되는 구간은 학습 구간에서 제외할 수 있다.In some other examples, the learning interval setting unit 120 may measure structural similarity within the image source. There may be obstacles within the image source. For example, in video data for a specific landmark such as Gyeongbokgung, there may be obstacles such as pedestrians or vehicles passing the landmark. These obstacles may be predefined by the user or pre-stored data, and the learning section setting unit 120 may detect an object having a structure similar to the predefined obstacles from the image source, and the obstacles are included. Sections can be excluded from the learning section.

레이블 이미지 추출부(130)는 학습 구간 설정부(120)에 의하여 설정된 학습 구간으로부터 레이블 이미지를 추출할 수 있다. 레이블 이미지 추출부(130)는 이미지 소스에 대하여 설정된 학습 구간 내에서, 이미지 소스로부터 레이블 이미지를 결정할 수 있다. 레이블 이미지는 이미지 소스 내에 포함된 주된 오브젝트를 포함하는 이미지로서, 대표적인 이미지를 지칭할 수 있다. 일부 예시에서, 레이블 이미지 추출부(130)는 학습 구간 내에서 임의의 방식으로 레이블 이미지를 추출할 수 있다. 일부 다른 예시에서, 레이블 이미지 추출부(130)는 설정된 학습 구간의 중심에 위치한 이미지, 즉 중심 이미지를 레이블 이미지로서 추출할 수 있다. 이미지 소스가 비디오 데이터인 경우, 레이블 이미지 추출부(130)는 중심 프레임의 이미지를 레이블 이미지로서 결정할 수 있다. 일부 또 다른 예시에서, 레이블 이미지 추출부(130)는 학습 구간 내에서 추출되는 복수의 이미지 중에서 하나 이상의 특성을 만족하거나 하나 이상의 특성 범위 내에 들어가는 이미지를 레이블 이미지로서 결정할 수 있다. The label image extractor 130 may extract a label image from the learning section set by the learning section setting unit 120 . The label image extractor 130 may determine a label image from the image source within a learning section set for the image source. The label image is an image including a main object included in an image source, and may refer to a representative image. In some examples, the label image extraction unit 130 may extract the label image in an arbitrary manner within the learning section. In some other examples, the label image extraction unit 130 may extract an image located at the center of the set learning section, that is, the center image as the label image. When the image source is video data, the label image extractor 130 may determine the image of the center frame as the label image. In some other examples, the label image extractor 130 may determine, as the label image, an image that satisfies one or more characteristics or falls within one or more characteristic ranges among a plurality of images extracted within the learning section.

인접 이미지 추출부(140)는 학습 구간 설정부(120)에 의해 설정된 학습 구간 및 레이블 이미지 추출부(130)에 의해 추출된 레이블 이미지에 기초하여 적어도 하나의 인접 이미지를 추출할 수 있다. 일부 예시에서, 인접 이미지 추출부(140)는 레이블 이미지를 학습 구간의 기준 이미지로서 이용하여, 레이블 이미지로부터 소정의 간격이 떨어진 이미지를 인접 이미지로서 추출할 수 있다. 일 예시에서, 인접 이미지 추출부(140)는 소정의 간격으로 떨어진 2 개의 이미지를 인접 이미지로서 추출할 수 있다. 일부 다른 예시에서, 인접 이미지 추출부(140)는 학습 구간 내에서 임의의 이미지를 추출할 수 있으며, 추출된 이미지와 레이블 이미지 간의 구조 유사도를 측정하고, 측정된 구조 유사도가 미리 정해진 범위(예컨대, 구조 유사도가 0.9 이상)인 경우, 해당 이미지를 인접 이미지로서 추출할 수 있다. The adjacent image extractor 140 may extract at least one adjacent image based on the learning section set by the learning section setting unit 120 and the label image extracted by the label image extractor 130 . In some examples, the adjacent image extraction unit 140 may extract an image separated by a predetermined distance from the label image as the adjacent image by using the label image as a reference image of the learning section. In one example, the adjacent image extraction unit 140 may extract two images separated by a predetermined interval as adjacent images. In some other examples, the adjacent image extraction unit 140 may extract an arbitrary image within the learning section, measure the structural similarity between the extracted image and the label image, and the measured structural similarity is within a predetermined range (eg, If the structural similarity is 0.9 or more), the corresponding image may be extracted as an adjacent image.

유사 이미지 생성부(150)는 머신러닝 기반의 유사 이미지 생성을 수행하는 머신러닝 모델을 포함할 수 있다. 유사 이미지 생성부(150)는 해당 머신러닝 모델의 학습을 위한 학습 데이터를 생성할 수 있다. 따라서, 유사 이미지 생성부(150)는 레이블 이미지 추출부(130)에 의해 추출된 레이블 이미지와 인접 이미지 추출부(140)에 의해 추출된 적어도 하나의 인접 이미지에 기초하여, 학습 데이터를 생성할 수 있다. 유사 이미지 생성부(150)의 머신러닝 모델은 예컨대, 생성적 적대 신경망(Generative Adversarial Network; GAN), Star GAN, 룰 베이스 시스템(rule-based system) 일 수도 있고, 신경망 베이스 시스템(neural network-based system)(예: 합성곱 신경망(convolutional neural network(CNN)), 피드포워드 신경망(feedforward neural network(FNN)), 순환 신경망(recurrent neural network(RNN)) 등과 같은 다양한 유형의 인공지능 네트워크를 포함할 수 있다. 일부 예시에서, 머신러닝 모델은 적어도 하나의 인접 이미지를 유사 이미지의 생성에 이용하고, 레이블 이미지를 생성된 유사 이미지의 판별에 사용되는 실제 이미지(real image)로서 이용할 수 있다. GAN의 예시에 대해서는 도 2에서 보다 구체적으로 설명한다. 유사 이미지 생성부(150)는 생성된 학습 데이터를 이용하여, 머신러닝 모델의 학습을 수행할 수 있다.The similar image generator 150 may include a machine learning model that generates a similar image based on machine learning. The similarity image generator 150 may generate training data for learning the corresponding machine learning model. Accordingly, the similar image generating unit 150 may generate training data based on the label image extracted by the label image extracting unit 130 and at least one neighboring image extracted by the neighboring image extracting unit 140 . have. The machine learning model of the similar image generator 150 may be, for example, a generative adversarial network (GAN), a star GAN, a rule-based system, or a neural network-based system. system) (e.g. convolutional neural networks (CNN)), feedforward neural networks (FNNs), recurrent neural networks (RNNs), etc. In some examples, the machine learning model may use at least one adjacent image to generate a similar image, and use the label image as a real image used to determine the generated similar image. An example will be described in more detail with reference to Fig. 2. The similar image generator 150 may use the generated training data to learn the machine learning model.

이와 같이, 레이블 이미지 추출부(130)에 의해 추출된 레이블 이미지와 인접 이미지 추출부(140)에 의해 추출된 적어도 하나의 인접 이미지는 유사 이미지 생성부(150)의 입력으로서 이용될 수 있다. 추가적이고 선택적인 일부 예시에서, 이미지 전처리부(160)는 레이블 이미지 및/또는 적어도 하나의 인접 이미지를 이용하여 전처리를 수행하여 추가적인 학습 데이터를 위한 이미지를 생성할 수 있다. 일부 예시에서, 이미지 전처리부(160)는 레이블 이미지 및/또는 적어도 하나의 인접 이미지에 의도적인 노이즈를 추가함으로써, 생성된 이미지에 의해 유사 이미지 생성부의 학습에서 과적합(overfitting)이 발생하는 것을 방지할 수 있다. 일부 예시에서, 이미지 전처리부(160)는 학습을 위한 추가 데이터를 생성하기 위하여, OpenCV(Open Source Computer Vision)을 이용하여 이미지의 외곽선(Contour)의 추출 및 유사도 비교를 수행할 수 있다. In this way, the label image extracted by the label image extraction unit 130 and the at least one adjacent image extracted by the adjacent image extraction unit 140 may be used as an input of the similar image generation unit 150 . In some additional and optional examples, the image preprocessor 160 may generate an image for additional training data by performing preprocessing using the label image and/or at least one adjacent image. In some examples, the image preprocessor 160 adds intentional noise to the label image and/or at least one adjacent image, thereby preventing overfitting from occurring in the training of the similar image generator by the generated image. can do. In some examples, the image preprocessor 160 may extract an image contour and compare the similarity by using Open Source Computer Vision (OpenCV) to generate additional data for learning.

추가적이고 선택적인 일부 예시에서, 메타 데이터 추출부(170)는 이미지 소스, 레이블 이미지 및 적어도 하나의 인접 이미지 중 적어도 하나에 기초하여, 이미지의 메타 데이터를 추출할 수 있다. 이러한 메타 데이터는 예컨대, 이미지 내 오브젝트의 명칭, 이미지의 주제, 이미지 상의 날씨, 이미지 상의 시각, 오브젝트의 색 분포 등과 같은 이미지의 컨텍스트(context)에 관한 데이터를 포함할 수 있다. 이러한 메타 데이터는 유사 이미지 생성부(150)에 입력되는 레이블 이미지와 적어도 하나의 인접 이미지에 더하여, 학습 데이터로서 활용될 수 있다. 예컨대, 유사 이미지 생성부(150)는 레이블 이미지 및 적어도 하나의 인접 이미지뿐만 아니라, 메터데이터 추출부(170)에 의해 추출된 메타 데이터에 기초하여, 유사 이미지를 생성에 관한 학습을 수행할 수 있다.In some additional and optional examples, the metadata extraction unit 170 may extract metadata of an image based on at least one of an image source, a label image, and at least one adjacent image. Such metadata may include, for example, data about the context of the image, such as the name of an object in the image, the subject of the image, weather on the image, time on the image, color distribution of the object, and the like. Such metadata may be used as learning data in addition to the label image and at least one adjacent image input to the similar image generating unit 150 . For example, the similarity image generating unit 150 may learn to generate the similarity image based on the label image and at least one adjacent image, as well as the metadata extracted by the metadata extracting unit 170 . .

이와 같이, 본 개시에 따르면, 복수의 이미지를 추출할 수 있는 이미지 소스, 예컨대, 비디오 데이터, 소정 사이즈의 이미지 등을 공급하는 것만으로 머신러닝 모델의 학습에 이용되는 데이터를 자동적으로 생성하여, 머신러닝 모델에 공급할 수 있으므로, 수동적으로 학습 데이터를 제공해야 하는 경우에 비하여, 머신러닝 모델의 학습의 편의를 도모할 수 있는 한편, 머신러닝 모델의 학습의 효율 및 속도도 향상시킬 수 있다.As described above, according to the present disclosure, data used for learning a machine learning model is automatically generated just by supplying an image source from which a plurality of images can be extracted, for example, video data, an image of a predetermined size, etc. Since it can be supplied to the learning model, it is possible to promote the convenience of learning the machine learning model compared to the case where learning data needs to be provided passively, while improving the efficiency and speed of learning the machine learning model.

도 2는 도 1의 유사 이미지 생성부(150)의 일 예시를 도시하는 블록도이다. 유사 이미지 생성부(150)는 도 2에 도시된 바와 같이, 생성적 적대 신경망(GAN)을 포함하고 있으나, 본 개시에 따른 실시예는 이러한 이미지 생성 모델에 제한되지 않음이 이해될 것이다. 도 2에 도시된 바에 따르면, 유사 이미지 생성부(150)는 유사 이미지 생성망(210), 유사 이미지 판별망(220)(Fake Image Discriminate Network), 피드백부(230) 및 출력부(240)를 포함한다.FIG. 2 is a block diagram illustrating an example of the similar image generator 150 of FIG. 1 . As shown in FIG. 2 , the similar image generator 150 includes a generative adversarial neural network (GAN), but it will be understood that the embodiment according to the present disclosure is not limited to such an image generation model. As shown in FIG. 2 , the similar image generating unit 150 includes a similar image generating network 210 , a fake image discriminate network 220 , a feedback unit 230 , and an output unit 240 . include

유사 이미지 생성부(150)는 도 1에 관하여 설명된 바와 같이, 레이블 이미지와 적어도 하나의 인접 이미지에 기초하여, 학습 데이터를 생성한다. 일부 예시에서, 유사 이미지 생성망(210)은 적어도 하나의 인접 이미지를 이용하여, 유사 이미지를 생성할 수 있다. 유사 이미지 판별망(220)은 유사 이미지 생성망(210)에 의해 생성된 이미지가 유사 이미지(fake image)인지 실제 이미지(real image)인지 판별한다. 일부 예시에서, 유사 이미지 판별망(220)은 레이블 이미지를 실제 이미지로서 이용하여, 생성된 이미지가 유사 이미지인지 실제 이미지인지를 결정한다. 피드백부(230)는 유사 이미지 판별망(230)의 결정의 결과에 기초하여, 유사 이미지 생성망(210)과 유사 이미지 판별망(220)에 피드백을 제공할 수 있다. 이와 같은 방식으로, 유사 이미지 생성부(150) 내 머신 러닝 모델은 학습을 수행할 수 있다. 출력부(240)는 소정의 요청 또는 조건에 의해, 유사 이미지 생성망(210)에 의해 생성된 유사 이미지를 출력할 수 있다.As described with reference to FIG. 1 , the similar image generator 150 generates training data based on the label image and at least one adjacent image. In some examples, the similar image generating network 210 may generate a similar image by using at least one adjacent image. The similar image determining network 220 determines whether the image generated by the similar image generating network 210 is a fake image or a real image. In some examples, the similar image determination network 220 uses the label image as an actual image to determine whether the generated image is a similar image or an actual image. The feedback unit 230 may provide feedback to the similarity image generating network 210 and the similarity image determining network 220 based on a result of the determination of the similarity image determining network 230 . In this way, the machine learning model in the similar image generating unit 150 may perform learning. The output unit 240 may output a similar image generated by the similar image generating network 210 according to a predetermined request or condition.

도 3은 본 개시의 일부 실시예에 따라, 학습 데이터의 생성에 이용되는 레이블 이미지 및 인접 이미지가 추출되는 예시를 도시한다. 본 개시의 일부 실시예에 따르면, 도 3에 도시된 바와 같은 학습 구간(310)이 설정될 수 있다. 이미지 소스가 비디오 데이터인 경우, 소정 시간 구간의 복수의 프레임 이미지가 학습 구간에 포함될 수 있다. 이미지 소스가 파노라마 이미지와 같은 단일 이미지인 경우, 후속하여 추출될 레이블 이미지와 인접 이미지보다 큰 소정의 사이즈의 공간적 범위가 학습 구간(310)으로서 설정될 수 있다.3 illustrates an example in which a label image and an adjacent image used to generate training data are extracted, according to some embodiments of the present disclosure. According to some embodiments of the present disclosure, a learning section 310 as shown in FIG. 3 may be set. When the image source is video data, a plurality of frame images of a predetermined time period may be included in the learning period. When the image source is a single image such as a panoramic image, a spatial range of a predetermined size larger than a label image to be subsequently extracted and an adjacent image may be set as the learning section 310 .

본 개시의 일부 실시예에 따르면, 학습 구간(310)으로부터 레이블 이미지(320)가 추출될 수 있다. 일부 예시에서, 레이블 이미지(320)는 학습 구간(310)의 중심 이미지일 수 있다. 일부 다른 예시에서, 레이블 이미지(320)는 소정의 조건을 만족하는 이미지일 수 있다.According to some embodiments of the present disclosure, the label image 320 may be extracted from the learning section 310 . In some examples, the label image 320 may be a central image of the learning section 310 . In some other examples, the label image 320 may be an image satisfying a predetermined condition.

레이블 이미지(320)가 추출된 이후, 본 개시의 일부 실시예에 따르면, 레이블 이미지(320)를 중심으로 적어도 하나의 인접 이미지(330)가 추출될 수 있다. 일부 예시에서, 레이블 이미지(320)로부터 소정의 간격을 가지는 이미지가 인접 이미지(330)로서 선택될 수 있다. 일부 다른 예시에서, 레이블 이미지(320)와 소정 범위 내의 차이점을 가지는 이미지가 인접 이미지(330)로서 선택될 수 있다. After the label image 320 is extracted, according to some embodiments of the present disclosure, at least one adjacent image 330 may be extracted based on the label image 320 . In some examples, an image having a predetermined distance from the label image 320 may be selected as the adjacent image 330 . In some other examples, an image having a difference within a predetermined range from the label image 320 may be selected as the adjacent image 330 .

도 4는 본 개시의 적어도 일부 실시예에 따라, 학습 데이터를 생성하는 예시적인 프로세스(400)를 도시한다. 예컨대, 프로세스(400)는 도 1의 유사 이미지 생성 장치(100)와 같은 컴퓨팅 장치의 제어 하에서 수행될 수 있다. 도 4에 도시된 프로세스(400)는 블록(410, 420, 430, 440 및/또는 450)에 의하여 예시된 바와 같은 하나 이상의 동작, 기능 또는 작용을 포함할 수 있다. 도 4에 예시된 개략적인 동작들은 예시로서만 제공되고, 개시된 실시에의 본질에서 벗어나지 않으면서, 동작들 중 일부가 선택적일 수 있거나, 더 적은 동작으로 조합될 수 있거나, 추가적인 동작으로 확장될 수 있다. 프로세스(400)는 이미지 소스를 획득하는 블록(410)에서 시작할 수 있다. 4 depicts an example process 400 of generating training data, in accordance with at least some embodiments of the present disclosure. For example, the process 400 may be performed under the control of a computing device, such as the similar image generating device 100 of FIG. 1 . The process 400 illustrated in FIG. 4 may include one or more acts, functions, or acts as illustrated by blocks 410 , 420 , 430 , 440 and/or 450 . The schematic operations illustrated in FIG. 4 are provided by way of example only, and some of the operations may be optional, may be combined into fewer operations, or expanded into additional operations without departing from the essence of the disclosed embodiment. have. Process 400 may begin at block 410 of obtaining an image source.

블록(410)에서, 컴퓨팅 장치는 이미지 소스를 획득할 수 있다. 일부 예시에서, 일부 예시에서 컴퓨팅 장치는 미리 정해진 방식에 의해 이미지 소스를 수집할 수 있다. 일부 다른 예시에서, 컴퓨팅 장치는 사용자로부터 이미지 소스를 직접 입력받을 수 있다. 일부 또 다른 예시에서, 컴퓨팅 장치는 외부 컴퓨팅 장치로부터 또는 사용자에 의해 입력된 네트워크의 주소로부터 이미지 소스를 수신할 수 있다. 이미지 소스는 예컨대, 복수의 프레임 이미지를 가지는 비디오 데이터, 360도 이미지와 같은 파노라마 이미지, 360도 비디오 데이터 등을 포함할 수 있다. 프로세스(400)는 블록(410)으로부터 이미지 소스 내 학습 구간을 설정하는 블록(420)으로 이어질 수 있다.At block 410 , the computing device may obtain an image source. In some examples, the computing device may in some instances collect the image source in a predetermined manner. In some other examples, the computing device may directly receive an image source from a user. In some other examples, the computing device may receive the image source from an external computing device or from an address in the network entered by the user. The image source may include, for example, video data having a plurality of frame images, a panoramic image such as a 360 degree image, 360 degree video data, and the like. Process 400 may continue from block 410 to block 420 of establishing a learning interval within the image source.

블록(420)에서, 컴퓨팅 장치는 블록(410)에서 획득된 이미지 소스를 분석하여 이미지 소스 내의 학습 구간을 설정할 수 있다. 컴퓨팅 장치는 이미지 소스 내에서 학습 가능한 구간을 도출하기 위하여, 다양한 분석을 수행할 수 있다. 컴퓨팅 장치는 예컨대, 이미지의 소스의 색상 변화율의 측정, 이미지 복잡도의 측정 및 이미지 소스 내 오브젝트에 대한 구조 유사도 측정 중 적어도 하나를 수행할 수 있다. 일부 예시에서, 컴퓨팅 장치는 이미지 소스 내에서 색상 변화율을 측정할 수 있으며, 측정된 색상 변화율이 미리 정해진 범위 내에 있는 구간을 설정할 수 있다. 일부 다른 예시에서, 컴퓨팅 장치는 이미지 소스 내에서 이미지 복잡도를 측정할 수 있으며, 복잡도가 소정의 범위 내인 구간을 학습 구간으로서 설정할 수 있다. 일부 또 다른 예시에서, 컴퓨팅 장치는 이미지 소스 내에서 구조 유사도를 측정할 수 있다. 이미지 소스 내에는 미리 정의된 장애 요소가 있을 수 있으며, 컴퓨팅 장치는 이러한 장애 요소가 포함되지 않도록 학습 구간으로 설정할 수 있다. 프로세스(400)는 블록(420)에서 레이블 이미지를 추출하는 블록(430)으로 이어질 수 있다.In block 420 , the computing device may analyze the image source obtained in block 410 to set a learning section in the image source. The computing device may perform various analyzes in order to derive a learnable section within the image source. The computing device may, for example, perform at least one of measuring a color change rate of a source of an image, measuring an image complexity, and measuring a structural similarity of an object in the image source. In some examples, the computing device may measure a color change rate in the image source, and set a section in which the measured color change rate is within a predetermined range. In some other examples, the computing device may measure image complexity in the image source, and may set a section in which the complexity is within a predetermined range as the learning section. In some other examples, the computing device may measure structural similarity within the image source. There may be predefined obstacles in the image source, and the computing device may set the learning interval so that these obstacles are not included. Process 400 may continue at block 420 to block 430 where the label image is extracted.

블록(430)에서, 컴퓨팅 장치는 블록(420)에서 설정된 학습 구간으로부터 레이블 이미지를 추출할 수 있다. 컴퓨팅 장치는 이미지 소스에 대하여 설정된 학습 구간 내에서, 이미지 소스로부터 레이블 이미지를 결정할 수 있다. 레이블 이미지는 이미지 소스 내에 포함된 주된 오브젝트를 포함하는 이미지로서, 대표적인 이미지를 지칭할 수 있다. 일부 예시에서, 컴퓨팅 장치는 임의의 방식으로 레이블 이미지를 추출할 수 있다. 일부 다른 예시에서, 컴퓨팅 장치는 학습 구간의 중심에 위치한 중심 이미지를 레이블 이미지로서 추출할 수 있다. 일부 또 다른 예시에서, 컴퓨팅 장치는 학습 구간 내에서 추출되는 복수의 이미지 중에서 소정의 조건을 만족하는 이미지를 레이블 이미지로서 결정할 수 있다. 프로세스(400)는 블록(430)으로부터 적어도 하나의 인접 이미지를 추출하는 블록(440)으로 이어질 수 있다.In block 430 , the computing device may extract a label image from the learning section set in block 420 . The computing device may determine a label image from the image source within a learning interval set for the image source. The label image is an image including a main object included in an image source, and may refer to a representative image. In some examples, the computing device may extract the label image in any manner. In some other examples, the computing device may extract a central image located at the center of the learning section as a label image. In some other examples, the computing device may determine, as a label image, an image satisfying a predetermined condition among a plurality of images extracted within the learning section. Process 400 may continue from block 430 to block 440 where at least one adjacent image is extracted.

블록(440)에서, 컴퓨팅 장치는 블록(420)에서 설정된 학습 구간 및 블록(430)에서 추출된 레이블 이미지에 기초하여 적어도 하나의 인접 이미지를 추출할 수 있다. 일부 예시에서, 컴퓨팅 장치는 레이블 이미지를 학습 구간의 기준 이미지로서 이용하여, 레이블 이미지로부터 소정의 간격이 떨어진 이미지를 인접 이미지로서 추출할 수 있다. 일부 다른 예시에서, 컴퓨팅 장치는 레이블 이미지와의 구조 유사도가 소정의 값(예컨대, 0.9) 이상인 이미지를 인접 이미지로서 추출할 수 있다. 프로세스(400)는 블록(440)으로부터 학습 데이터를 생성하는 블록(450)으로 이어질 수 있다.In block 440 , the computing device may extract at least one adjacent image based on the learning section set in block 420 and the label image extracted in block 430 . In some examples, the computing device may use the label image as a reference image of the learning section to extract an image separated by a predetermined distance from the label image as an adjacent image. In some other examples, the computing device may extract an image having a structural similarity with the label image of a predetermined value (eg, 0.9) or more as an adjacent image. Process 400 may continue from block 440 to block 450 generating training data.

블록(450)에서, 컴퓨팅 장치는 블록(430)에서 추출된 레이블 이미지 및 블록(440)에서 추출된 적어도 하나의 인접 이미지에 기초하여, 유사 이미지 생성을 수행하는 머신러닝 모델의 학습 데이터를 생성할 수 있다. 일부 예시에서, 머신러닝 모델은 컴퓨팅 장치에 저장될 수 있다. 머신러닝 모델은 예컨대, GAN, Star GAN 등을 포함하는 다양한 유형의 인공지능 네트워크를 포함할 있다. 머신러닝 모델은 생성된 학습 데이터를 이용하여, 학습을 수행할 수 있다. 일부 예시에서, 머신러닝 모델, 즉, 컴퓨팅 장치는 적어도 하나의 인접 이미지를 유사 이미지의 생성에 이용하고, 레이블 이미지를 생성된 유사 이미지의 판별에 사용되는 실제 이미지(real image)로서 이용할 수 있다.In block 450 , the computing device generates training data of a machine learning model that performs similar image generation based on the label image extracted in block 430 and the at least one adjacent image extracted in block 440 . can In some examples, the machine learning model may be stored on a computing device. The machine learning model may include various types of artificial intelligence networks including, for example, GANs, Star GANs, and the like. The machine learning model may perform learning using the generated training data. In some examples, the machine learning model, ie, the computing device, may use at least one adjacent image to generate a similar image, and use the label image as a real image used to determine the generated similar image.

도 5는 본 개시의 적어도 일부 실시예에 따라, 개시의 적어도 일부 실시예에 따라, 유사 이미지 생성에 관한 머신러닝 모델의 학습 데이터를 생성하는 데 이용될 수 있는 컴퓨터 프로그램 제품(500)을 도시한다. 예시적인 컴퓨터 프로그램 제품의 예시적인 실시예는 신호 베어링 매체(510)를 이용하여 제공된다. 일부 실시예에서, 하나 이상의 컴퓨터 프로그램 제품(500)의 신호 베어링 매체(510)는 컴퓨터 판독가능 매체(530) 및/또는 기록 가능 매체(540)를 포함할 수 있다.5 illustrates a computer program product 500 that may be used to generate training data of a machine learning model for generating similar images, in accordance with at least some embodiments of the disclosure, and in accordance with at least some embodiments of the disclosure. . An exemplary embodiment of an exemplary computer program product is provided using a signal bearing medium 510 . In some embodiments, the signal bearing medium 510 of the one or more computer program products 500 may include a computer readable medium 530 and/or a recordable medium 540 .

신호 베어링 매체(510)에 포함된 명령어(520)는 유사 이미지 생성 장치(100)와 같은 컴퓨팅 장치에 의해 실행될 수 있다. 따라서, 컴퓨팅 장치는 머신러닝 기반의 이미지 생성을 수행하는 머신러닝 모델의 학습 데이터를 생성하는 명령어(520)를 포함할 수 있다. 명령어(520)가 실행되면, 이미지 소스를 획득하기 위한 하나 이상의 명령어; 이미지 소스를 분석하여 이미지 소스 내의 학습 구간을 설정하기 위한 하나 이상의 명령어; 설정된 학습 구간으로부터 레이블 이미지를 추출하기 위한 하나 이상의 명령어; 설정된 학습 구간 및 레이블 이미지에 기초하여, 적어도 하나의 인접 이미지를 추출하기 위한 하나 이상의 명령어; 또는 레이블 이미지 및 적어도 하나의 인접 이미지에 기초하여, 머신러닝 모델에 입력하기 위한 학습 데이터를 생성하기 위한 하나 이상의 명령어 중 적어도 하나를 포함한다.The instructions 520 contained in the signal bearing medium 510 may be executed by a computing device, such as the similar image generating device 100 . Accordingly, the computing device may include the instruction 520 for generating training data of a machine learning model that performs machine learning-based image generation. When instructions 520 are executed, one or more instructions for obtaining an image source; one or more instructions for analyzing the image source to set a learning interval in the image source; one or more commands for extracting a label image from the set learning section; one or more instructions for extracting at least one adjacent image based on the set learning section and the label image; or one or more instructions for generating, based on the label image and the at least one adjacent image, training data for input into the machine learning model.

도 6은 본 개시의 적어도 일부 실시예에 따라 배열되는 컴퓨팅 장치(600)의 예시적인 실시예의 블록도이다. 일 예시적인 기본적인 구성(602)에서, 컴퓨팅 장치(600)는 하나 이상의 프로세서(604) 및 시스템 메모리(606)를 포함할 수 있다. 메모리 버스(608)는 프로세서(604) 및 시스템 메모리(606) 사이에서 통신하기 위하여 이용될 수 있다.6 is a block diagram of an example embodiment of a computing device 600 arranged in accordance with at least some embodiments of the present disclosure. In one exemplary basic configuration 602 , computing device 600 may include one or more processors 604 and system memory 606 . A memory bus 608 may be used to communicate between the processor 604 and the system memory 606 .

요구되는 구성에 따라, 프로세서(604)는 마이크로 프로세서(μP), 마이크로 컨트롤러(μC), 디지털 신호 프로세서(DSP), 또는 그들의 임의의 조합을 포함하는 임의의 유형일 수 있으나, 이에 제한되지는 않는다. 프로세서(604)는 레벨 1 캐시(610), 레벨 2 캐시(612)와 같은 하나 이상의 레벨(level)의 캐시(cache), 프로세서 코어(614), 및 레지스터(616)를 포함할 수 있다. 프로세서 코어(614)는 산술 논리 연산장치(arithmetic logic unit; ALU), 부동 소수점 장치(floating point unit; FPU), 디지털 신호 처리 코어(DSP Core), 또는 그들의 임의의 조합을 포함할 수 있다. 메모리 컨트롤러(618)는 또한 프로세서(604)와 함께 사용될 수 있거나, 또는 일부 구현예에서 메모리 컨트롤러(618)는 프로세서(604)의 내적인 일부일 수 있다.Depending on the configuration required, the processor 604 may be of any type including, but not limited to, a microprocessor (μP), a microcontroller (μC), a digital signal processor (DSP), or any combination thereof. The processor 604 may include one or more levels of cache, such as a level 1 cache 610 , a level 2 cache 612 , a processor core 614 , and registers 616 . The processor core 614 may include an arithmetic logic unit (ALU), a floating point unit (FPU), a digital signal processing core (DSP Core), or any combination thereof. Memory controller 618 may also be used with processor 604 , or in some implementations memory controller 618 may be an internal part of processor 604 .

요구되는 구성에 따라, 시스템 메모리(606)는 (RAM과 같은) 휘발성 메모리, (ROM, 플래시 메모리 등과 같은) 비휘발성 메모리, 또는 그들의 임의의 조합을 포함하는 임의의 유형일 수 있으나, 이에 제한되지는 않는다. 시스템 메모리(606)는 운영 체제(620), 하나 이상의 어플리케이션(application)(622), 및 프로그램 데이터(624)를 포함할 수 있다. 어플리케이션(622)은 도 1의 유사 이미지 생성 장치(100) 및/또는 도 4의 프로세스(400)에 관하여 기술된 기능 블록 및/또는 작용을 포함하여 여기에서 기술된 바와 같은 기능을 수행하도록 배열된 알고리즘(626)을 포함할 수 있다. 프로그램 데이터(624)는, 알고리즘 (626)과 함께 이용하기 위한 데이터(628), 예컨대, 정적 네트워크 환경 등에 해당하는 데이터를 포함할 수 있다. 일부 실시예에서, 어플리케이션(622)은 최적의 전송 환경을 결정하기 위한 구현이 여기에서 설명된 바와 같이 제공될 수 있도록 운영 체제(620) 상에서 프로그램 데이터(624)로 동작하도록 배열될 수 있다. 예컨대, 유사 이미지 생성 장치(100)은 컴퓨팅 장치(600)의 전부 또는 일부를 포함할 수 있으며 최적의 전송 환경을 결정하는 구현이 여기에서 설명된 바와 같이 제공될 수 있도록 어플리케이션(622)의 전부 또는 일부를 수행하는 것이 가능할 수 있다. 이러한 기술된 기본적인 구성은 도 6에서 파선(602) 내의 그 구성요소들에 의해 도시된다.Depending on the configuration required, system memory 606 can be of any type including, but not limited to, volatile memory (such as RAM), non-volatile memory (such as ROM, flash memory, etc.), or any combination thereof. does not System memory 606 may include an operating system 620 , one or more applications 622 , and program data 624 . Application 622 is arranged to perform functions as described herein, including functional blocks and/or actions described with respect to process 400 of FIG. 4 and/or apparatus 100 for generating similar images of FIG. 1 . algorithm 626 . Program data 624 may include data 628 for use with algorithm 626 , eg, data corresponding to a static network environment, or the like. In some embodiments, application 622 may be arranged to operate with program data 624 on operating system 620 such that an implementation for determining an optimal transfer environment may be provided as described herein. For example, the similar image generating device 100 may include all or part of the computing device 600 and may include all or part of the application 622 , such that an implementation determining an optimal transmission environment may be provided as described herein. It may be possible to do some. This described basic configuration is illustrated by its components within dashed line 602 in FIG. 6 .

컴퓨팅 장치(600)는 기본적인 구성(602) 및 임의의 요구되는 장치 및 인터페이스(interface) 사이의 통신을 용이하게 하도록 추가적인 특징 또는 기능, 및 추가적인 인터페이스를 가질 수 있다. 예를 들어, 버스/인터페이스 컨트롤러(630)는 저장부 인터페이스 버스(634)를 통한 기본적인 구성(602) 및 하나 이상의 데이터 저장 장치(632) 사이의 통신을 용이하게 하도록 사용될 수 있다. 데이터 저장 장치(632)는 이동식 저장 장치(636), 고정식 저장 장치(638), 또는 그 조합일 수 있다. 이동식 저장 장치 및 고정식 저장 장치의 예를 몇 가지 들자면, 플렉서블 디스크 드라이브(flexible disk drive) 및 하드 디스크 드라이브(HDD)와 같은 자기 디스크 장치, 컴팩트 디스크(CD) 드라이브 또는 디지털 다목적 디스크(DVD) 드라이브와 같은 광 디스크 드라이브, 고체 상태 드라이브(SSD), 및 테이프 드라이브 등을 포함한다. 예시적인 컴퓨터 저장 매체는 컴퓨터 판독가능 명령, 데이터 구조, 프로그램 모듈(program module), 또는 기타 데이터와 같은 정보의 저장을 위한 임의의 방법 또는 기술에서 구현되는 휘발성 및 비휘발성의 이동식 및 고정식 매체를 포함할 수 있다.Computing device 600 may have additional features or functionality, and additional interfaces, to facilitate communication between basic configuration 602 and any desired devices and interfaces. For example, bus/interface controller 630 may be used to facilitate communication between basic configuration 602 and one or more data storage devices 632 over storage interface bus 634 . The data storage device 632 may be a removable storage device 636 , a fixed storage device 638 , or a combination thereof. Magnetic disk devices such as flexible disk drives and hard disk drives (HDDs), compact disk (CD) drives, or digital versatile disk (DVD) drives, to name a few examples of removable and fixed storage devices. such as optical disk drives, solid state drives (SSDs), and tape drives. Exemplary computer storage media includes volatile and nonvolatile removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules, or other data. can do.

시스템 메모리(606), 이동식 저장 장치(636) 및 고정식 저장 장치(638)는 모두 컴퓨터 저장 매체의 예이다. 컴퓨터 저장 매체는 RAM, ROM, EEPROM, 플래시 메모리 또는 기타 메모리 기술, CD-ROM, 디지털 다목적 디스크(DVD) 또는 기타 광 저장 장치, 자기 카세트, 자기 테이프, 자기 디스크 저장 장치 또는 기타 자기 저장 장치, 또는 요구되는 정보를 저장하도록 사용될 수 있고, 컴퓨팅 장치(700)에 의해 액세스될 수 있는 임의의 기타 매체를 포함하나, 이에 제한되지는 않는다. 임의의 그러한 컴퓨터 저장 매체는 컴퓨팅 장치(600)의 일부일 수 있다.System memory 606 , removable storage 636 , and non-removable storage 638 are all examples of computer storage media. Computer storage media may include RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disk (DVD) or other optical storage device, magnetic cassette, magnetic tape, magnetic disk storage device or other magnetic storage device, or including, but not limited to, any other medium that may be used to store the required information and that may be accessed by the computing device 700 . Any such computer storage media may be part of computing device 600 .

컴퓨팅 장치(600)는 또한 버스/인터페이스 컨트롤러(642)를 통한 다양한 인터페이스 장치(예컨대, 출력 인터페이스, 주변 인터페이스, 및 통신 인터페이스)로부터 기본적인 구성(602)으로의 통신을 용이하게 하기 위한 인터페이스 버스(642)를 포함할 수 있다. 예시적인 출력 장치(642)는 그래픽 처리 유닛(648) 및 오디오 처리 유닛(650)을 포함할 수 있으며, 이는 하나 이상의 A/V 포트(652)를 통하여 디스플레이 또는 스피커와 같은 다양한 외부 장치로 통신하도록 구성될 수 있다. 또한, 도 6에서 그래픽 처리 유닛(648)은 출력 장치의 컴포넌트로서 도시되었지만, 프로세서(604)에 더하여, 일부 구현예에 따른 연산을 수행하는 것이 가능하다. 본 개시에 따른 일부 실시예에 따르면, 그래픽 처리 유닛(648)은 예컨대, 신경망의 학습, 신경망에 따른 처리 등과 같은 다양한 신경망 관련 연산을 수행하도록 구성될 수 있다. 예시적인 주변 인터페이스(644)는 직렬 인터페이스 컨트롤러(654) 또는 병렬 인터페이스 컨트롤러(656)를 포함할 수 있으며, 이는 하나 이상의 I/O 포트(658)를 통하여 입력 장치(예컨대, 키보드, 마우스, 펜, 음성 입력 장치, 터치 입력 장치 등) 또는 기타 주변 장치(예컨대, 프린터, 스캐너 등)와 같은 외부 장치와 통신하도록 구성될 수 있다. 예시적인 통신 장치(646)는 네트워크 컨트롤러(660)를 포함하며, 이는 하나 이상의 통신 포트(664)를 통한 네트워크 통신 상에서의 하나 이상의 다른 컴퓨팅 장치(762)와의 통신을 용이하게 하도록 배열될 수 있다.Computing device 600 also provides an interface bus 642 for facilitating communication from various interface devices (eg, output interfaces, peripheral interfaces, and communication interfaces) to basic configuration 602 via bus/interface controller 642 . ) may be included. Exemplary output device 642 may include a graphics processing unit 648 and an audio processing unit 650 , which may be configured to communicate via one or more A/V ports 652 to various external devices, such as displays or speakers. can be configured. Also, although graphics processing unit 648 is shown as a component of an output device in FIG. 6 , in addition to processor 604 , it is possible to perform operations according to some implementations. According to some embodiments according to the present disclosure, the graphic processing unit 648 may be configured to perform various neural network-related operations, such as, for example, learning of a neural network, processing according to the neural network, and the like. Exemplary peripheral interface 644 may include a serial interface controller 654 or a parallel interface controller 656 , which may include input devices (eg, keyboard, mouse, pen, It may be configured to communicate with an external device, such as a voice input device, a touch input device, etc.) or other peripheral device (eg, a printer, scanner, etc.). The exemplary communication device 646 includes a network controller 660 , which may be arranged to facilitate communication with one or more other computing devices 762 over network communications via one or more communication ports 664 .

컴퓨팅 장치(600)는 또한 랩탑 컴퓨터나 랩탑이 아닌 컴퓨터 구성 둘 다를 포함하는 개인 컴퓨터로 구현될 수 있다. 또한, 컴퓨팅 장치(600)는 무선 기지국 또는 기타 무선 시스템 또는 장치의 일부로서 구현될 수 있다.Computing device 600 may also be implemented as a personal computer including both laptop computer and non-laptop computer configurations. Additionally, computing device 600 may be implemented as part of a wireless base station or other wireless system or device.

전술한 본원의 설명은 예시를 위한 것이며, 본원이 속하는 기술분야의 통상의 지식을 가진 자는 본원의 기술적 사상이나 필수적인 특징을 변경하지 않고서 다른 구체적인 형태로 쉽게 변형이 가능하다는 것을 이해할 수 있을 것이다. 그러므로, 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적이 아닌 것으로 이해해야만 한다. 예를 들어, 단일형으로 설명되어 있는 각 구성 요소는 분산되어 실시될 수도 있으며, 마찬가지로 분산된 것으로 설명되어 있는 구성 요소들도 결합된 형태로 실시될 수도 있다.The foregoing description of the present application is for illustration, and those of ordinary skill in the art to which the present application pertains will understand that it can be easily modified into other specific forms without changing the technical spirit or essential features of the present application. Therefore, it should be understood that the embodiments described above are illustrative in all respects and not restrictive. For example, each component described as a single type may be implemented in a distributed manner, and likewise components described as distributed may also be implemented in a combined form.

이상, 본 개시에서 청구하고자 하는 대상에 대해 구체적으로 살펴 보았다. 본 개시에서 청구된 대상은 앞서 기술한 특정 구현예로 그 범위가 제한되지 않는다. 예컨대, 어떤 구현예에서는 장치 또는 장치의 조합 상에서 동작 가능하게 사용되는 하드웨어의 형태일 수 있으며, 다른 구현예에서는 소프트웨어 및/또는 펌웨어의 형태로 구현될 수 있고, 또 다른 구현예에서는 신호 베어링 매체, 저장 매체와 같은 하나 이상의 물품을 포함할 수 있다. 여기서, CD-ROM, 컴퓨터 디스크, 플래시 메모리 등과 같은 저장 매체는, 예컨대 컴퓨팅 시스템, 컴퓨팅 플랫폼 또는 기타 시스템과 같은 컴퓨팅 장치에 의하여 실행될 때 앞서 설명한 구현예에 따라 해당 프로세서의 실행을 야기시킬 수 있는 명령을 저장할 수 있다. 이러한 컴퓨팅 장치는 하나 이상의 처리 유닛 또는 프로세서, 디스플레이, 키보드 및/또는 마우스와 같은 하나 이상의 입/출력 장치, 및 정적 랜덤 액세스 메모리, 동적 랜덤 액세스 메모리, 플래시 메모리 및/또는 하드 드라이브와 같은 하나 이상의 메모리를 포함할 수 있다.Above, the object to be claimed in the present disclosure has been described in detail. The claimed subject matter in this disclosure is not limited in scope to the specific implementations described above. For example, in some implementations it may be in the form of hardware used operatively on a device or combination of devices, in other implementations it may be implemented in the form of software and/or firmware, and in still other implementations it may be in the form of a signal bearing medium; may include one or more articles, such as storage media. Herein, a storage medium such as a CD-ROM, a computer disk, a flash memory, etc. is an instruction that, when executed by a computing device such as a computing system, a computing platform, or other systems, may cause the processor to execute according to the embodiments described above. can be saved. Such computing devices may include one or more processing units or processors, one or more input/output devices such as a display, keyboard and/or mouse, and one or more memory such as static random access memory, dynamic random access memory, flash memory and/or hard drives. may include

전술한 상세한 설명에서는 블록도, 흐름도 및/또는 기타 예시를 통해 장치 및/또는 프로세스의 다양한 실시예를 설명하였다. 그러한 블록도, 흐름도, 및/또는 기타 예시는 하나 이상의 기능 및/또는 동작을 포함하게 되며, 당업자라면 블록도, 흐름도 및/또는 기타 예시 내의 각각의 기능 및/또는 동작이 하드웨어, 소프트웨어, 펌웨어, 또는 이들의 임의의 조합에 의해 개별적으로 혹은 집합적으로 구현될 수 있다는 점을 이해할 수 있을 것이다. 일 실시예에서, 본 개시에 기재된 대상의 몇몇 부분은 ASIC(Application Specific Integrated Circuit), FPGA(Field Programmable Gate Array), DSP(Digital Signal Processor) 또는 다른 집적의 형태를 통해 구현될 수 있다. 이와 달리, 본 개시의 실시예의 일부 양상은 하나 이상의 컴퓨터 상에 실행되는 하나 이상의 컴퓨터 프로그램(예를 들어, 하나 이상의 컴퓨터 시스템 상에 실행되는 하나 이상의 프로그램), 하나 이상의 프로세서 상에서 실행되는 하나 이상의 프로그램(예를 들어, 하나 이상의 마이크로프로세서 상에서 실행되는 하나 이상의 프로그램), 펌웨어 또는 이들의 실질적으로 임의의 조합으로써 전체적으로 또는 부분적으로 균등하게 집적 회로에서 구현될 수도 있으며, 소프트웨어 및/또는 펌웨어를 위한 코드의 작성 및/또는 회로의 설계는 본 개시에 비추어 당업자의 기술 범위 내에 속하는 것이다. 또한, 당업자라면, 본 개시의 대상의 매커니즘들이 다양한 형태의 프로그램 제품으로 분배될 수 있음을 이해할 것이며, 본 개시의 대상의 예시는 분배를 실제로 수행하는데 사용되는 신호 베어링 매체의 특정 유형과 무관하게 적용됨을 이해할 것이다. In the foregoing detailed description, various embodiments of apparatus and/or processes have been described by way of block diagrams, flow diagrams, and/or other examples. Such block diagrams, flow diagrams, and/or other examples will include one or more functions and/or operations, and those skilled in the art will recognize that each function and/or operation within the block diagrams, flow diagrams, and/or other examples may be implemented in hardware, software, firmware, Or it will be understood that they may be implemented individually or collectively by any combination thereof. In one embodiment, some portions of the subject matter described in the present disclosure may be implemented through an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA), a Digital Signal Processor (DSP), or other form of integration. In contrast, some aspects of embodiments of the present disclosure include one or more computer programs running on one or more computers (eg, one or more programs running on one or more computer systems), one or more programs running on one or more processors ( writing code for software and/or firmware, which may be equally implemented in whole or in part as, for example, one or more programs running on one or more microprocessors), firmware, or substantially any combination thereof; and/or the design of the circuit is within the skill of those skilled in the art in light of the present disclosure. In addition, those skilled in the art will understand that the mechanisms of the subject matter of this disclosure may be distributed in various forms of program products, and the examples of the subject matter of this disclosure apply irrespective of the particular type of signal bearing medium used to actually perform the distribution. will understand

특정 예시적 기법이 다양한 방법 및 시스템을 이용하여 여기에서 기술되고 도시되었으나, 당업자라면, 청구된 대상에서 벗어남이 없이, 다양한 기타의 수정 또는 등가물로의 치환 가능성을 이해할 수 있다. 추가적으로, 여기에 기술된 중심 개념으로부터 벗어남이 없이 특정 상황을 청구된 대상의 교시로 적응시키도록 많은 수정이 이루어질 수 있다. 따라서, 청구된 대상이 개시된 특정 예시로 제한되지 않으나, 그러한 청구된 대상은 또한 첨부된 청구범위 및 그 균등의 범위 내에 들어가는 모든 구현예를 포함할 수 있음이 의도된다.While specific exemplary techniques have been described and illustrated herein using various methods and systems, those skilled in the art will understand the possibility of various other modifications or equivalents permutations without departing from the claimed subject matter. Additionally, many modifications may be made to adapt a particular situation to the teachings of the claimed subject matter without departing from the central concepts described herein. Accordingly, it is intended that claimed subject matter not be limited to the particular examples disclosed, but that such claimed subject matter may also include all embodiments falling within the scope of the appended claims and their equivalents.

본 개시의 범위는 상기 상세한 설명보다는 후술하는 특허청구범위에 의하여 나타내어지며, 특허청구범위의 의미 및 범위, 그리고 그 균등 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본원의 범위에 포함되는 것으로 해석되어야 한다.The scope of the present disclosure is indicated by the following claims rather than the above detailed description, and all changes or modifications derived from the meaning and scope of the claims and their equivalent concepts should be construed as being included in the scope of the present application. do.

Claims

A method for generating training data that is performed under the control of a computing device and generates training data of a machine learning model that performs machine learning-based similar image generation, comprising:
acquiring an image source;
analyzing the image source to set a learning section in the image source;
extracting a label image from the set learning section;
extracting at least one adjacent image based on the set learning section and the label image; and
generating training data for input to the machine learning model based on the label image and the at least one adjacent image;
A method of generating training data comprising a.

According to claim 1,
Wherein the machine learning model includes a generative adversarial network (Generative Adversarial Network).

According to claim 1,
The machine learning model uses the at least one adjacent image to generate a similar image, and uses the label image as a real image used to determine the generated similar image.

According to claim 1,
The step of setting the learning section is
a measuring step of performing at least one of measuring a color change rate of the image source, measuring image complexity, and measuring a structural similarity of an object in the image source; and
Including the step of setting a section in the image source so that a result of the measurement step satisfies a predetermined condition, the learning data generating method.

According to claim 1,
The step of extracting the label image is
The method of generating learning data, comprising extracting a central image as the label image within the learning section in the image source.

According to claim 1,
The step of extracting the at least one neighboring image includes extracting the corresponding neighboring image from the learning section so that a structural similarity between each of the at least one neighboring image and the label image is within a predetermined range. Way.

A similar image generating device comprising:
an image source management unit for acquiring an image source;
a learning section setting unit that analyzes the image source and sets a learning section in the image source;
a label image extraction unit for extracting a label image from the set learning section;
an adjacent image extractor configured to extract at least one adjacent image based on the set learning section and the label image; and
A similarity comprising a machine learning model for generating a machine learning-based image, generating a similar image using the machine learning model, and learning the machine learning model using the label image and the at least one adjacent image image generator
A similar image generating device comprising a.

8. The method of claim 7,
The learning section setting unit is configured to perform at least one of measuring a color change rate of the image source, measuring image complexity, and measuring a structural similarity of an object in the image source.

8. The method of claim 7,
and the adjacent image extracting unit extracts the adjacent image from the learning section so that a structural similarity between each of the at least one adjacent image and the label image is within a predetermined range.

8. The method of claim 7,
A metadata extraction unit for extracting metadata from the image source, the label image, and the at least one adjacent image
further comprising,
The similar image generating unit further uses the metadata to train a machine learning model.

A computer-readable storage medium storing a training data generation program that generates training data of a machine learning model that performs machine learning-based image generation, wherein the training data generation program is executed by a computing device,
acquiring an image source;
analyzing the image source and setting a learning section in the image source;
extracting a label image from the set learning section;
extracting at least one adjacent image based on the set learning section and the label image; and
generating training data for input to the machine learning model based on the label image and the at least one adjacent image
A computer-readable storage medium comprising one or more instructions to cause