KR20240030176A

KR20240030176A - Gan-based artificial intelligence high-quality video compression system

Info

Publication number: KR20240030176A
Application number: KR1020220108870A
Authority: KR
Inventors: 정소원
Original assignee: 주식회사 사이버네틱스이미징시스템즈
Priority date: 2022-08-30
Filing date: 2022-08-30
Publication date: 2024-03-07
Also published as: KR102645760B1

Abstract

GAN 기반 인공기능 고품질 이미지 압축 시스템이 개시된다. 본 발명의 일실시례에 따른 GAN 기반 인공기능 고품질 이미지 압축 시스템은 입력된 원본 이미지를 기설정된 형식의 데이터로 압축하는 데이터압축부; 상기 데이터압축부에서 압축된 데이터를 저장하는 데이터저장부; 상기 데이터저장부에 저장된 상기 데이터를 호출하고, 고해상도의 복원 이미지를 생성하는 데이터복원부; 및 상기 데이터압축부 및 상기 데이터복원부 중 적어도 어느 하나를 학습시키는 데이터학습부;를 포함한다.A GAN-based artificial function high-quality image compression system is disclosed. A GAN-based artificial function high-quality image compression system according to an embodiment of the present invention includes a data compression unit that compresses the input original image into data in a preset format; a data storage unit that stores data compressed in the data compression unit; a data restoration unit that retrieves the data stored in the data storage unit and generates a high-resolution restored image; and a data learning unit that trains at least one of the data compression unit and the data restoration unit.

Description

GAN-based artificial intelligence high-quality image compression system {GAN-BASED ARTIFICIAL INTELLIGENCE HIGH-QUALITY VIDEO COMPRESSION SYSTEM}

본 발명은 GAN 네트워크를 이용한 이미지 압축 시스템에 관한 것으로, 보다 상세하게는 원본 이미지를 잠재 공간에서의 벡터(Latent Vector)로 변환하여 저장한 후, 변환 이미지를 GAN의 생성자 네트워크에 입력하여 원본 이미지를 반환하는 시스템에 관한 것이다.The present invention relates to an image compression system using a GAN network. More specifically, the original image is converted into a vector (Latent Vector) in the latent space and stored, and then the converted image is input into the GAN's generator network to convert the original image. It's about the return system.

인공지능 시스템은 인간 수준의 지능을 구현하는 컴퓨터 시스템이며, 기존 규칙 기반 스마트 시스템과 달리 기계가 스스로 학습하고 판단하며 똑똑해지는 시스템이다. 인공지능 시스템은 사용할수록 인식률이 향상되고, 사용자 취향을 보다 정확하게 이해할 수 있게 되어, 기존 규칙 기반 스마트 시스템은 점차 딥러닝 시스템으로 대체되고 있다. An artificial intelligence system is a computer system that implements human-level intelligence, and unlike existing rule-based smart systems, it is a system in which machines learn, make decisions, and become smarter on their own. As artificial intelligence systems are used, the recognition rate improves and users' preferences can be more accurately understood, and existing rule-based smart systems are gradually being replaced by deep learning systems.

인공지능 기술은 기계학습(딥러닝) 및 기계학습을 이용한 요소 기술들로 구성된다. 기계학습은 입력 데이터들의 특징을 스스로 분류/학습하는 알고리즘 기술이며, 요소 기술은 딥러닝 등의 기계학습 알고리즘을 이용하여 인간두뇌의 인지, 판단 등의 기능을 모사하는 기술로서, 언어적 이해, 시각적 이해, 추론/예측, 지식 표현, 동작 제어 등의 기술 분야로 구성된다.Artificial intelligence technology consists of machine learning (deep learning) and element technologies using machine learning. Machine learning is an algorithmic technology that classifies/learns the characteristics of input data on its own, and elemental technology is a technology that uses machine learning algorithms such as deep learning to mimic functions such as recognition and judgment of the human brain, including linguistic understanding and visual It consists of technical areas such as understanding, reasoning/prediction, knowledge expression, and motion control.

인공지능 기술이 응용되는 다양한 분야는 다음과 같다. 언어적 이해는 인간의 언어/문자를 인지하고 응용/처리하는 기술로서, 자연어 처리, 기계 번역, 대화 시스템, 질의 응답, 음성 인식/합성 등을 포함한다. 시각적 이해는 사물을 인간의 시각처럼 인식하여 처리하는 기술로서, 객체 인식, 객체 추적, 영상 검색, 사람 인식, 장면 이해, 공간 이해, 영상 개선 등을 포함한다. 추론/예측은 정보를 판단하여 논리적으로 추론하고 예측하는 기술로서, 지식/확률 기반 추론, 최적화 예측, 선호 기반 계획, 추천 등을 포함한다. 지식 표현은 인간의 경험정보를 지식데이터로 자동화 처리하는 기술로서, 지식 구축(데이터 생성/분류), 지식 관리(데이터 이용) 등을 포함한다. 동작 제어는 차량의 자율 주행, 로봇의 움직임을 제어하는 기술로서, 움직임 제어(항법, 충돌, 주행), 조작 제어(행동 제어) 등을 포함한다.The various fields where artificial intelligence technology is applied are as follows. Linguistic understanding is a technology that recognizes and applies/processes human language/characters and includes natural language processing, machine translation, conversation systems, question and answer, and voice recognition/synthesis. Visual understanding is a technology that recognizes and processes objects like human vision, and includes object recognition, object tracking, image search, person recognition, scene understanding, spatial understanding, and image improvement. Inference/prediction is a technology that judges information to make logical inferences and predictions, and includes knowledge/probability-based reasoning, optimization prediction, preference-based planning, and recommendations. Knowledge expression is a technology that automatically processes human experience information into knowledge data, and includes knowledge construction (data creation/classification) and knowledge management (data use). Motion control is a technology that controls the autonomous driving of vehicles and the movement of robots, and includes motion control (navigation, collision, driving), operation control (behavior control), etc.

한편, 멀티 미디어 기술 및 컴퓨터 기술이 발전함에 따라, 딥러닝 기법을 이용한 이미지 생성 기술이 개발되고 있으나, 학습 속도가 현저히 느리고 연산 복잡도가 높으며, 생성된 이미지의 해상도가 낮다는 문제점이 있다.Meanwhile, as multimedia technology and computer technology develop, image generation technology using deep learning techniques is being developed, but there are problems in that the learning speed is significantly slow, computational complexity is high, and the resolution of the generated image is low.

이와 관련하여, 종래기술인 한국공개특허공보 제10-2018-0004989호는 신경망회로와 병렬처리 프로세서를 결합하여, 기존의 이미지를 분석하고, 대체 이미지로 변환하는 과정을 신속하게 수행할 수 있는 딥러닝 기반의 이미지 처리 기술 및 그 방법에 대하여 개시하고 있고, 한국등록특허공보 제10-2132690호는 얼굴 인식 패턴 기술을 적용하여 가장 복원 성능이 좋은 고해상도 영상 복원 알고리즘과 얼굴 인식 패턴 기술을 선택하여 초고해상도 영상을 복원하고, 모니터링 시스템에 적용 시 고해상도 복원 영상이 원본 입력 영상과 유사하며, 고해상도의 얼굴 인식 정확도가 원본 영상의 얼굴 인식 정확도에 더 가깝게 되는 초고해상도 영상 복원 시스템에 대하여 개시하고 있다. In this regard, the prior art, Korean Patent Publication No. 10-2018-0004989, is a deep learning technology that combines a neural network circuit and a parallel processing processor to quickly perform the process of analyzing existing images and converting them into replacement images. Korea Patent Publication No. 10-2132690 discloses a high-resolution image restoration algorithm with the best restoration performance by applying face recognition pattern technology and a face recognition pattern technology to select ultra-high resolution image processing technology and method. We are disclosing a super-resolution image restoration system in which, when restoring an image and applying it to a monitoring system, the high-resolution restored image is similar to the original input image, and the high-resolution face recognition accuracy is closer to that of the original image.

그러나, 종래의 고해상도 이미지 복원 시스템은 연산 처리 과정이 복잡하고, 딥 러닝 기술을 적용하더라도 학습 데이터로 사용되는 데이터의 양이 방대하여 많은 양의 메모리가 필요하다는 단점이 있다. However, the conventional high-resolution image restoration system has a disadvantage in that the computational processing process is complicated, and even when deep learning technology is applied, the amount of data used as learning data is large and requires a large amount of memory.

따라서, 이미지 데이터를 압축시켜 저장하고, 사용자의 요청에 따라 압축된 데이터를 복원하더라도 원본 데이터와 거의 유사한 수준의 복원 데이터를 생성할 수 있는 GAN 기반 인공지능 고품질 이미지 압축 시스템에 관한 연구가 필요하다.Therefore, research is needed on a GAN-based artificial intelligence high-quality image compression system that can compress and store image data and generate restored data at a level almost similar to the original data even if the compressed data is restored at the user's request.

본 발명은 GAN 기반 인공지능 시스템을 적용하여 이미지 데이터의 특징 정보를 추출하여 압축하고, 추출된 특징 정보에 기초하여 압축된 이미지를 복원함으로써, 데이터의 처리, 저장 및 관리가 효율적으로 수행 가능한 GAN 기반 인공지능 고품질 이미지 압축 시스템을 제공하는 데 그 목적이 있다.The present invention applies a GAN-based artificial intelligence system to extract and compress feature information of image data, and restores the compressed image based on the extracted feature information, thereby enabling efficient processing, storage, and management of data. The purpose is to provide an artificial intelligence high-quality image compression system.

또한, 생성자 및 판별자로 구성된 GAN 알고리즘을 적용함으로써, 두 개의 네트워크가 경쟁하며 학습하여 원본 데이터의 복잡하고 다양한 특징을 유연하게 학습할 수 있어 원본 데이터에 근사한 복원 데이터를 생성할 수 있는 GAN 기반 인공지능 고품질 이미지 압축 시스템을 제공하는 데 그 목적이 있다. In addition, by applying the GAN algorithm consisting of a generator and a discriminator, two networks compete and learn to flexibly learn complex and diverse features of the original data, which is a GAN-based artificial intelligence that can generate restored data that approximates the original data. The purpose is to provide a high-quality image compression system.

본 발명이 해결하고자 하는 과제들은 이상에서 언급한 과제로 제한되지 않으며, 여기에 언급되지 않은 본 발명이 해결하려는 또 다른 과제들은 아래의 기재로부터 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에게 명확하게 이해될 수 있을 것이다.The problems to be solved by the present invention are not limited to the problems mentioned above, and other problems to be solved by the present invention that are not mentioned herein can be explained to those skilled in the art from the description below. You will be able to understand it clearly.

본 발명의 일실시례에 따른 GAN 기반 인공지능 고품질 이미지 압축 시스템은 입력된 원본 이미지를 기설정된 형식의 데이터로 압축하는 데이터압축부, 데이터압축부에서 압축된 데이터를 저장하는 데이터저장부, 데이터저장부에 저장된 데이터를 호출하고, 고해상도의 복원 이미지를 생성하는 데이터복원부 및 데이터압축부 및 데이터복원부 중 적어도 어느 하나를 학습시키는 데이터학습부를 포함한다.The GAN-based artificial intelligence high-quality image compression system according to an embodiment of the present invention includes a data compression unit that compresses the input original image into data in a preset format, a data storage unit that stores the data compressed in the data compression unit, and a data storage unit. It includes a data restoration unit that calls the data stored in the unit and generates a high-resolution restored image, and a data learning unit that trains at least one of the data compression unit and the data restoration unit.

또한, 데이터학습부는, 원본 이미지를 피쳐맵 데이터로 압축하는 인코더 및 피쳐맵 데이터로부터 고해상도의 복원 이미지를 생성하는 디코더를 포함하는 생성자 및 생성자에서 생성된 복원 이미지와 원본 이미지를 비교하여 복원 이미지를 구별하는 판별자를 포함하되, 인코더는, 원본 이미지에 포함된 적어도 하나의 특징 정보에 대응하는 확률 변수를 추출하고, 확률 변수의 조합으로 구성되는 피쳐맵 데이터를 생성하며, 디코더는, 인코더를 통해 생성된 상기 피쳐맵 데이터에 기설정된 네트워크 레이어를 적용하여 복원 이미지를 생성하되, 네트워크 레이어는 기설정된 크기의 행렬로 구성된 필터를 포함하고, 필터에 의해 기설정된 다차원 공간 내 적어도 어느 한 위치에 대응하는 피쳐맵 데이터의 반응 데이터가 산출되며, 반응 데이터의 분포에 기초하여 복원 이미지가 생성되는 것을 특징으로 한다.In addition, the data learning unit distinguishes the restored image by comparing the original image with the restored image generated by the generator and a generator that includes an encoder that compresses the original image into feature map data and a decoder that generates a high-resolution restored image from the feature map data. Includes a discriminator, wherein the encoder extracts a random variable corresponding to at least one feature information included in the original image and generates feature map data consisting of a combination of the random variables, and the decoder generates the generated through the encoder. A restored image is generated by applying a preset network layer to the feature map data, wherein the network layer includes a filter composed of a matrix of a preset size, and a feature map corresponding to at least one position in a multidimensional space preset by the filter. Response data of the data is calculated, and a restored image is generated based on the distribution of the response data.

또한, 데이터 학습부는, [수학식 1]에 기초하여 생성자 및 판별자를 학습시키되, 판별자는, [수학식 1]의 우변이 최대값을 갖도록 학습하고, 생성자는, [수학식 1]의 우변이 최소값을 갖도록 학습하는 것을 특징으로 한다.In addition, the data learning unit learns a generator and a discriminator based on [Equation 1], where the discriminator learns so that the right side of [Equation 1] has the maximum value, and the generator learns the right side of [Equation 1] It is characterized by learning to have a minimum value.

[수학식 1][Equation 1]

(상기 D(x)는 상기 판별자의 전달함수이고, 상기 G(x)는 상기 생성자의 전달함수이고, 상기 x~pdata(x)는 실제 데이터에 대한 확률분포에서 샘플링한 데이터를 의미함)(The D(x) is the transfer function of the discriminator, the G(x) is the transfer function of the generator, and the x~pdata(x) means data sampled from the probability distribution for actual data.)

또한, 데이터 학습부는, 판별자가 생성자에서 생성된 복원 이미지와 원본 이미지를 비교하여 복원 이미지를 구별할 수 없는 경우, 학습을 종료하는 것을 특징으로 한다.In addition, the data learning unit is characterized in that learning is terminated when the discriminator cannot distinguish the restored image by comparing the restored image generated by the generator with the original image.

또한, 데이터학습부는, 생성자의 학습이 종료된 경우, 인코더의 원본 이미지 압축 과정 및 디코더의 복원 이미지 생성 과정을 저장하고, 인코더의 원본 이미지 압축 과정을 데이터압축부에 적용하며, 디코더의 복원 이미지 생성 과정을 데이터복원부에 적용하는 것을 특징으로 한다.In addition, when the learning of the generator is completed, the data learning unit stores the encoder's original image compression process and the decoder's restored image generation process, applies the encoder's original image compression process to the data compression unit, and generates the decoder's restored image. The process is characterized by being applied to the data restoration department.

본 발명의 GAN 기반 인공지능 고품질 이미지 압축 시스템은 GAN 기반 인공지능 시스템을 적용하여 이미지 데이터의 특징 정보를 추출하여 압축하고, 추출된 특징 정보에 기초하여 압축된 이미지를 복원함으로써, 데이터의 처리, 저장 및 관리가 효율적으로 수행 가능한 효과를 가진다.The GAN-based artificial intelligence high-quality image compression system of the present invention extracts and compresses feature information of image data by applying a GAN-based artificial intelligence system, and restores the compressed image based on the extracted feature information to process and store data. and management can be carried out efficiently.

또한, 생성자 및 판별자로 구성된 GAN 알고리즘을 적용함으로써, 두 개의 네트워크가 경쟁하며 학습하여 원본 데이터의 복잡하고 다양한 특징을 유연하게 학습할 수 있어 원본 데이터에 근사한 복원 데이터를 생성할 수 있는 효과를 가진다.In addition, by applying the GAN algorithm consisting of a generator and a discriminator, two networks can compete and learn to flexibly learn complex and diverse features of the original data, which has the effect of generating restored data that is close to the original data.

도 1은 본 발명의 일실시례에 따른 GAN 기반 인공지능 고품질 이미지 압축 시스템의 구성도이다.
도 2 및 도3은 본 발명의 일실시례에 따른 GAN 기반 인공지능 고품질 이미지 압축 시스템의 데이터학습부를 설명하기 위한 도면이다.
도 4는 본 발명의 일실시례에 따른 GAN 기반 인공지능 고품질 이미지 압축 과정을 설명하기 위한 도면이다.Figure 1 is a configuration diagram of a GAN-based artificial intelligence high-quality image compression system according to an embodiment of the present invention.
Figures 2 and 3 are diagrams for explaining the data learning unit of the GAN-based artificial intelligence high-quality image compression system according to an embodiment of the present invention.
Figure 4 is a diagram illustrating the GAN-based artificial intelligence high-quality image compression process according to an embodiment of the present invention.

이상과 같은 본 발명에 대한 해결하고자 하는 과제, 과제의 해결 수단, 발명의 효과를 포함한 구체적인 사항들은 다음에 기재할 실시례 및 도면들에 포함되어 있다. 본 발명의 이점 및 특징, 그리고 그것들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 후술되어 있는 실시례들을 참조하면 명확해질 것이다. Specific details, including the problem to be solved by the present invention, the means for solving the problem, and the effect of the invention, are included in the examples and drawings described below. The advantages and features of the present invention and methods for achieving them will become clear by referring to the embodiments described in detail below along with the accompanying drawings.

이하, 첨부된 도면을 참조하여 본 발명을 보다 상세히 설명하기로 한다.Hereinafter, the present invention will be described in more detail with reference to the attached drawings.

도 1은 본 발명의 일실시례에 따른 GAN 기반 인공지능 고품질 이미지 압축 시스템의 구성도이고, 도 2 및 도3은 본 발명의 일실시례에 따른 GAN 기반 인공지능 고품질 이미지 압축 시스템의 데이터학습부를 설명하기 위한 도면이고, 도 4는 본 발명의 일실시례에 따른 GAN 기반 인공지능 고품질 이미지 압축 과정을 설명하기 위한 도면이다.Figure 1 is a configuration diagram of a GAN-based artificial intelligence high-quality image compression system according to an embodiment of the present invention, and Figures 2 and 3 show the data learning unit of the GAN-based artificial intelligence high-quality image compression system according to an embodiment of the present invention. This is a diagram for explanation, and Figure 4 is a diagram for explaining the GAN-based artificial intelligence high-quality image compression process according to an embodiment of the present invention.

<실시례 1><Example 1>

도 1을 참고하면, 본 발명의 일실시례에 따른 GAN 기반 인공지능 고품질 이미지 압축 시스템(100)은 데이터압축부(110), 데이터저장부(120), 데이터복원(130) 및 데이터학습부(140)로 구성되되, 상기 데이터압축부(110)는 입력된 원본 이미지를 기설정된 형식의 데이터로 압축하고, 상기 데이터저장부(120)는 상기 데이터압축부(110)에서 압축된 데이터를 저장하고, 상기 데이터복원(130)는 상기 데이터저장부(120)에 저장된 압축 데이터 중 적어도 어느 하나를 호출하고, 상기 압축 데이터에 대한 고해상도의 복원 이미지를 생성하고, 상기 데이터학습부(140)는 상기 데이터압축부(110) 및 상기 데이터복원부(130) 중 적어도 어느 하나를 학습시킬 수 있다.Referring to Figure 1, the GAN-based artificial intelligence high-quality image compression system 100 according to an embodiment of the present invention includes a data compression unit 110, a data storage unit 120, a data restoration 130, and a data learning unit ( 140), wherein the data compression unit 110 compresses the input original image into data in a preset format, and the data storage unit 120 stores the data compressed in the data compression unit 110. , the data restoration 130 calls at least one of the compressed data stored in the data storage unit 120 and generates a high-resolution restored image for the compressed data, and the data learning unit 140 stores the data. At least one of the compression unit 110 and the data restoration unit 130 can be trained.

보다 상세하게는, 도 2 및 도 3에 도시된 바와 같이 상기 데이터학습부(140)는 상기 원본 이미지를 상기 피쳐맵 데이터로 압축하는 인코더 및 상기 피쳐맵 데이터로부터 고해상도의 복원 이미지를 생성하는 디코더를 포함하는 생성자(141) 및 상기 생성자(141)에서 생성된 복원 이미지와 상기 원본 이미지를 비교하여 상기 복원 이미지를 구별하는 판별자(142)를 포함할 수 있다.More specifically, as shown in FIGS. 2 and 3, the data learning unit 140 includes an encoder that compresses the original image into the feature map data and a decoder that generates a high-resolution restored image from the feature map data. It may include a generator 141 and a discriminator 142 that compares the restored image generated by the generator 141 with the original image to distinguish the restored image.

이때, 상기 인코더는 상기 원본 이미지에 포함된 적어도 하나의 특징 정보에 대응하는 확률 변수를 추출하고, 상기 확률 변수의 조합으로 구성되는 피쳐맵 데이터를 생성할 수 있다.At this time, the encoder may extract a random variable corresponding to at least one feature information included in the original image and generate feature map data composed of a combination of the random variables.

일례로, 상기 원본 이미지가 인물 이미지인 경우, 상기 특징 정보는 눈썹 두께, 눈의 길이, 입의 크기 등을 포함할 수 있으며, 상기 원본 이미지가 병리 이미지인 경우, 표피세포, 핵의 크기, 결, 병변 부위 등의 다양한 요소를 포함할 수 있다.For example, if the original image is a human image, the feature information may include eyebrow thickness, eye length, mouth size, etc., and if the original image is a pathology image, the size and texture of epidermal cells, nuclei, etc. , lesion area, etc. can be included.

또한, 상기 원본 이미지는 통계적인 평균값을 가지며, 이러한 평균값의 군집을 개별 특징으로 구분할 수 있다. 따라서 상기 원본 이미지는 상기 개별 특징에 대응하는 확률 변수의 조합으로 표현될 수 있으며, 상기 인코더는 상기 원본 이미지에 포함된 특징 정보를 추출하고, 상기 특징 정보에 대응하는 확률 변수의 조합으로 구성되는 피쳐맵 데이터를 생성할 수 있다.Additionally, the original image has a statistical average value, and clusters of these average values can be distinguished as individual features. Therefore, the original image can be expressed as a combination of random variables corresponding to the individual features, and the encoder extracts feature information included in the original image and creates a feature composed of a combination of random variables corresponding to the feature information. Map data can be created.

이때, 상기 피쳐맵 데이터는 기지정된 다차원 특징 공간 상에 한 점으로 표현될 수 있다.At this time, the feature map data may be expressed as one point on a predetermined multidimensional feature space.

한편, 상기 디코더는, 상기 인코더를 통해 생성된 상기 피쳐맵 데이터에 기설정된 네트워크 레이어를 적용하여 상기 복원 이미지를 생성할 수 있다. 이때, 상기 네트워크 레이어는 기설정된 크기의 행렬로 구성된 필터를 포함할 수 있다. Meanwhile, the decoder may generate the restored image by applying a preset network layer to the feature map data generated through the encoder. At this time, the network layer may include a filter composed of a matrix of a preset size.

따라서, 상기 디코더는 상기 피쳐맵 데이터 형태로 압축된 이미지에 상기 필터를 적용하여 주요 특징 정보를 추출하고, 상기 추출된 주요 특징 정보에 기초하여 복원 이미지를 생성하게 된다.Accordingly, the decoder extracts key feature information by applying the filter to the image compressed in the form of feature map data, and generates a restored image based on the extracted key feature information.

상기 피쳐맵 데이터에 상기 필터를 적용하면, 기설정된 다차원 공간 내 적어도 어느 한 위치에 대응하는 상기 피쳐맵 데이터의 반응 데이터가 산출되며, 상기 반응 데이터의 분포에 기초하여 상기 복원 이미지가 생성될 수 있다.When the filter is applied to the feature map data, response data of the feature map data corresponding to at least one location in a preset multidimensional space is calculated, and the restored image can be generated based on the distribution of the response data. .

한편, 상기 데이터학습부(140)는, 상기 원본 이미지와 상기 복원 이미지를 비교하여 오차(손실)를 산출하되, 상기 오차(손실)는 상기 복원 이미지가 상기 원본 이미지일 확률로 산출되며, 상기 판별자(142)가 상기 원본 이미지와 구별할 수 없는 상기 복원 이미지를 상기 생성자(141)가 생성하도록 상기 오차(손실)에 기초하여 상기 네트워크 레이어 및 상기 필터의 가중치를 업데이트 할 수 있다.Meanwhile, the data learning unit 140 calculates an error (loss) by comparing the original image and the restored image, and the error (loss) is calculated as the probability that the restored image is the original image, and the determination The generator 141 may update the weights of the network layer and the filter based on the error (loss) so that the generator 141 generates the restored image that the ruler 142 cannot distinguish from the original image.

또한, 상기 네트워크 레이어 및 상기 필터의 가중치가 업데이트 됨에 따라, 상기 생성자(141)는 상기 복원 이미지를 생성하는 과정을 학습하고, 상기 판별자(142)는 상기 복원 이미지와 상기 원본 이미지를 구분하는 과정을 학습하되, 하기 [수학식 1]에 기초하여 상기 이미지생성부(130)는 우변이 최소값을 갖도록 학습하고, 상기 이미지판별부(140)는 상기 우변이 최대값을 갖도록 학습할 수 있다.In addition, as the weights of the network layer and the filter are updated, the generator 141 learns the process of generating the restored image, and the discriminator 142 distinguishes the restored image from the original image. However, based on the following [Equation 1], the image generator 130 learns the right side to have the minimum value, and the image determination unit 140 learns the right side to have the maximum value.

[수학식 1][Equation 1]

즉, 상기 생성자(141)는 상기 판별자(142)로 하여금 상기 생성자(141)에서 생성된 복원 이미지가 원본 이미지인지 정확하게 구별하지 못하도록 상기 원본 이미지와 유사한 상기 복원 이미지를 생성할 수 있다.That is, the generator 141 may generate the restored image similar to the original image so that the discriminator 142 cannot accurately distinguish whether the restored image generated by the generator 141 is the original image.

보다 상세하게는, 상기 [수학식 1]의 우변에서 첫번째 항은 상기 생성자(141)의 전달함수인 G(x)를 포함하지 않으므로 생략될 수 있고, 두번째 항이 최솟값을 갖기 위해서는 log(1-D(G(x))가 최소가 되어야 하므로, D(G(x))는 1이 되어야 한다.More specifically, the first term on the right side of [Equation 1] can be omitted because it does not include G(x), which is the transfer function of the generator 141, and in order for the second term to have the minimum value, log(1-D Since (G(x)) must be minimal, D(G(x)) must be 1.

따라서, 상기 생성자(141)는 상기 판별자(142)가 상기 생성자(141)에서 생성된 복원 이미지를 원본 이미지라고 판단할 수 있을 정도로 상기 원본 이미지와 유사한 복원 이미지를 생성하도록 학습할 수 있다.Accordingly, the generator 141 can learn to generate a restored image similar to the original image to the extent that the discriminator 142 can determine that the restored image generated by the generator 141 is the original image.

한편, 상기 판별자(142)는 상기 [수학식 1]의 우변에서 첫번째 항 및 두번째 항이 모두 최대값을 갖도록 학습할 수 있다.Meanwhile, the discriminator 142 can learn so that both the first and second terms on the right side of [Equation 1] have maximum values.

보다 상세하게는, 첫번째 항인 log D(x)가 1이 되어야 하고, 두번째 항인 log(1-D(G(x)) 또한 1이 되어야 하므로, D(G(x))는 0이 되어야 한다. More specifically, the first term, log D(x), must be 1, and the second term, log(1-D(G(x))) must also be 1, so D(G(x)) must be 0.

다시 말해, 상기 판별자(142)는 보다 정확하게 원본 이미지와 복원 이미지를 구별할 수 있어야 하며, 원본 이미지에 대해서는 1을 출력해야 하고, 복원 이미지에 대해서는 0을 출력하도록 학습할 수 있다.In other words, the discriminator 142 must be able to more accurately distinguish between the original image and the restored image, and can learn to output 1 for the original image and 0 for the restored image.

따라서, 상기 생성자(141)는 상기 판별자(142)를 속일 수 있을 만큼 원본 이미지와 유사한 복원 이미지를 생성하도록 학습하고, 상기 판별자(142)는 상기 생성자(141)에 의해 생성된 복원 이미지를 정확하게 구별하도록 학습함으로써, 상기 생성자(141) 및 상기 판별자(142)에 의해서 상기 원본 이미지와 거의 흡사한 고해상도 복원 이미지를 생성할 수 있게 된다. Accordingly, the generator 141 learns to generate a restored image similar to the original image enough to deceive the discriminator 142, and the discriminator 142 uses the restored image generated by the generator 141. By learning to accurately distinguish, it is possible to generate a high-resolution restored image that is almost similar to the original image by the generator 141 and the discriminator 142.

한편, 상기 데이터 학습부(140)는, 상기 판별자(142)가 상기 생성자(141)에서 생성된 복원 이미지와 상기 원본 이미지를 비교하여 상기 복원 이미지를 구별할 수 없는 경우, 학습을 종료할 수 있다.Meanwhile, the data learning unit 140 may end learning when the discriminator 142 cannot distinguish the restored image by comparing the restored image generated by the generator 141 with the original image. there is.

또한, 상기 데이터학습부(140)는, 상기 생성자(141)의 학습이 종료된 경우, 상기 인코더의 원본 이미지 압축 과정 및 상기 디코더의 복원 이미지 생성 과정을 저장하고, 상기 인코더의 원본 이미지 압축 과정을 상기 데이터압축부(110)에 적용하며, 상기 디코더의 복원 이미지 생성 과정을 상기 데이터복원부(130)에 적용할 수 있다.In addition, when learning of the generator 141 is completed, the data learning unit 140 stores the original image compression process of the encoder and the restored image generation process of the decoder, and performs the original image compression process of the encoder. It is applied to the data compression unit 110, and the restored image generation process of the decoder can be applied to the data restoration unit 130.

즉, 상기 데이터압축부(110) 및 상기 데이터복원부(130)는 상기 데이터학습부(140)에서 학습이 완료된 알고리즘을 적용하여 보다 신속하게 데이터를 압축하고 복원할 수 있게 된다. That is, the data compression unit 110 and the data restoration unit 130 can compress and restore data more quickly by applying the algorithm learned in the data learning unit 140.

일례로, 상기 생성자(141)의 학습이 종료된 경우, 상기 생성자(141)에서 적용된 상기 네트워크 레이어 및 상기 필터의 가중치가 결정되며, 상기 네트워크 레이어 및 상기 필터의 가중치가 결정된 경우, 상기 생성자(141)는 더 이상의 상기 네트워크 레이어 및 상기 필터의 가중치를 업데이트를 위한 학습 과정을 거치지 않고 한 번에 상기 원본 이미지에 근사한 상기 복원 이미지를 생성할 수 있다.For example, when learning of the generator 141 is completed, the weights of the network layer and the filter applied by the generator 141 are determined, and when the weights of the network layer and the filter are determined, the generator 141 ) can generate the restored image that approximates the original image at once without going through any more learning process to update the weights of the network layer and the filter.

도 4를 참고하면, GAN 기반 인공지능 고품질 이미지 압축 시스템(100)은 GAN 알고리즘을 적용하여 상기 데이터학습부(140)의 상기 생성자(141) 및 상기 판별자(142)를 학습시키고, 상기 생성자(141) 및 상기 판별자(142)의 학습이 완료된 경우, 상기 생성자(141)의 알고리즘을 상기 데이터압축부(110) 및 상기 데이터복원부(130)에 적용하여 보다 신속하게 원본 이미지(10)를 압축하고, 고해상도의 복원 이미지(20)를 생성할 수 있다.Referring to Figure 4, the GAN-based artificial intelligence high-quality image compression system 100 applies the GAN algorithm to learn the generator 141 and the discriminator 142 of the data learning unit 140, and the generator ( 141) and when the learning of the discriminator 142 is completed, the algorithm of the generator 141 is applied to the data compression unit 110 and the data restoration unit 130 to more quickly retrieve the original image 10. It is possible to compress and generate a high-resolution restored image 20.

<실시례 2><Example 2>

본 발명의 일실시례에 따른 GAN 기반 인공지능 고품질 이미지 압축 방법은 원본 이미지가 입력되는 제1 단계, 상기 원본 이미지에 포함된 적어도 어느 하나의 특징 정보에 대응하는 확률 변수를 추출하고 특징 정보 데이터를 생성하는 제2 단계, 상기 특징 정보 데이터를 기지정된 다차원 특징 공간 상에 하나의 점으로 디스플레이되는 피쳐맵 데이터로 변환하는 제3 단계, 상기 피쳐맵 데이터로부터 고해상도의 복원 이미지를 생성하는 제4 단계 및 상기 복원 이미지와 상기 원본 이미지를 비교하여 상기 복원 이미지를 판별하는 제5 단계를 포함할 수 있다.The GAN-based artificial intelligence high-quality image compression method according to an embodiment of the present invention includes a first step in which an original image is input, extracting a random variable corresponding to at least one feature information included in the original image, and extracting the feature information data. A second step of generating, a third step of converting the feature information data into feature map data displayed as a single point on a predetermined multidimensional feature space, a fourth step of generating a high-resolution restored image from the feature map data, and It may include a fifth step of comparing the restored image with the original image to determine the restored image.

또한, 상기 제4 단계는, 상기 피쳐맵 데이터에 기지정된 네트워크 레이어를 적용하는 제4-1 단계, 상기 다차원 특징 공간 상에 디스플레이된 피쳐맵 데이터의 반응 데이터를 산출하는 제4-2 단계 및 상기 반응 데이터의 분포에 기초하여 상기 복원 이미지를 생성하는 제4-3 단계를 포함할 수 있다.In addition, the fourth step includes a step 4-1 of applying a predetermined network layer to the feature map data, a step 4-2 of calculating response data of the feature map data displayed on the multidimensional feature space, and the step 4-2 of applying a predetermined network layer to the feature map data. It may include a 4-3 step of generating the restored image based on the distribution of response data.

따라서, 상기 GAN 기반 인공지능 고품질 이미지 압축 방법에 의해 대용량의 고해상도 원본 이미지는 상기 원본 이미지에 포함된 특징 정보에 대응되는 확률변수 및 그 조합에 의해 면적은 줄어들고 깊이가 깊어진 피쳐맵 형태로 압축됨으로써, 데이터의 보관 및 전송이 용이한 효과를 가진다.Therefore, by using the GAN-based artificial intelligence high-quality image compression method, a large-capacity, high-resolution original image is compressed into a feature map with a reduced area and a deepened depth by random variables and their combinations corresponding to the feature information included in the original image, It has the effect of making it easy to store and transmit data.

상기와 같은 본 발명의 효과에 따르면, GAN 기반 인공지능 시스템을 적용하여 이미지 데이터의 특징 정보를 추출하여 압축하고, 추출된 특징 정보에 기초하여 압축된 이미지를 복원함으로써, 데이터의 처리, 저장 및 관리가 효율적으로 수행 가능한 GAN 기반 인공지능 고품질 이미지 압축 시스템 및 그 방법이 제공될 수 있다.According to the effect of the present invention as described above, the feature information of image data is extracted and compressed by applying a GAN-based artificial intelligence system, and the compressed image is restored based on the extracted feature information to process, store, and manage the data. A GAN-based artificial intelligence high-quality image compression system and method that can perform efficiently can be provided.

또한, 생성자 및 판별자로 구성된 GAN 알고리즘을 적용함으로써, 두 개의 네트워크가 경쟁하며 학습하여 원본 데이터의 복잡하고 다양한 특징을 유연하게 학습할 수 있어 원본 데이터에 근사한 복원 데이터를 생성할 수 있는 GAN 기반 인공지능 고품질 이미지 압축 시스템 및 그 방법이 제공될 수 있다.In addition, by applying the GAN algorithm consisting of a generator and a discriminator, two networks compete and learn to flexibly learn complex and diverse features of the original data, which is a GAN-based artificial intelligence that can generate restored data that approximates the original data. A high-quality image compression system and method may be provided.

또한, 생성자의 가중치를 업데이트 하는 오차값에 영향을 주는 판별자가 지속적으로 변화함으로써, 다양한 평가 기준에 대해 학습할 수 있는 GAN 기반 인공지능 고품질 이미지 압축 시스템을 제공하는 데 그 목적이 있다.In addition, the purpose is to provide a GAN-based artificial intelligence high-quality image compression system that can learn about various evaluation criteria by continuously changing the discriminator that affects the error value that updates the generator's weight.

또한, 생성자는 다양한 관점의 평가 기준을 만족하는 복원 이미지를 생성함으로써, 원본 데이터를 더욱 잘 근사하는 복원 데이터를 생성하는 강건한 인공지능 엔진을 구축할 수 있는 GAN 기반 인공지능 고품질 이미지 압축 시스템을 제공하는 데 그 목적이 있다.In addition, the generator provides a GAN-based artificial intelligence high-quality image compression system that can build a robust artificial intelligence engine that generates restored data that better approximates the original data by generating restored images that satisfy evaluation criteria from various perspectives. There is a purpose.

또한, 본 발명의 일실시례에 따른, GAN 기반 인공지능 고품질 이미지 압축 방법은 다양한 컴퓨터로 구현되는 동작을 수행하기 위한 프로그램 명령을 포함하는 컴퓨터 판독 가능 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 매체는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다.Additionally, the GAN-based artificial intelligence high-quality image compression method according to an embodiment of the present invention may be recorded on a computer-readable medium containing program instructions for performing various computer-implemented operations. The computer-readable medium may include program instructions, data files, data structures, etc., singly or in combination. The medium may have program instructions specifically designed and configured for the present invention, or may be known and available to those skilled in the art of computer software. Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tapes, optical media such as CD-ROMs and DVDs, and magnetic media such as floptical disks. -Includes optical media (magneto-optical media) and hardware devices specifically configured to store and execute program instructions, such as ROM, RAM, flash memory, etc. Examples of program instructions include machine language code, such as that produced by a compiler, as well as high-level language code that can be executed by a computer using an interpreter, etc.

이상과 같이 본 발명의 일실시례는 비록 한정된 실시례와 도면에 의해 설명되었으나, 본 발명의 일실시례는 상기 설명된 실시례에 한정되는 것은 아니며, 이는 본 발명이 속하는 분야에서 통상의 지식을 가진 자라면 이러한 기재로부터 다양한 수정 및 변형이 가능하다. 따라서 본 발명의 일실시례는 아래에 기재된 특허청구범위에 의해서만 파악되어야 하고, 이의 균등 또는 등가적 변형 모두는 본 발명 사상의 범주에 속한다고 할 것이다.As described above, although one embodiment of the present invention has been described with limited examples and drawings, one embodiment of the present invention is not limited to the above-described embodiment, which is based on common knowledge in the field to which the present invention pertains. Anyone who has the knowledge can make various modifications and variations from this description. Accordingly, one embodiment of the present invention should be understood only by the scope of the claims set forth below, and all equivalent or equivalent modifications thereof shall fall within the scope of the spirit of the present invention.

110 : 데이터압축부
120 : 데이터저장부
130 : 데이터복원부
140 : 데이터학습부 141 : 생성자
142 : 판별자110: data compression unit
120: data storage unit
130: Data restoration department
140: Data learning unit 141: Constructor
142: Discriminator

Claims

a data compression unit that compresses the input original image into data in a preset format;
a data storage unit that stores data compressed in the data compression unit;
a data restoration unit that retrieves the data stored in the data storage unit and generates a high-resolution restored image; and
a data learning unit that trains at least one of the data compression unit and the data restoration unit;
A GAN-based artificial function high-quality image compression system including.

According to paragraph 1,
The data learning department,
a generator including an encoder that compresses the original image into feature map data and a decoder that generates a high-resolution restored image from the feature map data; and
A discriminator that compares the restored image generated by the generator with the original image to distinguish the restored image,

The encoder is,
Extracting a random variable corresponding to at least one feature information included in the original image and generating feature map data composed of a combination of the random variables,

The decoder is,
The restored image is generated by applying a preset network layer to the feature map data generated through the encoder,
The network layer includes a filter composed of a matrix of a preset size,
Response data of the feature map data corresponding to at least one position in a preset multidimensional space is calculated by the filter,
A GAN-based artificial intelligence high-quality image compression system, characterized in that the restored image is generated based on the distribution of the response data.

According to paragraph 2,
The data learning unit,
The generator and the discriminator are learned based on [Equation 1] below,

[Equation 1]

(The D(x) is the transfer function of the discriminator, the G(x) is the transfer function of the generator, and the x~pdata(x) means data sampled from the probability distribution for actual data.)

The discriminator is,
Learn so that the right side of [Equation 1] has the maximum value,
The constructor is:
A GAN-based artificial intelligence high-quality image compression system characterized by learning so that the right side of [Equation 1] has the minimum value.

According to paragraph 2,
The data learning unit,
A GAN-based artificial intelligence high-quality image compression system, characterized in that learning is terminated when the discriminator cannot distinguish the restored image by comparing the restored image generated by the generator with the original image.

According to clause 4,
The data learning department,
When learning of the generator is completed, the encoder's original image compression process and the decoder's restored image generation process are stored, the encoder's original image compression process is applied to the data compression unit, and the decoder's restored image is generated. A GAN-based artificial intelligence high-quality image compression system characterized by applying the process to the data restoration unit.