WO2022080681A1

WO2022080681A1 - Image inpainting method and device considering peripheral region

Info

Publication number: WO2022080681A1
Application number: PCT/KR2021/012505
Authority: WO
Inventors: 이원섭; 박정선; 강동연; 진아정
Original assignee: 주식회사 씨앤에이아이
Priority date: 2020-10-16
Filing date: 2021-09-14
Publication date: 2022-04-21
Also published as: KR102389304B1

Abstract

An operation method for training an image inpainting model may comprise the steps of: generating a first image; distinguishing objects included in the first image on the basis of information of each of pixels of the first image, by using a first artificial neural network for object distinguishment; removing a separate object from the first image; setting a boundary between a second object and a third object in the first image; reconstructing a first region on the basis of pixel information of the second object and the third object by using a second artificial neural network for image reconstruction, and generating a reconstructed image; calculating a difference value of pixels of an original image and the reconstructed image; and updating a weight vector of the second artificial neural network for the image reconstruction.

Description

Image inpainting method and device considering the surrounding area

The present invention relates to an image inpainting method and device in consideration of a surrounding area, and more particularly, to an image inpainting method using an artificial neural network.

Image inpainting technology is a technology that started with the restoration of damaged works of art, etc. Currently, image inpainting technology has various uses, such as restoring damaged parts of images, such as paintings, photos, and videos, or removing parts from images. is used as

When there is a region to be restored in the image, the image inpainting technique corrects pixels of the background image region adjacent to the boundary of the region to be restored (pixels in the peripheral region) of the image, and uses the corrected pixels to restore the region to be restored technology to fill Here, the background image area may mean an area other than the area to be restored from the original image.

However, in most cases, the image inpainting technique is performed by a professional image editor, and even for a professional image editor, the editing operation for image inpainting may be a task that requires a high degree of concentration. Therefore, there is a problem in that it is difficult to use image inpainting technology for ordinary people who need image inpainting of images taken in their daily life.

As such, in order to increase accessibility to image inpainting technology and to increase the accuracy of image inpainting, it is necessary to provide an artificial intelligence-based image inpainting method and a device and application for performing the image inpainting method.

An object of the present invention to solve the above problems is to provide a learning method of an artificial neural network for image inpainting by updating an artificial neural network and a weight vector of a device performing image inpainting.

Another object of the present invention to solve the above problems is to provide a method of restoring an original image using an object in a peripheral area of the removed image.

In order to achieve the above object, an operating method for learning an image inpainting model according to an embodiment of the present invention includes the steps of adding a separate object to an original image to generate a first image; Using an artificial neural network to classify objects included in the first image based on information on each pixel of the first image, removing the separate object from the first image, and the first image In the step of setting a boundary between a second object and a third object in the first area corresponding to the removed separate object, using a second artificial neural network for image restoration, the first area is divided into the second Restoring based on pixel information of an object and the third object, generating a restored image, calculating a difference value between pixels of the original image and the restored image, and the original image and the restored image The method may include updating a weight vector of the second artificial neural network for image reconstruction based on the difference value of the pixels.

Here, the step of setting a boundary between the second object and the third object in the first area comprises:

It may be set based on at least one of pixel information of each of the second object and the third object, and attribute information of each of the second object and the third object.

Here, after generating the reconstructed image, analyzing pattern information of the second object based on pixel information of at least a portion of the region of the second object, and among the first region divided by the boundary The method may include overlapping the pattern of the second object in at least a partial area.

Here, after the step of generating the restored image, obtaining a separate second image from the moving picture including the first image, in the second image, the second image being a set of pixels corresponding to the first area The method may include obtaining information of one pixel group and overlapping the first pixel group in the first area of the first image.

Here, the step of compensating the first pixel group may include, and the overlapping of the pixels may include overlapping the corrected pixels in the first area.

Here, in the second image, tracking the first object located in a second area, obtaining information on a second pixel group that is a set of pixels corresponding to the second area in the first image and overlapping the second pixel group on the second area of the second image.

Here, the method further includes updating a weight vector of a first artificial neural network for classifying the object based on a difference value between pixels of the original image and the reconstructed image, wherein the weight vector of the first artificial neural network includes: The information may be updated based on a difference value between objects divided based on information on preset objects of the first image and information on each pixel of the first image.

Here, the weight vector of the second artificial neural network is a difference between information on pixels in a region corresponding to the preset first object of the first image and information on pixels in a region corresponding to the restored first object. It can be updated based on the value.

According to the present invention, by analyzing an image based on artificial intelligence, classifying the image by object, removing at least one object among the divided objects, and then restoring the image, a more convenient image inpainting service is provided to the user. There are advantages that can be provided.

According to the present invention, there is an advantage that an image inpainting service can be more easily provided to a user by performing an operation of restoring an original image using an object in a peripheral area of the removed image.

1 is a block diagram showing a first embodiment of the structure of a terminal.

2 is a flowchart illustrating an embodiment of an operation of a device for image inpainting.

3 is a flowchart illustrating a first embodiment of an operation of a processor for image restoration among operations for image inpainting.

4 is a flowchart illustrating a second embodiment of an operation of a processor for image restoration among operations for image inpainting.

5 is a conceptual diagram illustrating an embodiment of an image correction process according to each operation process for image inpainting.

6 is a flowchart illustrating an embodiment of an operation of a device for inpainting an image of a moving picture.

7 is a conceptual diagram illustrating an embodiment of an image correction process according to each operation process for image inpainting of a moving picture.

8 is a flowchart illustrating an embodiment of an artificial neural network learning operation of a device for image inpainting.

9 is a conceptual diagram illustrating an embodiment of a process of performing an artificial neural network learning operation of a device for image inpainting.

10 is a flowchart illustrating an embodiment of an operation based on an artificial neural network among operations for image inpainting.

11 is a conceptual diagram illustrating an embodiment of an object tracking operation among operations for image inpainting.

Since the present invention can have various changes and can have various embodiments, specific embodiments are illustrated in the drawings and described in detail. However, this is not intended to limit the present invention to specific embodiments, and it should be understood to include all modifications, equivalents and substitutes included in the spirit and scope of the present invention.

Terms such as first, second, etc. may be used to describe various elements, but the elements should not be limited by the terms. The above terms are used only for the purpose of distinguishing one component from another. For example, without departing from the scope of the present invention, a first component may be referred to as a second component, and similarly, a second component may also be referred to as a first component. and/or includes a combination of a plurality of related listed items or any of a plurality of related listed items.

When a component is referred to as being “connected” or “connected” to another component, it may be directly connected or connected to the other component, but it is understood that other components may exist in between. it should be On the other hand, when it is said that a certain element is "directly connected" or "directly connected" to another element, it should be understood that the other element does not exist in the middle.

The terms used in the present application are only used to describe specific embodiments, and are not intended to limit the present invention. The singular expression includes the plural expression unless the context clearly dictates otherwise. In the present application, terms such as “comprise” or “have” are intended to designate that a feature, number, step, operation, component, part, or combination thereof described in the specification exists, but one or more other features It should be understood that this does not preclude the existence or addition of numbers, steps, operations, components, parts, or combinations thereof.

Unless defined otherwise, all terms used herein, including technical or scientific terms, have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Terms such as those defined in a commonly used dictionary should be interpreted as having a meaning consistent with the meaning in the context of the related art, and should not be interpreted in an ideal or excessively formal meaning unless explicitly defined in the present application. does not

Hereinafter, preferred embodiments of the present invention will be described in more detail with reference to the accompanying drawings. In describing the present invention, in order to facilitate the overall understanding, the same reference numerals are used for the same components in the drawings, and duplicate descriptions of the same components are omitted.

1 is a block diagram showing a first embodiment of the structure of a terminal.

Referring to FIG. 1 , the terminal may include a camera 110 , a processor 120 , a display 130 , and a memory 140 . The camera 110 may include an image sensor 111 , a buffer 112 , a preprocessing module 113 , a resizer 114 , and a controller 115 . The camera 110 may acquire an image of the external area. The camera 110 may store the raw data generated by the image sensor 111 in the buffer 112 of the camera. The raw data may be processed by the controller 115 or the processor 120 in the camera 110 . The processed data may be transmitted to the display 140 or the encoder 123 . Alternatively, the raw data may be processed and then stored in the buffer 112 , and may be transferred from the buffer 112 to the display 140 or encoder 174 . The image acquired by the camera 110 may be a portion of an equirectangular (ERP) image, a panoramic image, a circular fisheye image, a spherical image, or a three-dimensional image.

The image sensor 111 may collect raw data by detecting light incident from the outside. The image sensor 111 may include, for example, at least one of a charge coupled device (CCD), a complementary metal oxide semiconductor (CMOS) image sensor, and an infrared (IR) optical sensor 150 . The image sensor 111 may be controlled by the controller 115 .

The pre-processing module 113 may convert the raw data obtained by the image sensor 111 into a color space form. The color space may be one of a YUV color space, a red green blue (RGB) color space, and a red green blue alpha (RGBA) color space. The preprocessing module 113 may transmit the data converted into the color space form to the buffer 112 or the processor 120 .

The pre-processing module 113 may correct an error or distortion of an image included in the received raw data. In addition, the pre-processing module 113 may adjust the color or size of the image included in the raw data. The preprocessing module 113 is, for example, bad pixel correction (BPC), lens shading (LS), demosaicing, white balance (WB), gamma correction, color space conversion (CSC) , HSC (hue, saturation, contrast) improvement, size conversion, filtering, and at least one of image analysis may be performed.

The processor 120 of the terminal may include a management module 121 , an image processing module 122 , and an encoder 123 . The management module 121 , the image processing module 122 , and the encoder 123 may be hardware modules included in the processor 120 , or may be software modules that may be executed by the processor 290 . Referring to FIG. 1 , the management module 121 , the image processing module 122 , and the encoder 123 are illustrated as being included in the processor 120 , but may not be limited thereto. Some of the management module 121 , the image processing module 122 , and the encoder 123 may be implemented as a module separate from the processor 120 .

According to an embodiment, the management module 121 may control the camera 110 included in the terminal. The management module 121 may control initialization of the camera 110 , a power input mode of the camera 110 , and an operation of the camera 110 . In addition, the management module 121 may control an image processing operation of the buffer 112 included in the camera 110 , captured image processing, image size, and the like.

The management module 121 controls the first electronic device 100 so that the first electronic device 100 adjusts auto focus, auto exposure, resolution, bit rate, frame rate, camera power mode, VBI, zoom, gamma, or white balance. ) can be controlled. The management module 121 may transmit the acquired image to the image processing module 122 and control the image processing module 122 to perform processing.

The management module 121 may transmit the acquired image to the encoder 123 . In addition, the management module 121 may control the encoder 123 to encode the obtained image.

The image processing module 122 may acquire an image from the management module 121 . The image processing module 122 may perform a processing operation on the acquired image. Specifically, the image processing module 122 may perform noise reduction, filtering, image synthesize, color correction, color conversion, image transformation, and/or filtering of the acquired image. 3D modeling, image drawing, augmented reality (AR)/virtual reality (VR) processing, dynamic range adjusting, perspective adjustment, shearing, resizing, edge extraction, region of interest (ROI) Judgment, image matching, and/or image segmentation may be performed. The image processing module 122 may perform processing such as synthesizing a plurality of images, generating a stereoscopic image, or generating a depth-based panoramic image.

Referring to FIG. 2 , the processor of the device may perform operations for image inpainting. The processor may include an artificial neural network (ANN), and may perform operations for image inpainting based on artificial intelligence using the artificial neural network.

In step S210 , the processor may classify objects included in the first image. The processor may classify objects included in the first image by using an artificial neural network. For example, the processor may classify a plurality of objects included in the first image based on information on pixels included in the first image. The processor may reflect color information of pixels, etc. in classifying objects included in the first image. When the difference between the color information values of different adjacent pixels is less than or equal to a preset value, the processor determines that different adjacent pixels are pixels obtained from images of the same and/or similar object in classifying objects included in the first image can do.

In operation S220 , the processor may determine a first object that is a correction target object (or a separate object) from among the objects of the first image. Alternatively, the processor may analyze each of the objects based on color information and shape information of the objects included in the first image, and may determine the first object to be corrected based on the analysis result of each of the objects. The processor may determine a first object to be corrected from among the objects of the first image based on input information from the user. The processor may remove the first object determined as the object to be corrected from the first image.

In operation S230 , the processor may set a boundary between the second object and the third object in a region corresponding to the first object removed from the first image. The processor may set a boundary between the second object and the third object in a region corresponding to the first object removed from the first image by using the artificial neural network. Specifically, the processor may set a boundary between the second object and the third object in the area corresponding to the first object based on pixel information of each of the second object and the third object. The processor may set a boundary between the second object and the third object in the area corresponding to the first object based on at least one piece of attribute information of each of the second object and the third object. That is, the processor may divide the region corresponding to the first object of the first image into an (a) region associated with the second object and a (b) region associated with the third object, (a) region and (b) You can set boundaries between regions.

In operation S240 , the processor may restore an area corresponding to the first object based on the boundary based on pixel information of the second object and the third object. The processor may reconstruct a region corresponding to the first object based on pixel information of the second object and the third object by using the artificial neural network. That is, the processor may reconstruct region (a) of the first image based on pixel information of the second object, and may reconstruct region (b) of the first image based on pixel information of the third object. Accordingly, as a result of image inpainting by the processor, the processor may acquire a restored first image.

Referring to FIG. 3 , the processor of the device may perform operations for image inpainting. The processor may include an artificial neural network, and may perform operations for image inpainting based on artificial intelligence using the artificial neural network. The processor may be in a state in which the operations of steps S210 to S240 of FIG. 2 are performed.

In step S310, the processor may analyze the pattern information of the second object. The processor may analyze the pattern information of the second object by using the artificial neural network. The processor may analyze the second object based on color information of pixels included in the second object, shape information of the second object, and the like, and may set an additional boundary line inside the second object based on the analysis result. The processor may acquire pattern information inside the second object based on the information on the additional boundary line inside the set second object.

In step S320 , the processor may analyze attribute information of the second object. The processor may analyze attribute information of the second object by using the artificial neural network. The processor may analyze the second object based on color information and shape information of the second object, and may determine attribute information of the second object based on the analysis result. The processor may label the attribute information on the second object. When the pattern information inside the second object is obtained, the processor may additionally label the attribute information in detailed areas of the second object.

In operation S330 , the processor may overlap the pattern of the second object on an area corresponding to the first object of the first image. The processor may overlap the pattern of the second object in an area corresponding to the first object by using the artificial neural network. Accordingly, as a result of image inpainting by the processor, a restored first image may be obtained.

Referring to FIG. 4 , the processor of the device may perform operations for image inpainting. The processor may include an artificial neural network, and may perform operations for image inpainting based on artificial intelligence using the artificial neural network. The processor may be in a state in which the operations of steps S210 to S240 of FIG. 2 are performed.

In step S410, the processor may acquire a second image. The processor may acquire a second image similar to the first image, which is an image inpainting target, by using the artificial neural network. The processor may analyze the separate images based on the information on the pixels of the first image to obtain a second image having a similarity greater than or equal to a preset value among the separate images. The processor may acquire a second image similar to the first image from among the separate images stored in the memory of the device. Alternatively, the processor may acquire a second image similar to the first image from among separate images stored in an external database.

In operation S420 , the processor may acquire a first pixel group corresponding to an area corresponding to the first object in the second image. The processor may obtain a first pixel group corresponding to a region corresponding to the first object of the first image in the second image by using the artificial neural network.

According to an embodiment, the processor may calculate location information of a region corresponding to the first object in the first image, and apply the calculated location information to the second image to generate a first pixel group from the second image. can be obtained

According to another embodiment, the processor may calculate information on pixels adjacent to an area corresponding to the first object in the first image, and obtain a first pixel group from the second image based on the calculated information on the pixels. can

In operation S430 , the processor may correct the pixels of the first pixel group. The processor may optimize the pixels of the first pixel group to the first image by correcting the pixels of the first pixel group using the artificial neural network.

According to an embodiment, the processor may compare color information of pixels of the first group with color information of pixels adjacent to the first object of the first image. When the difference between the color information value of pixels of the first group and the color information value of pixels adjacent to the first object of the first image exceeds a preset value, the processor is configured to: Colors of pixels of the first group may be corrected based on .

According to another embodiment, the processor may compare the shape information of the first group with the shape information of the first object of the first image. When the shape information of the first group does not match the shape of the first object of the first image, the processor may correct the shape of the first group based on the shape of the first object of the first image.

In operation S440 , the processor may overlap the pixels of the first pixel group in an area corresponding to the first object. The processor may use an artificial neural network to correct the first image by overlapping the pixels of the reconstructed first pixel group on the first image obtained from the second image. Accordingly, as a result of image inpainting by the processor, a restored first image may be obtained.

Referring to FIG. 5 , the processor of the device may perform operations for image inpainting. The processor applies an image inpainting technique to the first image of FIG. 5( a ), as shown in FIG. 5( c ). A restored first image may be acquired. Specific operations in which the processor acquires a reconstructed image by applying the image inpainting technique may be as follows.

Referring to FIG. 5A , the processor may distinguish

objects

510 , 520 , and 530 included in the first image. In classifying the

objects

510 , 520 , and 530 included in the first image, the processor may reflect color information of pixels, and the like.

The processor may determine a first object 510 that is a correction target object from among the objects of the first image. The processor may remove the first object 510 determined as the object to be corrected from the first image. The image from which the first object is removed from the first image may be as shown in FIG. 5(b) .

Referring to FIG. 5B , the processor may set a boundary between the second object and the third object in a region corresponding to the first object removed from the first image. Accordingly, an area corresponding to the removed first object may be divided by a boundary to correspond to the object. Specifically, the processor may set a boundary between the second object 520 and the third object 530 in an area corresponding to the first object based on pixel information of each of the second object and the third object.

The processor may reconstruct an area corresponding to the first object 510 based on the pixel information of the second object and the third object based on the boundary. The processor may reconstruct a partial region 512 from among the regions corresponding to the first object 510 based on the pixel information of the second object, and may reconstruct the region 512 corresponding to the first object 510 based on the pixel information of the third object. The remaining area 513 may be restored among the areas to be used. When the processor obtains the first pixel group for the partial region from the second image, which is an image separate from the first image, the processor places the pixels of the first pixel group into the partial region 512 of the region corresponding to the first object. can overlap. Alternatively, when the processor obtains the pattern information of the third object, the processor may overlap the pattern of the third object in a partial region 513 of the regions corresponding to the first object. The restored first image may be as shown in FIG. 5( c ).

Referring to FIG. 6 , the processor of the device may perform operations for image inpainting of a moving picture including a first image and a second image. The processor may include an artificial neural network, and may perform operations for image inpainting of a moving image based on artificial intelligence using the artificial neural network. The processor may be in a state in which the operations of steps S210 to S240 of FIG. 2 are performed.

In step S610, the processor may track the first object in the second image. The processor may track the first object in the second image by using the artificial neural network. The processor may determine, among the objects of the second image, an object having a degree of similarity to the first object in the first image equal to or greater than a preset value as the first object. That is, the processor may track an object having a similar color and shape in the second image by using the color and shape information of the first object.

In operation S620 , the processor may acquire, from the first image, a second pixel group corresponding to an area corresponding to the first object included in the second image. The processor may acquire a second pixel group corresponding to an area corresponding to the first object included in the second image by using the artificial neural network.

According to an embodiment, the processor may calculate location information of a region corresponding to the first object in the second image, and apply the calculated location information to the second image to generate a first pixel group from the second image. can be obtained

According to another embodiment, the processor may calculate information on pixels adjacent to an area corresponding to the first object in the second image, and obtain a second pixel group from the first image based on the calculated information on the pixels. can

In operation S630, the processor may correct the pixels of the second pixel group. The processor may optimize the pixels of the second pixel group to the second image by correcting the pixels of the second pixel group using the artificial neural network.

According to an embodiment, the processor may compare color information of pixels of the second group with color information of pixels adjacent to the first object of the second image. When the difference between the color information value of the pixels of the second group and the color information value of the pixels adjacent to the first object of the second image exceeds a preset value, the processor receives the color information of pixels adjacent to the first object of the second image Colors of pixels of the second group may be corrected as a reference.

According to another embodiment, the processor may compare the shape information of the second group with the shape information of the first object of the second image. When the shape information of the second group does not match the shape of the first object of the second image, the processor may correct the shape of the second group based on the shape of the first object of the second image.

In operation S640, the processor may overlap the pixels of the second pixel group in an area corresponding to the first object. The processor may correct the second image by overlapping the pixels of the reconstructed second pixel group on the second image obtained from the first image using the artificial neural network. Accordingly, as a result of image inpainting by the processor, a moving picture including the restored first image and the restored second image may be obtained.

The processor of the device applies an image inpainting technique to a moving picture including the first image 710 and the second image 720 , and includes the restored first image 730 and the restored second image 740 . Operations for generating a video may be performed. An operation for generating a moving picture including the restored images may be as follows.

The processor may distinguish objects included in the first image 710 . In addition, the processor may determine a first object 711 that is a correction target object from among the objects of the first image 710 .

The processor may track the first object 721 in the second image 720 . Specifically, the processor may determine, among the objects of the second image 720 , as the first object 721 , an object having a degree of similarity with the first object 711 in the first image 710 equal to or greater than a preset value. . The processor may identify the objects included in the second image 720 while tracking the first object 721 in the second image 720 .

The processor may obtain, from the second image 720 , a first pixel group 722 corresponding to an area corresponding to the first object 711 included in the first image 710 . In addition, the processor may obtain, from the first image 710 , a second pixel group 712 corresponding to an area corresponding to the first object 721 included in the second image 720 . The processor may further correct pixels of the first pixel group and the second pixel group. For example, the processor based on the comparison result of 720 of the first image 710 and the second image, based on the degree of change of each of the objects of the first image 710 and the second image 720 . Pixels of the first pixel group and the second pixel group may be further corrected.

The processor may overlap the pixels of the first pixel group 722 in an area corresponding to the first object 711 in the first image 710 . In addition, the processor may overlap the pixels of the first pixel group 722 in an area corresponding to the first object 711 in the first image 710 . Accordingly, the processor may generate a moving picture including the restored first image 730 and the restored second image 740 . Here, the operation of overlapping the pixels of the first pixel group 722 on the area corresponding to the first object 711 in the first image 710 and the pixels of the first pixel group 722 are combined with the first image 710 ), overlapping the region corresponding to the first object 711 may be performed interactively.

Referring to FIG. 8 , a processor performing operations for image inpainting of a moving picture may include an artificial neural network and may perform operations for learning of the artificial neural network. The processor may be in a state in which the operations of steps S210 to S240 of FIG. 2 are performed. In order to perform operations for learning the artificial neural network, the processor may insert a first object into an existing first image and perform an operation of reconstructing the first image into which the first object is inserted at least once or more. The processor may perform an operation for learning the artificial neural network based on a comparison result between the existing first image and the restored first image obtained as a result of the restoration operation, and specific operations may be as follows.

In step S810 , the processor may compare the first image with the first image restored as a result of image inpainting. The processor may compare information on objects included in the first image with information on objects included in the restored first image. In addition, the processor may compare information on each of the pixels of the first image with information on each of the pixels of the reconstructed first image. The information of the pixels compared by the processor may include color information of the pixel and the like.

In operation S820 , the processor may calculate a difference value between the first image and the object classification result included in the restored first image. The processor may calculate a difference value between the objects included in the first image and the objects included in the restored first image, and the calculated difference value may be a value such as a difference between ranges of the objects.

In step S830, the processor may calculate a difference value between the first image and the first region restoration result of the restored first image. The processor may calculate a difference value between the restoration result of the region corresponding to the first object and the first image. The calculated difference value may be a difference value between information on pixels included in the region corresponding to the first object and information on a boundary line for distinguishing other objects in the region corresponding to the first object.

In steps S840 and S850, the processor may update the weight vector of the artificial neural network based on the difference value between the first image and the reconstructed first image. The processor may update the weight vector of the artificial neural network for object classification based on the difference value between the first image and the corrected first image. Also, the processor may update a weight vector for image restoration based on a difference value between the first image and the corrected first image.

In this case, the processor may update the weight vector of the artificial neural network by applying the backpropagation method. Specifically, the processor may backpropagate a difference value between the objects included in the first image and the objects included in the restored first image to the artificial neural network to update the weight vector of the artificial neural network for object classification. In addition, the processor may backpropagate a difference value between the first image and the first region restoration result of the restored first image to the artificial neural network, and update the weight vector of the artificial neural network for image restoration.

Referring to FIG. 9 , a processor performing operations for image inpainting may include an artificial neural network network, and may perform operations for learning the artificial neural network. The processor inserts the first object 921 into the existing first image 910 to perform operations for learning the artificial neural network, and inserts the image into the first image 920 into which the first object 910 is inserted. The operation of applying the inpainting technique may be performed at least once. The processor may perform an operation for learning the artificial neural network based on a comparison result between the existing first image 910 and the restored first image 930 obtained as a result of the restoration operation. Here, the first image may be an image previously stored in a database or memory.

The processor may compare the first image 910 with the reconstructed first image 930 . The processor may compare information on objects included in the first image 910 with information on objects included in the restored first image 930 . In addition, the processor may compare information on each of the pixels of the first image 910 with information on each of the pixels of the reconstructed first image 930 .

The processor may calculate a difference value of a result of classification between the first image 910 and objects included in the restored first image 930 . The processor may calculate a difference value between the first region restoration result of the first image 910 and the restored first image 930 .

The processor may update the weight vector of the artificial neural network based on a difference value between the first image 910 and the reconstructed first image 930 . The processor backpropagates the difference value between the objects included in the first image 910 and the objects included in the restored first image 930 to the artificial neural network to update the weight vector of the artificial neural network for object classification. can Then, the processor back-propagates the difference value between the first image 910 and the restoration result of the first region 921 of the restored first image 930 to the artificial neural network to update the weight vector of the artificial neural network for image restoration. can

10 is a flowchart illustrating an embodiment of an operation based on an artificial neural network among operations for painting an image.

Referring to FIG. 10 , processes in the first artificial neural network and the second artificial neural network are respectively shown. First, in the first artificial neural network, objects included in the first image may be classified based on information on each pixel of the first image. After the separate object included in the first image is removed, a second artificial neural network may be used to restore a corresponding area (the first area).

That is, in the second artificial neural network, the first region may be restored based on pixel information of the second object and the third object. The second object and the third object may correspond to surrounding areas based on the first area.

In the case of image inpainting motion using deep learning, two-way tracking and one-way tracking are possible. For example, as shown in the upper figure of FIG. 11 , tracking from the left and tracking from the right may be indicated. In addition, in the case of one-way tracking, it will be performed simultaneously with the segmentation task, thereby improving the learning efficiency of the deep learning network.

Meanwhile, in another embodiment, in some cases, the boundary between the second object and the third object used to restore the first area may be divided according to the area occupied by the second object and the third object in the entire image. This is for simple convenience and may be used when restoring an area having a size smaller than a predetermined size.

Specifically, assuming that the entire image exists, it may be assumed that the area occupied by the second object in the entire image is larger than that of the third object. In this case, the image area corresponding to the second object in the restored first area may be larger than the image area corresponding to the third object.

In addition, in some cases, the ratio of the area occupied by the second object to the area occupied by the third object in the entire image and the ratio of the image area corresponding to the second object to the image area corresponding to the third object in the restored first area This may be the same.

The methods according to the present invention may be implemented in the form of program instructions that can be executed by various computer means and recorded in a computer-readable medium. The computer-readable medium may include program instructions, data files, data structures, etc. alone or in combination. The program instructions recorded on the computer-readable medium may be specially designed and configured for the present invention, or may be known and available to those skilled in the art of computer software.

Examples of computer-readable media include hardware devices specially configured to store and carry out program instructions, such as ROM, RAM, flash memory, and the like. Examples of program instructions include not only machine language codes such as those generated by a compiler, but also high-level language codes that can be executed by a computer using an interpreter or the like. The hardware device described above may be configured to operate as at least one software module to perform the operations of the present invention, and vice versa.

Although it has been described with reference to the above embodiments, it will be understood by those skilled in the art that various modifications and changes can be made to the present invention without departing from the spirit and scope of the present invention as set forth in the claims below. will be able

Claims

In an operation method for learning an image inpainting model,

generating a first image by adding a separate object to the original image;

classifying objects included in the first image based on information on each pixel of the first image using a first artificial neural network for object classification;

removing the separate object from the first image;

setting a boundary between a second object and a third object in a first area corresponding to the removed separate object in the first image;

restoring the first region based on pixel information of the second object and the third object using a second artificial neural network for image restoration, and generating a restored image;

calculating a difference value between pixels of the original image and the reconstructed image; and

updating a weight vector of the second artificial neural network for image restoration based on a difference value between pixels of the original image and the restored image,

The weight vector of the second artificial neural network is a difference value between information on pixels in a region corresponding to the preset first object of the first image and information on pixels in a region corresponding to the restored first object. An operational method for training of an image inpainting model, updated on a basis.
The method according to claim 1,

The step of setting a boundary between the second object and the third object in the first area comprises:

An operating method for image inpainting, setting based on at least one of pixel information of each of the second object and the third object, and attribute information of each of the second and third objects.
The method according to claim 1,

After generating the restored image,

analyzing pattern information of the second object based on pixel information of at least a portion of the second object; and

and overlapping the pattern of the second object on at least a partial area of the first area divided by the boundary.
The method according to claim 1,

After generating the restored image,

obtaining a separate second image from the moving picture including the first image;

obtaining information on a first pixel group that is a set of pixels corresponding to the first area from the second image; and

and overlapping the first group of pixels in the first area of the first image.
The method according to claim 4,

calibrating the first group of pixels;

The overlapping of the pixels comprises:

and overlapping the corrected pixels in the first area.
The method according to claim 4,

tracking the first object located in a second area in the second image;

obtaining information on a second pixel group that is a set of pixels corresponding to the second area in the first image; and

and overlapping the second group of pixels in the second region of the second image.
The method according to claim 1,

The method further comprising: updating a weight vector of a first artificial neural network for classifying the object based on a difference value between pixels of the original image and the reconstructed image;

The weight vector of the first artificial neural network is updated based on a difference value between objects classified based on information on preset objects of the first image and information on each pixel of the first image. how it works for you.