WO2021106961A1

WO2021106961A1 - Image generation device

Info

Publication number: WO2021106961A1
Application number: PCT/JP2020/043904
Authority: WO
Inventors: 謙祐横田; 杉浦　直樹
Original assignee: 株式会社小糸製作所
Priority date: 2019-11-27
Filing date: 2020-11-25
Publication date: 2021-06-03
Also published as: JPWO2021106961A1

Abstract

An image generation unit (10) comprises: a storage unit (21); a setting unit (43) that sets, respectively, a first specific gravity of a first domain of first image data and a second specific gravity of a second domain of second image data; a learning unit (41) that, for each first specific gravity and second specific gravity value, generates, from a plurality of first image data pieces and a plurality of second image data pieces, a learned model corresponding to the first specific gravity and the second specific gravity; a learned model storage unit 51; and a learned model selection unit 55. In addition, the image generation device (10) comprises a test image data input unit 53, and an image data generation unit (47) that uses a selected learned model to generate new image data from test image data.

Description

Image generator

The present invention relates to an image generator.

In recent years, image processing technology has been advanced, and morphing processing can be mentioned as one of the image processing technologies that attracts attention. The morphing process generates intermediate image data that connects the first image data and the second image data between the first image data that is the original image and the second image data that is the target image, and the first image data. Is a process of smoothly changing to the second image data. Patent Document 1 discloses an image generator that generates intermediate image data by morphing processing.

Japanese Unexamined Patent Publication No. 2001-0767177

When the intermediate image data generated in the morphing process is used as new image data, the difficulty of predicting the new image data generated from the first image data and the second image data is not set in the morphing process. Therefore, there is a concern that the new image data can be easily predicted from the first image data and the second image data. Further, since the intermediate image data is an intermediate image data between the first image data and the second image data, the new image data is apparently the data recalled from the first image data or the second image data. There is a concern. Therefore, after setting the difficulty of prediction, it is required to generate image data having a new design property that is not easy to predict from the first image data and the second image data.

Therefore, an object of the present invention is to provide an image generation device capable of generating image data having a new design property that is not easy to predict from a plurality of image data after setting the difficulty of prediction.

In order to solve the above problems, the image generator of the present invention includes a recording unit that records a plurality of first image data and a plurality of second image data, a first specific gravity of the first domain of the first image data, and the above. A setting unit for setting the second specific weight of the second domain of the second image data, and learned corresponding to the first specific weight and the second specific weight from the plurality of first image data and the plurality of second image data. A learning unit that generates a model for each value of the first specific gravity and the second specific gravity, a trained model storage unit that stores a plurality of the trained models, and a plurality of the trained model storage units that are stored in the trained model storage unit. Using a trained model selection unit that selects one trained model from the trained models, a test image data input unit that inputs test image data, and the trained model selected by the trained model selection unit. It is characterized by including an image data generation unit that generates new image data from the test image data input from the test image data input unit.

In the image generator of the present invention, the values of the first specific density and the second specific gravity indicating the difficulty of prediction can be set, and a trained model corresponding to each of the values of the first specific density and the second specific gravity is generated, and a plurality of trained models are generated. New image data is generated from the test image data using one of the trained models of. When new image data is generated, the degree of conversion of the test image data changes according to the trained model corresponding to the values of the first specific density and the second specific gravity. When the degree of conversion changes, the new image data may become image data that is difficult to predict from the first image data and the second image data, and may have a new design property. Therefore, the image generation device of the present embodiment can generate new image data having a new design property that is not easy to predict from the plurality of first and second image data after setting the difficulty of prediction.

Further, in the image generation device of the present invention, the learning unit may generate a trained model according to the Cycle GAN method.

Further, in the image generation device of the present invention, the learning unit may perform the calculation used in the Cycle GAN method by the number of learning times set in each of the learned models to generate each of the learned models. With this configuration, when a trained model with a large number of learnings is used as the number of learnings increases, new image data having a new design that is not easy to predict is generated from the plurality of first and second image data. It can be easier. Also, the smaller the number of trainings, the faster the trained model can be generated.

Further, the image generator of the present invention may further include an output unit that outputs new image data.

As described above, according to the present invention, there is provided an image generation device capable of generating image data having a new design property that is not easy to predict from a plurality of image data after setting the difficulty of prediction. can do.

FIG. 1 is a block diagram of an image generator according to an embodiment of the present invention. FIG. 2 is a diagram for explaining the third term in the formula of the loss function of the learning unit. FIG. 3 is a flowchart showing the trained model generation steps. FIG. 4 is a flowchart showing the learning process in the trained model generation step. FIG. 5 is a flowchart showing an image generation step.

Hereinafter, a preferred embodiment of the image generator according to the present invention will be described in detail with reference to the drawings. The embodiments illustrated below are for facilitating the understanding of the present invention, and are not for limiting the interpretation of the present invention. The present invention can be modified and improved without departing from the spirit of the present invention. In addition, the present invention may appropriately combine the components in each of the embodiments exemplified below. For ease of understanding, some parts may be exaggerated in each figure.

FIG. 1 is a block diagram of the image generation device 10 according to the present embodiment. The image generation device 10 generates a trained model from a plurality of image data according to the Cycle GAN method in the hostile generation network (GAN (Generative Adversarial Network)) method, and generates new image data using the generated trained model. To do. In the image generation device 10 of the present embodiment, an example of generating a trained model from each domain of two image data is shown. A domain indicates a feature in image data.

The image generation device 10 includes a recording unit 21, a first image data input unit 23, a second image data input unit 25, a specific gravity input unit 27, a learning frequency input unit 29, a learning unit 41, and an image data generation unit. It includes a control unit 40 including 47, a trained model storage unit 51, a test image data input unit 53, a trained model selection unit 55, and an image output unit 57. Here, each block of the image generation device 10 may be configured by hardware, may be configured by software, or may be configured by a combination of hardware and software.

The recording unit 21 records a plurality of first image data and a plurality of second image data. Each first image data is data in which the appearance of each first image data is similar when each first image data is output as an image (for example, a still image), and the appearance of each first image data. Are not always the same. Here, the first image data has been described, but the same applies to each of the second image data. For example, when each first image data is image data indicating cat's eyes, each first image data is data classified into the same category such as cat's eyes. Further, for example, when each second image data is image data indicating a vehicle headlight, each second image data is data classified into the same category such as a vehicle headlight, and is the first data. The data is classified into a category different from the image data. The plurality of first image data has a first domain, and the plurality of second image data has a second domain. When the first image data is image data indicating the eyes of a cat, the first domain indicates, for example, the size and shape of the eyes. When the second image data is image data indicating a vehicle headlight, the second domain indicates, for example, the size and shape of the vehicle headlight. In the image generation device 10 of the present embodiment, for example, the number of the first image data and the number of the second image data are 14,000, respectively. In addition, the recording unit 21 records training image data and test image data, which will be described later. The recording unit 21 is, for example, a memory.

The first image data input unit 23 inputs to the learning unit 41 an instruction to cause the learning unit 41 of the control unit 40 to read a plurality of first image data recorded in the recording unit 21.

The second image data input unit 25 inputs to the learning unit 41 an instruction to cause the learning unit 41 to read a plurality of second image data recorded in the recording unit 21.

The specific gravity input unit 27 learns the specific density λA of the first domain of the first image data used at the time of learning of the learning unit 41 and the specific gravity λB of the second domain of the second image data used at the time of learning of the learning unit 41. It is input to the setting unit 43 described later of 41. The specific densities λA and λB indicate the difficulty of prediction described later. The values of the specific densities λA and λB can be appropriately set by the user.

The learning number input unit 29 inputs the learning number of the learning unit 41, which will be described later, to the setting unit 43.

The first image data input unit 23, the second image data input unit 25, the specific gravity input unit 27, and the learning frequency input unit 29 are devices for input such as a keyboard and a mouse.

The control unit 40 includes a CPU (Central Processing Unit) and a memory. The control unit 40 comprehensively controls the operation of the image generation device 10 by reading and executing the control program recorded in the memory by the CPU.

Next, the learning unit 41 in the control unit 40 will be described.

The learning unit 41 has a setting unit 43.

The setting unit 43 sets the specific densities λA and λB input from the specific densities input unit 27, and inputs the set specific densities λA and λB to the generation unit 45 and the identification unit 46 described later of the learning unit 41. Further, the setting unit 43 sets the learning number input from the learning number input unit 29, and inputs the set learning number to the generation unit 45 and the identification unit 46.

Further, the learning unit 41 has a generation unit 45 and an identification unit 46, and the generation unit 45 and the identification unit 46 form a neural network in machine learning.

Next, the fake image data and the training image data used in the generation unit 45 and the identification unit 46 will be described. In the following, it is shown that the learning unit 41 is mainly both the generation unit 45 and the identification unit 46.

The fake image data is fake data obtained by converting a certain image data so as to approximate it to the training image data. The training image data is real data that is a basis for improving the accuracy of the fake image data in order to approximate the fake image data to the training image data. The approximation here indicates the appearance when the image data is output as an image (for example, a still image). For example, when the above image data is the first image data, the training image data becomes the second image data, and the fake image data approximates the first image data to the second image data which is the training image data. It becomes the converted fake second image data. On the contrary, when the above-mentioned image data is the second image data, the training image data becomes the first image data, and the fake image data approximates the second image data to the first image data which is the training image data. It becomes the fake first image data converted into.

The generation unit 45 reads the above-mentioned image data from the recording unit 21, converts the image data, and generates fake image data from the image data. The fake image data is input to the identification unit 46.

The identification unit 46 discriminates between the fake image data input from the generation unit 45 and the training image data read from the recording unit 21.

Further, when the fake image data does not match the training image data, the identification unit 46 calculates information regarding the deviation between the fake image data and the training image data, and outputs the information to the generation unit 45. The generation unit 45 reads image data different from the image data read from the recording unit 21 from the recording unit 21, and converts the other image data read from the other image data based on the information from the identification unit 46. To generate fake image data different from the above. Another fake image data is input to the identification unit 46, and the identification unit 46 discriminates between the other fake image data and the training image data. By alternately repeating the generation of the generation unit 45 and the identification of the identification unit 46, the generation unit 45 and the identification unit 46 compete with each other alternately, and as a result, the generation unit 45 and the identification unit 46 deepen the learning. By deepening the learning, the generation unit 45 can generate fake image data that is close to the training image data. When the fake image data is close to the training image data, the identification unit 46 does not output the information to the generation unit 45, and the generation unit 45 does not generate the fake image data. In the learning unit 41 of the present embodiment, since the Cycle GAN method is used in the process in which the generation unit 45 and the identification unit 46 alternately compete with each other as described above, the generation unit 45 and the identification unit 46 have image data. Both the first image data and the second image data are generated and identified, and this point will be described below.

The Cycle GAN method is represented by the following loss function equation (1). In the formula (1) of this loss function, a special loss function called "Cycle loss", which is the third term, is added to the loss function which is the first term and the second term.

The first term of the equation (1) is a loss function that converts the first image data into fake image data approximated to the second image data. In the first paragraph, X is shows the first image data, Y represents a second image data, G represents the generator 45 for generating a false image data from the first image data, D _Y is the training image data The identification unit 46 for distinguishing from the fake image data is shown.

The second term of the equation (1) is a loss function that converts the second image data into fake image data approximated to the first image data. In the second term, Y represents a second image data, X is shows the first image data, F is shows a generator 45 which generates false image data from the second image data, D _X is the training image data The identification unit 46 for distinguishing from the fake image data is shown.

In the third term of the equation (1), the specific densities λA and λB set by the setting unit 43 are integrated as coefficients. The third term of the formula (1) is represented by the following formula (2).

Here, the equation (2) will be described below with reference to FIG.
In the first term of the formula (2), the fake image data fake_Y in which the generation unit 45 represented by G in FIG. 2 is approximated from the first image data real_X to the second image data which is the training image data as the first main body unit. Is generated, and the generation unit 45 shown as F in FIG. 2 performs a process of restoring the fake image data fake_Y to the first image data rec_X as the first restoration unit. In the first term of the equation (2), the generation unit 45 reduces the difference between the first image data real_X and the restored first image data rec_X by the specific gravity λA, thereby reducing the difference between the first image data real_X and the fake image data fake_Y. Suppress excessive conversion to. In the first term of the equation (2), the specific density λA is integrated, and the smaller the specific density λA, the weaker the above-mentioned suppression, the greater the degree of conversion of the first image data real_X, and the more dynamically the first image data real_X becomes. Will be converted. Therefore, the fake image data fake_Y generated by the conversion is not more similar to the second image data which is the training image data, and is a new image having a new design that is difficult to predict from the first image data real_X. It becomes data. On the contrary, as the specific gravity λA is larger, the above-mentioned suppression is strengthened, the degree of conversion of the first image data real_X is larger, and the first image data real_X is not dynamically converted as compared with the case where the specific gravity λA is small. Therefore, the fake image data fake_Y generated by the conversion is not similar to the second image data which is the training image data, and is a new image having a new design that is not easy to predict from the first image data real_X. It becomes data. As described above, the smaller the specific density λA, the more difficult it is to predict, and the fake image data fake_Y becomes image data that is more difficult to predict from the first image data real_X. Further, the larger the specific density λA, the more difficult it is to predict, and the false image data fake_Y becomes image data that is difficult to predict from the first image data real_X. The difference between the first image data real_X and the first image data rec_X is a reconstruction error called cycle-consistency loss. The identification unit 46, denoted D _Y in Figure 2 identifies the second image data is a training image data false image data fake_Y generated by the generator 45.

In the second term of the equation (2), the fake image data fake_X in which the generation unit 45 shown as F in FIG. 2 is approximated from the second image data real_Y to the first image data which is the training image data as the second main body unit. Is generated, and the generation unit 45 shown as G in FIG. 2 performs a process of restoring the fake image data fake_X to the second image data rec_Y as the second restoration unit. In the second term of the equation (2), the generation unit 45 reduces the difference between the second image data real_Y and the restored second image data rec_Y by the specific gravity λB, so that the fake image data fake_X from the second image data real_Y Suppress excessive conversion to. In the second term of the equation (2), the specific density λB is integrated, and the smaller the specific density λB, the weaker the above-mentioned suppression, the greater the degree of conversion of the second image data real_Y, and the more dynamically the second image data real_Y. Will be converted. Therefore, the fake image data fake_X generated by the conversion is not more similar to the first image data which is the training image data, and is a new image having a new design that is difficult to predict from the second image data real_Y. It becomes data. On the contrary, as the specific gravity λB is larger, the above-mentioned suppression is strengthened, the degree of conversion of the second image data real_Y is larger, and the second image data real_Y is not dynamically converted as compared with the case where the specific gravity λB is small. Therefore, the fake image data fake_X generated by the conversion is not similar to the first image data which is the training image data, and is a new image having a new design that is not easy to predict from the second image data real_Y. It becomes data. As described above, the smaller the specific density λB, the more difficult it is to predict, and the fake image data fake_X becomes image data that is more difficult to predict from the second image data real_Y. Further, the larger the specific density λA, the more difficult it is to predict, and the false image data fake_X becomes image data that is difficult to predict from the second image data real_Y. The difference between the second image data real_Y and the second image data rec_Y is a reconstruction error called cycle-consistency loss. The identification unit 46, denoted as D _x in Figure 2 identifies the first image data is a training image data false image data fake_X generated by the generator 45.

In the image generation device 10 of the present embodiment, the learning process, which is a calculation using the loss function represented by the above equation (1), is performed by the learning unit 41 for the number of learning times set by the setting unit 43, so that the specific gravity λA One trained model in λB is constructed. The trained model is constructed for each value of the specific densities λA and λB.

In the image generation device 10 of the present embodiment, an example in which the first, second, and third trained models are constructed as trained models is shown. The first trained model is constructed with a specific density λA1 and a specific density λB1 smaller than the specific density λA1, the second trained model is constructed with a specific density λA2 and a specific density λB2 having the same specific density λA2, and the third trained model is constructed with a specific density λA3. It is constructed with a specific density λB3 larger than the specific density λA3.

Here, returning to FIG. 1, the description of each block of the image generator 10 is continued.

The trained model storage unit 51 stores each trained model constructed as described above as independent data. The trained model is input to the trained model storage unit 51 each time the trained model is constructed as one model by the learning unit 41. The trained model storage unit 51 is, for example, a memory.

The test image data input unit 53 inputs to the image data generation unit 47 an instruction to cause the image data generation unit 47 to read the test image data recorded in the recording unit 21. The test image data is an image used when the image data generation unit 47 generates the image data. The test image data is, for example, image data showing a cat's eyes such as the first image data, or image data showing a vehicle headlight such as the second image data.

The trained model selection unit 55 selects a trained model from the trained model storage unit 51, and inputs an instruction to the image data generation unit 47 to read the selected trained model into the image data generation unit 47.

The test image data input unit 53 and the trained model selection unit 55 are devices for input such as a keyboard and a mouse.

The image data generation unit 47 accesses the trained model storage unit 51 according to the instruction from the trained model selection unit 55, and reads the trained model selected by the trained model selection unit 55 from the trained model storage unit 51. Next, the image data generation unit 47 generates new image data from the test image data using the read learned model. The generated new image data is input to the image output unit 57.

The image output unit 57 is, for example, a monitor. The image output unit 57 outputs new image data generated by the image data generation unit 47 as an image.

Next, the operation of the image generator 10 will be described. The operation of the image generation device 10 includes a trained model generation step and an image generation step as main steps.

FIG. 3 is a flowchart showing the trained model generation steps.

(Step S1)
In this step, the first image data input unit 23 inputs an instruction to cause the learning unit 41 to read a plurality of first image data, and the learning unit 41 inputs a plurality of first image data from the recording unit 21. read out. Further, the second image data input unit 25 inputs an instruction to cause the learning unit 41 to read the second image data, and the learning unit 41 reads the second image data from the recording unit 21. When the learning unit 41 reads out the plurality of first image data and the plurality of second image data, the process proceeds to step S2.

(Step S2)
In this step, the specific gravity input unit 27 inputs the specific densities λA1 and λB1 to the setting unit 43, and the setting unit 43 sets the specific densities λA1 and λB1 as the specific densities λA and λB. The set specific densities λA1 and λB1 are input to the generation unit 45 and the identification unit 46, and the process proceeds to step S3.

(Step S3)
In this step, the learning number input unit 29 inputs the learning number of the learning unit 41 to the setting unit 43, and the setting unit 43 sets the input learning number. The set number of learnings is input to the generation unit 45 and the identification unit 46, and the process proceeds to step S4. Here, the number of learnings is set to, for example, 100 times.

(Step S4)
In this step, the learning unit 41 checks the current number of learning times. If the number of learnings is less than 100, the process proceeds to step S5, and if the number of learnings is not less than 100, the process proceeds to step S7. When the trained model generation step is started and the process shifts to the first step S4, the number of trainings is set to 0.

(Step S5)
In this step, the learning unit 41 shifts to the learning process described later. When the learning process is completed, the process proceeds to step S6.

(Step S6)
In this step, the learning unit 41 adds one to the current number of learnings, and the process returns to step S4.

(Step S7)
In this step, the first trained model corresponding to the specific densities λA1 and λB1 set in step S2 is completed by the learning process 100 times, and the completed first trained model is stored in the trained model storage unit 51. It is stored.

Next, the learning process of the learning unit 41 in step S5 will be described. FIG. 4 is a flowchart showing the learning process of the learning unit 41.

(Step S11)
In this step, the learning unit 41 allocates the order i to each of the first image data and each second image data read from the recording unit 21 in step S1. As described above, since the number of the first image data and the number of the second image data are 14,000 each, the order i is 1 to 14000. When the order is assigned, the process proceeds to step S12.

(Step S12)
In this step, the learning unit 41 checks the order i of the first image data and the second image data to be learned. If the order i is less than 14,000 described above, the process proceeds to step S13. If the order i is not less than 14,000, it is assumed that the learning process is performed on all the first image data and all the second image data with the specific densities λA1 and λB1 set in step S2, and the process proceeds to step S6.

(Step S13)
In this step, the learning unit 41 acquires the i-th first image data and the second image data, and the process proceeds to step S14. When the learning process is started and the process shifts to the first step S13, i is set to 1.

(Step S14)
In this step, the learning unit 41 performs a calculation using the loss function represented by the equation (1) for the i-th first image data and the i-th second image data, and the process proceeds to step S15.

(Step S15)
In this step, learning is performed in the learning unit 41, and the process proceeds to step S16.

(Step S16)
In this step, the learning unit 41 adds one of the current order i, and the process returns to step S12.

In the processes shown in FIGS. 3 and 4, with the specific densities λA and λB set to the specific densities λA1 and λB1, the first image data and the second image data in the 1st to 14000th positions are expressed by the equation (1). The learning process, which is a calculation using the shown loss function, is performed 100 times. When the learning process is performed 100 times, the first trained model corresponding to the specific densities λA1 and λB1 is completed.

When the specific densities λA and λB are set to the specific densities λA2 and λB2 in step S2 after the first trained model is completed, the second trained corresponding to the specific densities λA2 and λB2 is similar to the generation of the first trained model. The model is generated by 100 learning processes. Further, when the specific densities λA and λB are set to the specific densities λA3 and λB3, the third trained model corresponding to the specific gravities λA3 and λB3 is generated by 100 times of learning processing in the same manner as the generation of the first trained model. .. Therefore, the trained model is generated for each value of the specific densities λA and λB set in step S2, and each generated trained model is stored in the trained model storage unit 51. When each trained model is stored in the trained model storage unit 51, the process in the trained model generation step ends.

Next, the image generation step will be described with reference to FIG. FIG. 5 is a flowchart showing an image generation step. The image generation step is performed after a plurality of trained models have been constructed by the trained model generation step.

(Step S21)
In this step, the test image data is input from the recording unit 21 to the image data generation unit 47 by the test image data input unit 53. Further, the trained model selected by the trained model selection unit 55 is input from the trained model storage unit 51 to the image data generation unit 47. Here, the test image data is image data indicating the cat's eyes such as the first image data, and is data classified into the same category as the first image data. The training image data is the second image data.

(Step S22)
In this step, the image data generation unit 47 checks the trained model input to the image data generation unit 47. If the input trained model is the first trained model, the process proceeds to step S23. If the input trained model is the second trained model, the process proceeds to step S24. If the input trained model is the third trained model, the process proceeds to step S25.

(Step S23)
In this step, the image data generation unit 47 generates new image data from the test image data using the first trained model. Here, in the first trained model, the specific gravity λA1 is larger than the specific density λB1. Therefore, the degree of conversion of the test image data becomes large, and the new image data is closer to the first domain than the second domain, and the image data has a new design that is not easy to predict from the test image data. Become. The new image data generated in this step is the image data indicating the vehicle headlight that most closely resembles the cat's eyes. Then, the new image data is input to the image output unit 57, and the process proceeds to step S26.

(Step S24)
In this step, the image data generation unit 47 generates new image data from the test image data using the second trained model. In the second trained model, since the specific gravity λA2 is the same as the specific density λB2, the new image data is an intermediate image data between the cat's eyes and the vehicle headlight. Then, the new image data is input to the image output unit 57, and the process proceeds to step S26.

(Step S25)
In this step, the image data generation unit 47 generates new image data from the test image data using the third trained model. Here, in the third trained model, the specific density λA3 is smaller than the specific density λB3. Therefore, the degree of conversion of the test image data becomes larger, and the new image data is closer to the second domain than the first domain, and the image data has a new design that is difficult to predict from the test image data. Become. The new image data generated in this step is image data showing a vehicle headlight that approximates the eyes of a cat. Then, the new image data is input to the image output unit 57, and the process proceeds to step S26.

(Step S26)
In this step, the image output unit 57 outputs new image data as an image, and the process in the image generation step ends.

As described above, the image generation device 10 of the present embodiment includes a recording unit 21 that records a plurality of first image data and a plurality of second image data, and a first specific gravity and a first domain of the first image data. 2 The setting unit 43 for setting the second specific weight of the second domain of the image data is provided. The image generation device 10 generates a trained model corresponding to the first specific gravity and the second specific gravity from the plurality of first image data and the plurality of second image data for each value of the first specific gravity and the second specific gravity. A trained model storage unit 51 that stores a plurality of trained models, and a trained model selection unit 55 that selects one trained model from a plurality of trained models stored in the trained model storage unit 51. Further prepare. Further, the image generation device 10 is input from the test image data input unit 53 using the test image data input unit 53 for inputting the test image data and the trained model selected by the trained model selection unit 55. It further includes an image data generation unit 47 that generates new image data from the test image data.

In the image generation device 10 of the present embodiment, the values of the specific gravity λA and the specific gravity λB indicating the difficulty of prediction can be set, and a trained model corresponding to each value of the specific gravity λA and the specific gravity λB is generated, and a plurality of trainings are performed. New image data is generated from the test image data using one trained model from the completed models. At the time of generating new image data, the degree of conversion of the test image data changes according to the trained model corresponding to the values of the specific gravity λA and the specific gravity λB. When the degree of conversion changes, the new image data can become image data that is difficult to predict from the first image data and the second image data, which are training image data, and can be provided with new designability. Therefore, the image generation device 10 of the present embodiment can generate new image data having a new design property that is not easy to predict from the plurality of first and second image data after setting the difficulty of prediction. ..

Further, in the image generation device 10 of the present embodiment, the setting unit 43 sets a plurality of specific densities λA and a plurality of specific densities λB, and the learning unit 41 sets a plurality of learned models corresponding to the plurality of specific radii λA and the plurality of specific densities λB. To generate. Further, in the image generation device 10 of the present embodiment, the image data generation unit 47 generates new image data using one trained model from among the plurality of trained models. By generating a plurality of trained models, various kinds of new image data can be generated as compared with the case where only one trained model is generated.

Although the present invention has been described above by taking the above-described embodiment as an example, the present invention is not limited thereto.

The image data generation unit 47 may be the generation unit 45 learned in the learning unit 41. Alternatively, the learning unit 41 may provide the generation unit 45 learned in the learning process to the image data generation unit 47.

The number of learnings set in step S3 may be set for each learned model to be constructed. Therefore, for example, the number of trainings in the construction of the first trained model may be the same as the number of trainings in the construction of other trained models, and may be more or less than the number of trainings in the construction of other trained models. Good. As the number of learnings increases, when a trained model with a large number of learnings is used, new image data having a new design that is not easy to predict can be easily generated from the plurality of first and second image data. Also, the smaller the number of trainings, the faster the trained model can be generated. Further, although the learning unit 41 generates three trained models, it is not necessary to limit the learning unit 41 to this, and at least one trained model may be generated.

The learning unit 41 generates a trained model according to the Cycle GAN method, but the learning unit 41 does not have to be limited to this.

The setting unit 43 sets the specific densities input from the specific densities input unit 27 as the specific densities λA and λB, but it is not necessary to be limited to these. For example, the setting unit 43 may set the specific densities preset in the memory of the control unit 40 as the specific densities λA and λB. Further, the setting unit 43 sets the value input from the learning number input unit 29 as the learning number, but the setting unit 43 does not have to be limited to this. For example, the setting unit 43 may set a value preset in the memory of the control unit 40 as the number of learnings.

Each first image data is described as image data indicating the eyes of a cat, but it is not necessary to be limited to this, and image data indicating the eyes of other animals may be used.

According to the present invention, an image generation device capable of generating image data having a new design that is not easy to predict from a plurality of image data after setting the difficulty of prediction is provided, and the image generation is performed. The device can be used in the field of image generation and the like.

Claims

A recording unit that records a plurality of first image data and a plurality of second image data,
A setting unit for setting the first specific density of the first domain of the first image data and the second specific density of the second domain of the second image data, respectively.
A learning unit that generates a trained model corresponding to the first specific density and the second specific density from the plurality of first image data and the plurality of second image data for each value of the first specific density and the second specific gravity. ,
A trained model storage unit that stores a plurality of the trained models,
A trained model selection unit that selects one trained model from the plurality of trained models stored in the trained model storage unit, and a trained model selection unit.
A test image data input unit for inputting test image data,
An image data generation unit that generates new image data from the test image data input from the test image data input unit using the trained model selected by the trained model selection unit.
An image generator characterized by comprising.
The image generation device according to claim 1, wherein the learning unit generates the trained model according to a Cycle GAN method.
The image generation device according to claim 2, wherein the learning unit performs a calculation used in the Cycle GAN method by a number of learning times set in each of the learned models to generate each of the learned models. ..
The image generation device according to any one of claims 1 to 3, further comprising an output unit for outputting the new image data.