CN111144563A

CN111144563A - Method for training generation countermeasure network based on dual distance loss

Info

Publication number: CN111144563A
Application number: CN201911345589.8A
Authority: CN
Inventors: 宋艳枝; 彭程; 王昊
Original assignee: Hefei Liman Information Technology Co ltd
Current assignee: Hefei Liman Information Technology Co ltd
Priority date: 2019-12-24
Filing date: 2019-12-24
Publication date: 2020-05-12

Abstract

The invention discloses a method for training a generation confrontation network based on dual distance loss, and relates to the technical field of deep learning neural networks. The invention comprises the following steps: step S1: acquiring a data set of target distribution, and preprocessing the data set; step S2: setting the structures and parameters of a generator and a discriminator neural network and the learning rate in the training process; step S3: and (3) calculating a dual distance loss function according to parameters of the neural network, and training a generator to generate real distribution by adopting a random gradient descent method based on the dual distance loss function. According to the invention, more accurate results are obtained under the condition of the same iteration steps, the training quality can be improved, and the cost benefit is better.

Description

Method for training generation countermeasure network based on dual distance loss

Technical Field

The invention belongs to the technical field of deep learning neural networks, and particularly relates to a method for training a generation confrontation network based on dual distance loss.

Background

The generation of the confrontation network is a kind of neural network, and the discriminators and the generators are trained in turn to confront each other to sample from complex probability distribution, such as generating pictures, characters, voice and the like.

If the original generators and discriminators are random, it is difficult to determine whether the generators and discriminators can converge to an ideal conclusion through training of given data. While it can be shown that under some strong assumptions, generators and discriminators can converge to local nash equilibrium, many generation-confrontation network algorithms do not converge globally.

Disclosure of Invention

The invention aims to provide a method for training a generation countermeasure network based on dual distance loss, which comprises the steps of preprocessing a data set obtained by target distribution, setting structures, parameters and learning rate in a training process of a generator and a discriminator network, calculating a dual distance loss function according to parameters of an applied neural network, training the generator to generate real distribution based on the random gradient descent method adopted by the dual distance loss function, and solving the problems that the existing generation network countermeasure algorithm can not achieve global convergence and the training result is not accurate enough.

In order to solve the technical problems, the invention is realized by the following technical scheme:

the invention relates to a method for training a generation confrontation network based on dual distance loss, which comprises the following steps:

step S1: acquiring a data set of target distribution, and preprocessing the data set;

step S2: setting the structures and parameters of a generator and a discriminator neural network and the learning rate in the training process;

step S3: and (3) calculating a dual distance loss function according to parameters of the neural network, and training a generator to generate real distribution by adopting a random gradient descent method based on the dual distance loss function. Preferably, after the step S3, an empirical dual distance between the target real distribution and the generated distribution needs to be calculated:

in the formula (I), the compound is shown in the specification,

wherein x is_iIs a sample point in the true distribution, z_iIs a sample point in the Gaussian distribution, m and n are positive integers, f is a discriminator, g is a generator,

and

respectively, the space where the arbiter and the generator are located.

Preferably, the method for generating the confrontation network training calculates perturbation points, and then determines a dual distance loss function and an optimization direction by using the perturbation points, including the following steps:

an initialization step: the target data set is processed. Given the arbiter f in the initial state₀Sum generator g₀Setting parameters gamma epsilon (0, 2) and k as 0, and giving two positive integers m and n;

random selection of data points: select m points in the target data set, and record as { x₁，...，x_mSelecting n points in a specified Gaussian noise, and recording the points as z₁，...，z_n}；

And a step of calculating the shooting point: for a given generator g_kSum discriminator f_kConsider that

Calculating perturbation points

And

and (3) calculating an optimization direction: consideration function

The sub-gradient of (A) is recorded as

And

then, considering the optimization direction, respectively

An updating step: computing

And

and τ_k＝γE_k/||d_k||²。。

Preferably, the generative confrontation network training method satisfies at least with a probability of 1-3 δ:

the invention has the following beneficial effects:

the invention carries out preprocessing by acquiring a data set of target distribution, sets the structures, parameters and learning rate in the training process of a generator and a discriminator network, calculates a dual distance loss function according to the parameters of an applied neural network, trains the generator to generate real distribution by adopting a random gradient descent method based on the dual distance loss function, obtains more accurate results under the condition of the same iteration step number, can improve the training quality and has better cost benefit.

Of course, it is not necessary for any product in which the invention is practiced to achieve all of the above-described advantages at the same time.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.

Fig. 1 is a flowchart illustrating a method for generating a confrontation network training according to the present invention.

Fig. 2 is a comparison graph of the generated countermeasure network training method provided by the embodiment of the present invention with the generated result of the WGAN-GP method after 20000 iterations on the CIFAR10 data set.

Fig. 3 is a comparison graph of the generated confrontation network training method provided by the embodiment of the invention on a CIFAR10 data set and the inclusion Score obtained by the WGAN-GP method.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Referring to fig. 1-3, the present invention is a method for training a generation countermeasure network based on dual distance loss, comprising:

step S3: calculating a dual distance loss function according to parameters of the neural network, and training a generator to generate real distribution by adopting a random gradient descent method based on the dual distance loss function;

the method for generating the confrontation training based on the dual distance loss function can obtain more accurate results under the condition of the same iteration steps, can improve the training quality, and has better cost benefit and generalization performance.

After step S3, it is necessary to calculate the empirical dual distance between the target real distribution and the generated distribution:

in the formula (I), the compound is shown in the specification,

wherein the content of the first and second substances,_xi is the sample point in the true distribution, z_iIs a sample point in the Gaussian distribution, m and n are positive integers, f is a discriminator, g is a generator,

and

respectively, the space where the arbiter and the generator are located.

Specifically, 2 fields are given

And

arbiter and generator satisfy

A convex function phi, a true data distribution p_dataAnd a Gaussian distribution p_z，(f^*，g^*) Upper dual distance loss DG (f)^*，g^*) Is composed of

Here, the loss function for generating the countermeasure network is

Empirical dual distance loss

Satisfy the requirement of

Wherein the empirical loss function that generates the countermeasure network is:

if the true sample X and the Gaussian distribution sample Z are bounded and the boundary is bounded by B_xAnd B_zRepresents;

so that

And

is provided with

Wherein L is_fIs the Lipschitz constant, L, of the arbiter network f_gA liphoz constant for the generator network; then with a probability of at least 1-3 δ there is equation (1):

at this time, a specific process of obtaining formula (1) is given, which may include:

the equation of formula (1) is simplified to the left as:

the McDiarmid inequality condition is:

wherein X ═ { X ═ X₁，x₂，...，x_i，...，x_n}，X′＝{x₁，x₂，...，x′_i，...，x_n}，ρ_φIs the Liphoz constant of φ.

Using the McDiarmid inequality, at least

The probability of (d) is given by equation (2):

again using the McDiarmid inequality, at least

The probability of (c) is given by equation (3):

where e is ∈₁，∈₂，...，∈_n) And P (∈ C)_i＝1)＝P(∈_i-1) 0.5. Therefore, the probability of at least 1- δ is given by equation (4):

similarly, the probability of at least 1- δ has formula (5) and formula (6):

thus, the probability of at least 1-3 δ has equation (7):

since both the discriminator f and the generator g are neural networks, they can be written in the form of equations (8) and (9):

f＝a_H(M_H(a_H-1(M_H-1(...a₁(M₁(·))...)))) (8)；

g＝b_H′(N_H′(b_H′-1(N_H′-1)...b₁(N₁(·))...)))) (9)；

wherein, a_iAnd b_iFor activating functions, M_iAnd N_iFor the matrix, a is the activation function Relu in the experiment_iAnd b_iHas a lipschitz constant of less than 1; and assume | M_i||≤B_iAnd N_i||≤B′_i；d_fAnd d_gThe width of the neural network of the arbiter and generator.

According to the above assumptions, there is formula (10):

order to

Its coverage number

Satisfies formula (11):

due to the fact that

Therefore, the formula (12):

according to the relationship between the Ladamard Mach complexity and the coverage number, obtaining a formula (13):

similarly, formula (14) and formula (15) are obtained:

and (3) assuming that m > is greater than n, and combining the formula (7), the formula (13), the formula (14) and the formula (15), obtaining a generalization error bound based on the dual loss distance, namely the formula (1).

Specifically, in the method for training a generative confrontation based on dual distance loss according to the embodiment of the present invention, after setting the structure and parameters of the neural network and providing a data set conforming to the target distribution, solving the dual distance loss function by using a gradient descent method may include:

an initialization step: processing the target data set; given the arbiter f in the initial state₀Sum generator g₀Setting parameters gamma epsilon (0, 2) and k as 0, and giving two positive integers m and n;

Calculating the shooting point: for a given generator g_kSum discriminator f_kConsider that

Calculating perturbation points

And

dual distance calculation: calculating a dual distance loss function

If E is_kIf 0, then the algorithm stops and the generator g is output_k；

And (3) calculating an optimization direction: consideration function

The sub-gradient of (A) is recorded as

And

then, considering the optimization direction, respectively

An updating step: computing

And

and τ_k＝γE_k/||d_k||₂。

The method for generating the confrontation training based on the dual distance loss function provided by the embodiment of the invention successfully realizes data distribution generation on MNIST and CIFAR 10; the result shows that, compared with the traditional method for generating the confrontation training, the method for generating the confrontation training based on the dual distance loss function provided by the embodiment of the invention can obtain a more accurate result under the condition of the same iteration step number, can improve the training quality and has better cost benefit.

Since each training requires generation of gaussian noise during the training process, the method for generating a confrontation training based on a dual distance loss function provided by this embodiment seems to be more complicated in terms of setting of the loss function, but does not increase a generalization error, and thus has the same generalization performance as the conventional method for generating a confrontation training.

It should be noted that, in the above system embodiment, each included unit is only divided according to functional logic, but is not limited to the above division as long as the corresponding function can be implemented; in addition, specific names of the functional units are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present invention.

Claims

1. A method for generating a confrontation network training based on dual distance loss is characterized by comprising the following steps:

step S3: and (3) calculating a dual distance loss function according to parameters of the neural network, and training a generator to generate real distribution by adopting a random gradient descent method based on the dual distance loss function.

2. The method for training a generative confrontation network based on dual distance loss as claimed in claim 1, wherein after step S3, an empirical dual distance between the target real distribution and the generative distribution is calculated:

in the formula (I), the compound is shown in the specification,

wherein x is_iIs a sample point in the true distribution, z_iIs a sample point in a Gaussian distribution, and m and n arePositive integer, f is a discriminator, g is a generator,

and

respectively, the space where the arbiter and the generator are located.

3. The method for generating confrontation network training based on dual distance loss as claimed in claim 1, wherein the method for generating confrontation network training calculates perturbation points, and then determines dual distance loss function and optimization direction by using the perturbation points, comprising the following steps:

Calculating perturbation points

And

and (3) calculating an optimization direction: consideration function

The sub-gradient of (A) is recorded as

And

then, considering the optimization direction, respectively

An updating step: computing

And

and τ_k＝γE_k/||d_k||²。

4. The method for training a generative warfare network based on dual distance loss as claimed in claim 1, wherein the method satisfies at least with a probability of 1-3 δ: