WO2022057691A1

WO2022057691A1 - Gaussian distribution data adjustment method based on improved gan

Info

Publication number: WO2022057691A1
Application number: PCT/CN2021/117026
Authority: WO
Inventors: 朱锦雷; 井焜; 许野平; 张传锋; 刘辰飞
Original assignee: 神思电子技术股份有限公司
Priority date: 2020-09-18
Filing date: 2021-09-07
Publication date: 2022-03-24
Also published as: CN112116073A

Abstract

A gaussian distribution data adjustment method based on an improved GAN. Gaussian distribution data adjustment is performed by means of an improved GAN, and randomly generated data is distributed in a specified mode regardless of the distribution of original real data. However, the GAN only changes the distribution of the data and does not change the data itself, so that the generated data is consistent with the real data in feature. According to the present method, Gaussian distribution is taken as an example for explanation, and other data statistical distribution rules can also be used for directional standardization, so as to meet data generation expectation requirements.

Description

A Gaussian Distribution Data Adjustment Method Based on Improved GAN Network

technical field

The invention relates to a Gaussian distribution data adjustment method based on an improved GAN network, and belongs to the technical field of machine learning.

Background technique

Generative Adversarial Networks (GAN, Generative Adversarial Networks) is a deep learning model and one of the most promising methods for unsupervised learning on complex distributions in recent years. The model produces fairly good outputs through mutual game learning of (at least) two modules in the framework: Generative Model and Discriminative Model.

We often need the data generated by the GAN network to be consistent with the real data, but also distributed according to a certain expected law, that is, to make certain adjustments to the orientation of the generated data. However, there is currently no plan to adjust the orientation of the generated data through the GAN network.

SUMMARY OF THE INVENTION

The technical problem to be solved by the present invention is to provide a Gaussian distribution data adjustment method based on an improved GAN network. The generated data output by the GAN network conforms to the Gaussian distribution law, and also satisfies that the characteristics of the generated data and the real data are consistent.

In order to solve the technical problem, the technical solution adopted in the present invention is: a Gaussian distribution data adjustment method based on an improved GAN network, comprising the following steps:

S01), the data that natural random number, artificial setting value or other systems are produced are transmitted to the data generation network, and the data generation network generates data according to the input data output;

S02), the generation data of step S01, the real data measuring the approximation of the generated data are transmitted to the data discrimination network, and the data discrimination network obtains the loss Loss1 according to the approximation of the generated data and the real data;

S03), the n data generated in batches by the data generation network are transmitted to the Gaussian distribution deviation judgment network, and the Gaussian distribution deviation judgment network uses the central limit theorem to count the distribution of the n data generated in batches, and calculate the distance between the distribution and the normal Gaussian distribution , so as to get the loss Loss2;

S04), the weighted sum of the loss Loss2 and the loss Loss1, as the overall loss of the data generation network, use the overall loss to train the image generation network, and the trained data generation network adjusts the input data in line with the Gaussian distribution.

Further, the image generating network, the image discriminating network, and the Gaussian distribution deviation judging network are sequentially trained or counted, and the other networks remain unchanged when one of them is trained or counted.

Further, in step S03, the mean square error of the distribution of the n pieces of data generated in batches and the normal Gaussian distribution is calculated as the loss Loss2.

Further, the overall loss of the data generation network is set to Loss, then Loss=Loss2+λLoss1, where λ is an adjustable weighting coefficient.

Beneficial effects of the present invention: the present invention is used to improve the Gaussian distribution data adjustment of the GAN network, and the randomly generated data is distributed in a specified manner, no matter how the original real data is distributed. However, the GAN network only changes the distribution of the data, not the data itself, so the generated data is consistent with the real data in characteristics. The present invention is described by taking Gaussian distribution as an example, and other statistical distribution laws of data can also be used for directional normalization, so as to meet the expected requirements of data generation.

Description of drawings

Figure 1 is a flow chart of the method.

detailed description

The present invention will be further described below with reference to the accompanying drawings and specific embodiments.

Example 1

This embodiment discloses a Gaussian distribution data adjustment method based on an improved GAN network, as shown in FIG. 1 , including the following steps:

S01), the data AZi is transmitted to the data generation network, after the data generation network carries out the corresponding calculation to the input data AZi, the output generation data;

In this embodiment, the data AZi represents a natural random number, and can also be manually set or data generated by other systems, such as data or images collected by a transmitter;

S02), the generation data of step S01, the real data BZi measuring the approximation of the generated data are transmitted to the data discrimination network, and the data discrimination network obtains the loss Loss1 according to the approximation of the generated data and the real data;

The central limit theorem shows that when n is large, the sum of independent random variables approximately obeys the normal distribution N(nμ, nσ ² ), therefore, the target normal distribution function is N(nμ, nσ ² ), this method The key problem to be solved is to make the approximation of the target normal function as high as possible when n is certain.

In this embodiment, the weighted summation of the loss Loss2 and the loss Loss1 is realized by the gating unit, specifically, the weighting coefficient λ and the loss Loss1 are multiplied by the AND gate, and the loss Loss2 and λLoss1 are added by the OR gate.

In this embodiment, the image generating network, the image discriminating network, and the Gaussian distribution deviation judging network are sequentially trained or counted, and the other networks remain unchanged during one training or statistics.

In step S03, the distribution of the n pieces of data generated in batches and the normal Gaussian distribution are calculated as the mean square error as the loss Loss2.

The process of obtaining the loss Loss1 by the data discrimination network is: Loss1 is the output of the data discrimination network, that is, the characteristic distance between the generated data and the real data (such as Euclidean distance, cosine distance, etc.); According to the basic theory of GAN, the training is divided into two stages Execute, when training the discriminant network, assuming that the parameters of the generation network remain unchanged, generate batch data as negative samples, and real data as positive samples to train the network; when training the generation network, the discriminant network uses the trained model. The feature distance between the generated data and the real data is judged (ie Loss1).

The overall loss of the data generation network is set to Loss, then Loss=Loss2+λLoss1, where λ is an adjustable weighting coefficient, that is, the weights of Loss2 and Loss1 are adjustable.

In this embodiment, other aspects of the GAN network are the same as the general GAN network principle, and the image generation network, image discrimination network, and Gaussian distribution deviation judgment network can still be common VGG, RESNET, etc. or self-defined networks.

This embodiment is used to improve the Gaussian distribution data adjustment of the GAN network, and the randomly generated data is distributed in a specified manner, no matter how the original real data is distributed. However, the GAN network only changes the distribution of the data, not the data itself, so the generated data is consistent with the real data in characteristics. The present invention is described by taking Gaussian distribution as an example, and other statistical distribution laws of data can also be used for directional normalization, so as to meet the expected requirements of data generation.

The above descriptions are only the basic principles and preferred embodiments of the present invention, and improvements and substitutions made by those skilled in the art according to the present invention belong to the protection scope of the present invention.

Claims

A Gaussian distribution data adjustment method based on an improved GAN network, characterized in that it comprises the following steps:

S01), the data generated by natural random numbers, artificial setting values or other systems are transmitted to the data generation network, and the data generation network outputs the generated data according to the input data;

S02), the generation data of step S01, the real data measuring the approximation of the generated data are transmitted to the data discrimination network, and the data discrimination network obtains the loss Loss1 according to the approximation of the generated data and the real data;

S03), the n data generated in batches by the data generation network are transmitted to the Gaussian distribution deviation judgment network, and the Gaussian distribution deviation judgment network uses the central limit theorem to count the distribution of the n data generated in batches, and calculate the distance between the distribution and the normal Gaussian distribution , so as to get the loss Loss2;

S04), the weighted sum of the loss Loss2 and the loss Loss1, as the overall loss of the data generation network, use the overall loss to train the image generation network, and the trained data generation network adjusts the input data in line with the Gaussian distribution.
The Gaussian distribution data adjustment method based on an improved GAN network according to claim 1, characterized in that: the image generation network, the image discrimination network, and the Gaussian distribution deviation judgment network are trained or counted in sequence, and when one is trained or counted, the other networks constant.
The Gaussian distribution data adjustment method based on the improved GAN network according to claim 1, characterized in that: in step S03, the distribution of the n pieces of data generated in batches and the normal Gaussian distribution are calculated as the mean square error as the loss Loss2.
The Gaussian distribution data adjustment method based on improved GAN network according to claim 1 is characterized in that: the overall loss of the data generation network is set to Loss, then Loss=Loss2+λLoss1, wherein λ is an adjustable weighting coefficient.