CN113390848A

CN113390848A - DCGAN spectral data expansion method

Info

Publication number: CN113390848A
Application number: CN202010179231.9A
Authority: CN
Inventors: 李彦晖; 吴鹏飞; 刘勇飞; 殷琳琳
Original assignee: Guilin University of Electronic Technology
Current assignee: Guilin University of Electronic Technology
Priority date: 2020-03-13
Filing date: 2020-03-13
Publication date: 2021-09-14

Abstract

The invention discloses a DCGAN spectral data expansion method, which introduces convolution on the basis of original GAN, extracts deep features of Raman spectrum by means of feature extraction capability of convolution layer, and generates highly similar spectrum. Compared with an infrared spectrum method, the Raman spectrum provides nondestructive qualitative and quantitative analysis, has no special requirements on a sample, is simple and convenient in short time and high in sensitivity, avoids errors caused by the damage of the sample or the defects of the sample, has smaller difference of the distortion degree of the generated spectrum compared with the original spectrum, and well retains the information of the original spectrum of the generated spectrum.

Description

DCGAN spectral data expansion method

Technical Field

The invention relates to the field of spectral metrology, in particular to a DCGAN spectral data expansion method.

Background

The safety of food and medicine has always been the object of major concern, and the commonly used food and medicine detection means include absorption coefficient method, chemical method, HPLC and the like, and these detection methods are not only cumbersome, but also limited to laboratories, so a means capable of rapid detection is needed, and in recent years, the development is better for near infrared spectrum detection and Raman spectrum detection, wherein the Raman spectrum detection technology is a detection technology generated based on Raman spectrum photon fingerprints, when light impacts on object molecules, elastic scattering occurs, and a small amount of photons generate inelastic scattering, these photons are Raman photons, Raman photons transfer energy to molecules, and generate displacement scattered light, and the displacement distance is the information of the molecules. Different distances correspond to different molecular structures, thereby generating a Raman spectrum. The chemical and molecular information and content of the sample can be clarified according to the spectrogram.

The use of raman analysis for identification and classification of food and drug products has been widely used due to improved upgrading of instruments and methods. The application of deep learning to spectroscopy is a necessary trend. The existing Raman spectrum acquisition needs higher manpower and time cost, the amount of acquired data samples is less, interference factors exist, and the condition that deep learning needs to be trained by large samples cannot be met, so that the method for applying the deep learning to the Raman spectrum is less.

Disclosure of Invention

In view of the above, the present invention is to provide a method for expanding DCGAN spectral data.

A DCGAN spectral data expansion method is to introduce convolution on the basis of original GAN, and extract deep features of Raman spectrum by means of feature extraction capability of convolution layer to generate highly similar spectrum, and the method comprises the following steps:

(1) using deep convolution to generate a countermeasure network and a new spectrum, and inputting CNN for classification;

(2) generating a picture by using a countermeasure network, inputting random noise, and judging the authenticity of the picture;

(3) training a generation network, and achieving the purpose of optimizing the generation network by giving a judgment network parameter, so that the judgment network cannot identify 'false' samples, can output larger probability values of all true samples, and is mapped into a function to be maximized D (G (z)), namely minimized 1-D (G (z));

(4) training a discrimination network, and giving parameters of a generated network to achieve the purpose of optimizing the discrimination network, so that the accuracy of the discrimination network can be greatly improved, wherein a real image x is expected to output a larger probability value, namely, the maximum D (x); for the generated sample g (z), D (g (z)) is minimized. Therefore, an objective function optimization objective during network training is obtained: lnD (x) + ln (1-D (G (x)));

finally, an objective function is obtained:

training criteria for discriminant networks, i.e. maximizing V (D, G) given a generating network:

where equation (2) is desired to be maximized, this requires that each x in the equation lets

P_data(x)ln(D(x))+P_g(x)ln(1-D(x)) (3)

The maximum value is taken. Where x, P_data(x)，P_g(x) Are all fixed values, obviously: for any nonzero P_data(x)，P_g(x) And a real value D (x) e [0,1 ∈]When the function (3) is in P_data(x)/(P_data(x)+P_g(x) Take the maximum value, list the optimal function to generate network D:

when optimizing the generation network, there is P_data＝P_gThe generation network obtains an optimal solution, so that the generation network better reproduces the distribution of real samples;

(5) and modifying the convolution kernel of the convolution layer in the DCGAN into a one-dimensional vector convolution kernel so as to process the Raman spectrum data.

The invention has the advantages that: compared with an infrared spectrum method, the Raman spectrum provides nondestructive qualitative and quantitative analysis, has no special requirements on a sample, is simple and convenient in short time and high in sensitivity, avoids errors caused by the damage of the sample or the defects of the sample, has smaller difference of the distortion degree of the generated spectrum compared with the original spectrum, and well retains the information of the original spectrum of the generated spectrum.

Drawings

FIG. 1 is a schematic diagram of DCGAN network structure and Raman spectrum classification according to an embodiment of the present invention;

FIG. 2 is a comparison between the original spectrum and the generated spectrum of DCGAN in the example of the present invention ((a) original spectrum (b) generated spectrum).

Detailed Description

The invention is further illustrated with reference to the following figures and examples.

Example (b):

the network structure of DCGAN is shown in fig. 1, and the operation of DCGAN is illustrated here: two networks, G (Generator) and D (Discrimatoror), can be seen. Their function confirms their name:

g this network is used to generate a picture, to which a random noise z is input, and finally a picture, denoted G (z), can be generated.

D this network is used to determine the true degree of a picture. Inputting a picture x to the device, and outputting D (x), which means that x is the probability of the real picture, and if the probability value is 1, the picture is completely real. If the probability value is 0, the picture is false.

Training is then performed, as explained above, where G requires the generation of as many pictures as possible to verify D. And D should distinguish the "false" picture generated by G from the actual picture as much as possible. It can be seen that G and D form a "left-right interpulsation".

Finally, we get the result of a game, i.e. G generates a picture G (z) of "false-to-false". In this case, it is difficult to determine the true degree of g (z) for D, and D (g (z)) is 0.5. The convolution is introduced because CNN does not process each single pixel point, but calculates the whole area, so that the formed DCGAN can better extract Raman spectrum characteristic information. Since partial loss of image information is caused by down-sampling of the pooling layer (posing) in the convolutional network, deconvolution (deconvolution) and stepped convolution (stepped convolution) are introduced instead of generating and countering the network pooling layer, respectively, so that the loss of image information can be reduced. Subsequently, Batch Normalization (BN) is introduced to construct a more stable network.

The convolutional layer of the conventional DCGAN network is mainly classified into image-oriented layers. The network layer default input is typically a two-dimensional image, so the network layer convolution kernel and pooling window are both matrices of size n dimensions. Therefore, the network structure is not suitable for spectral images, and therefore, the convolutional layer of the conventional DCGAN network needs to be improved, that is, the convolutional core of the convolutional layer in the DCGAN network is modified into a one-dimensional vector convolutional core, so that raman spectral data can be processed.

The structures of the generation network and the discrimination network designed for the raman spectrum data are shown in tables 1 and 2.

Table 1 generating a network

TABLE 2 Confrontation network

The training is divided into two parts:

(1) training a generation network, and achieving the purpose of optimizing the generation network by giving a discrimination network parameter, so that the discrimination network cannot identify 'false' samples, and the larger probability values of all true samples which can be output are mapped into a function, namely, the maximization D (G (z)), namely, the minimization 1-D (G (z)).

(2) And training the discrimination network, and giving the parameters of the generated network to achieve the purpose of optimizing the discrimination network, so that the precision of the discrimination network can be greatly improved, and the real image x is expected to output a larger probability value, namely, the maximum D (x). For the generated sample g (z), D (g (z)) is minimized. Therefore, an objective function optimization objective during network training is obtained: lnD (x) + ln (1-D (G (x))).

Finally, an objective function is obtained:

P_data(x)ln(D(x))+P_g(x)ln(1-D(x)) (3)

when optimizing the generation network, there is P_data＝P_gThe generation network obtains an optimal solution, so that the generation network better reproduces the distribution of real samples.

Example (b):

raman spectral data was used in the invention as a published raman spectral data set and the samples of the study were 16 pork samples taken from the slaughterhouse daily production inventory. The cylinders were fat drilled with a 12 mm rotary biopsy drill and then cut into disks (height 1.8 mm) to create a depth profile. 105 disc data were obtained from the waist of a total of 16 pork samples.

The activation function of the convolutional network in the DCGAN selects LeakyReLU, the slope value of the leak is set to be 0.2, the value of the whole network is set to be 2, the learning rate of the network cannot be too large, otherwise, the time is too long, the learning rate is set to be 0.0002, the convolutional layer also needs to use an optimizer and set momentum parameters, the optimizer uses Adam, and the parameters are set to be 0.5. The spectrum number of each drug is expanded to 100 by a DCGAN mode, 70% of the spectrum is selected to train CNN, and the rest 30% of the spectrum is tested. In order to avoid the unsatisfactory results caused by the inconsistent sample bands in the experiments, the spectrum of 100-1000 bands of each drug was selected in the following experiments.

The number of spectra per drug was also expanded to 100 using DCGAN, and the training and test sets were partitioned as in Table 3. FIG. 2 shows the 10 randomly selected sample spectra from the original discharge data, as shown in FIG. 2(a), and 10 new spectra generated by the DCGAN challenge, as shown in FIG. 2 (b). The generated spectrum is visually seen to be smoother and clearer than the original spectrum. And inputting the divided training set and test set into CNN for training and classification to obtain a discrimination result shown in Table 4. The experimental result shows that the generated spectrogram has high classification precision. Since the DCGAN generated spectrum is a regenerated original spectrum, it is necessary to evaluate the degree of distortion of the generated spectrum compared to the original spectrum. The local Variance Estimation method (LVE) is a better method capable of estimating the image distortion degree, and the algorithm principle is that the local Variance of each picture pixel is calculated firstly, the largest local Variance is the signal Variance, the smallest local Variance is the noise Variance, and the signal Variance is compared with the noise Variance to obtain the ratio and converted into the noise unit dB. It can be seen from table 5 that the distortion degree of the generated spectrum is smaller than the difference of the original spectrum, and the generated spectrum better retains the information of the original spectrogram.

TABLE 3 pork sample training set, test set partitioning

Note that in Table 4, the detailed result-DCGAN + CNN (%)

Table 5 corresponds to the LVE method signal-to-noise ratio of FIGS. 2(a) (b)

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims

1. A DCGAN spectral data expansion method is characterized in that: introducing convolution on the basis of original GAN, extracting deep features of Raman spectrum by means of feature extraction capability of convolution layer, and generating highly similar spectrum, wherein the method comprises the following steps:

finally, an objective function is obtained:

max V(D,G)＝∫_xP_data(x)ln(D(x))dx+∫_zP_z(z)ln(1-D(G(z)))dz＝∫_x[P_data(x)ln(D(x))+P_g(x)ln(1-D(x))]]dx (2)

P_data(x)ln(D(x))+P_g(x)ln(1-D(x)) (3)