CN114091504A

CN114091504A - Rotary machine small sample fault diagnosis method based on generation countermeasure network

Info

Publication number: CN114091504A
Application number: CN202110813313.9A
Authority: CN
Inventors: 曹智; 伏洪勇; 王珂; 李振祥; 张俊华
Original assignee: University of Chinese Academy of Sciences; Technology and Engineering Center for Space Utilization of CAS
Current assignee: University of Chinese Academy of Sciences; Technology and Engineering Center for Space Utilization of CAS
Priority date: 2021-07-19
Filing date: 2021-07-19
Publication date: 2022-02-25

Abstract

The invention discloses a rotary machine small sample fault diagnosis method based on a generation countermeasure network, which comprises the steps of firstly converting collected rotary machine time domain signals into frequency spectrum signals with characteristics easy to observe through Fast Fourier Transform (FFT); and then, on the basis of ACWGAN-GP, an ACWGAN-GP-ARGMAX diagnosis model is obtained by introducing an ARGMAX multi-classification idea. The diagnosis model endows the discriminator with classification and identification capabilities, enhances the classification and identification capabilities of the discriminator, and can effectively improve the accuracy and efficiency of fault diagnosis of small samples of rotary machines.

Description

Rotary machine small sample fault diagnosis method based on generation countermeasure network

Technical Field

The invention relates to the technical field of fault diagnosis of rotary machines, in particular to a fault diagnosis method for a small sample of a rotary machine based on a generation countermeasure network.

Background

The rotating machinery is very commonly applied in the fields of aerospace, ships, wind power, petrochemicals and the like, is one of key components in an industrial scene, and is equipment which is easy to break down. Most rotary machines are of a circularly symmetric structure and are stable under normal operating conditions. However, when the rotary machine fails, its symmetry is broken, resulting in unstable performance and easily causing a serious accident. In order to avoid catastrophic failures of the rotating machine, it is of great significance to continuously monitor and diagnose the operating state of the rotating machine.

The operating state of a rotating machine can be detected and diagnosed by measuring signals such as vibration, noise, and oil. At present, the collection and analysis technology of vibration signals is relatively mature, and fault diagnosis of rotary machines based on vibration signals is one of the most common and widely applied methods. Vibration signal characteristics of rotary machines are complex, and non-professional people generally do not know and cannot identify fault characteristics of vibration signals, so that intelligent fault diagnosis research based on multidisciplinary intersection is the main development direction of fault diagnosis of rotary machines at present. However, in the training process of the intelligent fault diagnosis method, corresponding data distribution needs to be learned from a large amount of historical data, so that accurate fault diagnosis is realized. However, in a real industrial scenario, due to: 1) the failure occurrence rate is low, and the comprehensive real operation data corresponding to various health states are difficult to acquire; 2) the fault injection experiment of the rotating machinery has high cost and has great risk, and the personnel safety and the property safety can be endangered; 3) even if a large amount of operation monitoring data exist, a large amount of manpower and material resources are consumed for effectively labeling the data, the cost is extremely high, and therefore, various types of data which are enough to be effectively labeled are generally difficult to obtain for model training. Therefore, it is necessary to develop a diagnostic method that can achieve a high-precision fault diagnosis result by training with only a small sample.

Disclosure of Invention

The invention aims to provide a rotary machine small sample fault diagnosis method based on a generation countermeasure network, so as to solve the problems in the prior art.

In order to achieve the purpose of the invention, the technical scheme adopted by the invention is as follows:

a rotary machine small sample fault diagnosis method based on a generation countermeasure network comprises the following steps:

s1, collecting vibration signals of the rotating machine;

s2, preprocessing the rotary mechanical vibration signal, and dividing the signal into a training set and a testing set;

s3, establishing an ACWGAN-GP-ARGMAX network model;

s4, training the ACWGAN-GP-ARGMAX network model;

s5, respectively carrying out countermeasure training on the data in each health state in the training set, and fusing the trained network models into a diagnosis model;

s6: and inputting the test set into the diagnosis model for diagnosis test, and outputting a diagnosis result.

Further, a specific implementation manner of step S1 includes arranging a vibration acceleration sensor on the rotary machine, and acquiring data of one-dimensional time series vibration acceleration of the rotary machine in various health states by using the vibration acceleration sensor to form a raw data set, where the raw data set includes 1 normal state and N-1 fault states.

Further, the specific implementation manner of step S2 includes performing feature extraction on the vibration signals in various health states in the original data set by a fast fourier transform and standard normalization processing manner, and constructing a new data set from the processed data;

the standard normalization method is as follows: x ═ X (X-X)_min)/(x_max-x_min) Wherein x is_maxIs the maximum value, x, of the data sample_minIs the minimum value of the data sample;

according to algorithm requirements, dividing the new data set into a training set and a testing set, wherein the training set is expressed as

Where k represents the k-th health state,

the ith training sample in the healthy state.

Further, the ACWGAN-GP-ARGMAX network model in the step S3 includes a generator and a discriminator;

the generator is represented by a network model structure comprising 5 deconvolution layers, and input data is 4096-dimensional random noise which follows normal distribution; in the generator, the random noise is firstly input into a full-connection layer consisting of 4096 neurons, and then passes through the 5-layer deconvolution layer in sequence; the number of the filters in the first four layers of deconvolution layers is 256, 128, 64 and 32 in turn, and the last layer of deconvolution layer only has 1 filter; the size of the first layer of deconvolution kernel is 4 x 4 and the step size is 1, and the sizes of the last four layers of deconvolution kernels are all 3 x 3 and the step size is 2; through the five deconvolution layers, input data is converted into a 64 x 64 two-dimensional matrix;

the discriminator is expressed as a network model structure comprising 5 layers of convolution layers, and input data is a 64 x 64 two-dimensional matrix generated by the generator and a 64 x 64 two-dimensional matrix obtained by shaping real data; in the discriminator, the number of the filters in the first four convolutional layers is 32, 64, 128 and 256 in turn, and the last convolutional layer only has 1 filter; the sizes of the first four convolutional layer convolutional cores are 3 multiplied by 3 and the step length is 2, and the sizes of the last convolutional layer convolutional cores are 4 multiplied by 4 and the step length is 1; through the five convolutional layers, the input data is converted into a tag value.

Further, the specific implementation manner of step S4 includesSelecting Gaussian distribution as noise distribution P_noiseAnd assigning a corresponding class label from which a subset of size n is sampled, denoted as

Random noise subset with class label

Input to a generator G corresponding to the health state k_kForward propagation results in generating a subset of samples

Training set P from real data of said health state k_k～dataTaking the data sample as a positive sample, and recording as

Then training set P from real data outside the health state k_{(C≠k)～data}Taking the data sample as a negative sample, and recording as

By

And

composing the true sample of the health state k, and generating the sample by

Combined to form a training set subset { X_k}；

Setting the discriminator D_kAnd the generator G_kThe ratio of training times is m, first using the training set subset { X_kThe algorithm pairs the discriminator D with Adam_kPerforming m times of training; then using said random noise subset with class labels

And Adam's algorithm to the generator G_kPerforming 1 training; selecting a cross entropy loss function as an objective function, and specifically defining the cross entropy loss function as follows:

where x represents the true sample or samples and where,

it is indicated that the generation of the sample,

by adding at x and

random samples obtained by linear interpolation, lambda being a gradient penalty factor,

the expectation of the distribution of the real data is represented,

indicating the desire to generate a distribution of data,

denotes a gradient penalty term, P (C ═ C | X_real) Representing the conditional probability distribution of the real data on class labels,

representing a conditional probability distribution of the generated data on the class label,

according to the above-mentioned ratio m of training times of each round, for the said discriminator D_kAnd the generator G_kPerforming iterative training until the discriminator D_kAnd the generator G_kAchieve nash equilibrium between them.

Further, the specific implementation manner of step S5 includes,

firstly, marking data samples except the kth health state as negative samples and marking the data samples as class 0; correspondingly, marking the data sample in the kth health state as a positive sample and marking the data sample as a class 1;

second, the labeled data is inputted as a real sample to the discriminator D_kPerforming countermeasure training until the model loss function converges to the minimum value, thereby obtaining a generator and a discriminator of a kth sample;

thirdly, performing the training on each type of sample in turn to obtain n trained ACWGAN-GP-ARGMAX models; separating n discriminators from the data to be used as a trained diagnosis model;

and fourthly, model fusion is carried out on the n discriminators, the probability values of the output results are combined into n-dimensional vectors, and the n-dimensional vectors are accessed into a Softmax activation layer for activation, so that the probability that each value output by the output layer corresponds to each test sample and belongs to each health state is obtained.

Further, between the step S4 and the step S5, there is further included a step S500: and performing quality evaluation on the network model trained in the step S4, and if the network model cannot pass the evaluation, repeating the step S4.

Further, the quality evaluation in step S500 includes:

first, evaluate the generator G_kThe method comprises the steps of comparing the frequency spectrums of a generated sample and a real sample qualitatively, and then measuring the similarity of the generated sample and the real sample quantitatively by calculating the Pearson Correlation Coefficient (PCC) and the Cosine Similarity (CS) of the generated sample and the real sample;

second, the discriminator D is evaluated_kThe classification and identification capability of (2) is specifically realized by using the trained discriminator D_kGenerating a data sample if said discriminator D_kCapable of generating sufficiently realistic generationSamples, these generated samples are further used to continuously align the discriminator D_kTraining is performed so that the discriminator D_kThe hidden distribution in the current sample is learned.

The invention has the beneficial effects that:

1) on the basis of the ACWGAN-GP model, the invention introduces the multi-classification idea of 'ARGMAX', so that the ACWGAN-GP-ARGMAX model has strong generating capability and strong classification and identification capability.

2) The invention integrates the generation of the countermeasure network and the fault diagnosis model, and overcomes the defect that the generation of the countermeasure network cannot be classified under the unsupervised condition, so that the fault diagnosis can be realized by directly utilizing the data set without additionally introducing a discrimination model.

3) The rotary machine small sample fault diagnosis model based on the generated countermeasure network has the advantages of simple structure, few parameters, low requirement on hardware resources and high economy, and has certain generalization capability, and the gear box data set and the bearing data set experiments prove that the rotary machine small sample fault diagnosis accuracy can be effectively improved.

Drawings

FIG. 1 is a basic flow diagram of the rotary machine small sample fault diagnosis method based on the generation countermeasure network of the present invention;

FIG. 2 is a flow chart of an embodiment implementation of the present invention;

FIG. 3 is a diagram of the ACWGAN-GP-ARGMAX network model architecture according to the present invention;

FIG. 4 is a time domain diagram of the raw vibration signal of the class 8 gearbox according to the embodiment of the invention;

FIG. 5 is a graph comparing various types of generated data with the spectrum of the original data in the embodiment of the present invention;

FIG. 6 is a graph of the diagnostic results of 10 independent experiments at different sample ratios in the examples described herein;

FIG. 7 is a graph of the mean values of the diagnostic results of 10 independent experiments at different sample ratios in the example of the present invention;

FIG. 8 is a confusion matrix of the mean values of the diagnostic results of 10 independent experiments under different sample ratios in the example described in the present invention;

FIG. 9 is the average of the diagnostic results of 10 independent experiments under different sample ratios for different diagnostic methods in the examples described herein.

Detailed Description

The invention relates to a rotary machine small sample fault diagnosis method based on a generated countermeasure network, which comprises the following steps:

and S1, acquiring vibration signals of the rotating machine. The method comprises the steps of arranging a vibration acceleration sensor on a rotary machine, collecting data of one-dimensional time series vibration acceleration of the rotary machine in various health states by using the vibration acceleration sensor, and obtaining an original data set, wherein the data set comprises 1 normal state and N-1 fault states.

And S2, preprocessing the rotating mechanical vibration signal. And performing feature extraction on the vibration signals in various health states in the data set by processing modes such as Fast Fourier Transform (FFT), standard normalization and the like, and constructing the processed data into a new data set. The standard normalization method comprises the following steps: x ═ X (X-X)_min)/(x_max-x_min) Wherein x is_maxIs the maximum value of the data sample, x_minIs the minimum value of the data sample. And according to algorithm requirements, dividing the new data set into a training set and a testing set. The training set may be represented as

Where k represents the k-th health state,

the ith training sample in the healthy state.

And S3, establishing an ACWGAN-GP-ARGMAX network model corresponding to the health state. In view of the excellent performance of CNNs in feature extraction and classification, the present invention builds generators and discriminators on the basis of two-dimensional CNNs.

The network model structure of the generator consists of 5 deconvolution layers, and input data is 4096-dimensional random noise which follows normal distribution. In the generator, random noise is first input into a fully connected layer of 4096 neurons, then sequentially through 5 deconvolution layers. The number of the filters in the first four deconvolution layers is 256, 128, 64 and 32 in sequence, and the last deconvolution layer only has 1 filter; the size of the first deconvolution layer convolution kernel is 4 x 4 and the step size is 1, and the sizes of the last four layers of deconvolution layer convolution kernels are all 3 x 3 and the step size is 2; through the five deconvolution layers, the input data is converted into a 64 × 64 two-dimensional matrix.

The network structure of the arbiter is symmetric to the generator. The network model structure of the discriminator is composed of 5 layers of convolution layers, and input data is a 64 x 64 two-dimensional matrix generated by a generator and a 64 x 64 two-dimensional matrix obtained by shaping real data. In the discriminator, the number of filters in the first four convolutional layers is 32, 64, 128 and 256 in turn, and the last convolutional layer only has 1 filter; the sizes of the convolution kernels of the first four layers are 3 multiplied by 3 and the step length is 2, and the sizes of the convolution kernels of the last layer are 4 multiplied by 4 and the step length is 1; through the five convolutional layers, the input data is converted to a tag value.

And S4, training the network model. Selecting Gaussian distribution as noise distribution P_noiseAnd assigning a corresponding class label, from which a subset of size n is sampled, denoted as

Random noise subset with class label

Training set P of real data from health state k_k～dataTaking the data sample as a positive sample, and recording as

By

And

forming a real sample under the condition of the health state k, and generating the sample by

Combined to form a training set subset { X_k}. Setting discriminator D_kAnd generator G_kThe ratio of training times is m, first using a subset of the training set { X }_kDiscriminator D of the pair of Adam and Algorithm_kTraining for m times; then with random noise subsets of class labels

And Adam Algorithm Pair Generator G_kPerforming 1 training; selecting a cross entropy loss function as an objective function, which is specifically defined as follows:

where x represents the true sample or samples and where,

it is indicated that the generation of the sample,

by adding at x and

the expectation of the distribution of the real data is represented,

indicating the desire to generate a distribution of data,

And S5, performing quality evaluation on the network model trained in the S4. The trained model is generally evaluated for quality in two ways. The similarity of generated samples is qualitatively and quantitatively evaluated, namely, the generating capacity of a model generator is evaluated. The method comprises the steps of comparing frequency spectrums of a generated sample and a real sample qualitatively, and then measuring similarity of the generated sample and the real sample quantitatively by calculating a Pearson Correlation Coefficient (PCC) and a Cosine Similarity (CS) of the generated sample and the real sample; and secondly, evaluating the influence of the generated sample on fault diagnosis under the condition of a small sample by sample expansion, namely evaluating the classification and identification capability of the model discriminator. Generating data samples by using the trained generator, and if the generator can generate sufficiently vivid generated samples, further continuously training the discriminator by using the generated samples to ensure that the discriminator D_kThe hidden distribution in the current sample is learned.

S6, respectively carrying out countermeasure training on the data in each health state in the data set, firstly, marking the data samples except the kth health state as negative samples and marking the negative samples as class 0; accordingly, the data sample in the k-th health state is labeled as a positive sample, labeled as class 1. Secondly, inputting the marked data as a true sample into a discriminator for countermeasure training until the model loss function converges to the minimum value, thereby obtaining a generator and a discriminator of the kth sample. Thirdly, the training is carried out on each type of sample in turn to obtain n trained ACWGAN-GP-ARGMAX models. From which n discriminators D are connected_iAnd separating the model to be used as a well-trained diagnosis model. Fourthly, D is_iModel fusion is carried out, and n D are obtained through model fusion_iAnd combining probability values of the output results into an n-dimensional vector, and accessing a Softmax activation layer for activation, so that each value output by the output layer corresponds to the probability that the test sample belongs to each health state.

Thus, a complete fault diagnosis process is achieved.

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to the accompanying drawings. It should be understood that the detailed description and examples, while indicating the preferred embodiment of the invention, are given by way of illustration only, not limitation.

Example (b):

FIG. 2 is a flow chart of an embodiment of the present invention based on a rotating machinery small sample fault diagnosis method for generating a countermeasure network. As shown in fig. 2, the method for diagnosing a fault of a small sample of a rotating machine based on a generated countermeasure network of the present invention specifically includes the steps of:

and S1, collecting vibration signals of the rotary machine by using the vibration acceleration sensor.

And S2, preprocessing the acquired vibration signals of the rotating machine to construct an original data set, and dividing the original data set into a training set and a testing set.

S3, constructing a network model based on ACWGAN-GP-ARGMAX.

And S4, performing model training on the network model based on ACWGAN-GP-ARGMAX, and fusing the trained network models in each health state to construct a diagnosis model.

And S5, inputting the test set into a diagnosis model to perform diagnosis test.

FIG. 3 is a diagram of the ACWGAN-GP-ARGMAX network model architecture according to the present invention. As shown in fig. 3, the ACWGAN-GP-ARGMAX network model comprises the following modules: the device comprises a generator module, a discriminator module, a data input module and a diagnosis module.

The present invention and its effects are specifically described below by way of an example.

The experimental data for this example is from the gearbox dataset provided by the IEEE PHM challenge in 2009, which is representative of general industrial gearbox data. The physical diagram of the gearbox is shown in figure 3. The data set contains a total of 8 time-domain vibration signals of different health states. All data in the data set were collected from the input shaft under high load conditions, with a shaft speed of 30Hz, a sampling frequency of 66.7kHz, and a sampling time of 4 s. On the premise of ensuring that the information contained in the diagnosis signal of the gearbox is not lost, 400 health state data samples of each type are selected. The sample length is 8192, and each sample is a time domain signal, so after the FFT, each type of data sample becomes a spectrum vector with a sample length of 4096. Details of the samples used are shown in table 1, with the time domain waveforms shown in fig. 4.

TABLE 1 gearbox data set operating mode details

IS:Input Shaft；ID:Idler Shaft；OS:Output Slide.

G:Good；C:Chipped；E:Eccentric；Br:Broken；B:Ball；I:Inner race；O:Outer race；Im:Imbalance；K:Keyway sheared.

Taking 200 samples of each type of healthy state in the data set as training sets respectivelyAnd used for testing. A part of samples are taken from the training set to be used as a training subset, and the ratio of the number of training subset samples to the number of test set samples is called a sample ratio. In the experiment, sample comparison is divided according to 1:20, 1:10, 1:5, 1:4, 1:2 and 1:1 respectively, and the number of various training samples in each sample proportion is shown in table 2. During the course of the confrontation training, the size of minipatch is set to 8 at a sample ratio of 1:20, and the size of minipatch is set to 16 in all the other cases. In the anti-training process of each wheel, the ratio of the update iteration times of the discriminator to the generator is set to be 5:1, namely, the generator is updated for 1 time when the discriminator is updated for 5 times. The learning rate is set to 0.0001, and the number of training iterations is 10000; arbiter for learning kth health status

For example, all training samples in the kth health state are selected as positive samples, the label type is 1, and part of the training samples in the remaining 7 health states are selected as negative samples, and the label type is 0, where the number of the positive samples and the number of the negative samples are approximately equal.

TABLE 2 number of training samples in various health states under different sample ratios

After the network model is pre-trained, 200 new samples are generated for each type of health state using each generator model. In order to evaluate a new sample, firstly, the frequency spectrums of the real sample and the generated sample are visually displayed in the same graph, as shown in fig. 5; next, the generated samples were evaluated by the values of the similarity indices PCC and CS, which are shown in table 3.

TABLE 3 PCC and CS between generated data and raw data under each type of health status

As can be seen from fig. 5, the overlap ratio of the generated sample and the true sample of each health state is high; as shown in table 3, the PCC and CS were higher than 0.5 for all healthy state samples, indicating that the generated samples have a higher similarity to the real samples. In conclusion, the generator subjected to the countermeasure training can accurately learn the implicit data distribution in the real sample and generate the sample with high similarity to the real sample.

In order to test the performance of the fault diagnosis method based on ACWGAN-GP-ARGMAX, the comparison experiment based on the ACWGAN-GP diagnosis model is arranged. For each sample ratio, 10 independent experiments were performed, and the experimental results are shown in fig. 6 and 7. Wherein data1 represents ACWGAN-GP diagnostic model and data2 represents ACWGAN-GP-ARGMAX diagnostic model. As can be seen from FIG. 6, the ACWGAN-GP-ARGMAX diagnostic model has significantly improved diagnostic accuracy and stability compared to the ACWGAN-GP diagnostic model. As can be seen from FIG. 7, with the increase of the proportion of the training samples, the diagnosis accuracy of the ACWGAN-GP-ARGMAX diagnosis model and the ACWGAN-GP diagnosis model is obviously improved, but the diagnosis effect of the ACWGAN-GP-ARGMAX diagnosis model is better. In order to more intuitively explain the diagnosis result of the ACWGAN-GP-ARGMAX diagnosis model, the diagnosis process is visualized by using a confusion matrix by taking each sample ratio as an example, as shown in FIG. 8. As can be seen from FIG. 8, when the sample ratio is 1:20, some of the data in Class2,4,6, and 8 are classified as erroneous. With the increasing sample proportion, the sample diagnosis accuracy rate is increased gradually, and when the sample proportion reaches 1:1, only few samples are classified wrongly.

In order to prove the superiority of the method, several common methods are adopted to be compared with ACWGAN-GP-ARGMAX in the aspect of solving the problem of a small sample for fault diagnosis, and the common methods comprise the following steps: CNN, DNN, SVM, SAE, GAN. The fault diagnosis accuracy of all the above methods was obtained by training and testing each method under the same conditions, as shown in fig. 9.

As can be seen from fig. 9, the failure accuracy of all methods increases with the sample scale, indicating that these methods are effective in failure diagnosis. Meanwhile, as is also apparent from fig. 9, the method provided by the invention has higher fault diagnosis accuracy and better effect under the condition of small samples compared with other methods.

In summary, according to the rotary machine small sample fault diagnosis method based on the generation countermeasure network, the generation countermeasure network is combined with the ARGMAX multi-classification idea to construct the diagnosis model based on the ACWGAN-GP-ARGMAX, and therefore the accuracy of rotary machine small sample fault diagnosis can be effectively improved.

The foregoing is only a preferred embodiment of the present invention, and it should be noted that it is obvious to those skilled in the art that various modifications and improvements can be made without departing from the principle of the present invention, and such modifications and improvements should be considered within the scope of the present invention.

Claims

1. A rotary machine small sample fault diagnosis method based on a generation countermeasure network is characterized by comprising the following steps:

s1, collecting vibration signals of the rotating machine;

s3, establishing an ACWGAN-GP-ARGMAX network model;

s4, training the ACWGAN-GP-ARGMAX network model;

2. The method for diagnosing the fault of the rotating machine based on the small sample generated the countermeasure network as claimed in claim 1, wherein the step S1 is implemented by disposing a vibration acceleration sensor on the rotating machine, and acquiring data of one-dimensional time series vibration acceleration of the rotating machine in various health states by using the vibration acceleration sensor to form a raw data set, wherein the raw data set includes 1 normal state and N-1 fault states.

3. The method for diagnosing the fault of the rotating machinery small sample based on the generation countermeasure network as claimed in claim 1, wherein the step S2 is implemented by performing feature extraction on the vibration signals under various health states in the original data set through fast fourier transform and standard normalization processing, and constructing the processed data into a new data set;

the standard normalization method is as follows: x ═ X (X-X)_min)/(x_max-x_min) Wherein x is_maxIs the maximum value of the data sample, x_minIs the minimum value of the data sample;

Where k represents the k-th health state,

the ith training sample in the healthy state.

4. The rotating machine small sample fault diagnosis method based on generation countermeasure network as claimed in claim 3, wherein said ACWGAN-GP-ARGMAX network model in step S3 comprises a generator and a discriminator;

the generator is represented by a network model structure comprising 5 deconvolution layers, and input data is 4096-dimensional random noise which follows normal distribution; in the generator, the random noise is firstly input into a full-connection layer consisting of 4096 neurons, and then passes through the 5-layer deconvolution layer in sequence; the number of the filters in the first four layers of deconvolution layers is 256, 128, 64 and 32 in turn, and the last layer of deconvolution layer only has 1 filter; the size of the first layer of deconvolution kernel is 4 x 4 and the step size is 1, and the sizes of the last four layers of deconvolution kernels are all 3 x 3 and the step size is 2; through the five deconvolution layers, the input data is converted into a 64 x 64 two-dimensional matrix;

5. The method for fault diagnosis of rotating machinery small sample based on generation countermeasure network as claimed in claim 4, wherein the step S4 is implemented by selecting Gaussian distribution as noise distribution P_noiseAnd assigning a corresponding class label from which a subset of size n is sampled, denoted as

Random noise subset with class label

Training set P from real data of said health state k_k～dataIs taking a data sample as positiveSample, is marked as

By

And

composing the true sample of the health state k, and generating the sample by

Combined to form a training set subset { X_k}；

And Adam's algorithm to the generator G_kPerforming 1 training; selecting a cross entropy loss function as an objective function, which is specifically defined as follows:

wherein x represents a true sampleThe utility model relates to a novel water-saving device,

it is indicated that the generation of the sample,

by adding at x and

the expectation of the distribution of the real data is represented,

indicating the desire to generate a distribution of data,

denotes a gradient penalty term, P (C ═ C | X_real) Representing the conditional probability distribution of the real data on the class label,

6. The rotating machinery small sample fault diagnosis method based on generation countermeasure network as claimed in claim 5, wherein the step S5 includes,

firstly, marking data samples except the kth health state as negative samples and marking the data samples as class 0; correspondingly, marking the data sample in the kth health state as a positive sample and marking the data sample as a type 1;

and fourthly, model fusion is carried out on the n discriminators, the probability value of the output result is combined into an n-dimensional vector, and the n-dimensional vector is accessed into a Softmax activation layer for activation, so that the probability that each value output by the output layer corresponds to each test sample and belongs to each health state is obtained.

7. The rotating machinery small sample fault diagnosis method based on generation of countermeasure network as claimed in claim 5, wherein between said step S4 and said step S5 further comprising the step S500: and performing quality evaluation on the network model trained in the step S4, and if the network model cannot pass the evaluation, repeating the step S4.

8. The method for diagnosing faults of small samples of rotating machinery based on generation of countermeasure network as claimed in claim 5, wherein the quality evaluation in the step S500 comprises:

second, the discriminator D is evaluated_kThe classification and identification capability of (2) is specifically realized by using the trained discriminator D_kGenerating a data sample if said discriminator D_kCan generate sufficiently realistic generated samples, the generated samples are further used to continuously perform a comparison with the discriminator D_kTraining is performed so that the discriminator D_kThe hidden distribution in the current sample is learned.