CN113778811A

CN113778811A - Method and system for fault monitoring of software system based on deep convolution transfer learning

Info

Publication number: CN113778811A
Application number: CN202111157772.2A
Authority: CN
Inventors: 吴勇; 廖明霞; 董一英; 沈谷峰; 杨婷婷
Original assignee: Chongqing University of Post and Telecommunications
Current assignee: Chongqing University of Post and Telecommunications
Priority date: 2021-09-28
Filing date: 2021-09-28
Publication date: 2021-12-10

Abstract

The invention relates to a software system fault monitoring method and system based on deep convolution transfer learning, belonging to the field of computer software testing, comprising the following steps: collecting a software system load data set under an existing load S, constructing a source domain data set; A set of original response times are divided by points to construct the source domain sample data set; the target domain data set is constructed, and each group of original response times in the target domain data set is divided by points to construct the target domain sample data set; the source domain sample data set is constructed; The dataset and target domain sample dataset utilize deep convolution transfer learning to implement fault monitoring for software systems. The present invention can still obtain a relatively ideal fault monitoring effect when there are few fault samples under multiple loads or a certain fault sample is missing, and the data set under the new load does not need to retrain the network model, and can Save a lot of time.

Description

Fault monitoring method and system based on deep convolution migration learning software system

Technical Field

The invention belongs to the field of computer software testing, and relates to a method and a system for monitoring system faults based on deep convolution transfer learning software.

Background

With the increasing size and complexity of computer software, the quality of the computer software is difficult to be effectively controlled and guaranteed. In the software system, when the load is applied to the adjacent boundary in the running process of a large number of users, the software system can have faults of different degrees. How to effectively extract and utilize the existing response time information to quickly and accurately identify and predict software faults is a key problem in the field of software fault monitoring at present.

The software system cannot respond or stop running, which causes poor user experience for users, and may cause great surface influence on company image, even may cause great damage. Therefore, by monitoring, monitoring and downtime prediction of the software system, the equipment is expanded or distributed for maintenance when the equipment fails or is about to fail, and the method has important significance for improving the reliability and the economy of the software system.

In the process of monitoring the actual load, the software system is usually operated under the condition of different loads, the response time is short, and the fault state of the software system is less. Therefore, the fault state data monitored and collected by the software system in the actual load has the characteristics of multiple loads, fewer fault state samples and even the defect of a certain fault state sample. When the traditional diagnosis method is faced with fault data samples under different loads, a network model needs to be re-established when the loads change, and the process of the traditional diagnosis method takes a lot of time. Moreover, most of the traditional diagnosis methods rely on a large amount of fault label data, and when the conditions that the fault state samples are insufficient or the fault state samples are missing occur, the generalization capability of the traditional network model is poor, and the fault monitoring effect is not ideal.

Disclosure of Invention

In view of the above, an object of the present invention is to provide a method and a system for monitoring a fault of a software system based on deep convolution migration learning, which can still obtain an ideal fault monitoring effect when the conditions of fewer fault samples under multiple loads or missing a certain fault sample occur, and can save a lot of time without retraining a network model for a data set under a new load. The transfer learning is a learning method for solving problems in different but related fields by using existing knowledge, and the method realizes field knowledge sharing by transferring the knowledge obtained by learning in a source field into a target field, thereby solving the problem of poor performance of a training model caused by few learning samples and unbalanced sample distribution in the target field. Compared with methods such as incremental learning, multi-task learning and self-learning, the migration learning emphasizes the correlation between learning tasks and utilizes the correlation to complete the migration between knowledge. The concept of Deep learning originates from the field of artificial intelligence machine learning, and a Deep Neural Network (DNN) model composed of multiple hidden layers is a remarkable characteristic of the Deep learning model. Compared with a shallow neural network model, the DNN can combine bottom layer features to form more abstract high-level feature representation, so that implicit feature expression of data is found, and features of information are effectively extracted and represented through layer-by-layer conversion of data features. Transfer Learning (Transfer Learning) is a machine Learning method, which transfers knowledge in one field (i.e., a source field) to another field (i.e., a target field) to enable the target field to obtain a better Learning effect.

In order to achieve the purpose, the invention provides the following technical scheme:

on one hand, the invention provides a software system fault monitoring method based on deep convolution transfer learning, which comprises the following steps:

s1: collecting a software system load data set under the existing load S, and constructing a source domain sample data set;

s2: point division is carried out on each group of original response time, and a source domain data set is constructed;

s3: constructing a target domain sample data set, and performing point segmentation on each group of original response time in the target domain data set to construct a target domain data set;

s4: and carrying out fault monitoring on the software system by using the source domain data set and the target domain data set through deep convolution transfer learning.

Further, in step S1, the software system load sample data set under the existing load S is classified into w states according to the fault type, and the original response time under each fault type

Where w represents the data class, w is 1, 2, 3 … n, x₀～x_nRepresented as the 1 st to n +1 th group fault signals in the w fault state.

Further, in the step S2, the source domain data set construction method includes the following steps:

s21: setting a window sliding step length s and a window length l according to the number N of data points, and generating a sample number t; sample d_i＝{X₀，X₁，X₂，...X_L1, 2, 3, ·, t; obtaining a source domain data set M from a sample^s＝{d₁，，，d₂，，d₃，…d_t，}；

S22: setting a source domain test set in a source domain data set

And source domain training set

R, the source domain training set

Sample number a ═ t · r, source domain test set

The sample number b is t (1-r).

Further, in step S3, the machine response time of the software system under different loads in the four states of normal operation state, data abnormality, local user abnormality, and downtime is collected

Constructing a target domain sample data set according to the machine response time

Wherein, w' is 1, 2, 3, 4, which respectively represents four states of normal operation state, program data abnormity, local error and downtime.

Further, setting window sliding step length and window length, and constructing a target domain data set M^T(ii) a Setting the proportion of the test set and the training set, and constructing a target domain data training set

And target domain data test set

Further, in step S4, the fault monitoring includes the following steps:

s41: training source domain data

Inputting a set of one-dimensional depth convolution neural network I to pre-train and initialize network parameters, and testing the set through a source domain

Testing the network effect, if the testing effect is ideal, pre-training to finish determining parameters and finishing training the network, otherwise, continuously adjusting the network to perform back propagation and continuously updating the parameters until the network achieves the ideal effect on the test set to finish training;

s42: targeting domain dataset M using convolutional neural network hierarchy^TPerforming transfer learning, freezing the global mean pooling layer L in the feature extraction module and the feature classification module of the one-dimensional deep convolutional neural network I_GAnd L in the full connection layer_FAdding a new Softmax layer for the network model I to adapt to the target domain data set

Completing network level adjustment and constructing new network I₂；

S43: to network I₂Fine tuning is performed by locking feature classification modules D, and L₁，L₂，L₃Weight parameter of layer, unfreezing L₄Layer parameters, obtaining network I after fine adjustment₃；

S44: acquiring original fault signals of the software system in real time and transmitting the signals to a network I₃And obtaining a fault monitoring result of the current software system.

Further, the one-dimensional depth convolution neural network I model construction method in step S41 includes the following steps:

(1) construction of a convolution pooling layer L_j：

L_j＝{C_j，P_j，B_j}

In the formula, C_j、P_j、B_jThe convolution layer, the pooling layer and the normalization layer are respectively used for feature extraction; j is the number of the convolution pooling module;

(2) stacking 4 convolution pooling layers to construct a feature extraction module S', S ═ L₁，L₂，L₃，L₄}；

(3) Adding a characteristic classification module D, D ═ L_G，L_F，L_softmaxThe feature classification module comprises a global mean pooling layer L_GAll-connected layer L_FSoftmax layer

And completing the network construction.

Further, in the step S43, the target domain training data set is used

To network I₃Training is carried out to enable the network to extract deep abstract features from the target domain data set

Via the full connection layer L_FSoftmax layer

And outputting the fault probability distribution of each fault type of the target domain, wherein the maximum probability of the fault probability distribution corresponds to the fault type and serves as a diagnosis result.

On the other hand, the invention provides a load diagnosis system based on a deep convolution migration learning software system, which comprises a source domain sample data set construction module, a source domain data set construction module, a target domain data set construction module and a fault monitoring module;

the source domain sample data set construction module collects a software system load data set under the existing load S and constructs a source domain sample data set;

the source domain data set construction module performs point number segmentation on each group of original response time to construct a source domain data set;

the target domain data set construction module constructs a target domain sample data set, and performs point segmentation on each group of original response time in the target domain data set to construct a target domain sample data set;

and the fault monitoring module carries out fault monitoring on the software system by using the deep convolution transfer learning on the source domain data set and the target domain data set.

Further, when the fault monitoring module detects a fault, the method includes: training set of source domain data

Inputting one-dimensional deep convolution neural network I to pre-train and initialize network parameters, and passing through a source domain test set

Testing the network effect of the network, if the test effect is ideal, pre-training to complete the determination of parameters and complete the training of the network, otherwise, continuously adjusting the network to perform back propagation and continuously updating the parameters until the network achieves the ideal effect on the test set to complete the training; targeting domain dataset M using convolutional neural network hierarchy^TPerforming transfer learning, freezing the global mean pooling layer L in the feature extraction module S' and the feature classification module of the one-dimensional deep convolutional neural network I_GAnd weighting parameters of LF in the full connection layer, and adding a new Softmax layer for adapting the network model I to the target domain data set

New network I is constructed by adjusting transmission completion network level₂(ii) a Acquiring original fault signals of the software system in real time and transmitting the signals to a network I₃And obtaining a fault monitoring result of the current software system.

The invention has the beneficial effects that: compared with the traditional fault monitoring method, the deep winding machine migration learning method provided by the invention still has higher fault monitoring precision when a few sample data sets or missing sample data sets are faced. 2. Compared with the traditional fault monitoring method, the deep convolution transfer learning method provided by the invention utilizes the convolutional neural network hierarchical structure transfer learning, and can save a large amount of time when new load training is faced.

Additional advantages, objects, and features of the invention will be set forth in part in the description which follows and in part will become apparent to those having ordinary skill in the art upon examination of the following or may be learned from practice of the invention. The objectives and other advantages of the invention may be realized and attained by the means of the instrumentalities and combinations particularly pointed out hereinafter.

Drawings

For the purposes of promoting a better understanding of the objects, aspects and advantages of the invention, reference will now be made to the following detailed description taken in conjunction with the accompanying drawings in which:

FIG. 1 is a schematic overall flow chart of a fault monitoring method based on a deep convolution migration learning software system according to the present invention;

FIG. 2 is a schematic diagram of a one-dimensional convolutional network structure according to the present invention;

FIG. 3 is a schematic diagram of the network migration learning of the present invention;

fig. 4 is a schematic diagram of network migration learning fine tuning according to the present invention.

Detailed Description

The embodiments of the present invention are described below with reference to specific embodiments, and other advantages and effects of the present invention will be easily understood by those skilled in the art from the disclosure of the present specification. The invention is capable of other and different embodiments and of being practiced or of being carried out in various ways, and its several details are capable of modification in various respects, all without departing from the spirit and scope of the present invention. It should be noted that the drawings provided in the following embodiments are only for illustrating the basic idea of the present invention in a schematic way, and the features in the following embodiments and examples may be combined with each other without conflict.

Wherein the showings are for the purpose of illustrating the invention only and not for the purpose of limiting the same, and in which there is shown by way of illustration only and not in the drawings in which there is no intention to limit the invention thereto; to better illustrate the embodiments of the present invention, some parts of the drawings may be omitted, enlarged or reduced, and do not represent the size of an actual product; it will be understood by those skilled in the art that certain well-known structures in the drawings and descriptions thereof may be omitted.

The same or similar reference numerals in the drawings of the embodiments of the present invention correspond to the same or similar components; in the description of the present invention, it should be understood that if there is an orientation or positional relationship indicated by terms such as "upper", "lower", "left", "right", "front", "rear", etc., based on the orientation or positional relationship shown in the drawings, it is only for convenience of description and simplification of description, but it is not an indication or suggestion that the referred device or element must have a specific orientation, be constructed in a specific orientation, and be operated, and therefore, the terms describing the positional relationship in the drawings are only used for illustrative purposes, and are not to be construed as limiting the present invention, and the specific meaning of the terms may be understood by those skilled in the art according to specific situations.

As shown in fig. 1, the present invention provides a method for monitoring system faults based on deep convolution transfer learning software, which includes the following steps:

s1, collecting the software system load data set under the existing load S, and constructing the source domain sample data set

The software system load sample data set under the existing load S is classified into w states according to fault types, and the original response time under each fault type

Where w represents the data class, w is 1, 2, 3 … n, x_NRepresented as the nth set of fault signals in the w fault state.

S2, performing point number segmentation on each group of original response time to construct a source domain data set;

to be provided with

One set of signals x in (1)₁For example, for x₁Point segmentation is carried out to construct a source domain sample data set, and the specific steps are as follows:

and S21, setting a window sliding step length S and a window length l according to the number N of the data points, and generating a sample with the number t. Sample d_i＝{X₀，X₁，X₂，...X_L1, 2, 3, ·, t; obtaining a source domain data set M from a sample^s＝{d₁，，，d₂，，d₃，…d_t，}；

S22, setting a source domain test set in a source domain data set

And source domain training set

R, the source domain training set

Sample number a ═ t · r, source domain test set

Sample number b ═ t · (1-r); in the present embodiment, the ratio r is preferably 0.3.

S3, constructing a target domain sample data set

Dividing the number of points of each group of original response time in the target domain data set to construct a target domain data set;

collecting original response time of software system under different loads in four states of normal operation state, abnormal program data, local error and downtime

Constructing a target domain sample data set according to the response time

W is 1, 2, 3 and 4, which respectively represent a normal running state, tooth surface abrasion, planet gear tooth breakage and rolling element bearing loss;

setting window sliding step size and window length according to step S21, and constructing target domain data set M^TSetting the ratio of the test set to the training set according to the step S22, and constructing a training set of target domain data

And target domain data test set

S4, carrying out fault monitoring on the software system by using the deep convolution transfer learning of the source domain data set and the target domain data set, which comprises the following specific steps:

s41, training set of source domain data

And testing the network effect of the network, if the test effect is ideal, pre-training to finish determining parameters and finishing training the network, otherwise, continuously adjusting the network to perform back propagation and continuously updating the parameters until the network achieves the ideal effect on the test set to finish training. The initialization of the internal parameters of the network comprises the steps of setting learning rate, activating function, weighting parameters, extracting characteristics and the like.

As shown in fig. 2, the method for constructing the one-dimensional depth convolution neural network I model includes:

(1) construction of a convolution pooling layer L_j：

L_j＝{C_j，P_j，B_j}

In the formula, C_j、P_j、B_jThe convolution layer, the pooling layer and the normalization layer are respectively used for feature extraction; j is the convolution pooling module number.

(2) Superposing 4 convolution pooling layers to construct a feature extraction module S, S ═ L₁，L₂，L₃，L₄}。

And completing the network construction.

The source domain data training set passes through C of each convolution pooling layer in the feature extraction module_j、P_j、B_jConvolution kernel operation, pooling operation, normalization operation output characteristics of

Superposition of 4 convolutional pooling layers S ═ L₁，L₂，L₃，L₄Get the final characteristics

Final characteristics

Outputting the characteristic value y after passing through the global mean pooling layer_fg(ii) a Full connection layer pair y_fgPerforming characteristic combination and Dropodt operation to output characteristic value y_tAnd is combined with y_tAnd (4) inputting the probability distribution of each fault type of the source domain into a Softmax classifier, and taking the maximum probability of the probability distribution corresponding to the fault type as a diagnosis result.

S42, utilizing the convolutional neural network hierarchy structure to carry out the data set M of the target domain^TPerforming transfer learning, freezing a feature extraction module S of the network model I and a global mean pooling layer L in the feature classification module_GAnd L in the full connection layer_FAdding a new Softmax layer for the network model I to adapt to the target domain data set

Completing network level adjustment and constructing new network I₂As shown in fig. 3.

Training set using target domain data

For new network 1₂Training and updating Softmax layer

Pass the target domain test set

And testing the network, finishing the transfer learning if the testing effect is ideal, and otherwise, continuing to perform network iteration and performing back propagation until the network achieves the ideal effect on the testing set.

S43, network I₂Fine tuning is performed by locking feature classification modules D, and L₁，L₂，L₃Weight parameter of layer, unfreezing L₄Layer parameters, obtaining network I after fine adjustment₃As shown in fig. 4.

Training a data set using a target domain

Via the full connection layer L_FSoftmax layer

S44, real-time obtaining step S3_SoftwareThe original fault signal of the system is transmitted to the network I in step S43₃And obtaining a fault monitoring result of the current software system.

The invention also provides a system load diagnosis system based on the deep convolution transfer learning software, which comprises: the system comprises a source domain sample data set construction module, a source domain data set construction module, a target domain data set construction module and a fault monitoring module;

a source domain data set construction module collects a software system load data set under the existing load S and constructs a source domain sample data set;

the target domain data set construction module constructs a target domain sample data set, and performs point segmentation on each group of original response time in the target domain sample data set to construct a target domain data set;

In the above embodiment, in the fault monitoring module, the fault monitoring includes the following steps:

training set of source domain data

Testing the network effect of the network, if the test effect is ideal, pre-training to complete the determination of parameters and complete the training of the network, otherwise, continuously adjusting the network to perform back propagation and continuously updating the parameters until the network achieves the ideal effect on the test set to complete the training;

targeting domain dataset M using convolutional neural network hierarchy^TPerforming transfer learning, freezing a feature extraction module S of the network model I and a global mean pooling layer L in the feature classification module_GAnd L in the full connection layer_FAdding a new Softmax layer for the network model I to adapt to the target domain data set

Completing network level adjustment and constructing new network I₂；

To network I₂Fine tuning is performed by locking feature classification modules D, and L₁，L₂，L₃Weight parameter of layer, unfreezing L₄Layer parameters, obtaining network I after fine adjustment₃；

Acquiring original fault signals of the software system in real time and transmitting the signals to a network I₃And obtaining a fault monitoring result of the current software system.

In conclusion, the invention constructs the one-dimensional deep convolutional neural network, performs the migration learning by utilizing the hierarchical structure of the one-dimensional convolutional neural network, and provides the software system fault monitoring method based on the deep convolutional migration learning. The invention uses the existing source domain data set to pre-train the one-dimensional convolutional neural network, and uses the hierarchical structure of the one-dimensional convolutional neural network to complete the transfer learning of the target domain data set.

Finally, the above embodiments are only intended to illustrate the technical solutions of the present invention and not to limit the present invention, and although the present invention has been described in detail with reference to the preferred embodiments, it will be understood by those skilled in the art that modifications or equivalent substitutions may be made on the technical solutions of the present invention without departing from the spirit and scope of the technical solutions, and all of them should be covered by the claims of the present invention.

Claims

1. a software system fault monitoring method based on deep convolution migration learning, is characterized in that: comprise the following steps:

S1: Collect the software system load data set under the existing load S, and construct the source domain sample data set;

S2: Divide each group of original response times by points to construct a source domain data set;

S3: Construct the target domain sample data set, and divide each group of original response times in the target domain data set by points to construct the target domain data set;

S4: The source domain dataset and the target domain dataset are used for deep convolution transfer learning to realize fault monitoring of the software system.

2. The software system fault monitoring method based on deep convolution transfer learning according to claim 1, wherein in step S1, the software system load sample data set under the existing load S is classified as w according to the fault type states, raw response time under each fault type

Where w represents the data type, w=1, 2, 3...n, x ₀ ～x _n are represented as the 1st ～n+1 groups of fault signals in the w fault state.

3. The software system fault monitoring method based on deep convolution transfer learning according to claim 1, wherein in the step S2, the source domain data set construction method comprises the following steps:

S21: Set the window sliding step size _s and the window length _l _according to the number of data points _N , and the number of generated samples is t _; 2, 3,..., t; obtain the source domain data set M ^s ={d ₁ ,,,d ₂ ,,d ₃ ,...d _t ,} according to the sample;

S22: Set the source domain test set in the source domain dataset

with the source domain training set

The ratio of r is r, then the source domain training set

Number of samples a = t r, source domain test set

Number of samples b=t·(1-r).

4. The method for monitoring software system faults based on deep convolution transfer learning according to claim 1, characterized in that: in the step S3, the software systems under different loads are collected in normal operation state, data abnormality, local user abnormality and Machine response time in four states of downtime

Build a target domain sample dataset based on the machine response time

Among them, w'=1, 2, 3, and 4, respectively representing four states of normal running state, abnormal program data, partial error and downtime.

5. the software system fault monitoring method based on deep convolution transfer learning according to claim 4, is characterized in that: setting window sliding step size and window length, build target domain data set ^MT ; Set test set and training set Proportion, construct the target domain data training set

and the target domain data test set

6. The fault monitoring method based on deep convolution transfer learning software system according to claim 1, is characterized in that: in described step S4, fault monitoring comprises the following steps:

S41: Train the source domain data

Set input one-dimensional deep convolutional neural network I for pre-training to initialize network parameters, and pass the source domain test set

Test the network effect. If the test effect is ideal, the pre-training is completed to determine the parameters, and the training network is completed. Otherwise, continue to adjust the network for back-propagation and continuously update the parameters until the network achieves the ideal effect on the test set to complete the training;

S42: Use the convolutional neural network hierarchical structure to perform migration learning on the target domain data set ^MT , and freeze the global mean pooling layer _LG and the global mean pooling layer LG in the feature extraction module and the feature classification module of the one-dimensional deep convolutional neural network I. The weight parameter of LF in the connection layer, adding a new _Softmax layer for the adaptation of the network model I to the target domain dataset

Complete the network level adjustment to construct a new network I ₂ ;

S43: Fine-tune the network I ₂ , and obtain the fine-tuned network I ₃ by locking the feature classification module D, and the weight parameters of the L ₁ , L ₂ , and L ₃ layers, and unfreezing the parameters of the L ₄ layer;

S44: Acquire the original fault signal of the software system in real time, transmit it to the network _I3 , and obtain the fault monitoring result of the current software system.

7. the software system fault monitoring method based on deep convolution transfer learning according to claim 6, is characterized in that: the one-dimensional deep convolutional neural network I model construction method described in step S41 comprises the following steps:

(1) Construct the convolution pooling layer L _j :

L _j ={C _j , P _j , B _j }

In the formula, C _j , P _j , and B _j are the convolution layer, pooling layer, and normalization layer, which are used for feature extraction; j is the number of the convolution pooling module;

(2) Stacking four convolution pooling layers to construct a feature extraction module S', S'={L ₁ , L ₂ , L ₃ , L ₄ };

(3) Add a feature classification module D, D={L _G , L _F , L _softmax }, the feature classification module includes a global mean pooling layer _LG , a fully connected layer _LF , and a Softmax layer

Complete the network construction.

8. The fault monitoring method for a software system based on deep convolution transfer learning according to claim 6, wherein in the step S43, a target domain training data set is used

Train the network _I3 so that the network extracts deep abstract features from the target domain dataset

Via the fully connected layer LF and the _Softmax layer

The fault probability distribution of each fault type in the target domain is output, and the maximum probability corresponds to the fault type as the diagnosis result.

9. A software system load diagnosis system based on deep convolution transfer learning, characterized in that: it comprises a source domain sample data set building module, a source domain data set building module, a target domain data set building module and a fault monitoring module;

The source domain sample data set construction module collects the software system load data set under the existing load S, and constructs the source domain sample data set;

The source domain data set building module divides each group of original response times by points to construct a source domain data set;

The target domain data set building module constructs a target domain sample data set, and divides each group of original response times in the target domain data set by points to construct a target domain sample data set;

The fault monitoring module utilizes the deep convolution transfer learning of the source domain data set and the target domain data set to realize fault monitoring of the software system.

10. The system load diagnosis system based on deep convolution transfer learning software system according to claim 9, characterized in that: when the fault monitoring module performs fault detection, it comprises: the source domain data training set

Input the one-dimensional deep convolutional neural network I for pre-training to initialize the network parameters, and pass the source domain test set

Test the network effect of the network. If the test effect is ideal, the pre-training is completed to determine the parameters, and the training network is completed. Otherwise, continue to adjust the network for back-propagation and continuously update the parameters until the network achieves the ideal effect on the test set to complete the training; use convolution The neural network hierarchical structure performs migration learning on the target domain data set MT, and freezes the feature extraction module S′ of the one-dimensional deep convolutional neural network I, the global mean pooling layer _LG in the feature classification module, and the fully connected layer L. The weight parameter of _F , adding a new Softmax layer to adapt the network model I to the target domain dataset

After completing the network level adjustment and constructing a new network I ₂ ; obtain the original fault signal of the software system in real time, transmit it to the network I ₃ , and obtain the fault monitoring result of the current software system.