CN112766303A

CN112766303A - CNN-based aeroengine fault diagnosis method

Info

Publication number: CN112766303A
Application number: CN202011535827.4A
Authority: CN
Inventors: 全哲; 高晋峰; 肖桐; 郭燕; 李磊
Original assignee: Hunan University
Current assignee: Hunan University
Priority date: 2020-12-23
Filing date: 2020-12-23
Publication date: 2021-05-07
Anticipated expiration: 2040-12-23
Also published as: CN112766303B

Abstract

The invention discloses a CNN-based aeroengine fault diagnosis method, wherein the data set used for prevention is gas path parameters collected by an aeroengine sensor, wherein the gas path parameters comprise gas path parameters when various faults occur and gas path parameters under normal conditions, and the data are collected according to a time sequence, the characteristic that a convolutional neural network can fully excavate the change between the gas path parameters back and forth is used, compared with the traditional method for modeling discrete data (analyzing the data at a specific moment), the method not only considers the change of specific values of the gas path parameters at different moments, but also further considers the trend characteristic of the parameter change at continuous moments and the relation back and forth, and as the CNN is used with certain translational invariance, the generalization capability is better, more comprehensive, more advanced and more complex characteristics can be obtained, and then a novel loss function is provided, and the method is used for evaluating the classification result of the model so as to realize the diagnosis of the fault.

Description

CNN-based aeroengine fault diagnosis method

Technical Field

The invention belongs to the field of engines, and particularly relates to a CNN-based aircraft engine fault diagnosis method.

Background

The aircraft engine is one of the most central components of an aircraft, is a system with high complexity, and the health condition of the aircraft engine is an important prerequisite for ensuring the flight safety of the aircraft. The relevant data show that more than 50% of the flight accidents in the last decade are caused by the failure of the aircraft engine, and in addition, the maintenance expenditure of the aircraft engine accounts for up to 40% of the global aircraft maintenance industry, so that the reliable and stable operation of the engine is guaranteed, and the method has great significance for reducing the maintenance cost of the airlines and manufacturers, shortening the maintenance period and the engine stop time and improving the operation efficiency of the engine. The fault detection technology of the aircraft engine is one of the most important core technologies. At present, the mainstream intelligent fault detection algorithm is mainly based on a neural network method and a support vector regression method, and an offset is obtained by converting an aero-engine gas path measurement parameter into a standard state and calculating a difference value with a corresponding engine performance baseline (or a reference value), and fault diagnosis and performance prediction are performed through the offset and a change trend thereof. On the other hand, feature extraction is carried out on limited gas circuit measurement data change through an artificial intelligence technology, and the method becomes a new means for diagnosing faults of the aero-engine.

The current common practice is mainly divided into the following steps:

1. a baseline modeling method based on a neural network. With the rapid development of artificial intelligence, the Neural Network provides possibility for solving uncertain input and output description existing in engine baseline modeling, an aircraft engine performance parameter baseline library is constructed by analyzing performance parameters of a factory monitoring system by adopting a nonlinear regression analysis method, and an engine gas circuit state parameter prediction method based on a Process Neural Network (PNN) is adopted. Or using NeuroSolution6 software to realize a Radial Basis Function (RBF) neural network algorithm and establishing EGT, FF and N2 healthy baselines. Establishing a baseline model of aeroengine gas path parameters (EGT, FF and N2) by utilizing a Back Propagation (BP) neural network optimized by a genetic algorithm. Although the neural network has strong nonlinear fitting capability, the neural network has the defect that when the training sample set is small, divergence easily occurs.

2. A baseline modeling method based on Support Vector Regression (SVR). In recent years, the support vector regression has been studied as a data mining method by many scholars, and the defects of the neural network can be well avoided. The SVR algorithm has the advantages of high processing speed and accurate calculation when processing the nonlinear regression problem, and performs multi-parameter and single-parameter regression analysis. However, the algorithm based on the SVR still has the problems of sensitivity in model parameter and kernel function selection, and the like, and has a general effect on the problem of multi-classification.

3. The deep learning is used as a hotspot technology of machine learning, and has been successfully applied to the field of fault diagnosis by virtue of excellent feature extraction capability, the feature extraction is carried out on the real-time monitoring data and the historical data of the engine by utilizing the strong feature learning capability of the deep learning, and the classification and the diagnosis of the engine fault can be better completed by utilizing the classifier to classify the features, so that the deep confidence network-based feature extractor and the fault classification method have stronger generalization and practicability. However, if the number of layers of the neural network is too deep, the problem of overfitting is easy to occur, an extremely large data set needs to be used for training, and if a traditional shallow network is used, the problems of local minimum and overfitting are easy to occur, so that the generalization of the system is influenced.

In summary, each method has certain limitations, and the method closest to the third scheme of the invention is very easy to train and have poor generalization capability, but because the traditional neural network is used, the method provided by the invention uses a one-dimensional convolutional neural network to extract features according to the gas path monitoring data of the time sequence of the aircraft engine, integrates the variation trend of a plurality of monitoring parameters, and can obtain better multi-classification fault features. The method has the advantages that the shallow one-dimensional convolutional neural network is used for extracting features, the SVM and the softmax classifier are used for fusion learning of the extracted features, and the fault detection technology with better generalization effect and higher accuracy can be obtained under the condition that the high calculation cost of the deep neural network is avoided.

The noun explains:

one hot: one hot coding is a common label coding mode in multiple classifications, which is also called one-bit effective coding, and one N-dimensional vector is adopted to represent N states, where N represents the total number of the classifications in the multiple classifications, and when a classification label is i, the position of the i-th index in the N-dimensional vector is set to be 1, and the other positions are all 0 values.

Relu activation function: the Rectified linear Unit, modified linear units, is of the form

The method is a nonlinear activation function, the output of a part of neurons is 0, thus the sparsity of the network is caused, the interdependence relation of parameters is reduced, and the occurrence of the overfitting problem is relieved.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provide a CNN-based aeroengine fault diagnosis method. The method firstly uses the one-dimensional convolutional neural network to intelligently extract the characteristics of the aeroengine gas path parameters based on the time sequence, replaces the traditional artificial characteristic design, has better generalization and stability, and has low calculation cost due to the shallow convolutional neural network. And performing pooling operation on the result after the convolution to further extract features so as to obtain the feature size with a fixed scale, so that the technology can be applied to sequence data with different lengths and has good flexibility. And finally, classifying the sequence characteristics by combining a classifier and a back propagation algorithm, thereby accurately diagnosing the fault mode.

The purpose of the invention can be realized by the following technical scheme:

a CNN-based aeroengine fault diagnosis method comprises the following steps:

the method comprises the following steps of firstly, collecting operation data when an aircraft engine fails, classifying fault categories, labeling to construct a data set, and segmenting the data set to form a training set and a testing set;

preprocessing the data of the training set to finish data cleaning, and adopting min-max normalization to perform dimension removal on the data;

constructing a multi-classification model, wherein the multi-classification model comprises a shallow convolutional layer and a pooling layer, and is fused with a classifier;

taking the sampled gas path parameters as input, carrying out convolution, pooling and output layer, transmitting the output value to a classifier, optimizing a cross entropy loss function, and then continuing training;

step five, carrying out repeated iterative computation to preset times to obtain a trained model;

step six, carrying out the same data preprocessing on the test set for testing;

and step seven, inputting the running data of the engine into the trained model in real time to obtain the diagnosis result when the engine fails.

In the first step, data containing null values or abnormal values in the data set are removed, and interference data are removed; then, the data are segmented to ensure that the training set and the test set are distributed consistently; with 80% of the data as the training set and 20% as the test set.

In a further improvement, the second step includes the following steps:

2.1, counting the engine parameters, and sorting out practical data of each engine parameter to obtain a counting result; the prime number practical data comprises an actual range interval and an occurrence frequency; the engine parameters comprise torque, inter-turbine temperature, rotating speed of a low-pressure turbine compressor, rotating speed of a high-pressure turbine compressor, rotating speed of a propeller, outlet pressure of the high-pressure compressor, fuel flow, take-off height, flight speed Mach number and flight height;

2.2 counting outliers or dirty data by utilizing a box plot principle according to the statistical result;

2.3 removing outliers or dirty data to obtain a data set;

2.4, carrying out normalization processing on the data in the data set;

2.5 the data is subjected to a dispersion normalization process, and the original data is subjected to a linear transformation, so that the result is mapped between [0, 1], and the conversion function is as follows:

wherein x^*Representing the data after normalization, x representing the data before normalization, max being the maximum value of the sample data, and min being the minimum value of the sample data;

2.6 further sorting the normalized data set to form an engine parameter matrix:

wherein the fault label represents the fault type, X₀ ^mThe value represents the state value of the mth variable at time 0, and n represents the nth time.

In the third step, the building of the multi-classification model includes the following steps:

3.1 the data of n continuous time points are sampled in the engine gas path parameter matrix as the input of the model.

3.2 the true value of the output y of the model is coded by one hot, y represents the fault label corresponding to the input matrix, the true value dimension of y is consistent with the actual label, each element corresponds to the possible probability of one fault category, and the sum of all the probabilities is 1;

3.3 the model uses a one-dimensional convolution neural network for feature extraction, the one-dimensional convolution has two basic features: firstly, the data is a one-dimensional matrix; secondly, each line is arranged according to a time sequence and has a front-back incidence relation; the calculation formula for one-dimensional convolution is as follows:

wherein u is one-dimensional data with the sequence length of s, and u is used as the input of the model; each element in the one-dimensional data is a vector of fixed size; f (i, j) represents that the row index is i and the column index is j in the one-dimensional convolution kernel represents the parameter of the convolution kernel; i represents a row index of the one-dimensional convolution kernel, j represents a column index of the one-dimensional convolution kernel, u (i, j) represents an element which is represented by i and j in the input parameter u, b represents a bias parameter, and Conv1D (u) represents the output of the input parameter u after the one-dimensional convolution operation; sigma represents a Relu activation function, is used for increasing the nonlinear fitting capability of the neural network, overcomes the problem of gradient disappearance and accelerates the training speed;

the formula of the 3.4SVM classifier is as follows:

wherein L is_iRepresenting the value of the loss function obtained after the ith input matrix is subjected to model calculation,

y_iindicating the actual correct label, s_jA probability value representing a class j of an actual output of the model;

class y representing the actual prediction output of the model_iA probability value of (d); Δ represents a threshold value if

If the difference is equal to or higher than the threshold value, the correct category and the compared category are judged to be well distinguished, and a 0 loss value is given; if less than the threshold, the model is said to have poor classification between the correct class and the class being compared, and the difference between the class scores is determinedAdding a threshold value delta as a loss;

3.5Softmax classifier:

firstly, the softmax classifier normalizes the actual output of the model through the normalization function of the formula to ensure that each is positive and the sum of all classes is 1; after normalization, if a certain class value is larger and closer to 1, the model judges that the most possible class is the corresponding class, conversely, if the model judges that the correct class probability value is closer to 0, the model is worse, according to the characteristic, the loss value is taken as the-log value of the correct class, and the smaller the correct class probability is, the larger the loss is, as shown in the following formula:

wherein P(s) represents the probability of the actually predicted vector of the model after being subjected to softmax normalization, s represents the actually output vector of the model, k represents the kth fault category, e^sIndicating that the predicted probability value for a certain class is exponentially operated, M indicating the number of classes,

represents 0 or 1, is 1 if the actual class of sample i is consistent with c, otherwise is 0,

representing the probability that the type of the model actual prediction sample i is c; n represents the number of samples;

3.6 combining loss functions

Where i denotes the ith sample, N denotes the total number of samples, and α denotes L_svmLoss function stationThe factor occupied.

In a further improvement, the fourth step includes the following steps:

4.1 one-dimensional convolution process:

4.1.1 traversing a sliding window with the same size as the convolution kernel in the input features;

4.1.3 performing dot product operation on the convolution kernel and the corresponding characteristic matrix window in the previous step;

4.1.4 traversing the whole feature matrix to calculate the result of point multiplication for summation;

4.1.5 sending the result after summation into Relu activation function to increase the fitting capability of the nonlinear characteristic of the model;

4.1.6 after summing, reducing the dimension of the original feature matrix and extracting the original feature matrix into high-level features related to front and back time series;

4.1.7 returning the extracted high-level features;

4.2 the pooling process:

a pooling layer is added after the convolutional layer, so that the complexity of data is reduced, overfitting of the model is prevented, and maximum pooling or average pooling is selected according to final needs;

4.2.1 the global maximum pooling is to select the maximum eigenvalue from the eigenvalues as the output of the maximum pooling according to the size of the eigenvalue;

4.2.2 Global average pooling refers to selecting an average value group in the features as the output of the final average pooling layer according to the size of the feature value;

4.3 classifier: connecting all extracted features and sending output values to a classifier

4.3.1 converting the output value of the network into a vector;

4.3.2 replacing a full connection layer by adopting a global average pooling technology; or a full connection layer technology is used, and a dropout layer is used in a matching way;

4.4 dropout layer:

the Dropout layer randomly discards a part of input in the training process, and parameters corresponding to the input of the lost part cannot be updated at the moment so as to solve the problem of overfitting and reduce the problem of complex adaptation among neurons;

4.4.1 at first, randomly deleting half of hidden neurons in the network, and keeping the input and output neurons unchanged;

4.4.2 then propagating the input x forward through the modified network and then propagating the resulting loss result backward through the modified network; after the partial training samples are executed, updating corresponding parameters on the undeleted neurons according to a random gradient descent method;

4.4.3 repeat step 4.4.1 and step 4.4.2.

In a further improvement, the step five comprises the following steps:

5.1 obtaining the super parameter batch size through an experiment, namely the range of batch size;

5.2 using convolution kernels with different sizes in the convolution process;

and 5.3, carrying out grid search or hyper-parameter search of the network model on the number of convolution kernels and even the size of the convolution kernels according to a preset rule in a preset range.

Because the state m of the engine parameter is fixed at each time, the sizes of the convolution kernels can be selected from different combinations of (3, m), (5, m), (7, m), (9, m), and the like, and the number of the convolution kernels can be 64, 128, 256, and the like. And training the models according to different combinations so as to select the optimal combination mode. Setting the epoch of training to be 400, fixing the size of the batch size and the combination of convolution kernels in the process of training each epoch, performing back propagation to adjust the parameters of the model by using a gradient descent algorithm in the training of each batch, and performing repeated iteration to complete the model training. Thereby obtaining the optimal parameter to complete the parameter search.

Compared with the prior art, the invention has the following advantages and characteristics:

1. the method comprises the steps of firstly preprocessing data more reasonably, counting an effective range interval of a gas circuit parameter of the aircraft engine by using a statistical method, analyzing abnormal values of outliers by using a box diagram, and rejecting noise at the same time, so that the data are more reasonable; meanwhile, min-max normalization processing is carried out on the data, dimensions among different variables are removed, and the normalized data can accelerate the training speed of the model and the convergence of the model.

2. And thirdly, a more reasonable model architecture is constructed, the convolution neural network is used for extracting the gas path parameter characteristics, so that more characteristics which cannot be extracted by the traditional manual method can be extracted, a large amount of workload of manually selecting the characteristics is avoided, the automatic extraction of the characteristics is completely dependent on the extraction capability of the model, the extracted characteristics are further selected by adopting a pooling layer, and the model is simpler and lighter. Meanwhile, the dropout layer is used for further improving the generalization capability of the model and reducing the model to a great extent

Drawings

FIG. 1 is a process flow diagram of the present invention;

fig. 2 is a structural view of feature extraction.

Detailed Description

The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

The method firstly uses the one-dimensional convolutional neural network to intelligently extract the characteristics of the aeroengine gas path parameters based on the time sequence, replaces the traditional artificial characteristic design, has better generalization and stability, and has low calculation cost due to the shallow convolutional neural network. And performing pooling operation on the result after the convolution to further extract features so as to obtain the feature size with a fixed scale, so that the technology can be applied to sequence data with different lengths and has good flexibility. And finally, classifying the sequence characteristics by combining a classifier and a back propagation algorithm, thereby accurately diagnosing the fault mode. The specific technical scheme of the invention is as follows:

firstly, constructing a data set:

1.1 the data set is cut and shuffled, comprising 80% of training set and 20% of test set, which are used to verify the model effect.

1.2 the data set is classified and labeled according to the fault category of the aircraft engine, and the data containing null values or other abnormal values are removed, so that the accuracy can be obviously improved by eliminating interference data.

1.3 the data are divided to ensure that the training set and the testing set are distributed consistently.

Secondly, preprocessing the data of the training set to finish data cleaning, and adopting min-max normalization to perform dimensionless operation on the data

2.1, carrying out statistics on parameters such as torque, inter-turbine temperature, low-pressure turbine compressor rotating speed, high-pressure turbine compressor rotating speed, propeller rotating speed, high-pressure compressor outlet pressure, fuel flow, takeoff height, flight speed Mach number, flight height and the like, and sorting out practical data such as actual range intervals, appearance frequency and the like of each parameter;

and 2.3, rejecting partial data which does not meet the requirement.

2.4 because different evaluation indexes (parameters) often have different dimensions and dimension units, such a situation affects the result of data analysis, and in order to eliminate the dimension influence among the indexes, data standardization processing is required.

2.5 discrete normalization of the data, linear transformation of the original data, mapping the result between [0, 1], the transfer function is shown in FIG. 2 as follows:

FIG. 2 Min-Max Normalization (Min-Max Normalization)

Wherein max is the maximum value of the sample data, min is the minimum value of the sample data, and a reasonable value can be set according to an empirical value.

2.6 the normalized data set is further collated so that it is shown as follows:

first row X of the matrix₀ ⁰To X₀ ^mThe state values of m variables at 0 th time are represented, and the variable states from 0 th row to nth row are based on time series from 0 th to n th. The label corresponding to the whole matrix is a certain type of fault as a label.

And thirdly, constructing a classification model (multi-classification), which comprises a feature extractor and a classifier:

3.1 inputting a matrix shown in the above formula as a variable and a class label in the form of one hot code as a supervision signal;

3.2 using one hot coding for the true value of the model output y, wherein the dimension should be consistent with the actual label, each element corresponds to a possible probability of a fault category, and the sum of all the probabilities is 1;

3.3 the model uses a one-dimensional convolutional neural network, the one-dimensional convolution has two basic features: one is a matrix in which the data is one-dimensional, appearing to be 2-dimensional, with each row being a whole; secondly, each line in the graph is arranged according to a time sequence and has a certain pre-and post-association relation; the following figure is a calculation formula of one-dimensional convolution, wherein u is data of a certain length sequence dimension in the input:

u: one-dimensional data of sequence length s, where each element is also a vector of fixed size;

f (i, j) one-dimensional convolution kernel parameters;

σ: the Relu activating function increases the nonlinear fitting capability of the neural network, achieves a better fitting effect, can overcome the problem of gradient disappearance and accelerates the training speed.

The formula calculates the convolution kernel parameter and the corresponding state parameter matrix in a dot product mode according to a fixed convolution step length to obtain a corresponding convolution characteristic, and meanwhile, the original dimensionality can be compressed. And strictly moving the convolution kernel in a sliding window-like manner according to the step length until the state matrix is completely traversed, and obtaining the characteristic parameters of the input matrix, namely a characteristic set.

3.4SVM classifier: the formula is shown below

y_iindicating the actual correct label, s_jA score of class j representing the actual output of the model.

Delta denotes a threshold if above the threshold we consider the correct class to distinguish well from a certain class, we give a 0 penalty to distinguish between the two classes, on the contrary, we show that the model distinguishes the two classes very poorly, we add the difference between the class scores to the threshold as the penalty.

3.5Softmax classifier:

first, the softmax classifier normalizes the actual model output (which may be positive or negative) by the normalization function in the above equation, ensuring that each is positive and the sum of the classes is 1. After normalization, if a certain class value is larger and closer to 1, the model judges that the most possible class is the class, conversely, if the model judges that the correct class probability value is closer to 0, the model is worse, and according to the characteristic, the loss can be regarded as the-log value of the correct class (the smaller the correct class probability is, the larger the loss is), and the formula is as follows:

and fourthly, taking the sampled gas path parameters as input, and after convolution and pooling operations, transmitting an output value to a classifier, optimizing a loss function and continuing training. The specific process is shown in fig. 2:

4.1 one-dimensional convolution process:

4.1.6 after summing, reducing the dimension of the original feature matrix and extracting the original feature matrix into higher-level features related to the front and back time series;

4.1.7 return the high level features that have already been extracted.

4.2 the pooling process:

usually, a pooling layer is added after the convolutional layer to reduce the complexity of the data and prevent overfitting of the model, and the maximum pooling or average pooling can be selected according to the final requirement

4.2.1 the maximum pooling is that the maximum eigenvalue is selected from the adjacent eigenvalues according to the size of the pooling kernel as the output of the maximum pooling;

4.2.2 average pooling means that instead of being the maximum of the de-feature values, the average values in the features are selected to be the output of the final average pooling layer.

4.3 classifier:

connecting all extracted features and sending output values to a classifier

4.3.1 convert the output values of the network into a vector.

4.3.2 Global average pooling technique can be used instead of a fully-connected layer

4.3.3 if full connectivity layer technology is used, it is necessary to work with dropout layers because full connectivity is scale sensitive

4.4 dropout layer:

the Dropout layer randomly discards a part of input in the training process, and parameters corresponding to the input of the lost part cannot be updated at the moment, so that the overfitting problem can be solved to a great extent, and the complex fitting problem among neurons is reduced.

4.4.1 at first, half of the hidden neurons in the network are deleted randomly (temporarily according to a certain probability), and the input and output neurons are kept unchanged.

4.4.2 then propagate the input x forward through the modified network and then propagate the resulting loss results back through the modified network. After a small batch of training samples finishes the process, the corresponding parameters are updated on the neurons which are not deleted according to a random gradient descent method.

4.4.3 this process is repeated continuously.

And fifthly, obtaining a trained model through multiple iterative computations:

5.1 there is an important over-parameter batch size in random gradient descent training, the size of which has a great influence on the whole model training. Larger batch sizes will calculate more accurate gradient estimates because the more data is used per parameter update, the more representative the gradient of the global loss function and therefore the more accurate the gradient, but it may be that the network falls into a local minimum. And if the data volume is too large, the data loaded into the GPU video memory at one time can be too much bottleneck, and if the batch size is too small, the model can be not converged, so that the batch size needs to be increased within a reasonable range.

5.2 convolution process can use convolution kernel combination mode of different sizes, because the bigger the size of convolution kernel, the bigger the corresponding receptive field will be, the more the characteristics that can be learned the learning ability is stronger, but also increase the parameter quantity of model, increase the training difficulty of model. On the contrary, if the size of the convolution kernel is too small, the receptive field is too small, and the learning capability of the model may be limited, because the mode of combining convolution kernels with different sizes can be considered, different receptive fields can be considered, the learning capability of the model can be increased, and model parameters can not be greatly increased.

5.3 the number of the one-dimensional convolution kernels is also an important hyper-parameter, because the model finally adopts a global average pooling layer, one convolution kernel corresponds to a high-level feature, if the number of the convolution kernels is too small, the number of the finally extracted features is too small, and further, the learning capability of the model is possibly too poor, if the number of the convolution kernels is too large, the model becomes too complex, the training time is too long, and the like.

And sixthly, carrying out the test by the same pretreatment of the open test set.

The data set used by the invention is the gas circuit parameters collected by the aeroengine sensor, which comprises the gas circuit parameters when various faults occur and the gas circuit parameters under normal conditions, and the data is collected according to a time sequence, the invention uses the characteristic that the convolutional neural network can fully excavate the front and back change among the gas circuit parameters, compared with the traditional method for modeling discrete data (analyzing the data at a specific moment), the invention not only considers the change of the specific values of the gas circuit parameters at different moments, but also further considers the trend characteristic of the parameter change at continuous moments and the front and back relation, as the CNN has certain translation invariance, the generalization capability is better, more comprehensive, higher and more complex characteristics can be obtained, and then the invention provides a novel loss function, and the method is used for evaluating the classification result of the model so as to realize the diagnosis of the fault.

Claims

1. A CNN-based aeroengine fault diagnosis method is characterized by comprising the following steps:

the method comprises the following steps of firstly, collecting operation data when an aircraft engine fails, classifying fault categories, labeling to construct a data set, and segmenting the data set to form a training set and a test set;

step six, carrying out the same data preprocessing on the test set for testing;

2. The CNN-based aeroengine fault diagnosis method of claim 1, wherein in the first step, data including null values or abnormal values in the data set are removed, and interference data are removed; then, the data are segmented to ensure that the training set and the test set are distributed consistently; with 80% of the data as the training set and 20% as the test set.

3. The CNN-based aircraft engine fault diagnosis method according to claim 1, wherein the second step comprises the following steps:

2.1, counting the engine parameters, and sorting out practical data of each engine parameter to obtain a statistical result; the prime number practical data comprises an actual range interval and an occurrence frequency; the engine parameters comprise torque, inter-turbine temperature, rotating speed of a low-pressure turbine compressor, rotating speed of a high-pressure turbine compressor, rotating speed of a propeller, outlet pressure of the high-pressure compressor, fuel flow, take-off height, flight speed Mach number and flight height;

2.3 removing outliers or dirty data to obtain a data set;

2.4, carrying out normalization processing on the data in the data set;

2.6 further sorting the normalized data set to form an engine parameter matrix:

4. The CNN-based aircraft engine fault diagnosis method according to claim 3, wherein in the third step, the building of the multi-classification model comprises the following steps:

3.3 the model uses a one-dimensional convolution neural network for feature extraction, the one-dimensional convolution has two basic features: firstly, the data is a one-dimensional matrix; secondly, each line is arranged according to a time sequence and has a front-back incidence relation; the calculation formula of the one-dimensional convolution is as follows:

the formula of the 3.4SVM classifier is as follows:

If the value is equal to or higher than the threshold value, the classification is judged to be correct and the classification is comparedThe classification is well differentiated, and a 0 loss value is given; if the value is less than the threshold value, the model distinguishes the correct category from the compared categories badly, and the difference of the category scores is added with a threshold value delta to be used as loss;

3.5Softmax classifier:

wherein P(s) represents the probability of the actually predicted vector of the model after being subjected to softmax normalization, s represents the actually output vector of the model, k represents the kth fault category, and e^sIndicating that the predicted probability value for a certain class is exponentially operated, M indicating the number of classes,

3.6 combining loss functions

Where i denotes the ith sample, N denotes the total number of samples, and α denotes L_svmThe factor occupied by the loss function.

5. The CNN-based aircraft engine fault diagnosis method according to claim 1, wherein the fourth step comprises the steps of:

4.1 one-dimensional convolution process:

4.1.6 after summing, reducing the dimension of the original feature matrix and extracting the original feature matrix into high-level features related to the front and back time series;

4.1.7 returning the extracted high-level features;

4.2 the pooling process:

4.3.1 converting the output value of the network into a vector;

4.4 dropout layer:

4.4.3 repeat step 4.4.1 and step 4.4.2.

6. The CNN-based aircraft engine fault diagnosis method according to claim 1, wherein the fifth step includes the steps of:

5.2 using convolution kernels with different sizes in the convolution process;