CN113255546A

CN113255546A - Diagnosis method for aircraft system sensor fault

Info

Publication number: CN113255546A
Application number: CN202110617475.5A
Authority: CN
Inventors: 鲁方祥
Original assignee: Chengdu Calabar Information Technology Co ltd
Current assignee: Chengdu Calabar Information Technology Co ltd
Priority date: 2021-06-03
Filing date: 2021-06-03
Publication date: 2021-08-13
Anticipated expiration: 2041-06-03
Also published as: CN113255546B

Abstract

The invention relates to a method for diagnosing faults of sensors of an aircraft system, which comprises the following steps: performing sample and feature processing on the acquired sensor data to be used as a training set for training a fault diagnosis model; training a fault diagnosis model by using a method of a decision tree, a random forest or a deep neural network according to the training set to construct the fault diagnosis model; using sensor data which is not subjected to sample and feature processing as a test set, and verifying the constructed fault diagnosis model; and after the fault diagnosis model is verified, inputting the newly acquired sensor data into the fault diagnosis model to obtain a diagnosis result. The training set of the invention contains data of various performances of the equipment, and the trained fault diagnosis model can know what performances of the equipment have faults and can completely locate fault points.

Description

Diagnosis method for aircraft system sensor fault

Technical Field

The invention relates to the technical field of aircraft sensor fault detection, in particular to a method for diagnosing faults of a sensor of an aircraft system.

Background

The airplane is provided with a plurality of sensor acquisition subsystems which are responsible for the functions of acquiring and processing the data of the sensors at the front end of the airplane electronic system. Such as sensors on the aircraft including rate gyroscopes, acceleration assemblies, fuel sensors, pitch rod sensors, etc., and interface units to collect data signals from these sensors.

The data collected by the aircraft sensor contains the state information of the aircraft, and acquisition of the driving data of the sensor is carried out at present, so that whether the sensor breaks down or not is judged, the purposes of fault diagnosis and isolation maintenance of the sensor in the aircraft are achieved, a fault unit is rapidly positioned, and the method has important significance for reducing economic loss and improving the safety and the fighting capacity of the aircraft.

However, at present, the fault diagnosis of the sensor is based on experience knowledge, or only the equipment corresponding to which sensor is faulty can be known according to the data of the sensor, but what performance of the equipment is faulty cannot be known, and therefore the fault point cannot be completely located.

Disclosure of Invention

The invention aims to provide a method for specifically positioning equipment fault point positions, reducing errors to the maximum extent and providing a method for diagnosing faults of sensors of an aircraft system.

In order to achieve the above object, the embodiments of the present invention provide the following technical solutions:

a method for diagnosing aircraft system sensor faults, comprising the steps of:

performing sample and feature processing on the acquired sensor data to be used as a training set for training a fault diagnosis model;

training a fault diagnosis model by using a method of a decision tree, a random forest or a deep neural network according to the training set to construct the fault diagnosis model;

using sensor data which is not subjected to sample and feature processing as a test set, and verifying the constructed fault diagnosis model;

and after the fault diagnosis model is verified, inputting the newly acquired sensor data into the fault diagnosis model to obtain a diagnosis result.

In the scheme, the real data of the sensor is used as a training set, and when a fault diagnosis model is constructed, the training set with huge data volume can be applied by a method of a decision tree, a random forest or a deep neural network, and meanwhile, the difficulty of manual marking is reduced; after the fault diagnosis model is built, the fault diagnosis model is evaluated by using a test set so as to verify whether the built fault diagnosis model can accurately output a fault result.

The step of performing sample and feature processing on the acquired sensor data as a training set for training a fault diagnosis model includes:

injecting faults with different performances into equipment of data collected by a sensor, collecting data of the equipment respectively at each performance fault by using the sensor as a training set C, and collecting data under each performance fault as a training subset C₁、C₂、...C_NN is the number of equipment performance faults;

wherein the data of each performance fault further includes a plurality of condition data, and one training subset is C_i={a₁ ⁱ，a₂ ⁱ，...a_M ⁱ}，C_iFor the ith training subset, a is the condition data and M is the number of condition data.

According to the training set, the step of training the fault diagnosis model by using a deep neural network method comprises the following steps:

carrying out DNN forward propagation calculation and DNN backward propagation calculation through a deep neural network layer;

the deep neural network layer comprises an input layer, a hidden layer and an output layer, wherein the hidden layer is an intermediate layer and comprises a plurality of layers;

performing DNN forward propagation calculation:

wherein the content of the first and second substances,

is a linear relation coefficient and represents the linear coefficient from the kth neuron of the i-1 th layer to the jth neuron of the i-1 th layer;

is inclined toMean bias of the ith neuron;

is an activation function;

an output value calculated for forward propagation, representing the output value for the jth neuron of the ith layer if there are m neurons in total at the ith-1 layer;

the output value of the i-th layer is represented using a matrix method:

wherein, the i-1 th layer has m neurons and the i-1 th layer has n neurons, the linear coefficients w of the i-th layer form an n × m matrix

The bias b of the ith layer constitutes an n x 1 vector

The output a of the i-1 th layer constitutes an m x 1 vector

The linear output z of the i-th layer before being activated forms an n x 1 vector

The output a of the ith layer constitutes an n x 1 vector

；

DNN back propagation calculations were performed:

inputting: the total number L of layers, the number of neurons of each hidden layer and each output layer, an activation function, a loss function and an iteration step length

Maximum number of iterations MAX and stop iteration threshold

Input m training subsets C₁、C₂、...C_m；

And (3) outputting: a linear relation coefficient matrix W and a bias vector b of each hidden layer and each output layer.

The step of verifying the constructed fault diagnosis model by using the sensor data which is not subjected to sample and feature processing as a test set comprises the following steps:

the sensor data without sample and feature processing is: using a sensor to collect data of the equipment in any condition as a test set, wherein the collected test set is data of unknown equipment performance whether has faults or unknown equipment performance why has faults, and Z = { b =₁、b₂、...b_nB is data of the equipment acquired by the sensor under any condition, and n is the data quantity acquired by the sensor;

and inputting the data of the test set into a fault diagnosis model, and judging whether the result output by the fault diagnosis model is consistent with the original equipment performance fault of the data.

Compared with the prior art, the invention has the beneficial effects that:

(1) the training set of the invention contains data of various performances of the equipment, and the trained fault diagnosis model can know what performances of the equipment have faults and can completely locate fault points.

(2) The real data of the sensor is used as a training set, and when a fault diagnosis model is constructed, the method can be applied to the training set with huge data volume by a decision tree, random forest or deep neural network method, and meanwhile, the difficulty of manual marking is reduced; after the fault diagnosis model is built, the fault diagnosis model is evaluated by using a test set so as to verify whether the built fault diagnosis model can accurately output a fault result.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and for those skilled in the art, other related drawings can be obtained according to the drawings without inventive efforts.

FIG. 1 is a flow chart of a diagnostic method of the present invention;

FIG. 2 is a diagram illustrating deep neural network layers according to an embodiment of the present invention;

FIG. 3 is a diagram illustrating the definition of linear relationship coefficients according to an embodiment of the present invention;

FIG. 4 is a schematic diagram of the bias definition according to an embodiment of the present invention;

FIG. 5 is a diagram illustrating output values of a deep neural network according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations. Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.

It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, it need not be further defined and explained in subsequent figures.

The invention is realized by the following technical scheme, as shown in fig. 1, a method for diagnosing the faults of the sensors of the aircraft system comprises the following steps:

step S1: and carrying out sample and feature processing on the acquired sensor data to be used as a training set for training a fault diagnosis model.

Usually, a sensor on the aircraft is used to fixedly acquire data of a device, such as a rate gyroscope for acquiring the angular rate of the vehicle, an acceleration component for acquiring the acceleration of the vehicle, a fuel sensor for acquiring the amount of fuel in the fuel tank, and a pitch lever sensor for acquiring the operation data of the pitch lever.

However, one device often suffers from different performance failures, such as a possible fuel deficiency in the fuel tank, a possible fuel tank leak, and a possible fuel sensor power failure; for another example, the failure of the operation of the pitch lever may be a failure of a broken pitch lever, a failure of a power supply of a sensor of the pitch lever, or the like.

Therefore, for a device, different performance failures occur, which can be roughly divided into electrical performance failures and mechanical performance failures, wherein the electrical performance failures include voltage, current, power, temperature and the like, and the mechanical performance failures include breakage, jamming and the like. However, when the sensor collects data, even if the collected data is abnormal, it cannot be directly known what performance of the device has been failed.

According to the scheme, firstly, faults with different performances are injected into equipment with data collected by a sensor actively, the data of the equipment during each performance fault are collected by the sensor, the data under each performance fault are used as a training subset, and C is obtained₁、C₂、...C_NAnd N is the number of device performance failures.

For example, when the C-redundancy pitch rod fails, it may be caused by a disconnection fault of the C-redundancy pitch rod or a power supply fault of a sensor corresponding to the C-redundancy pitch rod (for the moment, the two cases are discussed), a disconnection fault of the C-redundancy pitch rod is injected into the C-redundancy pitch rod, and data of the C-redundancy pitch rod during a performance fault of the disconnection is collected by using the sensor as a training subset C₁. Injecting the fault of sensor power supply into the C redundancy pitch rod again, and using the data of the sensor when the performance fault of the sensor power supply is collected as the training subset C₂。

And separately for training subsets C₁And training subset C₂Labelling performance faults, e.g. for training subset C₁The label of 'C redundancy pitching rod disconnection fault' is marked, and the training subset C is subjected to₂And (4) marking a label of 'C redundancy pitch rod sensor power supply fault'.

When data of the training subset is collected, data of a plurality of pieces of equipment in different states needs to be collected, and the collected data are called as condition data. For example, when data of performance faults of disconnection of the C-redundancy pitch lever are collected, the pitch levers are respectively adjusted to 9 different gear values: -20, -15, -10, -5, 0, 5, 10, 15, 20, so as to obtain 9 sets of data, C, acquired by the sensors₁={a₁ ¹(-20)，a₂ ¹(-15)，a₃ ¹(-10)，a₄ ¹(-5)，a₅ ¹(0)，a₆ ¹(5)，a₇ ¹(10)，a₈ ¹(15)，a₉ ¹(20)}。

Similarly, when data of performance failure of power supply of the sensor is collected, the pitching rods are respectively adjusted to the 9 different gear positions, so that 9 groups of data, C, collected by the sensor are obtained₂={a₁ ²(-20)，a₂ ²(-15)，a₃ ²(-10)，a₄ ²(-5)，a₅ ²(0)，a₆ ²(5)，a₇ ²(10)，a₈ ²(15)，a₉ ²(20)}。

Wherein the training subset C₁And training subset C₂Each piece of data in (1) is conditional data, and two training subsets are used as training sets, so that 18 pieces of conditional data are in total. And each piece of condition data is labeled with a condition label, such as condition data a₁ ¹(-20) is labeled with a "-20" conditional label. And thus as data in the training set, is known for its specific performance failure and the current state of the device.

Step S2: and training the fault diagnosis model by using a method of a decision tree, a random forest or a deep neural network according to the training set so as to construct the fault diagnosis model.

As an implementable mode, the fault diagnosis model is trained and constructed by using a decision tree, and a training set C (comprising a training subset C)₁Training subset C₂) The conditional data in (1) is used as leaf nodes of the decision tree, and the subset C is trained₁As root node, training subset C₂As a root node. And segmenting the training set by a recursive optimal feature selection mode to ensure that each condition data has an optimal classification result.

Because each condition data is labeled with a condition label, each condition data can be correctly classified into a corresponding training subset after the decision tree training is carried out. And repeating the steps until all the condition data in the training set are correctly classified, namely, all the condition data are finally segmented into corresponding root nodes, so that a decision tree is generated, and the training of the fault diagnosis model is completed. And inputting the test set into a decision tree to complete the construction of the fault diagnosis model. In the present embodiment, the condition tags are used to classify the condition data, and therefore the condition tags are selected features.

However, when the data size of the training set is very large, the label processing is performed on the condition data one by one, which increases the workload. Therefore, when selecting features for classification, the criteria for selection may be selected by way of information gain, information gain ratio, or a kuni index.

When the features are selected by using an information gain mode, the conditional data is used as a random variable X, and the probability distribution is as follows:

wherein

Is the ith condition data, n is the number of condition data,

is the probability distribution of the ith condition data.

The entropy of the random variable X is then:

entropy is a measure of uncertainty of random variables, and the larger the entropy value, the larger the uncertainty of random variables. According to the entropy of each random variable X, the joint entropy of a plurality of random variables can be obtained. For example, the joint entropy expression of the random variable X and the random variable Y is:

after the joint entropy is obtained, the expression of the conditional entropy can be obtained:

the conditional entropy measures the uncertainty of the random variable X remaining after the random variable Y is known, so the information gain represents the degree to which the information of the feature Y is known such that the uncertainty of the feature X is reduced. Assuming that a is a certain feature in the training set C, the information gain of the feature a to the training set C is expressed as:

h (C) represents the uncertainty of the classification of the training set C, and H (C | a) represents the uncertainty of the classification of the training set C under the conditions given by the feature a, so that the difference is the information gain g (C | a) representing the degree of uncertainty of the classification of the training set C reduced due to the given feature a. Therefore, the larger the information gain, the stronger the classification capability of the feature is, and therefore, the feature with the larger information gain can be selected as the classification feature.

The method for selecting features according to the information gain criteria is to calculate the information gain of each feature and select the feature with the largest information gain for classification. Suppose that the training set is C, | C | is the sample capacity of the training set, and there are K classes D_k，k=1,2,...K，|D_kIs of class D_kThe number of samples.

The characteristic A has n different values { a }₁,a₂,...a_nDividing the training set C into n training subsets C according to the characteristic A₁、C₂、...C_i、...C_n，|C_iL is the sample number of the ith value of the characteristic A; let training subset C_iIn (II) of class D_kIs C_ikI.e. C_ik=C_i∩C_k，|C_ikL is C_ikThe information gain algorithm is as follows:

inputting a training set C and features A, and calculating an entropy H (C):

2, calculating the conditional entropy H (C | a):

calculating an information gain g (C, a):

and secondly, when the characteristics are selected by selecting the information gain ratio, the adverse effect caused by the characteristic with more values biased by the information gain as a dividing basis can be avoided.

Information gain ratio g of feature A to training set C_R(C, A) defined as its information gain g (C, A) and the entropy H of the training set C with respect to the feature A_A(C) The ratio of (A) to (B) is expressed as:

about the characteristic entropy H_A(C) Is expressed as:

wherein n is the number of the values of the characteristic A, | C_iAnd | C | is the sample capacity.

And (III) when the characteristics are selected by using the mode of the Gini coefficient, assuming that K categories are provided, wherein the probability of the Kth category is p_kThen the expression of the kini coefficient is:

the larger the kini coefficient is, the larger the uncertainty of the training set is, and for the training set C, if the training set C is divided into training subsets C according to a certain value a of the characteristic A₁And training subset C₂And in two parts, under the condition of the characteristic A, the Gini coefficient of the training set C is expressed as:

in conclusion, features are selected in the mode of information gain, information gain ratio or a kini coefficient, so that the classification method of the decision tree is generated, and is suitable for processing samples with missing attributes, for example, when data in a training set C has attribute missing; the method is suitable for processing mass data, for example, when the data volume in the training set C is huge, feasible and reliable results can be made for a large data source in a relatively short time; the method is suitable for the cases of which the classification details need to be visually displayed and has strong interpretability.

As another possible implementation mode, a random forest is used for training and constructing the fault diagnosis model, the random forest is an integrated algorithm, and the result of the overall fault diagnosis model has high accuracy and generalization capability by combining a plurality of weak classifiers and voting the final result.

The random forest uses a decision tree generated by selecting features through a kini coefficient as a weak classifier, improves the establishment of the decision tree on the basis of using the decision tree, and selects an optimal feature from all n sample features to divide left and right subtrees of the decision tree for a common decision tree.

But random forests are created by selecting a portion of sample features n_sub（n_sub<n) to select an optimal feature for left and right tree partitioning of the decision tree, thus further enhancing the generalization capability of fault diagnosis model construction, and n_subThe smaller the fault diagnosis model is, the more robust the fault diagnosis model is, and the algorithm of the random forest is as follows:

inputting iteration times T of a training set C and a classifier, wherein T =1,2_t。

2, using the sampling set C_tTraining and training tth decision tree model G_t(x) When the nodes of the decision tree model are trained, a part of sample features are selected from all the sample features on the nodes, and an optimal feature is selected from the selected part of sample features to divide left and right subtrees of the decision tree.

And 3, the category with the maximum number of votes is cast out in T iterations to serve as the final category of the data in the training set, and if two or more categories with the maximum number of votes exist, the final category of the data in one seat training set is selected.

The random forest can realize the classification of data, and the application condition of the random forest not only comprises the application condition of a decision tree, but also is suitable for the condition of not making feature selection, is also suitable for the condition of not making generalization processing, and is also suitable for the condition of needing parallel processing of weak classifiers.

No matter a decision tree or a random forest is used, a fault diagnosis model can be constructed according to a training set and a test set which are prepared in advance, but after the fault diagnosis model is constructed, in order to guarantee the use accuracy of the fault diagnosis model, the fault diagnosis model also needs to be evaluated so as to ensure whether errors occur in the construction process of the fault diagnosis model.

As another possible implementation manner, the fault diagnosis model is trained by using a deep neural network method, and if the fault diagnosis model output has errors, the fault diagnosis model is repeatedly learned to reduce or eliminate the errors.

The deep neural network DNN is a multi-layer feedforward neural network trained according to an error back propagation algorithm, and is the most widely applied neural network at present. The process of evaluating the fault diagnosis model consists of two parts, namely signal forward propagation and error backward propagation.

When the DNN is transmitted in the forward direction, an input sample is transmitted from an input layer of the fault diagnosis model, is sequentially processed layer by layer through all hidden layers and is transmitted to an output layer, if the output of the output layer is inconsistent with the expectation, errors are used as adjusting signals to be reversely transmitted back layer by layer, and a connection weight matrix between neurons is processed, so that the errors are reduced. Through repeated learning, the error is finally reduced to an acceptable range.

The deep neural network layer can be divided into three types, namely an input layer, a hidden layer and an output layer. Referring to fig. 2, the first layer is an input layer, the middle layers are hidden layers, the last layer is an output layer, all layers are connected, and any neuron on the ith layer is connected with any neuron on the (i + 1) th layer.

In defining the coefficient of linear relationship

Please refer to fig. 3, for example

Linear coefficients representing the 4 th neuron of the second layer to the 2 nd neuron of the third layer, superscript representing linear coefficients

The number of layers is the same, and the table below corresponds to the third layer of output cablesIndex 2 and the input second level index 4. The linear coefficient from the kth neuron of the i-1 th layer to the jth neuron of the i-1 th layer is defined as

。

In defining bias b, see FIG. 4, for example

Indicating the bias for the third neuron in the second layer, superscript 2 represents the number of layers in which it is located, and subscript 3 represents the index of the nerve in which it is located. Also the bias of the first neuron in the third layer should be expressed as

. The bias of the jth neuron at the ith layer is defined as

。

In carrying out the DNN forward propagation algorithm, the activation function is

The output values of the hidden layer and the output layer are a, and the output of the next layer is calculated by using the output of the previous layer. See FIG. 5, for example for the output of the second layer

、

、

There are (superscript of a represents number of layers, subscript represents index of nerve, x represents neuron of input layer):

assuming that there are m neurons in the i-1 th layer, the output for the jth neuron of the i-th layer

The method comprises the following steps:

if the output of the representation of each element by using an algebraic method is complex, the matrix method is simple to use. Assuming that there are m neurons in the i-1 th layer and n neurons in the i-th layer, the linear coefficients w of the i-th layer form an n × m matrix

The offset b of the ith layer constitutes an n x 1 vector

The output a of the i-1 th layer constitutes an m x 1 vector

. The output of the ith layer is represented by a matrix method as:

the forward propagation of DNN is to use several weight coefficient matrixes W, bias vectors b and input value vectors x to perform a series of linear operations and activation operations, starting from an input layer, calculating backwards layer by layer until an output layer is operated to obtain an output result.

Thus, DNN forward propagation can be summarized as:

inputting: the total number of layers L, a matrix W, a bias vector b and an input value vector x corresponding to all the hidden layers and the output layers;

and (3) outputting: output of the output layer

。

The method comprises the following steps:

1, initialization

；

2, for i =2 to L, calculate:

the final result is the output

。

During DNN reverse propagation, if errors exist, the errors serve as adjusting signals to reversely pass back layer by layer, a connection weight matrix between the neurons is processed, and the reverse propagation can be summarized as follows:

inputting: the total number L of layers, the number of neurons of each hidden layer and each output layer, an activation function, a loss function, an iteration step length beta, a maximum iteration number MAX, an iteration stop threshold value з, and m input training subsets C₁、C₂、...C_m；

The method comprises the following steps:

1, initializing a linear relation coefficient matrix W and a bias vector b of each hidden layer and each output layer to be a random value;

2，for iter to 1 to MAX：

2-1，for i=1 to m；

2-1a, inverting DNN to network input

Is arranged as

；

2-1b, for i =2 to L, forward propagation algorithm calculation is performed

；

2-1c, calculating output layers by loss functions

；

2-1d, for i = L-1 to 2, calculating by back propagation algorithm

；

2-2, for i =2 to L, updating the coefficient matrix of the linear relation of the i-th layer

Bias vector of

：

2-3, if all the W, b change values are smaller than the stop iteration threshold з, jumping out of the iteration loop to the next step;

and 3, outputting a linear relation coefficient matrix W and a bias vector b of each hidden layer and each output layer.

The deep neural network DNN can realize data classification, is particularly suitable for the condition of discovering the nonlinear relation between model input and model output, can learn and store a large number of input-output mode mapping relations without a mathematical equation for describing the mapping relations in advance, and is a good choice for training a fault diagnosis model.

Step S3: and using the sensor data which is not subjected to sample and characteristic processing as a test set to verify the constructed fault diagnosis model.

Using a sensor to collect data of the equipment in any condition as a test set, wherein the collected test set is data of unknown equipment performance whether has faults or unknown equipment performance why has faults, and Z = { b =₁、b₂、...b_nB is the data of the equipment acquired by the sensor under any condition, and n is the data quantity acquired by the sensor.

For example, the C-redundancy pitch stick is now in any situation, and it is unknown whether the performance of the C-redundancy pitch stick is faulty, or in particular what kind of fault is. The pitch lever is also adjusted to the 9 different gear positions, so that 9 groups of data collected by the sensor are obtained as a test set, and Z = { b = }₁(-20)、b₂(-15)、b₃(-10)、b₄(-5)、b₅(0)、b₆(5)、b₇(10)、b₈(15)、b₉(20)}。

Since the data in the test set is unknown whether the device has a performance failure or not, and also unknown what specific performance failure has occurred to the device, the data in the test set is not tagged as random data.

Step S4: and after the fault diagnosis model is verified, inputting the newly acquired sensor data into the fault diagnosis model to obtain a diagnosis result.

The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and all the changes or substitutions should be covered within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A method for diagnosing faults in sensors of an aircraft system, comprising: the method comprises the following steps:

2. A method of diagnosing a sensor fault in an aircraft system according to claim 1, wherein: the step of performing sample and feature processing on the acquired sensor data as a training set for training a fault diagnosis model includes:

3. A method of diagnosing a sensor fault in an aircraft system according to claim 2, characterised in that: according to the training set, the step of training the fault diagnosis model by using a decision tree method comprises the following steps:

respectively marking a performance fault label on each training subset, and respectively marking a condition label on each condition data;

a plurality of training subsets C in a training set C_iAnd as a root node of the decision tree, taking the condition data in the training set C as leaf nodes of the decision tree, classifying the condition data into corresponding root nodes by taking the condition labels of the condition data as features in a recursive feature selection mode so as to generate the decision tree and finish the training of the fault diagnosis model.

4. A method of diagnosing a sensor fault in an aircraft system according to claim 2, characterised in that: according to the training set, the step of training the fault diagnosis model by using a decision tree method comprises the following steps:

a plurality of training subsets C in a training set C_iTaking the conditional data in the training set C as leaf nodes of the decision tree as root nodes of the decision tree;

inputting the training set C and the preset characteristics A into a decision tree, and calculating the entropy H (C):

wherein H (C) represents the uncertainty of classifying the training set C, | C | is the sample capacity of the training set C, and K classes are D_k，k=1,2,...K，|D_kIs of class D_kThe number of samples of (a); the characteristic A has n different values { a }₁,a₂,...a_nDividing the training set C into n training subsets C according to the characteristic A₁、C₂、...C_i、...C_n，|C_iL is the sample number of the ith value of the characteristic A;

let training subset C_iIn (II) of class D_kIs C_ikI.e. C_ik=C_i∩C_k，|C_ikL is C_ikThe conditional entropy H (C | a) is calculated:

from the entropy H (C) and the conditional entropy H (C | A) of the training set C, an information gain g (C, A) is calculated:

the information gain represents the degree of classification uncertainty reduction for the training set C due to the given feature a, and the feature with the largest value of the information gain is selected as the classification feature of the decision tree.

5. A diagnostic method for aircraft system sensor failure according to claim 4, characterized in that: according to the training set, the step of training the fault diagnosis model by using a decision tree method further comprises the following steps:

calculating the information gain ratio g of the characteristic A to the training set C according to the information gain g (C, A) of the training set C_R(C,A)：

Wherein the content of the first and second substances,

characteristic entropy:

and selecting the characteristic of the maximum value of the information gain ratio as the classification characteristic of the decision tree.

6. A diagnostic method for aircraft system sensor failure according to claim 4, characterized in that: according to the training set, the step of training the fault diagnosis model by using a decision tree method further comprises the following steps:

assuming that there are K classes, the probability of the Kth class is p_kThen the expression of the kini coefficient is:

and selecting the characteristic of the maximum value of the kini coefficient as the classification characteristic of the decision tree.

7. A method of diagnosing a sensor fault in an aircraft system according to claim 2, characterised in that: according to the training set, the step of training the fault diagnosis model by using a random forest method comprises the following steps:

inputting iteration times T of a training set C and a classifier, wherein T =1,2_t；

Using a sample set C_tTraining and training tth decision tree model G_t(x) When the nodes of the decision tree model are trained, a part of sample features are selected from all sample features on the nodes, and an optimal feature is selected from the selected part of sample features to make left and right subtrees of the decision treeDividing;

and (5) the category with the maximum ticket number is cast out in T times of iteration and is used as the final category of the condition data in the training set C.

8. A method of diagnosing a sensor fault in an aircraft system according to claim 1, wherein: according to the training set, the step of training the fault diagnosis model by using a deep neural network method comprises the following steps:

performing DNN forward propagation calculation:

wherein the content of the first and second substances,

is bias, representing the bias of the jth neuron in the ith layer;

is an activation function;

the output value of the i-th layer is represented using a matrix method:

The bias b of the ith layer constitutes an n x 1 vector

The output a of the i-1 th layer constitutes an m x 1 vector

The output a of the ith layer constitutes an n x 1 vector

；

DNN back propagation calculations were performed:

Maximum number of iterations MAX and stop iteration threshold

Input m training subsets C₁、C₂、...C_m；

9. A method of diagnosing a sensor fault in an aircraft system according to claim 8, wherein: the step of performing DNN back propagation calculations comprises:

initializing linear relation coefficient matrix of each hidden layer and output layer

And the value of the bias vector b is a random value;

inputs to the DNN inverse network are

；

Performing forward propagation algorithm calculations

；

Computing output layers by loss functions

；

Performing back propagation algorithm calculations

；

Updating the linear relation coefficient matrix of the ith layer

Bias vector of

：

If all of

、

Are all less than the stop iteration threshold

Jumping out the iteration loop to the next step; wherein the content of the first and second substances,

is a linear relation coefficient matrix of the L-th layer,

is a linear relation coefficient matrix of the L +1 layer,

is the offset vector for the L-th layer,

is the offset vector of the L +1 th layer;

outputting linear relation coefficient matrix of each hidden layer and output layer

Sum bias vector

。

10. A method of diagnosing a sensor fault in an aircraft system according to claim 1, wherein: the step of verifying the constructed fault diagnosis model by using the sensor data which is not subjected to sample and feature processing as a test set comprises the following steps: