CN111830408B

CN111830408B - Motor fault diagnosis system and method based on edge calculation and deep learning

Info

Publication number: CN111830408B
Application number: CN202010581758.4A
Authority: CN
Inventors: 张品佳; 吴志良; 袁巍
Original assignee: Lonston Technology Beijing Co ltd
Current assignee: Lonston Technology Beijing Co ltd
Priority date: 2020-06-23
Filing date: 2020-06-23
Publication date: 2023-04-18
Anticipated expiration: 2040-06-23
Also published as: CN111830408A

Abstract

The invention relates to a motor fault diagnosis system and method based on edge calculation and deep learning, wherein edge side equipment can configure an acquisition device, perform Empirical Mode Decomposition (EMD) after preprocessing vibration or current signals, input n IMF components by a deep residual error network diagnosis model built in a diagnosis module to perform fault diagnosis, and upload a diagnosis result and fault data to a cloud service center; the cloud service center utilizes the historical data and the fault data to regularly train the deep residual error network diagnosis model, sends the deep residual error network diagnosis model to each edge side device after training, and updates the deep residual error network diagnosis model in the edge side device. The motor equipment can be diagnosed in real time by computing the edge side equipment, the motor state can be diagnosed quickly in time, and processed data are transmitted to the cloud center, so that the time delay caused by the limitation of data transmission speed and bandwidth is greatly reduced, and the data processing pressure of the cloud center is remarkably relieved.

Description

Motor fault diagnosis system and method based on edge calculation and deep learning

Technical Field

The invention relates to the technical field of motor fault diagnosis, in particular to a motor fault diagnosis system and method based on edge calculation and deep learning.

Background

The motor is very common in the modern industry, whether large factory enterprises or small factory enterprises can see the shadow of the motor everywhere, but large motors on some important production lines are more important to the enterprises, once a fault occurs, the whole production line can be stopped, and more serious economic loss and casualties can be brought to the enterprises, so that the normal and stable operation of the motor is very important to the enterprises.

At present, most of the maintenance of the motor is to carry out vibration noise, shell temperature inspection and the like on the motor every day by field maintenance personnel, the state of the motor is evaluated according to field experience for many years, and the other main measure is to carry out regular factory return maintenance on the motor. The experience of field maintenance personnel is likely to fail to find the fault of the motor in time, so that major accidents are caused, in addition, the motor is disassembled and maintained regularly, the work of the whole production line needs to be suspended, and the economic loss caused by excessive maintenance is also caused.

At present, most of the existing technical schemes are to arrange a data acquisition device on site, transmit data acquired in real time on site back to a cloud server, and apply a corresponding algorithm to detect the state of a motor and diagnose a fault. At present, common algorithms also stay in traditional neural network algorithms more frequently, such as BP neural network and probabilistic neural network, but in the face of huge data volume of a motor group, the shallow neural network algorithms have no advantages, so that the training speed is slow, and the precision of a diagnosis model is low;

on the other hand, the cloud processing method cannot meet the real-time requirement, and the detection is performed after a fault occurs, and in addition, the method has high requirements on the storage and calculation capacity of the server, and the safety and stability of transmitting a large amount of original data cannot be guaranteed.

With the development of network technology, the quantity of devices connected to a network is larger and larger, massive data is generated, if the data are directly transmitted back to a cloud server, large bandwidth and delay are generated, and meanwhile, the safety of processing massive data is difficult to guarantee; in addition, the shallow neural network algorithms are difficult to process huge data volume of a motor group, so that the problems of low network training speed, overfitting and the like can be caused, the diagnosis capability of the network is finally reduced, and the actual state of the motor cannot be well reflected.

Disclosure of Invention

The invention provides a motor fault diagnosis system and method based on edge calculation and deep learning, aiming at the problems that diagnosis calculation can be carried out only by transmitting all diagnosis data back to a cloud server and the accuracy rate of shallow neural network diagnosis is low.

In order to achieve the purpose, the invention provides a motor fault diagnosis system based on edge calculation and deep learning, which comprises a plurality of acquisition devices, a plurality of edge side devices and a cloud service center, wherein the acquisition devices are connected with the edge side devices through a network;

the acquisition device is used for acquiring vibration or current signals of the motor and sending the vibration or current signals to the corresponding edge side equipment;

the edge side equipment comprises an acquisition configuration module, a preprocessing module, an empirical mode decomposition module and a diagnosis module; the acquisition configuration module can configure the acquisition device; the preprocessing module is used for preprocessing the vibration or current signal; the empirical mode decomposition module is used for performing Empirical Mode Decomposition (EMD) on the preprocessed vibration or current signals to decompose the signals into n IMF components; the diagnostic module is internally provided with a depth residual error network diagnostic model, n IMF components are input for fault diagnosis, and a diagnostic result and fault data are uploaded to the cloud service center;

the cloud service center utilizes historical data and fault data to perform EMD, trains the depth residual error network diagnosis model periodically, sends the trained depth residual error network diagnosis model to each edge side device, and updates the depth residual error network diagnosis model in the edge side device.

Further, the empirical mode decomposition module processing flow includes:

s1 obtaining a pre-processed vibration or current signal x (t), further i =0;

s2, obtaining an envelope f on x (t) _max (t) and lower envelope f _min (t), calculating the mean value of the upper envelope line and the lower envelope line as m (t);

s2, making a difference between x (t) and the mean value m (t), and recording the difference as h ₁ (t)；

S3, judging whether i is smaller than n, if so, adding 1 to i to obtain the ith IMF component c _i (t)＝h ₁ (t) and using x (t) -h ₁ (t) replacing x (t), and returning to S2; if i is not less than n, n IMF components are output.

Further, the diagnostic module comprises an input convolution unit, 6 residual error units and an output unit; after the n IMF components are subjected to feature extraction through an input convolution unit, sequentially performing treatment through 6 residual error units connected in series; and finally, obtaining a diagnosis result by an output unit.

Further, the residual error unit includes a first batch normalization layer, a first ReLu activation function, a first convolution layer, a second batch normalization layer, a second ReLu activation function, a second convolution layer, a third batch normalization layer, a third ReLu activation function, a third convolution layer, and an identity mapping unit, which are connected in series in sequence, and the identity mapping unit superimposes and outputs an output of the third convolution layer and an input of the first batch normalization layer.

Further, the output unit comprises an output batch standardization layer, an output ReLu activation function, a global mean pooling layer, a Dropout layer and a full-connection output layer which are sequentially connected in series; the method comprises the steps that an output batch standardization layer carries out standardization processing on input features, an output ReLu activation function carries out nonlinear processing, a global mean value pooling layer averages feature maps of all input channels and outputs the averaged feature maps, a Dropout layer discards partial neurons, a full-connection output layer adopts a softmax function to calculate classification results, and the probabilities of various types of faults and no faults are output.

Further, the operation process of the first, second and third batch normalization layers and the output batch normalization layer includes processing n one-dimensional features characterized by n IMF components of each sample, respectively obtaining output features, and calculating a certain one-dimensional feature x of the s-th input sample _s Calculating output characteristic y _s Comprises the following steps:

wherein: n is a radical of _batch Refers to the number of samples contained in each batch, μ being N _batch Mean value of samples in a dimension, σ being N _batch The standard deviation of each sample in a dimension, γ and β are the scale and shift learnable parameters, respectively, and ε is a constant that is close to zero.

Further, the cloud service center includes: the system comprises a database, a selection module, an IMF component extraction module, a model training module and a packaging output module; the database stores historical vibration or current signals of the motor, vibration or current signals of the motor during the fault uploaded by each edge side device and a manually labeled diagnosis result; the selection module selects a sample from a database and inputs the sample to the IMF component extraction module, and the IMF component extraction module performs Empirical Mode Decomposition (EMD) on the input to be decomposed into n IMF components; the model training module inputs n IMF components into an internal deep residual error network diagnosis model for training, and when the requirements of training times or training precision are met, the deep residual error network diagnosis model is packaged and sent to each edge side device through the packaging output module.

And further, the system also comprises an application layer, and the application layer is used for reading and displaying the data of the cloud service center.

The invention also provides a motor fault diagnosis method based on edge calculation and deep learning, which comprises the following steps:

training a deep residual error network diagnosis model and loading the model to edge side equipment;

each edge side device receives vibration or current signals of the motor collected by the corresponding collecting device and carries out pretreatment; performing Empirical Mode Decomposition (EMD) on the preprocessed vibration or current signals to decompose the signals into n IMF components; inputting n IMF components by the deep residual error network diagnosis model to perform fault diagnosis, and uploading a diagnosis result and fault data to the cloud service center;

the cloud service center conducts EMD by using the diagnosis result and the fault data, trains the depth residual error network diagnosis model regularly, sends the trained depth residual error network diagnosis model to each edge side device, and updates the depth residual error network diagnosis model in the edge side device.

Further, the EMD module performs EMD on the preprocessed vibration or current signals and decomposes the signals into n IMF components;

the processing flow of EMD by the EMD module comprises the following steps:

s1 obtaining a pre-processed vibration or current signal x (t), further i =0;

S3, judging whether i is smaller than n, if so, adding 1 to i to obtain the ith IMF component c _i (t)＝h ₁ (t) and using x (t) -h ₁ (t) replacing x (t), and returning to S1; if i is not less than n, n IMF components are output.

Further, the depth residual error network diagnosis model comprises an input convolution unit, 6 residual error units and an output unit; after the n IMF components are subjected to feature extraction through an input convolution unit, the n IMF components are sequentially subjected to treatment through 6 residual error units which are connected in series; and finally, obtaining a diagnosis result by an output unit.

Further, the output module comprises an output batch standardization layer, an output ReLu activation function, a global mean pooling layer, a Dropout layer and a full-connection output layer which are sequentially connected in series; the method comprises the steps that an output batch standardization layer carries out standardization processing on input features, an output ReLu activation function carries out nonlinear processing, a global mean value pooling layer averages feature maps of all input channels and outputs the averaged feature maps, a Dropout layer discards partial neurons, a full-connection output layer adopts a softmax function to calculate classification results, and the probabilities of various types of faults and no faults are output.

Further, the cloud service center includes: the system comprises a database, a selection module, an IMF component extraction module, a model training module and a packaging output module; the database stores historical vibration or current signals of the motor, vibration or current signals of the motor during fault uploading by each edge side device and diagnosis results manually marked; the selection module selects a sample from a database and inputs the sample to the IMF component extraction module, and the IMF component extraction module performs Empirical Mode Decomposition (EMD) on the input to be decomposed into n IMF components; the model training module inputs n IMF components into an internal deep residual error network diagnosis model for training, and when the requirements of training times or training precision are met, the packaging output module packages the deep residual error network diagnosis model and sends the deep residual error network diagnosis model to each edge side device.

The technical scheme of the invention has the following beneficial technical effects:

(1) The existing common method adopts a data acquisition device to send locally acquired data to a cloud center for calculation and analysis to obtain a diagnosis result, but due to the huge amount of original data and the numerous field acquisition devices, real-time data transmission and storage are very great challenges for the cloud center, on-site equipment is easy to break down, but fault data are not timely uploaded to the cloud center due to various reasons, so that fault finding is delayed. The method and the device have the advantages that the deep residual error network model is trained in the cloud center, the trained deep residual error network model is issued to the edge side equipment, the motor equipment can be diagnosed in real time through calculation of the edge side equipment, the motor state can be diagnosed rapidly in time, and after the edge side equipment diagnoses a fault, the diagnosis result is transmitted to the cloud server, and fault data are transmitted to the cloud center, so that a fault database can be enriched continuously, and the trained diagnosis model can better accord with the actual condition of the motor.

(2) The existing method directly uses the collected original data as the input of the deep learning network, which also introduces much noise and causes inaccurate diagnosis result; according to the method, the current or vibration signals are collected, the original data are processed through a preprocessing method, and then the intrinsic mode function of the fault signal is extracted through an empirical mode decomposition algorithm, so that useful information of the original signal can be fully obtained, and the accuracy of a depth model is increased;

(3) The invention relates to a fault diagnosis model, in particular to an improved version of a basic convolutional neural network and a deep residual error network model, wherein the network can greatly reduce the training time of the network, improve the accuracy of the model and provide great support for quick update of a diagnosis model by introducing a residual error module.

(4) In addition, the processing of the edge computing is closer to a data source, only the processed data is transmitted to the cloud center, so that the time delay caused by the limitation of data transmission speed and bandwidth is greatly reduced, and the data processing pressure of the cloud center is remarkably relieved.

Drawings

FIG. 1 is a flow chart of a motor fault diagnosis system;

FIG. 2 is a diagram of a ResNet residual unit;

fig. 3 is a schematic diagram of a ResNet network structure.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in further detail with reference to the accompanying drawings in conjunction with the following detailed description. It should be understood that the description is intended to be exemplary only, and is not intended to limit the scope of the present invention. Moreover, in the following description, descriptions of well-known structures and techniques are omitted so as to not unnecessarily obscure the concepts of the present invention.

The invention aims to provide a motor fault diagnosis system and a motor fault diagnosis method, wherein the diagnosis system comprises a plurality of acquisition devices, a plurality of edge side devices, a cloud service center and an application layer, and is combined with a figure 1.

the edge side equipment has a data acquisition function, a data preprocessing function, an application deep learning model diagnosis function and functions of data uploading, local data storage and display; the edge side equipment comprises an acquisition configuration module, a preprocessing module, an empirical mode decomposition module and a diagnosis module; the acquisition configuration module can initialize the acquisition device and configure acquisition strategy parameters; the preprocessing module is used for preprocessing the vibration or current signals and adopts preprocessing methods such as filtering processing and the like; the EMD module is used for carrying out EMD on the preprocessed vibration or current signals and decomposing the preprocessed vibration or current signals into n IMF components; the diagnostic module is internally provided with a depth residual error network diagnostic model, n IMF components are input for fault diagnosis, and a diagnostic result and fault data are uploaded to the cloud service center; and applying the trained deep residual error network diagnosis model on the edge side equipment to diagnose the motor in real time, storing and displaying the diagnosis result only without storing normal real-time data when the diagnosis result is normal, and storing and displaying the diagnosis result and fault data when the diagnosis result is faulty. And uploading the diagnosis result and the fault data to a cloud center for displaying and storing, uploading the diagnosis result and the fault data of the corresponding fault to the cloud center when the diagnosis result indicates that a certain part has a fault, and manually confirming and calibrating the fault result. When all the parts of the diagnosis result are normal, the diagnosis result is only sent to the cloud service center, and the real-time data under normal working conditions is not sent to the cloud service center, so that the cost of data transmission can be greatly reduced.

The cloud service center has a function of carrying out initialization configuration on edge side equipment, has functions of receiving data, storing data and displaying, and comprises a fault database module function; the cloud service center utilizes the historical data and the fault data to regularly train the deep residual error network diagnosis model, sends the trained deep residual error network diagnosis model to each edge side device, and updates the deep residual error network diagnosis model in the edge side device.

The application layer comprises portable equipment such as office computers and notebooks, mobile equipment such as mobile phones and the like and functions of various applications for controlling and displaying real-time running states of the motors.

Furthermore, the empirical mode decomposition is a novel adaptive signal time-frequency processing method, and is particularly suitable for analysis processing of nonlinear non-stationary signals, and the method has the advantages that the method processes the signals according to the time scale characteristics of the method, and no basis function needs to be set, which is essentially different from the Fourier decomposition and the wavelet decomposition which are established on the prior harmonic basis function and the wavelet basis function.

The core of the method is empirical mode decomposition, which can decompose a complex signal into a finite number of eigenmode functions (IMFs), each decomposed IMF component comprises local characteristic signals of different time scales of an original signal, and the decomposition is based on the local characteristics of the time scales of a signal sequence, so that the method has self-adaptability. The eigenmode function must have 2 conditions met:

condition 1: in the whole time range of the function, the number of local extreme points and zero-crossing points must be equal, or the difference is at most 1;

condition 2: at any point in time, the local maximum envelope and the local minimum envelope must be 0 on average.

For a given signal, an EMD method is adopted to solve an intrinsic mode function, and the screening process of EMD analysis is as follows: the empirical mode decomposition module processing flow comprises the following steps:

s1 obtaining a pre-processed vibration or current signal x (t), further i =0;

s2 findingAll extreme points on x (t) are obtained, and all extreme points and minimum points are respectively connected by a curve to form an upper envelope f _max (t) and lower envelope f _min (t), taking the mean value of the upper envelope line and the lower envelope line as m (t);

s3, making a difference between x (t) and the mean value m (t), and recording the difference as h ₁ (t); namely, the following equation:

h ₁ (t)＝x(t)-m(t)

ideally, h ₁ (t) is a fundamental mode component, whereas the actual signal is generally more complex, h ₁ (t) repeating the above process as new x (t) until a condition is met, then h ₁ (t) is the first fundamental mode component, noted:

c ₁ (t)＝h ₁ (t)

s4, judging whether i is smaller than n, if so, adding 1 to i to obtain the ith IMF component c _i (t)＝h ₁ (t) and using x (t) -h ₁ (t) replace x (t), i.e., x (t) = x (t) -c ₁ (t), return to S2; if i is not less than n, n IMF components are output. Sequentially obtaining a second IMF component c ₂ (t), third IMF component c ₃ (t) up to the Nth IMF component c _n And (t), finally decomposing the original signal into n IMF components after EMD decomposition. The n IMF components characterize n one-dimensional features of the sample.

Further, with reference to fig. 3, the diagnostic module includes an input convolution unit, 6 residual error units, and an output unit; after the n IMF components are subjected to feature extraction through an input convolution unit, the n IMF components are sequentially subjected to input convolution unit and 6 residual error units connected in series; and finally, obtaining a diagnosis result by an output unit. With reference to fig. 2, the residual error unit includes a first batch normalization layer, a first ReLu activation function, a first convolution layer, a second batch normalization layer, a second ReLu activation function, a second convolution layer, a third batch normalization layer, a third ReLu activation function, a third convolution layer, and an identity mapping unit, which are sequentially connected in series, and the identity mapping unit superimposes and outputs an output of the third convolution layer and an input of the first batch normalization layer.

The output unit comprises an output batch standardization layer, an output ReLu activation function, a global mean pooling layer, a Dropout layer and a full-connection output layer which are sequentially connected in series; the method comprises the steps that an output batch standardization layer carries out standardization processing on input features, an output ReLu activation function carries out nonlinear processing, a global mean value pooling layer averages feature maps of all input channels and outputs the averaged feature maps, a Dropout layer discards partial neurons, a full-connection output layer adopts a softmax function to calculate classification results, and the probabilities of various types of faults and no faults are output.

The method comprises the steps of extracting Intrinsic Mode Functions (IMF) on the edge equipment by applying an empirical mode decomposition algorithm to the acquired signals, using a matrix formed by extracted IMF components of various orders as input of a deep learning diagnosis model, diagnosing a field motor in real time through the diagnosis model, uploading a diagnosis result and fault data to a cloud center, storing the fault data, training a deep neural network model regularly by applying historical data and the uploaded fault data to the cloud center, and issuing the trained deep learning model to each edge equipment, so that the deep learning diagnosis model can be updated continuously, the diagnosis response to the motor is faster, the efficiency is higher, and the accuracy of the fault diagnosis result is higher. The system can display the states of the motors through a plurality of platforms, so that a client can clearly master the real-time conditions of the motors.

The residual error network model is an upgraded convolutional neural network (ConvNet) model in nature, and can solve the problem that model training is gradually difficult as the number of layers of the convolutional network model is continuously deepened. The residual network model is typically mapped by input layer, convolutional layer, batch normalization, identity. The global mean pooling and the full-link output layer are combined by important basic modules. For the deep residual network (ResNet) diagnostic model, different basic units have different functions. The ResNet model usually contains a certain number of basic residual units, which are the core components of the ResNet model. The main part of the ResNet model is formed by stacking a plurality of basic residual modules. Common basic residual units are composed of two convolutional layers, two ReLu activation functions, two batch normalizations, and one identity map. The invention applies an improved version of a residual error module, the path of which comprises 'batch standardization-ReLu activation function-convolution layer-batch standardization-ReLu activation function-convolution layer', the residual error unit is more beneficial to the back propagation of errors in the neural network, and the parameters are easier to train. The residual error module used in the present invention is shown in fig. 2.

(1) Convolutional layer

The convolutional layer is one of the core parts of the ConvNet model and the ResNet model. One of its main advantages over the conventional fully-connected layer is that the training weights required for training are significantly reduced. In practice, the convolution kernel may act as a trainable local feature extraction operator. During the convolution operation, the convolution kernel is slid over the input data to extract the basic local features at different locations. The same convolution kernel can be used for extracting the same type local features at different positions, so that the utilization efficiency of the weight in the neural network is improved. As shown in fig. 3, the present invention first uses 3 × 3 input convolution units for feature extraction. And in each residual error unit, three convolution units of 3 x 3 are respectively arranged for feature extraction.

(2) Batch standardization layer (BN)

When the minimum batch stochastic gradient descent method is adopted, a certain number of randomly selected training samples are input into the deep neural network in each iteration of the deep neural network training process, and forward propagation and backward propagation are calculated. However, the feature distribution between these small batches of training samples tends to change continuously during the iterative process, in which case the weights and biases need to be updated continuously to adapt to the distribution after the change, slowing down the speed of deep neural network training. Batch normalization is a normalization method proposed for this internal covariance drift problem. Unlike the general standardization method, the batch standardization is an operation embedded between layers inside the deep neural network, and can be expressed as:

wherein: n is a radical of hydrogen _batch Refers to the number of samples contained in each batch, and u is N _batch The mean value of the samples in a certain dimension, sigma being N _batch The standard deviation of each sample in one dimension, γ and β, are learnable parameters used to scale and translate the normalized value, respectively, and ε is a constant that is close to zero. Y of _s Is x _s The corresponding output characteristic. Three batch normalization layers (BN) are arranged in each residual unit, and the output unit is also provided with the batch normalization layer (BN) for normalization processing.

During deep neural network training, with continuous updating of parameters, the data distribution input by each middle layer is often greatly different from that before parameter updating, so that the network is required to adapt to new number distribution continuously, and the training of the whole deep network is extremely difficult. By introducing batch standardization, the input layers are not only subjected to standardization treatment, but also the input of each middle layer is subjected to standardization treatment, so that the problem of drift of internal covariates can be avoided, and the problems of difficulty in training and low training speed are solved.

(3) Activating a function

The activation function is usually an indispensable part of a neural network algorithm, and the core role of the activation function is to realize the nonlinear mapping of input data in the neural network, so that the neural network can fit the nonlinear relation between the input data and the label. Because most of the current neural network algorithms need to adopt an error back propagation algorithm to train internal parameters, the output of an activation function needs to be differentiable for input. Currently common activation functions include Sigmoid, tanh, and Relu activation functions, among others. Because the partial derivative number of the Relu activation function is only possibly 0 or 1, compared with the traditional Sigmoid and Tanh activation functions, the Relu activation function can effectively reduce the risks of gradient disappearance and gradient explosion of the deep neural network in the training process and reduce the training difficulty of internal parameters of the deep neural network, so the Relu activation function is adopted by default, and the formula is expressed as follows:

wherein x is _ReLu And y _ReLu The input characteristic and the output characteristic of the ReLu activation function are respectively.

As shown in fig. 3, each batch normalization layer (BN) is followed by a ReLu activation function, which performs non-linear processing on the output characteristics of the batch normalization layer (BN). The activation function is used for introducing nonlinear factors into the neural network, and the neural network can fit various curves through the activation function, so that the expression capacity of the neural network is stronger.

(4) Global mean pooling GAP

The ResNet model adopted by the invention adopts the operation of global mean pooling at the position close to the output layer. Global mean pooling refers to calculating a corresponding mean value from the feature map of each channel as an output feature, and can be expressed as:

Y _GAP (1,1,i _ch )＝average(XGAP(i _ro ,i _co ,i _ch ))

wherein: x _GAP And Y _GAP An input feature map and an output feature map representing global mean pooling, respectively.

The global mean pooling not only can significantly reduce the number of weights to be trained, but also can effectively solve the problem of translational changes. That is, for the vibration signals of the same fault type, the positions of the fault impacts in the sample are different, and the learned writing characteristics in the conventional neural network are also different accordingly. The global mean pooling can realize invariance of the deep learning model to fault impact positions by calculating the global feature of mean, and solves the problem to a certain extent, thereby improving the generalization effect of the deep learning model on test data.

(5) Dropout layer

Dropout is a more common method of mitigating multiple fits. Dropout randomly discards some neurons during each training process, allowing the neural network to forward propagate and reverse update only the remaining neuron parameters. In this way, drop can weaken the "cooperative relationship" in front of the neurons, making the effect of each neuron more independent, thereby achieving the effect of model regularization. During the test, if there is no Dropout operation, i.e. all neurons participate in the calculation, the invention performs Dropout operation on the global mean pooled output, setting the discard ratio to 50%.

(6) Output layer

The cross entropy loss function is a classical loss function and is often used for multi-classification problems of neural networks and deep neural networks. In the ResNet model, the feature dimensions of the output layer are the same as the number of classes participating in training. Before calculating the cross entropy loss function, firstly, the characteristic x of the output layer needs to be processed by adopting a softmax function _j ，j＝1,2,…,N _class Forced conversion to [0,1 ]]Fraction of interval, wherein N _class Indicating the number of categories to which the classification task relates. The Softmax function can be expressed in the form of a formula:

in general, in the multi-classification problem of the shallow and deep neural networks, the labels of each class are in the form of vectors, only one value is 1, and the other values are 0, and in this label form, the cross-entropy loss function can be defined as:

wherein, t _j A true label, y, representing the specimen _j Representing the output of the softmax function. In general, cross-entropy loss functions can lead to higher training speeds in neural network-based multi-classification problems than more traditional squared error functions.

Step3: first fundamental mode component c ₁ After (t) is decomposed, c is subtracted from the original signal x (t) ₁ (t) obtaining a residual signal x ₁ (t)

x ₁ (t)＝x(t)-c ₁ (t)

Step4: x is to be ₁ (t) as an original signal x (t), repeating the steps 1 to 3 to obtain a second IMF component c ₂ (t), third IMF component c ₃ (t) up to the Nth IMF component c _n And (t), finally decomposing the original signal into n IMF components after EMD decomposition.

The cloud computing transmits data to the cloud end, the application deployed by the cloud end is used for computing, and the computing result is transmitted back to each user after computing is completed, so that the user does not need to pay attention to the problems of environment, hardware conditions and the like. In addition, the processing of the edge computing is closer to a data source, only the processed data is transmitted to the cloud center, so that the time delay caused by the limitation of data transmission speed and bandwidth is greatly reduced, and the data processing pressure of the cloud center is remarkably relieved.

In summary, the present invention relates to a motor fault diagnosis system and method based on edge calculation and deep learning, where the edge device may configure an acquisition device, perform Empirical Mode Decomposition (EMD) after preprocessing a vibration or current signal, input n IMF components to a diagnosis module built-in deep residual error network diagnosis model for fault diagnosis, and upload a diagnosis result and fault data to a cloud service center; the cloud service center utilizes the historical data and the fault data to regularly train the deep residual error network diagnosis model, sends the deep residual error network diagnosis model to each edge side device after training, and updates the deep residual error network diagnosis model in the edge side device. The motor equipment can be diagnosed in real time through calculation on the edge side equipment, and the motor state can be diagnosed in time and rapidly. By sinking the detection calculation to the edge equipment, the collected field data are directly processed and calculated on the edge equipment combined with the deep learning algorithm, and the detection result and the fault data are sent back to the cloud center server, so that the detection efficiency and the diagnosis accuracy are greatly improved, and the data transmission cost can be reduced.

It is to be understood that the above-described embodiments of the present invention are merely illustrative of or explaining the principles of the invention and are not to be construed as limiting the invention. Therefore, any modification, equivalent replacement, improvement and the like made without departing from the spirit and scope of the present invention should be included in the protection scope of the present invention. Further, it is intended that the appended claims cover all such variations and modifications as fall within the scope and boundaries of the appended claims or the equivalents of such scope and boundaries.

Claims

1. A motor fault diagnosis system based on edge calculation and deep learning is characterized by comprising a plurality of acquisition devices, a plurality of edge side devices and a cloud service center;

the cloud service center performs EMD (empirical mode decomposition) by using historical data and fault data, trains the depth residual error network diagnosis model periodically, sends the trained depth residual error network diagnosis model to each edge side device, and updates the depth residual error network diagnosis model in the edge side device;

the diagnosis module comprises an input convolution unit, 6 residual error units and an output unit; after the n IMF components are subjected to feature extraction through an input convolution unit, the n IMF components are sequentially subjected to treatment through 6 residual error units which are connected in series; finally, obtaining a diagnosis result by an output unit;

the residual error unit comprises a first batch of standardized layers, a first ReLu activation function, a first convolution layer, a second batch of standardized layers, a second ReLu activation function, a second convolution layer, a third batch of standardized layers, a third ReLu activation function, a third convolution layer and an identity mapping unit which are sequentially connected in series, and the identity mapping unit outputs the output of the third convolution layer after being overlapped with the input of the first batch of standardized layers.

2. The system according to claim 1, wherein the EMD module process flow comprises:

s1 obtaining a pre-processed vibration or current signal x (t), further i =0;

S3, judging whether i is smaller than n, if so, adding 1 to i to obtain the ith IMF component c _i (t)＝h ₁ (t) and using x (t) -h ₁ (t) replacing x (t), and returning to S2; such asIf i is not less than n, n IMF components are output.

3. The system for motor fault diagnosis based on edge calculation and deep learning of claim 2, wherein the output unit comprises an output batch normalization layer, an output ReLu activation function, a global mean pooling layer, a Dropout layer and a fully-connected output layer which are connected in series in sequence; the method comprises the steps that an output batch standardization layer carries out standardization processing on input features, an output ReLu activation function carries out nonlinear processing, a global mean value pooling layer averages feature maps of all input channels and outputs the averaged feature maps, a Dropout layer discards partial neurons, a full-connection output layer adopts a softmax function to calculate classification results, and the probabilities of various types of faults and no faults are output.

4. The system of claim 3, wherein the first, second, third and output batch normalization layers are operated by processing n one-dimensional features of the samples characterized by the n IMF components of each sample to obtain an output feature, and calculating a certain one-dimensional feature x of the s-th input sample _s Computing output feature y _s Comprises the following steps:

/>

wherein: n is a radical of _batch Refers to the number of samples contained in each batch, and μ is N _batch Mean value of samples in a dimension, σ being N _batch The standard deviation of each sample in one dimension, gamma and beta are the scale and translation, respectivelyThe learnable parameter, ε, is a constant that is close to zero.

5. The edge-computing and deep-learning based motor fault diagnosis system according to claim 1 or 2, wherein the cloud service center comprises: the system comprises a database, a selection module, an IMF component extraction module, a model training module and a packaging output module; the database stores historical vibration or current signals of the motor, vibration or current signals of the motor during fault uploading by each edge side device and diagnosis results manually marked; the selection module selects a sample from a database and inputs the sample to the IMF component extraction module, and the IMF component extraction module performs Empirical Mode Decomposition (EMD) on the input to be decomposed into n IMF components; the model training module inputs n IMF components into an internal deep residual error network diagnosis model for training, and when the requirements of training times or training precision are met, the packaging output module packages the deep residual error network diagnosis model and sends the deep residual error network diagnosis model to each edge side device.

6. The motor fault diagnosis system based on edge computing and deep learning as claimed in claim 1 or 2, further comprising an application layer for reading and displaying data of the cloud service center.

7. A motor fault diagnosis method based on edge calculation and deep learning is characterized by comprising the following steps:

each edge side device receives vibration or current signals of the motor collected by the corresponding collecting device and carries out pretreatment; performing Empirical Mode Decomposition (EMD) on the preprocessed vibration or current signals to decompose the signals into n IMF components; inputting n IMF components by the deep residual error network diagnosis model to perform fault diagnosis, and uploading a diagnosis result and fault data to a cloud service center;

the cloud service center performs Empirical Mode Decomposition (EMD) by using the diagnosis result and the fault data, trains the deep residual error network diagnosis model at regular intervals, sends the trained deep residual error network diagnosis model to each edge side device, and updates the deep residual error network diagnosis model in the edge side device;

the depth residual error network diagnosis model comprises an input convolution unit, 6 residual error units and an output unit; after the n IMF components are subjected to feature extraction through an input convolution unit, sequentially performing treatment through 6 residual error units connected in series; finally, obtaining a diagnosis result by an output unit;

8. The edge calculation and deep learning based motor fault diagnosis method according to claim 7, wherein the EMD module performs EMD on the preprocessed vibration or current signals and decomposes the EMD into n IMF components;

the processing flow of EMD by the EMD module comprises the following steps:

s1 obtaining a preprocessed vibration or current signal x (t), where i =0;

s2, obtaining an envelope f on x (t) _max (t) and the lower envelope f _min (t), calculating the mean value of the upper envelope line and the lower envelope line as m (t);

9. The motor fault diagnosis method based on the edge calculation and the deep learning as claimed in claim 8, wherein the output module comprises an output batch standardization layer, an output ReLu activation function, a global mean pooling layer, a Dropout layer and a fully-connected output layer which are connected in series in sequence; the method comprises the steps that an output batch standardization layer carries out standardization processing on input features, an output ReLu activation function carries out nonlinear processing, a global mean value pooling layer averages feature maps of all input channels and outputs the averaged feature maps, a Dropout layer discards partial neurons, a full-connection output layer adopts a softmax function to calculate classification results, and the probabilities of various types of faults and no faults are output.

10. The motor fault diagnosis method based on edge computing and deep learning of claim 8, wherein the cloud service center comprises: the system comprises a database, a selection module, an IMF component extraction module, a model training module and a packaging output module; the database stores historical vibration or current signals of the motor, vibration or current signals of the motor during the fault uploaded by each edge side device and a manually labeled diagnosis result; the selection module selects a sample from a database and inputs the sample to the IMF component extraction module, and the IMF component extraction module performs Empirical Mode Decomposition (EMD) on the input to be decomposed into n IMF components; the model training module inputs n IMF components into an internal deep residual error network diagnosis model for training, and when the requirements of training times or training precision are met, the packaging output module packages the deep residual error network diagnosis model and sends the deep residual error network diagnosis model to each edge side device.