CN114048787B - Method and system for intelligently diagnosing bearing fault in real time based on Attention CNN model - Google Patents

Method and system for intelligently diagnosing bearing fault in real time based on Attention CNN model Download PDF

Info

Publication number
CN114048787B
CN114048787B CN202210024754.5A CN202210024754A CN114048787B CN 114048787 B CN114048787 B CN 114048787B CN 202210024754 A CN202210024754 A CN 202210024754A CN 114048787 B CN114048787 B CN 114048787B
Authority
CN
China
Prior art keywords
bearing
fault
model
attention
training
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210024754.5A
Other languages
Chinese (zh)
Other versions
CN114048787A (en
Inventor
蔡绍滨
陈鑫
王宇昊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen Qianruisheng Intelligent Technology Co.,Ltd.
Original Assignee
Hangzhou Yunzhisheng Intelligent Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Yunzhisheng Intelligent Technology Co ltd filed Critical Hangzhou Yunzhisheng Intelligent Technology Co ltd
Priority to CN202210024754.5A priority Critical patent/CN114048787B/en
Publication of CN114048787A publication Critical patent/CN114048787A/en
Application granted granted Critical
Publication of CN114048787B publication Critical patent/CN114048787B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/12Classification; Matching
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01MTESTING STATIC OR DYNAMIC BALANCE OF MACHINES OR STRUCTURES; TESTING OF STRUCTURES OR APPARATUS, NOT OTHERWISE PROVIDED FOR
    • G01M13/00Testing of machine parts
    • G01M13/04Bearings
    • G01M13/045Acoustic or vibration analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks

Abstract

The invention provides a bearing fault real-time intelligent diagnosis method and system based on an Attention CNN model, which comprises the steps of collecting a fault bearing vibration signal by using a vibration sensor, and then segmenting the fault bearing vibration signal by adopting a fixed-length random segmentation method to obtain a data sample; after labels corresponding to various types are attached to the data samples according to the state types of the rolling bearings, the data samples are divided into a training set, a verification set and a test set according to a certain proportion; respectively manufacturing a plurality of bearing fault data sets in an unbalanced state according to the training set and the verification set, and forming an unbalanced data set by all the manufactured bearing fault data sets; constructing the model, and respectively training the model by using different bearing fault data sets to obtain the training model; and carrying out real-time fault detection on the rolling bearing by using the training model. The invention can accurately and automatically identify the running state of the bearing in real time, thereby effectively maintaining the normal running of mechanical equipment.

Description

Method and system for intelligently diagnosing bearing fault in real time based on Attention CNN model
Technical Field
The invention relates to the technical field of equipment health management, in particular to a real-time intelligent diagnosis method and a real-time intelligent diagnosis system for bearing faults based on an Attention CNN model.
Background
The bearing is a key component of modern industrial equipment, the working scene of the bearing is complex, and once the bearing breaks down, serious safety accidents can be caused, so that a great amount of casualties and huge economic losses are caused. The bearing is one of the key supporting components in equipment such as helicopters, aero-engines, wind driven generators and the like, so whether faults can be detected timely and accurately is of great importance to eliminating the potential safety hazards of machinery. Therefore, how to diagnose the fault of the rotating machinery accurately and automatically in real time has important significance for ensuring the normal operation and the safe production of the rotating machinery.
The conventional signal-based method refers to extracting fault features from time domain signals or frequency domain signals for diagnosis by using various signal analysis techniques.
The machine learning method is also applied to fault diagnosis of the rotary machine, and the machine learning can extract features for classification without abundant expert knowledge for judgment, so that the diagnosis difficulty is reduced. Machine learning uses vibration signals as samples for input in fault diagnosis, and then performs classification by extracting features. The traditional fault classification algorithm and the machine learning algorithm have certain achievements in the field of fault identification, but the traditional fault detection algorithm depends on expert experience, meanwhile, the machine learning algorithm cannot well learn complex nonlinear relations in vibration signals, fault features cannot be automatically extracted, and different types of fault identification needs to design different types of feature extractors.
The deep learning method is greatly developed in the field of fault identification, deep features can be extracted from original signals, massive complex data are processed, representative features can be extracted in a self-adaptive mode based on the deep learning method, manual intervention is not needed, and compared with a traditional algorithm, the fault identification has higher accuracy. In the deep learning field, the convolutional neural network reduces the parameter quantity due to the convolution and pooling operation, and improves the speed and the accuracy of identification through local receptive fields (local perceptual fields) and shared weights (shared weights).
Although various rotary machine fault detection models are available and good experimental results are obtained, many challenges still exist in the field of rotary machine fault diagnosis. In order to obtain higher accuracy, the traditional deep learning fault detection model usually adopts a multilayer neural network superposition mode, so that the model complexity is too high. The model with the stacked multilayer networks has extremely high requirements on training equipment, and the fault detection time of the successfully trained model is too long, so that the method is not suitable for the problem of fault real-time diagnosis in an industrial scene. Secondly, a large number of high-quality samples are needed for the deeply learned fault diagnosis model, so that the model can be ensured not to be over-fitted. The existing model is tested under the condition of ideal sample quantity, and the actual industrial condition is that a fault sample is less than a normal sample, so that the data imbalance phenomenon generally exists in the industrial field.
Aiming at the defects of the prior art, the invention provides a real-time intelligent bearing fault diagnosis method based on an Attention CNN model under the condition of data imbalance. The bearing fault detection method has the advantages of high bearing fault detection precision, short detection time, high accuracy and good stability under the condition of unbalanced data.
Disclosure of Invention
The invention provides an Attention CNN model-based bearing fault real-time intelligent diagnosis method and system, which aim to overcome the defects in the prior art.
In one aspect, the invention provides a bearing fault real-time intelligent diagnosis method based on an Attention CNN model, which comprises the following steps:
s1: collecting a vibration signal of a fault bearing by using a vibration sensor, and then segmenting the vibration signal of the fault bearing by adopting a fixed-length random segmentation method to obtain a data sample;
s2: after labels corresponding to all types are attached to the data sample according to the state type of the rolling bearing, the data sample is divided into a training set, a verification set and a test set according to a certain proportion;
s3: respectively manufacturing a plurality of bearing fault data sets in an unbalanced state according to the training set and the verification set, and forming an unbalanced data set by all the manufactured bearing fault data sets;
s4: constructing an Attention CNN model, respectively training the Attention CNN model by using different bearing fault data sets, and simultaneously carrying out model test by using the test set to finally obtain an Attention CNN training model;
s5: and carrying out real-time fault detection on the rolling bearing by utilizing the Attention CNN training model.
The method firstly adopts fixed-length random segmentation to enhance the randomness of data, thereby enhancing the robustness of the model; secondly, when the model is constructed, the overfitting of the model is relieved by batch normalization, Dropout and L2 regularization, so that the detection efficiency of the model is improved; then, the importance degree of the features to the final output is calculated through an attention mechanism, so that the method still has excellent fault diagnosis capability under the condition of no multilayer neural network, and has excellent stability under the condition of unbalanced data; in the model training stage, an exponential decay learning rate and a callback function are adopted to prevent the model from being over-fitted and store the optimal model in the training process; finally, the stored model is applied to real-time fault detection of the rotary machine; the Attention CNN model network structure comprises: a convolutional layer, a pooling layer, a Batch Normalization (BN) layer, an Attention module, a full link layer, and a classification layer. The convolution layer carries out feature extraction on the original vibration signal; the pooling layer performs down-sampling on the characteristics of the convolution layer to reduce model parameters; the BN layer accelerates the fitting speed of the network, and the calculation efficiency is accelerated; the Attention module calculates the importance degree of the features to the final output; the full connection layer maps the features; the classification layer classifies features. The invention can accurately and automatically identify the running state of the bearing in real time, thereby effectively maintaining the normal running of mechanical equipment.
In a specific embodiment, the state types of the rolling bearing are divided into a normal state and a single-point defect with different diameters is respectively manufactured on the outer ring, the inner ring and the rolling body of the bearing by using an electric discharge machining technology.
In a specific embodiment, the creating a plurality of bearing fault data sets in an unbalanced state according to the data in the training set and the verification set respectively specifically includes:
performing the following operations on the training set and the validation set, respectively:
under the condition that the number of normal data samples is fixed, simulating the conditions under different unbalanced states according to the ratio of normal data to fault data in various different proportions;
and obtaining a plurality of corresponding bearing fault data sets in an unbalanced state for the training set and the verification set respectively.
In a specific embodiment, the segmenting the vibration signal of the faulty bearing by using a fixed-length random segmentation method to obtain a data sample specifically includes:
when the total Length of the sampling data of the data sample of the faulty bearing vibration signal is Length, and the sampling window is W, and a random number index is taken in the (0, Length-W) section, the Length of the data sample is index + W. This is done to enhance the randomness of the data, thereby increasing the robustness of the model.
In a specific embodiment, the specific steps of constructing the Attention CNN model include:
t1: constructing a first layer of convolution, wherein the first convolution layer is a one-dimensional convolution neural network, the size of convolution kernels is set to be 64, the number of the convolution kernels is set to be 32, the step length is 2, and Same filling is adopted;
t2: adding Batch Normalization (BN) after the T1 to improve the calculation speed of the model;
t3: adding a Relu activation function after the T2, and accelerating convergence and relieving gradient disappearance by using the Relu activation function;
t4: introducing a maximum pooling layer after the T3, and performing down-sampling by using the maximum pooling layer to reduce the parameter number of the model;
t5: dropout is introduced after the T4, and is utilized to make a part of neurons inactive, so that the number of model neurons is reduced, and the model complexity is reduced;
t6: introducing a second layer of convolutional neural network after the T5, wherein the second convolutional layer is a one-dimensional convolutional neural network, setting the size of convolutional kernels and the number of convolutional kernels, and adopting Same filling;
t7: adding Batch Normalization (BN) after the T6 to improve the calculation speed of the model;
t8 addition of Relu activation function after the T7 accelerates convergence and relieves gradient disappearance;
t9 introducing a Flatten layer after the T8, wherein the Flatten layer tiles high-dimensional features into a one-dimensional space;
t10, adding an Attention mechanism after the T9 to focus Attention on the part with fault information, thereby improving the classification accuracy and the detection efficiency;
t11: mapping the characteristics output by the full connection layer mechanism by using the full connection layer;
t12: and the number of the neurons of the classification layer is equal to the total number of bearing fault classes, and a Softmax activation function is adopted.
In a specific embodiment, the step of constructing the Attention CNN model includes:
taking characteristic information in Source as a plurality of Key, Value data pairs, obtaining a weight coefficient of Value corresponding to each Key by calculating the correlation between Query and each Key, and then carrying out weighted summation on Value according to the weight coefficient so as to obtain a final Attention Value;
the method specifically comprises the following steps:
a1 computing Query sum by dot product
Figure 398753DEST_PATH_IMAGE002
The dot product model can better utilize the matrix product in the aspect of realization, so that the calculation efficiency is higher, and the requirements of real-time fault detection are met;
Figure 41087DEST_PATH_IMAGE004
a2: and numerically converting the score of the first stage by introducing a calculation mode of Softmax, wherein the numerical conversion comprises the following steps: normalizing and sorting the originally calculated values into probability distribution with the sum of element weights being 1; the weight of the important features is more prominent through the intrinsic mechanism of Softmax; obtaining the weight coefficient after the numerical value conversion
Figure 255031DEST_PATH_IMAGE006
Figure 941096DEST_PATH_IMAGE008
A3: wherein
Figure 506069DEST_PATH_IMAGE010
To represent
Figure 420936DEST_PATH_IMAGE012
And weighting and summing the corresponding weight coefficients to obtain an Attention value, wherein the Attention value is as follows:
Figure 754965DEST_PATH_IMAGE014
in a specific embodiment, the training the Attention CNN model with the different bearing fault data sets respectively specifically includes:
in order to prevent overfitting of the model during training, L2 regularization was used at the convolutional layer;
the exponential decay learning rate is adopted in the process of training the Attention CNN model, the efficiency of model training can be greatly improved, a better solution is quickly obtained by using a larger learning rate, then the learning rate is gradually reduced along with the continuation of iteration, so that the model is more stable in the later training period, and the method can avoid a loss platform (loss platform) and is an effective strategy for jumping out a local optimal solution.
Figure 113397DEST_PATH_IMAGE016
Wherein
Figure 165666DEST_PATH_IMAGE018
In order to be the initial learning rate,
Figure 618644DEST_PATH_IMAGE020
in order to achieve the final learning rate,
Figure 322027DEST_PATH_IMAGE022
for the attenuation rate, decay _ step is the attenuation speed, global _ step is the current iteration number;
using a callback function (callback) in training the Attention CNN model; the callback function can interrupt the training when overfitting is just started, thereby avoiding using fewer rounds to train the model from the beginning, and simultaneously directly saving the optimal model appearing in the training process.
And using an Adma optimizer and a cross entropy loss function in the training process of the Attention CNN model, and updating the network weight by using a gradient descent algorithm until a callback function is triggered to terminate the training to obtain the Attention CNN training model.
In a specific embodiment, the performing real-time fault detection on the rolling bearing by using the Attention CNN training model specifically includes:
deploying the Attention CNN training model offline and testing the fault diagnosis accuracy rate in various unbalanced states by using the bearing fault data set;
and respectively recording the bearing fault data set used for testing and the time used for testing the corresponding fault data for the Attention CNN training model which is deployed off line.
According to a second aspect of the present invention, a computer-readable storage medium is proposed, on which a computer program is stored, which computer program, when being executed by a computer processor, carries out the above-mentioned method.
According to a third aspect of the present invention, a real-time intelligent diagnosis system for bearing faults based on an Attention CNN model is provided, which comprises:
a data sample acquisition module: the method comprises the steps that a vibration sensor is used for collecting a vibration signal of a fault bearing, and then the vibration signal of the fault bearing is divided by adopting a fixed-length random dividing method to obtain a data sample;
a data sample division module: after the data sample is configured and used for attaching labels corresponding to all types to the data sample according to the state type of the rolling bearing, the data sample is divided into a training set, a verification set and a test set according to a certain proportion;
an unbalanced data set construction module: the system is configured and used for respectively manufacturing a plurality of bearing fault data sets in an unbalanced state according to the training set and the verification set and forming an unbalanced data set by all the manufactured bearing fault data sets;
attention CNN model training module: configuring and constructing an Attention CNN model, respectively training the Attention CNN model by using different bearing fault data sets, and simultaneously carrying out model test by using the test set to finally obtain an Attention CNN training model;
a real-time fault detection module: and the real-time fault detection device is configured for utilizing the Attention CNN training model to detect the fault of the rolling bearing in real time.
Firstly, fixed-length random segmentation is adopted to enhance the randomness of data, so that the robustness of a model is enhanced; secondly, when the model is constructed, the overfitting of the model is relieved by batch normalization, Dropout and L2 regularization, so that the detection efficiency of the model is improved; then, the importance degree of the features to the final output is calculated through an attention mechanism, so that the method still has excellent fault diagnosis capability under the condition of no multilayer neural network, and has excellent stability under the condition of unbalanced data; in the model training stage, an exponential decay learning rate and a callback function are adopted to prevent the model from being over-fitted and store the optimal model in the training process; finally, the stored model is applied to real-time fault detection of the rotary machine; the Attention CNN model network structure comprises: a convolutional layer, a pooling layer, a Batch Normalization (BN) layer, an Attention module, a full link layer, and a classification layer. The convolution layer carries out feature extraction on the original vibration signal; the pooling layer performs down-sampling on the characteristics of the convolution layer to reduce model parameters; the BN layer accelerates the fitting speed of the network, and the calculation efficiency is accelerated; the Attention module calculates the importance degree of the features to the final output; the full connection layer maps the features; the classification layer classifies features. The invention can accurately and automatically identify the running state of the bearing in real time, thereby effectively maintaining the normal running of mechanical equipment.
Drawings
The accompanying drawings are included to provide a further understanding of the embodiments and are incorporated in and constitute a part of this specification. The drawings illustrate embodiments and together with the description serve to explain the principles of the invention. Other embodiments and many of the intended advantages of embodiments will be readily appreciated as they become better understood by reference to the following detailed description. Other features, objects and advantages of the present application will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings in which:
FIG. 1 is an exemplary system architecture diagram in which the present application may be applied;
FIG. 2 is a flowchart of a bearing fault real-time intelligent diagnosis method based on the Attention CNN model according to an embodiment of the present invention;
FIG. 3 is a flowchart of a real-time intelligent fault diagnosis method for a bearing based on an Attention CNN model under the condition of data imbalance according to a specific embodiment of the present invention;
FIG. 4 is a data fixed-length sampling illustration of a particular embodiment of the present invention;
FIG. 5 is a schematic illustration of normal bearing vibration signals for a specific embodiment of the present invention;
FIG. 6 is a graph of rolling element fault bearing vibration signals at 0.007 (601), 0.014 (602) and 0.021 (603) inch damage diameters for a specific embodiment of the present invention;
FIG. 7 is a graph of inner ring fault bearing vibration signals at 0.007 (701), 0.014 (702), and 0.021 (703) inch damage diameters for a specific embodiment of the present invention;
FIG. 8 is a graph of outer ring fault bearing vibration signals at 0.007 (801), 0.014 (802) and 0.021 (803) inch gage diameters for a specific embodiment of the present invention;
FIG. 9 is a t-SNE feature visualization of raw data distribution for a particular embodiment of the present invention;
FIG. 10 is a t-SNE feature visualization of data distribution after model processing for a specific embodiment of the invention;
FIG. 11 is a graph illustrating accuracy curves and loss function curves of a training set and a validation set of a bearing data set Attention CNN model according to an embodiment of the present invention;
FIG. 12 is a confusion matrix of a balanced data set (1201), a normal data set to failure data set (1202) and a normal data set to failure data set 10:1 data set (1203) of a particular embodiment of the present invention;
FIG. 13 is a table of data divisions for an imbalance condition in accordance with a specific embodiment of the present invention;
FIG. 14 illustrates the accuracy of testing under various scenarios of unbalanced data in accordance with an exemplary embodiment of the present invention;
FIG. 15 is a block diagram of a real-time intelligent diagnosis system for bearing faults based on the Attention CNN model according to an embodiment of the present invention;
FIG. 16 is a block diagram of a computer system suitable for use in implementing the electronic device of an embodiment of the present application.
Detailed Description
The present application will be described in further detail with reference to the following drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the relevant invention and not restrictive of the invention. It should be noted that, for convenience of description, only the portions related to the related invention are shown in the drawings.
It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present application will be described in detail below with reference to the embodiments with reference to the attached drawings.
Fig. 1 shows an exemplary system architecture 100 to which an Attention CNN model-based bearing fault real-time intelligent diagnosis method according to an embodiment of the present application may be applied.
As shown in fig. 1, the system architecture 100 may include terminal devices 101, 102, 103, a network 104, and a server 105. The network 104 serves as a medium for providing communication links between the terminal devices 101, 102, 103 and the server 105. Network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, to name a few.
The user may use the terminal devices 101, 102, 103 to interact with the server 105 via the network 104 to receive or send messages or the like. Various applications, such as a data processing application, a data visualization application, a web browser application, etc., may be installed on the terminal devices 101, 102, 103.
The terminal apparatuses 101, 102, and 103 may be hardware or software. When the terminal devices 101, 102, 103 are hardware, they may be various electronic devices including, but not limited to, smart phones, tablet computers, laptop portable computers, desktop computers, and the like. When the terminal apparatuses 101, 102, 103 are software, they can be installed in the electronic apparatuses listed above. It may be implemented as multiple pieces of software or software modules (e.g., software or software modules used to provide distributed services) or as a single piece of software or software module. And is not particularly limited herein.
The server 105 may be a server providing various services, such as a background information processing server providing support for data samples presented on the terminal devices 101, 102, 103. The backend information processing server may process the acquired unbalanced data set and generate a processing result (e.g., the Attention CNN training model).
It should be noted that the method provided in the embodiment of the present application may be executed by the server 105, or may be executed by the terminal devices 101, 102, and 103, and the corresponding apparatus is generally disposed in the server 105, or may be disposed in the terminal devices 101, 102, and 103.
The server may be hardware or software. When the server is hardware, it may be implemented as a distributed server cluster formed by multiple servers, or may be implemented as a single server. When the server is software, it may be implemented as multiple pieces of software or software modules (e.g., software or software modules used to provide distributed services), or as a single piece of software or software module. And is not particularly limited herein.
It should be understood that the number of terminal devices, networks, and servers in fig. 1 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.
Fig. 2 shows a flowchart of a bearing fault real-time intelligent diagnosis method based on an Attention CNN model according to an embodiment of the present invention. As shown in fig. 2, the method comprises the steps of:
s1: collecting a vibration signal of a fault bearing by using a vibration sensor, and then segmenting the vibration signal of the fault bearing by adopting a fixed-length random segmentation method to obtain a data sample;
s2: after labels corresponding to all types are attached to the data sample according to the state type of the rolling bearing, the data sample is divided into a training set, a verification set and a test set according to a certain proportion;
s3: respectively manufacturing a plurality of bearing fault data sets in an unbalanced state according to the training set and the verification set, and forming an unbalanced data set by all the manufactured bearing fault data sets;
s4: constructing an Attention CNN model, respectively training the Attention CNN model by using different bearing fault data sets, and simultaneously carrying out model test by using the test set to finally obtain an Attention CNN training model;
s5: and carrying out real-time fault detection on the rolling bearing by utilizing the Attention CNN training model.
In a specific embodiment, the state types of the rolling bearing are divided into a normal state and a single-point defect of which the diameters are different from 0.007 inches, 0.014 inches and 0.021 inches is respectively manufactured on the outer ring, the inner ring and the rolling body of the bearing by using an electric discharge machining technology.
In a specific embodiment, the creating a plurality of bearing fault data sets in an unbalanced state according to the data in the training set and the verification set respectively specifically includes:
performing the following operations on the training set and the validation set, respectively:
under the condition that the number of normal data samples is fixed, the ratio of the normal data to the fault data is as follows: 10: 10; 10: 9; 10: 8; 10: 7; 10: 6; 10: 5; 10: 4; 10: 3; 10: 2; 10:1 to simulate different imbalance conditions;
and obtaining a plurality of corresponding bearing fault data sets in an unbalanced state for the training set and the verification set respectively.
In a specific embodiment, the segmenting the vibration signal of the faulty bearing by using a fixed-length random segmentation method to obtain a data sample specifically includes:
when the total Length of the sampling data of the data sample of the faulty bearing vibration signal is Length, and the sampling window is W, and a random number index is taken in the (0, Length-W) section, the Length of the data sample is index + W. This is done to enhance the randomness of the data, thereby increasing the robustness of the model.
In a specific embodiment, the specific steps of constructing the Attention CNN model include:
t1: constructing a first layer of convolution, wherein the first convolution layer is a one-dimensional convolution neural network, the size of convolution kernels is set to be 64, the number of the convolution kernels is set to be 32, the step length is 2, and Same filling is adopted;
t2: adding Batch Normalization (BN) after the T1 to improve the calculation speed of the model;
t3: adding a Relu activation function after the T2, and accelerating convergence and relieving gradient disappearance by using the Relu activation function;
t4: introducing a maximum pooling layer after the T3, and performing down-sampling by using the maximum pooling layer to reduce the parameter number of the model;
t5: dropout is introduced after the T4, and is utilized to make a part of neurons inactive, so that the number of model neurons is reduced, and the model complexity is reduced;
t6: introducing a second layer of convolutional neural network after the T5, wherein the second layer of convolutional neural network is a one-dimensional convolutional neural network, setting the size of convolutional kernels and the number of convolutional kernels, and adopting Same filling, wherein the size of the convolutional kernels is set to be 128, the number of convolutional kernels is set to be 64, and the step size is 8;
t7: adding Batch Normalization (BN) after the T6 to improve the calculation speed of the model;
t8 addition of Relu activation function after the T7 accelerates convergence and relieves gradient disappearance;
t9 introducing a Flatten layer after the T8, wherein the Flatten layer tiles high-dimensional features into a one-dimensional space;
t10, adding an Attention mechanism after the T9 to focus Attention on the part with fault information, thereby improving the classification accuracy and the detection efficiency;
t11: mapping the characteristics output by the full-connection layer mechanism by using the full-connection layer, wherein the number of the neurons of the full-connection layer is set to be 100;
t12: and the number of the neurons of the classification layer is equal to the total number of bearing fault classes, and a Softmax activation function is adopted.
In a specific embodiment, the step of constructing the Attention CNN model includes:
the method comprises the steps that characteristic information in Source is regarded as being composed of a plurality of Key and Value data pairs, the correlation between Query and each Key is calculated to obtain the weight coefficient of Value corresponding to each Key, and then the Value is subjected to weighted summation according to the weight coefficient, so that the final Attention Value is obtained;
the method specifically comprises the following steps:
a1 computing Query sum by dot product
Figure 100627DEST_PATH_IMAGE024
The dot product model can better utilize the matrix product in the aspect of realization, so that the calculation efficiency is higher, and the requirements of real-time fault detection are met;
Figure 640193DEST_PATH_IMAGE026
a2: and numerically converting the score of the first stage by introducing a calculation mode of Softmax, wherein the numerical conversion comprises the following steps: normalizing and sorting the originally calculated values into probability distribution with the sum of element weights being 1; the weight of the important features is more prominent through the intrinsic mechanism of Softmax; obtaining the weight coefficient after the numerical value conversion
Figure 896862DEST_PATH_IMAGE028
Figure 956216DEST_PATH_IMAGE030
A3: wherein
Figure 905717DEST_PATH_IMAGE032
To represent
Figure 667000DEST_PATH_IMAGE034
And weighting and summing the corresponding weight coefficients to obtain an Attention value, wherein the Attention value is as follows:
Figure 992939DEST_PATH_IMAGE036
in a specific embodiment, the training the Attention CNN model with the different bearing fault data sets respectively specifically includes:
to prevent overfitting of the model during training, L2 regularization was used at the convolutional layer, with the L2 regularization factor set to 0.0004;
the exponential decay learning rate is adopted in the process of training the Attention CNN model, the efficiency of model training can be greatly improved, a better solution is quickly obtained by using a larger learning rate, then the learning rate is gradually reduced along with the continuation of iteration, so that the model is more stable in the later training stage, and the final learning rate is obtained:
Figure 405335DEST_PATH_IMAGE038
wherein
Figure 260158DEST_PATH_IMAGE040
In order to be the initial learning rate,
Figure 774316DEST_PATH_IMAGE042
in order to achieve the final learning rate,
Figure 389099DEST_PATH_IMAGE044
for the decay rate, decay _ step is the decay rate, global _ step is the current iteration number. The initial learning rate of index attenuation in the Attention CNN model is set to be 0.05, and the attenuation rate is set to be 0.05
Figure 406734DEST_PATH_IMAGE046
Set to 0.96;
using a callback function (callback) in training the Attention CNN model; the callback function can interrupt the training when overfitting is just started, thereby avoiding using fewer rounds to train the model from the beginning, and simultaneously directly saving the optimal model appearing in the training process. And setting the parameter of the Attention CNN model callback function to be 5, namely terminating the training if the verification precision of the model in 5 rounds is not improved, and saving the optimal model.
And using an Adma optimizer and a cross entropy loss function in the training process of the Attention CNN model, and updating the network weight by using a gradient descent algorithm until a callback function is triggered to terminate the training to obtain the Attention CNN training model.
In a specific embodiment, the performing real-time fault detection on the rolling bearing by using the Attention CNN training model specifically includes:
deploying the Attention CNN training model offline and testing the fault diagnosis accuracy rate in various unbalanced states by using the bearing fault data set;
and respectively recording the bearing fault data set used for testing and the time used for testing the corresponding fault data for the Attention CNN training model which is deployed off line.
To illustrate and verify the method of the present invention, the following test is performed by using test data of normal bearings and faulty bearings in a laboratory as a sample data set, fig. 3 is a flowchart of a real-time intelligent fault diagnosis method for bearings based on an Attention CNN model under a data imbalance condition according to an embodiment of the present invention, and fig. 13 is a data division table of an imbalance condition according to an embodiment of the present invention, and the training and the testing are performed by using bearing data. The test bench for testing comprises a motor, a torque sensor, a power meter and a controller, wherein the sample data set acquires data of a normal bearing, a single-point driving end and a fan end under the motor load of 0hp, 1hp, 2hp and 3 hp. The experimental data was completed with an accelerometer motor driven mechanical system with 12000 samples per second. For the failed bearing, single-point defects with the diameters of 0.007 inches, 0.014 inches and 0.021 inches are manufactured on the outer ring, the inner ring and the rolling body of the bearing by using an electric spark machining technology.
In the following experiment, ten different state signal data under 0ph are selected as experimental data:
fig. 4 is an explanatory diagram of fixed-length sampling of data according to a specific embodiment of the present invention, as shown in fig. 4, a fixed-length random sample is extracted from a signal to be tested, for example, sample data is randomly extracted from the signal to be tested with a data length of 1024, and the sample data is divided into a training sample and a test set, and the training sample may be further divided into a training set and a verification set;
FIG. 5 is a schematic diagram of vibration signals of a normal bearing according to an embodiment of the present invention, and as shown in FIG. 5, the vibration signals of the normal bearing are distributed regularly;
FIG. 6 is a graph showing rolling element fault bearing vibration signals at 0.007 (601), 0.014 (602) and 0.021 (603) inch damage diameters for a specific embodiment of the present invention, as shown in FIG. 6, showing different transitions for different sized damage defects;
FIG. 7 is a graph showing the vibration signals of a failed inner race bearing at 0.007 (701), 0.014 (702) and 0.021 (703) inch damage diameters for a specific embodiment of the present invention, as shown in FIG. 7, showing different transitions in the vibration signals of a failed inner race bearing at different sized damage defects;
FIG. 8 is a graph showing the outer ring fault bearing vibration signals at 0.007 (801), 0.014 (802) and 0.021 (803) inch damage diameters for one embodiment of the present invention, as shown in FIG. 8, showing different transitions for the outer ring fault bearing vibration signals at different sized damage defects.
The diagrams of the vibration signals in the ten different states are shown in fig. 5, 601-603 in fig. 6, 701-702 in fig. 7, and 801-803 in fig. 8, and a subsequent required data set is obtained based on performing fixed-length data sampling on the vibration signals in the different states as measured signals.
The frequency and the rotating speed of data collection are obtained from the description of the bearing data set of the embodiment, and the quantity of data collected by each circle of the bearing is deduced, wherein the formula is that the sampling count/circle number is 60 sampling frequency/rotating speed is 12000 and 60/1797 is 400. Thus to ensure that each sample has a faulty signal, the length of each sample is set to 1024.
The experimental environment of the invention is a software environment: windows 1064, Keras 2.3.1, TensorFlow 1.15.0, Python 3.6.13; the hardware environment is CPU AMD Ryzen 74800H, RAM 32.0.0 GB, GPU RTX2060.
The accuracy of the test under different unbalanced data scenes is shown in fig. 14.
Although it has been successfully applied to fault diagnosis, it is a black box model, and fault diagnosis based on what the neural network is a problem to be solved. The effectiveness of the invention is illustrated by using a characteristic visualization t-SNE (t-distributed stored probabilistic Neighbor Embedding) algorithm, wherein the t-SNE characteristic visualization of the original data distribution situation is shown in FIG. 9, and the t-SNE characteristic visualization of the data distribution situation after model processing is shown in FIG. 10.
FIG. 11 is a graph illustrating accuracy curves and loss function curves of a training set and a validation set of a bearing data set Attention CNN model according to an embodiment of the present invention; FIG. 12 is a confusion matrix of a balanced data set (1201), a normal data set to failure data set (1202) and a normal data set to failure data set 10:1 data set (1203) of a particular embodiment of the present invention.
According to the experiment in the embodiment, the time for testing 10 pieces of fault data is 20.27 milliseconds, and the time for testing 100 pieces of data is 45 milliseconds under the current experimental environment. According to the data sampling frequency, 12000 sampling points are acquired every second, but the invention only needs 20.27 milliseconds to test 10240 sampling points.
From the accuracy and detection speed of the test set in various unbalanced data set states, the diagnosis accuracy of the method under balanced data set conditions and slightly unbalanced data sets is close to 100%, and the diagnosis accuracy under extreme unbalanced conditions is over 97%.
The invention has the beneficial effects that:
the invention designs a new model with low structural complexity aiming at the bearing real-time intelligent fault diagnosis scene in the data unbalance state, and obtains excellent performance in the aspect of real-time diagnosis under the condition of not using any data enhancement means.
Aiming at real-time fault diagnosis, the invention obtains excellent diagnosis performance by using the characteristics of attention mechanism automatic learning and calculation of the contribution of input characteristics to output data under the condition of not overlapping convolution and pooling layers, greatly shortens the model detection time and meets the real-time fault detection.
The Attention CNN model provided by the invention can be used for real-time fault detection and can effectively solve the problem of data imbalance.
The invention focuses more on real-time fault diagnosis of a real scene, directly adopts original signals for input, does not carry out any data signal preprocessing, and adopts fixed-length random sampling to eliminate sampling contingency. The original signal input is directly adopted, so that the dependence on a manual signal processing method is avoided.
Fig. 15 shows a frame diagram of a real-time intelligent diagnosis system for bearing faults based on the Attention CNN model according to an embodiment of the present invention. The system comprises a data sample acquisition module 1501, a data sample partitioning module 1502, an unbalanced data set construction module 1503, an Attention CNN model training module 1504 and a real-time fault detection module 1505.
In a specific embodiment, the data sample acquisition module 1501 is configured to acquire a vibration signal of a faulty bearing by using a vibration sensor, and then segment the vibration signal of the faulty bearing by using a fixed-length random segmentation method to obtain a data sample;
the data sample dividing module 1502 is configured to, after attaching labels corresponding to each type to the data samples according to the state types of the rolling bearings, divide the data samples into a training set, a verification set and a test set according to a certain proportion;
the unbalanced data set construction module 1503 is configured to make a plurality of bearing fault data sets in an unbalanced state according to the data in the training set and the verification set, and configure all the made bearing fault data sets into an unbalanced data set;
the Attention CNN model training module 1504 is configured to construct an Attention CNN model, respectively train the Attention CNN model with different bearing fault data sets, and simultaneously perform model testing with the test set to finally obtain the Attention CNN training model;
the real-time fault detection module 1505 is configured for real-time fault detection of the rolling bearing using the Attention CNN training model.
The system firstly adopts fixed-length random segmentation to enhance the randomness of data, thereby enhancing the robustness of the model; secondly, when the model is constructed, the overfitting of the model is relieved by batch normalization, Dropout and L2 regularization, so that the detection efficiency of the model is improved; then, the importance degree of the features to the final output is calculated through an attention mechanism, so that the method still has excellent fault diagnosis capability under the condition of no multilayer neural network, and has excellent stability under the condition of unbalanced data; in the model training stage, an exponential decay learning rate and a callback function are adopted to prevent the model from being over-fitted and store the optimal model in the training process; finally, the stored model is applied to real-time fault detection of the rotary machine; the Attention CNN model network structure comprises: a convolutional layer, a pooling layer, a Batch Normalization (BN) layer, an Attention module, a full link layer, and a classification layer. The convolution layer carries out feature extraction on the original vibration signal; the pooling layer performs down-sampling on the characteristics of the convolution layer to reduce model parameters; the BN layer accelerates the fitting speed of the network, and the calculation efficiency is accelerated; the Attention module calculates the importance degree of the features to the final output; the full connection layer maps the features; the classification layer classifies features. The invention can accurately and automatically identify the running state of the bearing in real time, thereby effectively maintaining the normal running of mechanical equipment.
Referring now to FIG. 16, shown is a block diagram of a computer system 1600 suitable for use in implementing an electronic device of an embodiment of the present application. The electronic device shown in fig. 16 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present application.
As shown in fig. 16, the computer system 1600 includes a Central Processing Unit (CPU) 1601 which can perform various appropriate actions and processes in accordance with a program stored in a Read Only Memory (ROM) 1602 or a program loaded from a storage portion 1608 into a Random Access Memory (RAM) 1603. In the RAM 1603, various programs and data necessary for the operation of the system 1600 are also stored. The CPU 1601, ROM 1602, and RAM 1603 are connected to each other via a bus 1604. An input/output (I/O) interface 1605 is also connected to the bus 1604.
The following components are connected to the I/O interface 1605: an input portion 1606 including a keyboard, a mouse, and the like; an output portion 1607 including a display such as a Liquid Crystal Display (LCD) and a speaker; a storage portion 1608 including a hard disk and the like; and a communication section 1609 including a network interface card such as a LAN card, a modem, or the like. The communication section 1609 performs communication processing via a network such as the internet. The driver 1610 is also connected to the I/O interface 1605 as needed. A removable medium 1611 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 1610 as necessary, so that a computer program read out therefrom is mounted in the storage portion 1608 as necessary.
In particular, according to an embodiment of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable storage medium, the computer program containing program code for performing the method illustrated by the flow chart. In such embodiments, the computer program may be downloaded and installed from a network via the communication portion 1609, and/or installed from the removable media 1611. The above-described functions defined in the method of the present application are executed when the computer program is executed by the Central Processing Unit (CPU) 1601. It should be noted that the computer readable storage medium described herein can be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present application, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In this application, however, a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable storage medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable storage medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations for aspects of the present application may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present application. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The modules described in the embodiments of the present application may be implemented by software or hardware. The units described may also be provided in a processor, and the names of the units do not in some cases constitute a limitation of the unit itself.
Embodiments of the present invention also relate to a computer-readable storage medium having stored thereon a computer program which, when executed by a computer processor, implements the method above. The computer program comprises program code for performing the method illustrated in the flow chart. It should be noted that the computer readable medium of the present application can be a computer readable signal medium or a computer readable medium or any combination of the two.
Firstly, fixed-length random segmentation is adopted to enhance the randomness of data, so that the robustness of a model is enhanced; secondly, when the model is constructed, the overfitting of the model is relieved by batch normalization, Dropout and L2 regularization, so that the detection efficiency of the model is improved; then, the importance degree of the features to the final output is calculated through an attention mechanism, so that the method still has excellent fault diagnosis capability under the condition of no multilayer neural network, and has excellent stability under the condition of unbalanced data; in the model training stage, an exponential decay learning rate and a callback function are adopted to prevent the model from being over-fitted and store the optimal model in the training process; finally, the stored model is applied to real-time fault detection of the rotary machine; the Attention CNN model network structure comprises: a convolutional layer, a pooling layer, a Batch Normalization (BN) layer, an Attention module, a full link layer, and a classification layer. The convolution layer carries out feature extraction on the original vibration signal; the pooling layer performs down-sampling on the characteristics of the convolution layer to reduce model parameters; the BN layer accelerates the fitting speed of the network, and the calculation efficiency is accelerated; the Attention module calculates the importance degree of the features to the final output; the full connection layer maps the features; the classification layer classifies features. The invention can accurately and automatically identify the running state of the bearing in real time, thereby effectively maintaining the normal running of mechanical equipment.
The above description is only a preferred embodiment of the application and is illustrative of the principles of the technology employed. It will be appreciated by those skilled in the art that the scope of the invention herein disclosed is not limited to the particular combination of features described above, but also encompasses other arrangements formed by any combination of the above features or their equivalents without departing from the spirit of the invention. For example, the above features may be replaced with (but not limited to) features having similar functions disclosed in the present application.

Claims (8)

1. An Attention CNN model-based bearing fault real-time intelligent diagnosis method is characterized by comprising the following steps:
s1: collecting a vibration signal of a fault bearing by using a vibration sensor, and then segmenting the vibration signal of the fault bearing by adopting a fixed-length random segmentation method to obtain a data sample;
the method for segmenting the fault bearing vibration signal by adopting a fixed-length random segmentation method to obtain a data sample specifically comprises the following steps:
when the total Length of sampling data of a data sample of the fault bearing vibration signal is Length and a sampling window is W, and a random number index is taken in a (0, Length-W) section, the Length of the data sample is index + W;
s2: after labels corresponding to all types are attached to the data sample according to the state type of the rolling bearing, the data sample is divided into a training set, a verification set and a test set according to a certain proportion;
s3: respectively manufacturing a plurality of bearing fault data sets in an unbalanced state according to the training set and the verification set, and forming an unbalanced data set by all the manufactured bearing fault data sets;
s4: constructing an Attention CNN model, respectively training the Attention CNN model by using different bearing fault data sets, and simultaneously carrying out model test by using the test set to finally obtain an Attention CNN training model;
the specific steps for constructing the Attention CNN model comprise:
t1: constructing a first layer of convolution, wherein the first convolution layer is a one-dimensional convolution neural network, the size of convolution kernels is set to be 64, the number of the convolution kernels is set to be 32, the step length is 2, and Same filling is adopted;
t2: (ii) adding Batch Normalization after said T1;
t3: adding a Relu activation function after the T2, and accelerating convergence and relieving gradient disappearance by using the Relu activation function;
t4: introducing a maximum pooling layer after the T3, and performing down-sampling by using the maximum pooling layer;
t5: introducing Dropout after the T4, with which a portion of the neurons are inactivated;
t6: introducing a second layer of convolutional neural network after the T5, wherein the second convolutional layer is a one-dimensional convolutional neural network, setting the size of convolutional kernels and the number of convolutional kernels, and adopting Same filling;
t7: (ii) adding Batch Normalization after said T6;
t8 adding a Relu activation function after the T7;
t9 introducing a Flatten layer after the T8, wherein the Flatten layer tiles high-dimensional features into a one-dimensional space;
t10 adding Attention to the Attention mechanism after the T9 to focus on the part with the fault information;
t11: mapping the characteristics output by the full connection layer mechanism by using the full connection layer;
t12: the number of the neurons of the classification layer is equal to the total number of bearing fault classes, and a Softmax activation function is adopted;
s5: and carrying out real-time fault detection on the rolling bearing by utilizing the Attention CNN training model.
2. The method according to claim 1, wherein the condition types of the rolling bearing are classified into a normal condition and a single point defect having a different diameter is respectively made on the outer ring, the inner ring and the rolling element of the bearing using an electric discharge machining technique.
3. The method according to claim 1, wherein the creating a plurality of bearing fault data sets under unbalanced conditions from the data in the training set and the verification set respectively comprises:
performing the following operations on the training set and the validation set, respectively:
under the condition that the number of normal data samples is fixed, simulating the conditions under different unbalanced states according to the ratio of normal data to fault data in various different proportions;
and obtaining a plurality of corresponding bearing fault data sets in an unbalanced state for the training set and the verification set respectively.
4. The method of claim 1, wherein the Attention calculating step in constructing the Attention CNN model comprises:
the method comprises the steps that characteristic information in Source is regarded as being composed of a plurality of Key and Value data pairs, the correlation between Query and each Key is calculated to obtain the weight coefficient of Value corresponding to each Key, and then the Value is subjected to weighted summation according to the weight coefficient, so that the final Attention Value is obtained;
the method specifically comprises the following steps:
a1 computing Query sum by dot product
Figure 662545DEST_PATH_IMAGE002
The dot product model can better utilize the matrix product in the aspect of realization, so that the calculation efficiency is higher, and the requirements of real-time fault detection are met;
Figure 40437DEST_PATH_IMAGE004
a2: and numerically converting the score of the first stage by introducing a calculation mode of Softmax, wherein the numerical conversion comprises the following steps: normalizing and sorting the originally calculated values into probability distribution with the sum of element weights being 1; the weight of the important features is more prominent through the intrinsic mechanism of Softmax; obtaining the weight coefficient after the numerical value conversion
Figure 77401DEST_PATH_IMAGE006
Figure 540743DEST_PATH_IMAGE008
A3: wherein
Figure 822820DEST_PATH_IMAGE010
To represent
Figure 789639DEST_PATH_IMAGE012
The corresponding weight coefficient is utilized to carry out weighted summation to obtain an Attention value, wherein the Attention value is
Figure DEST_PATH_IMAGE014
5. The method of claim 1, wherein the training of the Attention CNN model with different sets of bearing fault data, respectively, comprises:
regularization using L2 at the convolutional layer during training;
in the training process of the Attention CNN model, exponential decay learning rate is adopted, a better solution is quickly obtained by using a larger learning rate, then the learning rate is gradually reduced along with the continuation of iteration, and the final learning rate is obtained:
wherein, the initial learning rate, the final learning rate and the attenuation rate are included, the decay _ step is the attenuation speed, and the global _ step is the current iteration number;
using a callback function (callback) in training the Attention CNN model;
and using an Adma optimizer and a cross entropy loss function in the training process of the Attention CNN model, and updating the network weight by using a gradient descent algorithm until a callback function is triggered to terminate the training to obtain the Attention CNN training model.
6. The method according to claim 1, wherein the real-time fault detection of the rolling bearing using the Attention CNN training model specifically comprises:
deploying the Attention CNN training model offline and testing the fault diagnosis accuracy rate in various unbalanced states by using the bearing fault data set;
and respectively recording the bearing fault data set used for testing and the time used for testing the corresponding fault data for the Attention CNN training model which is deployed off line.
7. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a computer processor, carries out the method of any one of claims 1 to 6.
8. The utility model provides a bearing trouble real-time intelligent diagnosis system based on Attention CNN model which characterized in that includes:
a data sample acquisition module: the method comprises the steps that a vibration sensor is used for collecting a vibration signal of a fault bearing, and then the vibration signal of the fault bearing is divided by adopting a fixed-length random dividing method to obtain a data sample;
the method for segmenting the fault bearing vibration signal by adopting a fixed-length random segmentation method to obtain a data sample specifically comprises the following steps:
when the total Length of sampling data of a data sample of the fault bearing vibration signal is Length and a sampling window is W, and a random number index is taken in a (0, Length-W) section, the Length of the data sample is index + W;
a data sample division module: after the data sample is configured and used for attaching labels corresponding to all types to the data sample according to the state type of the rolling bearing, the data sample is divided into a training set, a verification set and a test set according to a certain proportion;
an unbalanced data set construction module: the system is configured and used for respectively manufacturing a plurality of bearing fault data sets in an unbalanced state according to the training set and the verification set and forming an unbalanced data set by all the manufactured bearing fault data sets;
attention CNN model training module: configuring and constructing an Attention CNN model, respectively training the Attention CNN model by using different bearing fault data sets, and simultaneously carrying out model test by using the test set to finally obtain an Attention CNN training model;
the specific steps for constructing the Attention CNN model comprise:
t1: constructing a first layer of convolution, wherein the first convolution layer is a one-dimensional convolution neural network, the size of convolution kernels is set to be 64, the number of the convolution kernels is set to be 32, the step length is 2, and Same filling is adopted;
t2: (ii) adding Batch Normalization after said T1;
t3: adding a Relu activation function after the T2, and accelerating convergence and relieving gradient disappearance by using the Relu activation function;
t4: introducing a maximum pooling layer after the T3, and performing down-sampling by using the maximum pooling layer;
t5: introducing Dropout after the T4, with which a portion of the neurons are inactivated;
t6: introducing a second layer of convolutional neural network after the T5, wherein the second convolutional layer is a one-dimensional convolutional neural network, setting the size of convolutional kernels and the number of convolutional kernels, and adopting Same filling;
t7: (ii) adding Batch Normalization after said T6;
t8 adding a Relu activation function after the T7;
t9 introducing a Flatten layer after the T8, wherein the Flatten layer tiles high-dimensional features into a one-dimensional space;
t10 adding Attention to the Attention mechanism after the T9 to focus on the part with the fault information;
t11: mapping the characteristics output by the full connection layer mechanism by using the full connection layer;
t12: the number of the neurons of the classification layer is equal to the total number of bearing fault classes, and a Softmax activation function is adopted;
a real-time fault detection module: and the real-time fault detection device is configured for utilizing the Attention CNN training model to detect the fault of the rolling bearing in real time.
CN202210024754.5A 2022-01-11 2022-01-11 Method and system for intelligently diagnosing bearing fault in real time based on Attention CNN model Active CN114048787B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210024754.5A CN114048787B (en) 2022-01-11 2022-01-11 Method and system for intelligently diagnosing bearing fault in real time based on Attention CNN model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210024754.5A CN114048787B (en) 2022-01-11 2022-01-11 Method and system for intelligently diagnosing bearing fault in real time based on Attention CNN model

Publications (2)

Publication Number Publication Date
CN114048787A CN114048787A (en) 2022-02-15
CN114048787B true CN114048787B (en) 2022-04-22

Family

ID=80196205

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210024754.5A Active CN114048787B (en) 2022-01-11 2022-01-11 Method and system for intelligently diagnosing bearing fault in real time based on Attention CNN model

Country Status (1)

Country Link
CN (1) CN114048787B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114662680B (en) * 2022-05-24 2022-08-30 杭州谨煜科技有限公司 Bearing fault diagnosis method and system based on Gaussian Noise CNN model under variable working conditions
CN116128882B (en) * 2023-04-19 2023-06-23 中汽数据(天津)有限公司 Motor bearing fault diagnosis method, equipment and medium based on unbalanced data set

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109902399A (en) * 2019-03-01 2019-06-18 哈尔滨理工大学 Rolling bearing fault recognition methods under a kind of variable working condition based on ATT-CNN

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11017177B2 (en) * 2019-06-27 2021-05-25 Conduent Business Services, Llc Neural network systems and methods for target identification from text
CN112577747B (en) * 2020-12-07 2023-08-08 东南大学 Rolling bearing fault diagnosis method based on space pooling network

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109902399A (en) * 2019-03-01 2019-06-18 哈尔滨理工大学 Rolling bearing fault recognition methods under a kind of variable working condition based on ATT-CNN

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Attention guided 3D CNN-LSTM model for accurate speech based emotion recognition;Orhan Atila et al.;《Applied Acoustics》;20210707;第182卷;全文 *
基于深度神经网络的轴承故障诊断方法研究;刘望;《中国优秀硕士学位论文全文数据库 工程科技Ⅱ辑》;20210915;第2021年卷(第9期);第C029-76页 *

Also Published As

Publication number Publication date
CN114048787A (en) 2022-02-15

Similar Documents

Publication Publication Date Title
CN110516305B (en) Intelligent fault diagnosis method under small sample based on attention mechanism meta-learning model
CN114048787B (en) Method and system for intelligently diagnosing bearing fault in real time based on Attention CNN model
CN112254964A (en) Rolling bearing fault diagnosis method based on rapid multi-scale convolution neural network
CN111340238A (en) Fault diagnosis method, device, equipment and storage medium of industrial system
CN111914883A (en) Spindle bearing state evaluation method and device based on deep fusion network
CN112926728B (en) Small sample turn-to-turn short circuit fault diagnosis method for permanent magnet synchronous motor
CN113076834B (en) Rotating machine fault information processing method, processing system, processing terminal, and medium
CN113887136A (en) Improved GAN and ResNet based electric vehicle motor bearing fault diagnosis method
CN113391207A (en) Motor fault detection method, medium and system
CN116502175A (en) Method, device and storage medium for diagnosing fault of graph neural network
CN114037478A (en) Advertisement abnormal flow detection method and system, electronic equipment and readable storage medium
CN114755017B (en) Variable-speed bearing fault diagnosis method of cross-domain data driving unsupervised field shared network
CN116702076A (en) Small sample migration learning fault diagnosis method, system, computer and storage medium based on CNN feature fusion
CN116186641A (en) Cross-equipment rolling bearing small sample fault diagnosis method based on feature fusion
CN112146882B (en) Bearing fault diagnosis method based on transfer learning vibration signal image recognition
CN111766513B (en) Capsule network-based variable-working-condition multi-fault diagnosis method for three-phase induction motor
CN116227586A (en) Meta learning fault diagnosis method and device based on depth residual error shrinkage prototype network
CN116541771A (en) Unbalanced sample bearing fault diagnosis method based on multi-scale feature fusion
CN113092044B (en) Rotary machine fault diagnosis method based on weighted level visible graph
CN115290326A (en) Rolling bearing fault intelligent diagnosis method
CN114462446A (en) Rolling bearing fault diagnosis method based on vibration signal and electronic equipment
CN114662680B (en) Bearing fault diagnosis method and system based on Gaussian Noise CNN model under variable working conditions
CN114970044B (en) Rolling bearing fault diagnosis method and system based on threshold convolutional neural network
CN113312719B (en) Rotary machine fault diagnosis method based on class unbalance weight cross entropy
CN116010805B (en) Rolling bearing fault feature extraction method and device based on convolutional neural network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20220823

Address after: Room 2966, 2nd Floor, Building 1, No. 200, Binwen Road, Changhe Street, Binjiang District, Hangzhou City, Zhejiang Province, 310051 (self-declaration)

Patentee after: Hangzhou Runzhen Information Technology Co.,Ltd.

Address before: 311400 room 1003, 10th floor, building 14, China Zhigu Fuchun Park, Yinhu street, Fuyang District, Hangzhou City, Zhejiang Province

Patentee before: Hangzhou yunzhisheng Intelligent Technology Co.,Ltd.

TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20221123

Address after: Room 5100, Building 1, No. 10, Fangjing Road, Suzhou Industrial Park, Jiangsu Province, 215000

Patentee after: Suzhou Guangxi Intelligent Technology Co.,Ltd.

Address before: Room 2966, 2nd Floor, Building 1, No. 200, Binwen Road, Changhe Street, Binjiang District, Hangzhou City, Zhejiang Province, 310051 (self-declaration)

Patentee before: Hangzhou Runzhen Information Technology Co.,Ltd.

TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20231229

Address after: Room 404, Room A241, No. 50 Chengyi North Street, Software Park Phase III, Torch High tech Zone, Xiamen City, Fujian Province, 361022

Patentee after: Xiamen Qianruisheng Intelligent Technology Co.,Ltd.

Address before: Room 5100, Building 1, No. 10, Fangjing Road, Suzhou Industrial Park, Jiangsu Province, 215000

Patentee before: Suzhou Guangxi Intelligent Technology Co.,Ltd.

TR01 Transfer of patent right