CN115204280A - Rolling bearing fault diagnosis method based on graph Markov attention network - Google Patents
Rolling bearing fault diagnosis method based on graph Markov attention network Download PDFInfo
- Publication number
- CN115204280A CN115204280A CN202210760515.6A CN202210760515A CN115204280A CN 115204280 A CN115204280 A CN 115204280A CN 202210760515 A CN202210760515 A CN 202210760515A CN 115204280 A CN115204280 A CN 115204280A
- Authority
- CN
- China
- Prior art keywords
- target
- network
- gat
- trainer
- label
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000005096 rolling process Methods 0.000 title claims abstract description 17
- 238000000034 method Methods 0.000 title claims abstract description 13
- 238000003745 diagnosis Methods 0.000 title claims abstract description 12
- 238000009826 distribution Methods 0.000 claims abstract description 15
- 238000012549 training Methods 0.000 claims abstract description 12
- 239000011159 matrix material Substances 0.000 claims abstract description 4
- 239000000126 substance Substances 0.000 claims description 5
- 102100039956 Geminin Human genes 0.000 claims description 3
- 101000886596 Homo sapiens Geminin Proteins 0.000 claims description 3
- 238000004364 calculation method Methods 0.000 claims description 3
- 230000003313 weakening effect Effects 0.000 claims description 3
- 230000006870 function Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 102100040954 Ephrin-A1 Human genes 0.000 description 1
- 101000965523 Homo sapiens Ephrin-A1 Proteins 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000003631 expected effect Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/901—Indexing; Data structures therefor; Storage structures
- G06F16/9024—Graphs; Linked lists
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Software Systems (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Testing Of Devices, Machine Parts, Or Other Structures Thereof (AREA)
Abstract
The invention discloses a rolling bearing fault diagnosis method based on a graph Markov attention network, which is characterized by comprising the following steps of: constructing input information inputs _ q, labels target _ q and an adjacency relation matrix adj, training a two-layer GAT network trainer _ q to obtain preds, coding the preds into new coding input, training a network, updating, then obtaining new preds and target _ q, and replacing the part owned by idx _ train with a real label by the target _ q to obtain new target _ q; the original inputs _ q and the new target _ q are used as features and labels, and then the trainer _ q is trained, and the trainer _ q is updated. The method ensures that the probability distribution learned by GAT theta is as consistent as possible with the probability distribution learned by GAT phi, the two models are mutually constrained, the two models are alternately updated by adopting an EM (effective vector regression) algorithm, the problem that the traditional GAT network cannot model the label correlation is solved, and the problem of model failure caused by insufficient label nodes is avoided.
Description
Technical Field
The invention belongs to the technical field of bearing fault diagnosis, and particularly relates to a rolling bearing fault diagnosis method based on a graph Markov attention network.
Background
GAT (GRAPH ATTENTION NETWORKS) is a neural network which uses self attribute mechanism, calculates the ATTENTION of a certain node in the GRAPH relative to each adjacent node in a mode similar to self attribute in a transducer, takes the characteristics of the node and the ATTENTION characteristic as the characteristics of the node, and carries out tasks such as classification of the node and the like on the basis of the characteristics.
GAT uses a similar process to calculate self attentions of nodes, first calculate the attention score of the current node and each neighboring node, then multiply the features of each node by this score, accumulate them and go through a non-linear mapping as the features of the current node.
Graph Attention Network (GAT) models can also model the correlation of labels between Network nodes (objects). The essence of the GAT network is information (feature) propagation between nodes, and the correlation of labels between nodes can be modeled only by inputting the node labels into the GAT network (named as GAT phi) as features, so that the function of statistical relationship learning is realized. However, this implementation has problems in that: in the semi-supervised learning scenario, labeled nodes tend to be rare, and thus a peripheral neighbor of a labeled node may not have a label. The expected effect of modeling labels between nodes is therefore difficult to achieve using the GAT-based approach described above.
In order to solve the problem of label scarcity, another GAT network model (named GAT theta) is adopted to predict node labels, and the aim is to use the node labels predicted by GAT phi as input characteristics of the GAT theta model. It is worth noting that GAT φ only takes the attribute features of the nodes as input, and there is no need to pay attention to the correlation of labels between nodes. The GAT φ model described above is constrained by the GAT θ model because the GAT φ model relies on GAT θ to provide node tag information as input. However, GAT θ is not constrained, which tends to cause the results of model training to not converge. To solve this problem, the probability distribution learned by GAT θ can be made as consistent as possible with the probability distribution learned by GAT Φ, so that the two models are constrained to each other and the two models can be alternately updated by using the EM algorithm.
Therefore, in order to solve the above problems, a rolling bearing fault diagnosis method based on a graph markov attention network is proposed herein.
Disclosure of Invention
In order to solve the technical problems, the invention designs a rolling bearing fault diagnosis method based on a graph Markov attention network, which enables the probability distribution learned by GAT theta to be as consistent as possible with the probability distribution learned by GAT phi, and the two models are mutually constrained, so that the two models can be alternately updated by adopting an EM (effective-energy) algorithm, the problem that the traditional GAT network cannot model the correlation of labels is solved, and the problem that the GAT phi network using the labels as characteristics has insufficient label nodes to cause model failure is also avoided.
In order to achieve the technical effects, the invention is realized by the following technical scheme: a rolling bearing fault diagnosis method based on a graph Markov attention network is characterized by comprising the following steps:
step1: constructing input information inputs _ q;
step2: training a two-layer GAT network trainer _ q by inputting characteristics inputs _ q, a label target _ q and an adjacency relation matrix adj, wherein the label takes a part owned by idx _ train;
step3: the inputs _ q obtains the press through the trainer _ q.presct, which is a distribution, each dimension is a value, the index of the dimension with larger value is more likely to be selected, and the index is encoded into a new code with 0/1 only, the inputs _ p and the target _ p;
step4: using the obtained new code as a label and a characteristic as the input of a network trainer _ p (two-layer GMNN), training the network, and updating the trainer _ p;
step5: input _ p obtains a new preds and a target _ q through a feeder _ p.predict, and the target _ q replaces the part owned by idx _ train with a real label to obtain a new target _ q;
step6: taking the initial inputs _ q and the new target _ q as features and labels, and then training the trainer _ q, wherein the trainer _ q is updated;
further, the Step1 specifically comprises the following steps:
step1.1: different fault types (including normal state (NC), rolling Element Fault (REF), outer Ring Fault (ORF) and Inner Ring Fault (IRF)), rolling bearing vibration signals of different fault sizes are respectively maximum-minimum normalized: x nol =normalize(X);
Step1.2: constructing a data set in a segmented manner;
step1.3: weakening the influence of noise, carrying out FFT on each sample set, and taking the first half part of the result;
step1.4: each sample is assigned a corresponding label target _ q;
step1.5: determining the number of nodes; and further searching the neighbor of each node in the affinity graph by using epsilon-radius, and calculating by the following formula:
wherein the content of the first and second substances,is neighborhood ofepsilon is the radius selected for the selected radius,return toWherein the content of the first and second substances,is by calculationandCosine similarity;
step1.6: calculating the weight between each node through a threshold Gaussian kernel weight function;
where β is the bandwidth variance of the gaussian function.
The invention has the beneficial effects that:
the invention designs a rolling bearing fault diagnosis method based on a graph Markov attention network, wherein probability distribution learned by GAT theta is as consistent as possible with probability distribution learned by GAT phi, and the two models are mutually constrained, so that the two models can be alternately updated by adopting an EM (effective and effective) algorithm, the problem that the traditional GAT network cannot model the correlation of labels is solved, and the problem of model failure caused by insufficient label nodes of the GAT phi network using the labels as characteristics is avoided.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a block diagram of the framework of the present invention;
FIG. 2 is a diagram of the network code structure of the algorithm of the present invention;
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Example 1
Referring to fig. 1 to 2, a rolling bearing fault diagnosis method based on a graph markov attention network is characterized by comprising the following steps:
step1: constructing input information _ q;
step2: training a two-layer GAT network trainer _ q by inputting characteristics inputs _ q, a label target _ q and an adjacency relation matrix adj, wherein the label takes a part owned by idx _ train;
step3: the inputs _ q obtains the press through the trainer _ q.presct, which is a distribution, each dimension is a value, the index of the dimension with larger value is more likely to be selected, and the index is encoded into a new code with 0/1 only, the inputs _ p and the target _ p;
step4: using the obtained new code as a label and a characteristic as the input of a network trainer _ p (two-layer GMNN), training the network, and updating the trainer _ p;
step5: input _ p obtains a new preds and a target _ q through a feeder _ p.predict, and the target _ q replaces the part owned by idx _ train with a real label to obtain a new target _ q;
step6: taking the initial inputs _ q and the new target _ q as features and labels, and then training the trainer _ q, wherein the trainer _ q is updated;
further, the specific steps in Step1 are as follows:
step1.1: different fault types (including normal state (NC), rolling Element Fault (REF), outer Ring Fault (ORF) and Inner Ring Fault (IRF)), rolling bearing vibration signals of different fault sizes are normalized by maximum-minimum respectively: x nol =normalize(X);
Step1.2: constructing a data set in a segmented manner;
step1.3: weakening the influence of noise, carrying out FFT on each sample set, and taking the first half part of the result;
step1.4: each sample is allocated with a corresponding label target _ q;
step1.5: determining the number of nodes; and further searching the neighbor of each node in the affinity graph by using epsilon-radius, and calculating by the following formula:
wherein the content of the first and second substances,is neighborhood ofepsilon is the radius selected for the selected radius,return toWherein the content of the first and second substances,is by calculationandCosine similarity;
step1.6: calculating the weight between each node through a threshold Gaussian kernel weight function;
where β is the bandwidth variance of the gaussian function.
Example 2
The invention designs a rolling bearing fault diagnosis method based on a graph Markov attention network, and by enabling probability distribution learned by GAT theta to be as consistent as possible with probability distribution learned by GAT phi, and mutually restricting the two models, the two models can be alternately updated by adopting an EM algorithm, so that the problem that the traditional GAT network cannot model the relevance of labels is solved, and the problem of model failure caused by insufficient label nodes of the GAT phi network using the labels as characteristics is also avoided.
Example 3
Fig. 1 is a frame overview. Squares are tagged and untagged objects, white grids are attributes, and histograms are tag distributions of objects. The orange triple circle is the object representation. GMAN is trained by alternating between E and M steps.
Figure 1 gives an illustration of the frame. For a given node, q θ predicts the label using the attribute features of the neighbor nodes and itself as inputs, so that a vector representation of the node can be learned. In contrast, p Φ uses not only the attributes of the neighbor nodes but also the labels of the neighbor nodes as input features, and if the neighbor nodes do not have labels, the labels predicted by q θ are used as input features. In step M, q theta is used for predicting the label of the label-free node, and a part of labels are sampled from q theta, and p phi is trained and updated by combining the existing labels in the data set; in step E, p Φ first predicts the labels of the unlabeled nodes using the node attributes and labels, and then trains and updates q θ with the probability distribution of the labels predicted by p Φ as the learning target of q θ.
In the description herein, references to the description of "one embodiment," "an example," "a specific example" or the like are intended to mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
The preferred embodiments of the invention disclosed above are intended to be illustrative only. The preferred embodiments are not intended to be exhaustive or to limit the invention to the precise embodiments disclosed. Obviously, many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, to thereby enable others skilled in the art to best utilize the invention. The invention is limited only by the claims and their full scope and equivalents.
Claims (2)
1. A rolling bearing fault diagnosis method based on a graph Markov attention network is characterized by comprising the following steps:
step1: constructing input information inputs _ q;
step2: training a two-layer GAT network trainer _ q by inputting characteristics input _ q, a label target _ q and an adjacency relation matrix adj, wherein the label takes a part owned by idx _ train;
step3: the inputs _ q obtains the press through the trainer _ q.presct, which is a distribution, each dimension is a value, the index of the dimension with larger value is more likely to be selected, and the index is encoded into a new code with 0/1 only, the inputs _ p and the target _ p;
step4: using the obtained new code as a label and characteristics as the input of a network train _ p (two-layer GMNN), training the network, and updating the train _ p;
step5: obtaining a new presd and a target _ q by input _ p through a sender _ p.predict, and replacing the part owned by idx _ train by the target _ q to obtain a new target _ q;
step6: and training the trainer _ q by taking the initial inputs _ q and the new target _ q as features and labels, wherein the trainer _ q is updated.
2. The X of claim 1, wherein: the specific steps in Step1 are as follows:
step1.1: different fault types (including normal state (NC), rolling Element Fault (REF), outer Ring Fault (ORF) and Inner Ring Fault (IRF)), rolling bearing vibration signals of different fault sizes are normalized by maximum-minimum respectively: x nol =normalize(X);
Step1.2: constructing a data set in a segmented manner;
step1.3: weakening the influence of noise, carrying out FFT on each sample set, and taking the first half part of the result;
step1.4: each sample is allocated with a corresponding label target _ q;
step1.5: determining the number of nodes; and further searching the neighbor of each node in the affinity graph by using epsilon-radius, and calculating by the following formula:
wherein, the first and the second end of the pipe are connected with each other,epsilon is the radius selected for the selected radius,return toWherein the content of the first and second substances,is by calculationCosine similarity;
step1.6: calculating the weight between each node through a threshold Gaussian kernel weight function;
where β is the bandwidth variance of the gaussian function.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210760515.6A CN115204280A (en) | 2022-06-29 | 2022-06-29 | Rolling bearing fault diagnosis method based on graph Markov attention network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210760515.6A CN115204280A (en) | 2022-06-29 | 2022-06-29 | Rolling bearing fault diagnosis method based on graph Markov attention network |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115204280A true CN115204280A (en) | 2022-10-18 |
Family
ID=83577305
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210760515.6A Pending CN115204280A (en) | 2022-06-29 | 2022-06-29 | Rolling bearing fault diagnosis method based on graph Markov attention network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115204280A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115929495A (en) * | 2022-11-30 | 2023-04-07 | 昆明理工大学 | Engine Valve Fault Diagnosis Method Based on Markov Transition Field and Improved Gaussian Prototype Network |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113343581A (en) * | 2021-06-28 | 2021-09-03 | 山东华科信息技术有限公司 | Transformer fault diagnosis method based on graph Markov neural network |
CN114167180A (en) * | 2021-12-03 | 2022-03-11 | 国网山西省电力公司晋城供电公司 | Oil-filled electrical equipment fault diagnosis method based on graph attention neural network |
-
2022
- 2022-06-29 CN CN202210760515.6A patent/CN115204280A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113343581A (en) * | 2021-06-28 | 2021-09-03 | 山东华科信息技术有限公司 | Transformer fault diagnosis method based on graph Markov neural network |
CN114167180A (en) * | 2021-12-03 | 2022-03-11 | 国网山西省电力公司晋城供电公司 | Oil-filled electrical equipment fault diagnosis method based on graph attention neural network |
Non-Patent Citations (2)
Title |
---|
LI,Z. ZHAO ET AL.: "Multireceptive field graph convolutional networks for machine fault diagnosis", 《IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS》, vol. 68, no. 12, 3 December 2020 (2020-12-03), pages 12739 - 12749, XP011875210, DOI: 10.1109/TIE.2020.3040669 * |
MENG QU ET AL.: "GMNN: Graph Markov Neural Networks", 《ARXIV》, 23 July 2020 (2020-07-23), pages 1 - 13 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115929495A (en) * | 2022-11-30 | 2023-04-07 | 昆明理工大学 | Engine Valve Fault Diagnosis Method Based on Markov Transition Field and Improved Gaussian Prototype Network |
CN115929495B (en) * | 2022-11-30 | 2024-05-14 | 昆明理工大学 | Engine valve fault diagnosis method based on Markov transition field and improved Gaussian prototype network |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2023065545A1 (en) | Risk prediction method and apparatus, and device and storage medium | |
CN111914156B (en) | Cross-modal retrieval method and system for self-adaptive label perception graph convolution network | |
CN112100369A (en) | Semantic-combined network fault association rule generation method and network fault detection method | |
Li et al. | Improve relation extraction with dual attention-guided graph convolutional networks | |
CN112597296B (en) | Abstract generation method based on plan mechanism and knowledge graph guidance | |
CN113361627A (en) | Label perception collaborative training method for graph neural network | |
CN112100372B (en) | Head news prediction classification method | |
Tian et al. | Twitter user location inference based on representation learning and label propagation | |
Xu et al. | A scientific research topic trend prediction model based on multi‐LSTM and graph convolutional network | |
CN115204280A (en) | Rolling bearing fault diagnosis method based on graph Markov attention network | |
CN113591971A (en) | User individual behavior prediction method based on DPI time series word embedded vector | |
CN113920379B (en) | Zero sample image classification method based on knowledge assistance | |
CN110807601A (en) | Park road deterioration analysis method based on truncation data | |
CN116090504A (en) | Training method and device for graphic neural network model, classifying method and computing equipment | |
CN113128667A (en) | Cross-domain self-adaptive graph convolution balance migration learning method and system | |
CN112925907A (en) | Microblog comment viewpoint object classification method based on event graph convolutional neural network | |
CN116932938A (en) | Link prediction method and system based on topological structure and attribute information | |
CN115129875A (en) | Building accident report classification system and method based on graph neural network | |
CN112765489B (en) | Social network link prediction method and system | |
CN115048510A (en) | Criminal name prediction method based on hierarchical legal knowledge and double-graph joint representation learning | |
CN114254738A (en) | Double-layer evolvable dynamic graph convolution neural network model construction method and application | |
Chen et al. | Sampled in Pairs and Driven by Text: A New Graph Embedding Framework | |
Guo et al. | User Identity Linkage Across Social Networks Based on Neural Tensor Network | |
CN112784057B (en) | Three-network industrial map construction method based on regional industrial enterprises | |
CN117435970A (en) | Universal domain self-adaptive node classification method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |