CN115204280A - Rolling bearing fault diagnosis method based on graph Markov attention network - Google Patents

Rolling bearing fault diagnosis method based on graph Markov attention network Download PDF

Info

Publication number
CN115204280A
CN115204280A CN202210760515.6A CN202210760515A CN115204280A CN 115204280 A CN115204280 A CN 115204280A CN 202210760515 A CN202210760515 A CN 202210760515A CN 115204280 A CN115204280 A CN 115204280A
Authority
CN
China
Prior art keywords
target
network
gat
trainer
label
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210760515.6A
Other languages
Chinese (zh)
Inventor
李祥
马军
熊新
王晓东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kunming University of Science and Technology
Original Assignee
Kunming University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kunming University of Science and Technology filed Critical Kunming University of Science and Technology
Priority to CN202210760515.6A priority Critical patent/CN115204280A/en
Publication of CN115204280A publication Critical patent/CN115204280A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9024Graphs; Linked lists
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Testing Of Devices, Machine Parts, Or Other Structures Thereof (AREA)

Abstract

The invention discloses a rolling bearing fault diagnosis method based on a graph Markov attention network, which is characterized by comprising the following steps of: constructing input information inputs _ q, labels target _ q and an adjacency relation matrix adj, training a two-layer GAT network trainer _ q to obtain preds, coding the preds into new coding input, training a network, updating, then obtaining new preds and target _ q, and replacing the part owned by idx _ train with a real label by the target _ q to obtain new target _ q; the original inputs _ q and the new target _ q are used as features and labels, and then the trainer _ q is trained, and the trainer _ q is updated. The method ensures that the probability distribution learned by GAT theta is as consistent as possible with the probability distribution learned by GAT phi, the two models are mutually constrained, the two models are alternately updated by adopting an EM (effective vector regression) algorithm, the problem that the traditional GAT network cannot model the label correlation is solved, and the problem of model failure caused by insufficient label nodes is avoided.

Description

Rolling bearing fault diagnosis method based on graph Markov attention network
Technical Field
The invention belongs to the technical field of bearing fault diagnosis, and particularly relates to a rolling bearing fault diagnosis method based on a graph Markov attention network.
Background
GAT (GRAPH ATTENTION NETWORKS) is a neural network which uses self attribute mechanism, calculates the ATTENTION of a certain node in the GRAPH relative to each adjacent node in a mode similar to self attribute in a transducer, takes the characteristics of the node and the ATTENTION characteristic as the characteristics of the node, and carries out tasks such as classification of the node and the like on the basis of the characteristics.
GAT uses a similar process to calculate self attentions of nodes, first calculate the attention score of the current node and each neighboring node, then multiply the features of each node by this score, accumulate them and go through a non-linear mapping as the features of the current node.
Graph Attention Network (GAT) models can also model the correlation of labels between Network nodes (objects). The essence of the GAT network is information (feature) propagation between nodes, and the correlation of labels between nodes can be modeled only by inputting the node labels into the GAT network (named as GAT phi) as features, so that the function of statistical relationship learning is realized. However, this implementation has problems in that: in the semi-supervised learning scenario, labeled nodes tend to be rare, and thus a peripheral neighbor of a labeled node may not have a label. The expected effect of modeling labels between nodes is therefore difficult to achieve using the GAT-based approach described above.
In order to solve the problem of label scarcity, another GAT network model (named GAT theta) is adopted to predict node labels, and the aim is to use the node labels predicted by GAT phi as input characteristics of the GAT theta model. It is worth noting that GAT φ only takes the attribute features of the nodes as input, and there is no need to pay attention to the correlation of labels between nodes. The GAT φ model described above is constrained by the GAT θ model because the GAT φ model relies on GAT θ to provide node tag information as input. However, GAT θ is not constrained, which tends to cause the results of model training to not converge. To solve this problem, the probability distribution learned by GAT θ can be made as consistent as possible with the probability distribution learned by GAT Φ, so that the two models are constrained to each other and the two models can be alternately updated by using the EM algorithm.
Therefore, in order to solve the above problems, a rolling bearing fault diagnosis method based on a graph markov attention network is proposed herein.
Disclosure of Invention
In order to solve the technical problems, the invention designs a rolling bearing fault diagnosis method based on a graph Markov attention network, which enables the probability distribution learned by GAT theta to be as consistent as possible with the probability distribution learned by GAT phi, and the two models are mutually constrained, so that the two models can be alternately updated by adopting an EM (effective-energy) algorithm, the problem that the traditional GAT network cannot model the correlation of labels is solved, and the problem that the GAT phi network using the labels as characteristics has insufficient label nodes to cause model failure is also avoided.
In order to achieve the technical effects, the invention is realized by the following technical scheme: a rolling bearing fault diagnosis method based on a graph Markov attention network is characterized by comprising the following steps:
step1: constructing input information inputs _ q;
step2: training a two-layer GAT network trainer _ q by inputting characteristics inputs _ q, a label target _ q and an adjacency relation matrix adj, wherein the label takes a part owned by idx _ train;
step3: the inputs _ q obtains the press through the trainer _ q.presct, which is a distribution, each dimension is a value, the index of the dimension with larger value is more likely to be selected, and the index is encoded into a new code with 0/1 only, the inputs _ p and the target _ p;
step4: using the obtained new code as a label and a characteristic as the input of a network trainer _ p (two-layer GMNN), training the network, and updating the trainer _ p;
step5: input _ p obtains a new preds and a target _ q through a feeder _ p.predict, and the target _ q replaces the part owned by idx _ train with a real label to obtain a new target _ q;
step6: taking the initial inputs _ q and the new target _ q as features and labels, and then training the trainer _ q, wherein the trainer _ q is updated;
further, the Step1 specifically comprises the following steps:
step1.1: different fault types (including normal state (NC), rolling Element Fault (REF), outer Ring Fault (ORF) and Inner Ring Fault (IRF)), rolling bearing vibration signals of different fault sizes are respectively maximum-minimum normalized: x nol =normalize(X);
Step1.2: constructing a data set in a segmented manner;
Figure BDA0003720883700000031
step1.3: weakening the influence of noise, carrying out FFT on each sample set, and taking the first half part of the result;
step1.4: each sample is assigned a corresponding label target _ q;
step1.5: determining the number of nodes; and further searching the neighbor of each node in the affinity graph by using epsilon-radius, and calculating by the following formula:
Figure BDA0003720883700000041
wherein the content of the first and second substances,
Figure BDA0003720883700000042
is neighborhood of
Figure BDA0003720883700000043
epsilon is the radius selected for the selected radius,
Figure BDA0003720883700000044
return to
Figure BDA0003720883700000045
Wherein the content of the first and second substances,
Figure BDA0003720883700000046
is by calculation
Figure BDA0003720883700000047
and
Figure BDA0003720883700000048
Cosine similarity;
step1.6: calculating the weight between each node through a threshold Gaussian kernel weight function;
Figure BDA0003720883700000049
where β is the bandwidth variance of the gaussian function.
The invention has the beneficial effects that:
the invention designs a rolling bearing fault diagnosis method based on a graph Markov attention network, wherein probability distribution learned by GAT theta is as consistent as possible with probability distribution learned by GAT phi, and the two models are mutually constrained, so that the two models can be alternately updated by adopting an EM (effective and effective) algorithm, the problem that the traditional GAT network cannot model the correlation of labels is solved, and the problem of model failure caused by insufficient label nodes of the GAT phi network using the labels as characteristics is avoided.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a block diagram of the framework of the present invention;
FIG. 2 is a diagram of the network code structure of the algorithm of the present invention;
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Example 1
Referring to fig. 1 to 2, a rolling bearing fault diagnosis method based on a graph markov attention network is characterized by comprising the following steps:
step1: constructing input information _ q;
step2: training a two-layer GAT network trainer _ q by inputting characteristics inputs _ q, a label target _ q and an adjacency relation matrix adj, wherein the label takes a part owned by idx _ train;
step3: the inputs _ q obtains the press through the trainer _ q.presct, which is a distribution, each dimension is a value, the index of the dimension with larger value is more likely to be selected, and the index is encoded into a new code with 0/1 only, the inputs _ p and the target _ p;
step4: using the obtained new code as a label and a characteristic as the input of a network trainer _ p (two-layer GMNN), training the network, and updating the trainer _ p;
step5: input _ p obtains a new preds and a target _ q through a feeder _ p.predict, and the target _ q replaces the part owned by idx _ train with a real label to obtain a new target _ q;
step6: taking the initial inputs _ q and the new target _ q as features and labels, and then training the trainer _ q, wherein the trainer _ q is updated;
further, the specific steps in Step1 are as follows:
step1.1: different fault types (including normal state (NC), rolling Element Fault (REF), outer Ring Fault (ORF) and Inner Ring Fault (IRF)), rolling bearing vibration signals of different fault sizes are normalized by maximum-minimum respectively: x nol =normalize(X);
Step1.2: constructing a data set in a segmented manner;
Figure BDA0003720883700000061
step1.3: weakening the influence of noise, carrying out FFT on each sample set, and taking the first half part of the result;
step1.4: each sample is allocated with a corresponding label target _ q;
step1.5: determining the number of nodes; and further searching the neighbor of each node in the affinity graph by using epsilon-radius, and calculating by the following formula:
Figure BDA0003720883700000062
wherein the content of the first and second substances,
Figure BDA0003720883700000063
is neighborhood of
Figure BDA0003720883700000064
epsilon is the radius selected for the selected radius,
Figure BDA0003720883700000065
return to
Figure BDA0003720883700000066
Wherein the content of the first and second substances,
Figure BDA0003720883700000067
is by calculation
Figure BDA0003720883700000068
and
Figure BDA0003720883700000069
Cosine similarity;
step1.6: calculating the weight between each node through a threshold Gaussian kernel weight function;
Figure BDA0003720883700000071
where β is the bandwidth variance of the gaussian function.
Example 2
The invention designs a rolling bearing fault diagnosis method based on a graph Markov attention network, and by enabling probability distribution learned by GAT theta to be as consistent as possible with probability distribution learned by GAT phi, and mutually restricting the two models, the two models can be alternately updated by adopting an EM algorithm, so that the problem that the traditional GAT network cannot model the relevance of labels is solved, and the problem of model failure caused by insufficient label nodes of the GAT phi network using the labels as characteristics is also avoided.
Example 3
Fig. 1 is a frame overview. Squares are tagged and untagged objects, white grids are attributes, and histograms are tag distributions of objects. The orange triple circle is the object representation. GMAN is trained by alternating between E and M steps.
Figure 1 gives an illustration of the frame. For a given node, q θ predicts the label using the attribute features of the neighbor nodes and itself as inputs, so that a vector representation of the node can be learned. In contrast, p Φ uses not only the attributes of the neighbor nodes but also the labels of the neighbor nodes as input features, and if the neighbor nodes do not have labels, the labels predicted by q θ are used as input features. In step M, q theta is used for predicting the label of the label-free node, and a part of labels are sampled from q theta, and p phi is trained and updated by combining the existing labels in the data set; in step E, p Φ first predicts the labels of the unlabeled nodes using the node attributes and labels, and then trains and updates q θ with the probability distribution of the labels predicted by p Φ as the learning target of q θ.
In the description herein, references to the description of "one embodiment," "an example," "a specific example" or the like are intended to mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
The preferred embodiments of the invention disclosed above are intended to be illustrative only. The preferred embodiments are not intended to be exhaustive or to limit the invention to the precise embodiments disclosed. Obviously, many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, to thereby enable others skilled in the art to best utilize the invention. The invention is limited only by the claims and their full scope and equivalents.

Claims (2)

1. A rolling bearing fault diagnosis method based on a graph Markov attention network is characterized by comprising the following steps:
step1: constructing input information inputs _ q;
step2: training a two-layer GAT network trainer _ q by inputting characteristics input _ q, a label target _ q and an adjacency relation matrix adj, wherein the label takes a part owned by idx _ train;
step3: the inputs _ q obtains the press through the trainer _ q.presct, which is a distribution, each dimension is a value, the index of the dimension with larger value is more likely to be selected, and the index is encoded into a new code with 0/1 only, the inputs _ p and the target _ p;
step4: using the obtained new code as a label and characteristics as the input of a network train _ p (two-layer GMNN), training the network, and updating the train _ p;
step5: obtaining a new presd and a target _ q by input _ p through a sender _ p.predict, and replacing the part owned by idx _ train by the target _ q to obtain a new target _ q;
step6: and training the trainer _ q by taking the initial inputs _ q and the new target _ q as features and labels, wherein the trainer _ q is updated.
2. The X of claim 1, wherein: the specific steps in Step1 are as follows:
step1.1: different fault types (including normal state (NC), rolling Element Fault (REF), outer Ring Fault (ORF) and Inner Ring Fault (IRF)), rolling bearing vibration signals of different fault sizes are normalized by maximum-minimum respectively: x nol =normalize(X);
Step1.2: constructing a data set in a segmented manner;
Figure FDA0003720883690000021
step1.3: weakening the influence of noise, carrying out FFT on each sample set, and taking the first half part of the result;
step1.4: each sample is allocated with a corresponding label target _ q;
step1.5: determining the number of nodes; and further searching the neighbor of each node in the affinity graph by using epsilon-radius, and calculating by the following formula:
Figure FDA0003720883690000022
wherein, the first and the second end of the pipe are connected with each other,
Figure FDA0003720883690000023
epsilon is the radius selected for the selected radius,
Figure FDA0003720883690000024
return to
Figure FDA0003720883690000025
Wherein the content of the first and second substances,
Figure FDA0003720883690000026
is by calculation
Figure FDA0003720883690000027
Cosine similarity;
step1.6: calculating the weight between each node through a threshold Gaussian kernel weight function;
Figure FDA0003720883690000028
where β is the bandwidth variance of the gaussian function.
CN202210760515.6A 2022-06-29 2022-06-29 Rolling bearing fault diagnosis method based on graph Markov attention network Pending CN115204280A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210760515.6A CN115204280A (en) 2022-06-29 2022-06-29 Rolling bearing fault diagnosis method based on graph Markov attention network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210760515.6A CN115204280A (en) 2022-06-29 2022-06-29 Rolling bearing fault diagnosis method based on graph Markov attention network

Publications (1)

Publication Number Publication Date
CN115204280A true CN115204280A (en) 2022-10-18

Family

ID=83577305

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210760515.6A Pending CN115204280A (en) 2022-06-29 2022-06-29 Rolling bearing fault diagnosis method based on graph Markov attention network

Country Status (1)

Country Link
CN (1) CN115204280A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115929495A (en) * 2022-11-30 2023-04-07 昆明理工大学 Engine Valve Fault Diagnosis Method Based on Markov Transition Field and Improved Gaussian Prototype Network

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113343581A (en) * 2021-06-28 2021-09-03 山东华科信息技术有限公司 Transformer fault diagnosis method based on graph Markov neural network
CN114167180A (en) * 2021-12-03 2022-03-11 国网山西省电力公司晋城供电公司 Oil-filled electrical equipment fault diagnosis method based on graph attention neural network

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113343581A (en) * 2021-06-28 2021-09-03 山东华科信息技术有限公司 Transformer fault diagnosis method based on graph Markov neural network
CN114167180A (en) * 2021-12-03 2022-03-11 国网山西省电力公司晋城供电公司 Oil-filled electrical equipment fault diagnosis method based on graph attention neural network

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LI,Z. ZHAO ET AL.: "Multireceptive field graph convolutional networks for machine fault diagnosis", 《IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS》, vol. 68, no. 12, 3 December 2020 (2020-12-03), pages 12739 - 12749, XP011875210, DOI: 10.1109/TIE.2020.3040669 *
MENG QU ET AL.: "GMNN: Graph Markov Neural Networks", 《ARXIV》, 23 July 2020 (2020-07-23), pages 1 - 13 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115929495A (en) * 2022-11-30 2023-04-07 昆明理工大学 Engine Valve Fault Diagnosis Method Based on Markov Transition Field and Improved Gaussian Prototype Network
CN115929495B (en) * 2022-11-30 2024-05-14 昆明理工大学 Engine valve fault diagnosis method based on Markov transition field and improved Gaussian prototype network

Similar Documents

Publication Publication Date Title
WO2023065545A1 (en) Risk prediction method and apparatus, and device and storage medium
CN111914156B (en) Cross-modal retrieval method and system for self-adaptive label perception graph convolution network
CN112100369A (en) Semantic-combined network fault association rule generation method and network fault detection method
Li et al. Improve relation extraction with dual attention-guided graph convolutional networks
CN112597296B (en) Abstract generation method based on plan mechanism and knowledge graph guidance
CN113361627A (en) Label perception collaborative training method for graph neural network
CN112100372B (en) Head news prediction classification method
Tian et al. Twitter user location inference based on representation learning and label propagation
Xu et al. A scientific research topic trend prediction model based on multi‐LSTM and graph convolutional network
CN115204280A (en) Rolling bearing fault diagnosis method based on graph Markov attention network
CN113591971A (en) User individual behavior prediction method based on DPI time series word embedded vector
CN113920379B (en) Zero sample image classification method based on knowledge assistance
CN110807601A (en) Park road deterioration analysis method based on truncation data
CN116090504A (en) Training method and device for graphic neural network model, classifying method and computing equipment
CN113128667A (en) Cross-domain self-adaptive graph convolution balance migration learning method and system
CN112925907A (en) Microblog comment viewpoint object classification method based on event graph convolutional neural network
CN116932938A (en) Link prediction method and system based on topological structure and attribute information
CN115129875A (en) Building accident report classification system and method based on graph neural network
CN112765489B (en) Social network link prediction method and system
CN115048510A (en) Criminal name prediction method based on hierarchical legal knowledge and double-graph joint representation learning
CN114254738A (en) Double-layer evolvable dynamic graph convolution neural network model construction method and application
Chen et al. Sampled in Pairs and Driven by Text: A New Graph Embedding Framework
Guo et al. User Identity Linkage Across Social Networks Based on Neural Tensor Network
CN112784057B (en) Three-network industrial map construction method based on regional industrial enterprises
CN117435970A (en) Universal domain self-adaptive node classification method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination