CN112885372B - Intelligent diagnosis method, system, terminal and medium for power equipment fault sound - Google Patents

Intelligent diagnosis method, system, terminal and medium for power equipment fault sound Download PDF

Info

Publication number
CN112885372B
CN112885372B CN202110053647.0A CN202110053647A CN112885372B CN 112885372 B CN112885372 B CN 112885372B CN 202110053647 A CN202110053647 A CN 202110053647A CN 112885372 B CN112885372 B CN 112885372B
Authority
CN
China
Prior art keywords
audio
power equipment
samples
recognition model
fault
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110053647.0A
Other languages
Chinese (zh)
Other versions
CN112885372A (en
Inventor
江丽
徐明月
马全江
赵晓楠
肖驰
张卫东
李喆
于文牮
刘腾
辛昂倍
刘震
蔡英明
刘嘉
姚晓林
丛龙琦
程波涛
王嫚嫚
张启红
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Jiaotong University
Weihai Power Supply Co of State Grid Shandong Electric Power Co Ltd
Original Assignee
Shanghai Jiaotong University
Weihai Power Supply Co of State Grid Shandong Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jiaotong University, Weihai Power Supply Co of State Grid Shandong Electric Power Co Ltd filed Critical Shanghai Jiaotong University
Priority to CN202110053647.0A priority Critical patent/CN112885372B/en
Publication of CN112885372A publication Critical patent/CN112885372A/en
Application granted granted Critical
Publication of CN112885372B publication Critical patent/CN112885372B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y04INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
    • Y04SSYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
    • Y04S10/00Systems supporting electrical power generation, transmission or distribution
    • Y04S10/50Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications
    • Y04S10/52Outage or fault management, e.g. fault detection or location

Abstract

The invention provides a power equipment fault sound intelligent diagnosis method and system based on MFCC and attention mechanism, firstly establishing a power equipment fault audio sample database; then, preprocessing the audio sample; extracting a Mel cepstrum coefficient from each frame of signal obtained after preprocessing to serve as a feature vector of the frame of signal; then, taking adjacent multi-frame signals as a group of samples, and optimizing by using an attention mechanism to form a new feature vector of the samples; and finally, inputting the optimized feature vector into an audio recognition model for judgment, wherein the audio recognition model can recognize various types of fault sounds of the power equipment. A terminal and medium are also provided. The method can effectively solve the problem of adaptability of the Mel cepstrum coefficient to the sound of the power equipment, and can achieve better recognition effect when recognizing the fault sound of the power equipment.

Description

Intelligent diagnosis method, system, terminal and medium for power equipment fault sound
Technical Field
The invention relates to the technical field of signal processing, in particular to an intelligent diagnosis method, system, terminal and medium for power equipment fault sound based on MFCC and attention mechanism.
Background
In recent years, with the rapid development of the economy of China, the development of the power grid of China begins to present a new situation, the scale of the power grid is continuously enlarged, and meanwhile, the access of an extra-high voltage system makes the running condition of the power grid increasingly complex, and power equipment is required to have higher reliability indexes. In addition to the electrical quantity, the failure of the electrical equipment transmits a lot of information such as abnormal sound generated. Therefore, the non-electrical quantity can also be used for the status analysis of the power equipment. The sound signal has the advantages that the collection mode is flexible and simple, the sensor does not directly contact the equipment, and the normal operation of the equipment is not interfered, so that the sound signal is widely applied to the field of power equipment diagnosis. With the rapid development of artificial intelligence technology, the application of voice recognition is becoming mature day by day, which provides a new idea for a power equipment fault diagnosis method based on sound signal processing. By using the principle of voice recognition for reference, the accumulated fault diagnosis research experience of the power equipment for many years is combined with the artificial intelligence technology, the value of the field data is maximized, and the intelligent development of the power equipment diagnosis is promoted.
Mel-Frequency Cepstral Coefficients (MFCC) is a speech feature based on human auditory mechanism, and is widely applied in the field of speech recognition. The frequency of the voice signal is mainly concentrated on 200Hz-5000Hz, and the sound signal of the electric power equipment is mainly concentrated on 100Hz-2000Hz, and the frequency ranges of the voice signal and the sound signal are slightly different. Through comparing the spectrogram of power equipment sound and pronunciation discovery, the energy distribution of power equipment sound is more even, is the linearity, and the change of similar ripple appears in the spectrogram of pronunciation, reflects the energy change of power equipment sound relatively steady, and the similarity between two adjacent frames of sound signal is high. To make the MFCC better suited for electrical equipment sound, further processing is required. The attention mechanism is an information processing mode for quickly screening out high-value information from a plurality of information by simulating the human brain, and can screen out effective input information which is more important to the current task for processing, so that the global relation can be grasped, the local relation is not ignored, and the learning efficiency of the neural network is improved. However, if the MFCC and attention mechanism are directly combined, the model may suffer from overfitting.
At present, no explanation or report of the similar technology of the invention is found, and similar data at home and abroad are not collected.
Disclosure of Invention
The invention provides a power equipment fault sound intelligent diagnosis method, a system, a terminal and a medium based on MFCC and attention mechanism, aiming at the defects in the prior art.
According to one aspect of the invention, the intelligent sound diagnosis method for the power equipment fault comprises the following steps:
establishing a power equipment fault audio sample database, and dividing a training set and a test set;
respectively carrying out preprocessing operation on the audio samples of the training set and the testing set;
extracting n-dimensional Mel cepstrum coefficients from each preprocessed frame of audio signal as a feature vector of one frame;
taking adjacent m frames of audio signals as a group of samples, and optimizing the feature vectors of the group of samples by using an attention mechanism to form optimized feature vectors;
inputting the optimized feature vector into an audio recognition model for judgment, and finishing training and testing the audio recognition model;
and inputting the audio to be identified into the audio identification model, and identifying and outputting the fault sound of the corresponding type of the electrical equipment.
Preferably, the establishing an audio sample database of the power equipment fault, and dividing the training set and the test set includes: collecting audio frequencies of common power equipment under different working conditions and defects, and carrying out type marking on the audio frequencies to form a complete power equipment fault audio frequency sample database;
for each type of audio sample, randomly extracting a part of the audio samples in proportion to be used as a training set for training a model, and using the rest of the audio samples as a test set for verifying the effectiveness of the model;
all audio samples in the training set and the testing set and corresponding labels thereof are randomly arranged.
Preferably, the preprocessing operation comprises: pre-emphasis, de-mute, framing, and windowing.
Preferably, the preprocessing operation further comprises any one or more of the following operations:
-the pre-emphasis factor is 0.97;
-the de-mute threshold is an average energy of 40%;
-the framing comprises: dividing the audio sample into 25ms segments, and setting the frame displacement as 10 ms;
-said windowing comprises: windowing each frame of audio signal with a hamming window.
Preferably, the value of n is 13-20.
Preferably, the value of m is 10-50.
Preferably, the method for constructing the audio recognition model includes:
and (3) adopting a deep neural network, and arranging a Dropout layer behind each full-connection layer of the deep neural network to construct and obtain an audio recognition model.
Preferably, the training of the audio recognition model takes the optimized feature vector formed based on the MFCC and the attention mechanism as the input of the audio recognition model, and the parameters are continuously updated through forward propagation and error backward propagation to enable the deep neural network to learn, so as to finally generalize the weight model capable of classifying the audio recognition under different operating states and defects of the power equipment.
Preferably, the activation function of the network training employs a tanh function.
According to another aspect of the present invention, there is provided an audio intelligent diagnosis system for power equipment failure, comprising:
the database construction module is used for establishing a power equipment fault audio sample database and dividing a training set and a test set;
the data preprocessing module is used for respectively preprocessing the audio samples of the training set and the testing set;
a feature vector extraction module which extracts n-dimensional mel cepstrum coefficients from each frame of preprocessed audio signal as a feature vector of one frame;
the feature optimization module takes the adjacent m frames of audio signals as a group of samples, optimizes feature vectors of the group of samples by using an attention mechanism and forms optimized feature vectors;
the audio recognition model module is used for constructing an audio recognition model, inputting the optimized feature vector into the audio recognition model for judgment, and training and testing the audio recognition model; and inputting the audio to be identified into the audio identification model, and identifying and outputting the fault sound of the corresponding type of the electrical equipment.
According to a third aspect of the present invention, there is provided a terminal comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor when executing the program being operable to perform any of the methods described above.
According to a fourth aspect of the invention, there is provided a computer readable storage medium having stored thereon a computer program which, when executed by a processor, is operable to perform the method of any of the above.
Due to the adoption of the technical scheme, compared with the prior art, the invention has the following beneficial effects:
according to the intelligent diagnosis method, system, terminal and medium for the power equipment fault sound, deep learning is applied to power equipment fault sound recognition, an acoustic-based power equipment fault diagnosis model can be effectively established, and compared with a traditional shallow classifier, the intelligent diagnosis method has the powerful advantages that a traditional artificial intelligence algorithm cannot compare with the traditional artificial intelligence algorithm in the aspects of feature self-learning, end-to-end modeling and the like.
According to the intelligent diagnosis method, system, terminal and medium for the power equipment fault sound, provided by the invention, an attention mechanism is introduced to improve the MFCC, so that the MFCC is better suitable for representing the sound of the power equipment, adjacent m frames (for example, 50 frames) of MFCC vectors in a sample are comprehensively considered, some effective information which is more important for a current task is screened out and processed, and the identification effect is effectively improved.
According to the intelligent diagnosis method, system, terminal and medium for the power equipment fault sound, a proper deep neural network structure is built, namely, a Dropout layer is arranged after each layer is fully connected, and the problem of model overfitting is avoided.
Drawings
Other features, objects and advantages of the invention will become more apparent upon reading of the detailed description of non-limiting embodiments with reference to the following drawings:
fig. 1 is a flowchart of an intelligent sound diagnostic method for a fault of an electrical device according to an embodiment of the present invention.
Fig. 2 is a flowchart of an intelligent sound diagnosis method for power equipment failure according to a preferred embodiment of the present invention.
Fig. 3 is a schematic structural diagram of a sound intelligent diagnosis system for power equipment failure according to an embodiment of the present invention.
Detailed Description
The following examples illustrate the invention in detail: the embodiment is implemented on the premise of the technical scheme of the invention, and a detailed implementation mode and a specific operation process are given. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention.
Fig. 1 is a flowchart of an intelligent sound diagnostic method for a fault of an electrical device according to an embodiment of the present invention.
As shown in fig. 1, the method for intelligently diagnosing a fault of an electrical device according to this embodiment may include the following steps:
s100, establishing a power equipment fault audio sample database, and dividing a training set and a test set;
s200, respectively carrying out preprocessing operation on the audio samples of the training set and the testing set;
s300, extracting n-dimensional Mel cepstrum coefficients from each preprocessed frame of audio signal to be used as a feature vector of one frame;
s400, using the adjacent m frames of audio signals as a group of samples, and optimizing the feature vectors of the group of samples by using an attention mechanism to form optimized feature vectors;
s500, inputting the optimized feature vector into an audio recognition model for judgment, and finishing training and testing the audio recognition model;
and S600, inputting the audio to be identified into the audio identification model, and identifying and outputting the fault sound of the corresponding type of the power equipment.
In S100 of this embodiment, as a preferred embodiment, establishing an audio sample database of the power equipment failure, and dividing the training set and the test set, the method may include the following steps:
s101, collecting audio frequencies of common power equipment under different working conditions and defects, and carrying out type marking on the audio frequencies to form a complete power equipment fault audio frequency sample database;
s102, for each type of audio sample, randomly extracting a part of the audio samples in proportion to be used as a training set for training a model, and using the rest of the audio samples as a test set for verifying the effectiveness of the model;
s103, randomly arranging all the audio samples and the corresponding labels in the training set and the testing set.
In a specific application example, 80% can be randomly drawn as a training set, and the remaining 20% can be drawn as a testing set.
In S200 of this embodiment, as a preferred embodiment, the preprocessing operation may include: pre-emphasis, de-mute, framing, and windowing.
In a specific application example, in the pre-emphasis process, the pre-emphasis coefficient may be 0.97.
In one embodiment, the de-muting threshold may be 40% of the average energy in de-muting.
In a specific application example, in the framing process, the framing may include: the audio samples are sliced into 25ms segments with a frame shift of 10 ms.
In a specific application example, in the windowing process, the windowing may include: windowing each frame of audio signal with a hamming window.
In S300 of this embodiment, n may be 13 to 20 as a preferred embodiment.
In S400 of this embodiment, m may be 10 to 50 as a preferred embodiment.
In S500 of this embodiment, as a preferred embodiment, the method for constructing an audio recognition model may include the following steps:
and (3) adopting a deep neural network, and arranging a Dropout layer behind each full-connection layer of the deep neural network to construct and obtain an audio recognition model.
In S500 of this embodiment, as a preferred embodiment, the training of the audio recognition model may include the following steps:
and (3) taking the optimized feature vector formed based on the MFCC and the attention mechanism as the input of the audio recognition model, continuously updating parameters through forward propagation and error backward propagation to enable the deep neural network to learn, and finally generalizing a weight model capable of recognizing and classifying the audio under different operating states and defects of the power equipment.
In S500 of this embodiment, as a preferred embodiment, the activation function of the network training employs a tanh function.
Fig. 2 is a flowchart of an intelligent sound diagnosis method for power equipment failure according to a preferred embodiment of the present invention.
As shown in fig. 2, the method for intelligently diagnosing a fault of an electrical device according to the preferred embodiment may include the following steps:
step 1, establishing an audio sample database of common power equipment faults, and then dividing a training set and a test set according to a proportion.
In this preferred embodiment, step 1 specifically includes the following steps:
step 1.1, collecting audio frequencies of power equipment such as a transformer and a switch under different working conditions and defects, and labeling the audio frequencies to form a complete power equipment fault audio frequency sample database.
And 1.2, randomly extracting 80% of samples of each type as a training set to train the model, and using the rest 20% as test samples to verify the effectiveness of the model. In order to ensure the effectiveness of learning, all samples and corresponding labels are randomly arranged and input into the network in a disordered sequence.
And 2, respectively carrying out pre-emphasis, de-muting, framing and windowing on the audio samples of the training set and the test set.
In this preferred embodiment, step 2 specifically includes the following steps:
step 2.1, pre-emphasis: the pre-emphasis processing is realized by passing the signal through a high-pass filter with first-order finite-length unit impulse response, and the transfer function of the high-pass filter is as follows:
H(z)=1-αz -1 (1)
in the formula, alpha is a pre-emphasis coefficient and is 0.97.
Step 2.2, de-muting: whether the sound signal is mute or not is judged by judging whether the short-time energy of the sound signal reaches a certain threshold value or not. Generally, the average energy is used as a measure, and the average energy of the sound signal is calculated by the following formula:
Figure BDA0002900062990000061
in the formula, L represents the number of sampling points;
x (n) -data for each sample point.
Step 2.3, framing: a plurality of sampling points are grouped into an observation unit, which is called a sub-frame. The covered time is about as. An overlap region is set between two adjacent frames, the overlap region includes a plurality of sampling points, and the time covered is about bs.
Further, the number of the plurality of sampling points is 400; the covered time is 0.025 s; the overlap region contains 160 sampling points covering 0.01 s.
And 2.4, windowing each frame of audio signal by using a Hamming window. The Hamming window function is expressed as:
Figure BDA0002900062990000062
in the formula, N represents the frame length, and α is a window function parameter, which is generally 0.46.
And 3, extracting 20-dimensional Mel cepstrum coefficient from each frame of signal obtained after preprocessing to serve as a feature vector of one frame.
In this preferred embodiment, step 3 specifically includes the following steps:
step 3.1, performing windowing on each frame signal after framingThe fast fourier transform obtains the frequency spectrum of each frame, and the power spectrum of the sound signal is obtained by performing a modulo square on the frequency spectrum of the sound signal (sound sample). Setting sound signal X a (k) The DFT of (1) is:
Figure BDA0002900062990000071
where k denotes the kth frequency of the fourier transform, x (N) denotes an input speech signal, and N denotes the number of points of the fourier transform.
Step 3.2, passing the energy spectrum through a group of Mel-scale triangular filter banks, wherein the frequency response H of the triangular filter banks m (k) Comprises the following steps:
Figure BDA0002900062990000072
wherein f (-) represents the center frequency,
Figure BDA0002900062990000073
m is the number of filters.
Step 3.3, calculating the logarithmic energy s (m) output by each filter bank, wherein the form is as follows:
Figure BDA0002900062990000074
step 3.4, obtaining MFCC coefficient C (n) through Discrete Cosine Transform (DCT) in the form of:
Figure BDA0002900062990000075
and 4, taking the adjacent 50 frames of audio signals as a group of samples, optimizing the feature vectors of the group of samples by using an attention mechanism, and constructing new feature vectors.
In this preferred embodiment, step 4 specifically includes the following steps:
step 4.1, input signal, with X ═ { X 1 ,x 2 ,...,x n Denotes n input signals.
Step 4.2, attention distribution calculation, for convenience of understanding, assume key i =value i =x i Then the attention distribution is:
α i =softmax(s(key i ,q))=softmax(s(x i ,q)) (8)
in the formula of alpha i -the weight of the ith input message;
softmax — map the input between 0-1 and normalize the guaranteed sum to 1;
s(key i q) -attention scoring mechanism, herein adopted dot product models, i.e.
Figure BDA0002900062990000076
And 4.3, carrying out weighted average on the information to obtain an attention vector:
Figure BDA0002900062990000077
in the formula of alpha i -the weight of the ith input message.
And 5, inputting the optimized feature vector into an audio recognition model constructed by a deep neural network for judgment, and finishing training and testing the audio recognition model.
In the preferred embodiment, step 5 specifically includes the following steps:
step 5.1, building a deep neural network architecture, wherein an activation function of network training adopts a tanh function:
Figure BDA0002900062990000081
and 5.2, arranging a Dropout layer behind each full-connection layer, randomly discarding some hidden neurons in the network, and keeping the number of input neurons and output neurons unchanged.
And 5.3, after training and debugging the network structure parameters to be optimal, repeating the steps of preprocessing and feature extraction on the test set sample, inputting the feature vector into the network to obtain the corresponding probability of each fault sound, and taking the fault with the highest probability as the recognition result of the audio sample.
And 6, inputting the audio to be identified into the audio identification model, and identifying and outputting the fault sound of the corresponding type of the power equipment.
The intelligent diagnosis method for the power equipment fault sound provided by the embodiment of the invention can effectively solve the problem of adaptability of the Mel cepstrum coefficient to the power equipment sound, and can obtain better recognition effect when recognizing the power equipment fault sound.
Fig. 3 is a schematic structural diagram of a sound intelligent diagnosis system for power equipment failure according to an embodiment of the present invention.
As shown in fig. 3, the intelligent sound diagnostic system for power equipment failure provided by this embodiment may include the following modules:
the database construction module is used for establishing a power equipment fault audio sample database and dividing a training set and a test set;
the data preprocessing module is used for respectively preprocessing the audio samples of the training set and the testing set;
a feature vector extraction module which extracts n-dimensional mel cepstrum coefficients from each frame of preprocessed audio signal as a feature vector of one frame;
the feature optimization module takes the adjacent m frames of audio signals as a group of samples, optimizes feature vectors of the group of samples by using an attention mechanism and forms optimized feature vectors;
the audio recognition model module is used for constructing an audio recognition model, inputting the optimized feature vector into the audio recognition model for judgment, and training and testing the audio recognition model; and inputting the audio to be identified into the audio identification model, and identifying and outputting the fault sound of the corresponding type of the electrical equipment.
A third embodiment of the present invention provides a terminal comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor being operable to execute the method according to any one of the above embodiments of the present invention when executing the program.
Optionally, a memory for storing a program; a Memory, which may include a volatile Memory (RAM), such as a Random Access Memory (SRAM), a Double Data Rate Synchronous Dynamic Random Access Memory (DDR SDRAM), and the like; the memory may also comprise a non-volatile memory, such as a flash memory. The memories are used to store computer programs (e.g., applications, functional modules, etc. that implement the above-described methods), computer instructions, etc., which may be stored in partition in the memory or memories. And the computer programs, computer instructions, data, etc. described above may be invoked by a processor.
The computer programs, computer instructions, etc. described above may be stored in one or more memories in a partitioned manner. And the computer programs, computer instructions, data, etc. described above may be invoked by a processor.
A processor for executing the computer program stored in the memory to implement the steps of the method according to the above embodiments. Reference may be made in particular to the description relating to the preceding method embodiment.
The processor and the memory may be separate structures or may be an integrated structure integrated together. When the processor and the memory are separate structures, the memory, the processor may be coupled by a bus.
A fourth embodiment of the invention provides a computer-readable storage medium, on which a computer program is stored, which program, when being executed by a processor, is adapted to carry out the method of any of the above-mentioned embodiments of the invention.
According to the intelligent diagnosis method, the system, the terminal and the medium for the power equipment fault sound provided by the embodiment of the invention, firstly, a common power equipment fault audio sample database is established; then preprocessing operations such as pre-emphasis, de-mute, framing, windowing and the like are carried out on the audio samples; then extracting n-dimensional (such as 20-dimensional) Mel Cepstral Coefficients (MFCC) from each frame signal obtained after preprocessing as a feature vector of the frame signal; then, taking the adjacent m frames (such as 50 frames) as a group of samples, and optimizing by using an attention mechanism to form a new feature vector of the sample; and finally, inputting the optimized feature vector into a built deep neural network for judgment, and identifying fault sounds of various types of power equipment. The embodiment of the invention can effectively solve the problem of adaptability of the Mel cepstrum coefficient to the sound of the power equipment, and the method can obtain better identification effect when identifying the fault sound of the power equipment.
It should be noted that, the steps in the method provided by the present invention may be implemented by using corresponding modules, devices, units, and the like in the system, and those skilled in the art may implement the composition of the system by referring to the technical solution of the method, that is, the embodiment in the method may be understood as a preferred example for constructing the system, and will not be described herein again.
Those skilled in the art will appreciate that, in addition to implementing the system and its various devices provided by the present invention in purely computer readable program code means, the method steps can be fully programmed to implement the same functions by implementing the system and its various devices in the form of logic gates, switches, application specific integrated circuits, programmable logic controllers, embedded microcontrollers and the like. Therefore, the system and various devices thereof provided by the present invention can be regarded as a hardware component, and the devices included in the system and various devices thereof for realizing various functions can also be regarded as structures in the hardware component; means for performing the functions may also be regarded as structures within both software modules and hardware components for performing the methods.
The foregoing description of specific embodiments of the present invention has been presented. It is to be understood that the present invention is not limited to the specific embodiments described above, and that various changes and modifications may be made by one skilled in the art within the scope of the appended claims without departing from the spirit of the invention.

Claims (10)

1. An intelligent sound diagnosis method for power equipment failure is characterized by comprising the following steps:
establishing a power equipment fault audio sample database, and dividing a training set and a test set;
respectively carrying out preprocessing operation on the audio samples of the training set and the testing set;
extracting n-dimensional Mel cepstrum coefficients from each preprocessed frame of audio signal as a feature vector of one frame;
the method comprises the following steps of taking adjacent m frames of audio signals as a group of samples, optimizing feature vectors of the group of samples by using an attention mechanism, and forming optimized feature vectors, wherein the method specifically comprises the following steps:
step 4.1, input signal, with X ═ { X 1 ,x 2 ,...,x n Denotes n input signals;
step 4.2, attention distribution calculation, for convenience of understanding, assume key i =value i =x i Then the attention distribution is:
α i =softmax(s(key i ,q))=softmax(s(x i ,q)) (8)
in the formula of alpha i -the weight of the ith input message;
softmax — map the input between 0-1 and normalize the guaranteed sum to 1;
s(key i q) -attention scoring mechanism, herein using a dot product model, i.e.
Figure FDA0003712419600000011
And 4.3, carrying out weighted average on the information to obtain an attention vector:
Figure FDA0003712419600000012
in the formula of alpha i -the weight of the ith input message;
inputting the optimized feature vector into an audio recognition model for judgment, and finishing training and testing the audio recognition model;
and inputting the audio to be identified into the audio identification model, and identifying and outputting the fault sound of the corresponding type of the electrical equipment.
2. The intelligent sound diagnosis method for power equipment failure according to claim 1, wherein the establishing a power equipment failure audio sample database and dividing a training set and a test set comprises:
collecting audio frequencies of common power equipment under different working conditions and defects, and carrying out type marking on the audio frequencies to form a complete power equipment fault audio frequency sample database;
for each type of audio sample, randomly extracting a part of the audio samples in proportion to be used as a training set for training a model, and using the rest of the audio samples as a test set for verifying the effectiveness of the model;
all audio samples in the training set and the testing set and corresponding labels thereof are randomly arranged.
3. The power equipment fault sound intelligent diagnosis method according to claim 1, characterized in that the preprocessing operation comprises: pre-emphasis, de-mute, framing, and windowing.
4. The intelligent sound diagnosis method for the power equipment fault according to claim 3, characterized by further comprising any one or more of the following:
-the pre-emphasis factor is 0.97;
-the de-mute threshold is an average energy of 40%;
-the framing comprises: dividing the audio sample into 25ms segments, and setting the frame displacement as 10 ms;
-said windowing comprises: windowing each frame of audio signal with a hamming window.
5. The intelligent sound diagnosis method for the power equipment fault according to claim 1, wherein n is a vimel cepstrum coefficient n with a value of 13-20;
the value of m is 10-50.
6. The intelligent sound diagnosis method for the power equipment fault is characterized in that the audio identification model building method comprises the following steps:
and (3) adopting a deep neural network, and arranging a Dropout layer behind each full-connection layer of the deep neural network to construct and obtain an audio recognition model.
7. The intelligent sound diagnosis method for the power equipment fault is characterized in that the training of the audio recognition model comprises any one or more of the following items:
taking the optimized feature vector formed based on the MFCC and the attention mechanism as the input of the audio recognition model, continuously updating parameters through forward propagation and error backward propagation to enable the deep neural network to learn, and finally generalizing a weight model capable of recognizing and classifying the audio under different operating states and defects of the power equipment;
-the activation function of the network training employs the tanh function.
8. An intelligent sound diagnostic system for power equipment failure, comprising:
the database construction module is used for establishing a power equipment fault audio sample database and dividing a training set and a test set;
the data preprocessing module is used for respectively preprocessing the audio samples of the training set and the testing set;
a feature vector extraction module which extracts n-dimensional mel cepstrum coefficients from each frame of preprocessed audio signal as a feature vector of one frame;
the feature optimization module takes the adjacent m frames of audio signals as a group of samples, optimizes feature vectors of the group of samples by using an attention mechanism, and forms optimized feature vectors, and specifically comprises the following steps:
step 4.1, input signal, with X ═ { X 1 ,x 2 ,...,x n Denotes n input signals;
step 4.2, attention distribution calculation, for convenience of understanding, assume key i =value i =x i Then the attention distribution is:
α i =softmax(s(key i ,q))=softmax(s(x i ,q)) (8)
in the formula of alpha i -the weight of the ith input message;
softmax — map the input between 0-1 and normalize the guaranteed sum to 1;
s(key i q) -attention scoring mechanism, herein using a dot product model, i.e.
Figure FDA0003712419600000031
And 4.3, carrying out weighted average on the information to obtain an attention vector:
Figure FDA0003712419600000032
in the formula of alpha i -the weight of the ith input message;
the audio recognition model module is used for constructing an audio recognition model, inputting the optimized feature vector into the audio recognition model for judgment, and training and testing the audio recognition model; and inputting the audio to be identified into the audio identification model, and identifying and outputting the fault sound of the corresponding type of the electrical equipment.
9. A terminal comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor, when executing the program, is operative to perform the method of any of claims 1-7.
10. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, is adapted to carry out the method of any one of claims 1 to 7.
CN202110053647.0A 2021-01-15 2021-01-15 Intelligent diagnosis method, system, terminal and medium for power equipment fault sound Active CN112885372B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110053647.0A CN112885372B (en) 2021-01-15 2021-01-15 Intelligent diagnosis method, system, terminal and medium for power equipment fault sound

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110053647.0A CN112885372B (en) 2021-01-15 2021-01-15 Intelligent diagnosis method, system, terminal and medium for power equipment fault sound

Publications (2)

Publication Number Publication Date
CN112885372A CN112885372A (en) 2021-06-01
CN112885372B true CN112885372B (en) 2022-08-09

Family

ID=76048006

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110053647.0A Active CN112885372B (en) 2021-01-15 2021-01-15 Intelligent diagnosis method, system, terminal and medium for power equipment fault sound

Country Status (1)

Country Link
CN (1) CN112885372B (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113409814B (en) * 2021-06-02 2022-09-20 国家石油天然气管网集团有限公司西气东输分公司 Intelligent monitoring method and system for operation of natural gas station equipment
CN113553465B (en) * 2021-06-15 2023-12-19 深圳供电局有限公司 Sound data warehousing method, device, computer equipment and storage medium
CN114330431A (en) * 2021-12-23 2022-04-12 天津工业大学 Carrier roller fault diagnosis method based on dynamic self-attention and related equipment
CN114638256A (en) * 2022-02-22 2022-06-17 合肥华威自动化有限公司 Transformer fault detection method and system based on sound wave signals and attention network
CN116825131A (en) * 2022-06-24 2023-09-29 南方电网调峰调频发电有限公司储能科研院 Power plant equipment state auditory monitoring method integrating frequency band self-downward attention mechanism
CN115392293A (en) * 2022-08-01 2022-11-25 中国南方电网有限责任公司超高压输电公司昆明局 Transformer fault monitoring method and device, computer equipment and storage medium
CN115240691B (en) * 2022-09-23 2022-12-06 山西振中电力股份有限公司 Substation equipment running state monitoring control system based on data analysis
CN117095695B (en) * 2023-10-19 2023-12-22 国网山西省电力公司超高压变电分公司 Wide-area voiceprint compression acquisition method and system for transformer body

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108010514A (en) * 2017-11-20 2018-05-08 四川大学 A kind of method of speech classification based on deep neural network
CN109346103A (en) * 2018-10-30 2019-02-15 交通运输部公路科学研究所 A kind of audio-frequency detection for highway tunnel traffic event
CN109801635A (en) * 2019-01-31 2019-05-24 北京声智科技有限公司 A kind of vocal print feature extracting method and device based on attention mechanism
CN110782878A (en) * 2019-10-10 2020-02-11 天津大学 Attention mechanism-based multi-scale audio scene recognition method

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018005996A1 (en) * 2016-06-30 2018-01-04 Massachusetts Institute Of Technology System, device, and method for feature generation, selection, and classification for audio detection of anomalous engine operation
US10210860B1 (en) * 2018-07-27 2019-02-19 Deepgram, Inc. Augmented generalized deep learning with special vocabulary

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108010514A (en) * 2017-11-20 2018-05-08 四川大学 A kind of method of speech classification based on deep neural network
CN109346103A (en) * 2018-10-30 2019-02-15 交通运输部公路科学研究所 A kind of audio-frequency detection for highway tunnel traffic event
CN109801635A (en) * 2019-01-31 2019-05-24 北京声智科技有限公司 A kind of vocal print feature extracting method and device based on attention mechanism
CN110782878A (en) * 2019-10-10 2020-02-11 天津大学 Attention mechanism-based multi-scale audio scene recognition method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Research on Voice Print Recognition of Electrical Faults Based on Attention-MFCC Algorithm;Lin Chen et al.;《2021 Power System and Green Energy Conference (PSGEC)》;20210924;第748-751页 *

Also Published As

Publication number Publication date
CN112885372A (en) 2021-06-01

Similar Documents

Publication Publication Date Title
CN112885372B (en) Intelligent diagnosis method, system, terminal and medium for power equipment fault sound
US11908455B2 (en) Speech separation model training method and apparatus, storage medium and computer device
CN109065030B (en) Convolutional neural network-based environmental sound identification method and system
CN110491416B (en) Telephone voice emotion analysis and identification method based on LSTM and SAE
CN109740523A (en) A kind of method for diagnosing fault of power transformer based on acoustic feature and neural network
CN111325095A (en) Intelligent equipment health state detection method and system based on sound wave signals
Das et al. Urban sound classification using convolutional neural network and long short term memory based on multiple features
CN113566948A (en) Fault audio recognition and diagnosis method for robot coal pulverizer
CN111161715A (en) Specific sound event retrieval and positioning method based on sequence classification
CN115862684A (en) Audio-based depression state auxiliary detection method for dual-mode fusion type neural network
CN114023354A (en) Guidance type acoustic event detection model training method based on focusing loss function
CN113203914A (en) Underground cable early fault detection and identification method based on DAE-CNN
CN114627895A (en) Acoustic scene classification model training method and device, intelligent terminal and storage medium
Whitehill et al. Whosecough: In-the-wild cougher verification using multitask learning
CN111080168A (en) Power communication network equipment reliability evaluation method based on capsule network
CN114331214A (en) Domain-adaptive bearing voiceprint fault diagnosis method and system based on reinforcement learning
Gao et al. A multi-scale fully convolutional network for singing melody extraction
Li et al. Research on environmental sound classification algorithm based on multi-feature fusion
CN112329819A (en) Underwater target identification method based on multi-network fusion
Lim et al. Sound event detection in domestic environments using ensemble of convolutional recurrent neural networks
CN116026588A (en) Bearing fault diagnosis and early warning method based on ensemble learning
CN114898527A (en) Wearable old man falling detection system and method based on voice assistance
CN104102834A (en) Method for identifying sound recording locations
Luo et al. Polyphonic sound event detection based on CapsNet-RNN and post processing optimization
Vesperini et al. Hierarchic conv nets framework for rare sound event detection

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information
CB03 Change of inventor or designer information

Inventor after: Jiang Li

Inventor after: Xin Angbei

Inventor after: Liu Zhen

Inventor after: Cai Yingming

Inventor after: Liu Jia

Inventor after: Yao Xiaolin

Inventor after: Cong Longqi

Inventor after: Cheng Botao

Inventor after: Wang Manman

Inventor after: Zhang Qihong

Inventor after: Xu Mingyue

Inventor after: Ma Quanjiang

Inventor after: Zhao Xiaonan

Inventor after: Xiao Chi

Inventor after: Zhang Weidong

Inventor after: Li Zhe

Inventor after: Yu Wenjian

Inventor after: Liu Teng

Inventor before: Jiang Li

Inventor before: Xin Angbei

Inventor before: Liu Zhen

Inventor before: Cai Yingming

Inventor before: Liu Jia

Inventor before: Yao Xiaolin

Inventor before: Cong Longqi

Inventor before: Cheng Botao

Inventor before: Wang Manman

Inventor before: Zhang Qihong

Inventor before: Xu Mingyue

Inventor before: Ma Quanjiang

Inventor before: Zhao Xiaonan

Inventor before: Xiao Chi

Inventor before: Zhang Weidong

Inventor before: Li Zhe

Inventor before: Yu Wenjian

Inventor before: Liu Teng

GR01 Patent grant
GR01 Patent grant