CN112581940A - Discharging sound detection method based on edge calculation and neural network - Google Patents
Discharging sound detection method based on edge calculation and neural network Download PDFInfo
- Publication number
- CN112581940A CN112581940A CN202010979821.XA CN202010979821A CN112581940A CN 112581940 A CN112581940 A CN 112581940A CN 202010979821 A CN202010979821 A CN 202010979821A CN 112581940 A CN112581940 A CN 112581940A
- Authority
- CN
- China
- Prior art keywords
- model
- neural network
- frequency
- mel
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000004364 calculation method Methods 0.000 title claims abstract description 24
- 238000001514 detection method Methods 0.000 title claims abstract description 23
- 238000013528 artificial neural network Methods 0.000 title claims abstract description 16
- 238000007599 discharging Methods 0.000 title claims abstract description 12
- 238000012423 maintenance Methods 0.000 claims abstract description 8
- 230000004044 response Effects 0.000 claims abstract description 8
- 230000006870 function Effects 0.000 claims description 37
- 238000000034 method Methods 0.000 claims description 19
- 238000012549 training Methods 0.000 claims description 17
- 238000004422 calculation algorithm Methods 0.000 claims description 10
- 238000003062 neural network model Methods 0.000 claims description 10
- 230000003595 spectral effect Effects 0.000 claims description 9
- 230000004913 activation Effects 0.000 claims description 8
- 238000005457 optimization Methods 0.000 claims description 8
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 claims description 7
- 230000008569 process Effects 0.000 claims description 4
- 238000005311 autocorrelation function Methods 0.000 claims description 3
- 230000008859 change Effects 0.000 claims description 3
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 239000011159 matrix material Substances 0.000 claims description 3
- 238000005070 sampling Methods 0.000 claims description 3
- 230000009466 transformation Effects 0.000 claims description 3
- 238000002372 labelling Methods 0.000 claims description 2
- 238000009413 insulation Methods 0.000 abstract description 9
- 230000015556 catabolic process Effects 0.000 abstract description 3
- 238000006731 degradation reaction Methods 0.000 abstract description 3
- 230000002159 abnormal effect Effects 0.000 abstract description 2
- 230000032683 aging Effects 0.000 abstract description 2
- 238000010586 diagram Methods 0.000 description 4
- 238000012544 monitoring process Methods 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 238000012795 verification Methods 0.000 description 4
- 230000003044 adaptive effect Effects 0.000 description 3
- 238000013135 deep learning Methods 0.000 description 3
- 230000006866 deterioration Effects 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 210000002569 neuron Anatomy 0.000 description 2
- 238000010200 validation analysis Methods 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000007477 logistic regression Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012567 pattern recognition method Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000012706 support-vector machine Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01R—MEASURING ELECTRIC VARIABLES; MEASURING MAGNETIC VARIABLES
- G01R31/00—Arrangements for testing electric properties; Arrangements for locating electric faults; Arrangements for electrical testing characterised by what is being tested not provided for elsewhere
- G01R31/12—Testing dielectric strength or breakdown voltage ; Testing or monitoring effectiveness or level of insulation, e.g. of a cable or of an apparatus, for example using partial discharge measurements; Electrostatic testing
- G01R31/1209—Testing dielectric strength or breakdown voltage ; Testing or monitoring effectiveness or level of insulation, e.g. of a cable or of an apparatus, for example using partial discharge measurements; Electrostatic testing using acoustic measurements
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
Abstract
The invention provides a discharging sound detection method based on edge calculation and a neural network, which aims at the partial discharging phenomenon caused by the insulation aging of equipment in an electric power system, provides a signal detection model arranged at an edge node to monitor three states of normal operation, partial discharging and fault occurrence of the electric power equipment in real time, and feeds an abnormal state back to an operation and maintenance center to help the operation and maintenance center to monitor the equipment fault in real time, thereby improving the operation and maintenance response time of the electric power system, avoiding major electric power accidents caused by the insulation degradation of the equipment and reducing the operation and maintenance cost. The detection method is easy to implement, high in detection accuracy and suitable for popularization and use.
Description
Technical Field
The invention relates to the field of voice recognition, in particular to a discharging sound detection method based on edge calculation and a neural network.
Background
High-voltage switch cabinets are one of the most important electrical devices in electrical power systems. The safety and reliability of the power equipment are important links of ultra-large-scale power transmission and distribution and power grid safety guarantee, and the safety and reliability of the high-voltage switch cabinet as widely applied power equipment are also concerned more. According to the statistical information of accidents of 6 kV-10 kV switch cabinets of the national power system between 2005 and 2011, the total number of the accidents caused by insulation and current carrying is 50.2%, wherein the total number of the accidents caused by the deterioration of an insulation part is 79.0%, and the total number of the accidents caused by poor contact of an isolation plug is 71.1%. It can be seen that the rate of failure due to insulation deterioration or contact failure is high. Before the fault occurs, partial discharge and other phenomena may exist in the high-voltage switch cabinet, so that the equipment running state parameters can be obtained by detecting related information. In view of this, how to effectively find the partial discharge and the development rule thereof in the switch cabinet and detect the potential insulation fault in time is a problem that the power supervision department is concerned with and needs to solve more and more urgently, and is also a difficulty and challenge faced by related scientific research personnel and scientific research units. Therefore, the switch cabinet and the ring main unit adopt proper partial discharge live monitoring in actual operation, and the method has great significance.
Deep learning is one of the research fields that have developed rapidly in recent years, and has made a breakthrough in the sub-field of many human intelligence. Most of the early techniques of machine learning and signal processing, which use shallow structures, including support vector machine, gaussian mixture, logistic regression, etc., are in a predicament when some complex natural signals are involved. Therefore, researchers have proposed a more efficient deep learning method by simulating a deep hierarchical structure in systems such as human vision and hearing, extracting a complex structure from rich sensory input signals, and establishing an internal representation.
Since 90 s, the pattern recognition method is introduced into the field of partial discharge defect type recognition, but partial discharge signals of GIS and large transformers are mostly researched, and the research level is still in the first stage. Neural Network (NN) recognition is currently widely used, and is a machine learning method that follows the principle of empirical risk minimization.
Disclosure of Invention
The technical purpose of the invention is to provide a discharging sound detection method based on edge calculation and a neural network, which is used for carrying out training by utilizing known discharging sound data and carrying out classification detection on actual discharging sound data.
The technical scheme of the invention is as follows:
a discharging sound detection method based on edge calculation and a neural network is characterized by comprising the following steps:
collecting voice samples of the power equipment in three states of normal work, partial discharge and failure, extracting audio features of the voice samples, labeling and constructing a data set;
step (B) establishing a neural network model with a multi-classification function, training the model by using the data set established in the step (A), and compiling the model by using multi-classification cross entropy and adam optimization algorithm;
step (C) carrying out accuracy and error analysis on the trained model;
step (D), taking the model with the standard accuracy as a target model, deploying the target model at an edge node, detecting the electric leakage state of the power equipment at a node terminal by using the target model, and returning the detection result to an operation and maintenance center;
and (E) aiming at the detection result returned by the edge node, the operation and maintenance center carries out data statistics and analysis at the cloud end and makes a corresponding operation and maintenance response.
On the basis of the above scheme, a further improved or preferred scheme further comprises:
in the step (B), the neural network model with multi-classification functions comprises an input layer, a hidden layer and an output layer;
the calculation method of the hidden layer is as follows:
hidden=f(W×X+b)
wherein, hide layer output, f is a nonlinear activation function, W is the weight of the network, X is the input layer vector, i.e. the audio features extracted in step (A), and b is the bias of the network;
the calculation method of the output layer is as follows:
Y=softmax(WY×hiddenlast+bY)
wherein Y is the output of the output layer, hiddenlastIs the output value of the last hidden layer, WYIs the weight of the output layer, bYIs the offset of the output layer, softmax is the activation function of the output layer;
since here three leakage states are detected, W isYIs a two-dimensional matrix with one dimension of 3, and bYThen a length 3 vector;
the softmax function is defined as follows:
wherein, S is 3, corresponding to 3 states of the power equipment, j is any one of the 3 states, and j is more than or equal to 1 and less than or equal to 3; the formula represents the probability that the speech sample x is judged to be in the state j, and when the output node is selected finally, the node with the maximum probability is selected as the prediction target.
Further, in step (B), the multi-class cross-entropy loss function is defined as follows:
where C denotes the number of speech samples, YqIs the classification result of the q-th speech sample by the neural network model, yqIt is the true label for the sample.
Further, in step (C), the accuracy is defined as:
wherein P is the accuracy, TP is the number of samples correctly classified by the model, and FP is the number of samples incorrectly classified by the model.
Further, the audio features are one or more of short-time average energy, short-time average amplitude function, short-time average zero crossing rate, short-time autocorrelation function, mel-frequency cepstrum correlation parameter, and formant correlation parameter.
Preferably, the present invention adopts mel cepstrum related parameters as audio features, and the specific process of step (a) includes:
a1) extracting the same voice frame number for each voice sample, and then extracting corresponding voice characteristics for each frame of voice to prepare for model training;
speech signal x after windowing of a framei(n) calculating its fast fourier transform, where n is the sample point of the speech timing and i is the index of the frame;
transformation from time domain data to frequency domain data:
Xi(k)=FFT[xi(n)]
a2) FFT of each framei(k) Calculating the spectral line energy Ei(k) K denotes the kth spectral line in the frequency domain;
Ei(k)=|Xi(k)|2
a3) passing the energy of each frame of spectral line through a Mel filter bank, calculating its energy in each filter bank
Where N is the length of FFT change, M is the number of filters, M is the total number of filters, Hm(k) Is the frequency response of the mel filter:
wherein, f (m), f (m-1) and f (m +1) respectively represent the center frequencies of the m-th filter, the m-1-th filter and the m + 1-th filter, and the calculation formulas of f (m-1) and f (m +1) are analogized according to the calculation formula of f (m);
fland fHRespectively the lowest and highest frequency of the filter frequency range, N the length of the FFT variation, fsIs the sampling frequency, FMel() A function that converts the actual frequency in brackets to mel-frequency is shown,is FMel() The inverse function of (1), i.e. conversion of mel frequency to actual frequency;
a4) decorrelating the S obtained in step a3) by means of a Discrete Cosine Transform (DCT)i(m) substituting the following formula to obtain a final voice characteristic parameter MFCC;
in the above formula, mfcciAnd (n) extracting corresponding voice characteristics of the ith frame voice.
Has the advantages that:
the invention provides a discharging sound detection method based on edge calculation and a neural network, which is used for monitoring three states of normal operation, partial discharge and fault occurrence of electric power equipment in real time through a signal detection model arranged at an edge node aiming at a partial discharge phenomenon caused by insulation aging of the equipment in an electric power system, and feeding back an abnormal state to an operation and maintenance center, thereby helping the operation and maintenance center to monitor the equipment fault in real time, improving the operation and maintenance response time of the electric power system, avoiding major electric power accidents caused by insulation degradation of the equipment and reducing the operation and maintenance cost. Compared with the traditional leakage detection, the method needs a professional to detect on site, and all-weather unmanned monitoring can be realized through the terminal edge node; compare in traditional monitoring facilities, need the professional just can understand data, differentiate the electric leakage state, here can directly give the electric leakage state through the model probability, reduce the dependence to relevant professional knowledge.
Drawings
FIG. 1 is a simplified flow diagram of the method of the present invention;
FIG. 2 is a block diagram of a neural network model for multiple classification functions;
FIG. 3 is a diagram of a system architecture corresponding to the detection method of the present invention;
FIG. 4 is a graph of the accuracy of the trained model in the training set and the validation set;
FIG. 5 is a graph of the error curves of the trained model over the training set and the validation set;
fig. 6 is a schematic diagram of MFCC feature parameter extraction.
Detailed Description
To clarify the technical solution and working principle of the present invention, the present invention will be further described with reference to the accompanying drawings and specific embodiments.
As shown in fig. 1, a method for detecting a sounding discharge based on edge calculation and a neural network specifically includes the following steps:
(A) the method comprises the steps of collecting voice samples of the power equipment in three states of normal work, partial discharge and failure by using a voice sensor, extracting audio features of the voice samples, marking the voice samples with labels, and constructing a data set, wherein the data set comprises a training set, a verification set and a test set.
The audio features are one or more of short-time average energy, short-time average amplitude function, short-time average zero-crossing rate, short-time autocorrelation function, mel-frequency cepstrum related parameters and formant related parameters. In this embodiment, the Mel Frequency Cepstrum Coefficients (MFCCs) are used as audio features, and the Mel Frequency Cepstrum Coefficients are preferably selected.
Mel Frequency Cepstrum Coefficients (MFCCs) are used to analyze the Frequency spectrum of speech according to the results of human auditory experiments, because the division of the human subjective perceptual Frequency domain is not linear, and the relationship between the Frequency domain and the actual Frequency is shown in the following formula.
FMel=1125×log(1+f/700)
In the formula, FMelIs the perceived frequency in Mel (Mel) units, and f is the actual frequency in Hz. The MFCC characteristic parameter extraction principle is shown in fig. 6, and the specific process is as follows:
a1) extracting the same voice frame number for each voice sample, and then extracting corresponding voice characteristics for each frame of voice to prepare for model training;
speech signal x after windowing of a framei(n) calculating its fast fourier transform, where n is the sample point of the speech timing and i is the index of the frame;
transformation from time domain data to frequency domain data:
Xi(k)=FFT[xi(n)]
a2) FFT of each framei(k) Calculating the spectral line energy Ei(k) K denotes the kth spectral line in the frequency domain;
Ei(k)=|Xi(k)|2
a3) passing the energy of each frame of spectral line through a Mel filter bank, calculating its energy in each filter bank
Where N is the length of FFT change, M is the number of filters, M is the total number of filters, Hm(k) Is the frequency response of the mel filter:
wherein, f (m), f (m-1) and f (m +1) respectively represent the center frequencies of the m-th filter, the m-1-th filter and the m + 1-th filter, and the calculation formulas of f (m-1) and f (m +1) are analogized according to the calculation formula of f (m);
fland fHRespectively the lowest and highest frequency of the filter frequency range, N the length of the FFT variation, fsIs the sampling frequency, FMel() A function that converts the actual frequency in brackets to mel-frequency is shown,is FMel() The inverse function of (i.e. conversion of mel frequency to actual frequency),d represents the argument of the inverse function, namely the mel frequency;
a4) decorrelating the S obtained in step a3) by means of a Discrete Cosine Transform (DCT)i(m) substituting the following formula to obtain a final voice characteristic parameter MFCC;
in the above formula, mfcciAnd (n) extracting corresponding voice characteristics of the ith frame voice.
(B) Establishing a neural network model with a multi-classification function, training the model by using the data of the training set, and compiling the model by using a multi-classification cross entropy and adam optimization algorithm.
In this step, the neural network model with multi-classification functions includes an input layer, a hidden layer and an output layer; the calculation method of the hidden layer is as follows:
hidden=f(W×X+b)
wherein, hide the output of layer, f is the nonlinear activation function, W is the weight of the network, X is the input layer vector, namely the audio characteristic extracted in step (A), b is the bias of the network;
when a plurality of hidden layers exist, the calculation method of each hidden layer is as follows:
hiddenp=fp(Wp×hiddenp-1+bp)
where p is the number of the hidden layer, WpAs a weight of the p-th hidden layer, bpFor biasing of the p-th hidden layer, fpHidden function for p-th hidden layer, hiddenpFor output of p-th hidden layer, hiddenp-1And hiding the output value of the layer above the p-th layer.
For the first layer hidden layer, the input value is the audio feature MFCC extracted above, the weight and bias of each hidden layer may be different, the activation function is not necessarily required to be the same, and finally output is obtained through softmax.
A softmax function, which maps the outputs of a plurality of neurons into the interval between (0,1), is defined as follows:
wherein, S is 3 corresponding to 3 states of the power equipment, j is any one of 3 states, j is more than or equal to 1 and less than or equal to 3, xjIndicating that the speech sample x belongs to the j leakage state,in its exponential form, the form of the,this represents a summary of all leakage states, and this equation represents the probability of discriminating it as state j for speech sample x.
Since the task of the detection of the sparkling voice is a multi-classification problem, the activation function of the output layer uses a normalized exponential function, namely a softmax function, which is to map the outputs of a plurality of neurons into the interval between (0,1), and then when the output node is selected finally, the node with the maximum probability can be selected as the prediction target.
The calculation method of the output layer is as follows:
Y=softmax(WY×hiddenlast+bY)
among them, hiddenlastIs the output value of the last hidden layer, WYIs the weight of the output layer, bYIs the bias of the output layer and softmax is the activation function of the output layer.
Since here three leakage states are detected, W isYIs a two-dimensional matrix with one dimension of 3, and bYIt is a length 3 vector.
The cross entropy describes the distance between two probability distributions, which is a loss function widely used in the classification problem, and in this embodiment, the multi-classification cross entropy loss function is defined as follows:
where C denotes the number of speech samples, YqIs the classification result of the q-th speech sample by the neural network model, yqIt is the true label for the sample. Since the program index is typically computed starting from 0 and the total is C, starting from 0, the upper bound of the summation is C-1. It is also possible to start with 1, when the upper summation limit is C.
In order to make the result of the model output close to the real result, it is required to minimize the above-mentioned loss function, i.e., to minimize the entropy between the model output and the real result. For this purpose, Adam optimization algorithms were introduced. The Adam optimization algorithm is a first-order optimization algorithm that can replace the traditional stochastic gradient descent process, and can iteratively update neural network weights based on training data. The optimization algorithm integrates the advantages of an adaptive gradient algorithm (AdaGrad) and a root mean square propagation (RMSProp) algorithm, Adam not only calculates the adaptive parameter learning rate based on the first moment mean value, but also fully utilizes the second moment mean value of the gradient, which is the prior art, so that the description is not repeated.
In the embodiment, a deep learning method is used, and a known data set is trained to obtain a discharge sound multi-classification detection model. By applying a multi-classification cross entropy loss function and an adam optimization algorithm, independent adaptive learning rates are designed for different parameters by calculating first moment estimation and second moment estimation of the gradient, so that the model training efficiency can be improved, and the robustness of the recognition effect can be enhanced.
(C) Performing accuracy and error analysis on the trained model
The accuracy is defined as:
where P is the accuracy and TP is the number of samples for the correct classification of the model, i.e., argmax (Y)i)=argmax(yi) (ii) a FP is the number of samples for model misclassification.
(D) The method comprises the steps of taking a model with the standard accuracy as a target model, deploying the target model at an edge node, detecting the electric leakage state of the power equipment at the edge node terminal by using the target model based on audio data fed back by a voice sensor, namely judging the working state of the power equipment, returning a detection result to an operation and maintenance center through an edge computing private network, and helping the operation and maintenance center to monitor equipment faults in real time.
(E) And aiming at the detection result returned by the edge node, the operation and maintenance center performs data statistics and analysis at the cloud end to make a corresponding operation and maintenance response, so that a major power accident caused by insulation degradation of equipment is avoided, and the operation and maintenance cost is reduced.
To verify the validity of the method of the embodiment, the data set is as follows: 2: 2 (training set, verification set, test set), randomly dividing, setting batch _ size to 50, and training round number epochs to 30, and finally obtaining a graph 4 of the error and accuracy of the model on the training set and the verification set, as shown in fig. 5. As can be seen from the figure, when the number of training rounds reaches 16 rounds, the precision of the data set on the test set and the verification set reaches 98.15%, the error is gradually reduced, and the accuracy on the test set is 96.3%.
The foregoing shows and describes the general principles, essential features, and advantages of the invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the foregoing description only for the purpose of illustrating the principles of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the invention as defined by the appended claims, specification, and equivalents thereof.
Claims (6)
1. A discharging sound detection method based on edge calculation and a neural network is characterized by comprising the following steps:
collecting voice samples of the power equipment in three states of normal work, partial discharge and failure, extracting audio features of the voice samples, labeling and constructing a data set;
step (B) establishing a neural network model with a multi-classification function, training the model by using the data set established in the step (A), and compiling the model by using multi-classification cross entropy and adam optimization algorithm;
step (C) carrying out accuracy and error analysis on the trained model;
step (D), taking the model with the standard accuracy as a target model, deploying the target model at an edge node, detecting the electric leakage state of the power equipment at a node terminal by using the target model, and returning the detection result to an operation and maintenance center;
and (E) aiming at the detection result returned by the edge node, the operation and maintenance center carries out data statistics and analysis at the cloud end and makes a corresponding operation and maintenance response.
2. The method for detecting sparkling sound based on edge calculation and neural network of claim 1, wherein in the step (B), the neural network model with multi-classification function comprises an input layer, a hidden layer and an output layer;
the calculation method of the hidden layer is as follows:
hidden=f(W×X+b)
hidden layer output, and f (W multiplied by X + b) is a nonlinear activation function, wherein W is the weight of the network, X is an input layer vector, namely the audio feature extracted in the step (A), and b is the bias of the network;
the calculation method of the output layer is as follows:
Y=softmax(WY×hiddenlast+bY)
wherein Y is the output of the output layer, hiddenlastIs the output value of the last hidden layer, WYIs the weight of the output layer, bYIs the offset of the output layer, softmax is the activation function of the output layer;
since here three leakage states are detected, W isYIs a two-dimensional matrix with one dimension of 3, and bYThen a length 3 vector;
the softmax function is defined as follows:
wherein, S is 3, corresponding to 3 states of the power equipment, j is any one of the 3 states, and j is more than or equal to 1 and less than or equal to 3;
the formula represents the probability that the speech sample x is judged to be in the state j, and when the output node is selected finally, the node with the maximum probability is selected as the prediction target.
3. The method for detecting sound discharge based on edge computing and neural network of claim 1, wherein in step (B), the multi-class cross entropy loss function is defined as follows:
where C denotes the number of speech samples, YqIs the classification result of the q-th speech sample by the neural network model, yqIt is the true label for the sample.
4. The method for detecting sound discharge based on edge calculation and neural network as claimed in claim 1, wherein in step (C), the accuracy is defined as:
wherein P is the accuracy, TP is the number of samples correctly classified by the model, and FP is the number of samples incorrectly classified by the model.
5. The method as claimed in any one of claims 1-4, wherein the audio features are one or more of a short-time average energy, a short-time average amplitude function, a short-time average zero-crossing rate, a short-time autocorrelation function, a Mel cepstrum correlation parameter, and a formant correlation parameter.
6. The method for detecting sparkling sound based on edge computing and neural network as claimed in claim 5, wherein Mel cepstrum related parameters are used as audio features, and the specific process of step (A) comprises:
a1) extracting the same voice frame number for each voice sample, and then extracting corresponding voice characteristics for each frame of voice to prepare for model training;
speech signal x after windowing of a framei(n) calculating its fast fourier transform, where n is the sample point of the speech timing and i is the index of the frame;
transformation from time domain data to frequency domain data:
Xi(k)=FFT[xi(n)]
a2) FFT of each framei(k) Calculating the spectral line energy Ei(k) K denotes the kth spectral line in the frequency domain;
Ei(k)=|Xi(k)|2
a3) passing the energy of each frame of spectral line through a Mel filter bank, calculating its energy in each filter bank
Where N is the length of FFT change, M is the number of filters, M is the total number of filters, Hm(k) Is the frequency response of the mel filter:
wherein, f (m), f (m-1) and f (m +1) respectively represent the center frequencies of the m-th filter, the m-1-th filter and the m + 1-th filter, and the calculation formulas of f (m-1) and f (m +1) are analogized according to the calculation formula of f (m);
fland fHRespectively the lowest and highest frequency of the filter frequency range, N the length of the FFT variation, fsIs the sampling frequency, FMel() A function that converts the actual frequency in brackets to mel-frequency is shown,is FMel() The inverse function of (1), i.e. conversion of mel frequency to actual frequency;
a4) decorrelating the S obtained in step a3) by means of a Discrete Cosine Transform (DCT)i(m) substituting the following formula to obtain a final voice characteristic parameter MFCC;
in the above formula, mfcciAnd (n) extracting corresponding voice characteristics of the ith frame voice.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010979821.XA CN112581940A (en) | 2020-09-17 | 2020-09-17 | Discharging sound detection method based on edge calculation and neural network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010979821.XA CN112581940A (en) | 2020-09-17 | 2020-09-17 | Discharging sound detection method based on edge calculation and neural network |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112581940A true CN112581940A (en) | 2021-03-30 |
Family
ID=75119563
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010979821.XA Pending CN112581940A (en) | 2020-09-17 | 2020-09-17 | Discharging sound detection method based on edge calculation and neural network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112581940A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113640635A (en) * | 2021-10-18 | 2021-11-12 | 广东电网有限责任公司惠州供电局 | Power cable insulation state online monitoring method |
CN114113943A (en) * | 2021-11-25 | 2022-03-01 | 广东电网有限责任公司广州供电局 | Transformer partial discharge detection system, method and equipment based on current and ultrasonic signals |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008090529A (en) * | 2006-09-29 | 2008-04-17 | Matsushita Electric Works Ltd | Abnormality detection device, abnormality detection method |
CN109357749A (en) * | 2018-09-04 | 2019-02-19 | 南京理工大学 | A kind of power equipment audio signal analysis method based on DNN algorithm |
CN109856517A (en) * | 2019-03-29 | 2019-06-07 | 国家电网有限公司 | A kind of method of discrimination of extra-high voltage equipment Partial Discharge Detection data |
WO2019232846A1 (en) * | 2018-06-04 | 2019-12-12 | 平安科技(深圳)有限公司 | Speech differentiation method and apparatus, and computer device and storage medium |
CN111612279A (en) * | 2020-06-10 | 2020-09-01 | 江苏方天电力技术有限公司 | Power grid state prediction method and system based on edge calculation |
-
2020
- 2020-09-17 CN CN202010979821.XA patent/CN112581940A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008090529A (en) * | 2006-09-29 | 2008-04-17 | Matsushita Electric Works Ltd | Abnormality detection device, abnormality detection method |
WO2019232846A1 (en) * | 2018-06-04 | 2019-12-12 | 平安科技(深圳)有限公司 | Speech differentiation method and apparatus, and computer device and storage medium |
CN109357749A (en) * | 2018-09-04 | 2019-02-19 | 南京理工大学 | A kind of power equipment audio signal analysis method based on DNN algorithm |
CN109856517A (en) * | 2019-03-29 | 2019-06-07 | 国家电网有限公司 | A kind of method of discrimination of extra-high voltage equipment Partial Discharge Detection data |
CN111612279A (en) * | 2020-06-10 | 2020-09-01 | 江苏方天电力技术有限公司 | Power grid state prediction method and system based on edge calculation |
Non-Patent Citations (4)
Title |
---|
孙汉文 等: "基于机器学习与卷积神经网络的放电声音识别研究", 高压电器, vol. 56, no. 9, pages 107 - 113 * |
宋知用: "MATLAB语音信号分析与合成", 31 January 2018, 北京航空航天大学出版社, pages: 38 - 39 * |
王菲菲;阮爱民;魏刚;孙海渤;: "基于卷积神经网络的开关柜局部放电故障识别", 电气技术, no. 04, pages 76 - 81 * |
韩志艳: "面向语音与面部表情信号的多模式情感识别技术研究", 31 January 2017, 东北大学出版社, pages: 70 - 72 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113640635A (en) * | 2021-10-18 | 2021-11-12 | 广东电网有限责任公司惠州供电局 | Power cable insulation state online monitoring method |
CN114113943A (en) * | 2021-11-25 | 2022-03-01 | 广东电网有限责任公司广州供电局 | Transformer partial discharge detection system, method and equipment based on current and ultrasonic signals |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109685138B (en) | XLPE power cable partial discharge type identification method | |
CN109357749A (en) | A kind of power equipment audio signal analysis method based on DNN algorithm | |
CN109856517B (en) | Method for distinguishing partial discharge detection data of extra-high voltage equipment | |
CN112885372B (en) | Intelligent diagnosis method, system, terminal and medium for power equipment fault sound | |
CN108761287B (en) | Transformer partial discharge type identification method | |
CN106405339A (en) | Power transmission line fault reason identification method based on high and low frequency wavelet feature association | |
CN102279358B (en) | MCSKPCA based neural network fault diagnosis method for analog circuits | |
CN110120230B (en) | Acoustic event detection method and device | |
Zhang et al. | Fault identification based on PD ultrasonic signal using RNN, DNN and CNN | |
CN102809718A (en) | Ultra-high-frequency partial discharge signal identification method for gas insulated switchgear (GIS) | |
CN108169639A (en) | Method based on the parallel long identification switch cabinet failure of Memory Neural Networks in short-term | |
CN103558519A (en) | GIS partial discharge ultrasonic signal identification method | |
KR20200104019A (en) | Machine learning based voice data analysis method, device and program | |
CN102623009A (en) | Abnormal emotion automatic detection and extraction method and system on basis of short-time analysis | |
CN112581940A (en) | Discharging sound detection method based on edge calculation and neural network | |
CN111368892A (en) | Generalized S transformation and SVM electric energy quality disturbance efficient identification method | |
CN116778956A (en) | Transformer acoustic feature extraction and fault identification method | |
CN115728612A (en) | Transformer discharge fault diagnosis method and device | |
CN115909675A (en) | Distributed edge computing power equipment sound monitoring method | |
Rahman et al. | Dynamic time warping assisted svm classifier for bangla speech recognition | |
CN113111786B (en) | Underwater target identification method based on small sample training diagram convolutional network | |
CN113158781B (en) | Lightning trip type identification method | |
CN103678773A (en) | Sphere gap discharge voltage detection method | |
CN113921041A (en) | Recording equipment identification method and system based on packet convolution attention network | |
CN105006231A (en) | Distributed large population speaker recognition method based on fuzzy clustering decision tree |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |