CN111767398A - Secondary equipment fault short text data classification method based on convolutional neural network - Google Patents

Secondary equipment fault short text data classification method based on convolutional neural network Download PDF

Info

Publication number
CN111767398A
CN111767398A CN202010610878.2A CN202010610878A CN111767398A CN 111767398 A CN111767398 A CN 111767398A CN 202010610878 A CN202010610878 A CN 202010610878A CN 111767398 A CN111767398 A CN 111767398A
Authority
CN
China
Prior art keywords
layer
data
secondary equipment
vector
short text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010610878.2A
Other languages
Chinese (zh)
Inventor
王开科
南东亮
孙永辉
吴杰
于同伟
卜强生
庞福滨
杨毅
杨飞
钱海
赵启
张路
王利超
卢盛阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
Hohai University HHU
Electric Power Research Institute of State Grid Liaoning Electric Power Co Ltd
Electric Power Research Institute of State Grid Sichuan Electric Power Co Ltd
Electric Power Research Institute of State Grid Xinjiang Electric Power Co Ltd
Electric Power Research Institute of State Grid Jiangsu Electric Power Co Ltd
Original Assignee
State Grid Corp of China SGCC
Hohai University HHU
Electric Power Research Institute of State Grid Liaoning Electric Power Co Ltd
Electric Power Research Institute of State Grid Sichuan Electric Power Co Ltd
Electric Power Research Institute of State Grid Xinjiang Electric Power Co Ltd
Electric Power Research Institute of State Grid Jiangsu Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, Hohai University HHU, Electric Power Research Institute of State Grid Liaoning Electric Power Co Ltd, Electric Power Research Institute of State Grid Sichuan Electric Power Co Ltd, Electric Power Research Institute of State Grid Xinjiang Electric Power Co Ltd, Electric Power Research Institute of State Grid Jiangsu Electric Power Co Ltd filed Critical State Grid Corp of China SGCC
Priority to CN202010610878.2A priority Critical patent/CN111767398A/en
Publication of CN111767398A publication Critical patent/CN111767398A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/353Clustering; Classification into predefined classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Business, Economics & Management (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Economics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Databases & Information Systems (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method for classifying secondary equipment fault short text data based on a convolutional neural network, which comprises the steps of firstly, acquiring text information data of secondary equipment faults, determining a data set and obtaining a training sample set; preprocessing the short text data of the secondary equipment fault, and mapping word data and word vectors one by one to obtain text vector data; then training the convolutional neural network model by using the training sample to obtain a trained convolutional neural network model; and then, according to the verification set data, inputting test input data into the trained convolutional neural network model, wherein the output value of the model is the classification result of the secondary equipment fault short text information to be classified. The secondary equipment fault short text data classification method based on the convolutional neural network utilizes the good characteristic screening and analyzing capability of the convolutional neural network model, improves the prediction precision and strengthens the generalization capability of the model.

Description

Secondary equipment fault short text data classification method based on convolutional neural network
Technical Field
The invention belongs to a Chinese natural language processing technology, and particularly relates to a secondary equipment fault short text data classification method based on a convolutional neural network.
Background
In the process of construction and operation of an intelligent power grid, the occurrence of electric power big data explosions can be roughly divided into two types according to Chinese electric power big data development white papers compiled by the information special committee of the Chinese electro-mechanical engineering society in 2013, wherein one type is structured data represented by output power, equipment and environment temperature and humidity thereof, light intensity of an optical module and the like in a time sequence mode, and the other type is semi-structured and non-structured data represented by texts, images, audios and the like and difficult to express by using a relational database. Structured data mining work is mature, however, the problem of low data value density brought by the fact that normal data are always generated in the running process of a power grid and only a very small amount of fault data exist restricts the mining of unstructured data. The short text fault information occurring in the operation process of the secondary equipment is also the information which is focused in the construction process of the internet of things.
In the operation process of the secondary equipment, a plurality of fault defect short text data are accumulated, the data are often manually recorded by a transportation and inspection person to finish the classification work of the defect levels, the accurate classification is difficult to achieve due to the fact that the subjectivity and the experience of the transportation and inspection person are different, and the fault data are more, a large amount of manpower is needed to participate, and the efficiency is difficult to guarantee. With the development of Chinese text classification technology, automatic classification of a large amount of fault short text information in a power grid production management system becomes possible by means of machine learning.
At present, for short text classification, foreign countries already research emotion classification on hotel evaluation through natural language processing, but the natural language processing of English is difficult to use in Chinese due to different Chinese and English structures, such as characters of capitalization of proper nouns in English, space connection between words and the like, and is difficult to use in Chinese due to different industries, and a lot of proper nouns exist in the field of secondary equipment text classification, so that the research on the improvement of classification models is lacked, and most of the researches are based on the field of traditional machine learning. And because the reason that personnel of recording record by hand, the spoken language is recorded more, the text message is shorter, there is no method disclosed to the short text message classification of the secondary equipment at present.
Disclosure of Invention
The purpose of the invention is as follows: the invention provides a method for classifying secondary equipment fault short text data based on a convolutional neural network, and aims to solve the problem that fault short text information recorded by secondary equipment is insufficient in identification and classification in the operation process of a smart grid.
The technical scheme is as follows: a secondary equipment fault short text data classification method based on a convolutional neural network comprises the following steps:
(1) establishing a sample data set, collecting fault short text data generated by secondary equipment in an electric power system in the operation process, marking the fault short text data according to different defect grades, and dividing the fault short text data into a training set, a verification set and a test set;
(2) text preprocessing, namely constructing a stop word dictionary and a secondary equipment special dictionary, wherein the stop word dictionary is used for filtering and removing noise in secondary equipment fault short text information, and the noise comprises words and punctuations which have no actual physical significance in secondary equipment fault short text data; the secondary equipment special dictionary is used for identifying attribute data of the secondary equipment, and the attribute data comprises the name, the model, the station name and the route name of the secondary equipment;
(3) word vector modeling, namely establishing a word2vec model for the preprocessed text data, acquiring sample data set secondary equipment fault short text data, performing word vector training, mining context semantic relations among words, and establishing a word and word vector mapping relation;
(4) constructing a multi-size convolutional neural network secondary equipment fault short text data classification model, taking the word vectors trained in the step (3) as an input matrix of the convolutional neural network model, carrying out convolutional operation on the secondary equipment fault short text data matrix according to different word quantity combination modes by adopting a multi-size convolutional kernel to obtain a plurality of phrase sequences, outputting the phrase sequences to a pooling layer, carrying out maximum pooling extraction on features through the pooling layer, reducing feature dimensions, and screening the phrase sequences with the highest weight values;
(5) predicting a classification result, setting a layer of full-connection layer after the structures of the convolution layer and the pooling layer, performing feature extraction and combination on vectors output by different pooling layers, and outputting the combined vectors to a subsequent SoftMax layer for classification and judgment; and obtaining the probability belonging to each fault grade type, obtaining the maximum probability in the probabilities, and outputting the text type corresponding to the maximum probability as the type of the text to be classified.
Further, the text data of the fault short messages in the step (1) are divided into three types of fault defects of 'serious defect', 'critical defect' and 'general defect' according to the evaluation guide of the relay protection state of the national power grid company, corresponding fault feature labels are arranged on the fault short messages, and the fault short messages are randomly combined according to the following steps of 7: 2: 1 into a training set, a validation set and a test set.
Further, the text preprocessing in the step (2) includes filtering noise, extracting stem features, restoring part of speech, and recognizing entity nouns and proper nouns which appear in the short text data of the secondary equipment failure; the method comprises the steps of constructing a stop word dictionary and a secondary equipment special dictionary by performing feature sequencing on an original language database text, traversing a language database according to dictionary contents, filtering data noise and extracting data features.
Further, the construction process of the Word2vec Word vector model in the step (3) is as follows:
(31) expressing the Chinese text into a structured word vector according to the context relationship between words, and adopting WordEmbedding to construct vector mapping so that the semantic information of the word context is not lost by the word vector generated after mapping, and the dimension of the text vector is reduced;
(32) constructing three-layer structures which are an input layer, a hidden layer and an output layer respectively, entering discrete numerical values obtained by the input layer into a linear unit of the hidden layer for training, and finally using SoftMax regression on the output layer;
(33) two neural network model structures, namely a CBOW structure and a Skip-gram structure, are constructed according to the step (31) and the step (32), and are specifically as follows:
(A) CBOW structure model
The CBOW structure model predicts the final output result through the context relation of words in an input layer, supposing that the dimension of the words is V, the context of the input context is C, the size of a dictionary is D, inputting One-hot coding data into a hidden layer structure, and multiplying all discrete data by an input weight matrix WV×NN is the number of the set hidden layer structures, and the obtained vector is multiplied by an output weight matrix W'V×NObtaining the probability distribution with one-dimensional vector, and taking the maximum probability as the predicted intermediate words;
continuously selecting words, training an input weight matrix and an output weight matrix by adopting a gradient descent algorithm, wherein the smaller the error is, the better the error is;
the conditional probability of CBOW for a target word is calculated as follows:
Figure BDA0002561750320000031
the specific maximization formula of the objective function of CBOW is as follows:
(ω,c)∈DlogP(ω,c)
(B) skip-gram structure model
The Skip-gram structure model constructs words of an input layer to make predicted word vectors form a context form, and an objective function of the Skip-gram model is as follows:
Figure BDA0002561750320000032
further, the convolutional neural network secondary device failure short text information classification model in step (4) includes an input layer, a convolutional layer, a pooling layer, and an output layer, and specifically includes the following steps:
the first layer is an input layer, selects text data to be classified, completes vectorization of the text data according to the step (3), and outputs a matrix I ∈ Rm×nM is the number of words of the text, i.e., the number of rows of the input layer, n is the number of dimensions of the text vector, i.e., the number of columns of the input layer, and each word data is divided into word vectors of equal dimensions according to the division in step (3) so that the number of columns of the input layer is equal, thereby forming a matrix I ∈ Rm×nIn the training process, the word vector is adjusted by a random gradient descent method;
selecting convolution kernels of different sizes, wherein each convolution kernel has multiple convolution kernels, and inputting matrix I ∈ R to input layer respectivelym×nPerforming convolution operation to extract the matrix characteristics of the input layer to obtain a convolution result vector ri(i ═ 1,2,3,4, 5, 6, L), the formula is as follows:
ri=W·Ii:i+h-1
wherein the matrix W represents a weight coefficient, "·" represents a dot product operation;
activating the convolution result through an activation function ReLU, and performing nonlinear processing to obtain a result ciThe formula is as follows:
ci=ReLU(ri+b)
c is toiThe convolutional layer vector c ∈ R is obtained from left to right, in the order from top to bottoms-h+1The formula is as follows:
c=[c1,c2,K,cs-h+1]
the third layer is a pooling layer, which is pooled by maximum pooling method according to the result vector c ∈ R extracted from the convolutional layers -h+1The element with the largest value in the sequence is extracted as the characteristic value pj(j is 1,2,3,4, 5, 6, L, n) and all the characteristic values pjSequentially spliced into a vector p ∈ Rn×1Inputting the data into a fourth output layer, wherein a vector p represents a vector of global features of the text data;
the fourth layer is an output layer: fully connecting the pooling layer with the output layer, taking a vector p of the pooling layer as input, classifying the vector p by adopting a Softmax classifier, and outputting a final classification result; the probability of Softmax classification calculation is as follows:
Figure BDA0002561750320000041
wherein the function L (p)j) And expressing the probability of belonging to the secondary equipment category, selecting the result with the maximum probability, and outputting the defect level of the secondary equipment fault.
Has the advantages that: compared with the prior art, the method has the advantages that the stop word dictionary and the special dictionary in the secondary equipment field are constructed, and the text convolution neural network has good feature screening capability on the short text, so that the feature model structure is simplified, and the classification precision and generalization capability are improved. And a secondary equipment failure short text classification model based on deep learning is established, the prediction precision is obviously superior to that of the traditional neural network model, and the accuracy of secondary equipment failure short text classification is greatly improved.
Drawings
FIG. 1 is a flow chart of an embodiment of the method of the present invention.
Detailed Description
For the purpose of illustrating the technical solutions disclosed in the present invention in detail, the following description is further provided with reference to the accompanying drawings and specific embodiments.
As shown in fig. 1, the method for classifying short text data of secondary device failure based on convolutional neural network provided by the present invention includes the following steps:
s1: determining a data set;
collecting short text data of faults generated in the operation process of secondary equipment, dividing the fault types into 'serious defects', 'critical defects' and 'general defects' according to the requirements of relevant guide rules, and dividing a text data set into: training set, verification set and test set.
S2: preprocessing a text;
constructing a stop word dictionary, filtering and removing noise in the short text information of the secondary equipment fault, and keeping words with specific practical meanings such as nouns, verbs, quantifiers and the like; and constructing a secondary equipment professional dictionary, and identifying low-frequency words such as equipment names and equipment models and special nouns such as plant stations and lines where the equipment is located.
S3: constructing a word vector;
word vector modeling, namely establishing a word2vec model for the preprocessed text data, acquiring sample data set secondary equipment fault short text data, performing word vector training, mining context semantic relations among words, and establishing a word and word vector mapping relation;
s4: constructing a convolutional neural network;
setting four layers of convolution neural network
(1) The first layer is an input layer;
selecting text data to be classified, completing vectorization of the text data according to the step (S3), and outputting a matrix I ∈ Rm×nM is the number of words of the text, i.e., the number of rows of the input layer, n is the number of dimensions of the text vector, i.e., the number of columns of the input layer, and each word data is divided into word vectors of equal dimensions according to the division in step (3) so that the number of columns of the input layer is equal, thereby forming a matrix I ∈ Rm ×nIn the training process, the word vector is adjusted by a random gradient descent method;
(2) the second layer is a convolution layer;
selecting convolution kernels of different sizes, wherein each convolution kernel of different size has a plurality of matrixes I ∈ R respectively input to input layersm×nPerforming convolution operation to extract the matrix characteristics of the input layer to obtain a convolution result vector ri(i ═ 1,2,3,4, 5, 6, L), the formula is as follows:
ri=W·Ii:i+h-1
wherein the matrix W represents a weight coefficient, "·" represents a dot product operation;
activating the convolution result through an activation function ReLU, and performing nonlinear processing to obtain a result ciThe formula is as follows:
ci=ReLU(ri+b)
c is toiThe convolutional layer vector c ∈ R is obtained from left to right, in the order from top to bottoms-h+1The formula is as follows:
c=[c1,c2,K,cs-h+1]
(3) the third layer is a pooling layer;
pooling by maximum pooling based on the result vector c ∈ R extracted from the convolutional layers-h+1The element with the largest value in the sequence is extracted as the characteristic value pj(j is 1,2,3,4, 5, 6, L, n) and all the characteristic values pjSequentially spliced into a vector p ∈ Rn×1Inputting the data into a fourth output layer, wherein a vector p represents a vector of global features of the text data;
(4) the fourth layer is an output layer;
and fully connecting the pooling layer with the output layer, taking the vector p of the pooling layer as input, classifying the vector p by adopting a Softmax classifier, and outputting a final classification result. The probability of Softmax classification calculation is as follows:
Figure BDA0002561750320000061
wherein the function L (p)j) Representing the probability of belonging to a secondary device class.
The following description will take the classification of the secondary device failure short text message as an example.
2000 data of a Xinjiang power grid relay protection action statistical table from 2015 to 2019 are analyzed, and 1471 data of repeated invalid data with too large noise are removed. According to the following steps: 2: 1, dividing a data set, and performing word segmentation on each piece of data, wherein the word segmentation is performed on short text data, such as '220 kv/river park line/b/sleeve/protection/fiber channel/fault'. And training the text convolutional neural network, and bringing the training result into the trained convolutional neural network model, wherein the classification result is correct.
Three criteria are generally used to evaluate classification performance: accuracy, recall, F1 value:
Figure BDA0002561750320000071
Figure BDA0002561750320000072
Figure BDA0002561750320000073
the prediction results are shown in the following table, and as can be seen from table 1, the method of the invention has a good effect on short text information of secondary equipment failure. In conclusion, the invention can realize the classification of the faults and can be used for practical engineering application.
TABLE 1 text convolution neural model classification result evaluation index
Figure BDA0002561750320000074
The method provided by the invention eliminates the subjectivity of the traditional classification method for classifying the fault data depending on the experience of operation and maintenance maintainers to a certain extent. And a large amount of fault short text data generated in a production management system is operated around secondary equipment, and related training and analysis of automatic text classification based on a convolutional neural network are carried out. The secondary equipment fault short text data classification method based on the convolutional neural network utilizes the good characteristic screening and analyzing capability of the convolutional neural network model, improves the prediction precision and strengthens the generalization capability of the model.

Claims (5)

1. A secondary equipment fault short text data classification method based on a convolutional neural network is characterized by comprising the following steps: the method comprises the following steps:
(1) establishing a sample data set, collecting fault short text data generated by secondary equipment in an electric power system in the operation process, marking the fault short text data according to different defect grades, and dividing the fault short text data into a training set, a verification set and a test set;
(2) text preprocessing, namely constructing a stop word dictionary and a secondary equipment special dictionary, wherein the stop word dictionary is used for filtering and removing noise in secondary equipment fault short text information, and the noise comprises words and punctuations which have no actual physical significance in secondary equipment fault short text data; the secondary equipment special dictionary is used for identifying attribute data of the secondary equipment, and the attribute data comprises the name, the model, the station name and the route name of the secondary equipment;
(3) word vector modeling, namely establishing a word2vec model for the preprocessed text data, acquiring sample data set secondary equipment fault short text data, performing word vector training, mining context semantic relations among words, and establishing a word and word vector mapping relation;
(4) constructing a multi-size convolutional neural network secondary equipment fault short text data classification model, taking the word vectors trained in the step (3) as an input matrix of the convolutional neural network model, carrying out convolutional operation on the secondary equipment fault short text data matrix according to different word quantity combination modes by adopting a multi-size convolutional kernel to obtain a plurality of phrase sequences, outputting the phrase sequences to a pooling layer, carrying out maximum pooling extraction on features through the pooling layer, reducing feature dimensions, and screening the phrase sequences with the highest weight values;
(5) predicting a classification result, setting a layer of full-connection layer after the structures of the convolution layer and the pooling layer, performing feature extraction and combination on vectors output by different pooling layers, and outputting the combined vectors to a subsequent SoftMax layer for classification and judgment; and obtaining the probability belonging to each fault grade type, obtaining the maximum probability in the probabilities, and outputting the text type corresponding to the maximum probability as the type of the text to be classified.
2. The convolutional neural network-based secondary device failure short text data classification method according to claim 1, characterized in that: dividing the fault short text data in the step (1) into three fault defect characteristic labels of 'serious defect', 'critical defect' and 'general defect' according to the relay protection state evaluation guide rule of the national power grid company, and randomly combining the fault short text data according to the following steps of 7: 2: 1 into a training set, a validation set and a test set.
3. The convolutional neural network-based secondary device failure short text data classification method according to claim 1, characterized in that: the text preprocessing in the step (2) comprises noise filtering, word stem feature extraction, part of speech reduction, entity noun and proper noun identification of short text data of secondary equipment faults; the method comprises the steps of constructing a stop word dictionary and a secondary equipment special dictionary by performing feature sequencing on an original language database text, traversing a language database according to dictionary contents, filtering data noise and extracting data features.
4. The convolutional neural network-based secondary device failure short text data classification method according to claim 1, characterized in that: the construction process of the Word2vec Word vector model in the step (3) is as follows:
(31) expressing the Chinese text into a structured word vector according to the context relationship between words, and adopting WordEmbedding to construct vector mapping so that the semantic information of the word context is not lost by the word vector generated after mapping, and the dimension of the text vector is reduced;
(32) constructing three-layer structures which are an input layer, a hidden layer and an output layer respectively, entering discrete numerical values obtained by the input layer into a linear unit of the hidden layer for training, and finally using SoftMax regression on the output layer;
(33) two neural network model structures, namely a CBOW structure and a Skip-gram structure, are constructed according to the step (31) and the step (32), and are specifically as follows:
(A) CBOW structure model
The CBOW structure model predicts the final output result through the context relation of words in an input layer, supposing that the dimension of the words is V, the context of the input context is C, the size of a dictionary is D, inputting One-hot coding data into a hidden layer structure, and multiplying all discrete data by an input weight matrix WV×NN is the number of the set hidden layer structures, and the obtained vector is multiplied by an output weight matrix W'V×NObtaining the probability distribution with one-dimensional vector, and taking the maximum probability as the predicted intermediate words; continuously selecting words, and training an input weight matrix and an output weight matrix by adopting a gradient descent algorithm so as to reduce errors;
the conditional probability of CBOW for a target word is calculated as follows:
Figure FDA0002561750310000021
the specific maximization formula of the objective function of CBOW is as follows:
(ω,c)∈DlogP(ω,c)
(B) skip-gram structure model
The Skip-gram structure model constructs words of an input layer to make predicted word vectors form a context form, and an objective function of the Skip-gram model is as follows:
Figure FDA0002561750310000022
5. the convolutional neural network-based secondary device failure short text data classification method according to claim 1, characterized in that: the convolutional neural network secondary equipment fault short text information classification model in the step (4) comprises an input layer, a convolutional layer, a pooling layer and an output layer, and specifically comprises the following steps:
the first layer is an input layer: selecting text data to be classified, and finishing the text according to the step (3)Vectorization of data, output matrix I ∈ Rm×nM is the number of words of the text, i.e., the number of rows of the input layer, n is the number of dimensions of the text vector, i.e., the number of columns of the input layer, and each word data is divided into word vectors of equal dimensions according to the division in step (3) so that the number of columns of the input layer is equal, thereby forming a matrix I ∈ Rm×nIn the training process, the word vector is adjusted by a random gradient descent method;
selecting convolution kernels of different sizes, wherein each convolution kernel has multiple convolution kernels, and inputting matrix I ∈ R to input layer respectivelym×nPerforming convolution operation to extract the matrix characteristics of the input layer to obtain a convolution result vector ri(i ═ 1,2,3,4, 5, 6, L), the formula is as follows:
ri=W·Ii:i+h-1
wherein the matrix W represents a weight coefficient, "·" represents a dot product operation;
activating the convolution result through an activation function ReLU, and performing nonlinear processing to obtain a result ciThe formula is as follows:
ci=ReLU(ri+b)
c is toiThe convolutional layer vector c ∈ R is obtained from left to right, in the order from top to bottoms-h+1The formula is as follows:
c=[c1,c2,K,cs-h+1]
the third layer is a pooling layer, which is pooled by maximum pooling method according to the result vector c ∈ R extracted from the convolutional layers-h+1The element with the largest value in the sequence is extracted as the characteristic value pj(j is 1,2,3,4, 5, 6, L, n) and all the characteristic values pjSequentially spliced into a vector p ∈ Rn×1Inputting the data into a fourth output layer, wherein a vector p represents a vector of global features of the text data;
the fourth layer is an output layer: fully connecting the pooling layer with the output layer, taking a vector p of the pooling layer as input, classifying the vector p by adopting a Softmax classifier, and outputting a final classification result; the probability of Softmax classification calculation is as follows:
Figure FDA0002561750310000041
wherein the function L (p)j) And expressing the probability of belonging to the secondary equipment category, selecting the result with the maximum probability, and outputting the defect level of the secondary equipment fault.
CN202010610878.2A 2020-06-30 2020-06-30 Secondary equipment fault short text data classification method based on convolutional neural network Pending CN111767398A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010610878.2A CN111767398A (en) 2020-06-30 2020-06-30 Secondary equipment fault short text data classification method based on convolutional neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010610878.2A CN111767398A (en) 2020-06-30 2020-06-30 Secondary equipment fault short text data classification method based on convolutional neural network

Publications (1)

Publication Number Publication Date
CN111767398A true CN111767398A (en) 2020-10-13

Family

ID=72724137

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010610878.2A Pending CN111767398A (en) 2020-06-30 2020-06-30 Secondary equipment fault short text data classification method based on convolutional neural network

Country Status (1)

Country Link
CN (1) CN111767398A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112182249A (en) * 2020-10-23 2021-01-05 四川大学 Automatic classification method and device for aviation safety report
CN112465052A (en) * 2020-12-07 2021-03-09 重庆忽米网络科技有限公司 Equipment fault diagnosis report generation method and system based on convolutional neural network
CN113111183A (en) * 2021-04-20 2021-07-13 通号(长沙)轨道交通控制技术有限公司 Traction power supply equipment defect grade classification method
CN113434667A (en) * 2021-04-20 2021-09-24 国网浙江省电力有限公司杭州供电公司 Text classification method based on distribution network automation terminal text classification model
CN114638558A (en) * 2022-05-19 2022-06-17 天津市普迅电力信息技术有限公司 Data set classification method for operation accident analysis of comprehensive energy system
CN116738323A (en) * 2023-08-08 2023-09-12 北京全路通信信号研究设计院集团有限公司 Fault diagnosis method, device, equipment and medium for railway signal equipment
CN117270482A (en) * 2023-11-22 2023-12-22 博世汽车部件(苏州)有限公司 Automobile factory control system based on digital twin
CN117495338A (en) * 2023-09-30 2024-02-02 国网江苏省电力有限公司信息通信分公司 System fault diagnosis and repair method based on automatic operation and maintenance

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107885883A (en) * 2017-12-01 2018-04-06 北京国信宏数科技有限公司 A kind of macroeconomy field sentiment analysis method and system based on Social Media
CN108399230A (en) * 2018-02-13 2018-08-14 上海大学 A kind of Chinese financial and economic news file classification method based on convolutional neural networks
CN108536801A (en) * 2018-04-03 2018-09-14 中国民航大学 A kind of civil aviaton's microblogging security public sentiment sentiment analysis method based on deep learning
CN108596470A (en) * 2018-04-19 2018-09-28 浙江大学 A kind of power equipments defect text handling method based on TensorFlow frames
CN109977224A (en) * 2019-03-11 2019-07-05 南京邮电大学 The news category method combined based on convolutional neural networks and term vector
CN110134786A (en) * 2019-05-14 2019-08-16 南京大学 A kind of short text classification method based on theme term vector and convolutional neural networks
CN110232188A (en) * 2019-06-04 2019-09-13 上海电力学院 The Automatic document classification method of power grid user troublshooting work order

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107885883A (en) * 2017-12-01 2018-04-06 北京国信宏数科技有限公司 A kind of macroeconomy field sentiment analysis method and system based on Social Media
CN108399230A (en) * 2018-02-13 2018-08-14 上海大学 A kind of Chinese financial and economic news file classification method based on convolutional neural networks
CN108536801A (en) * 2018-04-03 2018-09-14 中国民航大学 A kind of civil aviaton's microblogging security public sentiment sentiment analysis method based on deep learning
CN108596470A (en) * 2018-04-19 2018-09-28 浙江大学 A kind of power equipments defect text handling method based on TensorFlow frames
CN109977224A (en) * 2019-03-11 2019-07-05 南京邮电大学 The news category method combined based on convolutional neural networks and term vector
CN110134786A (en) * 2019-05-14 2019-08-16 南京大学 A kind of short text classification method based on theme term vector and convolutional neural networks
CN110232188A (en) * 2019-06-04 2019-09-13 上海电力学院 The Automatic document classification method of power grid user troublshooting work order

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JANG B 等: "Word2vec convolutional neural networks for classification of news articles and tweets", 《PLOS ONE》 *
黄鑫 等: "卷积神经网络在机械设备故障诊断领域应用与挑战", 《制造技术与机床》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112182249A (en) * 2020-10-23 2021-01-05 四川大学 Automatic classification method and device for aviation safety report
CN112182249B (en) * 2020-10-23 2022-12-13 四川大学 Automatic classification method and device for aviation safety report
CN112465052A (en) * 2020-12-07 2021-03-09 重庆忽米网络科技有限公司 Equipment fault diagnosis report generation method and system based on convolutional neural network
CN112465052B (en) * 2020-12-07 2023-04-07 重庆忽米网络科技有限公司 Equipment fault diagnosis report generation method and system based on convolutional neural network
CN113111183A (en) * 2021-04-20 2021-07-13 通号(长沙)轨道交通控制技术有限公司 Traction power supply equipment defect grade classification method
CN113434667A (en) * 2021-04-20 2021-09-24 国网浙江省电力有限公司杭州供电公司 Text classification method based on distribution network automation terminal text classification model
CN113434667B (en) * 2021-04-20 2024-01-23 国网浙江省电力有限公司杭州供电公司 Text classification method based on distribution network automation terminal text classification model
CN114638558A (en) * 2022-05-19 2022-06-17 天津市普迅电力信息技术有限公司 Data set classification method for operation accident analysis of comprehensive energy system
CN116738323A (en) * 2023-08-08 2023-09-12 北京全路通信信号研究设计院集团有限公司 Fault diagnosis method, device, equipment and medium for railway signal equipment
CN116738323B (en) * 2023-08-08 2023-10-27 北京全路通信信号研究设计院集团有限公司 Fault diagnosis method, device, equipment and medium for railway signal equipment
CN117495338A (en) * 2023-09-30 2024-02-02 国网江苏省电力有限公司信息通信分公司 System fault diagnosis and repair method based on automatic operation and maintenance
CN117270482A (en) * 2023-11-22 2023-12-22 博世汽车部件(苏州)有限公司 Automobile factory control system based on digital twin

Similar Documents

Publication Publication Date Title
CN111767398A (en) Secondary equipment fault short text data classification method based on convolutional neural network
CN110597735B (en) Software defect prediction method for open-source software defect feature deep learning
CN111767397A (en) Electric power system secondary equipment fault short text data classification method
CN111966812B (en) Automatic question answering method based on dynamic word vector and storage medium
CN108549658A (en) A kind of deep learning video answering method and system based on the upper attention mechanism of syntactic analysis tree
CN110895565A (en) Method and system for classifying fault defect texts of power equipment
CN111709244B (en) Deep learning method for identifying cause and effect relationship of contradictory dispute
CN111126057B (en) Case scenario accurate sentencing system of hierarchical neural network
CN108052504A (en) Mathematics subjective item answers the structure analysis method and system of result
CN113065356B (en) IT equipment operation and maintenance fault suggestion processing method based on semantic analysis algorithm
CN112836809A (en) Device characteristic extraction method and fault prediction method of convolutional neural network based on differential feature fusion
CN112883286A (en) BERT-based method, equipment and medium for analyzing microblog emotion of new coronary pneumonia epidemic situation
CN112199496A (en) Power grid equipment defect text classification method based on multi-head attention mechanism and RCNN (Rich coupled neural network)
CN109740164A (en) Based on the matched electric power defect rank recognition methods of deep semantic
CN113420548A (en) Entity extraction sampling method based on knowledge distillation and PU learning
CN114036993A (en) Multi-modal ironic object detection method based on multi-scale trans-modal neural network
CN113204967A (en) Resume named entity identification method and system
CN109063983A (en) A kind of natural calamity loss real time evaluating method based on social media data
CN113378024B (en) Deep learning-oriented public inspection method field-based related event identification method
CN113868422A (en) Multi-label inspection work order problem traceability identification method and device
CN112559741B (en) Nuclear power equipment defect record text classification method, system, medium and electronic equipment
CN113111183A (en) Traction power supply equipment defect grade classification method
CN116029295A (en) Electric power text entity extraction method, defect positioning method and fault diagnosis method
CN114020901A (en) Financial public opinion analysis method combining topic mining and emotion analysis
CN114547294A (en) Rumor detection method and system based on comprehensive information of propagation process

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20201013

RJ01 Rejection of invention patent application after publication