A kind of electric power customer service work order emotion quantitative analysis method based on Word2Vec
Technical field
The present invention relates to a kind of electric power customer service work order analysis method more particularly to a kind of electric power customer services based on Word2Vec
Work order emotion quantitative analysis method.
Background technique
With the development of the social economy, power system reform deepens constantly, power supply enterprise only adheres to is with client
The heart is improved customer satisfaction, and Market Competition advantage could be obtained.And 95598 important channel as customer communication and communication
Window realizes quantization client's demand emotion by carrying out depth excavation to the client characteristics, the emotion information that imply in customer service work order
Analysis, is conducive to the focus for quickly understanding client, is conducive to identify potential complaint client according to client's emotion tendency,
Be conducive to support work order urgency priority processing and analysis, be conducive to the implementation result for differentiating a certain business according to feedback information,
These will all have a very important significance electric power enterprise and client.
Under traditional approach, for work order demand sentiment analysis, need to set up several demand analysis sole duties to client's demand work
It is single to carry out manual analysis and processing, expend a large amount of human costs.
Sentiment analysis is mainly the natural language processing towards non-structured text, by believing the emotion hidden in text
Breath is analyzed, and viewpoint and attitude that people holds things or event are excavated.Currently, universal sentiment analysis lays particular emphasis on feelings
Feel polarity classification, being represented simply as emotion is positive or negative sense;And emotional intensity embodiment then needs to quantify feeling means
It realizes.Traditional sentiment analysis process is broadly divided into three parts: Feature Engineering, feature selecting and machine learning algorithm application.It is partially
To in improving accuracy rate using engineering characteristics or polarity transition rule.And the emotion quantitative analysis of text is calculated at home
Outer research is simultaneously few, and majority research lays particular emphasis on emotion tendency classification.
Traditional sentiment analysis method is based on bag of words (bag-of-word) feature and word frequency statistics, while most research sides
Emotion tendency classification is overweighted, there are following tripartite's planar defects for this method: (1) lacking the context sequence and semantic reason of word
Solution;(2) ignore difference between the semanteme of word;(3) emotion power difference can not be embodied by laying particular emphasis on emotion tendency classification.In a word cannot
Effectively screen emotional intensity.
Summary of the invention
The technical problem to be solved in the present invention and the technical assignment of proposition are to be improved and improved to prior art,
A kind of electric power customer service work order emotion quantitative analysis method based on Word2Vec is provided, to reach the mesh for effectively screening emotional intensity
's.For this purpose, the present invention takes following technical scheme.
A kind of electric power customer service work order emotion quantitative analysis method based on Word2Vec, it is special in conjunction with electric power customer service work order text
Sign carries out classification combing, data cleansing to history electric power customer service work order and dissatisfied work order, then combs to be formed based on Baidu's dictionary
Polynary emotion dictionary is initialized, work order text participle is carried out using reverse maximum matching algorithm, is based on Word2Vec neural network
The term vector of the positive word of client's demand semanteme, passiveness word, negative word, degree adverb and word order is merged in building, passes through history
Customer service work order carry out machine learning training generate fusion demand emotion learning model, based on the part of speech close and distant relation in model come
Part of speech corpus is expanded, emotion quantum chemical method is carried out using similarity word order matrix quantization algorithm, completes customer service work order emotion amount
Change analysis, determines urgency level;Wherein emotion quantization formula are as follows:
AN-P=WordsNearest (Lnegative, Lpositive, 10000)
AP-N=WordsNearest (Lpositive, Lnegative, 10000)
In formula: SmoleculeIndicate all accumulative summations of word emotion quantization molecule, T indicates single all word total amounts of demand, SEQ
Indicate single work order demand similarity emotion quantization score, words [i] is indicated in the work order demand array by participle i-th
Word, AN-PBased on passive word to the nearest word ordered set of positive word association, AP-NBased on positive word to the nearest word of passive word association
Language ordered set, LdegreeIndicate degree adverb set, LnayIndicate negative set of words, LpositiveIndicate positive set of words,
LnegativeIndicate passive set of words, LneutralIndicate neutral set of words,Indicate the i-th word in AP-NRow in ordered set
Column position,Indicate the i-th word in AN-PArrangement position in ordered set, WordsNearest are then the space Word2vec
Incidence relation method, δlow、δneutral、δintervalIt respectively indicates emotion and quantifies passive lower coefficient limit, in emotion quantization under property coefficient
Limit, emotion quantify neutral section.
The technical program is based on Word2Vec neural network and carries out deep learning to historic customer work order, passes through learning model
Middle term vector space length characterizes close semantic and word close and distant relation, while passing through the same polarity parent after Word2Vec modeling
Thin relationship constructs word order, indicates the strong or weak relation of same Sentiment orientation by word order.By above-mentioned technological means in emotion
Context sequence, the emotion power difference between semantic understanding and word that word is merged in quantum chemical method, according to history trouble ticket demand with
Semantic similarity understands, two word emotion power differences of same part of speech influence emotion quantum chemical method result.
It is enriched under unsupervised mode and expands limited Emotional Corpus, simplified a large amount of manpower emotion corpus and comb work,
The emotion corpus of abundant missing is expanded based on limited Emotional Corpus.Based on limited emotion corpus, it is special to merge power business
Have dictionary, based on Word2Vec neural network to history trouble ticket deep learning formed positive word, passive word, neutral words, negative word,
Space correlation relationship between degree adverb, i.e., the tendentiousness word space close and distant relation centered on limited Emotional Corpus, from
And it further refines and expands polynary part of speech corpus.
The technical program can effectively distinguish emotion power difference to each work order demand, realize emotion quantitative analysis evaluation and
Non- complaint tendentiousness classification, to determine the urgency level of business processing.Emotion is added to intervene in line computation and analysis, it can be entirely square
Position ground is that client considers, understands customer anger, by the anticipation to client's emotion, intervenes differentiation emotion peace for machine in due course
It comforts and dredges, reduce client's on-line consulting time, improve customer satisfaction.
As optimization technique means: δlow、δneutral、δintervalRespectively 0.2,0.5,0.1.
As optimization technique means: electric power customer service work order emotion quantitative analysis method includes being based on Word2Vec similarity feelings
Feel word and expand association, for initializing multivariate classification dictionary, classified lexicon be divided into positive word, passive word, neutral words, negative word,
Degree adverb realizes that close emotion word expands association by Word2Vec similarity matrix, while these words merge client's demand
Semantic part of speech tendency and word order strong or weak relation;By forming positive word to work order demand progress Word2Vec deep learning, disappearing
Pole word, neutral words, negative word, the space correlation relationship between degree adverb.
As optimization technique means: electric power customer service work order emotion quantitative analysis method includes polynary Emotional Corpus building,
For word-based vector space apart from close and distant relation, enriches and expand initialization emotional semantic classification word, building fusion client's demand emotion
The multi-source Emotional Corpus of tendency;During carrying out text feature participle to electric power customer service work order and dissatisfied work order,
Preliminary corpus combing is formed, is related to actively using reverse maximum matching algorithm based on Baidu's dictionary and the proprietary dictionary of electric power
Word, passive word, neutral words, negative word, five class parts of speech classification of degree adverb;Term vector is constructed by Word2Vec, by machine
Learning training forms the tendentiousness word space length relationship centered on preliminary Emotional Corpus, so that it is more to further refine expansion
First part of speech corpus.
As optimization technique means: the building of polynary Emotional Corpus the following steps are included:
A) electric power customer service work order text feature is combined, classification comb is carried out to history electric power customer service work order and dissatisfied work order
Reason, data cleansing, comb to form word material based on Baidu's dictionary;
B) initialization corpus combing, corpus are divided into positive word, passive word, neutral words, negative word, degree adverb;
C) corpus dictionary is initialized;
D) the work order demand information of one section of timing is inputted;
E) deep learning is segmented by Word2Vec neural network work order, obtains learning model;
F) part of speech is separated by the close and distant incidence relation of Word2Vec;
G) polynary corpus is updated.
As optimization technique means: step e), including following sub-step:
E01) definition study model parameter, parameter include regular parameter, the number of iterations, learning rate, minimum word frequency, window
Size;
E02 Word2Vec learning model) is created;
E03 the work order demand information of one section of timing) is inputted;
E04) demand text is segmented;
E05) judge whether to be less than study the number of iterations, if entering step e06 less than the number of iterations), if more than iteration
Number then enters step e08);
E06) real-time synchronization recording learning progress, including currently learn number, learning time interval;
E07) Word2Vec network stochastic gradient descent studies in groups, including is biased to update with weight study;It is back to step
e05);
E08 learning tasks) are completed by learning model write-back library.
As optimization technique means: in step e01), the number of iterations 100;Learning rate is 0.001.
As optimization technique means: after emotion quantum chemical method, carrying out the normalized of data, pass through normalized
Finally obtain work order emotion quantized result.
The utility model has the advantages that
One, the technical program realizes feeling polarities distance zone quantitative relationship with Word2Vec and similarity word order matrix,
Break traditional sentiment analysis method based on bag of words feature and word frequency statistics, the context sequence of technological incorporation history trouble ticket,
The key factors such as the difference between semantic understanding, semanteme are realized that emotion tendency is categorized into the transformation of emotion quantitatively evaluating, are supported online
Real-time emotion operational analysis promotes daily client's emotion anticipation and Risk-warning ability;It realizes from client's emotion angle to work order
Urgency level divides, and distinguishes the urgency level of work order;The daily emotion unusual fluctuation early warning of work order, control visitor are realized from complaint risk angle
Family emotion complaint risk;It can apply and attend a banquet in service process in day electronic, insertion emotion is managed in real time in line computation and analysis
Customer anger is solved, by the anticipation to client's emotion, machine intervention differentiation emotion is pacified and dredged in due course, is reduced client and is existed
Line seeks advice from the time, improves customer satisfaction.
Two, the technical program can realize that assembly type encapsulates, and scalability is preferable, adaptable, while to model training process
Unified monitoring is carried out, all modeling parameters functions are realized to configure by front page layout and be completed, and are reduced developer's pressure, are mentioned
Rise demand response timeliness;
Three, the emotion quantitative analysis technology based on Word2Vec and similarity word order matrix merges the proprietary word of electric power customer service
Library and peculiar service application, can carry out demand personalization customization, and function has the visual control of model training overall process, emotion meter
The characteristics of calculating real-time operation, region/business emotion variance analysis on line.
Detailed description of the invention
Fig. 1 is general flow chart of the invention.
Fig. 2 is the proprietary term vector space length relational graph of client's demand of the invention.
Fig. 3 is polynary Emotional Corpus building flow chart of the invention.
Fig. 4 is the online electronics seat process flow diagram of emotion embedded intelligence of the invention.
Fig. 5 is service request arrearage telegram in reply RT register traffic emotion quantization tendency chart of the invention.
Fig. 6 is emotion quantitative analysis machine learning training process monitoring function figure of the invention.
Fig. 7 is work order emotion quantitative analysis functional diagram of the invention.
Specific embodiment
Technical solution of the present invention is described in further detail below in conjunction with Figure of description.
As shown in Figure 1, the technical program is realized based on Word2Vec depth learning technology, in conjunction with electric power customer service work order text
Feature carries out classification combing, data cleansing to history electric power customer service work order and dissatisfied work order, then combs shape based on Baidu's dictionary
At polynary emotion dictionary is initialized, work order text participle is carried out using reverse maximum matching algorithm, is based on Word2Vec nerve net
The term vector of the positive word of client's demand semanteme, passiveness word, negative word, degree adverb and word order is merged in network building, by going through
History customer service work order carries out the learning model that machine learning training generates fusion demand emotion, based on the part of speech close and distant relation in model
Part of speech corpus is expanded, similarity word order matrix quantization algorithm is finally researched and developed and completes customer service work order emotion quantitative analysis;
Wherein similarity word order matrix emotion quantization algorithm are as follows: achievement is refined according to above-mentioned Emotional Corpus, is based on
The building term vector polarity spatial relationship of Word2Vec neural network quantifies to calculate by the similarity matrix emotion of independent research
Method realizes the customer perception emotion quantum chemical method of each demand work order.Specific algorithm is as follows:
AN-P=WordsNearest (Lnegative, Lpositive, 10000)
AP-N=WordsNearest (Lpositive, Lnegative, 10000)
In formula: SmoleculeIndicate all accumulative summations of word emotion quantization molecule, T indicates single all word total amounts of demand, SEQ
Indicate single work order demand similarity emotion quantization score, words [i] is indicated in the work order demand array by participle i-th
Word, AN-PBased on passive word to the nearest word ordered set of positive word association, AP-NBased on positive word to the nearest word of passive word association
Language ordered set, LdegreeIndicate degree adverb set, LnayIndicate negative set of words, LpositiveIndicate positive set of words,
LnegativeIndicate passive set of words, LneutralIndicate neutral set of words,Indicate the i-th word in AP-NRow in ordered set
Column position,Indicate the i-th word in AN-PArrangement position in ordered set, WordsNearest are then the space Word2vec
Incidence relation method, δlow、δneutral、δintervalIt respectively indicates emotion and quantifies passive lower coefficient limit, in emotion quantization under property coefficient
Limit, emotion quantify neutral section, default setting difference 0.2,0.5,0.1.
General design idea is as shown in Figure 1.
Itself comprising steps of
One, work order participle cluster;Word is divided into positive word, passive word, neutral words, negative word, degree adverb;
Two, polynary corpus refines;
Three, deep learning models;
Four, model learning training;Using Word2Vec neural network, momentum coefficient 0.9, learning rate 0.1, activation
Function Relu, stochastic gradient descent algorithm, 100 wheel iterative learnings;
Five, emotion quantum chemical method is based on Word2Vec term vector similarity measure using similarity matrix emotion quantization algorithm
Change, emotion tendency quantum chemical method, normalized.
The technical program is by expanding association, the building of polynary Emotional Corpus, phase based on Word2Vec similarity emotion word
Realize that one kind that three key links are completed is based on Word2Vec and similarity word order square like degree word order matrix emotion quantization algorithm
The electric power customer service work order emotion quantitative analysis method of battle array effectively supports work order urgency level to screen and complains emotion risk analysis.
1) expand association based on Word2Vec similarity emotion word: the process realizes initialization multivariate classification dictionary (actively
Word, passive word, neutral words, negative word, degree adverb), it realizes that close emotion word is expanded by Word2Vec similarity matrix and closes
Connection, while the part of speech tendency and word order strong or weak relation of these words fusion client's demand semanteme.
Word2Vec is one for handling the double-deck neural network of text.Its input is corpus of text, and output is then
One group of vector: the feature vector of word in the corpus.Word2Vec realize in vector space by the vector of word by similitude into
Row grouping, numeric form in a distributed manner is come features such as the contexts that indicates word.Such as provide enough data, usage and up and down
Text, Word2Vec can carry out the prediction of pin-point accuracy according to past experience to the meaning of word.Such prediction result can be used
In establishing contacting between a word and other words.Word2Vec measures the cosine similarity of word, and no similitude is expressed as 90 degree
Angle, and similarity be 1 it is complete similar, be expressed as 0 degree of angle, that is, being completely coincident.
Positive word, passive word, neutral words, no is formed by carrying out Word2Vec deep learning to work order demand in nearly 2 years
Determine the space correlation relationship between word, degree adverb, as shown in Figure 2.
2) polynary Emotional Corpus building: the process, which is realized, is based on above-mentioned term vector space length close and distant relation, abundant to open up
Exhibition initialization emotional semantic classification word, the polynary Emotional Corpus of building fusion client's demand Sentiment orientation.To electric power customer service
During work order and dissatisfied work order carry out text feature participle, based on Baidu's dictionary and the proprietary dictionary of electric power, using inversely most
Big matching algorithm forms preliminary corpus combing, relates generally to positive word, passive word, neutral words, negative word, degree adverb five
Class parts of speech classification.In order to realize that abundant Emotional Corpus is expanded in optimization, term vector is constructed by Word2Vec, by machine learning
Training forms the tendentiousness word space length relationship centered on preliminary Emotional Corpus, expands polynary word to further refine
Property corpus, part of speech corpus dictionary is shown in Table 1, and Word2Vec neural network model training parameter is as shown in table 2, specific real
Existing process is as shown in Figure 3.
More than 1 yuan of Emotional Corpus dictionary example of table
2 Word2Vec model training parameter of table
The emotion power difference between context sequence, semantic understanding and word to merge word in emotion quantum chemical method, i.e.,
How to be understood according to history trouble ticket demand and semantic similarity, two word emotion power differences of same part of speech quantify to influence emotion
Calculated result.The technical program is based on Word2Vec neural network and carries out deep learning to historic customer work order, by learning mould
Term vector space length characterizes close semantic and word close and distant relation in type, as shown in Fig. 2, after being modeled simultaneously by Word2Vec
Same polarity close and distant relation construct word order, the strong or weak relation of same Sentiment orientation is indicated by word order.
Limited Emotional Corpus is expanded to enrich under unsupervised mode, i.e., how to simplify a large amount of manpower emotion corpus combs
Science and engineering is made, and the emotion corpus of abundant missing is expanded based on limited Emotional Corpus.It needs based on limited emotion corpus, fusion
The proprietary dictionary of power business;The technical program be based on Word2Vec neural network to history trouble ticket deep learning formed positive word,
Passive word, neutral words, negative word, the space correlation relationship between degree adverb, i.e. inclining centered on limited Emotional Corpus
Tropism word space close and distant relation expands polynary part of speech corpus to further refine.
For realize emotion power difference is effectively distinguished to each work order demand, i.e., how to realize emotion quantitative analysis evaluate and
Non- complaint tendentiousness classification.For client's emotional appeals angle, emotion degree of strength determines business processing to a certain extent
Urgency level, feeling polarities be more biased to it is passive be then easy to cause to complain, therefore emotion quantum chemical method is particularly important.This technology side
Case by refining achievement according to above-mentioned Emotional Corpus, close by the building term vector polarity space based on Word2Vec neural network
System realizes the customer perception emotion quantization meter of each demand work order by the similarity matrix emotion quantization algorithm of independent research
It calculates, is specifically shown in above-mentioned similarity word order matrix emotion quantization algorithm and realizes.
Electric power customer service work order emotion quantitative analysis based on Word2Vec and similarity word order matrix is in addition to effectively supporting work
Single urgency level is screened and is complained outside emotion risk analysis, it may also be used in day electronic seat call, as shown in figure 4, each
In the day electronic seat call of net provincial company, emotion is added and is intervened in line computation and analysis, can be examined in all directions for client
Consider, understand customer anger, by the anticipation to client's emotion, intervenes differentiation emotion for machine in due course and pacify and dredge, subtract
It few client's on-line consulting time, improves customer satisfaction.
The present invention is segmented by accepting content to work order, in conjunction with similarity matrix emotion quantization algorithm, realize to by
One participle emotion quantization is decomposed, and specific example is as shown in table 1, finally obtains work order emotion quantized result by normalized,
The emotion quantization score difference of each type of service work order is obvious simultaneously, can realize from client's emotion angle to work order urgency level
It divides, by the sentiment analysis to work order client's demand, the urgency level of work order can be distinguished.Same business emotion score is got over
Low, the urgency level of work order is higher, theoretically needs preferentially to be handled.
3 work order of table accepts content emotion quantization example
As shown in figure 5, can be seen that service request arrearage telegram in reply RT register traffic client emotion is total April from 7 days, line on the 14th
Body tends to balance, and what is wherein had outstanding performance in figure is rapid decrease on April 25, and the range of decrease 11.16% passes through specific aim analysis 4
Moon arrearage on the 25th telegram in reply registration work order, same type work order is found compared to before, and there are two Zhang Gongs singly clearly to mark medium/high potential throwing
It tells tendency, while being expressed in client's demand wish and further complaining tendency.Therefore, have by the daily emotion unusual fluctuation difference of work order
Complaint risk early warning and control conducive to building based on client's emotion.
Above it is found that the technical program has the advantages that the following in practical applications: first is that with Word2Vec and similar
It spends word order matrix and realizes feeling polarities distance zone quantitative relationship, break traditional emotion based on bag of words feature and word frequency statistics
Analysis method, the context sequence of technological incorporation history trouble ticket, semantic understanding, it is semantic between the key factors such as difference, realize feelings
Sense tendentiousness is categorized into the transformation of emotion quantitatively evaluating, supports online emotion operational analysis in real time, promotes daily client's emotion anticipation
With Risk-warning ability;Second is that the DL4j depth learning technology breach that can be increased income with mainstream, the functional realization assembly type envelope of institute
Dress, scalability is preferable, adaptable, while carrying out unified monitoring to model training process, and all modeling parameters function realizations can
It is completed with being configured by front page layout, reduces developer's pressure, promote demand response timeliness, wherein model training monitors page
Face such as Fig. 6 emotion quantitative analysis machine learning training process monitoring function figure;Third is that it is as based on Word2Vec and similarity
The emotion quantitative analysis technology of word order matrix merges the proprietary dictionary of electric power customer service and peculiar service application, and carries out demand individual character
Change customization, function is poor with real-time operation, region/business emotion on the visual control of model training overall process, affection computation line
Fig. 7 work order emotion quantitative analysis functional diagram is seen at the features such as different analysis, concrete function interface;And this key technology is in practical application
It is mainly shown as following three aspect: (1) being realized from client's emotion angle and work order urgency level is divided, distinguish the urgent journey of work order
Degree;(2) the daily emotion unusual fluctuation early warning of work order is realized from complaint risk angle, manage client's emotion complaint risk.(3) it can apply
It attends a banquet in service process in day electronic, insertion emotion understands customer anger, by client's feelings in line computation and analysis in real time
The anticipation of sense, machine intervention differentiation emotion is pacified and is dredged in due course, reduces client's on-line consulting time, promotes customer satisfaction
Degree.
Figure 1 above, a kind of electric power customer service work order emotion quantitative analysis method based on Word2Vec is the present invention shown in 2
Specific embodiment, embodied substantive distinguishing features of the present invention and progress, can according to it is actual use needs, of the invention
Under enlightenment, the equivalent modifications of shape, structure etc., the column in the protection scope of this programme are carried out to it.