CN106529804A - Client complaint early-warning monitoring analyzing method based on text mining technology - Google Patents

Client complaint early-warning monitoring analyzing method based on text mining technology Download PDF

Info

Publication number
CN106529804A
CN106529804A CN201610984456.5A CN201610984456A CN106529804A CN 106529804 A CN106529804 A CN 106529804A CN 201610984456 A CN201610984456 A CN 201610984456A CN 106529804 A CN106529804 A CN 106529804A
Authority
CN
China
Prior art keywords
complaint
text
data
grade
early warning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610984456.5A
Other languages
Chinese (zh)
Other versions
CN106529804B (en
Inventor
胡宏
高昇宇
倪炜
常飞
秦韶杨
施萱轩
汤宁
张玮
梁明
于涛
曹仁红
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
State Grid Jiangsu Electric Power Co Ltd
Nari Information and Communication Technology Co
Nanjing Power Supply Co of State Grid Jiangsu Electric Power Co Ltd
Original Assignee
State Grid Corp of China SGCC
State Grid Jiangsu Electric Power Co Ltd
Nari Information and Communication Technology Co
Nanjing Power Supply Co of State Grid Jiangsu Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, State Grid Jiangsu Electric Power Co Ltd, Nari Information and Communication Technology Co, Nanjing Power Supply Co of State Grid Jiangsu Electric Power Co Ltd filed Critical State Grid Corp of China SGCC
Priority to CN201610984456.5A priority Critical patent/CN106529804B/en
Publication of CN106529804A publication Critical patent/CN106529804A/en
Application granted granted Critical
Publication of CN106529804B publication Critical patent/CN106529804B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0635Risk analysis of enterprise or organisation activities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0203Market surveys; Market polls
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Development Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Marketing (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Game Theory and Decision Science (AREA)
  • Health & Medical Sciences (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • Data Mining & Analysis (AREA)
  • Educational Administration (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a client complaint early-warning monitoring analyzing method based on text mining technology. The method comprises the steps of a text data standardizing step for converting recorded text data into data with a standard data format in a unified rule; and a standard data analysis early-warning step, analyzing the standard data format through establishing a complaint analysis grade clustering model, determining a risk grade according to the clustering result, and transmitting early-warning according to the risk grade. The complaint early-warning monitoring analyzing method satisfies a precondition of ensuring client satisfaction degree and furthermore has advantages of greatly reducing workload in manual sampling observation, effectively improving passive after-event tracking responsibility-determining management mode and facilitating before-event active server in a targeted manner, thereby realizing a professional management requirement for in-time response for client demands.

Description

Customer complaint early warning and monitoring analysis method based on Text Mining Technology
Technical field
The present invention relates to the customer complaint early warning and monitoring analysis method based on Text Mining Technology, art is electric power row Industry client's crisis management field.
Background technology
With the progressively in-depth of power system reform, electric power sale market competition aggravation, in the urgent need to power supply enterprise is rapid Change traditional mode of thinking and mode of operation, further establish market-oriented service awareness, Business Innovation service mode is lifted Customization, personalized service level, the trust of Win Clients, it is ensured that the market share.Meanwhile, with common people's sense of independence and right-safeguarding The continuous lifting of consciousness, power supply enterprise's bidding price adjustment each time, service entries change even breakdown repair, all receives vast The close attention of the common people.
Used as the important window with customer communication, communication, 95598 customer service systems have recorded the customer information of magnanimity. Data in the system are broadly divided into structural data and unstructured data.At present, for the structural data in system, it is Unite by complaining the aspects such as quantity, the satisfaction marking of client or issue handling timeliness to carry out statistical analysiss.It is anti-for client Feedforward information unstructured data mainly manually to inspect by random samples based on combing, ask by the grasp client's focus of attention being not easy to promptly and accurately Topic;And be only limitted to carry out statistical analysiss from dimensions such as time, region, types of service so that what monitoring was analyzed becomes more meticulous degree not It is enough;Although text data to complaining work order carries out artificial Source Tracing one by one simultaneously, lacks automatic monitoring analysis, and side Overweight and call to account afterwards, it is impossible to realize early warning in advance.
The content of the invention
The technical problem to be solved is to overcome prior art not enough, there is provided a kind of to be based on Text Mining Technology Complaint early-warning monitoring method, can in time, accurately grasp client feedback hot issue for departments that are in charge of manging enterprises, and in advance The requirement of early warning customer complaint risk, throws to client in client feedback information numerous and complicated, expression way lower realization versatile and flexible The early warning of risk is told, solves the problems, such as that at present artificial carding efficiency is low and the Passive Management called to account of tracing to the source afterwards, so as to protect The complaint risk grade forecast to every work order is demonstrate,proved, has been easy to carry out in time and is taken the initiative in offering a hand, improve CSAT.
For solving above-mentioned technical problem, the technical solution used in the present invention is:
A kind of complaint early warning and monitoring analysis method based on Text Mining Technology, including:
The text data of typing is changed into the normalized number of uniform rules according to mould by step 1, text data standardizing step Formula;
Step 2, normalized number complain analytical grade Clustering Model to normalized number evidence according to analysis and early warning step by setting up Pattern is analyzed, and divides complaint risk grade according to cluster result, sends corresponding early warning further according to risk place grade.
Text data is converted into by structuring, standardized standardization expression formula using text data standardizing step, just Process in the later stage and apply;Code requirement data analysiss warning step realizes the judgement and early warning to customer complaint risk class, Artificial combing and differentiation is saved, work efficiency is effectively increased.
Used as the further limits scheme of the present invention, text data standardizing step includes:
Step 1.1, text initial processing step carry out participle and denoising to the text data of typing, obtain each Key word in text data;
Each key word is carried out vectorization and makees normalized by step 1.2, Text eigenvector step, is obtained By each key word WiIn file djIn vectorization normalization result set up real-valued matrix, key word WiIn file djIn Vectorization normalization result be:
In formula:N represents total number of files amount;NiRepresent comprising key word WiQuantity of documents;N is key word total degree;WiFor I-th key word;djRepresent j-th file;tfijFor key word WiIn file djIn word frequency;For relevant Keyword is in file djIn word frequency quadratic sum;For total number of files amount and comprising key word WiThe quantity of file Ratio is taken the logarithm after adding adjustment item 0.01;
Step 1.3, text data similarity matching step, according to the real-valued matrix set up, are calculated using the cosine law Cosine similarity between each key word, and COS distance nearest text data is matched, form near synonym dictionary;
Step 1.4, normalized number according to generation step, by the text data for matching according to setting unified standard pattern Generate normalized number evidence.
Used as the further limits scheme of the present invention, normalized number includes according to analysis and early warning step:
Step 2.1, Sentiment orientation degree calculation procedure carry out Judgment by emotion and are divided into positive, passive to normalized number evidence With three class of center;
Step 2.2, complaint risk grade classification step, according to customer grade, actively type of service, tendency degree and complaint History parameters are set up and complain analytical grade Clustering Model, and formulate complaint risk hierarchy rules according to cluster result;
Step 2.3, complain analysis and early warning step, according to formulate complaint risk hierarchy rules, preference pattern variable parameter, Classification learning model is set up using Bayes's classification, the throwing to unknown text data is realized by the training to classification learning model Tell the prediction of risk class.
Used as the further limits scheme of the present invention, the specific works step of Sentiment orientation degree calculation procedure is:
(1) conjunction and negative word dictionary are set up;
(2) conjunction and negative word, and labelling corresponding words are extracted according in from normalized number according to conjunction and negative word dictionary In position of the normalized number according in;
(3) existing sentiment dictionary storehouse is matched, obtains polarity and its emotion score value of vocabulary;
(4) by conjunction position, determine front sentence and rear sentence proportion, further according to negative word position judgment double denial with And the polarity inversion of neighbouring vocabulary;
(5) using vocabulary polarity and its emotion score value bring conjunction and negative word into after to normalized number according to adding up Obtain affection computation scoring;
(6) circulation step (2) to (5), if affection computation scoring be positive for canonical, are that to bear be then passive, otherwise in The heart.
As the further limits scheme of the present invention, complaint risk hierarchy rules for according to cluster result by complaint risk Grade classification is high-risk pole, hazard class, has complaint tendency level, general level and without this five risk class of complaint tendency level.
As the further limits scheme of the present invention, need to verify rule complaint risk hierarchy rules are formulated, The classification learning model that checking collection data input is set up, obtains the complaint risk grade of each text data in checking collection data, And result set is compared with corresponding level data in checking collection data, the accuracy of computation model prediction.
The beneficial effects of the present invention is:Using Text Mining Technology, text data is converted into into structuring, standardized Standardization expression formula;Secondly sentiment analysis are carried out to the text message of client feedback, and calculates its Sentiment orientation degree;Finally utilize There is the classification learning algorithm of supervision, set up and complain early warning and monitoring model, realize the judgement and early warning to customer complaint risk class.
Description of the drawings
Fig. 1 is standardization expression formula Establishing process schematic diagram;
Fig. 2 is complaint risk rating calculation flow chart proposed by the present invention;
Fig. 3 is complaint Early-warning Model establishment step schematic diagram proposed by the present invention;
Fig. 4 is complaint Early-warning Model checking schematic flow sheet proposed by the present invention;
Fig. 5 is each region work order number scattergram;
Fig. 6 is each grade work order data profile.
Specific embodiment
Below in conjunction with the accompanying drawings, the present invention is elaborated:
A kind of complaint early warning and monitoring analysis method based on Text Mining Technology of the present invention, walks including text data standardization , according to analysis and early warning step, wherein, text data standardizing step, for changing into system by the text data of typing for rapid and normalized number The standardization data pattern of one rule;Normalized number complains analytical grade to cluster mould according to analysis and early warning step for passing through to set up Type is analyzed to data pattern of standardizing, and is divided complaint risk grade according to cluster result, is sent according to risk place grade Corresponding early warning.
Text data is converted into by structuring, standardized standardization expression formula using text data standardizing step, just Process in the later stage and apply;Code requirement data analysiss warning step realizes the judgement and early warning to customer complaint risk class, Artificial combing and differentiation is saved, work efficiency is effectively increased.
Text data standardizing step includes text initial processing step, Text eigenvector step, text data phase Like property matching step and normalized number according to generation step.The concrete process step of text data standardizing step as shown in figure 1, The text data (95598 work order data) of typing is carried out into participle, denoising first;Then carry out vectorization and be processed into vector Matrix, word correlation matrix;Then it is associated analysis and generates near synonym dictionary;According to type of service and near synonym dictionary to text Data carry out standardization expression.
Wherein, text initial processing step, carries out participle and denoising for the text data to typing, obtains each Key word in text data;
Text eigenvector step, for each key word being carried out vectorization and making normalized, obtains by each Individual key word WiIn file djIn vectorization normalization result set up real-valued matrix, key word WiIn file djIn to Quantifying normalization result is:
In formula:N represents total number of files amount;NiRepresent comprising key word WiQuantity of documents;N is key word total degree;WiFor I-th key word;djRepresent j-th file;tfijFor key word WiIn file djIn word frequency;For relevant Keyword is in file djIn word frequency quadratic sum;For total number of files amount and comprising key word WiThe quantity of file Ratio is taken the logarithm after adding adjustment item 0.01;
Text data similarity matching step, for according to the real-valued matrix set up, calculating each pass using the cosine law Cosine similarity between keyword, and COS distance nearest text data is matched, form near synonym dictionary;
Normalized number according to generation step, for by the text data for matching according to setting unified standard schema creation Normalized number evidence.
Scheme is further disclosed as the present invention, and normalized number includes according to analysis and early warning step:
Sentiment orientation degree calculation procedure, for normalized number according to carry out Judgment by emotion and be divided into it is positive, passive and in Three class of the heart;
Complaint risk grade classification step, for according to customer grade, actively type of service, tendency degree and complaint history Parameter is set up and complains analytical grade Clustering Model, and formulates complaint risk hierarchy rules according to cluster result;Wind is complained formulating Dangerous hierarchy rules need to verify rule that the classification learning model for setting up checking collection data input obtains checking collection number The complaint risk grade of each text data according in, and result set is compared with corresponding level data in checking collection data, The accuracy of computation model prediction;
Complain analysis and early warning step, for according to formulate complaint risk hierarchy rules, preference pattern variable parameter (if any Without complaint history, type of service and customer grade), classification learning model is set up using Bayes's classification, by taxonomy The prediction of the complaint risk grade to unknown text data is realized in the training for practising model.
The present invention is based on concretely comprising the following steps that the complaint early warning and monitoring analysis system of Text Mining Technology is realized:
(1) set up standardization expression formula
1st, Chinese word segmentation
Based on 95598 work order data, set up participle corpus and special symbol table, to expect storehouse in adjacent co-occurrence each The frequency of combinatorics on words is counted, and calculates its degree of association.Computing formula is as follows:
Wherein, M represents expectation storehouse number of words, NARepresent the number of times that A occurs in storehouse is expected, NBRepresent that B occurs in storehouse is expected Number of times, NABRepresent the number of times that AB occurs in storehouse is expected together.
2nd, Text eigenvector
A part of maximally efficient feature is selected from 95598 work order content of text so that the dimension in new feature space Luv space dimension is often much smaller than, the further purification to Text eigenvector is realized, and is being kept the feelings of original text implication Under condition, content of text, and succinct characteristic vector can be most fed back in calculating.By doing normalized to text feature item, mitigate Impact of the different length text to Text similarity computing result.Computing formula is as follows:
In formula:N represents total number of files amount;NiRepresent comprising key word WiQuantity of documents;N is key word total degree;WiFor I-th key word;djRepresent j-th file;tfijFor key word WiIn file djIn word frequency;For relevant Keyword is in file djIn word frequency quadratic sum;For total number of files amount and comprising key word WiThe quantity of file Ratio is taken the logarithm after adding adjustment item 0.01.
3rd, according to the real-valued matrix after term vector conversion, using the cosine law, the cosine calculated between each phrase is similar Degree, and COS distance nearest word is matched, form near synonym dictionary.
4th, set up standardization expression formula
Classify with reference to 95598 system business, realize the canonical representation to client feedback text message, shape is such as:Complain-super Cui personnel-attitude.
(2) set up and complain early warning and monitoring model
1st, calculate Sentiment orientation degree
(1) Dalian University of Technology's sentiment dictionary is based on, using 95598 work order classs of service as praising text to repair as supplement Change, while setting up conjunction and negative word dictionary, set up emotion training storehouse.
(2) extraction conjunction and negative word from 95598 work orders, and labelling corresponding words position.
(3) sentiment dictionary is matched, determines that vocabulary polarity and its emotion score value, vocabulary polarity and emotion score value are emotion Existing basic parameter in dictionary.
(4) by conjunction position, front sentence and rear sentence proportion are determined, while dual no according to the interpretation of negative word position Determine, and neighbour enters the polarity inversion of vocabulary.
(5) the work order text emotion that adds up calculates scoring.
(6) circulation step (2) is to (5), if canonical is positive, be it is negative be then passiveness, otherwise centered on.
2nd, formulate complaint risk grade
By work order text data is converted into vectorization matrix, selection customer grade, actively tendency degree, complaint history etc. Parameter, set up complain analytical grade Clustering Model, finally according to cluster result, by complaint risk grade classification be 5 classes, such as Fig. 2 It is shown.
3rd, set up and complain analysis and early warning model
Data source is divided into into model training collection and checking collection, and according to the complaint risk hierarchy rules formulated, preference pattern Variable (such as whether there is complaint history, type of service, customer grade etc.), sets up classification learning model using Bayes's classification, passes through The step of model training realizes the prediction of the complaint risk grade to unknown work order, model training is as shown in Figure 3.
4th, model checking
As shown in figure 4, by checking collection data input classification learning model, by complaint risk of each work order of model prediction etc. Level, and the data that result set is concentrated with checking are compared, the accuracy of computation model prediction.
Embodiment
Collect 95598 system Nanjing August part work order data, 87359 altogether.Including business consultation, troublshooting, build The data of the nine class work order such as view, suggestion, complaint, report.Wherein Nanjing City produces work order up to 58151, is secondly river Peaceful area is 13248, and each region work order number distribution refers to Fig. 5.
1st, August part take the initiative in offering a hand grade work order monitoring analysis
Calculated by model, August part Nanjing work order is divided into into five classes and is taken the initiative in offering a hand grade.Wherein one-level work order is 232 Bar, two grades of work orders are 208, and as one-level, two grades of work orders are taken the initiative in offering a hand higher ranked, needs are processed in time, so Following selective analysiss one-level, two grades of work orders distribution situation and model checking monitoring analysis in each region.Each grade work order number Fig. 6 is seen according to distribution details:
(1) monitoring of each grade work order in region is analyzed
According to model monitoring result, Nanjing City one-level, two grades of work order quantity are at most respectively 109 and 90;Secondly 66 and 62 are respectively for Jiangning District;Pukou, the six directions, Lishui are relatively fewer, and wherein Pukou is respectively 20,24, the six directions point Not Wei 14,14, Lishui area be respectively 14,10;Gaochun area is at least respectively 9,8.Business of each grade work order in region Following table is referred to distributed number:
(2) one-level, two grades of work order content analysis
According to model monitoring result, each regional level work order content top ranked for reporting for repairment-without electricity, accounting is respectively river 26% is accounted for rather, Pukou accounts for 28%, and urban district accounts for 22%, and the six directions accounts for 31%, and Gaochun accounts for 26%, and Lishui accounts for 31%; One-level work order content be number two for equipment-failure and reporting-many families-for repairment without electricity, wherein Jiangning, Pukou and Nanjing City ranking Second content is equipment-failure, accounts for 16%, 18% and 21% respectively, and the content that the six directions, Gaochun and Lishui are number two is report Xiu-many families-and without electricity, 18%, 19% and 25% is accounted for respectively.Accounting ranking and first etc. of two grades of work order contents in each region Level is similar.
2nd, model pre-warning monitoring analysis
In monitoring cycle, using Early-warning Model of taking the initiative in offering a hand, common early warning one-level, two grades of work orders, 440 (complaint class work orders 128, non-complaint class work order 312).Find in one-level, two grades of non-complaint class work orders there are 35 finally to cause through checking Customer complaint, wherein has 13 work orders (one-level work order 7, two grades work order 6) directly to cause customer complaint, main business type It is troublshooting 5, service request 3, business consultation 5;Remaining 22 (one-level work order 12, two grades work order 10) work order It is that customer complaint is finally caused by association, totally 9.In this 35 work orders, business consultation accounting is up to about 54%, secondly It is troublshooting and service request, respectively may be about 34%, 12%.
Analyzed by the monitoring to model pre-warning result, in 128 for actually occurring complain work order, there are 106 to be visitor Directly complained in the case of no historical behavior at family.Remaining work order be client have before complaint corresponding troublshooting, The historical behaviors such as business consultation, and be integrally incorporated in this monitoring result.Model is demonstrated to non-complaint class in monitoring cycle Work order later transformation is the capture ability for complaining work order.
Using Early-warning Model is complained, in time capture complaint risk degree compared with work order, be easy to business department to carry out in time actively Service, so as to reduce complaining work order data, lifts the satisfaction of client.

Claims (6)

1. a kind of complaint early warning and monitoring analysis method based on Text Mining Technology, it is characterised in that include:
The text data of typing is changed into the standardization data pattern of uniform rules by step 1, text data standardizing step;
Step 2, normalized number complain analytical grade Clustering Model to data pattern of standardizing according to analysis and early warning step by setting up Be analyzed, complaint risk grade is divided according to cluster result, corresponding early warning is sent further according to risk place grade.
2. the complaint early warning and monitoring analysis method based on Text Mining Technology according to claim 1, it is characterised in that text Notebook data standardizing step includes:
Step 1.1, text initial processing step carry out participle and denoising to the text data of typing, obtain each text Key word in data;
Each key word is carried out vectorization and makees normalized by step 1.2, Text eigenvector step, is obtained by each Individual key word WiIn file djIn vectorization normalization result set up real-valued matrix, key word WiIn file djIn to Quantifying normalization result is:
W i ( d j ) = tf i j × log ( N N i + 0.01 ) Σ k = 1 n ( tf k j ) 2 × [ log ( N N i + 0.01 ) ] 2
In formula:N represents total number of files amount;NiRepresent comprising key word WiQuantity of documents;N is key word total degree;WiFor i-th Individual key word;djRepresent j-th file;tfijFor key word WiIn file djIn word frequency;For all key words In file djIn word frequency quadratic sum;For total number of files amount and comprising key word WiThe quantity ratio of file Plus adjustment item 0.01 after take the logarithm;
Step 1.3, text data similarity matching step, according to the real-valued matrix set up, calculate each pass using the cosine law Cosine similarity between keyword, and COS distance nearest text data is matched, form near synonym dictionary;
Step 1.4, normalized number according to generation step, by the text data for matching according to setting unified standard schema creation Normalized number evidence.
3. the complaint early warning and monitoring analysis method based on Text Mining Technology according to claim 2, it is characterised in that rule Generalized data analysiss warning step includes:
Step 2.1, Sentiment orientation degree calculation procedure, to normalized number according to carry out Judgment by emotion and be divided into it is positive, passive and in Three class of the heart;
Step 2.2, complaint risk grade classification step, according to customer grade, actively type of service, tendency degree and complaint history Parameter is set up and complains analytical grade Clustering Model, and formulates complaint risk hierarchy rules according to cluster result;
Step 2.3, complains analysis and early warning step, according to the complaint risk hierarchy rules formulated, preference pattern variable parameter, utilizes Bayes's classification sets up classification learning model, realizes the complaint wind to unknown text data by the training to classification learning model The prediction of dangerous grade.
4. the complaint early warning and monitoring analysis method based on Text Mining Technology according to claim 3, it is characterised in that feelings Sense tendency degree calculation procedure specific works step be:
(1) conjunction and negative word dictionary are set up;
(2) conjunction and negative word, and labelling corresponding words are extracted on rule according in from normalized number according to conjunction and negative word dictionary Position in generalized data;
(3) existing sentiment dictionary storehouse is matched, obtains polarity and its emotion score value of vocabulary;
(4) by conjunction position, front sentence and rear sentence proportion are determined, further according to negative word position judgment double denial and neighbour The polarity inversion of nearly vocabulary;
(5) using vocabulary polarity and its emotion score value bring conjunction and negative word into after to normalized number according to carrying out cumulative acquisition Affection computation scores;
(6) circulation step (2) to (5), if affection computation scoring be positive for canonical, be bear be then it is passive, otherwise centered on.
5. the complaint early warning and monitoring analysis method based on Text Mining Technology according to claim 3, it is characterised in that throw Tell risk class rule for according to cluster result by complaint risk grade classification be high-risk pole, hazard class, have complaint tendency level, General level and without complaining tendency level this five risk class.
6. the complaint early warning and monitoring analysis method based on Text Mining Technology according to claim 3, it is characterised in that Formulating complaint risk hierarchy rules needs to verify rule, the classification learning model that checking collection data input is set up, and obtains The complaint risk grade of each text data in collection data must be verified, and result set and checking are collected into corresponding level data in data Compare, the accuracy of computation model prediction.
CN201610984456.5A 2016-11-09 2016-11-09 Customer complaint early warning monitoring analysis method based on text mining technology Active CN106529804B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610984456.5A CN106529804B (en) 2016-11-09 2016-11-09 Customer complaint early warning monitoring analysis method based on text mining technology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610984456.5A CN106529804B (en) 2016-11-09 2016-11-09 Customer complaint early warning monitoring analysis method based on text mining technology

Publications (2)

Publication Number Publication Date
CN106529804A true CN106529804A (en) 2017-03-22
CN106529804B CN106529804B (en) 2023-08-18

Family

ID=58350353

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610984456.5A Active CN106529804B (en) 2016-11-09 2016-11-09 Customer complaint early warning monitoring analysis method based on text mining technology

Country Status (1)

Country Link
CN (1) CN106529804B (en)

Cited By (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107194568A (en) * 2017-05-18 2017-09-22 广东中科国志科技发展有限公司 A kind of method and apparatus of analytical technology
CN107220732A (en) * 2017-05-31 2017-09-29 福州大学 A kind of power failure complaint risk Forecasting Methodology based on gradient boosted tree
CN107240033A (en) * 2017-06-07 2017-10-10 国家电网公司客户服务中心 The construction method and system of a kind of electric power identification model
CN107729919A (en) * 2017-09-15 2018-02-23 国网山东省电力公司电力科学研究院 In-depth based on big data technology is complained and penetrates analysis method
CN107992609A (en) * 2017-12-15 2018-05-04 广东电网有限责任公司信息中心 A kind of complaint tendency determination methods based on Text Classification and decision tree
CN107992613A (en) * 2017-12-18 2018-05-04 广东广业开元科技有限公司 A kind of Text Mining Technology protection of consumers' rights index analysis method based on machine learning
CN108062392A (en) * 2017-12-18 2018-05-22 广东广业开元科技有限公司 A kind of protection of consumers' rights index calculation method based on big data sorting algorithm
CN108108352A (en) * 2017-12-18 2018-06-01 广东广业开元科技有限公司 A kind of enterprise's complaint risk method for early warning based on machine learning Text Mining Technology
CN108256552A (en) * 2017-12-18 2018-07-06 广东广业开元科技有限公司 Common people close friend's index assessment method and system based on big data sorting algorithm
CN108492033A (en) * 2018-03-26 2018-09-04 国家电网公司客户服务中心 Power grid client, which concentrates, complains intelligent early-warning method
CN109255499A (en) * 2018-10-25 2019-01-22 阿里巴巴集团控股有限公司 Complaint, tip-offs about environmental issues processing method, device and equipment
CN109558486A (en) * 2018-10-30 2019-04-02 国家电网有限公司客户服务中心 Electric power customer service client's demand intelligent identification Method
CN109636607A (en) * 2018-12-18 2019-04-16 平安科技(深圳)有限公司 Business data processing method, device and computer equipment based on model deployment
CN109905269A (en) * 2018-01-17 2019-06-18 华为技术有限公司 The method and apparatus for determining network failure
CN110097381A (en) * 2019-05-01 2019-08-06 青岛民航凯亚系统集成有限公司 A kind of complaint automatic identification early warning system and method applied to airport service
CN110275956A (en) * 2019-06-24 2019-09-24 成都数之联科技有限公司 A kind of personal identification method and system
CN110414819A (en) * 2019-07-19 2019-11-05 中国电信集团工会上海市委员会 A kind of work order methods of marking
CN110713088A (en) * 2019-10-25 2020-01-21 日立楼宇技术(广州)有限公司 Early warning method, device, equipment and medium for elevator complaints
CN110889770A (en) * 2019-10-12 2020-03-17 中国平安财产保险股份有限公司 Data processing method, device, equipment and storage medium
CN111415060A (en) * 2020-01-21 2020-07-14 国网浙江省电力有限公司湖州供电公司 Complaint risk analysis method based on customer label
CN112256836A (en) * 2020-11-04 2021-01-22 中国建设银行股份有限公司 Recording data processing method and device and server
CN112836517A (en) * 2021-01-27 2021-05-25 浪潮云信息技术股份公司 Method for processing mining risk signal based on natural language
CN112927091A (en) * 2021-04-08 2021-06-08 泰康保险集团股份有限公司 Complaint early warning method and device for annuity insurance, computer equipment and medium
CN113220545A (en) * 2021-05-11 2021-08-06 中国工商银行股份有限公司 Work order assignment method and device and electronic equipment
CN113673905A (en) * 2021-08-31 2021-11-19 广东省信息网络有限公司 Complaint service early warning monitoring system based on big data
CN114298425A (en) * 2021-12-30 2022-04-08 成都数联云算科技有限公司 Customer complaint risk prediction method, apparatus, device, and medium
CN115081869A (en) * 2022-06-20 2022-09-20 中国银行股份有限公司 Audit project generation method and device
CN118396393A (en) * 2024-06-27 2024-07-26 广东烟草东莞市有限公司 Multi-algorithm fusion risk prediction method for tobacco monopoly retailers

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120047053A1 (en) * 2010-08-18 2012-02-23 The Western Union Company Systems and methods for assessing fraud risk
US20120259673A1 (en) * 2011-04-08 2012-10-11 Welch Allyn, Inc. Risk-Based Complaint Management System
US20130110497A1 (en) * 2011-10-27 2013-05-02 Microsoft Corporation Functionality for Normalizing Linguistic Items
CN105335496A (en) * 2015-10-22 2016-02-17 国网山东省电力公司电力科学研究院 Customer service repeated call treatment method based on cosine similarity text mining algorithm
CN105389341A (en) * 2015-10-22 2016-03-09 国网山东省电力公司电力科学研究院 Text clustering and analysis method for repeating caller work orders of customer service calls
CN105930347A (en) * 2016-04-05 2016-09-07 浙江远传信息技术股份有限公司 Text analysis based power outage cause recognition system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120047053A1 (en) * 2010-08-18 2012-02-23 The Western Union Company Systems and methods for assessing fraud risk
US20120259673A1 (en) * 2011-04-08 2012-10-11 Welch Allyn, Inc. Risk-Based Complaint Management System
US20130110497A1 (en) * 2011-10-27 2013-05-02 Microsoft Corporation Functionality for Normalizing Linguistic Items
CN105335496A (en) * 2015-10-22 2016-02-17 国网山东省电力公司电力科学研究院 Customer service repeated call treatment method based on cosine similarity text mining algorithm
CN105389341A (en) * 2015-10-22 2016-03-09 国网山东省电力公司电力科学研究院 Text clustering and analysis method for repeating caller work orders of customer service calls
CN105930347A (en) * 2016-04-05 2016-09-07 浙江远传信息技术股份有限公司 Text analysis based power outage cause recognition system

Cited By (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107194568A (en) * 2017-05-18 2017-09-22 广东中科国志科技发展有限公司 A kind of method and apparatus of analytical technology
CN107220732A (en) * 2017-05-31 2017-09-29 福州大学 A kind of power failure complaint risk Forecasting Methodology based on gradient boosted tree
CN107240033A (en) * 2017-06-07 2017-10-10 国家电网公司客户服务中心 The construction method and system of a kind of electric power identification model
CN107240033B (en) * 2017-06-07 2021-02-12 国家电网公司客户服务中心 Method and system for constructing electric power identification model
CN107729919A (en) * 2017-09-15 2018-02-23 国网山东省电力公司电力科学研究院 In-depth based on big data technology is complained and penetrates analysis method
CN107992609B (en) * 2017-12-15 2021-05-18 广东电网有限责任公司信息中心 Complaint tendency judgment method based on text classification technology and decision tree
CN107992609A (en) * 2017-12-15 2018-05-04 广东电网有限责任公司信息中心 A kind of complaint tendency determination methods based on Text Classification and decision tree
CN108062392A (en) * 2017-12-18 2018-05-22 广东广业开元科技有限公司 A kind of protection of consumers' rights index calculation method based on big data sorting algorithm
CN108256552A (en) * 2017-12-18 2018-07-06 广东广业开元科技有限公司 Common people close friend's index assessment method and system based on big data sorting algorithm
CN108108352A (en) * 2017-12-18 2018-06-01 广东广业开元科技有限公司 A kind of enterprise's complaint risk method for early warning based on machine learning Text Mining Technology
CN107992613A (en) * 2017-12-18 2018-05-04 广东广业开元科技有限公司 A kind of Text Mining Technology protection of consumers' rights index analysis method based on machine learning
CN109905269B (en) * 2018-01-17 2020-11-17 华为技术有限公司 Method and device for determining network fault
CN109905269A (en) * 2018-01-17 2019-06-18 华为技术有限公司 The method and apparatus for determining network failure
CN108492033A (en) * 2018-03-26 2018-09-04 国家电网公司客户服务中心 Power grid client, which concentrates, complains intelligent early-warning method
CN109255499A (en) * 2018-10-25 2019-01-22 阿里巴巴集团控股有限公司 Complaint, tip-offs about environmental issues processing method, device and equipment
CN109255499B (en) * 2018-10-25 2021-12-07 创新先进技术有限公司 Complaint and complaint case processing method, device and equipment
CN109558486A (en) * 2018-10-30 2019-04-02 国家电网有限公司客户服务中心 Electric power customer service client's demand intelligent identification Method
CN109636607B (en) * 2018-12-18 2024-03-15 平安科技(深圳)有限公司 Service data processing method and device based on model deployment and computer equipment
CN109636607A (en) * 2018-12-18 2019-04-16 平安科技(深圳)有限公司 Business data processing method, device and computer equipment based on model deployment
CN110097381A (en) * 2019-05-01 2019-08-06 青岛民航凯亚系统集成有限公司 A kind of complaint automatic identification early warning system and method applied to airport service
CN110275956A (en) * 2019-06-24 2019-09-24 成都数之联科技有限公司 A kind of personal identification method and system
CN110414819A (en) * 2019-07-19 2019-11-05 中国电信集团工会上海市委员会 A kind of work order methods of marking
CN110414819B (en) * 2019-07-19 2023-05-26 中国电信集团工会上海市委员会 Work order scoring method
CN110889770A (en) * 2019-10-12 2020-03-17 中国平安财产保险股份有限公司 Data processing method, device, equipment and storage medium
CN110889770B (en) * 2019-10-12 2024-05-24 中国平安财产保险股份有限公司 Data processing method, device, equipment and storage medium
CN110713088B (en) * 2019-10-25 2021-06-01 日立楼宇技术(广州)有限公司 Early warning method, device, equipment and medium for elevator complaints
CN110713088A (en) * 2019-10-25 2020-01-21 日立楼宇技术(广州)有限公司 Early warning method, device, equipment and medium for elevator complaints
CN111415060A (en) * 2020-01-21 2020-07-14 国网浙江省电力有限公司湖州供电公司 Complaint risk analysis method based on customer label
CN111415060B (en) * 2020-01-21 2022-07-29 国网浙江省电力有限公司湖州供电公司 Complaint risk analysis method based on customer label
CN112256836A (en) * 2020-11-04 2021-01-22 中国建设银行股份有限公司 Recording data processing method and device and server
CN112836517A (en) * 2021-01-27 2021-05-25 浪潮云信息技术股份公司 Method for processing mining risk signal based on natural language
CN112927091A (en) * 2021-04-08 2021-06-08 泰康保险集团股份有限公司 Complaint early warning method and device for annuity insurance, computer equipment and medium
CN112927091B (en) * 2021-04-08 2023-11-10 泰康保险集团股份有限公司 Complaint early warning method and device for annual gold insurance, computer equipment and medium
CN113220545A (en) * 2021-05-11 2021-08-06 中国工商银行股份有限公司 Work order assignment method and device and electronic equipment
CN113673905A (en) * 2021-08-31 2021-11-19 广东省信息网络有限公司 Complaint service early warning monitoring system based on big data
CN114298425A (en) * 2021-12-30 2022-04-08 成都数联云算科技有限公司 Customer complaint risk prediction method, apparatus, device, and medium
CN115081869A (en) * 2022-06-20 2022-09-20 中国银行股份有限公司 Audit project generation method and device
CN118396393A (en) * 2024-06-27 2024-07-26 广东烟草东莞市有限公司 Multi-algorithm fusion risk prediction method for tobacco monopoly retailers

Also Published As

Publication number Publication date
CN106529804B (en) 2023-08-18

Similar Documents

Publication Publication Date Title
CN106529804A (en) Client complaint early-warning monitoring analyzing method based on text mining technology
CN106530127A (en) Complaint early warning and monitoring analysis system based on text mining
CN107704637B (en) knowledge graph construction method for emergency
CN108108352A (en) A kind of enterprise's complaint risk method for early warning based on machine learning Text Mining Technology
Yang et al. Coordinated development path of metropolitan logistics and economy in Belt and Road using DEMATEL–Bayesian analysis
CN107122432A (en) CSR analysis method, device and system
CN103150333A (en) Opinion leader identification method in microblog media
CN107885849A (en) A kind of moos index analysis system based on text classification
CN114860882A (en) Fair competition review auxiliary method based on text classification model
Yuan et al. A hybrid method for multi-class sentiment analysis of micro-blogs
CN106227802A (en) A kind of based on Chinese natural language process and the multiple source Forecasting of Stock Prices method of multi-core classifier
Du et al. Text similarity detection method of power customer service work order based on tfidf algorithm
Li et al. Credit risk management of scientific and technological enterprises based on text mining
Featherstone et al. Validating sentiment analysis on opinion mining using self-reported attitude scores
CN111241077A (en) Financial fraud behavior identification method based on internet data
Seo et al. Measuring News Sentiment of Korea Using Transformer
Wan et al. Evaluation model of power operation and maintenance based on text emotion analysis
Huang Web mining for the mayoral election prediction in Taiwan
CN110750622A (en) Financial event discovery method based on big data
CN114266646A (en) User consumption behavior monitoring and analyzing method and system based on internet summary calculation
Wang et al. Fault location of strip steel surface quality defects on hot-rolling production line based on information fusion of historical cases and process data
CN110968795B (en) Data association matching system of company image lifting system
AlFarasani et al. ATAM: arabic traffic analysis model for Twitter
CN114328819A (en) Power safety production hidden danger pre-control method based on knowledge graph
Jin et al. Diagnosis of corporate insolvency using massive news articles for credit management

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant