CN108021704B - Agent optimal configuration method based on social public opinion data mining technology - Google Patents

Agent optimal configuration method based on social public opinion data mining technology Download PDF

Info

Publication number
CN108021704B
CN108021704B CN201711445217.3A CN201711445217A CN108021704B CN 108021704 B CN108021704 B CN 108021704B CN 201711445217 A CN201711445217 A CN 201711445217A CN 108021704 B CN108021704 B CN 108021704B
Authority
CN
China
Prior art keywords
data
public opinion
social public
agent
social
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201711445217.3A
Other languages
Chinese (zh)
Other versions
CN108021704A (en
Inventor
孔祥明
杨晓霖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Guangye Kaiyuan Technology Co ltd
Original Assignee
Guangdong Guangye Kaiyuan Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Guangye Kaiyuan Technology Co ltd filed Critical Guangdong Guangye Kaiyuan Technology Co ltd
Priority to CN201711445217.3A priority Critical patent/CN108021704B/en
Publication of CN108021704A publication Critical patent/CN108021704A/en
Application granted granted Critical
Publication of CN108021704B publication Critical patent/CN108021704B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • Strategic Management (AREA)
  • Data Mining & Analysis (AREA)
  • Marketing (AREA)
  • General Engineering & Computer Science (AREA)
  • Development Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Game Theory and Decision Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Evolutionary Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Educational Administration (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses an agent optimal configuration method based on social public sentiment data mining technology, which comprises the following steps: step1, collecting public opinion data by using a web crawler technology; step2, public opinion data preprocessing, including data cleaning, data integration, data conversion and data reduction; step3, establishing a social public opinion confidence model by using a text mining technology and a support vector machine algorithm, and dividing the social public opinion information confidence level; and 4, establishing an agent optimization configuration model. According to the agent optimal configuration algorithm model based on social public opinion data mining, provided by the invention, the social public opinions are divided into confidence levels according to real-time public opinion data of the Internet and algorithms such as text mining and SVM (support vector machine) and the like, so that the call volume reported by 12345 complaints is estimated and predicted, and the agent is scientifically and reasonably configured by using a big data technology.

Description

Agent optimal configuration method based on social public opinion data mining technology
Technical Field
The invention relates to the technical field of databases, in particular to an agent optimal configuration method based on social public sentiment data mining technology.
Background
With the steady development of politics, economy, culture and society in China, the right-maintaining consciousness of people is gradually enhanced, the attention to social affairs is continuously improved, and the 12345 government affair service hotline becomes an effective window for reflecting social problem phenomena and expressing social appeal of people. In order to effectively exert the influence and the acting force of a 12345 government affair service hotline, the reasonable arrangement of the seats is a basic work which cannot be ignored, and the reasonable configuration of the seats is the basis and the key for the masses to effectively express complaints and reflect problems in time.
The existing seat configuration model only sets seats based on historical data such as call quantity, average processing time and the like, the seat model considers single factors, and the phenomenon that the seat configuration is unreasonable is easily caused by neglecting social public opinions which are closely related to the number of complaints of the masses. With the high-speed transmission of internet information, the relevance between the real-time public sentiment data of the internet and the complaint reporting information is continuously enhanced, and the mining of the social public sentiment data can provide stronger leading significance for the optimal configuration of the seat.
Disclosure of Invention
In view of the above defects in the prior art, the technical problem to be solved by the present invention is to provide an agent optimal configuration method based on social public opinion data mining technology, which performs confidence level division on social public opinions according to real-time public opinion data of the internet and algorithms such as text mining and SVM, evaluates and predicts telephone incoming call volume and hot-line average processing duration of 12345 complaint reporting, and provides effective data support for scientific and reasonable configuration of an agent by using big data technology.
In order to achieve the purpose, the invention provides an agent optimal configuration method based on social public sentiment data mining technology, which comprises the following steps:
step1, collecting public opinion data by using a web crawler technology;
step2, public opinion data preprocessing, including data cleaning, data integration, data conversion and data reduction;
step3, establishing a social public opinion confidence model by using a text mining technology and a support vector machine algorithm, and dividing the social public opinion information confidence level;
and 4, establishing an agent optimization configuration model.
Further, the step2 specifically includes:
step 21: data cleaning, namely identifying and processing the vacant data, the incomplete data and the unreasonable data;
step 22: data integration, namely organically centralizing and integrating data of different sources, formats and characteristics;
step 23: data conversion, converting the format of the data;
step 24: and (4) data reduction, namely, on the premise of keeping the integrity of the data, simplifying the data.
Further, the step3 specifically includes:
step 31: manually labeling, namely randomly extracting texts in a certain proportion, performing labeling classification by a plurality of related professionals, and counting the consistency of corpus labeling according to the labeling result;
step 32: feature selection, wherein the feature selection refers to selecting some representative words from a dictionary to realize dimension reduction, a Chi method is adopted to perform feature selection, and feature words w and categories a are assumedkThe chi-square distribution of the first-order degree of freedom is satisfied between the feature words w and the class akThe chi-square formula of (c) is:
Figure GDA0002958566550000021
n1 is the total number of documents, A is belonging to akThe number of documents in class and containing the feature word w, B being not akThe number of documents in class but containing the feature word w, E being akClass but no number of documents containing feature word w, D being not akThe number of documents which are similar and do not contain the feature word w;
for the situation of multiple categories, calculating chi-square statistic of the feature word w under each category;
if the feature word w and the category akIf the chi-square statistic value of (a) is 0, the feature word w and the text category a are describedkAre independent of each other; if the chi-square statistic value is larger, the characteristic word w and the category a are explainedkThe stronger the correlation of (c); removing the features lower than a specific threshold value through a chi-square formula, and reserving the features higher than the threshold value to realize feature selection;
step 33: feature extraction, namely, mapping a high-dimensional space to a low-dimensional space to realize dimension reduction, and performing feature extraction on a text by using an LSA algorithm, wherein the method mainly comprises the following steps:
1) establishing a word frequency matrix M;
2) calculating singular value decomposition of a word frequency matrix M, and decomposing the M into U, S, V three matrixes, wherein U and V are orthogonal matrixes, and S is a diagonal matrix;
3) mapping other training samples into a U space;
4) indexing and calculating similarity of the converted documents, and obtaining an LSA classifier through training;
step 34: constructing feature vectors, converting each text into an n-dimensional text vector, forming a text vector space by a plurality of text vectors, and assuming that n feature words exist, each text is an n-dimensional vector after being represented by the text;
step 35: constructing SVM classifier, and setting { (x 1),y1),(x2,y2),…,(xn,yn) The problem is transformed to the optimized hyperplane problem if the training set can be linearly divided by a hyperplane W · X + b { -0 } for a training set where xi represents the input vector and yi ∈ { -1,1} represents the output vector:
Figure GDA0002958566550000022
if the linear division is not linear, the input space R with low dimension can be divided by the kernel function K (x1, x2)nMapping to a high-dimensional characteristic space H to realize linear divisibility, and selecting a polynomial kernel function, wherein the formula is as follows:
K(x1,x2)=(<x1,x2>+R)d
1) selecting a proper kernel function K (x1, x2) and a penalty coefficient C >0, the formula of the objective function is as follows:
Figure GDA0002958566550000031
2) calculating an ai vector corresponding to minimization of the formula (2) by using an SMO algorithm;
3) calculating w, the formula is
Figure GDA0002958566550000032
4) Finding all samples (xm, ym) that meet 0< ai < C, assuming a total of M support vectors;
5) by passing
Figure GDA0002958566550000033
Calculating bm corresponding to each support vector (xm, ym), and obtaining
Figure GDA0002958566550000034
6) Thus, the classification hyperplane is
Figure GDA0002958566550000035
The classification decision function is
Figure GDA0002958566550000036
Step 36: and (5) predicting a result by the model.
Further, the step4 specifically includes:
step 41, according to the social public opinion information confidence level classification result in the step3, combining with the recent historical data of 12345 service hotlines, drawing an analysis curve related to time, and performing relevance analysis on hotline incoming call quantity and average processing time of each day and different time periods by using a relevance analysis algorithm;
step 42, by using a multiple regression analysis algorithm, taking historical data of social public opinion confidence level X1, and 12345 complaint report number X2, work order item type X3, and work order severity X4 as input variables, realizing weight distribution through multiple fitting, and finally constructing a daily hot-line incoming call quantity calculation formula in the following form:
F1(X)=W1*f1(X1)+W2*f2(X2)+W3*f3(X3)+W4*f4(X4)
hotline call volume F in different periods2(X) and hotline average processing time period F3(X) is calculated in a similar manner;
step 41, constructing an agent optimization configuration model by utilizing a multiple regression analysis algorithm, and assuming that the hot line incoming call quantity per day is F1(X) the hot incoming call quantity in different periods is F2(X) average hot line processing time length F3(X) if the call completing rate is a and the maximum occupancy rate is j, the agent optimization configuration function is as follows: g (X, a, j) ═ U1 × F1(X)+U2*F2(X)+U3*F3(X) + U4 g (a) + U5 h (j) Ui represents a weight.
Further, the step3 divides the social public opinion information into five confidence levels of optimism, prudent optimism, neutrality, prudent pessimism and pessimism.
The invention has the beneficial effects that:
according to the agent optimal configuration algorithm model based on social public opinion data mining, provided by the invention, the social public opinions are divided into confidence levels according to real-time public opinion data of the Internet and algorithms such as text mining and SVM (support vector machine) and the like, so that the call volume reported by 12345 complaints is estimated and predicted, and the agent is scientifically and reasonably configured by using a big data technology.
The conception, the specific structure and the technical effects of the present invention will be further described with reference to the accompanying drawings to fully understand the objects, the features and the effects of the present invention.
Drawings
FIG. 1 is an overall flow chart of the present invention.
Fig. 2 is a flow chart of establishing a social public opinion confidence model according to the present invention.
FIG. 3 is a flow chart of agent optimization configuration model establishment according to the present invention.
Detailed Description
As shown in fig. 1, the method for optimal configuration of an agent based on social public sentiment data mining technology of the present invention specifically comprises the following operation steps:
the method comprises the following steps: public opinion data collection
The method comprises the steps of collecting social public opinion data on the Internet by utilizing a web crawler technology, for example, regularly crawling and collecting the social public opinion data on social media such as various news websites, microblogs, forums, blogs and the like, and mainly using unstructured data mainly comprising text information.
Step two: public opinion data preprocessing
And preprocessing the crawled public opinion data, including the steps of data cleaning, data integration, data conversion, data reduction and the like.
Step 1: data cleaning: and the method identifies and processes the blank data, the incomplete data and the unreasonable data, and ensures the integrity, the reasonability, the authority and the consistency of the data.
Step 2: data integration: the data of different sources, formats and characteristics are organically centralized and integrated.
Step 3: data conversion: the format of the data is converted, so that the data can be analyzed and mined conveniently in the follow-up process.
Step 4: and (3) data reduction: on the premise of keeping the integrity of the data as much as possible, the data is simplified and processed by common dimensionality reduction methods such as PCA (principal component analysis).
Step three: the social public opinion confidence model is shown in fig. 2:
a social public opinion confidence model is established by utilizing a text mining technology and a support vector machine algorithm, and public opinion information is divided into five confidence levels of optimism, judicious optimism, neutrality, judicious pessimism and pessimism.
Step 1: manual labeling: randomly extracting texts in a certain proportion, carrying out labeling classification by a plurality of related professionals, counting the consistency of corpus labeling according to the labeling result, and using the passed labeling for information classification.
Step 2: selecting characteristics: the feature selection refers to selecting some representative words from a dictionary to realize dimension reduction.
Selecting characteristics by using a Chi method, and assuming characteristic words w and categories akThe chi-square distribution of the first-order degree of freedom is satisfied between the feature words w and the class akThe chi-square formula of (c) is:
Figure GDA0002958566550000051
n1 is the total number of documents, A is belonging to akThe number of documents in class and containing the feature word w, B being not akThe number of documents in class but containing the feature word w, E being akClass but no number of documents containing feature word w, D being not akClass and number of documents without the feature word w.
For the case of multiple categories, it is necessary to calculate chi-square statistics of the feature word w under each category.
If the feature word w and the category akIf the chi-square statistic value of (a) is 0, the feature word w and the text category a are describedkAre independent of each other; if the chi-square statistic value is larger, the characteristic word w and the category a are explainedkIn (2) correlation ofThe stronger the sex. Through a chi-square formula, the features lower than a specific threshold value can be removed, the features higher than the threshold value are reserved, and feature selection is realized.
Step 3: characteristic extraction: dimension reduction is achieved by mapping the high-dimensional space to the low-dimensional space.
The method is characterized by comprising the following steps of applying an LSA algorithm to extract the features of a text:
1) establishing a word frequency matrix M;
2) calculating singular value decomposition of a word frequency matrix M, and decomposing the M into U, S, V three matrixes, wherein U and V are orthogonal matrixes, and S is a diagonal matrix;
3) mapping other training samples into a U space;
4) and indexing and calculating the similarity of the converted documents, and obtaining the LSA classifier through training.
Step 4: constructing a feature vector: each text is converted into an n-dimensional text vector, and a plurality of text vectors form a text vector space. Assuming that n feature words are provided, each text is an n-dimensional vector after being represented by the text.
Step 5: constructing SVM classifier, and setting { (x 1),y1),(x2,y2),…,(xn,yn) The problem is transformed to the optimized hyperplane problem if the training set can be linearly divided by a hyperplane W · X + b { -0 } for a training set where xi represents the input vector and yi ∈ { -1,1} represents the output vector:
Figure GDA0002958566550000052
if the linear division is not linear, the input space R with low dimension can be divided by the kernel function K (x1, x2)nMapping to a high-dimensional feature space H to realize linear divisibility.
The kernel function refers to an inner product function of two vectors in a space after implicit mapping, common kernel functions include a polynomial kernel function, a linear kernel function, a gaussian kernel function and the like, and the polynomial kernel function is selected herein, and the formula is as follows:
K(x1,x2)=(<x1,x2>+R)d
1) selecting a proper kernel function K (x1, x2) and a penalty coefficient C >0, the formula of the objective function is as follows:
Figure GDA0002958566550000061
2) calculating a corresponding a vector when the formula (2) is minimized by using an SMO algorithm;
3) calculating w, the formula is
Figure GDA0002958566550000062
4) Finding all samples (xm, ym) that meet 0< ai < C, assuming a total of M support vectors;
5) by passing
Figure GDA0002958566550000063
Calculating bm corresponding to each support vector (xm, ym), and obtaining
Figure GDA0002958566550000064
6) Thus, the classification hyperplane is
Figure GDA0002958566550000065
The classification decision function is
Figure GDA0002958566550000066
Step 6: model prediction results
Confidence level classification is carried out on unmarked social public opinion data by adopting SVM parameters and models trained in step5
Step four: the agent optimization configuration model is as shown in FIG. 3:
step 1: and (4) according to the social public opinion confidence level classification result in the step three, combining the recent historical data of the 12345 service hotline, drawing an analysis curve related to time, and performing relevance analysis on the hotline incoming call quantity and the average processing time of each day and different time intervals by using a relevance analysis algorithm.
Step 2: by utilizing a multiple regression analysis algorithm, taking historical data such as social public opinion confidence level X1, number of complaints reported by 12345X 2, work order item type X3, work order severity X4 and the like as input variables, realizing weight distribution through multiple fitting, and finally constructing a hot line incoming call quantity calculation formula in the following form each day:
F1(X)=W1*f1(X1)+W2*f2(X2)+W3*f3(X3)+W4*f4(X4)
hotline call volume F in different periods2(X) and hotline average processing time period F3The calculation method of (X) is similar.
Step 3: and constructing an agent optimization configuration model by using a multiple regression analysis algorithm. Suppose that the daily hot line incoming call amount is F1(X) the hot incoming call quantity in different periods is F2(X) average hot line processing time length F3(X) if the call completing rate is a and the maximum occupancy rate is j, the agent optimization configuration function is as follows: g (X, a, j) ═ U1 × F1(X)+U2*F2(X)+U3*F3(X) + U4 g (a) + U5 h (j) Ui represents a weight.
According to the agent optimal configuration algorithm model based on social public opinion data mining, provided by the invention, the social public opinions are divided into confidence levels according to real-time public opinion data of the Internet and algorithms such as text mining and SVM (support vector machine) and the like, so that the call volume reported by 12345 complaints is estimated and predicted, and the agent is scientifically and reasonably configured by using a big data technology.
The foregoing detailed description of the preferred embodiments of the invention has been presented. It should be understood that numerous modifications and variations could be devised by those skilled in the art in light of the present teachings without departing from the inventive concepts. Therefore, the technical solutions available to those skilled in the art through logic analysis, reasoning and limited experiments based on the prior art according to the concept of the present invention should be within the scope of protection defined by the claims.

Claims (2)

1. A method for optimally configuring an agent based on a social public sentiment data mining technology is characterized by comprising the following steps:
step1, collecting public opinion data by using a web crawler technology;
step2, public opinion data preprocessing, including data cleaning, data integration, data conversion and data reduction;
step3, establishing a social public opinion confidence model by using a text mining technology and a support vector machine algorithm, and dividing the social public opinion information confidence level; the step3 divides the social public sentiment information into five confidence levels of optimism, judicious optimism, neutrality, judicious pessimism and pessimism,
step4, establishing an agent optimization configuration model, wherein the step4 specifically comprises the following steps:
step 41, according to the social public opinion information confidence level classification result in the step3, combining with the recent historical data of 12345 service hotlines, drawing an analysis curve related to time, and performing relevance analysis on hotline incoming call quantity and average processing time of each day and different time periods by using a relevance analysis algorithm;
step 42, by using a multiple regression analysis algorithm, taking historical data of social public opinion confidence level X1, and 12345 complaint report number X2, work order item type X3, and work order severity X4 as input variables, realizing weight distribution through multiple fitting, and finally constructing a daily hot-line incoming call quantity calculation formula in the following form:
F1(X)=W1*f1(X1)+W2*f2(X2)+W3*f3(X3)+W4*f4(X4)
hotline call volume F in different periods2(X) and hotline average processing time period F3(X) is calculated in a similar manner;
step 43, constructing an agent optimization configuration model by utilizing a multiple regression analysis algorithm, and assuming that the hot line incoming call quantity per day is F1(X) the hot incoming call quantity in different periods is F2(X) average hot line processing time length F3(X) if the call completing rate is a and the maximum occupancy rate is j, the agent optimization configuration function is as follows: g (X, a, j) ═ U1 × F1(X)+U2*F2(X)+U3*F3(X) + U4 g (a) + U5 h (j) Ui represents a weight.
2. The method for optimal configuration of the agent based on the social public opinion data mining technology as claimed in claim 1, wherein the step2 specifically comprises:
step 21: data cleaning, namely identifying and processing the vacant data, the incomplete data and the unreasonable data;
step 22: data integration, namely organically centralizing and integrating data of different sources, formats and characteristics;
step 23: data conversion, converting the format of the data;
step 24: and (4) data reduction, namely, on the premise of keeping the integrity of the data, simplifying the data.
CN201711445217.3A 2017-12-27 2017-12-27 Agent optimal configuration method based on social public opinion data mining technology Active CN108021704B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711445217.3A CN108021704B (en) 2017-12-27 2017-12-27 Agent optimal configuration method based on social public opinion data mining technology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711445217.3A CN108021704B (en) 2017-12-27 2017-12-27 Agent optimal configuration method based on social public opinion data mining technology

Publications (2)

Publication Number Publication Date
CN108021704A CN108021704A (en) 2018-05-11
CN108021704B true CN108021704B (en) 2021-05-04

Family

ID=62071068

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711445217.3A Active CN108021704B (en) 2017-12-27 2017-12-27 Agent optimal configuration method based on social public opinion data mining technology

Country Status (1)

Country Link
CN (1) CN108021704B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109871889B (en) * 2019-01-31 2019-12-24 内蒙古工业大学 Public psychological assessment method under emergency
CN110889556B (en) * 2019-11-28 2022-08-12 福建亿榕信息技术有限公司 Enterprise operation risk characteristic data information extraction method and extraction system
CN115048487B (en) * 2022-05-30 2024-05-03 平安科技(深圳)有限公司 Public opinion analysis method, device, computer equipment and medium based on artificial intelligence

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104113643A (en) * 2014-06-27 2014-10-22 国家电网公司 Customer service center on-site monitoring system and method
US9536191B1 (en) * 2015-11-25 2017-01-03 Osaro, Inc. Reinforcement learning using confidence scores
CN106530127A (en) * 2016-11-09 2017-03-22 国网江苏省电力公司南京供电公司 Complaint early warning and monitoring analysis system based on text mining
CN106791225A (en) * 2017-03-23 2017-05-31 国家电网公司客户服务中心 A kind of alarm method and device

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070100779A1 (en) * 2005-08-05 2007-05-03 Ori Levy Method and system for extracting web data
US9141966B2 (en) * 2009-12-23 2015-09-22 Yahoo! Inc. Opinion aggregation system
US20120323627A1 (en) * 2011-06-14 2012-12-20 Microsoft Corporation Real-time Monitoring of Public Sentiment
CN103970864B (en) * 2014-05-08 2017-09-22 清华大学 Mood classification and mood component analyzing method and system based on microblogging text

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104113643A (en) * 2014-06-27 2014-10-22 国家电网公司 Customer service center on-site monitoring system and method
US9536191B1 (en) * 2015-11-25 2017-01-03 Osaro, Inc. Reinforcement learning using confidence scores
CN106530127A (en) * 2016-11-09 2017-03-22 国网江苏省电力公司南京供电公司 Complaint early warning and monitoring analysis system based on text mining
CN106791225A (en) * 2017-03-23 2017-05-31 国家电网公司客户服务中心 A kind of alarm method and device

Also Published As

Publication number Publication date
CN108021704A (en) 2018-05-11

Similar Documents

Publication Publication Date Title
Jain et al. An intelligent cognitive-inspired computing with big data analytics framework for sentiment analysis and classification
CN103678670B (en) Micro-blog hot word and hot topic mining system and method
CN104199965B (en) Semantic information retrieval method
CN106874292B (en) Topic processing method and device
CN108021704B (en) Agent optimal configuration method based on social public opinion data mining technology
CN103279478A (en) Method for extracting features based on distributed mutual information documents
CN109471946A (en) A kind of classification method and system of Chinese text
CN109271514A (en) Generation method, classification method, device and the storage medium of short text disaggregated model
CN104834651A (en) Method and apparatus for providing answers to frequently asked questions
CN116843162B (en) Contradiction reconciliation scheme recommendation and scoring system and method
Rao et al. Result prediction for political parties using Twitter sentiment analysis
CN108334573B (en) High-correlation microblog retrieval method based on clustering information
Onwuegbuche et al. Support vector machine for sentiment analysis of Nigerian banks financial tweets
CN112270189A (en) Question type analysis node generation method, question type analysis node generation system and storage medium
Kulkarni et al. Tweet Sentiment Analysis and Study and Comparison of Various Approaches and Classification Algorithms Used
CN114372145A (en) Operation and maintenance resource dynamic allocation scheduling method based on knowledge graph platform
Ashwini et al. Impact of Text Representation Techniques on Clustering Models
Najadat et al. Analyzing social media opinions using data analytics
Bhuvaneswari et al. Enhancing the sentiment classification accuracy of twitter data using machine learning algorithms
Rana et al. News headlines classification using probabilistic approach
CN113806534B (en) Hot event prediction method for social network
Al-Hadheri et al. Text Classification in Arabic Natural Language Processing: A Review
CN111159393B (en) Text generation method for abstract extraction based on LDA and D2V
Koulali et al. A comparative study on text representation models for topic detection in arabic
CN118378792B (en) Data processing analysis platform based on artificial intelligence

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant