CN108021704B - Agent optimal configuration method based on social public opinion data mining technology - Google Patents
Agent optimal configuration method based on social public opinion data mining technology Download PDFInfo
- Publication number
- CN108021704B CN108021704B CN201711445217.3A CN201711445217A CN108021704B CN 108021704 B CN108021704 B CN 108021704B CN 201711445217 A CN201711445217 A CN 201711445217A CN 108021704 B CN108021704 B CN 108021704B
- Authority
- CN
- China
- Prior art keywords
- data
- public opinion
- social public
- agent
- social
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000005516 engineering process Methods 0.000 title claims abstract description 21
- 238000000034 method Methods 0.000 title claims abstract description 19
- 238000007418 data mining Methods 0.000 title claims abstract description 12
- 239000003795 chemical substances by application Substances 0.000 claims abstract description 26
- 238000012706 support-vector machine Methods 0.000 claims abstract description 14
- 238000005457 optimization Methods 0.000 claims abstract description 11
- 238000005065 mining Methods 0.000 claims abstract description 9
- 238000006243 chemical reaction Methods 0.000 claims abstract description 7
- 238000004140 cleaning Methods 0.000 claims abstract description 7
- 230000010354 integration Effects 0.000 claims abstract description 7
- 238000007781 pre-processing Methods 0.000 claims abstract description 5
- 238000012545 processing Methods 0.000 claims description 13
- 238000004458 analytical method Methods 0.000 claims description 10
- 238000000611 regression analysis Methods 0.000 claims description 6
- 238000004364 calculation method Methods 0.000 claims description 4
- 239000013598 vector Substances 0.000 description 21
- 238000002372 labelling Methods 0.000 description 9
- 238000012549 training Methods 0.000 description 8
- 238000013507 mapping Methods 0.000 description 7
- 239000011159 matrix material Substances 0.000 description 6
- 238000000605 extraction Methods 0.000 description 3
- 238000000354 decomposition reaction Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000000513 principal component analysis Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000009193 crawling Effects 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2411—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Tourism & Hospitality (AREA)
- Human Resources & Organizations (AREA)
- Economics (AREA)
- Strategic Management (AREA)
- Data Mining & Analysis (AREA)
- Marketing (AREA)
- General Engineering & Computer Science (AREA)
- Development Economics (AREA)
- General Business, Economics & Management (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Game Theory and Decision Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Entrepreneurship & Innovation (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Evolutionary Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Databases & Information Systems (AREA)
- Educational Administration (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses an agent optimal configuration method based on social public sentiment data mining technology, which comprises the following steps: step1, collecting public opinion data by using a web crawler technology; step2, public opinion data preprocessing, including data cleaning, data integration, data conversion and data reduction; step3, establishing a social public opinion confidence model by using a text mining technology and a support vector machine algorithm, and dividing the social public opinion information confidence level; and 4, establishing an agent optimization configuration model. According to the agent optimal configuration algorithm model based on social public opinion data mining, provided by the invention, the social public opinions are divided into confidence levels according to real-time public opinion data of the Internet and algorithms such as text mining and SVM (support vector machine) and the like, so that the call volume reported by 12345 complaints is estimated and predicted, and the agent is scientifically and reasonably configured by using a big data technology.
Description
Technical Field
The invention relates to the technical field of databases, in particular to an agent optimal configuration method based on social public sentiment data mining technology.
Background
With the steady development of politics, economy, culture and society in China, the right-maintaining consciousness of people is gradually enhanced, the attention to social affairs is continuously improved, and the 12345 government affair service hotline becomes an effective window for reflecting social problem phenomena and expressing social appeal of people. In order to effectively exert the influence and the acting force of a 12345 government affair service hotline, the reasonable arrangement of the seats is a basic work which cannot be ignored, and the reasonable configuration of the seats is the basis and the key for the masses to effectively express complaints and reflect problems in time.
The existing seat configuration model only sets seats based on historical data such as call quantity, average processing time and the like, the seat model considers single factors, and the phenomenon that the seat configuration is unreasonable is easily caused by neglecting social public opinions which are closely related to the number of complaints of the masses. With the high-speed transmission of internet information, the relevance between the real-time public sentiment data of the internet and the complaint reporting information is continuously enhanced, and the mining of the social public sentiment data can provide stronger leading significance for the optimal configuration of the seat.
Disclosure of Invention
In view of the above defects in the prior art, the technical problem to be solved by the present invention is to provide an agent optimal configuration method based on social public opinion data mining technology, which performs confidence level division on social public opinions according to real-time public opinion data of the internet and algorithms such as text mining and SVM, evaluates and predicts telephone incoming call volume and hot-line average processing duration of 12345 complaint reporting, and provides effective data support for scientific and reasonable configuration of an agent by using big data technology.
In order to achieve the purpose, the invention provides an agent optimal configuration method based on social public sentiment data mining technology, which comprises the following steps:
step1, collecting public opinion data by using a web crawler technology;
step2, public opinion data preprocessing, including data cleaning, data integration, data conversion and data reduction;
step3, establishing a social public opinion confidence model by using a text mining technology and a support vector machine algorithm, and dividing the social public opinion information confidence level;
and 4, establishing an agent optimization configuration model.
Further, the step2 specifically includes:
step 21: data cleaning, namely identifying and processing the vacant data, the incomplete data and the unreasonable data;
step 22: data integration, namely organically centralizing and integrating data of different sources, formats and characteristics;
step 23: data conversion, converting the format of the data;
step 24: and (4) data reduction, namely, on the premise of keeping the integrity of the data, simplifying the data.
Further, the step3 specifically includes:
step 31: manually labeling, namely randomly extracting texts in a certain proportion, performing labeling classification by a plurality of related professionals, and counting the consistency of corpus labeling according to the labeling result;
step 32: feature selection, wherein the feature selection refers to selecting some representative words from a dictionary to realize dimension reduction, a Chi method is adopted to perform feature selection, and feature words w and categories a are assumedkThe chi-square distribution of the first-order degree of freedom is satisfied between the feature words w and the class akThe chi-square formula of (c) is:
n1 is the total number of documents, A is belonging to akThe number of documents in class and containing the feature word w, B being not akThe number of documents in class but containing the feature word w, E being akClass but no number of documents containing feature word w, D being not akThe number of documents which are similar and do not contain the feature word w;
for the situation of multiple categories, calculating chi-square statistic of the feature word w under each category;
if the feature word w and the category akIf the chi-square statistic value of (a) is 0, the feature word w and the text category a are describedkAre independent of each other; if the chi-square statistic value is larger, the characteristic word w and the category a are explainedkThe stronger the correlation of (c); removing the features lower than a specific threshold value through a chi-square formula, and reserving the features higher than the threshold value to realize feature selection;
step 33: feature extraction, namely, mapping a high-dimensional space to a low-dimensional space to realize dimension reduction, and performing feature extraction on a text by using an LSA algorithm, wherein the method mainly comprises the following steps:
1) establishing a word frequency matrix M;
2) calculating singular value decomposition of a word frequency matrix M, and decomposing the M into U, S, V three matrixes, wherein U and V are orthogonal matrixes, and S is a diagonal matrix;
3) mapping other training samples into a U space;
4) indexing and calculating similarity of the converted documents, and obtaining an LSA classifier through training;
step 34: constructing feature vectors, converting each text into an n-dimensional text vector, forming a text vector space by a plurality of text vectors, and assuming that n feature words exist, each text is an n-dimensional vector after being represented by the text;
step 35: constructing SVM classifier, and setting { (x 1),y1),(x2,y2),…,(xn,yn) The problem is transformed to the optimized hyperplane problem if the training set can be linearly divided by a hyperplane W · X + b { -0 } for a training set where xi represents the input vector and yi ∈ { -1,1} represents the output vector:
if the linear division is not linear, the input space R with low dimension can be divided by the kernel function K (x1, x2)nMapping to a high-dimensional characteristic space H to realize linear divisibility, and selecting a polynomial kernel function, wherein the formula is as follows:
K(x1,x2)=(<x1,x2>+R)d
1) selecting a proper kernel function K (x1, x2) and a penalty coefficient C >0, the formula of the objective function is as follows:
2) calculating an ai vector corresponding to minimization of the formula (2) by using an SMO algorithm;
4) Finding all samples (xm, ym) that meet 0< ai < C, assuming a total of M support vectors;
Step 36: and (5) predicting a result by the model.
Further, the step4 specifically includes:
step 41, according to the social public opinion information confidence level classification result in the step3, combining with the recent historical data of 12345 service hotlines, drawing an analysis curve related to time, and performing relevance analysis on hotline incoming call quantity and average processing time of each day and different time periods by using a relevance analysis algorithm;
step 42, by using a multiple regression analysis algorithm, taking historical data of social public opinion confidence level X1, and 12345 complaint report number X2, work order item type X3, and work order severity X4 as input variables, realizing weight distribution through multiple fitting, and finally constructing a daily hot-line incoming call quantity calculation formula in the following form:
F1(X)=W1*f1(X1)+W2*f2(X2)+W3*f3(X3)+W4*f4(X4)
hotline call volume F in different periods2(X) and hotline average processing time period F3(X) is calculated in a similar manner;
step 41, constructing an agent optimization configuration model by utilizing a multiple regression analysis algorithm, and assuming that the hot line incoming call quantity per day is F1(X) the hot incoming call quantity in different periods is F2(X) average hot line processing time length F3(X) if the call completing rate is a and the maximum occupancy rate is j, the agent optimization configuration function is as follows: g (X, a, j) ═ U1 × F1(X)+U2*F2(X)+U3*F3(X) + U4 g (a) + U5 h (j) Ui represents a weight.
Further, the step3 divides the social public opinion information into five confidence levels of optimism, prudent optimism, neutrality, prudent pessimism and pessimism.
The invention has the beneficial effects that:
according to the agent optimal configuration algorithm model based on social public opinion data mining, provided by the invention, the social public opinions are divided into confidence levels according to real-time public opinion data of the Internet and algorithms such as text mining and SVM (support vector machine) and the like, so that the call volume reported by 12345 complaints is estimated and predicted, and the agent is scientifically and reasonably configured by using a big data technology.
The conception, the specific structure and the technical effects of the present invention will be further described with reference to the accompanying drawings to fully understand the objects, the features and the effects of the present invention.
Drawings
FIG. 1 is an overall flow chart of the present invention.
Fig. 2 is a flow chart of establishing a social public opinion confidence model according to the present invention.
FIG. 3 is a flow chart of agent optimization configuration model establishment according to the present invention.
Detailed Description
As shown in fig. 1, the method for optimal configuration of an agent based on social public sentiment data mining technology of the present invention specifically comprises the following operation steps:
the method comprises the following steps: public opinion data collection
The method comprises the steps of collecting social public opinion data on the Internet by utilizing a web crawler technology, for example, regularly crawling and collecting the social public opinion data on social media such as various news websites, microblogs, forums, blogs and the like, and mainly using unstructured data mainly comprising text information.
Step two: public opinion data preprocessing
And preprocessing the crawled public opinion data, including the steps of data cleaning, data integration, data conversion, data reduction and the like.
Step 1: data cleaning: and the method identifies and processes the blank data, the incomplete data and the unreasonable data, and ensures the integrity, the reasonability, the authority and the consistency of the data.
Step 2: data integration: the data of different sources, formats and characteristics are organically centralized and integrated.
Step 3: data conversion: the format of the data is converted, so that the data can be analyzed and mined conveniently in the follow-up process.
Step 4: and (3) data reduction: on the premise of keeping the integrity of the data as much as possible, the data is simplified and processed by common dimensionality reduction methods such as PCA (principal component analysis).
Step three: the social public opinion confidence model is shown in fig. 2:
a social public opinion confidence model is established by utilizing a text mining technology and a support vector machine algorithm, and public opinion information is divided into five confidence levels of optimism, judicious optimism, neutrality, judicious pessimism and pessimism.
Step 1: manual labeling: randomly extracting texts in a certain proportion, carrying out labeling classification by a plurality of related professionals, counting the consistency of corpus labeling according to the labeling result, and using the passed labeling for information classification.
Step 2: selecting characteristics: the feature selection refers to selecting some representative words from a dictionary to realize dimension reduction.
Selecting characteristics by using a Chi method, and assuming characteristic words w and categories akThe chi-square distribution of the first-order degree of freedom is satisfied between the feature words w and the class akThe chi-square formula of (c) is:
n1 is the total number of documents, A is belonging to akThe number of documents in class and containing the feature word w, B being not akThe number of documents in class but containing the feature word w, E being akClass but no number of documents containing feature word w, D being not akClass and number of documents without the feature word w.
For the case of multiple categories, it is necessary to calculate chi-square statistics of the feature word w under each category.
If the feature word w and the category akIf the chi-square statistic value of (a) is 0, the feature word w and the text category a are describedkAre independent of each other; if the chi-square statistic value is larger, the characteristic word w and the category a are explainedkIn (2) correlation ofThe stronger the sex. Through a chi-square formula, the features lower than a specific threshold value can be removed, the features higher than the threshold value are reserved, and feature selection is realized.
Step 3: characteristic extraction: dimension reduction is achieved by mapping the high-dimensional space to the low-dimensional space.
The method is characterized by comprising the following steps of applying an LSA algorithm to extract the features of a text:
1) establishing a word frequency matrix M;
2) calculating singular value decomposition of a word frequency matrix M, and decomposing the M into U, S, V three matrixes, wherein U and V are orthogonal matrixes, and S is a diagonal matrix;
3) mapping other training samples into a U space;
4) and indexing and calculating the similarity of the converted documents, and obtaining the LSA classifier through training.
Step 4: constructing a feature vector: each text is converted into an n-dimensional text vector, and a plurality of text vectors form a text vector space. Assuming that n feature words are provided, each text is an n-dimensional vector after being represented by the text.
Step 5: constructing SVM classifier, and setting { (x 1),y1),(x2,y2),…,(xn,yn) The problem is transformed to the optimized hyperplane problem if the training set can be linearly divided by a hyperplane W · X + b { -0 } for a training set where xi represents the input vector and yi ∈ { -1,1} represents the output vector:
if the linear division is not linear, the input space R with low dimension can be divided by the kernel function K (x1, x2)nMapping to a high-dimensional feature space H to realize linear divisibility.
The kernel function refers to an inner product function of two vectors in a space after implicit mapping, common kernel functions include a polynomial kernel function, a linear kernel function, a gaussian kernel function and the like, and the polynomial kernel function is selected herein, and the formula is as follows:
K(x1,x2)=(<x1,x2>+R)d
1) selecting a proper kernel function K (x1, x2) and a penalty coefficient C >0, the formula of the objective function is as follows:
2) calculating a corresponding a vector when the formula (2) is minimized by using an SMO algorithm;
4) Finding all samples (xm, ym) that meet 0< ai < C, assuming a total of M support vectors;
Step 6: model prediction results
Confidence level classification is carried out on unmarked social public opinion data by adopting SVM parameters and models trained in step5
Step four: the agent optimization configuration model is as shown in FIG. 3:
step 1: and (4) according to the social public opinion confidence level classification result in the step three, combining the recent historical data of the 12345 service hotline, drawing an analysis curve related to time, and performing relevance analysis on the hotline incoming call quantity and the average processing time of each day and different time intervals by using a relevance analysis algorithm.
Step 2: by utilizing a multiple regression analysis algorithm, taking historical data such as social public opinion confidence level X1, number of complaints reported by 12345X 2, work order item type X3, work order severity X4 and the like as input variables, realizing weight distribution through multiple fitting, and finally constructing a hot line incoming call quantity calculation formula in the following form each day:
F1(X)=W1*f1(X1)+W2*f2(X2)+W3*f3(X3)+W4*f4(X4)
hotline call volume F in different periods2(X) and hotline average processing time period F3The calculation method of (X) is similar.
Step 3: and constructing an agent optimization configuration model by using a multiple regression analysis algorithm. Suppose that the daily hot line incoming call amount is F1(X) the hot incoming call quantity in different periods is F2(X) average hot line processing time length F3(X) if the call completing rate is a and the maximum occupancy rate is j, the agent optimization configuration function is as follows: g (X, a, j) ═ U1 × F1(X)+U2*F2(X)+U3*F3(X) + U4 g (a) + U5 h (j) Ui represents a weight.
According to the agent optimal configuration algorithm model based on social public opinion data mining, provided by the invention, the social public opinions are divided into confidence levels according to real-time public opinion data of the Internet and algorithms such as text mining and SVM (support vector machine) and the like, so that the call volume reported by 12345 complaints is estimated and predicted, and the agent is scientifically and reasonably configured by using a big data technology.
The foregoing detailed description of the preferred embodiments of the invention has been presented. It should be understood that numerous modifications and variations could be devised by those skilled in the art in light of the present teachings without departing from the inventive concepts. Therefore, the technical solutions available to those skilled in the art through logic analysis, reasoning and limited experiments based on the prior art according to the concept of the present invention should be within the scope of protection defined by the claims.
Claims (2)
1. A method for optimally configuring an agent based on a social public sentiment data mining technology is characterized by comprising the following steps:
step1, collecting public opinion data by using a web crawler technology;
step2, public opinion data preprocessing, including data cleaning, data integration, data conversion and data reduction;
step3, establishing a social public opinion confidence model by using a text mining technology and a support vector machine algorithm, and dividing the social public opinion information confidence level; the step3 divides the social public sentiment information into five confidence levels of optimism, judicious optimism, neutrality, judicious pessimism and pessimism,
step4, establishing an agent optimization configuration model, wherein the step4 specifically comprises the following steps:
step 41, according to the social public opinion information confidence level classification result in the step3, combining with the recent historical data of 12345 service hotlines, drawing an analysis curve related to time, and performing relevance analysis on hotline incoming call quantity and average processing time of each day and different time periods by using a relevance analysis algorithm;
step 42, by using a multiple regression analysis algorithm, taking historical data of social public opinion confidence level X1, and 12345 complaint report number X2, work order item type X3, and work order severity X4 as input variables, realizing weight distribution through multiple fitting, and finally constructing a daily hot-line incoming call quantity calculation formula in the following form:
F1(X)=W1*f1(X1)+W2*f2(X2)+W3*f3(X3)+W4*f4(X4)
hotline call volume F in different periods2(X) and hotline average processing time period F3(X) is calculated in a similar manner;
step 43, constructing an agent optimization configuration model by utilizing a multiple regression analysis algorithm, and assuming that the hot line incoming call quantity per day is F1(X) the hot incoming call quantity in different periods is F2(X) average hot line processing time length F3(X) if the call completing rate is a and the maximum occupancy rate is j, the agent optimization configuration function is as follows: g (X, a, j) ═ U1 × F1(X)+U2*F2(X)+U3*F3(X) + U4 g (a) + U5 h (j) Ui represents a weight.
2. The method for optimal configuration of the agent based on the social public opinion data mining technology as claimed in claim 1, wherein the step2 specifically comprises:
step 21: data cleaning, namely identifying and processing the vacant data, the incomplete data and the unreasonable data;
step 22: data integration, namely organically centralizing and integrating data of different sources, formats and characteristics;
step 23: data conversion, converting the format of the data;
step 24: and (4) data reduction, namely, on the premise of keeping the integrity of the data, simplifying the data.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711445217.3A CN108021704B (en) | 2017-12-27 | 2017-12-27 | Agent optimal configuration method based on social public opinion data mining technology |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711445217.3A CN108021704B (en) | 2017-12-27 | 2017-12-27 | Agent optimal configuration method based on social public opinion data mining technology |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108021704A CN108021704A (en) | 2018-05-11 |
CN108021704B true CN108021704B (en) | 2021-05-04 |
Family
ID=62071068
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711445217.3A Active CN108021704B (en) | 2017-12-27 | 2017-12-27 | Agent optimal configuration method based on social public opinion data mining technology |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108021704B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109871889B (en) * | 2019-01-31 | 2019-12-24 | 内蒙古工业大学 | Public psychological assessment method under emergency |
CN110889556B (en) * | 2019-11-28 | 2022-08-12 | 福建亿榕信息技术有限公司 | Enterprise operation risk characteristic data information extraction method and extraction system |
CN115048487B (en) * | 2022-05-30 | 2024-05-03 | 平安科技(深圳)有限公司 | Public opinion analysis method, device, computer equipment and medium based on artificial intelligence |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104113643A (en) * | 2014-06-27 | 2014-10-22 | 国家电网公司 | Customer service center on-site monitoring system and method |
US9536191B1 (en) * | 2015-11-25 | 2017-01-03 | Osaro, Inc. | Reinforcement learning using confidence scores |
CN106530127A (en) * | 2016-11-09 | 2017-03-22 | 国网江苏省电力公司南京供电公司 | Complaint early warning and monitoring analysis system based on text mining |
CN106791225A (en) * | 2017-03-23 | 2017-05-31 | 国家电网公司客户服务中心 | A kind of alarm method and device |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070100779A1 (en) * | 2005-08-05 | 2007-05-03 | Ori Levy | Method and system for extracting web data |
US9141966B2 (en) * | 2009-12-23 | 2015-09-22 | Yahoo! Inc. | Opinion aggregation system |
US20120323627A1 (en) * | 2011-06-14 | 2012-12-20 | Microsoft Corporation | Real-time Monitoring of Public Sentiment |
CN103970864B (en) * | 2014-05-08 | 2017-09-22 | 清华大学 | Mood classification and mood component analyzing method and system based on microblogging text |
-
2017
- 2017-12-27 CN CN201711445217.3A patent/CN108021704B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104113643A (en) * | 2014-06-27 | 2014-10-22 | 国家电网公司 | Customer service center on-site monitoring system and method |
US9536191B1 (en) * | 2015-11-25 | 2017-01-03 | Osaro, Inc. | Reinforcement learning using confidence scores |
CN106530127A (en) * | 2016-11-09 | 2017-03-22 | 国网江苏省电力公司南京供电公司 | Complaint early warning and monitoring analysis system based on text mining |
CN106791225A (en) * | 2017-03-23 | 2017-05-31 | 国家电网公司客户服务中心 | A kind of alarm method and device |
Also Published As
Publication number | Publication date |
---|---|
CN108021704A (en) | 2018-05-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Jain et al. | An intelligent cognitive-inspired computing with big data analytics framework for sentiment analysis and classification | |
CN103678670B (en) | Micro-blog hot word and hot topic mining system and method | |
CN104199965B (en) | Semantic information retrieval method | |
CN106874292B (en) | Topic processing method and device | |
CN108021704B (en) | Agent optimal configuration method based on social public opinion data mining technology | |
CN103279478A (en) | Method for extracting features based on distributed mutual information documents | |
CN109471946A (en) | A kind of classification method and system of Chinese text | |
CN109271514A (en) | Generation method, classification method, device and the storage medium of short text disaggregated model | |
CN104834651A (en) | Method and apparatus for providing answers to frequently asked questions | |
CN116843162B (en) | Contradiction reconciliation scheme recommendation and scoring system and method | |
Rao et al. | Result prediction for political parties using Twitter sentiment analysis | |
CN108334573B (en) | High-correlation microblog retrieval method based on clustering information | |
Onwuegbuche et al. | Support vector machine for sentiment analysis of Nigerian banks financial tweets | |
CN112270189A (en) | Question type analysis node generation method, question type analysis node generation system and storage medium | |
Kulkarni et al. | Tweet Sentiment Analysis and Study and Comparison of Various Approaches and Classification Algorithms Used | |
CN114372145A (en) | Operation and maintenance resource dynamic allocation scheduling method based on knowledge graph platform | |
Ashwini et al. | Impact of Text Representation Techniques on Clustering Models | |
Najadat et al. | Analyzing social media opinions using data analytics | |
Bhuvaneswari et al. | Enhancing the sentiment classification accuracy of twitter data using machine learning algorithms | |
Rana et al. | News headlines classification using probabilistic approach | |
CN113806534B (en) | Hot event prediction method for social network | |
Al-Hadheri et al. | Text Classification in Arabic Natural Language Processing: A Review | |
CN111159393B (en) | Text generation method for abstract extraction based on LDA and D2V | |
Koulali et al. | A comparative study on text representation models for topic detection in arabic | |
CN118378792B (en) | Data processing analysis platform based on artificial intelligence |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |