CN105335496B - 基于余弦相似度文本挖掘算法的客服重复来电处理方法 - Google Patents
基于余弦相似度文本挖掘算法的客服重复来电处理方法 Download PDFInfo
- Publication number
- CN105335496B CN105335496B CN201510695559.5A CN201510695559A CN105335496B CN 105335496 B CN105335496 B CN 105335496B CN 201510695559 A CN201510695559 A CN 201510695559A CN 105335496 B CN105335496 B CN 105335496B
- Authority
- CN
- China
- Prior art keywords
- text
- work order
- vector
- incoming call
- customer service
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000005065 mining Methods 0.000 title claims abstract description 18
- 238000003672 processing method Methods 0.000 title claims abstract description 12
- 239000013598 vector Substances 0.000 claims abstract description 47
- 239000000284 extract Substances 0.000 claims abstract description 8
- 238000012545 processing Methods 0.000 claims abstract description 4
- 238000005201 scrubbing Methods 0.000 claims abstract description 4
- 230000005856 abnormality Effects 0.000 claims abstract description 3
- 238000000034 method Methods 0.000 claims description 23
- 238000004364 calculation method Methods 0.000 claims description 13
- 238000012360 testing method Methods 0.000 claims description 4
- 230000002159 abnormal effect Effects 0.000 claims description 3
- 238000013178 mathematical model Methods 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 238000004458 analytical method Methods 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 4
- 230000011218 segmentation Effects 0.000 description 3
- 230000005611 electricity Effects 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 241001269238 Data Species 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3325—Reformulation based on results of preceding query
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3347—Query execution using vector based model
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510695559.5A CN105335496B (zh) | 2015-10-22 | 2015-10-22 | 基于余弦相似度文本挖掘算法的客服重复来电处理方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510695559.5A CN105335496B (zh) | 2015-10-22 | 2015-10-22 | 基于余弦相似度文本挖掘算法的客服重复来电处理方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105335496A CN105335496A (zh) | 2016-02-17 |
CN105335496B true CN105335496B (zh) | 2019-05-21 |
Family
ID=55286023
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510695559.5A Active CN105335496B (zh) | 2015-10-22 | 2015-10-22 | 基于余弦相似度文本挖掘算法的客服重复来电处理方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105335496B (zh) |
Families Citing this family (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106227718A (zh) * | 2016-07-18 | 2016-12-14 | 中国民航大学 | 基于cnn的陆空通话语义一致性校验方法 |
CN106530127B (zh) * | 2016-11-09 | 2023-07-14 | 国网江苏省电力公司南京供电公司 | 基于文本挖掘技术的客户投诉预警监测分析系统 |
CN106529804B (zh) * | 2016-11-09 | 2023-08-18 | 国网江苏省电力公司南京供电公司 | 基于文本挖掘技术的客户投诉预警监测分析方法 |
CN108280766B (zh) * | 2017-01-06 | 2022-05-13 | 创新先进技术有限公司 | 交易行为风险识别方法及装置 |
CN106997345A (zh) * | 2017-03-31 | 2017-08-01 | 成都数联铭品科技有限公司 | 基于词向量和词统计信息的关键词抽取方法 |
CN107346344A (zh) * | 2017-07-24 | 2017-11-14 | 北京京东尚科信息技术有限公司 | 文本匹配的方法和装置 |
CN107798047B (zh) * | 2017-07-26 | 2021-03-02 | 深圳壹账通智能科技有限公司 | 重复工单检测方法、装置、服务器和介质 |
CN107463705A (zh) * | 2017-08-17 | 2017-12-12 | 陕西优百信息技术有限公司 | 一种数据清洗方法 |
CN107562853B (zh) * | 2017-08-28 | 2021-02-23 | 武汉烽火普天信息技术有限公司 | 一种面向海量互联网文本数据的流式聚类及展现的方法 |
CN107729919A (zh) * | 2017-09-15 | 2018-02-23 | 国网山东省电力公司电力科学研究院 | 基于大数据技术的深化投诉穿透分析方法 |
CN107861942B (zh) * | 2017-10-11 | 2021-10-26 | 国网浙江省电力有限公司营销服务中心 | 一种基于深度学习的电力疑似投诉工单识别方法 |
CN108550019B (zh) * | 2018-03-22 | 2022-03-25 | 创新先进技术有限公司 | 一种简历筛选方法及装置 |
CN108376178B (zh) * | 2018-03-22 | 2020-08-11 | 北京航空航天大学 | 一种异常访谈记录文本的确定方法及装置 |
CN109636538A (zh) * | 2018-12-20 | 2019-04-16 | 成都知数科技有限公司 | 银行产品推荐方法、装置及服务器 |
CN109885813B (zh) * | 2019-02-18 | 2023-04-28 | 武汉瓯越网视有限公司 | 一种基于词语覆盖度的文本相似度的运算方法及系统 |
CN110225036B (zh) * | 2019-06-12 | 2022-03-22 | 北京奇艺世纪科技有限公司 | 一种账号检测方法、装置、服务器及存储介质 |
CN110457473A (zh) * | 2019-07-16 | 2019-11-15 | 广州番禺职业技术学院 | 一种电力客服工单的问题聚合方法 |
CN111144109B (zh) * | 2019-12-27 | 2023-07-21 | 北京明略软件系统有限公司 | 文本相似度确定方法和装置 |
CN113626328A (zh) * | 2021-08-11 | 2021-11-09 | 中国银行股份有限公司 | 测试案例相似性排查方法及装置 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101196904A (zh) * | 2007-11-09 | 2008-06-11 | 清华大学 | 一种基于词频和多元文法的新闻关键词抽取方法 |
CN102446254A (zh) * | 2011-12-30 | 2012-05-09 | 中国信息安全测评中心 | 一种基于文本挖掘的相似漏洞查询方法 |
CN102937960A (zh) * | 2012-09-06 | 2013-02-20 | 北京邮电大学 | 突发事件热点话题的识别与评估装置和方法 |
-
2015
- 2015-10-22 CN CN201510695559.5A patent/CN105335496B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101196904A (zh) * | 2007-11-09 | 2008-06-11 | 清华大学 | 一种基于词频和多元文法的新闻关键词抽取方法 |
CN102446254A (zh) * | 2011-12-30 | 2012-05-09 | 中国信息安全测评中心 | 一种基于文本挖掘的相似漏洞查询方法 |
CN102937960A (zh) * | 2012-09-06 | 2013-02-20 | 北京邮电大学 | 突发事件热点话题的识别与评估装置和方法 |
Non-Patent Citations (2)
Title |
---|
Text Mining – Going Way Beyond Just Listening to the Voice of the Customer;Forte Consultancy;《https://forteconsultancy.wordpress.com/2010/05/17/text-mining-going-way-beyond-just-listening-to-the-voice-of-the-customer/》;20100517;正文第1页第7-13段 |
基于词频差异的特征选取及改进的TF-IDF公式;罗欣等;《计算机应用》;20050929;第25卷(第9期);正文第1.1节 |
Also Published As
Publication number | Publication date |
---|---|
CN105335496A (zh) | 2016-02-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105335496B (zh) | 基于余弦相似度文本挖掘算法的客服重复来电处理方法 | |
CN105389341B (zh) | 一种客服电话重复来电工单的文本聚类与分析方法 | |
CN103544255B (zh) | 基于文本语义相关的网络舆情信息分析方法 | |
CN103745000B (zh) | 一种中文微博客的热点话题检测方法 | |
CN103678670B (zh) | 一种微博热词与热点话题挖掘系统及方法 | |
CN107729336A (zh) | 数据处理方法、设备及系统 | |
CN104573130B (zh) | 基于群体计算的实体解析方法及装置 | |
CN108776671A (zh) | 一种网络舆情监控系统及方法 | |
CN106055539B (zh) | 姓名消歧的方法和装置 | |
CN105912524B (zh) | 基于低秩矩阵分解的文章话题关键词提取方法和装置 | |
CN109190051B (zh) | 一种用户行为分析方法和基于该分析方法的资源推荐方法 | |
CN103279478A (zh) | 一种基于分布式互信息文档特征提取方法 | |
CN108304382B (zh) | 基于制造过程文本数据挖掘的质量分析方法与系统 | |
CN107577724A (zh) | 一种大数据处理方法 | |
WO2019196259A1 (zh) | 一种虚假消息的识别方法及其设备 | |
CN114357117A (zh) | 事务信息查询方法、装置、计算机设备及存储介质 | |
CN110019820A (zh) | 一种病历中主诉与现病史症状时间一致性检测方法 | |
CN109783633A (zh) | 数据分析服务流程模型推荐方法 | |
CN106919997A (zh) | 一种基于lda的电子商务的用户消费预测方法 | |
CN111522950A (zh) | 一种针对非结构化海量文本敏感数据的快速识别系统 | |
Yu et al. | Exploiting structured news information to improve event detection via dual-level clustering | |
CN108268461A (zh) | 一种基于混合分类器的文本分类装置 | |
CN109213793A (zh) | 一种流式数据处理方法和系统 | |
CN107609921A (zh) | 一种数据处理方法及服务器 | |
Shen et al. | A cross-database comparison to discover potential product opportunities using text mining and cosine similarity |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP01 | Change in the name or title of a patent holder |
Address after: 250002 Wang Yue Road, Ji'nan City, Shandong Province, No. 2000 Patentee after: ELECTRIC POWER RESEARCH INSTITUTE OF STATE GRID SHANDONG ELECTRIC POWER Co. Patentee after: STATE GRID CORPORATION OF CHINA Address before: 250002 Wang Yue Road, Ji'nan City, Shandong Province, No. 2000 Patentee before: ELECTRIC POWER RESEARCH INSTITUTE OF STATE GRID SHANDONG ELECTRIC POWER Co. Patentee before: State Grid Corporation of China |
|
CP01 | Change in the name or title of a patent holder | ||
TR01 | Transfer of patent right |
Effective date of registration: 20210409 Address after: No. 150, Jinger Road, Daguanyuan, Shizhong District, Jinan City, Shandong Province Patentee after: Shandong Electric Power Marketing Center Patentee after: ELECTRIC POWER RESEARCH INSTITUTE OF STATE GRID SHANDONG ELECTRIC POWER Co. Patentee after: STATE GRID CORPORATION OF CHINA Address before: 250002 Wang Yue Road, Ji'nan City, Shandong Province, No. 2000 Patentee before: ELECTRIC POWER RESEARCH INSTITUTE OF STATE GRID SHANDONG ELECTRIC POWER Co. Patentee before: STATE GRID CORPORATION OF CHINA |
|
TR01 | Transfer of patent right |