CN107357837A - 基于保序子矩阵和频繁序列挖掘的电商评论情感分类方法 - Google Patents
基于保序子矩阵和频繁序列挖掘的电商评论情感分类方法 Download PDFInfo
- Publication number
- CN107357837A CN107357837A CN201710481733.5A CN201710481733A CN107357837A CN 107357837 A CN107357837 A CN 107357837A CN 201710481733 A CN201710481733 A CN 201710481733A CN 107357837 A CN107357837 A CN 107357837A
- Authority
- CN
- China
- Prior art keywords
- word
- comment
- frequent
- mrow
- trainset
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 71
- 239000013598 vector Substances 0.000 claims abstract description 83
- 230000002996 emotional effect Effects 0.000 claims abstract description 23
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 12
- 230000011218 segmentation Effects 0.000 claims abstract description 11
- 230000008451 emotion Effects 0.000 claims description 86
- 238000005065 mining Methods 0.000 claims description 60
- 238000012549 training Methods 0.000 claims description 34
- 239000011159 matrix material Substances 0.000 claims description 27
- 238000012360 testing method Methods 0.000 claims description 19
- 238000004458 analytical method Methods 0.000 claims description 18
- 238000012795 verification Methods 0.000 claims description 17
- 238000013145 classification model Methods 0.000 claims description 10
- 238000004364 calculation method Methods 0.000 claims description 7
- 238000010801 machine learning Methods 0.000 claims description 7
- 238000012706 support-vector machine Methods 0.000 claims description 6
- 238000007781 pre-processing Methods 0.000 claims description 5
- 238000012552 review Methods 0.000 claims description 5
- 238000012545 processing Methods 0.000 claims description 4
- 238000013507 mapping Methods 0.000 claims description 3
- 238000013138 pruning Methods 0.000 claims description 3
- 238000000926 separation method Methods 0.000 claims description 3
- 238000012163 sequencing technique Methods 0.000 claims description 3
- 238000011156 evaluation Methods 0.000 abstract description 6
- 238000009412 basement excavation Methods 0.000 abstract 1
- 230000000694 effects Effects 0.000 description 7
- 238000000605 extraction Methods 0.000 description 7
- 238000002474 experimental method Methods 0.000 description 6
- 238000003058 natural language processing Methods 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 238000003066 decision tree Methods 0.000 description 3
- 230000007547 defect Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000014509 gene expression Effects 0.000 description 3
- 230000000052 comparative effect Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000007637 random forest analysis Methods 0.000 description 2
- 238000010187 selection method Methods 0.000 description 2
- 238000010200 validation analysis Methods 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000007635 classification algorithm Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- DQJCHOQLCLEDLL-UHFFFAOYSA-N tricyclazole Chemical compound CC1=CC=CC2=C1N1C=NN=C1S2 DQJCHOQLCLEDLL-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3344—Query execution using natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0201—Market modelling; Market analysis; Collecting market data
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Business, Economics & Management (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Finance (AREA)
- General Engineering & Computer Science (AREA)
- Accounting & Taxation (AREA)
- Artificial Intelligence (AREA)
- Development Economics (AREA)
- Strategic Management (AREA)
- Data Mining & Analysis (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Entrepreneurship & Innovation (AREA)
- Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- Game Theory and Decision Science (AREA)
- Economics (AREA)
- Marketing (AREA)
- General Business, Economics & Management (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710481733.5A CN107357837B (zh) | 2017-06-22 | 2017-06-22 | 基于保序子矩阵和频繁序列挖掘的电商评论情感分类方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710481733.5A CN107357837B (zh) | 2017-06-22 | 2017-06-22 | 基于保序子矩阵和频繁序列挖掘的电商评论情感分类方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107357837A true CN107357837A (zh) | 2017-11-17 |
CN107357837B CN107357837B (zh) | 2019-10-08 |
Family
ID=60273250
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710481733.5A Active CN107357837B (zh) | 2017-06-22 | 2017-06-22 | 基于保序子矩阵和频繁序列挖掘的电商评论情感分类方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107357837B (zh) |
Cited By (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107967258A (zh) * | 2017-11-23 | 2018-04-27 | 广州艾媒数聚信息咨询股份有限公司 | 文本信息的情感分析方法和系统 |
CN108132930A (zh) * | 2017-12-27 | 2018-06-08 | 曙光信息产业(北京)有限公司 | 特征词提取方法及装置 |
CN108596637A (zh) * | 2018-04-24 | 2018-09-28 | 北京航空航天大学 | 一种电商服务问题自动发现系统 |
CN108984775A (zh) * | 2018-07-24 | 2018-12-11 | 南京新贝金服科技有限公司 | 一种基于商品评论的舆情监控方法及系统 |
CN109145187A (zh) * | 2018-07-23 | 2019-01-04 | 浙江大学 | 基于评论数据的跨平台电商欺诈检测方法和系统 |
CN109408621A (zh) * | 2018-10-29 | 2019-03-01 | 苏州派维斯信息科技有限公司 | 对话情感分析方法和系统 |
CN109408802A (zh) * | 2018-08-28 | 2019-03-01 | 厦门快商通信息技术有限公司 | 一种提升句向量语义的方法、系统及存储介质 |
CN109446528A (zh) * | 2018-10-30 | 2019-03-08 | 南京中孚信息技术有限公司 | 新型诈骗手法识别方法及装置 |
CN110347822A (zh) * | 2019-06-03 | 2019-10-18 | 佛山科学技术学院 | 一种评论文本的情感倾向分析方法及装置 |
CN110704710A (zh) * | 2019-09-05 | 2020-01-17 | 上海师范大学 | 一种基于深度学习的中文电商情感分类方法 |
CN111400495A (zh) * | 2020-03-17 | 2020-07-10 | 重庆邮电大学 | 一种基于模板特征的视频弹幕消费意图识别方法 |
CN111400432A (zh) * | 2020-06-04 | 2020-07-10 | 腾讯科技(深圳)有限公司 | 事件类型信息处理方法、事件类型识别方法及装置 |
CN112417093A (zh) * | 2020-11-11 | 2021-02-26 | 北京三快在线科技有限公司 | 一种模型训练的方法及装置 |
CN112463959A (zh) * | 2020-10-29 | 2021-03-09 | 中国人寿保险股份有限公司 | 一种基于上行短信的业务处理方法及相关设备 |
CN112825078A (zh) * | 2019-11-21 | 2021-05-21 | 北京沃东天骏信息技术有限公司 | 一种信息处理方法和装置 |
CN112905736A (zh) * | 2021-01-27 | 2021-06-04 | 郑州轻工业大学 | 一种基于量子理论的无监督文本情感分析方法 |
CN113111167A (zh) * | 2020-02-13 | 2021-07-13 | 北京明亿科技有限公司 | 基于深度学习模型的接处警文本车辆型号提取方法和装置 |
CN113393276A (zh) * | 2021-06-25 | 2021-09-14 | 食亨(上海)科技服务有限公司 | 评论数据的分类方法、装置和计算机可读介质 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105260437A (zh) * | 2015-09-30 | 2016-01-20 | 陈一飞 | 文本分类特征选择方法及其在生物医药文本分类中的应用 |
US20160085750A1 (en) * | 2014-09-24 | 2016-03-24 | Fujitsu Limited | Storage apparatus and storage apparatus control method |
-
2017
- 2017-06-22 CN CN201710481733.5A patent/CN107357837B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160085750A1 (en) * | 2014-09-24 | 2016-03-24 | Fujitsu Limited | Storage apparatus and storage apparatus control method |
CN105260437A (zh) * | 2015-09-30 | 2016-01-20 | 陈一飞 | 文本分类特征选择方法及其在生物医药文本分类中的应用 |
Non-Patent Citations (2)
Title |
---|
YUN XUE等: "Mining Order-Preserving Submatrices Based on Frequent Sequential Pattern Mining", 《SPRINGER INTERNATIONAL PUBLISHING SWITZERLAND 2014》 * |
薛云 等: "基于公共子序列的 OPSM 双聚类算法", 《华南师范大学学报(自然科学版)》 * |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107967258A (zh) * | 2017-11-23 | 2018-04-27 | 广州艾媒数聚信息咨询股份有限公司 | 文本信息的情感分析方法和系统 |
CN108132930A (zh) * | 2017-12-27 | 2018-06-08 | 曙光信息产业(北京)有限公司 | 特征词提取方法及装置 |
CN108596637B (zh) * | 2018-04-24 | 2022-05-06 | 北京航空航天大学 | 一种电商服务问题自动发现系统 |
CN108596637A (zh) * | 2018-04-24 | 2018-09-28 | 北京航空航天大学 | 一种电商服务问题自动发现系统 |
CN109145187A (zh) * | 2018-07-23 | 2019-01-04 | 浙江大学 | 基于评论数据的跨平台电商欺诈检测方法和系统 |
CN108984775A (zh) * | 2018-07-24 | 2018-12-11 | 南京新贝金服科技有限公司 | 一种基于商品评论的舆情监控方法及系统 |
CN109408802A (zh) * | 2018-08-28 | 2019-03-01 | 厦门快商通信息技术有限公司 | 一种提升句向量语义的方法、系统及存储介质 |
CN109408621B (zh) * | 2018-10-29 | 2021-04-02 | 苏州派维斯信息科技有限公司 | 对话情感分析方法和系统 |
CN109408621A (zh) * | 2018-10-29 | 2019-03-01 | 苏州派维斯信息科技有限公司 | 对话情感分析方法和系统 |
CN109446528A (zh) * | 2018-10-30 | 2019-03-08 | 南京中孚信息技术有限公司 | 新型诈骗手法识别方法及装置 |
CN110347822A (zh) * | 2019-06-03 | 2019-10-18 | 佛山科学技术学院 | 一种评论文本的情感倾向分析方法及装置 |
CN110704710A (zh) * | 2019-09-05 | 2020-01-17 | 上海师范大学 | 一种基于深度学习的中文电商情感分类方法 |
CN112825078A (zh) * | 2019-11-21 | 2021-05-21 | 北京沃东天骏信息技术有限公司 | 一种信息处理方法和装置 |
CN113111167A (zh) * | 2020-02-13 | 2021-07-13 | 北京明亿科技有限公司 | 基于深度学习模型的接处警文本车辆型号提取方法和装置 |
CN111400495A (zh) * | 2020-03-17 | 2020-07-10 | 重庆邮电大学 | 一种基于模板特征的视频弹幕消费意图识别方法 |
CN111400432B (zh) * | 2020-06-04 | 2020-09-25 | 腾讯科技(深圳)有限公司 | 事件类型信息处理方法、事件类型识别方法及装置 |
CN111400432A (zh) * | 2020-06-04 | 2020-07-10 | 腾讯科技(深圳)有限公司 | 事件类型信息处理方法、事件类型识别方法及装置 |
CN112463959A (zh) * | 2020-10-29 | 2021-03-09 | 中国人寿保险股份有限公司 | 一种基于上行短信的业务处理方法及相关设备 |
CN112417093A (zh) * | 2020-11-11 | 2021-02-26 | 北京三快在线科技有限公司 | 一种模型训练的方法及装置 |
CN112417093B (zh) * | 2020-11-11 | 2024-03-08 | 北京三快在线科技有限公司 | 一种模型训练的方法及装置 |
CN112905736A (zh) * | 2021-01-27 | 2021-06-04 | 郑州轻工业大学 | 一种基于量子理论的无监督文本情感分析方法 |
CN112905736B (zh) * | 2021-01-27 | 2023-09-19 | 郑州轻工业大学 | 一种基于量子理论的无监督文本情感分析方法 |
CN113393276A (zh) * | 2021-06-25 | 2021-09-14 | 食亨(上海)科技服务有限公司 | 评论数据的分类方法、装置和计算机可读介质 |
CN113393276B (zh) * | 2021-06-25 | 2023-06-16 | 食亨(上海)科技服务有限公司 | 评论数据的分类方法、装置和计算机可读介质 |
Also Published As
Publication number | Publication date |
---|---|
CN107357837B (zh) | 2019-10-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107357837A (zh) | 基于保序子矩阵和频繁序列挖掘的电商评论情感分类方法 | |
CN107491531B (zh) | 基于集成学习框架的中文网络评论情感分类方法 | |
Ishaq et al. | Aspect-based sentiment analysis using a hybridized approach based on CNN and GA | |
Kandhro et al. | Sentiment analysis of students’ comment using long-short term model | |
CN112434164B (zh) | 一种兼顾话题发现和情感分析的网络舆情分析方法及系统 | |
Rauf et al. | Using BERT for checking the polarity of movie reviews | |
CN108694176B (zh) | 文档情感分析的方法、装置、电子设备和可读存储介质 | |
Chong et al. | Comparison of naive bayes and svm classification in grid-search hyperparameter tuned and non-hyperparameter tuned healthcare stock market sentiment analysis | |
Rajalakshmi et al. | Sentimental analysis of code-mixed Hindi language | |
Jayakody et al. | Sentiment analysis on product reviews on twitter using Machine Learning Approaches | |
Hicham et al. | An efficient approach for improving customer Sentiment Analysis in the Arabic language using an Ensemble machine learning technique | |
Yadu et al. | A Hybrid Model Integrating Adaboost Approach for Sentimental Analysis of Airline Tweets. | |
Nithya et al. | Deep learning based analysis on code-mixed tamil text for sentiment classification with pre-trained ulmfit | |
Dhar et al. | Bengali news headline categorization using optimized machine learning pipeline | |
Patil et al. | Hate speech detection using deep learning and text analysis | |
CN117291190A (zh) | 一种基于情感词典和lda主题模型的用户需求计算方法 | |
Zhang et al. | Probabilistic verb selection for data-to-text generation | |
Awajan et al. | A review on sentiment analysis in arabic using document level | |
CN107729509A (zh) | 基于隐性高维分布式特征表示的篇章相似度判定方法 | |
Shanmugam et al. | Twitter sentiment analysis using novelty detection | |
Lubis et al. | Sentiment Analysis in social media: Handling Noisy Data and Detecting Sarcasm Using a Deep Learning Approach | |
Iqbal et al. | Implementation of supervised learning techniques for sentiment analysis of customer Tweets on airline services | |
Sharma et al. | A review of text mining techniques and applications | |
Banu et al. | Sentiment analysis for real-time micro blogs using twitter data | |
Bhatti et al. | Benchmarking Performance of Document Level Classification and Topic Modeling |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20210225 Address after: No. 411, block a, Xinxing science and technology building, No. 1, Guangmao Road, Weihai comprehensive bonded zone (South District), Wendeng District, Weihai City, Shandong Province, 264200 Patentee after: Shandong Yuncong Software Technology Co.,Ltd. Address before: Room 1703, building 1, Linghui Business Plaza, 278 Suzhou Avenue East, Suzhou Industrial Park, Suzhou area, China (Jiangsu) pilot Free Trade Zone, Suzhou 215000, Jiangsu Province Patentee before: Suzhou high Airlines intellectual property rights Operation Co.,Ltd. Effective date of registration: 20210225 Address after: Room 1703, building 1, Linghui Business Plaza, 278 Suzhou Avenue East, Suzhou Industrial Park, Suzhou area, China (Jiangsu) pilot Free Trade Zone, Suzhou 215000, Jiangsu Province Patentee after: Suzhou high Airlines intellectual property rights Operation Co.,Ltd. Address before: 510631 No. 55, Zhongshan Avenue, Tianhe District, Guangdong, Guangzhou Patentee before: SOUTH CHINA NORMAL University |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240228 Address after: Room 403, No. 7, Block B, Changhong Community, Torch High tech Industrial Development Zone, Weihai City, Shandong Province, 264209 Patentee after: Wang Shanshan Country or region after: China Address before: No. 411, block a, Xinxing science and technology building, No. 1, Guangmao Road, Weihai comprehensive bonded zone (South District), Wendeng District, Weihai City, Shandong Province, 264200 Patentee before: Shandong Yuncong Software Technology Co.,Ltd. Country or region before: China |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240325 Address after: 264205 Blue Star MixC No.23-B1203, Economic and Technological Development Zone, Weihai City, Shandong Province (self declared) Patentee after: Shandong Yuncong Software Technology Co.,Ltd. Country or region after: China Address before: Room 403, No. 7, Block B, Changhong Community, Torch High tech Industrial Development Zone, Weihai City, Shandong Province, 264209 Patentee before: Wang Shanshan Country or region before: China |