CN110765762B - 一种大数据背景下在线评论文本最佳主题提取系统和方法 - Google Patents
一种大数据背景下在线评论文本最佳主题提取系统和方法 Download PDFInfo
- Publication number
- CN110765762B CN110765762B CN201910933579.XA CN201910933579A CN110765762B CN 110765762 B CN110765762 B CN 110765762B CN 201910933579 A CN201910933579 A CN 201910933579A CN 110765762 B CN110765762 B CN 110765762B
- Authority
- CN
- China
- Prior art keywords
- word
- text
- module
- comment
- topic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 26
- 238000009826 distribution Methods 0.000 claims abstract description 33
- 238000010606 normalization Methods 0.000 claims abstract description 26
- 238000007781 pre-processing Methods 0.000 claims abstract description 21
- 238000000605 extraction Methods 0.000 claims abstract description 20
- 239000013598 vector Substances 0.000 claims abstract description 15
- 238000012552 review Methods 0.000 claims description 15
- 238000001514 detection method Methods 0.000 claims description 14
- 238000004140 cleaning Methods 0.000 claims description 11
- 238000012545 processing Methods 0.000 claims description 11
- 230000011218 segmentation Effects 0.000 claims description 10
- 238000013075 data extraction Methods 0.000 claims description 9
- 230000014509 gene expression Effects 0.000 claims description 8
- 239000000284 extract Substances 0.000 claims description 7
- 238000009499 grossing Methods 0.000 claims description 6
- 238000012216 screening Methods 0.000 claims description 6
- 238000006243 chemical reaction Methods 0.000 claims description 4
- 238000010276 construction Methods 0.000 claims description 2
- 239000000463 material Substances 0.000 claims 2
- 230000007547 defect Effects 0.000 abstract description 2
- 238000012804 iterative process Methods 0.000 abstract description 2
- 238000002360 preparation method Methods 0.000 abstract description 2
- 238000011160 research Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 238000005065 mining Methods 0.000 description 2
- 238000004451 qualitative analysis Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Images
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Machine Translation (AREA)
Abstract
Description
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910933579.XA CN110765762B (zh) | 2019-09-29 | 2019-09-29 | 一种大数据背景下在线评论文本最佳主题提取系统和方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910933579.XA CN110765762B (zh) | 2019-09-29 | 2019-09-29 | 一种大数据背景下在线评论文本最佳主题提取系统和方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110765762A CN110765762A (zh) | 2020-02-07 |
CN110765762B true CN110765762B (zh) | 2023-04-18 |
Family
ID=69329074
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910933579.XA Active CN110765762B (zh) | 2019-09-29 | 2019-09-29 | 一种大数据背景下在线评论文本最佳主题提取系统和方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110765762B (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111899832B (zh) * | 2020-08-13 | 2024-03-29 | 东北电力大学 | 基于上下文语义分析的医疗主题管理系统与方法 |
CN112507064B (zh) * | 2020-11-09 | 2022-05-24 | 国网天津市电力公司 | 一种基于主题感知的跨模态序列到序列生成方法 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2004185135A (ja) * | 2002-11-29 | 2004-07-02 | Mitsubishi Electric Corp | 話題変化抽出方法とその装置及び話題変化抽出プログラムとその情報記録伝送媒体 |
KR20160077446A (ko) * | 2014-12-23 | 2016-07-04 | 고려대학교 산학협력단 | 시맨틱 엔티티 토픽 추출 방법 |
CN108513176A (zh) * | 2017-12-06 | 2018-09-07 | 北京邮电大学 | 一种基于话题模型的社会化视频主题提取系统及方法 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8180755B2 (en) * | 2009-09-04 | 2012-05-15 | Yahoo! Inc. | Matching reviews to objects using a language model |
US10296837B2 (en) * | 2015-10-15 | 2019-05-21 | Sap Se | Comment-comment and comment-document analysis of documents |
-
2019
- 2019-09-29 CN CN201910933579.XA patent/CN110765762B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2004185135A (ja) * | 2002-11-29 | 2004-07-02 | Mitsubishi Electric Corp | 話題変化抽出方法とその装置及び話題変化抽出プログラムとその情報記録伝送媒体 |
KR20160077446A (ko) * | 2014-12-23 | 2016-07-04 | 고려대학교 산학협력단 | 시맨틱 엔티티 토픽 추출 방법 |
CN108513176A (zh) * | 2017-12-06 | 2018-09-07 | 北京邮电大学 | 一种基于话题模型的社会化视频主题提取系统及方法 |
Also Published As
Publication number | Publication date |
---|---|
CN110765762A (zh) | 2020-02-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Yasen et al. | Movies reviews sentiment analysis and classification | |
CN108255813B (zh) | 一种基于词频-逆文档与crf的文本匹配方法 | |
CN102622338A (zh) | 一种短文本间语义距离的计算机辅助计算方法 | |
CN110879831A (zh) | 基于实体识别技术的中医药语句分词方法 | |
Singh et al. | Sentiment analysis of Twitter data using TF-IDF and machine learning techniques | |
Sarwadnya et al. | Marathi extractive text summarizer using graph based model | |
CN113360647B (zh) | 一种基于聚类的5g移动业务投诉溯源分析方法 | |
CN106202065A (zh) | 一种跨语言话题检测方法及系统 | |
Manjari | Extractive summarization of Telugu documents using TextRank algorithm | |
CN110765762B (zh) | 一种大数据背景下在线评论文本最佳主题提取系统和方法 | |
CN107451116B (zh) | 一种移动应用内生大数据统计分析方法 | |
Britzolakis et al. | A review on lexicon-based and machine learning political sentiment analysis using tweets | |
CN114491062B (zh) | 一种融合知识图谱和主题模型的短文本分类方法 | |
Munnes et al. | Examining sentiment in complex texts. A comparison of different computational approaches | |
Hao et al. | SCESS: a WFSA-based automated simplified chinese essay scoring system with incremental latent semantic analysis | |
Alnajran et al. | A heuristic based pre-processing methodology for short text similarity measures in microblogs | |
US11599580B2 (en) | Method and system to extract domain concepts to create domain dictionaries and ontologies | |
Setiawan et al. | Social media emotion analysis in indonesian using fine-tuning bert model | |
CN117291190A (zh) | 一种基于情感词典和lda主题模型的用户需求计算方法 | |
Patel et al. | Influence of Gujarati STEmmeR in supervised learning of web page categorization | |
Wang et al. | Sentence-Ranking-Enhanced Keywords Extraction from Chinese Patents. | |
Medagoda et al. | Keywords based temporal sentiment analysis | |
Kuş et al. | An Extractive Text Summarization Model for Generating Extended Abstracts of Medical Papers in Turkish | |
Shaikh et al. | An intelligent framework for e-recruitment system based on text categorization and semantic analysis | |
CN115238093A (zh) | 一种模型训练的方法、装置、电子设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20231025 Address after: 312300 No. 77, Fuxing West Road, phase 1, Shangyu Industry Education Integration Innovation Park, waiwujia village, Wuxing West Road, Cao'e street, Shangyu District, Shaoxing City, Zhejiang Province (residence declaration) Patentee after: SHANGYU SCIENCE AND ENGINEERING RESEARCH INSTITUTE CO., LTD. OF HANGZHOU DIANZI University Patentee after: HANGZHOU DIANZI University Address before: Room 810, A2 / F, Zhejiang University network new science and Technology Park, 2288 Jiangxi Road, Cao'e street, Shangyu District, Shaoxing City, Zhejiang Province, 312300 Patentee before: SHANGYU SCIENCE AND ENGINEERING RESEARCH INSTITUTE CO., LTD. OF HANGZHOU DIANZI University |
|
CP03 | Change of name, title or address | ||
CP03 | Change of name, title or address |
Address after: 310000 Xiasha Higher Education Park, Hangzhou City, Zhejiang Province Patentee after: HANGZHOU DIANZI University Country or region after: China Patentee after: SHANGYU SCIENCE AND ENGINEERING RESEARCH INSTITUTE CO., LTD. OF HANGZHOU DIANZI University Address before: 312300 No. 77, Fuxing West Road, phase 1, Shangyu Industry Education Integration Innovation Park, waiwujia village, Wuxing West Road, Cao'e street, Shangyu District, Shaoxing City, Zhejiang Province (residence declaration) Patentee before: SHANGYU SCIENCE AND ENGINEERING RESEARCH INSTITUTE CO., LTD. OF HANGZHOU DIANZI University Country or region before: China Patentee before: HANGZHOU DIANZI University |