WO2019006642A1 - System for identifying quality of comment for product in electronic commerce - Google Patents

System for identifying quality of comment for product in electronic commerce Download PDF

Info

Publication number
WO2019006642A1
WO2019006642A1 PCT/CN2017/091592 CN2017091592W WO2019006642A1 WO 2019006642 A1 WO2019006642 A1 WO 2019006642A1 CN 2017091592 W CN2017091592 W CN 2017091592W WO 2019006642 A1 WO2019006642 A1 WO 2019006642A1
Authority
WO
WIPO (PCT)
Prior art keywords
comment
module
similar
fake
comments
Prior art date
Application number
PCT/CN2017/091592
Other languages
French (fr)
Chinese (zh)
Inventor
陈钦鹏
Original Assignee
深圳齐心集团股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳齐心集团股份有限公司 filed Critical 深圳齐心集团股份有限公司
Priority to PCT/CN2017/091592 priority Critical patent/WO2019006642A1/en
Publication of WO2019006642A1 publication Critical patent/WO2019006642A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising

Definitions

  • the invention belongs to the field of electronic commerce, and in particular relates to an e-commerce product review quality identification system.
  • e-commerce has become a widely used form of commercial trade.
  • the buyers and sellers mainly conduct trading activities through the e-commerce webpage or software. Since e-commerce does not have a traditional physical storefront, the number of sales personnel is not high, so it is more controllable than the traditional transaction mode, so it has a greater price advantage.
  • e-commerce does not have a traditional physical storefront, the number of sales personnel is not high, so it is more controllable than the traditional transaction mode, so it has a greater price advantage.
  • there are many unscrupulous merchants who hire professional brush evaluation teams to create a large number of false comments to make false advertisements about their products in order to increase their sales, thereby deceiving consumers to improve their real sales.
  • the embodiment of the invention provides an e-commerce product review quality identification system, which aims to solve the problem that the prior art lacks accurate and effective related equipment to realize product identification quality identification.
  • an e-commerce product review quality authentication system comprising: a comment document construction module, a similar comment screening module, a similar ID extraction module, and a comment quality authentication module; wherein the comment document construction module, For extracting comment data, and classifying the review data by product category to construct a product review document corresponding to the product; the similar comment screening module, for filtering out similar comments from the product review document; the similar ID extraction Module for extracting similar IDs from product review documents
  • the comment quality identification module is configured to match the filtered similar comments with the extracted similar IDs according to the product review document, and capture product reviews with similar IDs and reviews according to the matching results, and capture the captured products.
  • the product review identifiers whose IDs and comments are similar are identified as false comments, and the ID corresponding to the fake comments is identified as a fake comment ID.
  • the comment document construction module output is respectively connected to the input of the similar comment screening module and the similar ID extraction module; the comment quality authentication module and the display comment screening module respectively, the similar ID The output of the extraction module is connected.
  • the comment quality authentication module comprises:
  • a receiving unit configured to receive a similar comment filtered by a similar comment screening module and a similar ID extracted by a similar ID extraction module;
  • a matching unit configured to match the filtered similar comments with the extracted similar IDs according to the product review document
  • a false comment identification unit configured to capture product reviews having similar IDs and comments according to the matching result, and identify the product reviews with similar captured IDs and comments as false comments, and ID identifiers corresponding to the fake comments A false comment ID.
  • the system further includes: an identifier start time entry module, configured to enter an identifier start time for the fake ID identified in the comment quality authentication module.
  • the system further comprises: a storage module, configured to store the identified fake ID.
  • the system further includes: a fake ID timing deletion module, configured to calculate a time value stored in the storage module according to the identifier start time and the current system time entered by the fake ID, and the time value is The preset time threshold is compared. When the time value is greater than the preset time threshold, the fake ID is deleted from the storage module.
  • a fake ID timing deletion module configured to calculate a time value stored in the storage module according to the identifier start time and the current system time entered by the fake ID, and the time value is The preset time threshold is compared. When the time value is greater than the preset time threshold, the fake ID is deleted from the storage module.
  • the time threshold is 30 to 60 days.
  • the system further includes: a data redundancy judging module, connected to the comment quality discriminating module and the fake ID storage module, configured to determine a false ID identified in the comment quality discriminating module and a false stored in the storage module Is the ID the same?
  • a data redundancy judging module connected to the comment quality discriminating module and the fake ID storage module, configured to determine a false ID identified in the comment quality discriminating module and a false stored in the storage module Is the ID the same?
  • the system further includes: a same fake ID deletion module, configured to delete the fake ID identified in the comment quality authentication module when the fake ID identified in the comment quality authentication module is the same as the fake ID stored in the storage module.
  • a same fake ID deletion module configured to delete the fake ID identified in the comment quality authentication module when the fake ID identified in the comment quality authentication module is the same as the fake ID stored in the storage module.
  • the e-commerce product review quality authentication system captures the comment data by the comment document building module, classifies the comment data according to the product category, and constructs a product review document corresponding to the product; and filters the module through the similar comment screening module. A similar comment is filtered out in the comment document; a similar ID extraction module extracts a similar ID from the product review document Finally, through the comment quality identification module, the selected similar comments are matched with the extracted similar IDs according to the product review document, and the product reviews with similar IDs and comments are captured according to the matching results, and the captured IDs and comments are extracted.
  • the similar product reviews are identified as false comments, and the ID corresponding to the false comments is identified as a false comment ID, which can identify false comments in the target product evaluation, and the determination result is highly reliable.
  • FIG. 1 is a schematic structural diagram of an e-commerce product review quality authentication system according to an embodiment of the present invention
  • FIG. 2 is a schematic structural diagram of a comment quality authentication module according to an embodiment of the present invention.
  • FIG. 3 is a schematic structural diagram of another e-commerce product review quality authentication system according to an embodiment of the present invention.
  • FIG. 4 is a schematic structural diagram of still another e-commerce product review quality authentication system according to an embodiment of the present invention.
  • the e-commerce product review quality authentication system captures the comment data by the comment document building module, classifies the comment data according to the product category, and constructs a product review document corresponding to the product; and filters the module through the similar comment screening module. A similar comment is filtered out in the comment document; a similar ID extraction module extracts a similar ID from the product review document Finally, through the comment quality identification module, the selected similar comments are matched with the extracted similar IDs according to the product review document, and the product reviews with similar IDs and comments are captured according to the matching results, and the captured IDs and comments are extracted.
  • the similar product reviews are identified as false comments, and the ID corresponding to the false comments is identified as a false comment ID, which can identify false comments in the target product evaluation, and the determination result is highly reliable.
  • an e-commerce product review quality authentication system 100 includes: a comment document construction module 110, a similar comment screening module 120, a similar ID extraction module 130, and a comment quality authentication module 140;
  • the comment document construction module 110 is configured to capture the comment data, and classify the comment data by the product category to construct a product review document corresponding to the product;
  • the similar comment screening module 120 is configured to review the document from the product.
  • Similar comments are filtered out; the similar ID extraction module 130 is configured to extract a similar ID from the product review document
  • the comment quality discriminating module 140 is configured to match the filtered similar comments and the extracted similar IDs according to the product review document, and capture product reviews with similar IDs and reviews according to the matching results, and grab the products
  • the product review identifiers whose IDs and comments are similar are identified as false comments, and the ID corresponding to the fake comments is identified as a fake comment ID. It is possible to identify false comments in the evaluation of the target product, and the judgment result is highly reliable.
  • the output of the comment document construction module 110 is respectively connected to the input of the similar comment screening module 120 and the similar ID extraction module 130; the comment quality identification module 140 and the display respectively The comment screening module 120 and the output of the similar ID extraction module 130 are connected.
  • the comment quality discriminating module 140 includes: a receiving unit 141, configured to receive a similar comment filtered by a similar comment screening module and a similar ID extracted by a similar ID extraction module;
  • the matching unit 142 is configured to match the filtered similar comments and the extracted similar IDs according to the product review document, and the fake comment identifying unit 143, configured to capture the product reviews with similar IDs and comments according to the matching result, and
  • the product reviews with similar IDs and comments are identified as false comments, and the ID corresponding to the fake comments is identified as a fake comment ID.
  • the system 100 further includes: an identifier start time entry module 150, a storage module 160, and a fake ID timing deletion module 170.
  • the identifier start time entry module 150 is configured to enter an identifier start time for the fake ID identified in the comment quality authentication module.
  • the storage module 160 is configured to store the identified fake ID.
  • the fake ID timing deletion module 170 is configured to calculate a time value stored in the storage module according to the identifier start time and the current system time entered by the fake ID, and set the time value to a preset time threshold. The comparison is performed. When the time value is greater than the preset time threshold, the fake ID is deleted from the storage module.
  • the time threshold may be 30 to 60 days.
  • the identifier start time entry module enters the identifier start time of a fake ID identifier in the comment quality authentication module as 2011-06-06, and the current system time is 2011-077.
  • the fake ID timing deletion module deletes the modified false ID stored in the storage module; for example, when the time threshold is 45 days, the identifier start time entry module identifies the identifier in the comment quality authentication module.
  • the initial ID of the fake ID entry is 2011-06-06, and the current system time is 2011-07-21.
  • the fake ID timing deletion module deletes the modified false ID stored in the storage module;
  • the time threshold is 60 days.
  • the identifier start time entry module enters the identifier ID of the fake ID ID in the comment quality authentication module.
  • the start time is 2011-06-06, and the current system time is 2011-08-06.
  • the fake ID timing deletion module deletes the modified fake ID stored in the storage module.
  • the system 100 further includes: a data redundancy judging module 180 and an identical fake ID deleting module 190.
  • the data redundancy judging module 180 is connected to the comment quality discriminating module and the fake ID storage module, and is configured to determine whether the fake ID identified in the comment quality discriminating module is the same as the spurious ID stored in the storage module.
  • the same fake ID deletion module 190 is configured to delete the fake ID identified in the comment quality authentication module when the fake ID identified in the comment quality authentication module is the same as the fake ID stored in the storage module.
  • the storage module stores a fake ID of 123456
  • the database redundancy module identifies that the fake ID identified in the comment quality authentication module is 123456
  • the same fake ID deletion module will comment on the false ID identified in the quality authentication module.
  • the fake ID with the ID 123456 is deleted.
  • the e-commerce product review quality authentication system captures the comment data by the comment document building module, classifies the comment data by the product category, and constructs a product review document corresponding to the product; and filters the module from the product through the similar comment screening module. A similar comment is filtered out in the comment document; a similar ID extraction module extracts a similar ID from the product review document Finally, through the comment quality identification module, the selected similar comments are matched with the extracted similar IDs according to the product review document, and the product reviews with similar IDs and comments are captured according to the matching results, and the captured IDs and comments are extracted.
  • the similar product reviews are identified as false comments, and the ID corresponding to the false comments is identified as a false comment ID, which can identify false comments in the target product evaluation, and the determination result is highly reliable.

Abstract

The present invention provides a system for identifying the quality of a comment for a product in electronic commerce. The system comprises a comment file construction module, a similar comment screening module, a similar ID extraction module, and a comment quality identification module. The comment file construction module is used for capturing comment data and classifying the comment data according to the categories of commodities, so as to construct product comment files corresponding to the commodities. The similar comment screening module is used to sifting out similar comments in the product comment files. The similar ID extraction module is used for extracting similar IDs from the product comment files. The comment quality identification module is used for matching the selected similar comments with the extracted similar IDs according to the product comment files, capturing product comments with similar IDs and comments according to the matching results, marking the captured product comments with the similar IDs and the similar comments as fake comments, and marking the IDs corresponding to the fake comments as fake comment IDs. Fake comments in target product comments can be identified, and the reliability of the determining results is high.

Description

一种电子商务产品评论质量鉴别系统  E-commerce product review quality identification system 技术领域Technical field
本发明属于电子商务领域,尤其涉及一种电子商务产品评论质量鉴别系统。The invention belongs to the field of electronic commerce, and in particular relates to an e-commerce product review quality identification system.
背景技术Background technique
在当代,随着互联网的普及,电子商务已经成为一种被广泛利用的商业贸易方式。买卖双方主要是通过电商的网页或者是软件进行交易活动。由于电子商务没有传统的实体店面,对销售人员的数量要求也不高,所以相比传统交易模式更能够控制运营成本,因而有着更大的价格优势。但是,有很多不法商家为了提高自己的销量从而雇佣专业刷评价团队也制造大量的虚假评论来对自己的商品进行虚假的宣传,从而欺骗消费者来提高自己的真实销量。In the modern era, with the popularity of the Internet, e-commerce has become a widely used form of commercial trade. The buyers and sellers mainly conduct trading activities through the e-commerce webpage or software. Since e-commerce does not have a traditional physical storefront, the number of sales personnel is not high, so it is more controllable than the traditional transaction mode, so it has a greater price advantage. However, there are many unscrupulous merchants who hire professional brush evaluation teams to create a large number of false comments to make false advertisements about their products in order to increase their sales, thereby deceiving consumers to improve their real sales.
目前电子商务的发展迅猛,体量巨大,电商环境中的卖家数量众多,用户在进行购买决定时难以判断商品描述的真实性,对商品评价的依赖度很高,由于卖家评价作弊而造成的商品的性能好评度虚高的情况引起的买家利益损失的情况严重。在这样的情况下,如何对电子商务中商家的评价作弊行为进行识别和判断成电子商务发展过程中亟待解决的问题;在判断虚假评论过程中如何提高判断的准确性,避免误判情况的发生也是十分重要的考量因素;目前现有技术中还缺乏准确有效的相关设备实现产品评论质量的鉴别。At present, the development of e-commerce is rapid and huge, and there are many sellers in the e-commerce environment. It is difficult for users to judge the authenticity of the product description when making the purchase decision, and the dependence on the product evaluation is very high, which is caused by the seller’s evaluation cheating. The loss of buyer's interest caused by the high performance of the product's performance is serious. Under such circumstances, how to identify and judge the cheating behavior of merchants in e-commerce is an urgent problem to be solved in the process of e-commerce development; how to improve the accuracy of judgment in the process of judging false comments and avoid the occurrence of misjudgment It is also a very important consideration factor; at present, there is still a lack of accurate and effective related equipment in the prior art to realize the identification of product review quality.
技术问题technical problem
本发明实施例提供一种电子商务产品评论质量鉴别系统,旨在解决现有技术中还缺乏准确有效的相关设备实现产品评论质量的鉴别的问题。The embodiment of the invention provides an e-commerce product review quality identification system, which aims to solve the problem that the prior art lacks accurate and effective related equipment to realize product identification quality identification.
技术解决方案Technical solution
本发明实施例是这样实现的,一种电子商务产品评论质量鉴别系统,包括:评论文档构建模块、相似评论筛选模块、相似ID提取模块以及评论质量鉴别模块;其中,所述评论文档构建模块,用于抓取评论数据,同时将评论数据按商品类别进行分类构建与商品相对应的产品评论文档;所述相似评论筛选模块,用于从产品评论文档内筛选出相似评论;所述相似ID提取模块,用于从产品评论文档内提取出相似ID ;所述评论质量鉴别模块,用于根据产品评论文档将筛选出的相似评论和提取出的相似ID进行匹配,并根据匹配结果抓取出ID和评论都相似的产品评论,并将抓取的ID和评论都相似的产品评论标识为虚假评论,以及将所述虚假评论对应的ID标识为虚假评论ID。The embodiment of the present invention is implemented as follows: an e-commerce product review quality authentication system, comprising: a comment document construction module, a similar comment screening module, a similar ID extraction module, and a comment quality authentication module; wherein the comment document construction module, For extracting comment data, and classifying the review data by product category to construct a product review document corresponding to the product; the similar comment screening module, for filtering out similar comments from the product review document; the similar ID extraction Module for extracting similar IDs from product review documents The comment quality identification module is configured to match the filtered similar comments with the extracted similar IDs according to the product review document, and capture product reviews with similar IDs and reviews according to the matching results, and capture the captured products. The product review identifiers whose IDs and comments are similar are identified as false comments, and the ID corresponding to the fake comments is identified as a fake comment ID.
优选地,所述评论文档构建模块输出端分别与所述相似评论筛选模块、所述相似ID提取模块的输入端连接;所述评论质量鉴别模块分别与所述显示评论筛选模块、所述相似ID提取模块的输出端连接。Preferably, the comment document construction module output is respectively connected to the input of the similar comment screening module and the similar ID extraction module; the comment quality authentication module and the display comment screening module respectively, the similar ID The output of the extraction module is connected.
优选地,所述评论质量鉴别模块,包括:Preferably, the comment quality authentication module comprises:
接收单元,用于接收相似评论筛选模块筛选出的相似评论和相似ID提取模块提取出的相似ID;a receiving unit, configured to receive a similar comment filtered by a similar comment screening module and a similar ID extracted by a similar ID extraction module;
匹配单元,用于根据产品评论文档将筛选出的相似评论和提取出的相似ID进行匹配;以及a matching unit, configured to match the filtered similar comments with the extracted similar IDs according to the product review document;
虚假评论标识单元,用于根据匹配结果抓取出ID和评论都相似的产品评论,并将抓取的ID和评论都相似的产品评论标识为虚假评论,以及将所述虚假评论对应的ID标识为虚假评论ID。a false comment identification unit, configured to capture product reviews having similar IDs and comments according to the matching result, and identify the product reviews with similar captured IDs and comments as false comments, and ID identifiers corresponding to the fake comments A false comment ID.
优选地,所述系统还包括:标识起始时间录入模块,用于对评论质量鉴别模块内标识的虚假ID录入标识起始时间。Preferably, the system further includes: an identifier start time entry module, configured to enter an identifier start time for the fake ID identified in the comment quality authentication module.
优选地,所述系统还包括:存储模块,用于存储已标识的虚假ID。Preferably, the system further comprises: a storage module, configured to store the identified fake ID.
优选地,所述系统还包括:虚假ID定时删除模块,用于根据虚假ID录入的标识起始时间和当前系统时间,计算出该虚假ID在存储模块内存储的时间值,并将该时间值与预设的时间阈值进行比对,当该时间值大于预设的时间阈值时,则从存储模块内删除该虚假ID。Preferably, the system further includes: a fake ID timing deletion module, configured to calculate a time value stored in the storage module according to the identifier start time and the current system time entered by the fake ID, and the time value is The preset time threshold is compared. When the time value is greater than the preset time threshold, the fake ID is deleted from the storage module.
优选地,所述时间阈值为30~60天。Preferably, the time threshold is 30 to 60 days.
优选地,所述系统还包括:数据冗余判断模块,与所述评论质量鉴别模块、所述虚假ID存储模块连接,用于判断评论质量鉴别模块内标识的虚假ID与存储模块内存储的虚假ID是否相同。Preferably, the system further includes: a data redundancy judging module, connected to the comment quality discriminating module and the fake ID storage module, configured to determine a false ID identified in the comment quality discriminating module and a false stored in the storage module Is the ID the same?
优选地,所述系统还包括:相同虚假ID删除模块,用于当评论质量鉴别模块内标识的虚假ID与存储模块内存储的虚假ID相同时,则删除评论质量鉴别模块内标识的虚假ID。Preferably, the system further includes: a same fake ID deletion module, configured to delete the fake ID identified in the comment quality authentication module when the fake ID identified in the comment quality authentication module is the same as the fake ID stored in the storage module.
有益效果Beneficial effect
本发明实施例提供的电子商务产品评论质量鉴别系统,通过评论文档构建模块抓取评论数据,将评论数据按商品类别进行分类构建与商品相对应的产品评论文档;并通过相似评论筛选模块从产品评论文档内筛选出相似评论;相似ID提取模块,从产品评论文档内提取出相似ID ;最后通过评论质量鉴别模块根据产品评论文档将筛选出的相似评论和提取出的相似ID进行匹配,并根据匹配结果抓取出ID和评论都相似的产品评论,并将抓取的ID和评论都相似的产品评论标识为虚假评论,以及将所述虚假评论对应的ID标识为虚假评论ID,能够鉴别出目标商品评价中的虚假评论,判断结果可靠性高。The e-commerce product review quality authentication system provided by the embodiment of the present invention captures the comment data by the comment document building module, classifies the comment data according to the product category, and constructs a product review document corresponding to the product; and filters the module through the similar comment screening module. A similar comment is filtered out in the comment document; a similar ID extraction module extracts a similar ID from the product review document Finally, through the comment quality identification module, the selected similar comments are matched with the extracted similar IDs according to the product review document, and the product reviews with similar IDs and comments are captured according to the matching results, and the captured IDs and comments are extracted. The similar product reviews are identified as false comments, and the ID corresponding to the false comments is identified as a false comment ID, which can identify false comments in the target product evaluation, and the determination result is highly reliable.
附图说明DRAWINGS
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments or the description of the prior art will be briefly described below. Obviously, the drawings in the following description are Some embodiments of the present invention may also be used to obtain other drawings based on these drawings without departing from the art.
以下附图仅旨在于对本发明做示意性说明和解释,并不限定本发明的范围。The following drawings are only intended to illustrate and explain the present invention, and do not limit the scope of the invention.
图1是本发明实施例提供的一种电子商务产品评论质量鉴别系统的结构示意图;1 is a schematic structural diagram of an e-commerce product review quality authentication system according to an embodiment of the present invention;
图2是本发明实施例提供的评论质量鉴别模块的结构示意图;2 is a schematic structural diagram of a comment quality authentication module according to an embodiment of the present invention;
图3是本发明实施例提供的另一种电子商务产品评论质量鉴别系统的结构示意图;3 is a schematic structural diagram of another e-commerce product review quality authentication system according to an embodiment of the present invention;
图4是本发明实施例提供的又一种电子商务产品评论质量鉴别系统的结构示意图。FIG. 4 is a schematic structural diagram of still another e-commerce product review quality authentication system according to an embodiment of the present invention.
本发明的实施方式Embodiments of the invention
为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
本发明实施例提供的电子商务产品评论质量鉴别系统,通过评论文档构建模块抓取评论数据,将评论数据按商品类别进行分类构建与商品相对应的产品评论文档;并通过相似评论筛选模块从产品评论文档内筛选出相似评论;相似ID提取模块,从产品评论文档内提取出相似ID ;最后通过评论质量鉴别模块根据产品评论文档将筛选出的相似评论和提取出的相似ID进行匹配,并根据匹配结果抓取出ID和评论都相似的产品评论,并将抓取的ID和评论都相似的产品评论标识为虚假评论,以及将所述虚假评论对应的ID标识为虚假评论ID,能够鉴别出目标商品评价中的虚假评论,判断结果可靠性高。The e-commerce product review quality authentication system provided by the embodiment of the present invention captures the comment data by the comment document building module, classifies the comment data according to the product category, and constructs a product review document corresponding to the product; and filters the module through the similar comment screening module. A similar comment is filtered out in the comment document; a similar ID extraction module extracts a similar ID from the product review document Finally, through the comment quality identification module, the selected similar comments are matched with the extracted similar IDs according to the product review document, and the product reviews with similar IDs and comments are captured according to the matching results, and the captured IDs and comments are extracted. The similar product reviews are identified as false comments, and the ID corresponding to the false comments is identified as a false comment ID, which can identify false comments in the target product evaluation, and the determination result is highly reliable.
以下结合具体实施例对本发明的具体实现进行详细描述。The specific implementation of the present invention will be described in detail below with reference to specific embodiments.
如图1所示,在本发明实施例中,一种电子商务产品评论质量鉴别系统100,包括:评论文档构建模块110、相似评论筛选模块120、相似ID提取模块130以及评论质量鉴别模块140;其中,所述评论文档构建模块110,用于抓取评论数据,同时将评论数据按商品类别进行分类构建与商品相对应的产品评论文档;所述相似评论筛选模块120,用于从产品评论文档内筛选出相似评论;所述相似ID提取模块130,用于从产品评论文档内提取出相似ID ;所述评论质量鉴别模块140,用于根据产品评论文档将筛选出的相似评论和提取出的相似ID进行匹配,并根据匹配结果抓取出ID和评论都相似的产品评论,并将抓取的ID和评论都相似的产品评论标识为虚假评论,以及将所述虚假评论对应的ID标识为虚假评论ID。能够鉴别出目标商品评价中的虚假评论,判断结果可靠性高。As shown in FIG. 1 , in an embodiment of the present invention, an e-commerce product review quality authentication system 100 includes: a comment document construction module 110, a similar comment screening module 120, a similar ID extraction module 130, and a comment quality authentication module 140; The comment document construction module 110 is configured to capture the comment data, and classify the comment data by the product category to construct a product review document corresponding to the product; the similar comment screening module 120 is configured to review the document from the product. Similar comments are filtered out; the similar ID extraction module 130 is configured to extract a similar ID from the product review document The comment quality discriminating module 140 is configured to match the filtered similar comments and the extracted similar IDs according to the product review document, and capture product reviews with similar IDs and reviews according to the matching results, and grab the products The product review identifiers whose IDs and comments are similar are identified as false comments, and the ID corresponding to the fake comments is identified as a fake comment ID. It is possible to identify false comments in the evaluation of the target product, and the judgment result is highly reliable.
在本发明实施例中,所述评论文档构建模块110输出端分别与所述相似评论筛选模块120、所述相似ID提取模块130的输入端连接;所述评论质量鉴别模块140分别与所述显示评论筛选模块120、所述相似ID提取模块130的输出端连接。In the embodiment of the present invention, the output of the comment document construction module 110 is respectively connected to the input of the similar comment screening module 120 and the similar ID extraction module 130; the comment quality identification module 140 and the display respectively The comment screening module 120 and the output of the similar ID extraction module 130 are connected.
在本发明实施例中,如图2所示,所述评论质量鉴别模块140,包括:接收单元141,用于接收相似评论筛选模块筛选出的相似评论和相似ID提取模块提取出的相似ID;匹配单元142,用于根据产品评论文档将筛选出的相似评论和提取出的相似ID进行匹配;以及虚假评论标识单元143,用于根据匹配结果抓取出ID和评论都相似的产品评论,并将抓取的ID和评论都相似的产品评论标识为虚假评论,以及将所述虚假评论对应的ID标识为虚假评论ID。In the embodiment of the present invention, as shown in FIG. 2, the comment quality discriminating module 140 includes: a receiving unit 141, configured to receive a similar comment filtered by a similar comment screening module and a similar ID extracted by a similar ID extraction module; The matching unit 142 is configured to match the filtered similar comments and the extracted similar IDs according to the product review document, and the fake comment identifying unit 143, configured to capture the product reviews with similar IDs and comments according to the matching result, and The product reviews with similar IDs and comments are identified as false comments, and the ID corresponding to the fake comments is identified as a fake comment ID.
在本发明实施例中,如图3所示,所述系统100还包括:标识起始时间录入模块150、存储模块160和虚假ID定时删除模块170。其中,所述标识起始时间录入模块150,用于对评论质量鉴别模块内标识的虚假ID录入标识起始时间。所述存储模块160,用于存储已标识的虚假ID。所述虚假ID定时删除模块170,用于根据虚假ID录入的标识起始时间和当前系统时间,计算出该虚假ID在存储模块内存储的时间值,并将该时间值与预设的时间阈值进行比对,当该时间值大于预设的时间阈值时,则从存储模块内删除该虚假ID。In the embodiment of the present invention, as shown in FIG. 3, the system 100 further includes: an identifier start time entry module 150, a storage module 160, and a fake ID timing deletion module 170. The identifier start time entry module 150 is configured to enter an identifier start time for the fake ID identified in the comment quality authentication module. The storage module 160 is configured to store the identified fake ID. The fake ID timing deletion module 170 is configured to calculate a time value stored in the storage module according to the identifier start time and the current system time entered by the fake ID, and set the time value to a preset time threshold. The comparison is performed. When the time value is greater than the preset time threshold, the fake ID is deleted from the storage module.
在本实施例中,所述时间阈值可为30~60天。例如,当所述时间阈值为30天,所述标识起始时间录入模块对评论质量鉴别模块内标识的一虚假ID录入的标识起始时间为2011-06-06,当前系统时间为2011-07-06,则所述虚假ID定时删除模块删除存储模块内存储的改虚假ID;又如,当所述时间阈值为45天,所述所述标识起始时间录入模块对评论质量鉴别模块内标识的一虚假ID录入的标识起始时间为2011-06-06,当前系统时间为2011-07-21,则所述虚假ID定时删除模块删除存储模块内存储的改虚假ID;再如,当所述时间阈值为60天,所述所述标识起始时间录入模块对评论质量鉴别模块内标识的一虚假ID录入的标识起始时间为2011-06-06,当前系统时间为2011-08-06,则所述虚假ID定时删除模块删除存储模块内存储的改虚假ID。In this embodiment, the time threshold may be 30 to 60 days. For example, when the time threshold is 30 days, the identifier start time entry module enters the identifier start time of a fake ID identifier in the comment quality authentication module as 2011-06-06, and the current system time is 2011-077. -06, the fake ID timing deletion module deletes the modified false ID stored in the storage module; for example, when the time threshold is 45 days, the identifier start time entry module identifies the identifier in the comment quality authentication module. The initial ID of the fake ID entry is 2011-06-06, and the current system time is 2011-07-21. The fake ID timing deletion module deletes the modified false ID stored in the storage module; The time threshold is 60 days. The identifier start time entry module enters the identifier ID of the fake ID ID in the comment quality authentication module. The start time is 2011-06-06, and the current system time is 2011-08-06. The fake ID timing deletion module deletes the modified fake ID stored in the storage module.
在本发明实施例中,如图4所示,所述系统100还包括:数据冗余判断模块180和相同虚假ID删除模块190。其中,所述数据冗余判断模块180,与所述评论质量鉴别模块、所述虚假ID存储模块连接,用于判断评论质量鉴别模块内标识的虚假ID与存储模块内存储的虚假ID是否相同。所述相同虚假ID删除模块190,用于当评论质量鉴别模块内标识的虚假ID与存储模块内存储的虚假ID相同时,则删除评论质量鉴别模块内标识的虚假ID。例如,当存储模块内存储有一虚假ID为123456,所述数据库冗余模块识别出评论质量鉴别模块内标识的虚假ID为123456,则相同虚假ID删除模块将评论质量鉴别模块内标识的虚假ID中ID为123456的虚假ID删除。In the embodiment of the present invention, as shown in FIG. 4, the system 100 further includes: a data redundancy judging module 180 and an identical fake ID deleting module 190. The data redundancy judging module 180 is connected to the comment quality discriminating module and the fake ID storage module, and is configured to determine whether the fake ID identified in the comment quality discriminating module is the same as the spurious ID stored in the storage module. The same fake ID deletion module 190 is configured to delete the fake ID identified in the comment quality authentication module when the fake ID identified in the comment quality authentication module is the same as the fake ID stored in the storage module. For example, when the storage module stores a fake ID of 123456, and the database redundancy module identifies that the fake ID identified in the comment quality authentication module is 123456, the same fake ID deletion module will comment on the false ID identified in the quality authentication module. The fake ID with the ID 123456 is deleted.
上述发明实施例提供的电子商务产品评论质量鉴别系统,通过评论文档构建模块抓取评论数据,将评论数据按商品类别进行分类构建与商品相对应的产品评论文档;并通过相似评论筛选模块从产品评论文档内筛选出相似评论;相似ID提取模块,从产品评论文档内提取出相似ID ;最后通过评论质量鉴别模块根据产品评论文档将筛选出的相似评论和提取出的相似ID进行匹配,并根据匹配结果抓取出ID和评论都相似的产品评论,并将抓取的ID和评论都相似的产品评论标识为虚假评论,以及将所述虚假评论对应的ID标识为虚假评论ID,能够鉴别出目标商品评价中的虚假评论,判断结果可靠性高。The e-commerce product review quality authentication system provided by the above embodiment of the invention captures the comment data by the comment document building module, classifies the comment data by the product category, and constructs a product review document corresponding to the product; and filters the module from the product through the similar comment screening module. A similar comment is filtered out in the comment document; a similar ID extraction module extracts a similar ID from the product review document Finally, through the comment quality identification module, the selected similar comments are matched with the extracted similar IDs according to the product review document, and the product reviews with similar IDs and comments are captured according to the matching results, and the captured IDs and comments are extracted. The similar product reviews are identified as false comments, and the ID corresponding to the false comments is identified as a false comment ID, which can identify false comments in the target product evaluation, and the determination result is highly reliable.
以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明的保护范围之内。The above is only the preferred embodiment of the present invention, and is not intended to limit the present invention. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the protection of the present invention. Within the scope.

Claims (9)

  1. 一种电子商务产品评论质量鉴别系统,其特征在于,包括:评论文档构建模块、相似评论筛选模块、相似ID提取模块以及评论质量鉴别模块;其中,所述评论文档构建模块,用于抓取评论数据,同时将评论数据按商品类别进行分类构建与商品相对应的产品评论文档;所述相似评论筛选模块,用于从产品评论文档内筛选出相似评论;所述相似ID提取模块,用于从产品评论文档内提取出相似ID ;所述评论质量鉴别模块,用于根据产品评论文档将筛选出的相似评论和提取出的相似ID进行匹配,并根据匹配结果抓取出ID和评论都相似的产品评论,并将抓取的ID和评论都相似的产品评论标识为虚假评论,以及将所述虚假评论对应的ID标识为虚假评论ID。 An e-commerce product review quality authentication system, comprising: a comment document construction module, a similar comment screening module, a similar ID extraction module, and a comment quality authentication module; wherein the comment document construction module is configured to capture a comment Data, while classifying the review data by product category to construct a product review document corresponding to the product; the similar comment screening module for filtering out similar comments from the product review document; the similar ID extraction module for A similar ID is extracted from the product review document The comment quality identification module is configured to match the filtered similar comments with the extracted similar IDs according to the product review document, and capture product reviews with similar IDs and reviews according to the matching results, and capture the captured products. The product review identifiers whose IDs and comments are similar are identified as false comments, and the ID corresponding to the fake comments is identified as a fake comment ID.
  2. 如权利要求1所述的电子商务产品评论质量鉴别系统,其特征在于,所述评论文档构建模块输出端分别与所述相似评论筛选模块、所述相似ID提取模块的输入端连接;所述评论质量鉴别模块分别与所述显示评论筛选模块、所述相似ID提取模块的输出端连接。The e-commerce product review quality authentication system according to claim 1, wherein the comment document construction module output is respectively connected to the input of the similar comment screening module and the similar ID extraction module; The quality authentication module is respectively connected to the output of the display comment screening module and the similar ID extraction module.
  3. 如权利要求1所述的电子商务产品评论质量鉴别系统,其特征在于,所述评论质量鉴别模块,包括:The e-commerce product review quality authentication system according to claim 1, wherein the comment quality authentication module comprises:
    接收单元,用于接收相似评论筛选模块筛选出的相似评论和相似ID提取模块提取出的相似ID;a receiving unit, configured to receive a similar comment filtered by a similar comment screening module and a similar ID extracted by a similar ID extraction module;
    匹配单元,用于根据产品评论文档将筛选出的相似评论和提取出的相似ID进行匹配;以及a matching unit, configured to match the filtered similar comments with the extracted similar IDs according to the product review document;
    虚假评论标识单元,用于根据匹配结果抓取出ID和评论都相似的产品评论,并将抓取的ID和评论都相似的产品评论标识为虚假评论,以及将所述虚假评论对应的ID标识为虚假评论ID。a false comment identification unit, configured to capture product reviews having similar IDs and comments according to the matching result, and identify the product reviews with similar captured IDs and comments as false comments, and ID identifiers corresponding to the fake comments A false comment ID.
  4. 如权利要求3所述的电子商务产品评论质量鉴别系统,其特征在于,还包括:标识起始时间录入模块,用于对评论质量鉴别模块内标识的虚假ID录入标识起始时间。The e-commerce product review quality authentication system according to claim 3, further comprising: an identification start time entry module, configured to enter an identification start time for the fake ID identified in the comment quality authentication module.
  5. 如权利要求4所述的电子商务产品评论质量鉴别系统,其特征在于,还包括:存储模块,用于存储已标识的虚假ID。The e-commerce product review quality authentication system according to claim 4, further comprising: a storage module, configured to store the identified fake ID.
  6. 如权利要求5所述的电子商务产品评论质量鉴别系统,其特征在于,还包括:虚假ID定时删除模块,用于根据虚假ID录入的标识起始时间和当前系统时间,计算出该虚假ID在存储模块内存储的时间值,并将该时间值与预设的时间阈值进行比对,当该时间值大于预设的时间阈值时,则从存储模块内删除该虚假ID。The e-commerce product review quality authentication system according to claim 5, further comprising: a fake ID timing deletion module, configured to calculate the false ID according to the identifier start time and the current system time entered by the fake ID The time value stored in the storage module is compared with a preset time threshold. When the time value is greater than the preset time threshold, the fake ID is deleted from the storage module.
  7. 如权利要求6所述的电子商务产品评论质量鉴别系统,其特征在于,所述时间阈值为30~60天。The e-commerce product review quality authentication system according to claim 6, wherein the time threshold is 30 to 60 days.
  8. 如权利要求7所述的电子商务产品评论质量鉴别系统,其特征在于,还包括:数据冗余判断模块,与所述评论质量鉴别模块、所述虚假ID存储模块连接,用于判断评论质量鉴别模块内标识的虚假ID与存储模块内存储的虚假ID是否相同。The e-commerce product review quality authentication system according to claim 7, further comprising: a data redundancy judging module, coupled to the comment quality discriminating module and the fake ID storage module, for judging the quality of the comment. The fake ID identified in the module is the same as the fake ID stored in the storage module.
  9. 如权利要求8所述的电子商务产品评论质量鉴别系统,其特征在于,还包括:相同虚假ID删除模块,用于当评论质量鉴别模块内标识的虚假ID与存储模块内存储的虚假ID相同时,则删除评论质量鉴别模块内标识的虚假ID。 The e-commerce product review quality authentication system according to claim 8, further comprising: a same fake ID deletion module, configured to: when the fake ID identified in the comment quality authentication module is the same as the fake ID stored in the storage module , delete the fake ID identified in the comment quality authentication module.
PCT/CN2017/091592 2017-07-04 2017-07-04 System for identifying quality of comment for product in electronic commerce WO2019006642A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/091592 WO2019006642A1 (en) 2017-07-04 2017-07-04 System for identifying quality of comment for product in electronic commerce

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/091592 WO2019006642A1 (en) 2017-07-04 2017-07-04 System for identifying quality of comment for product in electronic commerce

Publications (1)

Publication Number Publication Date
WO2019006642A1 true WO2019006642A1 (en) 2019-01-10

Family

ID=64949555

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/091592 WO2019006642A1 (en) 2017-07-04 2017-07-04 System for identifying quality of comment for product in electronic commerce

Country Status (1)

Country Link
WO (1) WO2019006642A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117076812B (en) * 2023-10-13 2023-12-12 西安康奈网络科技有限公司 Intelligent monitoring management system of network information release and propagation platform
US20240062264A1 (en) * 2021-10-13 2024-02-22 Abhishek Trikha Ai- backed e-commerce for all the top rated products on a single platform

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090198677A1 (en) * 2008-02-05 2009-08-06 Nuix Pty.Ltd. Document Comparison Method And Apparatus
CN104867018A (en) * 2015-05-16 2015-08-26 成都数联铭品科技有限公司 Electronic commerce evaluation judgment system based on evaluation content and ID similarity identification
CN104881796A (en) * 2015-05-16 2015-09-02 成都数联铭品科技有限公司 False comment judgment system based on comment content and ID recognition
CN107392654A (en) * 2017-07-04 2017-11-24 深圳齐心集团股份有限公司 A kind of e-commerce product comments on quality discrimination system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090198677A1 (en) * 2008-02-05 2009-08-06 Nuix Pty.Ltd. Document Comparison Method And Apparatus
CN104867018A (en) * 2015-05-16 2015-08-26 成都数联铭品科技有限公司 Electronic commerce evaluation judgment system based on evaluation content and ID similarity identification
CN104881796A (en) * 2015-05-16 2015-09-02 成都数联铭品科技有限公司 False comment judgment system based on comment content and ID recognition
CN107392654A (en) * 2017-07-04 2017-11-24 深圳齐心集团股份有限公司 A kind of e-commerce product comments on quality discrimination system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240062264A1 (en) * 2021-10-13 2024-02-22 Abhishek Trikha Ai- backed e-commerce for all the top rated products on a single platform
CN117076812B (en) * 2023-10-13 2023-12-12 西安康奈网络科技有限公司 Intelligent monitoring management system of network information release and propagation platform

Similar Documents

Publication Publication Date Title
US9495445B2 (en) Document sorting system, document sorting method, and document sorting program
WO2022021400A1 (en) E-commerce comment identification and marking system
CN111104798B (en) Resolution method, system and computer readable storage medium for sentencing episodes in legal documents
CN107636662A (en) Web content certification
CN104867017A (en) Electronic commerce client false evaluation identification system
CN104881796A (en) False comment judgment system based on comment content and ID recognition
US20160125404A1 (en) Face recognition business model and method for identifying perpetrators of atm fraud
CN109885597A (en) Tenant group processing method, device and electric terminal based on machine learning
WO2019006642A1 (en) System for identifying quality of comment for product in electronic commerce
CN106408334A (en) Verification method and system of network advertisements
WO2022021391A1 (en) Electronic commerce information push monitoring system
CN109145187A (en) Cross-platform electric business fraud detection method and system based on comment data
CN112765565A (en) Copyright protection method and system based on block chain
CN205581878U (en) Two camera cabinet -type air conditioner testimony of a witness recognition device
KR20150061539A (en) Providing method and system for preventing fraud trading
CN107392654A (en) A kind of e-commerce product comments on quality discrimination system
CN111383109B (en) Picture copyright trading method and device
CN104867018A (en) Electronic commerce evaluation judgment system based on evaluation content and ID similarity identification
KR102482969B1 (en) Artificial intelligence-based system and method for online counterfeit product crackdown
CN205581899U (en) Single camera cabinet -type air conditioner testimony of a witness recognition device
JP2002024539A (en) Individual identification system for credit accommodation
CN105046511B (en) Commodity transaction information on-line acquisition system based on information collection box
CN205427932U (en) Process control system of accounting statement
WO2019006643A1 (en) Electronic commerce information push system
CN108763233A (en) The method and apparatus of the identification of doubtful fake products commodity and classification based on big data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17917150

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17917150

Country of ref document: EP

Kind code of ref document: A1