WO2020177463A1 - Information processing method and apparatus, storage medium, and electronic device - Google Patents

Information processing method and apparatus, storage medium, and electronic device Download PDF

Info

Publication number
WO2020177463A1
WO2020177463A1 PCT/CN2019/129063 CN2019129063W WO2020177463A1 WO 2020177463 A1 WO2020177463 A1 WO 2020177463A1 CN 2019129063 W CN2019129063 W CN 2019129063W WO 2020177463 A1 WO2020177463 A1 WO 2020177463A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
level information
level
comment
review
Prior art date
Application number
PCT/CN2019/129063
Other languages
French (fr)
Chinese (zh)
Inventor
徐凯姮
Original Assignee
拉扎斯网络科技(上海)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 拉扎斯网络科技(上海)有限公司 filed Critical 拉扎斯网络科技(上海)有限公司
Publication of WO2020177463A1 publication Critical patent/WO2020177463A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/42Data-driven translation
    • G06F40/44Statistical methods, e.g. probability models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising

Definitions

  • the disclosure of the present invention relates to the field of information processing, in particular to an information processing method, device, storage medium and electronic equipment.
  • the embodiment of the present invention analyzes the text information in the obtained user comment information according to a predetermined rule to obtain the first level information, and then determines the second level of the comment information according to the comment level information and the first level information in the obtained user comment information According to the second-level information, a more accurate merchant evaluation can be obtained. Because the embodiment of the present invention introduces predetermined rules, it can overcome the inaccurate evaluation of merchants caused by only evaluating merchants based on user review information in the prior art. problem.
  • Fig. 2 is an example flowchart of an information processing method according to an embodiment of the present invention.
  • Fig. 4 is a specific structural block diagram of an information processing device according to an embodiment of the present invention.
  • Fig. 7 is an application scenario diagram of an information processing device according to an embodiment of the present invention.
  • Step 101 Obtain user comment information, the comment information includes text information and comment rating information;
  • predetermined analysis rules which specifically include: obtaining multiple pieces of historical comment information; obtaining high-frequency words from multiple pieces of historical comment information according to predetermined rules (for example, industry characteristics) to construct predetermined analysis rules; Set the level of multiple high frequency words obtained.
  • the high-frequency words here can include: high-frequency nouns, high-frequency adjectives, and high-frequency adverbs.
  • the result of word segmentation is: rice/taste/super/good/,/amount of dishes/very sufficient/, /in short/very/satisfied/!
  • Step 203 Analyze user comment information, and perform level analysis on the text part
  • the obtained user review information is: the rice tastes super good, the dishes are very large, and in short, I am very satisfied!
  • Step 204 Calculate the comprehensive level of the user review information (ie, the above-mentioned second level information), specifically, according to the level score obtained in step 203 according to the public opinion dictionary (public opinion score) and the level score in the user comment information obtained in step 202 (User points) difference value to calculate the comprehensive level of user comment information.
  • the comprehensive level of the user review information ie, the above-mentioned second level information
  • the level score obtained in step 203 according to the public opinion dictionary (public opinion score) and the level score in the user comment information obtained in step 202 (User points) difference value to calculate the comprehensive level of user comment information.
  • FIG. 3 is a structural block diagram of the device. As shown in FIG. 3, the device includes: an information acquisition unit 301, a first level information determination unit 302, and a second level information determination unit 303, of which:
  • the information obtaining unit 301 is configured to obtain user comment information, the comment information including text information and comment rating information;
  • the valid information determining unit 304 can determine whether the review information is valid according to the difference between the review rating information and the first level information. When the difference between the review rating information and the first level information is less than a predetermined range, the valid information determining unit 305 determines The comment information is valid. When the difference between the comment level information and the first level information is large, it can be considered that the text information in the user’s comment information does not match the information reflected in the comment level, and the valid information determining unit 305 determines that the comment information is invalid , The comment information is not considered at this time. In this way, invalid user comments can be filtered out, and the accuracy and reliability of the evaluation of businesses can be improved.
  • the second level information determining module 3032 is configured to perform a weighted sum operation on the weight and corresponding level information to determine the second level information.
  • the predetermined rule may be based on industry characteristics, or may be based on industry experience.
  • a can be 0.5 and b can be 0.5.
  • the computer-readable medium may be a computer-readable signal medium or a computer-readable storage medium.
  • the computer-readable storage medium can be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, device or device, or any appropriate combination of the foregoing.

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Strategic Management (AREA)
  • Development Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Physics & Mathematics (AREA)
  • Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Human Resources & Organizations (AREA)
  • General Business, Economics & Management (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • Marketing (AREA)
  • Tourism & Hospitality (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Educational Administration (AREA)
  • Probability & Statistics with Applications (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Machine Translation (AREA)

Abstract

Disclosed are an information processing method and apparatus, a storage medium, and an electronic device. The method comprises: acquiring comment information of a user, wherein the comment information comprises text information and comment level information; analyzing the text information according to a predetermined rule, so as to determine first level information; and determining second level information of the comment information according to the comment level information and the first level information. By means of the present invention, relative precise comments on a merchant can be obtained.

Description

信息处理方法、装置、存储介质和电子设备Information processing method, device, storage medium and electronic equipment
本申请要求了2019年03月04日提交的、申请号为2019101602650、发明名称为“信息处理方法、装置、存储介质和电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed on March 4, 2019, with the application number 2019101602650 and the title of the invention "information processing methods, devices, storage media and electronic equipment", the entire contents of which are incorporated herein by reference Applying.
技术领域Technical field
本发明公开涉及信息处理领域,具体涉及一种信息处理方法、装置、存储介质和电子设备。The disclosure of the present invention relates to the field of information processing, in particular to an information processing method, device, storage medium and electronic equipment.
背景技术Background technique
目前,用户对商家商品的评论,包含一个客观打分(例如0-5分)以及文字评述部分。对商家的总体评分,通常是基于用户对商品评论中的客观打分综合得到的。具体而言,首先根据例如评论字数是否大于阈值、评论类型是否跟商品相对应等来判断用户评论是否有效,之后再根据有效评论中的分数,取均值得到商户总体评分。At present, users’ reviews of merchants’ products include an objective score (for example, 0-5 points) and a text review part. The overall rating of the merchant is usually obtained comprehensively based on the user's objective scoring in the product reviews. Specifically, firstly, whether the user review is valid or not is judged based on whether the number of review words is greater than a threshold, whether the review type corresponds to the product, etc., and then based on the scores in the valid reviews, the average value is taken to obtain the overall merchant score.
但是,平台常常会为了提高用户活跃度提供一些奖励措施来鼓励用户提交评价,一些商户也会为了提高自己的评分,提供“好评返现”等活动,导致有些用户为了得到奖励而提交无效评价,例如,配图常常是一些无关的图片,文字评述也往往是随意复制粘贴的无关文字,再打上5分满分的分数,甚至存在用户尽管在文字评述中对服务不满,依然会打上高分的情况。However, the platform often provides incentives to increase user activity to encourage users to submit reviews. Some merchants also provide activities such as "Praise Rebate" in order to improve their own ratings, causing some users to submit invalid reviews in order to get rewards. For example, pictures are often irrelevant pictures, and text comments are often irrelevant text copied and pasted at random, and then scored with a full score of 5 points. Even though users are dissatisfied with the service in the text comments, they still get high scores. .
这些情况造成了商户总体评分的不准确甚至虚高,从而导致其他用户无法有效地根据商户评分来做决策或者评价商户。These circumstances have caused the overall rating of the merchant to be inaccurate or even falsely high, resulting in other users unable to effectively make decisions or evaluate the merchant based on the merchant's rating.
发明内容Summary of the invention
有鉴于此,本发明实施例提供一种信息处理方法、装置、存储介质和电子设备,以解决现有技术中的根据用户评论信息来评价商户不准确的问题。In view of this, embodiments of the present invention provide an information processing method, device, storage medium, and electronic equipment to solve the problem of inaccurate evaluation of merchants based on user review information in the prior art.
根据本发明实施例的第一方面,提供一种信息处理方法,该方法包括:获取用户评论信息,该评论信息包括文本信息和评论等级信息;根据预定规则对文本信息进行分析,以确定第一等级信息;根据评论等级信息和第一等级信息确定该评论信息的第二等级信息。According to a first aspect of the embodiments of the present invention, an information processing method is provided, the method includes: acquiring user comment information, the comment information including text information and comment rating information; and analyzing the text information according to predetermined rules to determine the first Level information; the second level information of the review information is determined according to the review level information and the first level information.
根据本发明实施例的第二方面,提供一种信息处理装置,该装置包括:信息获取单元,用于获取用户评论信息,该评论信息包括文本信息和评论等级信息;第一等级信息确定单元,用于根据预定规则对文本信息进行分析,以确定第一等级信息;第二等级信息确定单元,用于根据评论等级信息和第一等级信息确定该评论信息的第二等级信息。According to a second aspect of the embodiments of the present invention, there is provided an information processing device, which includes: an information acquisition unit for acquiring user comment information, the comment information including text information and comment rating information; and a first rating information determining unit, It is used to analyze the text information according to predetermined rules to determine the first level information; the second level information determining unit is used to determine the second level information of the review information according to the review level information and the first level information.
根据本发明实施例的第三方面,提供一种计算机可读存储介质,其上存储计算机程序指令,其中,所述计算机程序指令在被处理器执行时实现如第一方面所述的方法。According to a third aspect of the embodiments of the present invention, there is provided a computer-readable storage medium on which computer program instructions are stored, wherein the computer program instructions implement the method as described in the first aspect when executed by a processor.
根据本发明实施例的第四方面,提供一种电子设备,包括存储器和处理器,其中,所述存储器用于存储一条或多条计算机程序指令,其中,所述一条或多条计算机程序指令被所述处理器执行以实现如第一方面所述的方法。According to a fourth aspect of the embodiments of the present invention, there is provided an electronic device, including a memory and a processor, wherein the memory is used to store one or more computer program instructions, wherein the one or more computer program instructions are The processor executes to implement the method as described in the first aspect.
本发明实施例根据预定规则对获取的用户评论信息中的文本信息进行分析得到第一等级信息,之后再根据获取的用户评论信息中的评论等级信息和第一等级信息确定评论信息的第二等级信息,根据该第二等级信息可以得到较准确的商户评价,本发明实施例由于引入了预定规则,因而可以克服现有技术中仅根据用户评论信息来评价商户而导致的对商户评价不够准确的问题。The embodiment of the present invention analyzes the text information in the obtained user comment information according to a predetermined rule to obtain the first level information, and then determines the second level of the comment information according to the comment level information and the first level information in the obtained user comment information According to the second-level information, a more accurate merchant evaluation can be obtained. Because the embodiment of the present invention introduces predetermined rules, it can overcome the inaccurate evaluation of merchants caused by only evaluating merchants based on user review information in the prior art. problem.
附图说明Description of the drawings
通过以下参照附图对本发明实施例的描述,本发明的上述以及其它目的、特征和优点将更为清楚,在附图中:Through the following description of the embodiments of the present invention with reference to the accompanying drawings, the above and other objectives, features and advantages of the present invention will be clearer, in the accompanying drawings:
图1是根据本发明实施例的信息处理方法的流程图;Fig. 1 is a flowchart of an information processing method according to an embodiment of the present invention;
图2是根据本发明实施例的信息处理方法的实例流程图;Fig. 2 is an example flowchart of an information processing method according to an embodiment of the present invention;
图3是根据本发明实施例的信息处理装置的结构框图;Fig. 3 is a structural block diagram of an information processing device according to an embodiment of the present invention;
图4是根据本发明实施例的信息处理装置的具体结构框图;Fig. 4 is a specific structural block diagram of an information processing device according to an embodiment of the present invention;
图5是根据本发明实施例的第一等级信息确定单元302的结构框图;FIG. 5 is a structural block diagram of the first level information determining unit 302 according to an embodiment of the present invention;
图6是根据本发明实施例的第二等级信息确定单元303的结构框图;6 is a structural block diagram of the second level information determining unit 303 according to an embodiment of the present invention;
图7是根据本发明实施例的信息处理装置的应用场景图;Fig. 7 is an application scenario diagram of an information processing device according to an embodiment of the present invention;
图8是根据本发明实施例的电子设备的示意图。Fig. 8 is a schematic diagram of an electronic device according to an embodiment of the present invention.
具体实施方式detailed description
以下基于实施例对本发明进行描述,但是本发明并不仅仅限于这些实施例。在下 文对本发明的细节描述中,详尽描述了一些特定的细节部分。对本领域技术人员来说没有这些细节部分的描述也可以完全理解本发明。为了避免混淆本发明的实质,公知的方法、过程、流程、元件和电路并没有详细叙述。The present invention is described below based on examples, but the present invention is not limited to these examples. In the following detailed description of the present invention, some specific details are described in detail. Those skilled in the art can fully understand the present invention without the description of these details. In order to avoid obscuring the essence of the present invention, well-known methods, processes, procedures, components and circuits are not described in detail.
此外,本领域普通技术人员应当理解,在此提供的附图都是为了说明的目的,并且附图不一定是按比例绘制的。In addition, those of ordinary skill in the art should understand that the drawings provided herein are for illustrative purposes, and the drawings are not necessarily drawn to scale.
除非上下文明确要求,否则整个说明书和权利要求书中的“包括”、“包含”等类似词语应当解释为包含的含义而不是排他或穷举的含义;也就是说,是“包括但不限于”的含义。Unless the context clearly requires, the words "including", "including" and other similar words in the entire specification and claims should be interpreted as inclusive rather than exclusive or exhaustive meanings; in other words, "including but not limited to" Meaning.
在本发明的描述中,需要理解的是,术语“第一”、“第二”等仅用于描述目的,而不能理解为指示或暗示相对重要性。此外,在本发明的描述中,除非另有说明,“多个”的含义是两个或两个以上。In the description of the present invention, it should be understood that the terms "first", "second", etc. are only used for descriptive purposes and cannot be understood as indicating or implying relative importance. In addition, in the description of the present invention, unless otherwise specified, "plurality" means two or more.
本发明实施例提供了一种信息处理方法,图1是该信息处理方法的流程图,如图1所示,该方法包括:An embodiment of the present invention provides an information processing method. FIG. 1 is a flowchart of the information processing method. As shown in FIG. 1, the method includes:
步骤101,获取用户评论信息,评论信息包括文本信息和评论等级信息;Step 101: Obtain user comment information, the comment information includes text information and comment rating information;
步骤102,根据预定规则对文本信息进行分析,以确定第一等级信息;Step 102: Analyze the text information according to predetermined rules to determine the first level information;
步骤103,根据评论等级信息和第一等级信息确定评论信息的第二等级信息。Step 103: Determine the second level information of the review information according to the review level information and the first level information.
本发明实施例根据预定规则对获取的用户评论信息中的文本信息进行分析,根据分析结果对文本信息进行等级划分得到第一等级信息,之后再根据获取的用户评论信息中的评论等级信息和第一等级信息确定评论信息的第二等级信息,根据该第二等级信息可以得到较准确的商户评价,本发明实施例由于引入了预定规则(该预定规则主要用于对文本信息分析,因而以下可以称为预定分析规则),因而可以克服现有技术中仅根据用户评论信息来评价商户而导致的对商户评价不够准确的问题。The embodiment of the present invention analyzes the text information in the obtained user comment information according to predetermined rules, ranks the text information according to the analysis result to obtain the first grade information, and then according to the comment grade information and the second grade information in the obtained user comment information. The first-level information determines the second-level information of the review information. According to the second-level information, a more accurate merchant evaluation can be obtained. The embodiment of the present invention introduces a predetermined rule (the predetermined rule is mainly used to analyze text information, so the following can be It is called a predetermined analysis rule), which can overcome the problem of inaccurate evaluation of merchants caused by only evaluating merchants based on user review information in the prior art.
在实际操作中,需要先构建预定分析规则,具体包括:获取多条历史评论信息;根据预定规则(例如,行业特征)从多条历史评论信息中获取高频词来构建预定分析规则;之后为获取的多个高频词设置等级。这里的高频词可以包括:高频名词、高频形容词、高频副词等。In actual operation, it is necessary to construct predetermined analysis rules first, which specifically include: obtaining multiple pieces of historical comment information; obtaining high-frequency words from multiple pieces of historical comment information according to predetermined rules (for example, industry characteristics) to construct predetermined analysis rules; Set the level of multiple high frequency words obtained. The high-frequency words here can include: high-frequency nouns, high-frequency adjectives, and high-frequency adverbs.
例如,对于餐饮行业,获取各个饭店的多条历史评论信息,对各评论信息进行分词等处理,获取多个词,选择出现频率较高的名词作为高频名词,例如,“菜量”、“口感”、“米饭”、“服务”、“态度”等,选择出现频率较高的形容词作为高频形容词,例如,“好”、“棒”、“差”等,将频率较高的副词作为高频副词,例如,“非常”、“一般” 等,之后根据选择的这些高频词构建预定分析规则,并对各高频词分别设置等级。For example, for the catering industry, obtain multiple historical review information of each restaurant, perform word segmentation and other processing on each review information, obtain multiple words, and select nouns with higher occurrence frequency as high-frequency nouns, such as "dish quantity", " "Taste", "rice", "service", "attitude", etc., select adjectives that appear more frequently as high-frequency adjectives, for example, "good", "stick", "bad", etc., and use higher-frequency adverbs as adverbs High-frequency adverbs, for example, "very", "general", etc., and then construct predetermined analysis rules based on the selected high-frequency words, and set a level for each high-frequency word.
在本发明实施例中,等级可以是一级(表示特别满意)、二级(表示满意)、三级(表示一般)等这类分级,也可以是用分数来表示,例如,用0-5分来进行分级,如5分表示特别满意,0分表示特别不满意。In the embodiment of the present invention, the grade can be first-level (representing particularly satisfactory), second-level (representing satisfaction), third-level (representing general), etc., or it can be represented by scores, for example, 0-5 Grade is based on points, such as 5 points for particularly satisfactory, 0 points for particularly unsatisfactory.
在步骤102中,根据预定分析规则对文本信息进行分析包括:根据预定分析规则获取文本信息中的关键词。也就是说,根据预定分析规则中的高频词,来获取文本信息中的与高频词对应的关键词。In step 102, analyzing the text information according to a predetermined analysis rule includes: obtaining keywords in the text information according to a predetermined analysis rule. That is, according to the high-frequency words in the predetermined analysis rule, the keywords corresponding to the high-frequency words in the text information are obtained.
之后根据预定分析规则中高频词的等级信息对获取的关键词进行等级划分,以得到第一等级信息。Afterwards, the acquired keywords are classified according to the level information of the high-frequency words in the predetermined analysis rule to obtain the first level information.
之后,在步骤103中,根据步骤101获取的评论等级信息和步骤102得到的第一等级信息确定评论信息的第二等级信息。具体地,当评论等级信息和第一等级信息的差异小于预定范围时,确定评论信息有效;之后根据评论等级信息和第一等级信息确定第二等级信息。而当评论等级信息和第一等级信息的差异大于预定范围时,确定评论信息无效,此时不考虑该评论信息。After that, in step 103, the second level information of the review information is determined according to the review level information obtained in step 101 and the first level information obtained in step 102. Specifically, when the difference between the comment rating information and the first rating information is less than a predetermined range, it is determined that the comment information is valid; then the second rating information is determined according to the comment rating information and the first rating information. When the difference between the comment level information and the first level information is greater than the predetermined range, it is determined that the review information is invalid, and the review information is not considered at this time.
通过评论等级信息和第一等级信息的差异比较,可以得出用户评论信息是否有效,如果评论等级信息和第一等级信息的差异较大,可以认为用户评论信息中的文本信息与评论等级信息所反映的信息是不相符的。例如,为了“好评返现”活动,文本信息是随意复制粘贴的无关段落,评论等级信息为5分满分分数,这类评论,通过引入预定分析规则得到的第一等级信息(可能为0分或者1分),与用户评论中的等级信息(5分)就会有很大差异,可以认为通过预定分析规则得到的第一等级信息是较为准确的客观等级信息。这样,可以过滤掉无效用户评论,提高对商家评价的准确性和可靠性。By comparing the difference between the comment level information and the first level information, it can be concluded whether the user comment information is valid. If the difference between the comment level information and the first level information is large, it can be considered that the text information in the user comment information is different from the comment level information. The information reflected is inconsistent. For example, for the "Commercial Rebate" campaign, the text information is an irrelevant paragraph that is freely copied and pasted, and the review grade information is 5 points full score. For this type of review, the first grade information (may be 0 points or 0 points) obtained by introducing predetermined analysis rules 1 point), there will be a big difference from the level information (5 points) in the user comments. It can be considered that the first level information obtained through the predetermined analysis rules is more accurate objective level information. In this way, invalid user comments can be filtered out, and the accuracy and reliability of the evaluation of businesses can be improved.
当评论等级信息和第一等级信息的差异小于预定范围(例如,差异不超过1分),则可以认为用户评论信息是有效的,之后根据评论等级信息和第一等级信息确定第二等级信息。具体而言,先分别设置评论等级信息和第一等级信息的权重;之后将权重与相应的等级信息进行加权求和操作以确定第二等级信息。When the difference between the comment rating information and the first rating information is less than a predetermined range (for example, the difference does not exceed 1 point), the user comment information can be considered valid, and then the second rating information is determined based on the comment rating information and the first rating information. Specifically, the weights of the comment level information and the first level information are respectively set first; then the weights and the corresponding level information are weighted and summed to determine the second level information.
例如,根据预定规则(例如,行业规则)将评论等级信息的权重设置为a,将第一等级信息的权重设置为b,通过如下公式确定第二等级信息:For example, according to a predetermined rule (for example, an industry rule), the weight of the review grade information is set to a, and the weight of the first grade information is set to b, and the second grade information is determined by the following formula:
第二等级信息=aⅹ等级信息+bⅹ第一等级信息,其中,a,b为0到1之间的实数,且a+b=1。Second level information=aⅹlevel information+bⅹfirst level information, where a and b are real numbers between 0 and 1, and a+b=1.
这里的预定规则可以是根据行业特征而定,或者可以根据行业经验而定。优选地,a可以是0.5,b可以是0.5。The predetermined rules here can be based on industry characteristics, or can be based on industry experience. Preferably, a can be 0.5 and b can be 0.5.
图2是根据本发明实施例的信息处理方法的实例流程图,如图2所示,该实例包括如下流程:Fig. 2 is an example flowchart of an information processing method according to an embodiment of the present invention. As shown in Fig. 2, the example includes the following processes:
步骤201,构造舆情分析字典,该舆情分析字典可以是上述的预定分析规则。Step 201: Construct a public opinion analysis dictionary, which may be the aforementioned predetermined analysis rule.
根据各行业场景关键词构造舆情分析字典,并定期针对不同场景进行更新该字典,例如进行关键词的增加,如增加餐饮行业新的菜品名、店名,增加“香糯”、“酥脆”等行业特有形容词等来提高分词的准确性,优化后续的分析结果。Construct a public opinion analysis dictionary based on keywords in various industry scenarios, and update the dictionary regularly for different scenarios, such as adding keywords, such as adding new dish names and store names in the catering industry, and adding industries such as "Xiang Nuo" and "Crispy" Unique adjectives, etc. to improve the accuracy of word segmentation and optimize subsequent analysis results.
具体而言,获取多条历史用户评论信息,对评论信息中的文本部分通过分词处理和词性标注,将文本转化为词库,整理出这些词的词频,选取出现频率较高(例如按频次排序前20名)的名词、形容词、副词等,进行0-5分的等级打分,添加到字典中,构成舆情分析字典。Specifically, obtain multiple pieces of historical user comment information, use word segmentation and part-of-speech tagging for the text in the comment information, convert the text into a thesaurus, sort out the word frequency of these words, and select the higher occurrence frequency (for example, sort by frequency The top 20) nouns, adjectives, adverbs, etc., are scored on a scale of 0-5 and added to the dictionary to form a public opinion analysis dictionary.
例如,获取到一条历史评论信息:米饭口感超级好,菜量很足,总之非常满意!For example, a piece of historical comment information is obtained: The rice tastes super good, the amount of dishes is sufficient, and in short, I am very satisfied!
分词结果为:米饭/口感/超级/好/,/菜量/很足/,/总之/非常/满意/!The result of word segmentation is: rice/taste/super/good/,/amount of dishes/very sufficient/, /in short/very/satisfied/!
词性标注为:(米饭,n)(口感,n)(超级,b)(好,a)(,,x)(菜量,n)(很足,a)(,,x)(总之,c)(非常,d)(满意,v)(!,x),其中,n表示名称,b表示区别词(例如,“女司机”中的“女”,词性标注为b,“总公司”中的“总”,词性标注为b),a表示形容词,x表示标点符合,c表示连词(例如,与),d表示副词(例如,“进一步发展”中的“进一步”,词性标注为d),v表示动词。The part-of-speech tagging is: (rice, n) (taste, n) (super, b) (good, a) (,, x) (vegetable quantity, n) (very sufficient, a) (,, x) (in short, c ) (Very, d) (satisfied, v) (!, x), where n is the name and b is the distinguishing word (for example, "女" in "woman driver", part of speech is marked as b, and in "head office" "Total", part of speech is marked as b), a means adjective, x means punctuation, c means conjunction (for example, and), d means adverb (for example, "further" in "further development", part of speech is marked as d) , V means verb.
这里的高频名词为“米饭”、“口感”、“菜量”,可以放入词典帮助提高分词准确性;对“很足”这样的形容词设置等级为4分,添加到舆情字典。The high-frequency nouns here are "rice", "taste", and "vegetable quantity", which can be put into the dictionary to help improve the accuracy of word segmentation; set a level of 4 for adjectives such as "very enough" and add them to the public opinion dictionary.
步骤202,获取并解析用户评论信息。Step 202: Obtain and parse user comment information.
具体的,获取用户评论信息,之后分别获取评论的文本部分与等级部分(0-5分),便于后续的处理与统计。Specifically, the user comment information is obtained, and then the text part and the grade part (0-5 points) of the comment are obtained respectively, which is convenient for subsequent processing and statistics.
步骤203,分析用户评论信息,对文本部分进行等级分析;Step 203: Analyze user comment information, and perform level analysis on the text part;
例如,获取的用户评论信息为:米饭口感超级好,菜量很足,总之非常满意!For example, the obtained user review information is: the rice tastes super good, the dishes are very large, and in short, I am very satisfied!
第一个分句“米饭口感超级好”中“好”在舆情字典中等级为3.5分,“超级”修饰下“超级好”等级为5分,则第一个分句等级为5分;"Good" in the first clause "The taste of rice is super good" is rated 3.5 points in the public opinion dictionary, and the level of "super good" under "super" modification is 5 points, and the first clause is rated 5 points;
第二个分句“菜量很足”中“很足”为4分,则第二个分句等级为4分;In the second clause "very enough dishes", "very enough" is 4 points, and the second clause has 4 points;
第三个分句“总之非常满意”中“满意”为4.5分,“非常”修饰下为5分,则 第三个分句5分。In the third clause “very satisfied in general”, “satisfied” is 4.5 points, and under the modification of “very” is 5 points, the third clause is 5 points.
三个分句并列,则文本部分总的等级为三个分句的平均值4.67分。If the three clauses are juxtaposed, the overall grade of the text part is 4.67 points average of the three clauses.
步骤204,计算用户评论信息的综合等级(即,上述第二等级信息),具体地,根据步骤203依据舆情字典得到的等级分(舆情分)和步骤202获取到的用户评论信息中的等级分(用户分)的差异值来计算用户评论信息的综合等级。Step 204: Calculate the comprehensive level of the user review information (ie, the above-mentioned second level information), specifically, according to the level score obtained in step 203 according to the public opinion dictionary (public opinion score) and the level score in the user comment information obtained in step 202 (User points) difference value to calculate the comprehensive level of user comment information.
|舆情分-用户分|>阈值时,表示用户评论无效,丢弃本条评论。这种情况可能是用户打分和文字不相对应,例如用户文字部分描述是对商家不满意,但实际评价时给商家是一个比较好的评价分。这里的阈值可以依据实际情况而定。|Public opinion points-user points|>Threshold value means that the user comment is invalid and this comment is discarded. In this case, the user’s score may not correspond to the text. For example, the user’s text description is dissatisfied with the business, but the actual evaluation is a good score for the business. The threshold here can be determined according to the actual situation.
|舆情分-用户分|<=阈值时,表示用户评价有效,取二者的平均值作为对商户的综合评分。|Public opinion score-user score|<=threshold, it means that the user evaluation is valid, and the average of the two is taken as the comprehensive score for the merchant.
通过根据舆情字典对用户评论的鉴别筛选,重新计算等级得分,可以有效提高商家评分的准确性和可靠性。By identifying and screening user reviews based on the public opinion dictionary, and recalculating the grade score, the accuracy and reliability of the merchant’s score can be effectively improved.
本发明实施例还提供一种信息处理装置,图3是该装置的结构框图,如图3所示,该装置包括:信息获取单元301、第一等级信息确定单元302和第二等级信息确定单元303,其中:An embodiment of the present invention also provides an information processing device. FIG. 3 is a structural block diagram of the device. As shown in FIG. 3, the device includes: an information acquisition unit 301, a first level information determination unit 302, and a second level information determination unit 303, of which:
信息获取单元301,用于获取用户评论信息,该评论信息包括文本信息和评论等级信息;The information obtaining unit 301 is configured to obtain user comment information, the comment information including text information and comment rating information;
第一等级信息确定单元302,用于根据预定规则对文本信息进行分析,以确定第一等级信息;The first-level information determining unit 302 is configured to analyze text information according to predetermined rules to determine the first-level information;
第二等级信息确定单元303,用于根据评论等级信息和第一等级信息确定评论信息的第二等级信息。The second level information determining unit 303 is configured to determine the second level information of the review information according to the review level information and the first level information.
本发明实施例的第一等级信息确定单元302根据预定规则对信息获取单元301获取的用户评论信息中的文本信息进行分析以确定第一等级信息,之后第二等级信息确定单元303根据评论等级信息和第一等级信息确定评论信息的第二等级信息,根据该第二等级信息可以得到较准确的商户评价,本发明实施例由于引入了预定规则(以下可以称为预定分析规则),因而可以克服现有技术中仅根据用户评论信息来评价商户而导致的对商户评价不够准确的问题。The first level information determining unit 302 in the embodiment of the present invention analyzes the text information in the user comment information obtained by the information acquiring unit 301 according to predetermined rules to determine the first level information, and then the second level information determining unit 303 according to the comment level information The second-level information of the review information is determined with the first-level information. According to the second-level information, a more accurate merchant evaluation can be obtained. The embodiment of the present invention introduces predetermined rules (hereinafter may be referred to as predetermined analysis rules), which can overcome In the prior art, the problem of insufficient accuracy in evaluating merchants caused by evaluating merchants only based on user review information.
在实际操作中,如图4所示,上述装置还包括:有效信息判断单元304和有效信息确定单元305,其中:In actual operation, as shown in FIG. 4, the above-mentioned device further includes: a valid information determining unit 304 and a valid information determining unit 305, wherein:
有效信息判断单元304,用于根据评论等级信息和第一等级信息判断评论信息是 否有效;The valid information judging unit 304 is configured to judge whether the review information is valid according to the review level information and the first level information;
有效信息确定单元305,用于响应于评论等级信息和第一等级信息的差异小于预定范围,确定评论信息有效,否则,确定评论信息无效。The valid information determining unit 305 is configured to determine that the review information is valid in response to the difference between the review level information and the first level information being less than a predetermined range, otherwise, determine that the review information is invalid.
通过有效信息判断单元304根据评论等级信息和第一等级信息两者之间的差异可以判断评论信息是否有效,当评论等级信息和第一等级信息的差异小于预定范围时,有效信息确定单元305确定评论信息有效,当评论等级信息和第一等级信息的差异较大时,可以认为用户评论信息中的文本信息与评论等级所反映的信息是不相符的,因而有效信息确定单元305确定评论信息无效,此时不考虑该评论信息。这样,可以过滤掉无效用户评论,提高对商家评价的准确性和可靠性。The valid information determining unit 304 can determine whether the review information is valid according to the difference between the review rating information and the first level information. When the difference between the review rating information and the first level information is less than a predetermined range, the valid information determining unit 305 determines The comment information is valid. When the difference between the comment level information and the first level information is large, it can be considered that the text information in the user’s comment information does not match the information reflected in the comment level, and the valid information determining unit 305 determines that the comment information is invalid , The comment information is not considered at this time. In this way, invalid user comments can be filtered out, and the accuracy and reliability of the evaluation of businesses can be improved.
如图5所示,第一等级信息确定单元302包括:文本信息分析模块3021和第一等级信息确定模块3022,其中:As shown in FIG. 5, the first level information determining unit 302 includes: a text information analysis module 3021 and a first level information determining module 3022, wherein:
文本信息分析模块3021,用于根据预定分析规则对文本信息进行分析;The text information analysis module 3021 is used to analyze text information according to predetermined analysis rules;
第一等级信息确定模块3022,用于根据分析结果对文本信息进行等级划分,以确定第一等级信息。The first level information determining module 3022 is configured to classify the text information according to the analysis result to determine the first level information.
具体而言,文本信息分析模块3021根据预定分析规则获取文本信息中的关键词,之后第一等级信息确定模块3022根据预定分析规则对获取的关键词进行等级划分,以确定第一等级信息。Specifically, the text information analysis module 3021 obtains keywords in the text information according to a predetermined analysis rule, and then the first level information determination module 3022 ranks the obtained keywords according to the predetermined analysis rule to determine the first level information.
如图6所示,第二等级信息确定单元303包括:权重设置模块3031和第二等级信息确定模块3032,其中:As shown in FIG. 6, the second level information determining unit 303 includes: a weight setting module 3031 and a second level information determining module 3032, wherein:
权重设置模块3031,用于分别设置评论等级信息和第一等级信息的权重;The weight setting module 3031 is used to set the weights of the comment level information and the first level information respectively;
第二等级信息确定模块3032,用于将权重与相应的等级信息进行加权求和操作以确定第二等级信息。The second level information determining module 3032 is configured to perform a weighted sum operation on the weight and corresponding level information to determine the second level information.
具体而言,权重设置模块3031可以根据预定规则(例如,行业规则)将评论等级信息的权重设置为a,将第一等级信息的权重设置为b;之后第二等级信息确定模块3032可以通过如下公式确定第二等级信息:第二等级信息=aⅹ评论等级信息+bⅹ第一等级信息,其中,a,b为0到1之间的实数,且a+b=1。Specifically, the weight setting module 3031 may set the weight of the review rank information to a and the weight of the first rank information to b according to predetermined rules (for example, industry rules); then the second rank information determination module 3032 may pass the following The formula determines the second level information: second level information=aⅹcomment level information+bⅹfirst level information, where a, b are real numbers between 0 and 1, and a+b=1.
该预定规则可以是根据行业特征而定,或者可以根据行业经验而定。优选地,a可以是0.5,b可以是0.5。The predetermined rule may be based on industry characteristics, or may be based on industry experience. Preferably, a can be 0.5 and b can be 0.5.
图7是根据本发明实施例的信息处理装置的应用场景图,如图7所示,信息获取单元301获取某商家的多条用户评论(评论1、评论2,……,评论N),第一等级信 息确定单元302根据预定分析规则对各用户评论中的文本信息进行分析以确定第一等级信息,有效信息判断单元304根据各用户评论中的评论等级信息及其第一等级信息判断该评论是否有效,当评论等级信息和第一等级信息的差异小于预定范围时,有效信息确定单元305确定该评论有效(图中所示为评论a,……,评论f),否则为无效评论(图中未示出),之后第二等级信息确定单元303根据有效评论中的评论等级信息和第一等级信息确定该有效评论的第二等级信息,根据多条有效评论的第二等级信息就可以得到较准确的商户评价。Fig. 7 is an application scenario diagram of an information processing device according to an embodiment of the present invention. As shown in Fig. 7, the information obtaining unit 301 obtains multiple user comments (Comment 1, Comment 2, ..., Comment N) of a certain merchant. The first-level information determining unit 302 analyzes the text information in each user review according to predetermined analysis rules to determine the first-level information, and the effective information determining unit 304 determines the comment based on the review level information and the first-level information in each user review Whether it is valid or not, when the difference between the review rank information and the first rank information is less than the predetermined range, the valid information determining unit 305 determines that the review is valid (comments a,..., f), otherwise it is an invalid comment (as shown in the figure) (Not shown in ), then the second level information determining unit 303 determines the second level information of the valid review according to the review level information and the first level information in the valid review, and can obtain the second level information of the valid review according to the second level information of multiple valid reviews. More accurate merchant evaluation.
图8是本发明实施例的电子设备的示意图。图8所示的电子设备为通用数据处理装置,其包括通用的计算机硬件结构,其至少包括处理器801和存储器802。处理器801和存储器802通过总线803连接。存储器802适于存储处理器801可执行的一条或多条指令或程序。该一条或多条指令或程序被处理器801执行以实现如下步骤:Fig. 8 is a schematic diagram of an electronic device according to an embodiment of the present invention. The electronic device shown in FIG. 8 is a general data processing apparatus, which includes a general computer hardware structure, and includes at least a processor 801 and a memory 802. The processor 801 and the memory 802 are connected through a bus 803. The memory 802 is suitable for storing one or more instructions or programs executable by the processor 801. The one or more instructions or programs are executed by the processor 801 to implement the following steps:
获取用户评论信息,所述评论信息包括文本信息和评论等级信息;Obtaining user comment information, the comment information including text information and comment rating information;
根据预定规则对所述文本信息进行分析,以确定第一等级信息;Analyzing the text information according to predetermined rules to determine the first level information;
根据所述评论等级信息和所述第一等级信息确定所述评论信息的第二等级信息。The second level information of the review information is determined according to the review level information and the first level information.
根据预定规则对所述文本信息进行分析,以确定第一等级信息包括:根据所述预定规则对所述文本信息进行分析;根据分析结果对所述文本信息进行等级划分,以确定第一等级信息。Analyzing the text information according to a predetermined rule to determine the first level information includes: analyzing the text information according to the predetermined rule; classifying the text information according to the analysis result to determine the first level information .
根据所述预定规则对所述文本信息进行分析包括:根据所述预定规则获取所述文本信息中的关键词。Analyzing the text information according to the predetermined rule includes: acquiring keywords in the text information according to the predetermined rule.
根据分析结果对所述文本信息进行等级划分以确定第一等级信息包括:根据所述预定规则对获取的关键词进行等级划分,以确定第一等级信息。Performing level division on the text information according to the analysis result to determine the first level information includes: performing level division on the acquired keywords according to the predetermined rule to determine the first level information.
根据所述评论等级信息和所述第一等级信息确定所述第二等级信息包括:分别设置所述评论等级信息和所述第一等级信息的权重;将权重与相应的等级信息进行加权求和操作以确定所述第二等级信息。Determining the second level information according to the review level information and the first level information includes: setting the weights of the review level information and the first level information respectively; and performing a weighted summation between the weights and the corresponding level information Operate to determine the second level information.
在确定所述评论信息的第二等级信息之前,上述步骤还包括:根据所述评论等级信息和所述第一等级信息判断所述评论信息是否有效;响应于所述评论等级信息和所述第一等级信息的差异小于预定范围,确定所述评论信息有效,否则,确定所述评论信息无效。Before determining the second level information of the review information, the above steps further include: judging whether the review information is valid according to the review level information and the first level information; responding to the review level information and the first level information; If the difference of a level information is less than a predetermined range, it is determined that the comment information is valid; otherwise, it is determined that the comment information is invalid.
上述处理器801可以是独立的微处理器,也可以是一个或者多个微处理器集合。由此,处理器801通过执行存储器802所存储的命令,从而执行如上所述的本发明实 施例的方法流程实现对于数据的处理和对于其他装置的控制。总线803将上述多个组件连接在一起,同时将上述组件连接到显示控制器804和显示装置以及输入/输出(I/O)装置805。输入/输出(I/O)装置805可以是鼠标、键盘、调制解调器、网络接口、触控输入装置、体感输入装置、打印机以及本领域公知的其他装置。典型地,输入/输出(I/O)装置805通过输入/输出(I/O)控制器806与系统相连。The aforementioned processor 801 may be an independent microprocessor, or a collection of one or more microprocessors. As a result, the processor 801 executes the command stored in the memory 802 to execute the method flow of the embodiment of the present invention as described above to realize data processing and control of other devices. The bus 803 connects the above-mentioned multiple components together, and at the same time connects the above-mentioned components to the display controller 804 and the display device and the input/output (I/O) device 805. The input/output (I/O) device 805 can be a mouse, a keyboard, a modem, a network interface, a touch input device, a motion sensing input device, a printer, and other devices known in the art. Typically, an input/output (I/O) device 805 is connected to the system through an input/output (I/O) controller 806.
其中,存储器802可以存储软件组件,例如操作系统、通信模块、交互模块以及应用程序。以上所述的每个模块和应用程序都对应于完成一个或多个功能和在发明实施例中描述的方法的一组可执行程序指令。Among them, the memory 802 may store software components, such as an operating system, a communication module, an interaction module, and an application program. Each module and application program described above corresponds to a set of executable program instructions that complete one or more functions and methods described in the embodiments of the invention.
综上所述,由于目前对商户进行评价时,通常是直接基于用户评论中的等级信息,由于用户评论中存在无意义评论以及与等级不符合的评论,因而直接根据用户评论来对商户评价是不够准确的,本发明实施例根据预定分析规则对用户评论中的文本部分进行分析以得到分析后的等级信息,并与用户评论中的等级信息综合考虑来对商户进行评价,可以排除无意义评论以及与等级不符合的评论,从而可以得到较准确的商户评价。To sum up, since the current evaluation of merchants is usually directly based on the rating information in the user reviews, because there are meaningless comments and comments that do not match the rating in the user reviews, the evaluation of the merchants directly based on the user reviews is If it is not accurate enough, the embodiment of the present invention analyzes the text part of the user review according to predetermined analysis rules to obtain the analyzed level information, and comprehensively considers the level information in the user review to evaluate the merchant, which can exclude meaningless reviews As well as comments that do not match the rating, more accurate merchant evaluations can be obtained.
上述根据本发明实施例的方法、设备(系统)和计算机程序产品的流程图和/或框图描述了本发明的各个方面。应理解,流程图和/或框图的每个块以及流程图图例和/或框图中的块的组合可以由计算机程序指令来实现。这些计算机程序指令可以被提供至通用计算机、专用计算机或其它可编程数据处理设备的处理器,以产生机器,使得(经由计算机或其它可编程数据处理设备的处理器执行的)指令创建用于实现流程图和/或框图块或块中指定的功能/动作的装置。The above-mentioned flowcharts and/or block diagrams of the methods, devices (systems) and computer program products according to the embodiments of the present invention describe various aspects of the present invention. It should be understood that each block of the flowchart illustrations and/or block diagrams and combinations of blocks in the flowchart illustrations and/or block diagrams can be implemented by computer program instructions. These computer program instructions can be provided to the processor of a general-purpose computer, a special-purpose computer, or other programmable data processing device to generate a machine, so that the instructions (executed by the processor of the computer or other programmable data processing device) are created for implementation Flow chart and/or block diagram block or means of function/action specified in the block.
同时,如本领域技术人员将意识到的,本发明实施例的各个方面可以被实现为系统、方法或计算机程序产品。因此,本发明实施例的各个方面可以采取如下形式:完全硬件实现方式、完全软件实现方式(包括固件、常驻软件、微代码等)或者在本文中通常可以都称为“电路”、“模块”或“系统”的将软件方面与硬件方面相结合的实现方式。此外,本发明的方面可以采取如下形式:在一个或多个计算机可读介质中实现的计算机程序产品,计算机可读介质具有在其上实现的计算机可读程序代码。At the same time, as those skilled in the art will realize, various aspects of the embodiments of the present invention may be implemented as a system, a method, or a computer program product. Therefore, various aspects of the embodiments of the present invention can take the following forms: a complete hardware implementation, a complete software implementation (including firmware, resident software, microcode, etc.), or can be generally referred to as "circuits" and "modules" in this document. "Or "system" is a way of combining software and hardware. In addition, aspects of the present invention may take the following form: a computer program product implemented in one or more computer-readable media, the computer-readable medium having computer-readable program codes implemented thereon.
可以利用一个或多个计算机可读介质的任意组合。计算机可读介质可以是计算机可读信号介质或计算机可读存储介质。计算机可读存储介质可以是如(但不限于)电子的、磁的、光学的、电磁的、红外的或半导体系统、设备或装置,或者前述的任意适当的组合。计算机可读存储介质的更具体的示例(非穷尽列举)将包括以下各项:具有 一根或多根电线的电气连接、便携式计算机软盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或闪速存储器)、光纤、便携式光盘只读存储器(CD-ROM)、光存储装置、磁存储装置或前述的任意适当的组合。在本发明实施例的上下文中,计算机可读存储介质可以为能够包含或存储由指令执行系统、设备或装置使用的程序或结合指令执行系统、设备或装置使用的程序的任意有形介质。Any combination of one or more computer readable media can be utilized. The computer-readable medium may be a computer-readable signal medium or a computer-readable storage medium. The computer-readable storage medium can be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, device or device, or any appropriate combination of the foregoing. More specific examples (non-exhaustive list) of computer-readable storage media would include the following: electrical connections with one or more wires, portable computer floppy disks, hard disks, random access memory (RAM), read-only memory ( ROM), erasable programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any appropriate combination of the foregoing. In the context of the embodiments of the present invention, a computer-readable storage medium may be any tangible medium that can contain or store a program used by an instruction execution system, device, or device or a program used in conjunction with an instruction execution system, device, or device.
计算机可读信号介质可以包括传播的数据信号,所述传播的数据信号具有在其中如在基带中或作为载波的一部分实现的计算机可读程序代码。这样的传播的信号可以采用多种形式中的任何形式,包括但不限于:电磁的、光学的或其任何适当的组合。计算机可读信号介质可以是以下任意计算机可读介质:不是计算机可读存储介质,并且可以对由指令执行系统、设备或装置使用的或结合指令执行系统、设备或装置使用的程序进行通信、传播或传输。The computer-readable signal medium may include a propagated data signal having computer-readable program code implemented therein as in baseband or as part of a carrier wave. Such a propagated signal can take any of a variety of forms, including but not limited to: electromagnetic, optical, or any suitable combination. The computer-readable signal medium may be any of the following computer-readable media: it is not a computer-readable storage medium, and it can communicate and propagate the program used by the instruction execution system, device or device or used in conjunction with the instruction execution system, device or device Or transmission.
用于执行针对本发明各方面的操作的计算机程序代码可以以一种或多种编程语言的任意组合来编写,所述编程语言包括:面向对象的编程语言如Java、Smalltalk、C++、PHP、Python等;以及常规过程编程语言如“C”编程语言或类似的编程语言。程序代码可以作为独立软件包完全地在用户计算机上、部分地在用户计算机上执行;部分地在用户计算机上且部分地在远程计算机上执行;或者完全地在远程计算机或服务器上执行。在后一种情况下,可以将远程计算机通过包括局域网(LAN)或广域网(WAN)的任意类型的网络连接至用户计算机,或者可以与外部计算机进行连接(例如通过使用因特网服务供应商的因特网)。The computer program code used to perform operations directed to various aspects of the present invention can be written in any combination of one or more programming languages, including: object-oriented programming languages such as Java, Smalltalk, C++, PHP, Python Etc.; and conventional process programming languages such as "C" programming language or similar programming languages. The program code can be executed as an independent software package entirely on the user's computer, partly on the user's computer; partly on the user's computer and partly on a remote computer; or entirely on the remote computer or server. In the latter case, the remote computer can be connected to the user computer through any type of network including a local area network (LAN) or a wide area network (WAN), or can be connected with an external computer (for example, by using the Internet of an Internet service provider) .
以上所述仅为本发明的优选实施例,并不用于限制本发明,对于本领域技术人员而言,本发明可以有各种改动和变化。凡在本发明的精神和原理之内所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The above descriptions are only preferred embodiments of the present invention and are not used to limit the present invention. For those skilled in the art, the present invention can have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (14)

  1. 一种信息处理方法,其特征在于,所述方法包括:An information processing method, characterized in that the method includes:
    获取用户评论信息,所述评论信息包括文本信息和评论等级信息;Obtaining user comment information, the comment information including text information and comment rating information;
    根据预定规则对所述文本信息进行分析,以确定第一等级信息;Analyzing the text information according to predetermined rules to determine the first level information;
    根据所述评论等级信息和所述第一等级信息确定所述评论信息的第二等级信息。The second level information of the review information is determined according to the review level information and the first level information.
  2. 根据权利要求1所述的信息处理方法,其特征在于,根据预定规则对所述文本信息进行分析,以确定第一等级信息包括:The information processing method according to claim 1, wherein analyzing the text information according to a predetermined rule to determine the first level information comprises:
    根据所述预定规则对所述文本信息进行分析;Analyze the text information according to the predetermined rule;
    根据分析结果对所述文本信息进行等级划分,以确定第一等级信息。The text information is classified according to the analysis result to determine the first level information.
  3. 根据权利要求2所述的信息处理方法,其特征在于,根据所述预定规则对所述文本信息进行分析包括:The information processing method according to claim 2, wherein analyzing the text information according to the predetermined rule comprises:
    根据所述预定规则获取所述文本信息中的关键词。Acquire keywords in the text information according to the predetermined rule.
  4. 根据权利要求3所述的信息处理方法,其特征在于,根据分析结果对所述文本信息进行等级划分以确定第一等级信息包括:The information processing method according to claim 3, wherein the classification of the text information according to the analysis result to determine the first level information comprises:
    根据所述预定规则对获取的关键词进行等级划分,以确定第一等级信息。The acquired keywords are classified according to the predetermined rules to determine the first level information.
  5. 根据权利要求1所述的信息处理方法,其特征在于,根据所述评论等级信息和所述第一等级信息确定所述第二等级信息包括:The information processing method according to claim 1, wherein determining the second level information according to the review level information and the first level information comprises:
    分别设置所述评论等级信息和所述第一等级信息的权重;Respectively setting the weights of the comment level information and the first level information;
    将权重与相应的等级信息进行加权求和操作以确定所述第二等级信息。Perform a weighted sum operation on the weight and the corresponding level information to determine the second level information.
  6. 根据权利要求1所述的信息处理方法,其特征在于,在确定所述评论信息的第二等级信息之前,所述方法还包括:The information processing method according to claim 1, wherein before determining the second level information of the review information, the method further comprises:
    根据所述评论等级信息和所述第一等级信息判断所述评论信息是否有效;Judging whether the comment information is valid according to the comment rating information and the first rating information;
    响应于所述评论等级信息和所述第一等级信息的差异小于预定范围,确定所述评论信息有效,否则,确定所述评论信息无效。In response to the difference between the comment rating information and the first rating information being less than a predetermined range, it is determined that the comment information is valid; otherwise, it is determined that the comment information is invalid.
  7. 一种信息处理装置,其特征在于,所述装置包括:An information processing device, characterized in that the device includes:
    信息获取单元,用于获取用户评论信息,所述评论信息包括文本信息和评论等级信息;An information acquisition unit for acquiring user comment information, the comment information including text information and comment rating information;
    第一等级信息确定单元,用于根据预定规则对所述文本信息进行分析,以确定第 一等级信息;The first level information determining unit is configured to analyze the text information according to predetermined rules to determine the first level information;
    第二等级信息确定单元,用于根据所述评论等级信息和所述第一等级信息确定所述评论信息的第二等级信息。The second level information determining unit is configured to determine the second level information of the review information according to the review level information and the first level information.
  8. 根据权利要求7所述的信息处理装置,其特征在于,所述第一等级信息确定单元包括:The information processing device according to claim 7, wherein the first level information determining unit comprises:
    文本信息分析模块,用于根据所述预定规则对所述文本信息进行分析;A text information analysis module, configured to analyze the text information according to the predetermined rule;
    第一等级信息确定模块,用于根据分析结果对所述文本信息进行等级划分,以确定第一等级信息。The first level information determining module is used to classify the text information according to the analysis result to determine the first level information.
  9. 根据权利要求8所述的信息处理装置,其特征在于,所述文本信息分析模块具体用于:The information processing device according to claim 8, wherein the text information analysis module is specifically configured to:
    根据所述预定规则获取所述文本信息中的关键词。Acquire keywords in the text information according to the predetermined rule.
  10. 根据权利要求9所述的信息处理装置,其特征在于,所述第一等级信息确定模块具体用于:The information processing device according to claim 9, wherein the first level information determining module is specifically configured to:
    根据所述预定规则对获取的关键词进行等级划分,以确定第一等级信息。The acquired keywords are classified according to the predetermined rules to determine the first level information.
  11. 根据权利要求7所述的信息处理装置,其特征在于,所述第二等级信息确定单元包括:The information processing device according to claim 7, wherein the second level information determining unit comprises:
    权重设置模块,用于分别设置所述评论等级信息和所述第一等级信息的权重;A weight setting module for setting the weights of the comment level information and the first level information respectively;
    第二等级信息确定模块,用于将权重与相应的等级信息进行加权求和操作以确定所述第二等级信息。The second level information determining module is configured to perform a weighted sum operation on the weight and corresponding level information to determine the second level information.
  12. 根据权利要求7所述的信息处理装置,其特征在于,所述装置还包括:The information processing device according to claim 7, wherein the device further comprises:
    有效信息判断单元,用于根据所述评论等级信息和所述第一等级信息判断所述评论信息是否有效;A valid information judging unit, configured to judge whether the comment information is valid according to the comment level information and the first level information;
    有效信息确定单元,用于响应于所述评论等级信息和所述第一等级信息的差异小于预定范围,确定所述评论信息有效,否则,确定所述评论信息无效。The valid information determining unit is configured to determine that the review information is valid in response to the difference between the review level information and the first level information being less than a predetermined range; otherwise, determine that the review information is invalid.
  13. 一种计算机可读存储介质,其上存储计算机程序指令,其特征在于,所述计算机程序指令在被处理器执行时实现如权利要求1-6中任一项所述的方法。A computer-readable storage medium having computer program instructions stored thereon, wherein the computer program instructions, when executed by a processor, implement the method according to any one of claims 1-6.
  14. 一种电子设备,包括存储器和处理器,其特征在于,所述存储器用于存储一条或多条计算机程序指令,其中,所述一条或多条计算机程序指令被所述处理器执行以实现如权利要求1-6中任一项所述的方法。An electronic device comprising a memory and a processor, wherein the memory is used to store one or more computer program instructions, wherein the one or more computer program instructions are executed by the processor to realize The method of any one of claims 1-6.
PCT/CN2019/129063 2019-03-04 2019-12-27 Information processing method and apparatus, storage medium, and electronic device WO2020177463A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910160265.0A CN109902304A (en) 2019-03-04 2019-03-04 Information processing method, device, storage medium and electronic equipment
CN201910160265.0 2019-03-04

Publications (1)

Publication Number Publication Date
WO2020177463A1 true WO2020177463A1 (en) 2020-09-10

Family

ID=66946211

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/129063 WO2020177463A1 (en) 2019-03-04 2019-12-27 Information processing method and apparatus, storage medium, and electronic device

Country Status (2)

Country Link
CN (1) CN109902304A (en)
WO (1) WO2020177463A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109902304A (en) * 2019-03-04 2019-06-18 拉扎斯网络科技(上海)有限公司 Information processing method, device, storage medium and electronic equipment
CN111079428B (en) * 2019-12-27 2023-09-19 北京羽扇智信息科技有限公司 Word segmentation and industry dictionary construction method and device and readable storage medium
CN113836410B (en) * 2021-09-22 2024-03-15 中国第一汽车股份有限公司 Vehicle sound quality evaluation method, device, evaluation equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105630793A (en) * 2014-10-28 2016-06-01 阿里巴巴集团控股有限公司 Information weight determination method and device
US20170221111A1 (en) * 2016-01-28 2017-08-03 Institut Mines-Telecom Method for detecting spam reviews written on websites
CN107977798A (en) * 2017-12-21 2018-05-01 中国计量大学 A kind of risk evaluating method of e-commerce product quality
CN108595562A (en) * 2018-04-12 2018-09-28 西安邮电大学 User's evaluation data analysing method based on accurate sex determination
CN108665339A (en) * 2018-03-27 2018-10-16 北京航空航天大学 A kind of electric business product reliability index and its implementation estimated based on subjective emotion
CN109902304A (en) * 2019-03-04 2019-06-18 拉扎斯网络科技(上海)有限公司 Information processing method, device, storage medium and electronic equipment

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105630793A (en) * 2014-10-28 2016-06-01 阿里巴巴集团控股有限公司 Information weight determination method and device
US20170221111A1 (en) * 2016-01-28 2017-08-03 Institut Mines-Telecom Method for detecting spam reviews written on websites
CN107977798A (en) * 2017-12-21 2018-05-01 中国计量大学 A kind of risk evaluating method of e-commerce product quality
CN108665339A (en) * 2018-03-27 2018-10-16 北京航空航天大学 A kind of electric business product reliability index and its implementation estimated based on subjective emotion
CN108595562A (en) * 2018-04-12 2018-09-28 西安邮电大学 User's evaluation data analysing method based on accurate sex determination
CN109902304A (en) * 2019-03-04 2019-06-18 拉扎斯网络科技(上海)有限公司 Information processing method, device, storage medium and electronic equipment

Also Published As

Publication number Publication date
CN109902304A (en) 2019-06-18

Similar Documents

Publication Publication Date Title
US10748164B2 (en) Analyzing sentiment in product reviews
US10360305B2 (en) Performing linguistic analysis by scoring syntactic graphs
US9196245B2 (en) Semantic graphs and conversational agents
US9275042B2 (en) Semantic clustering and user interfaces
JP6182279B2 (en) Data analysis system, data analysis method, data analysis program, and recording medium
US9852379B2 (en) Systems and methods for constructed response scoring using metaphor detection
RU2517368C2 (en) Method and apparatus for determining and evaluating significance of words
US7599926B2 (en) Reputation information processing program, method, and apparatus
US11709875B2 (en) Prioritizing survey text responses
US9483730B2 (en) Hybrid review synthesis
JP5775466B2 (en) Chat extraction system, method, and program for extracting chat part from conversation
US8577884B2 (en) Automated analysis and summarization of comments in survey response data
WO2020177463A1 (en) Information processing method and apparatus, storage medium, and electronic device
US20080275870A1 (en) Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance
US20130054502A1 (en) Determination of document credibility
KR102019207B1 (en) Apparatus and method for assessing data quality for text analysis
JP2009151760A (en) Method and system for calculating competitiveness metric between objects
US20200272915A1 (en) Artificial intelligence (ai) based data processing
US10282678B2 (en) Automated similarity comparison of model answers versus question answering system output
US10628749B2 (en) Automatically assessing question answering system performance across possible confidence values
CN113392218A (en) Training method of text quality evaluation model and method for determining text quality
US20200394211A1 (en) Multi-term query subsumption for document classification
WO2016189605A1 (en) Data analysis system, control method, control program, and recording medium
CN115827867A (en) Text type detection method and device
KR101758555B1 (en) Method and system for extracting topic expression

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19917730

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19917730

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 19917730

Country of ref document: EP

Kind code of ref document: A1