WO2018205178A1 - Text exploration and measurement system and method - Google Patents

Text exploration and measurement system and method Download PDF

Info

Publication number
WO2018205178A1
WO2018205178A1 PCT/CN2017/083848 CN2017083848W WO2018205178A1 WO 2018205178 A1 WO2018205178 A1 WO 2018205178A1 CN 2017083848 W CN2017083848 W CN 2017083848W WO 2018205178 A1 WO2018205178 A1 WO 2018205178A1
Authority
WO
WIPO (PCT)
Prior art keywords
component
data
data set
weighting
measurement system
Prior art date
Application number
PCT/CN2017/083848
Other languages
French (fr)
Chinese (zh)
Inventor
曹修源
苏辛词
Original Assignee
曹修源
苏辛词
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 曹修源, 苏辛词 filed Critical 曹修源
Priority to PCT/CN2017/083848 priority Critical patent/WO2018205178A1/en
Publication of WO2018205178A1 publication Critical patent/WO2018205178A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Definitions

  • the present invention relates to a word exploration measurement system and method, and more particularly to a network word exploration measurement system.
  • the invention separates different text systems by constructing a multi-faceted text data set to specifically analyze the scores of the meanings of the specific words in the text in the text, and through the classification system of the feature words and the weighted characters. It is a target expression or a perception attitude, so as to achieve the effect of improving the validity of the survey results and truly reflecting the opinions of consumers, and achieve the purpose of responding to consumers' opinions in real time.
  • the present invention provides a text search measurement system comprising: a first data set having at least one data component to be compared; and a second data set comprising a specific one of at least one specific topic set a subject sub-collection, and a weighting component, wherein the specific topic sub-collection includes one feature component corresponding to the first data set content; and an analysis server, the information is connected to the first data set and the second data set, and executed The step of comparing the first data set and the second data set, performing a weighting operation according to the feature component and a result of the weighting component corresponding to the at least one data component, to obtain the at least one data component at the A reference to a particular set of topics and/or one of the content of the particular subtopic of the particular subject.
  • the at least one data component comprises the result of an automated system.
  • the automation system further includes a segmentation system, and the information is connected to the at least one data component, and the at least one data component is divided into at least one block to obtain a target word of the at least one block.
  • the at least one particular topic set and/or the particular topic subset has its corresponding weighting component.
  • the third data set includes a result of the weighting operation and the metric reference value for the at least one particular topic set and/or the content of the particular topic subset.
  • the second data set is preset or can be set by a user through a usage interface, the specific topic subset, the feature component, and/or the weighting component.
  • the weighting component has a measurement reference value ranging between -5 and +5.
  • the feature component is selected from the group consisting of academic journals, papers, questionnaires, market reports, interviews, and machine learning algorithms.
  • the present invention provides a method for verifying the validity of a text search measurement system, comprising a step of performing a statistical verification of the results obtained by the text search measurement system and a comparison result.
  • the statistical verification is selected from one or more statistical methods comprising paired sample t assays.
  • the present invention further provides a method for text search measurement, comprising: Step 1: obtaining, by an automated system, a first data set to be compared, wherein the first data set includes at least one data Component; step two, establishing a second data set, wherein the second data set a specific subject sub-set comprising at least one specific topic set, and a weighting component, wherein the specific topic sub-set includes one feature component corresponding to the first data set content; and step 3, an analysis server executes the first data The step of comparing the step with the second data set; in step 4, the analyzing server performs a weighting operation according to the feature component and the result of the weighting component corresponding to the data component, to obtain the at least one data component for the at least A reference to a particular set of topics and/or one of the content of the particular subtopic of the particular subject.
  • the at least one data component comprises the result of an automated system.
  • the at least one particular topic set and/or the particular topic subset has its corresponding weighting component.
  • the comparing step comprises: a first comparing step of comparing the data component with the feature component, and a second comparing step of performing the result of the first comparing step with the weighting component Comparison.
  • a segmentation system distinguishes the at least one data component into at least one block, and obtains a target word of the at least one block.
  • a user sets the specific topic set, the specific topic subset, the feature component, and/or the weighting component through a usage interface.
  • a step of establishing a third data set comprising the result of the weighting operation and the metric reference value for the at least one particular topic set and/or the content of the particular topic subset.
  • the method further includes a step of: performing the foregoing verification method on the result of the third data set, adjusting the content of the feature component according to the result of the statistical verification, and performing the method of the foregoing text exploration measurement again until the result of the statistical verification It has statistical significance.
  • FIG. 1 is a schematic diagram of an embodiment of a text search and measurement system of the present invention.
  • FIG. 2 is a schematic diagram of another embodiment of the text search and measurement system of the present invention.
  • FIG. 3 is a schematic diagram of an embodiment of a facet sub-collection of a particular subject matter of the present invention.
  • FIG. 4 is a schematic diagram of another embodiment of a facet subset of a particular subject matter of the present disclosure.
  • FIG. 5 is a schematic diagram showing an embodiment of weighting values of the weighting component of the present invention.
  • FIG. 6 is a schematic diagram showing an embodiment of a reference value obtained by comparing the contents of the data sentence in the present case.
  • Figure 7 is a schematic diagram of the method for measuring text in this case.
  • FIG. 1 is a schematic diagram of an embodiment of the text search measurement system 1 .
  • the text search measurement system 1 includes an automation system 110, a second data set 120, an analysis server 130, and a usage interface 140.
  • the automation system 110 includes a exploration program 113 and/or Or a segmentation system 114, which may be a crawler program and/or a word breaker system, collecting text data of various social network platforms including Facebook, Youtube, PTT, Twitter, etc., and obtaining a first data set 111 to be compared;
  • the second data set 120 includes a multi-level text data set 121 of a specific theme, a facet sub-set 122 of a specific topic, and a feature text component 123 and a weighted text component 124 for comparison; wherein the analysis server 130 is connected to the to-be-matched
  • the data set 111 and the second data set 120 are weighted according to the content of the data set and the result of the feature text component 123 and the weighted text component 124 corresponding to the data sentence content 112 to be compared, and the data to be compared is given.
  • the weighted value of the target word -5 to +5 points in the sentence content 112 obtains a measurement reference value of the different
  • the segmentation system 114 is coupled to the at least one to-be-matched data sentence content 112, and distinguishes the to-be-matched data sentence content 112 into a plurality of blocks to obtain target words in the plurality of blocks.
  • the feature text component 123 is selected from national journals and paper-confirmed text components (eg, Aaker's 1997 brand personality 42 feature characters) and the national translation language of the text component, or national journals and papers for the subject.
  • the relevant words are provided for quantification (questionnaire) and qualitative (expert interview) to obtain the text components.
  • the multi-level text data set 121 of a particular topic includes a facet sub-set 122 of a particular topic; in one embodiment, the second data set 120 contains Chinese-English text.
  • the second data set 120 includes a language dictionary database, a slang database, and a self-built language database.
  • the analysis server 130 may apply a machine learning algorithm to give weight values.
  • the multi-level text data set 121 of the specific theme is a default or multi-level text data set 121 and a facet sub-set 122 thereof, and a feature text component that can be set or built by the user through the interface 140. 123 or weighted text component 124.
  • FIG. 2 is a schematic diagram of another embodiment of the text search measurement system 2 of the present invention.
  • the second data set 220 includes a specific topic set 221 and a weighted text component 223 corresponding to the specific topic set 221;
  • the specific topic set 221 includes a facet subset 222 of a particular topic and a specific topic
  • the weighted text component 224 corresponding to the facet set 222.
  • the feature text component 225 is constructed according to different cultures and different facets in different industries; in an embodiment, the user can build the feature text component 225 by itself.
  • the text search measurement system further includes a result data set 240 that the analysis server 230 performs the comparison, including the results of the weighting operations and the measurement reference for the content of the data to be compared 212 on a particular topic or its different facets. value.
  • FIG. 3 is a schematic diagram of an embodiment of a facet sub-collection of a specific topic set of the present invention.
  • a particular set of topics can be subdivided into a number of facet sub-collections, such as facet items for marketing tactical benefits, such as Action, Awareness, Desire, Excited, Happy, etc., each facet item There are corresponding feature text components under it.
  • FIG. 4 is a schematic diagram of another embodiment of a facet sub-set of a particular subject matter of the present invention.
  • the facet set and feature text components are built in three steps, including extracting keywords through journal paper questionnaires, relevant keywords from specific topics of focus interviews, and keywords or online popular terms obtained by machine learning algorithms.
  • FIG. 5 is a schematic diagram of an embodiment of weighting values of the weighting component of the present invention. As shown, the different weighting components represent different weighting values, respectively.
  • a weighting value of -5 to +5 points of the target word in the content of the data sentence to be compared is given, and the polarity is proportionally subdivided into a range such as a Likert Scale.
  • FIG. 6 is a schematic diagram showing an embodiment of the reference value of the content of the data sentence to be compared.
  • the metric reference may represent a different facet score of the content of the data sentence to be compared to a particular topic or particular topic.
  • the method for text search measurement includes the step S101: obtaining, by the crawler program, the data set to be compared includes at least one content of the data to be compared; and step S102, establishing a text database, which includes multiple levels of the specific topic. a text data set, a facet sub-set of a specific theme, and a feature text component and a weighted text component; in step S103, the at least one to-be-matched data sentence content is divided into a single or a plurality of blocks by the Chinese-English word-cutting system to obtain at least one area The target word of the block.
  • Step S104 the analyzing server performs a step of comparing the data set to be compared with the multi-level text data set including the specific topic; and in step S105, the analyzing server corresponds to the content of the at least one data to be compared according to the feature component and the weighting component.
  • a weighting operation is performed to obtain a reference value for the content of the data to be compared to a particular topic and/or its different facets on a particular topic.
  • step S104 further includes step S1041, comparing the content of the data to be compared with the feature text component, and step S1042, comparing the comparison result of step S1041 with the weighted text component.
  • the content of the data clause to be compared includes the results of the crawler program and/or the word breaker system.
  • the multi-level text data set of a particular topic and the facet sub-set of a particular topic have their corresponding weighted text components.
  • the method for text search measurement further includes a step of setting a text data set of a specific theme, a facet sub-set of a specific theme, a feature text component, or a weighted text component by using an interface.
  • the method of text search measurement further comprises a step of establishing a comparison result data set comprising the result of the weighting operation and the measurement of the content of the data to be compared on a particular topic and/or a particular topic at different facets Reference.
  • the method for text search measurement further comprises a step of statistically verifying the comparison result data set, correcting the content of the feature text component according to the result of the statistical verification, and performing the method of text exploration measurement again, performing statistical verification until The results of this statistical verification are statistically significant.
  • the present invention further provides a method of verifying the validity of a word exploration measurement system.
  • the method for verifying a text search measurement system includes a step of including a result of the weighted operation of the text search measurement system and the content of the data to be compared on a particular topic and/or a particular topic.
  • the reference value of the measurement is compared with the general questionnaire result corresponding to it, and the statistical verification such as the paired sample t verification is performed to confirm the validity.

Abstract

A text exploration and measurement system (1) comprising: an automation system (110) that may obtain a first data set (111) to be compared, wherein the first data set (111) has at least one data component (112); a second data set (120), comprising a specific topic subset (122) of at least one specific topic set (121) and a weighting component (124), wherein the specific topic subset (122) has a feature component (123) corresponding to the content of the first data set (111); an analysis server (130) that is informationally connected to the first data set (111) and the second data set (120), carries out a comparison operation between the first data set (111) and the second data set (120), and performs a weighting operation on the basis of the results of the feature component (123) and the weighting component (124) corresponding to the at least one data component (112) so as to obtain a measurement reference value for the content of the at least one data component (112) in the at least one specific topic set (121) and/or the specific topic subset (122).

Description

文字探勘衡量系统及方法Text exploration measurement system and method 【技术领域】[Technical Field]
本案涉及一种文字探勘衡量系统及方法,尤其是涉及一种网络文字探勘衡量系统。The present invention relates to a word exploration measurement system and method, and more particularly to a network word exploration measurement system.
【背景技术】【Background technique】
目前传统市场调查公司无法针对企业广告执行即刻监测之效益。根据统计,正常情况下一个广告周期2个月内预计有3至4则广告,1则广告2周内就会影响消费者对于广告内容之观感与态度,然而等待传统市场调查经由广告拨放时期等待、发放问卷、回收问卷以及统计分析等作业完成评估后,广告周期早已过去,只能事后评估结果,无法实时性的理解广告内容对于消费者意见之影响。At present, traditional market research companies are unable to perform immediate monitoring of the benefits of corporate advertising. According to statistics, under normal circumstances, an advertisement period is expected to have 3 to 4 advertisements within 2 months, and 1 advertisement will affect consumers' perception and attitude towards advertisement content within 2 weeks, while waiting for the traditional market investigation through the advertisement release period After the completion of the assignments such as waiting, issuing questionnaires, recycling questionnaires, and statistical analysis, the advertising cycle has passed, and the results can only be evaluated afterwards, and the impact of advertising content on consumer opinions cannot be understood in real time.
随网络口碑、网络媒体引领市场风潮,对于网络社群大数据的研究分析也被视为消费者意见的来源,然而广告过往大多以造访网站人数/停留时间做为评估依据,近期大多转变为以观看数量作为目标设定,然而该些评估依据及方法仍有上述问题。With the Internet word of mouth and online media leading the market trend, the research and analysis of online community big data is also regarded as the source of consumer opinions. However, most of the advertisements used to visit the website number/residence time as the basis for evaluation, and most of them have recently changed to The number of views is set as the target, but the evaluation basis and method still have the above problems.
为了解决前述的缺失,虽有业者研发出如各式网络评价自动分析系统,然而该分析系统及方法仅是对于各网络评价内容给予各个文字上的加权分值,再综合计算所有分值得到一概略结果,对于该些文字在该网络评价中是否 具有意见代表性,或是对于属于不同构面意见的内容并无法判断,如此所得之评估结果并未能真实呈现消费者之观感与态度。In order to solve the aforementioned shortcomings, although some operators have developed various automatic evaluation systems for network evaluation, the analysis system and method only assign weighted scores to each text for each network evaluation content, and then comprehensively calculate all the scores to obtain one. A rough result, is the text in the network evaluation? It is not representative of the opinions, or the content of the opinions that belong to different facets cannot be judged. The evaluation results obtained in this way do not truly present the perception and attitude of the consumers.
鉴于习知的文字探勘衡量系统及方法仍存有许多需要改善之处;本案申请人系经细心研究后,发展出本案,期使网络文字探勘衡量系统及方法可更为完善、准确,且更易于操作并达到贴近市场真实状态与反应之效果。In view of the conventional text search and measurement system and method, there are still many areas for improvement; the applicants of this case have developed the case after careful study, so that the network word exploration measurement system and method can be more perfect, accurate and more Easy to operate and close to the real state of the market and the effect of the reaction.
【发明内容】[Summary of the Invention]
本发明系透过建置多构面的文字数据集合以具体分析网络文字中的特定文字在该文句中所代表意思之分值,透过特征文字与加权文字的分类系统,区隔不同文字系属于目标表示或是观感态度表示,以达到提高调查结果效度并真实反映消费者意见之效果,达到实时反应消费者意见之目的。The invention separates different text systems by constructing a multi-faceted text data set to specifically analyze the scores of the meanings of the specific words in the text in the text, and through the classification system of the feature words and the weighted characters. It is a target expression or a perception attitude, so as to achieve the effect of improving the validity of the survey results and truly reflecting the opinions of consumers, and achieve the purpose of responding to consumers' opinions in real time.
就一方而言,本发明提出一种文字探勘衡量系统,包含:一第一数据集合,其具有待对比之一至少一数据组件;一第二数据集合,包含一至少一特定主题集合之一特定主题子集合,以及一加权组件,其中该特定主题子集合包含对应于该第一数据集合内容之一特征组件;以及一分析服务器,信息连接于该第一数据集合以及该第二数据集合,执行该第一数据集合与该第二数据集合之一比对步骤,根据该特征组件以及该加权组件所对应到该至少一数据组件之结果进行一加权操作,得到有关该至少一数据组件于该至少一特定主题集合及/或该特定主题子集合内容之一衡量参考值。In one aspect, the present invention provides a text search measurement system comprising: a first data set having at least one data component to be compared; and a second data set comprising a specific one of at least one specific topic set a subject sub-collection, and a weighting component, wherein the specific topic sub-collection includes one feature component corresponding to the first data set content; and an analysis server, the information is connected to the first data set and the second data set, and executed The step of comparing the first data set and the second data set, performing a weighting operation according to the feature component and a result of the weighting component corresponding to the at least one data component, to obtain the at least one data component at the A reference to a particular set of topics and/or one of the content of the particular subtopic of the particular subject.
根据上述构想,其中该至少一数据组件包含一自动化系统所得之结果。 According to the above concept, wherein the at least one data component comprises the result of an automated system.
根据上述构想,其中该自动化系统更包含一分割系统,信息连接于该至少一数据组件,区分该至少一数据组件为一至少一区块,得到该至少一区块之目标词。According to the above concept, the automation system further includes a segmentation system, and the information is connected to the at least one data component, and the at least one data component is divided into at least one block to obtain a target word of the at least one block.
根据上述构想,其中该至少一特定主题集合及/或该特定主题子集合具有其对应之该加权组件。According to the above concept, wherein the at least one particular topic set and/or the particular topic subset has its corresponding weighting component.
根据上述构想,其进一步包含一第三数据集合,其中该第三数据集合包含该加权操作之结果以及有关该至少一特定主题集合及/或该特定主题子集合内容之该衡量参考值。According to the above concept, further comprising a third data set, wherein the third data set includes a result of the weighting operation and the metric reference value for the at least one particular topic set and/or the content of the particular topic subset.
根据上述构想,其中该第二数据集合系为预设或可由一用户透过一使用接口设定该特定主题集合、该特定主题子集合、该特征组件及/或该加权组件。According to the above concept, the second data set is preset or can be set by a user through a usage interface, the specific topic subset, the feature component, and/or the weighting component.
根据上述构想,其中该加权组件之衡量参考值范围介于-5至+5之间。According to the above concept, the weighting component has a measurement reference value ranging between -5 and +5.
根据上述构想,其中该特征组件系选自由学术期刊、论文、问卷、市调报告、访谈以及机器学习算法所得到的关键词。According to the above concept, the feature component is selected from the group consisting of academic journals, papers, questionnaires, market reports, interviews, and machine learning algorithms.
就另一方面而言,本发明提供一种验证文字探勘衡量系统其效度的方法,其包含一步骤,将该文字探勘衡量系统所得之结果与一待比对结果进行一统计验证。In another aspect, the present invention provides a method for verifying the validity of a text search measurement system, comprising a step of performing a statistical verification of the results obtained by the text search measurement system and a comparison result.
根据上述构想,其中该统计验证系选自包含成对样本t检定的一种或多种统计方法。According to the above concept, wherein the statistical verification is selected from one or more statistical methods comprising paired sample t assays.
就一方面而言,本发明另提供一种文字探勘衡量的方法,包含:步骤一,透过一自动化系统取得待比对之一第一数据集合,其中该第一数据集合包含一至少一数据组件;步骤二,建立一第二数据集合,其中该第二数据集合 包含一至少一特定主题集合之一特定主题子集合,以及一加权组件,其中该特定主题子集合包含对应于该第一数据集合内容之一特征组件;步骤三,一分析服务器执行该第一数据集合与该第二数据集合之一比对步骤;步骤四,该分析服务器根据该特征组件以及该加权组件所对应到该数据组件之结果进行一加权操作,得到有关该至少一数据组件于该至少一特定主题集合及/或该特定主题子集合内容之一衡量参考值。In one aspect, the present invention further provides a method for text search measurement, comprising: Step 1: obtaining, by an automated system, a first data set to be compared, wherein the first data set includes at least one data Component; step two, establishing a second data set, wherein the second data set a specific subject sub-set comprising at least one specific topic set, and a weighting component, wherein the specific topic sub-set includes one feature component corresponding to the first data set content; and step 3, an analysis server executes the first data The step of comparing the step with the second data set; in step 4, the analyzing server performs a weighting operation according to the feature component and the result of the weighting component corresponding to the data component, to obtain the at least one data component for the at least A reference to a particular set of topics and/or one of the content of the particular subtopic of the particular subject.
根据上述构想,其中该至少一数据组件包含一自动化系统所得之结果。According to the above concept, wherein the at least one data component comprises the result of an automated system.
根据上述构想,其中该至少一特定主题集合及/或该特定主题子集合具有其对应之该加权组件。According to the above concept, wherein the at least one particular topic set and/or the particular topic subset has its corresponding weighting component.
根据上述构想,其中该比对步骤包含:一第一比对步骤,将数据组件与特征组件进行比对,以及;一第二比对步骤,将该第一比对步骤之结果与加权组件进行比对。According to the above concept, wherein the comparing step comprises: a first comparing step of comparing the data component with the feature component, and a second comparing step of performing the result of the first comparing step with the weighting component Comparison.
根据上述构想,其中更包含一步骤:一分割系统区分该至少一数据组件为一至少一区块,得到该至少一区块之目标词。According to the above concept, there is further included a step: a segmentation system distinguishes the at least one data component into at least one block, and obtains a target word of the at least one block.
根据上述构想,其中更包含一步骤:一用户透过一使用接口设定该特定主题集合、该特定主题子集合、该特征组件及/或该加权组件。According to the above concept, there is further included a step: a user sets the specific topic set, the specific topic subset, the feature component, and/or the weighting component through a usage interface.
根据上述构想,其中更包含一步骤:建立一第三数据集合,其包含该加权操作之结果以及有关该至少一特定主题集合及/或该特定主题子集合内容之该衡量参考值。According to the above concept, there is further included a step of establishing a third data set comprising the result of the weighting operation and the metric reference value for the at least one particular topic set and/or the content of the particular topic subset.
根据上述构想,其中更包含一步骤:将该第三数据集合之结果执行前述之验证的方法,依据统计验证之结果调整该特征组件内容,并再次执行前述文字探勘衡量之方法直到统计验证之结果具有统计上之意义。 According to the above concept, the method further includes a step of: performing the foregoing verification method on the result of the third data set, adjusting the content of the feature component according to the result of the statistical verification, and performing the method of the foregoing text exploration measurement again until the result of the statistical verification It has statistical significance.
本案得藉由以下图式与实施方式说明而更易于让在此领域具通常知识者了解本案的精神。In the present case, it is easier to let the general knowledge in this field understand the spirit of the case by the following drawings and implementation descriptions.
【附图说明】[Description of the Drawings]
图1为本案文字探勘衡量系统的一实施例示意图。FIG. 1 is a schematic diagram of an embodiment of a text search and measurement system of the present invention.
图2为本案文字探勘衡量系统的另一实施例示意图。FIG. 2 is a schematic diagram of another embodiment of the text search and measurement system of the present invention.
图3为本案特定主题的构面子集合的一实施例示意图。3 is a schematic diagram of an embodiment of a facet sub-collection of a particular subject matter of the present invention.
图4为本案特定主题的构面子集合的另一实施例示意图。4 is a schematic diagram of another embodiment of a facet subset of a particular subject matter of the present disclosure.
图5为本案加权组件的加权值表示一实施例示意图。FIG. 5 is a schematic diagram showing an embodiment of weighting values of the weighting component of the present invention.
图6为本案待比对资料文句内容所得之衡量参考值表示一实施例示意图。FIG. 6 is a schematic diagram showing an embodiment of a reference value obtained by comparing the contents of the data sentence in the present case.
图7为本案文字探勘衡量的方法示意图。Figure 7 is a schematic diagram of the method for measuring text in this case.
【具体实施方式】【detailed description】
本案将可透过以下的实施例说明让所属技术领域具通常知识者了解发明人创作之精神,并可据以完成。然本案的实施例并非可由以下实施例而限制其实施型态。The present invention will be described in the following examples to enable those of ordinary skill in the art to understand the spirit of the inventor's creation and to accomplish it. However, the embodiments of the present invention are not limited by the following embodiments.
请参阅图1,其为文字探勘衡量系统1的一实施例示意图。如图所示,在本实施例中,文字探勘衡量系统1包含自动化系统110、第二数据集合120、分析服务器130以及使用接口140;在一实施例中,自动化系统110包含探勘程序113及/或分割系统114,其可为爬虫程序及/或断词系统,搜集各大社群网络平台包括Facebook、Youtube、PTT、推特等文字数据,取得待比对之第一数据集合111; 其中第二数据集合120包含特定主题的多阶层文字数据集合121、特定主题的构面子集合122以及用于比对的特征文字组件123以及加权文字组件124;其中分析服务器130信息连接于待比对数据集合111以及第二数据集合120,比对该等数据集合内容,并依据特征文字组件123以及加权文字组件124所对应到待比对数据文句内容112的结果进行加权操作,给予待比对数据文句内容112中的目标词-5至+5分的加权值,得到待比对数据有关特定主题或特定主题的不同构面的衡量参考值。Please refer to FIG. 1 , which is a schematic diagram of an embodiment of the text search measurement system 1 . As shown, in the present embodiment, the text search measurement system 1 includes an automation system 110, a second data set 120, an analysis server 130, and a usage interface 140. In an embodiment, the automation system 110 includes a exploration program 113 and/or Or a segmentation system 114, which may be a crawler program and/or a word breaker system, collecting text data of various social network platforms including Facebook, Youtube, PTT, Twitter, etc., and obtaining a first data set 111 to be compared; The second data set 120 includes a multi-level text data set 121 of a specific theme, a facet sub-set 122 of a specific topic, and a feature text component 123 and a weighted text component 124 for comparison; wherein the analysis server 130 is connected to the to-be-matched The data set 111 and the second data set 120 are weighted according to the content of the data set and the result of the feature text component 123 and the weighted text component 124 corresponding to the data sentence content 112 to be compared, and the data to be compared is given. The weighted value of the target word -5 to +5 points in the sentence content 112 obtains a measurement reference value of the different facets of the specific subject or the specific subject to be compared with the data.
在另一实施例中,分割系统114信息连接于至少一待比对数据文句内容112,区分待比对数据文句内容112成复数个区块,得到复数个区块中的目标词。In another embodiment, the segmentation system 114 is coupled to the at least one to-be-matched data sentence content 112, and distinguishes the to-be-matched data sentence content 112 into a plurality of blocks to obtain target words in the plurality of blocks.
在一实施例中,特征文字组件123系选自各国期刊及论文证实之文字组件(例如Aaker1997年品牌个性42特征字符)及该文字组件之各国翻译语言,或各国期刊及论文对于受测者所提供之相关字词进行量化(问卷)与质化(专家访谈)后所得文字组件。In one embodiment, the feature text component 123 is selected from national journals and paper-confirmed text components (eg, Aaker's 1997 brand personality 42 feature characters) and the national translation language of the text component, or national journals and papers for the subject. The relevant words are provided for quantification (questionnaire) and qualitative (expert interview) to obtain the text components.
在一实施例中,特定主题的多阶层文字数据集合121包含特定主题的构面子集合122;在一实施例中,第二数据集合120包含中英语文字。In one embodiment, the multi-level text data set 121 of a particular topic includes a facet sub-set 122 of a particular topic; in one embodiment, the second data set 120 contains Chinese-English text.
在一实施例中,第二数据集合120包含语言字典数据库、俚语数据库以及自建语言数据库。In an embodiment, the second data set 120 includes a language dictionary database, a slang database, and a self-built language database.
在一实施例中,分析服务器130可运用机器学习算法来给予加权值。 In an embodiment, the analysis server 130 may apply a machine learning algorithm to give weight values.
在另一实施例中,特定主题的多阶层文字数据集合121为默认或可由用户透过使用接口140设定或建置特定主题的多阶层文字数据集合121及其构面子集合122、特征文字组件123或加权文字组件124。In another embodiment, the multi-level text data set 121 of the specific theme is a default or multi-level text data set 121 and a facet sub-set 122 thereof, and a feature text component that can be set or built by the user through the interface 140. 123 or weighted text component 124.
请参阅图2,其为本案文字探勘衡量系统2的另一实施例示意图。如图所示,在一实施例中,第二数据集合220包含特定主题集合221以及特定主题集合221所对应的加权文字组件223;特定主题集合221包含特定主题的构面子集合222以及特定主题的构面子集合222所对应的加权文字组件224。Please refer to FIG. 2 , which is a schematic diagram of another embodiment of the text search measurement system 2 of the present invention. As shown, in an embodiment, the second data set 220 includes a specific topic set 221 and a weighted text component 223 corresponding to the specific topic set 221; the specific topic set 221 includes a facet subset 222 of a particular topic and a specific topic The weighted text component 224 corresponding to the facet set 222.
在一实施例中,特征文字组件225系依据不同文化、不同产业下的不同构面建置;在一实施例中,用户可以自行建置特征文字组件225。In an embodiment, the feature text component 225 is constructed according to different cultures and different facets in different industries; in an embodiment, the user can build the feature text component 225 by itself.
在另一实施例中,文字探勘衡量系统进一步包含分析服务器230执行对比之结果数据集合240,其中包含加权操作之结果以及有关待比对数据文句内容212于特定主题或其不同构面之衡量参考值。In another embodiment, the text search measurement system further includes a result data set 240 that the analysis server 230 performs the comparison, including the results of the weighting operations and the measurement reference for the content of the data to be compared 212 on a particular topic or its different facets. value.
请参阅图3,其为本案特定主题集合的构面子集合的一实施例示意图。如图所示,在一实施例中,特定主题集合可再细分为数个构面子集合,像是营销手法效益的构面项目,如Action、Awareness、Desire、Excited、Happy等,各个构面项目下有其相应的特征文字组件。Please refer to FIG. 3, which is a schematic diagram of an embodiment of a facet sub-collection of a specific topic set of the present invention. As shown, in one embodiment, a particular set of topics can be subdivided into a number of facet sub-collections, such as facet items for marketing tactical benefits, such as Action, Awareness, Desire, Excited, Happy, etc., each facet item There are corresponding feature text components under it.
请参阅图4,其为本案特定主题的构面子集合的另一实施例示意图。构面子集合及特征文字组件系经过三步骤建立,包含透过期刊论文问卷萃取关键词、焦点访谈各特定主题所得相关关键词,以及机器学习算法所得到的关键词或网络流行用语。Please refer to FIG. 4, which is a schematic diagram of another embodiment of a facet sub-set of a particular subject matter of the present invention. The facet set and feature text components are built in three steps, including extracting keywords through journal paper questionnaires, relevant keywords from specific topics of focus interviews, and keywords or online popular terms obtained by machine learning algorithms.
请参阅图5,其为本案加权组件的加权值表示一实施例示意图。如图所示,不同加权组件分别代表不同加权值。 Please refer to FIG. 5 , which is a schematic diagram of an embodiment of weighting values of the weighting component of the present invention. As shown, the different weighting components represent different weighting values, respectively.
在一实施例中,给予待比对数据文句内容中的目标词-5至+5分的加权值,其极性可比例换算细分成如李克特量表(Likert Scale)般之区间。In one embodiment, a weighting value of -5 to +5 points of the target word in the content of the data sentence to be compared is given, and the polarity is proportionally subdivided into a range such as a Likert Scale.
请参阅图6,其为本案待比对资料文句内容的衡量参考值表示一实施例示意图。在一实施例中,衡量参考值可表现出待比对数据文句内容于特定主题或特定主题的不同构面分值。Please refer to FIG. 6 , which is a schematic diagram showing an embodiment of the reference value of the content of the data sentence to be compared. In an embodiment, the metric reference may represent a different facet score of the content of the data sentence to be compared to a particular topic or particular topic.
请参阅图7,其为本案文字探勘衡量的方法示意图。在一实施例中,文字探勘衡量的方法包含步骤S101,透过爬虫程序取得待比对数据集合其中包含至少一待比对数据文句内容;步骤S102,建立文字数据库,其中包含特定主题的多阶层文字数据集合、特定主题的构面子集合以及特征文字组件及加权文字组件;步骤S103,透过中英文断词系统区分至少一待比对数据文句内容成单一或复数个区块,得到至少一区块之目标词。步骤S104,分析服务器执行待比对数据集合与包含特定主题的多阶层文字数据集合之比对步骤;步骤S105,分析服务器根据特征组件以及加权组件所对应到至少一待比对数据文句内容之结果进行加权操作,得到有关待比对数据文句内容于特定主题及/或其在特定主题之不同构面的衡量参考值。Please refer to FIG. 7 , which is a schematic diagram of the method for measuring text in this case. In an embodiment, the method for text search measurement includes the step S101: obtaining, by the crawler program, the data set to be compared includes at least one content of the data to be compared; and step S102, establishing a text database, which includes multiple levels of the specific topic. a text data set, a facet sub-set of a specific theme, and a feature text component and a weighted text component; in step S103, the at least one to-be-matched data sentence content is divided into a single or a plurality of blocks by the Chinese-English word-cutting system to obtain at least one area The target word of the block. Step S104, the analyzing server performs a step of comparing the data set to be compared with the multi-level text data set including the specific topic; and in step S105, the analyzing server corresponds to the content of the at least one data to be compared according to the feature component and the weighting component. A weighting operation is performed to obtain a reference value for the content of the data to be compared to a particular topic and/or its different facets on a particular topic.
在一实施例中,步骤S104更包含步骤S1041,将待比对数据文句内容与特征文字组件进行比对,以及步骤S1042,将步骤S1041之比对结果与加权文字组件进行比对。In an embodiment, step S104 further includes step S1041, comparing the content of the data to be compared with the feature text component, and step S1042, comparing the comparison result of step S1041 with the weighted text component.
在另一实施例中,待比对数据文句内容包含爬虫程序及/或断字系统所得之结果。 In another embodiment, the content of the data clause to be compared includes the results of the crawler program and/or the word breaker system.
在一实施例中,特定主题的多阶层文字数据集合以及特定主题的构面子集合具有其对应的加权文字组件。In an embodiment, the multi-level text data set of a particular topic and the facet sub-set of a particular topic have their corresponding weighted text components.
在另一实施例中,文字探勘衡量的方法更包含一步骤,用户透过使用接口设定特定主题的文字数据集合、特定主题的构面子集合、特征文字组件或加权文字组件。In another embodiment, the method for text search measurement further includes a step of setting a text data set of a specific theme, a facet sub-set of a specific theme, a feature text component, or a weighted text component by using an interface.
在一实施例中,文字探勘衡量的方法更包含一步骤,建立对比结果数据集合,其包含加权操作之结果以及有关待比对数据文句内容于特定主题及/或特定主题在不同构面之衡量参考值。In one embodiment, the method of text search measurement further comprises a step of establishing a comparison result data set comprising the result of the weighting operation and the measurement of the content of the data to be compared on a particular topic and/or a particular topic at different facets Reference.
在另一实施例中,文字探勘衡量的方法更包含一步骤,将对比结果数据集合进行统计验证,依据统计验证之结果修正特征文字组件内容,再次执行文字探勘衡量的方法,进行统计验证,直到该统计验证之结果具有统计上之意义。In another embodiment, the method for text search measurement further comprises a step of statistically verifying the comparison result data set, correcting the content of the feature text component according to the result of the statistical verification, and performing the method of text exploration measurement again, performing statistical verification until The results of this statistical verification are statistically significant.
本发明另提供一种验证文字探勘衡量系统其效度的方法。在一实施例中,验证文字探勘衡量系统的方法包含一步骤,将文字探勘衡量系统所得之结果一包含加权操作之结果以及有关待比对数据文句内容于特定主题及/或特定主题在不同构面之衡量参考值,与其所对应的一般问卷调查结果,进行如成对样本t检定之统计验证,以确认效度。The present invention further provides a method of verifying the validity of a word exploration measurement system. In one embodiment, the method for verifying a text search measurement system includes a step of including a result of the weighted operation of the text search measurement system and the content of the data to be compared on a particular topic and/or a particular topic. The reference value of the measurement is compared with the general questionnaire result corresponding to it, and the statistical verification such as the paired sample t verification is performed to confirm the validity.
以上所提仅是本案的较佳实施例样态,并不是用于限定本案的实施范围;任何在此领域具有通常知识者,在不脱离本案的精神与范围下所作的诸般变化与修饰,都不脱如附权利要求所欲保护者。 The above description is only the preferred embodiment of the present invention, and is not intended to limit the scope of implementation of the present invention; any changes and modifications made by those having ordinary knowledge in the field without departing from the spirit and scope of the present invention, It does not depart from the claims as claimed.
【符号说明】【Symbol Description】
1 文字探勘衡量系统1 word exploration measurement system
2 文字探勘衡量系统2 word exploration measurement system
100 方法100 methods
S101、S102、S103、S104、S105、S1041、S1042 步骤S101, S102, S103, S104, S105, S1041, S1042 Steps
110 自动化系统110 automation system
111 待比对数据集合111 to compare data sets
112 待比对数据文句内容112 to compare the content of the data sentence
113 探勘程序113 Exploration procedures
114 分割系统114 segmentation system
120 文字数据库120 text database
121 特定主题的文字数据集合121 text data collection for a specific topic
122 特定主题的构面子集合122 facet subcollections of a specific topic
123 特征文字组件123 feature text component
124 加权文字组件124 weighted text component
130 分析服务器130 Analysis Server
140 使用接口140 Using the interface
210 自动化系统210 automation system
211 待比对数据集合211 to be compared to the data set
212 待比对数据文句内容212 to compare the content of the data sentence
220 文字数据库 220 text database
221 特定主题的文字数据集合221 text data collection for a specific topic
222 特定主题的构面子集合222 Faceted Subcollections for a Specific Topic
223 加权文字组件223 weighted text component
224 加权文字组件224 weighted text component
225 特征文字组件225 feature text component
230 分析服务器230 Analysis Server
240 对比结果数据集合 240 comparison result data set

Claims (18)

  1. 一种文字探勘衡量系统,包含:A word exploration measurement system comprising:
    一第一数据集合,其具有待对比之一至少一数据组件;a first data set having at least one data component to be compared;
    一第二数据集合,包含一至少一特定主题集合之一特定主题子集合,以及一加权组件,其中该特定主题子集合包含对应于该第一数据集合内容之一特征组件;以及a second data set comprising a specific subject sub-set of at least one specific topic set, and a weighting component, wherein the specific topic sub-set includes one of the feature components corresponding to the first data set content;
    一分析服务器,信息连接于该第一数据集合以及该第二数据集合,执行该第一数据集合与该第二数据集合之一比对步骤,根据该特征组件以及该加权组件所对应到该至少一数据组件之结果进行一加权操作,得到有关该至少一数据组件于该至少一特定主题集合及/或该特定主题子集合内容之一衡量参考值。An analysis server, the information is connected to the first data set and the second data set, and performing a comparison step of the first data set and the second data set, according to the feature component and the weighting component corresponding to the at least The result of a data component is subjected to a weighting operation to obtain a reference value for the at least one data component in the at least one specific topic set and/or one of the specific topic sub-collection contents.
  2. 如权利要求1所述之文字探勘衡量系统,其中该至少一数据组件包含一自动化系统所得之结果。The word search measurement system of claim 1 wherein the at least one data component comprises a result of an automated system.
  3. 如权利要求2所述之文字探勘衡量系统,其中该自动化系统更包含一分割系统,信息连接于该至少一数据组件,区分该至少一数据组件为一至少一区块,得到该至少一区块之目标词。The word search measurement system of claim 2, wherein the automation system further comprises a segmentation system, wherein the information is connected to the at least one data component, and the at least one data component is divided into at least one block to obtain the at least one block. The target word.
  4. 如权利要求1所述之文字探勘衡量系统,其中该至少一特定主题集合及/或该特定主题子集合具有其对应之该加权组件。The word search measurement system of claim 1 wherein the at least one particular topic set and/or the particular topic subset has its corresponding weighting component.
  5. 如权利要求1所述之文字探勘衡量系统,其进一步包含一第三数据集合,其中该第三数据集合包含该加权操作之结果以及有关该至少一特定主题集合及/或该特定主题子集合内容之该衡量参考值。 The word search measurement system of claim 1 further comprising a third data set, wherein the third data set includes results of the weighting operation and content related to the at least one particular topic set and/or the particular topic subset content The measurement reference value.
  6. 如权利要求1所述之文字探勘衡量系统,其中该第二数据集合系为预设或可由一用户透过一使用接口设定该特定主题集合、该特定主题子集合、该特征组件及/或该加权组件。The phrasing measurement system of claim 1 , wherein the second data set is preset or can be set by a user through a usage interface, the particular topic subset, the feature component, and/or The weighting component.
  7. 如权利要求1所述之文字探勘衡量系统,其中该加权组件之衡量参考值范围介于-5至+5之间。The word search measurement system of claim 1 wherein the weighting component has a range of reference values between -5 and +5.
  8. 如权利要求1所述之文字探勘衡量系统,其中该特征组件系选自由学术期刊、论文、问卷、市调报告、访谈以及机器学习算法所得到的关键词。The word search measurement system of claim 1 wherein the feature component is selected from the group consisting of academic journals, papers, questionnaires, market reports, interviews, and machine learning algorithms.
  9. 一种验证如权利要求1-8任一项所述之文字探勘衡量系统其效度的方法,其包含一步骤,将该文字探勘衡量系统所得之结果与一待比对结果进行一统计验证。A method of verifying the validity of a word search measurement system according to any of claims 1-8, comprising a step of performing a statistical verification of the results obtained by the text search measurement system with a comparison result.
  10. 如权利要求9所述之验证的方法,其中该统计验证系选自包含成对样本t检定的一种或多种统计方法。The method of verifying according to claim 9, wherein the statistical verification is selected from one or more statistical methods comprising paired sample t assays.
  11. 一种文字探勘衡量的方法,包含:A method of text exploration measurement that includes:
    步骤一,透过一自动化系统取得待比对之一第一数据集合,其中该第一数据集合包含一至少一数据组件;Step 1: obtaining, by an automated system, one of the first data sets to be compared, wherein the first data set includes at least one data component;
    步骤二,建立一第二数据集合,其中该第二数据集合包含一至少一特定主题集合之一特定主题子集合,以及一加权组件,其中该特定主题子集合包含对应于该第一数据集合内容之一特征组件;Step 2: Establish a second data set, where the second data set includes a specific topic subset of at least one specific topic set, and a weighting component, where the specific topic subset includes content corresponding to the first data set One feature component;
    步骤三,一分析服务器执行该第一数据集合与该第二数据集合之一比对步骤; Step 3: An analysis server performs a comparison step of the first data set and the second data set;
    步骤四,该分析服务器根据该特征组件以及该加权组件所对应到该数据组件之结果进行一加权操作,得到有关该至少一数据组件于该至少一特定主题集合及/或该特定主题子集合内容之一衡量参考值。Step 4: The analysis server performs a weighting operation according to the feature component and the result of the weight component corresponding to the data component, to obtain information about the at least one data component in the at least one specific topic set and/or the specific topic subset content. One measures the reference value.
  12. 如权利要求11所述之方法,其中该至少一数据组件包含一自动化系统所得之结果。The method of claim 11 wherein the at least one data component comprises the result of an automated system.
  13. 如权利要求11所述之方法,其中该至少一特定主题集合及/或该特定主题子集合具有其对应之该加权组件。The method of claim 11 wherein the at least one particular topic set and/or the particular topic subset has its corresponding weighting component.
  14. 如权利要求11-13任一项所述之方法,其中该比对步骤包含:The method of any of claims 11-13, wherein the comparing step comprises:
    一第一比对步骤,将数据组件与特征组件进行比对,以及;a first comparison step of comparing the data component with the feature component, and;
    一第二比对步骤,将该第一比对步骤之结果与加权组件进行比对。A second alignment step compares the result of the first alignment step with the weighting component.
  15. 如权利要求11-13任一项所述之方法,其中更包含一步骤:A method according to any of claims 11-13, further comprising a step:
    一分割系统区分该至少一数据组件为一至少一区块,得到该至少一区块之目标词。A segmentation system distinguishes the at least one data component into at least one block, and obtains a target word of the at least one block.
  16. 如权利要求11-13任一项所述之方法,其中更包含一步骤:A method according to any of claims 11-13, further comprising a step:
    一用户透过一使用接口设定该特定主题集合、该特定主题子集合、该特征组件及/或该加权组件。A user sets the particular set of topics, the particular subset of topics, the feature component, and/or the weighting component through a usage interface.
  17. 如权利要求11-13任一项所述之方法,其中更包含一步骤:A method according to any of claims 11-13, further comprising a step:
    建立一第三数据集合,其包含该加权操作之结果以及有关该至少一特定主题集合及/或该特定主题子集合之该衡量参考值。A third data set is created that includes the result of the weighting operation and the metric reference value for the at least one particular topic set and/or the particular topic subset.
  18. 如权利要求17所述之方法,其中更包含一步骤: The method of claim 17 further comprising a step of:
    将该第三数据集合之结果执行如权利要求9或10所述之验证的方法,依据统计验证之结果调整该特征组件内容,并再次执行如权利要求11所述之方法直到统计验证之结果具有统计上之意义。 Performing the method according to claim 9 or 10 on the result of the third data set, adjusting the content of the feature component according to the result of the statistical verification, and performing the method according to claim 11 again until the result of the statistical verification has Statistical significance.
PCT/CN2017/083848 2017-05-10 2017-05-10 Text exploration and measurement system and method WO2018205178A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/083848 WO2018205178A1 (en) 2017-05-10 2017-05-10 Text exploration and measurement system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/083848 WO2018205178A1 (en) 2017-05-10 2017-05-10 Text exploration and measurement system and method

Publications (1)

Publication Number Publication Date
WO2018205178A1 true WO2018205178A1 (en) 2018-11-15

Family

ID=64104094

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/083848 WO2018205178A1 (en) 2017-05-10 2017-05-10 Text exploration and measurement system and method

Country Status (1)

Country Link
WO (1) WO2018205178A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103488635A (en) * 2012-06-11 2014-01-01 腾讯科技(深圳)有限公司 Method and device for acquiring product information
CN104657425A (en) * 2014-10-06 2015-05-27 中华电信股份有限公司 Topic management type network public opinion evaluation management system and method
CN105095302A (en) * 2014-05-15 2015-11-25 财团法人工业技术研究院 Public praise-oriented analysis and inspection system, device and method
CN105589901A (en) * 2014-11-17 2016-05-18 财团法人资讯工业策进会 E-commerce public praise analysis system and method thereof
CN106021433A (en) * 2016-05-16 2016-10-12 北京百分点信息科技有限公司 Public praise analysis method and apparatus for product review data

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103488635A (en) * 2012-06-11 2014-01-01 腾讯科技(深圳)有限公司 Method and device for acquiring product information
CN105095302A (en) * 2014-05-15 2015-11-25 财团法人工业技术研究院 Public praise-oriented analysis and inspection system, device and method
CN104657425A (en) * 2014-10-06 2015-05-27 中华电信股份有限公司 Topic management type network public opinion evaluation management system and method
CN105589901A (en) * 2014-11-17 2016-05-18 财团法人资讯工业策进会 E-commerce public praise analysis system and method thereof
CN106021433A (en) * 2016-05-16 2016-10-12 北京百分点信息科技有限公司 Public praise analysis method and apparatus for product review data

Similar Documents

Publication Publication Date Title
Liu et al. Assessing product competitive advantages from the perspective of customers by mining user-generated content on social media
Anastasopoulos et al. Machine learning for public administration research, with application to organizational reputation
Danescu-Niculescu-Mizil et al. How opinions are received by online communities: a case study on amazon. com helpfulness votes
Singla et al. Statistical and sentiment analysis of consumer product reviews
Kar et al. Finding opinion strength using fuzzy logic on web reviews
US20150317390A1 (en) Computer-implemented systems and methods for taxonomy development
CN104850617A (en) Short text processing method and apparatus
Zhang et al. Using neutral sentiment reviews to improve customer requirement identification and product design strategies
Zhang et al. From buzz to bucks: The impact of social media opinions on the locus of innovation
Nirmala et al. Twitter data analysis for unemployment crisis
TWM546531U (en) Text mining and scale measuring system
Villegas et al. Analyzing online political advertisements
Ahuja et al. Corporate blogs as tools for consumer segmentation-using cluster analysis for consumer profiling
CN107291686B (en) Method and system for identifying emotion identification
WO2018205178A1 (en) Text exploration and measurement system and method
Chen et al. Topic modelling for open-ended survey responses
Malik et al. Sentiment analysis on political tweets
TW201901486A (en) Text mining and scale measuring system and method including an automatic system, a second data set and an analysis server
Turdjai et al. Simulation of marketplace customer satisfaction analysis based on machine learning algorithms
Erfina et al. Indonesian Analysis Sentiment on Non Fungible Token (NFT)
Deng Analysis and Countermeasures of College Students’ Sentimental Tendency Based on Network Behavior Data
CN111191108A (en) Software crowdsourcing project recommendation method and system based on reinforcement learning
Gupta et al. Sentiment Analysis and its Application in Analysing Consumer Behaviour
Kumar et al. Hybrid Multi-Criteria Decision Making Approach for Product Ranking Using Customers Reviews
Chen et al. Identifying desirable product specifications from target customers’ Chinese eWOM

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17909201

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 15/07/2020)

122 Ep: pct application non-entry in european phase

Ref document number: 17909201

Country of ref document: EP

Kind code of ref document: A1