JP2016207171A

JP2016207171A - Parameter estimation method, evaluation value estimation method, apparatus, and program

Info

Publication number: JP2016207171A
Application number: JP2015092094A
Authority: JP
Inventors: ジュンオウ; Jung Oh; 克仁須藤; Katsuto Sudo; 昌明永田; Masaaki Nagata
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2015-04-28
Filing date: 2015-04-28
Publication date: 2016-12-08

Abstract

PROBLEM TO BE SOLVED: To allow a parameter to be obtained so as to accurately estimate an evaluation value of review.SOLUTION: An estimation input part 10 receives a plurality of learning reviews about a plurality of objects to be evaluated. A parameter estimation part 26, on the basis of the plurality of learning reviews received by the estimation input part 10, estimates a model parameter according to a topic model which shows that a parameter in which an aspect overall evaluation value of each aspect with respect to the object to be evaluated and a total evaluation value of the learning reviews depend on a total overall evaluation value and an aspect evaluation value of each aspect of the learning reviews depends on the aspect overall evaluation value of each aspect with respect to objects to be evaluated and the total evaluation value of the learning reviews and which shows distribution of respective words appearing in review sentences to which aspects are assigned with respect to each aspect and a parameter in which aspects with respect to the respective aspects and the respective evaluation values are assigned and which shows distribution of respective words appearing in review sentences where an aspect evaluation value of the aspect becomes an evaluation value are calculated.SELECTED DRAWING: Figure 3

Description

本発明は、パラメータ推定方法、評価値推定方法、装置、及びプログラムに関する。 The present invention relates to a parameter estimation method, an evaluation value estimation method, an apparatus, and a program.

商品等に対する評価や感想を記したレビューデータが広く共有・公開されるようになってきており、その商品等の購入・利用を考えるユーザにとって、対象全体（例えばレストランやホテル）の総合評価、また個別の様相（例えば食事、サービス、場所、価格）に対する評価の情報は有益なものとして捉えられている。評価値推定とは、自然言語で記述されたレビューデータから、対象全体もしくは個別の様相に対する評価値を推定することをいう。 Review data describing evaluations and impressions on products has been widely shared and released, and for users who are considering purchasing and using such products, overall evaluation of the entire target (for example, restaurants and hotels) Evaluation information for individual aspects (eg meals, services, places, prices) is regarded as useful. Evaluation value estimation refers to estimating an evaluation value for an entire object or individual aspects from review data described in a natural language.

評価値推定に関係する従来技術としては、以下の非特許文献１〜非特許文献３が知られている。非特許文献１は、ディリクレ分布をモデルパラメータの事前分布として利用した潜在的ディリクレ配分法について述べたものである。非特許文献２及び非特許文献３に記載の技術は、いずれも非特許文献１の方法の改良である。非特許文献２は、潜在的ディリクレ配分法を文単位で扱うＬｏｃａｌ−ＬＤＡと呼ばれる方法を述べたものであり、文中で評価可能な様相を発見するために利用することが可能である。非特許文献３は、教師ありＬＤＡモデルによって個々のレビューに対応する評価値をモデル化し、評価値を予測するための方法について述べたものである。これらの方法は、レビューの文から個々の様相ごとの評価値を直接得ることを目的としている。例えば、あるホテルについてのレビューから「食事」「サービス」といった様相を評価する表現と評価値との関係をモデル化して、評価値が未知のレビューに対して個々の様相ごとの評価値を予測するといった処理を行うものである。 The following Non-Patent Document 1 to Non-Patent Document 3 are known as conventional techniques related to evaluation value estimation. Non-Patent Document 1 describes a potential Dirichlet distribution method using a Dirichlet distribution as a prior distribution of model parameters. The techniques described in Non-Patent Document 2 and Non-Patent Document 3 are all improvements of the method of Non-Patent Document 1. Non-Patent Document 2 describes a method called “Local-LDA” in which the latent Dirichlet allocation method is handled in sentence units, and can be used to find an aspect that can be evaluated in a sentence. Non-Patent Document 3 describes a method for predicting an evaluation value by modeling an evaluation value corresponding to each review using a supervised LDA model. These methods aim to obtain the evaluation value for each aspect directly from the review text. For example, by modeling the relationship between an evaluation value such as “meal” and “service” from a review of a hotel, and predicting the evaluation value for each individual aspect for reviews with unknown evaluation values Is performed.

David Blei, Andrew Ng and Michael Jordan,“Latent Dirichlet Allocation.”,In JMLR 2003.In JMLR.David Blei, Andrew Ng and Michael Jordan, “Latent Dirichlet Allocation.”, In JMLR 2003. In JMLR. Samuel Brody and Noemie Elhadad,“An unsupervised aspect-sentiment model for online reviews.”,In NAACL 2010Samuel Brody and Noemie Elhadad, “An unsupervised aspect-sentiment model for online reviews.”, In NAACL 2010 D. Blei and J. McAuliffe,“Supervised topic models.”,In NIPS 2007.D. Blei and J. McAuliffe, “Supervised topic models.”, In NIPS 2007.

しかし、上記非特許文献１〜３の方法では、個々の様相ごとの評価値は独立したものとして考えられている。一方で、総合評価の高い対象については個々の様相ごとの評価値も高いことや、あるレビューの全体評価値が高ければそのレビュー中の個々の様相ごとの評価値も高いことが予想されるが、従来の方法ではそうしたことは考慮されていない。そのため、総合評価の高い対象について過度に低い様相ごとの評価値が割り当てられるようなことが起こりやすくなり、評価の一貫性が低い予測が生じる可能性がある。 However, in the methods of Non-Patent Documents 1 to 3, the evaluation value for each aspect is considered to be independent. On the other hand, for subjects with a high overall evaluation, the evaluation value for each aspect is high, and if the overall evaluation value for a review is high, the evaluation value for each aspect in the review is also expected to be high. However, this is not taken into account in the conventional method. For this reason, it is likely that an evaluation value for each aspect that is excessively low is assigned to an object with a high overall evaluation, and prediction with low consistency of evaluation may occur.

本発明は、上記の事情を考慮してなされたもので、レビューの評価値を精度よく推定することができるパラメータ推定方法、評価値推定方法、装置、及びプログラムを提供することを目的とする。 The present invention has been made in view of the above circumstances, and an object thereof is to provide a parameter estimation method, an evaluation value estimation method, an apparatus, and a program that can accurately estimate an evaluation value of a review.

上記の目的を達成するために本発明に係るパラメータ推定方法は、推定用入力部及びパラメータ推定部を含むパラメータ推定装置におけるパラメータ推定方法であって、前記推定用入力部が、複数の様相を含む複数の評価対象に関する複数の学習用レビューであって、かつ前記評価対象に対する全体評価値と前記評価対象の各様相に対する様相評価値とが予め付与された複数の学習用レビューを受け付けるステップと、前記パラメータ推定部が、前記推定用入力部によって受け付けた複数の前記学習用レビューに基づいて、前記評価対象に対する各様相の様相総合評価値と前記学習用レビューの前記全体評価値とが前記評価対象に対する全体総合評価値に依存し、前記学習用レビューの各様相の様相評価値が前記評価対象に対する各様相の様相総合評価値と前記学習用レビューの全体評価値に依存して、各様相についての、前記様相が割り当てられたレビュー文で出現する各単語の分布を表すパラメータと、各様相及び各評価値についての、様相が割り当てられ、かつ、前記様相の様相評価値が前記評価値となるレビュー文で出現する各単語の分布を表すパラメータとが算出されることを表すトピックモデルに従って、レビュー文に割り当てられる各様相の分布を表すパラメータと、各様相についての、前記様相が割り当てられたレビュー文で出現する各単語の分布を表すパラメータと、各様相及び各評価値についての、様相が割り当てられ、かつ、前記様相の様相評価値が前記評価値となるレビュー文で出現する各単語の分布を表すパラメータと、前記複数の評価対象に対する全体総合評価値の平均及び標準偏差と、各様相についての前記複数の評価対象に対する様相総合評価値の標準偏差と、前記複数の学習用レビューの全体評価値の標準偏差と、各様相についての前記複数の学習用レビューの様相評価値の標準偏差とを含むモデルパラメータを推定するステップと、を含む。 In order to achieve the above object, a parameter estimation method according to the present invention is a parameter estimation method in a parameter estimation apparatus including an estimation input unit and a parameter estimation unit, and the estimation input unit includes a plurality of aspects. Receiving a plurality of reviews for learning, which are a plurality of reviews for learning related to a plurality of evaluation targets, and in which an overall evaluation value for the evaluation target and a mode evaluation value for each aspect of the evaluation target are given in advance; Based on the plurality of learning reviews received by the estimation input unit by the parameter estimation unit, the overall evaluation value of each aspect with respect to the evaluation object and the overall evaluation value of the learning review with respect to the evaluation object Depending on the overall overall evaluation value, the aspect evaluation value of each aspect of the review for learning is different from each aspect of the evaluation object. Depending on the overall evaluation value and the overall evaluation value of the review for learning, for each aspect, a parameter indicating the distribution of each word appearing in the review sentence to which the aspect is assigned, and each aspect and each evaluation value , Each assigned to a review sentence according to a topic model that indicates that an aspect is assigned and a parameter representing a distribution of each word appearing in the review sentence whose aspect evaluation value is the evaluation value is calculated. A parameter indicating the distribution of the aspect, a parameter indicating the distribution of each word appearing in the review sentence to which the aspect is assigned, and an aspect for each aspect and each evaluation value are assigned, and A parameter representing the distribution of each word appearing in the review sentence in which the aspect evaluation value of the aspect becomes the evaluation value, and all of the plurality of evaluation objects The average and standard deviation of the overall evaluation values, the standard deviation of the overall evaluation values for the plurality of evaluation targets for each aspect, the standard deviation of the overall evaluation values of the plurality of learning reviews, and the plurality for each aspect Estimating a model parameter including a standard deviation of an aspect evaluation value of the review for learning.

本発明に係るパラメータ推定装置は、複数の様相を含む複数の評価対象に関する複数の学習用レビューであって、かつ前記評価対象に対する全体評価値と前記評価対象の各様相に対する様相評価値とが予め付与された複数の学習用レビューを受け付ける推定用入力部と、前記推定用入力部によって受け付けた複数の前記学習用レビューに基づいて、前記評価対象に対する各様相の様相総合評価値と前記学習用レビューの前記全体評価値とが前記評価対象に対する全体総合評価値に依存し、前記学習用レビューの各様相の様相評価値が前記評価対象に対する各様相の様相総合評価値と前記学習用レビューの全体評価値に依存して、各様相についての、前記様相が割り当てられたレビュー文で出現する各単語の分布を表すパラメータと、各様相及び各評価値についての、様相が割り当てられ、かつ、前記様相の様相評価値が前記評価値となるレビュー文で出現する各単語の分布を表すパラメータとが算出されることを表すトピックモデルに従って、レビュー文に割り当てられる各様相の分布を表すパラメータと、各様相についての、前記様相が割り当てられたレビュー文で出現する各単語の分布を表すパラメータと、各様相及び各評価値についての、様相が割り当てられ、かつ、前記様相の様相評価値が前記評価値となるレビュー文で出現する各単語の分布を表すパラメータと、前記複数の評価対象に対する全体総合評価値の平均及び標準偏差と、各様相についての前記複数の評価対象に対する様相総合評価値の標準偏差と、前記複数の学習用レビューの全体評価値の標準偏差と、各様相についての前記複数の学習用レビューの様相評価値の標準偏差とを含むモデルパラメータを推定するパラメータ推定部と、を含んで構成されている。 The parameter estimation device according to the present invention is a plurality of reviews for learning related to a plurality of evaluation objects including a plurality of aspects, and an overall evaluation value for the evaluation object and an aspect evaluation value for each aspect of the evaluation object are preliminarily obtained. An estimation input unit that receives a plurality of given learning reviews, and the overall evaluation value of each aspect and the review for learning based on the plurality of learning reviews received by the estimation input unit And the overall evaluation value of each aspect of the review for learning is the overall evaluation value of each aspect of the review for evaluation and the overall evaluation of the review for learning Depending on the value, for each aspect, a parameter representing the distribution of each word appearing in the review sentence to which the aspect is assigned, and each aspect For each evaluation value, a review is performed according to a topic model indicating that an aspect is assigned and a parameter representing a distribution of each word appearing in a review sentence in which the aspect evaluation value of the aspect becomes the evaluation value is calculated. Parameters indicating the distribution of each aspect assigned to the sentence, parameters indicating the distribution of each word appearing in the review sentence to which the aspect is assigned, and the aspects of each aspect and each evaluation value are assigned. And a parameter representing the distribution of each word appearing in the review sentence in which the aspect evaluation value of the aspect becomes the evaluation value, the average and standard deviation of the overall comprehensive evaluation values for the plurality of evaluation objects, and each aspect Standard deviation of the overall evaluation value of the aspect for the plurality of evaluation objects, and standard deviation of the overall evaluation value of the plurality of learning reviews, Wherein and the plurality of configured to include a parameter estimation unit for estimating the model parameters, the containing and the standard deviation of the learning review aspects evaluation value for the.

本発明に係るパラメータ推定方法は、初期化部が、前記モデルパラメータを初期化するステップを更に含み、前記パラメータ推定部が前記モデルパラメータを推定するステップでは、前記モデルパラメータと、前記複数の学習用レビューの各々の前記全体評価値及び各様相の様相評価値とに基づいて、前記複数の評価対象の各々に対する、前記評価対象の全体総合評価値の平均及び標準偏差と、前記複数の評価対象の各々に対する、各様相についての前記評価対象に対する様相総合評価値の平均及び標準偏差と、前記複数の学習用レビューの各々に対する、前記学習用レビューにおける各様相の分布を表すパラメータと、前記複数の学習用レビューの各レビュー文に対する、前記レビュー文に割り当てられる各様相の分布を表すパラメータとを含む変分パラメータを推定すること、及び推定された前記変分パラメータに基づいて、前記モデルパラメータを推定することを、予め定められた反復終了条件を満たすまで繰り返すようにすることができる。 The parameter estimation method according to the present invention further includes a step in which an initialization unit initializes the model parameter, and the step in which the parameter estimation unit estimates the model parameter includes the model parameter and the plurality of learning parameters. Based on the overall evaluation value of each review and the aspect evaluation value of each aspect, for each of the plurality of evaluation objects, the average and standard deviation of the overall evaluation value of the evaluation object, and the evaluation object For each, the average and standard deviation of the overall evaluation values of the aspects of the evaluation object for each aspect, the parameter representing the distribution of each aspect in the learning review for each of the plurality of learning reviews, and the plurality of learnings For each review sentence for review, a parameter indicating the distribution of each aspect assigned to the review sentence; Estimating the variational parameters including, and estimated on the basis of the variational parameter to estimate the model parameters, it is possible to make repeated until iteration termination condition is satisfied predetermined.

本発明に係る評価値推定方法は、入力部及び評価値推定部を含む評価値推定装置における評価値推定方法であって、前記入力部が、評価対象に関するレビューを受け付けるステップと、前記評価値推定部が、前記入力部によって受け付けた前記レビューと、本発明のパラメータ推定方法によって推定された前記モデルパラメータとに基づいて、前記レビューの前記評価対象に対する全体評価値、及び前記レビューの前記評価対象の各様相に対する様相評価値を推定するステップと、を含む。 The evaluation value estimation method according to the present invention is an evaluation value estimation method in an evaluation value estimation device including an input unit and an evaluation value estimation unit, wherein the input unit receives a review related to an evaluation object, and the evaluation value estimation Based on the review received by the input unit and the model parameter estimated by the parameter estimation method of the present invention, the overall evaluation value for the evaluation target of the review, and the evaluation target of the review Estimating an aspect evaluation value for each aspect.

本発明に係るプログラムは、コンピュータを、本発明のパラメータ推定装置の各部として機能させるためのプログラムである。 The program according to the present invention is a program for causing a computer to function as each unit of the parameter estimation device of the present invention.

以上説明したように、本発明によれば、複数の学習用レビューに基づいて、評価対象に対する各様相の様相総合評価値と学習用レビューの全体評価値とが評価対象に対する全体総合評価値に依存し、学習用レビューの各様相の様相評価値が評価対象に対する各様相の様相総合評価値と学習用レビューの全体評価値に依存して、各様相についての、様相が割り当てられたレビュー文で出現する各単語の分布を表すパラメータと、各様相及び各評価値についての、様相が割り当てられ、かつ、様相の様相評価値が評価値となるレビュー文で出現する各単語の分布を表すパラメータとが算出されることを表すトピックモデルに従って、モデルパラメータを推定することにより、レビューの評価値を精度よく推定するためパラメータを得ることができる、という効果が得られる。 As described above, according to the present invention, based on a plurality of learning reviews, the overall aspect evaluation value of each aspect for the evaluation object and the overall evaluation value of the learning review depend on the overall evaluation value for the evaluation object. However, the aspect evaluation value of each aspect of the learning review depends on the overall evaluation value of each aspect and the overall evaluation value of the review for the evaluation target, and appears in the review sentence to which the aspect is assigned for each aspect. A parameter representing the distribution of each word and a parameter representing the distribution of each word appearing in the review sentence in which the aspect is assigned to each aspect and each evaluation value and the aspect evaluation value of the aspect is the evaluation value. By estimating the model parameters according to the topic model representing what is calculated, the parameters can be obtained to accurately estimate the review evaluation value. , An effect that can be obtained.

階層的事前分布に基づくトピックモデルによるレビューの生成過程を疑似コードによって表現したイメージ図である。It is the image figure which expressed the production | generation process of the review by the topic model based on hierarchical prior distribution by the pseudo code. 本発明の実施の形態に係るモデルパラメータ推定処理におけるＥ−ｓｔｅｐと初期化を表す疑似コードである。It is a pseudo code representing E-step and initialization in the model parameter estimation processing according to the embodiment of the present invention. 本発明の実施の形態に係るパラメータ推定装置の構成を示す概略図である。It is the schematic which shows the structure of the parameter estimation apparatus which concerns on embodiment of this invention. 本発明の実施の形態に係る評価値推定装置の構成を示す概略図である。It is the schematic which shows the structure of the evaluation value estimation apparatus which concerns on embodiment of this invention. 本発明の実施の形態に係るパラメータ推定装置におけるパラメータ推定処理ルーチンの内容を示すフローチャートである。It is a flowchart which shows the content of the parameter estimation process routine in the parameter estimation apparatus which concerns on embodiment of this invention. 本発明の実施の形態に係るパラメータ推定装置におけるＥ−ｓｔｅｐ処理ルーチンの内容を示すフローチャートである。It is a flowchart which shows the content of the E-step process routine in the parameter estimation apparatus which concerns on embodiment of this invention. 本発明の実施の形態に係るパラメータ推定装置におけるＭ−ｓｔｅｐ処理ルーチンの内容を示すフローチャートである。It is a flowchart which shows the content of the M-step process routine in the parameter estimation apparatus which concerns on embodiment of this invention. 本発明の実施の形態に係るパラメータ推定装置における評価値推定処理ルーチンの内容を示すフローチャートである。It is a flowchart which shows the content of the evaluation value estimation process routine in the parameter estimation apparatus which concerns on embodiment of this invention. 本発明の実施の形態の実験結果を示す図である。It is a figure which shows the experimental result of embodiment of this invention. 本発明の実施の形態の実験結果を示す図である。It is a figure which shows the experimental result of embodiment of this invention.

以下、図面を参照して本発明の実施の形態を詳細に説明する。以下では、まず本発明の実施の形態の概要及び原理について説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. Below, the outline | summary and principle of embodiment of this invention are demonstrated first.

＜本発明の実施の形態の概要＞
本発明の実施の形態は、評価対象に関するレビューデータにおける評価対象全体または評価対象の個別の様相に対する評価値を高精度に推定するためのものである。評価対象としては、例えばレストランやホテルが挙げられる。また、評価対象は複数の様相を含み、様相としては、例えば食事、サービス、場所、価格等が挙げられる。評価対象全体に対する全体総合評価値、評価対象の個別の様相に対する様相総合評価値、また個々のレビューの評価値は相互に関係があることが予想されるため、本発明の実施の形態では、潜在的ディリクレ配分法（Latent Dirichlet Allocation：LDA）に基づくトピックモデルを利用して評価値の推定を行う。 <Outline of Embodiment of the Present Invention>
The embodiment of the present invention is for accurately estimating an evaluation value for the entire evaluation object or individual aspects of the evaluation object in the review data relating to the evaluation object. Examples of evaluation targets include restaurants and hotels. The evaluation target includes a plurality of aspects, and examples of the aspect include meals, services, places, prices, and the like. In the embodiment of the present invention, the overall comprehensive evaluation value for the entire evaluation target, the overall aspect evaluation value for the individual aspect of the evaluation target, and the evaluation value of each review are expected to be related to each other. The evaluation value is estimated by using a topic model based on Latent Dirichlet Allocation (LDA).

本発明の実施の形態では、評価対象に対する各評価値を確率変数と考え、以下の（１）〜（４）の条件を設定する。 In the embodiment of the present invention, each evaluation value for the evaluation target is considered as a random variable, and the following conditions (1) to (4) are set.

なお、評価対象に対する全体総合評価値は、当該評価対象に関する複数のレビューから得られる評価対象の全体に対する評価値を表す。評価対象に対する各様相の様相総合評価値は、当該評価対象に関する複数のレビューから得られる評価対象の様相に対する評価値を表す。レビューの全体評価値は、１つのレビューにおける評価対象に対する評価値を表す。レビューの各様相の様相評価値は、１つのレビューにおける評価対象の各様相に対する評価値を表す。 The overall comprehensive evaluation value for the evaluation object represents an evaluation value for the entire evaluation object obtained from a plurality of reviews related to the evaluation object. The aspect comprehensive evaluation value of each aspect with respect to the evaluation object represents an evaluation value for the aspect of the evaluation object obtained from a plurality of reviews regarding the evaluation object. The overall evaluation value of a review represents an evaluation value for an evaluation object in one review. The aspect evaluation value of each aspect of the review represents an evaluation value for each aspect to be evaluated in one review.

（１）評価対象の様相ごとの様相総合評価値は、当該評価対象の全体総合評価値を平均とする確率分布に従う。
（２）各レビューの全体評価値は、評価対象の全体総合評価値を平均とする確率分布に従う。
（３）各レビューの様相ごとの様相評価値は、評価対象の様相ごとの総合評価値および当該レビューの全体評価値の重み付き和を平均とする確率分布に従う。
（４）レビュー文中の各単語は、各レビューの様相ごとの様相評価値と潜在トピックの条件付き確率分布に従う。 (1) The aspect comprehensive evaluation value for each aspect to be evaluated follows a probability distribution that averages the overall evaluation value of the evaluation object.
(2) The overall evaluation value of each review follows a probability distribution that averages the overall comprehensive evaluation value of the evaluation target.
(3) The appearance evaluation value for each aspect of each review follows a probability distribution that averages the weighted sum of the overall evaluation value for each aspect to be evaluated and the overall evaluation value of the review.
(4) Each word in the review sentence follows the aspect evaluation value for each aspect of each review and the conditional probability distribution of the latent topic.

本発明の実施の形態では、各評価値は上記（１）〜（４）の条件に従うものとし、各評価値をＬＤＡ（Latent Dirichlet Allocation）に基づくトピックモデルの変数に含めることで、当該レビューの評価値の階層性を考慮することができる。 In the embodiment of the present invention, each evaluation value conforms to the above conditions (1) to (4), and each evaluation value is included in a variable of a topic model based on LDA (Latent Dirichlet Allocation). The hierarchy of evaluation values can be considered.

レビューの評価値の階層性を考慮することにより、全体総合評価値及び様相総合評価値の高低に依存して各レビューの全体評価値、レビューの様相ごとの様相評価値の高低が変化し、全体総合評価値及び様相総合評価値の高い対象に対しては平均的に高い全体評価値や様相ごとの様相評価値がつきやすくなるようなモデルを得られることが期待でき、また当該モデルが得られる結果、高いレビュー評価値の予測精度が得られることが期待できる。 By considering the hierarchy of review evaluation values, the overall evaluation value of each review and the level of the aspect evaluation value for each aspect of the review change depending on the level of the overall comprehensive evaluation value and the overall aspect evaluation value. It can be expected that an object with a high overall evaluation value and an overall aspect evaluation value will be able to obtain a model that is likely to have an average evaluation value that is high on average and an aspect evaluation value for each aspect. As a result, it can be expected that prediction accuracy of high review evaluation values can be obtained.

＜原理＞
次に本発明の実施の形態の原理について説明する。 <Principle>
Next, the principle of the embodiment of the present invention will be described.

[１.定式化]
レビューデータとして、評価対象（例えばレストランやホテル）がＮ個あるとする。各評価対象 [1. Formulation]
Assume that there are N evaluation objects (for example, restaurants and hotels) as review data. Each evaluation target

にはＮ_ｉ個のレビュー N _i reviews

が付与されているとする。また、個々のレビューは Is given. Individual reviews

個の文 Sentence

で記述されており、各文には And each sentence

個の単語 Words

が含まれているとする。さらに、評価対象にはＫ個の評価様相 Is included. Furthermore, there are K evaluation aspects for evaluation.

があるとする。また、Ｖをレビューで用いられる単語（語彙）の異なり数とする。各評価対象には全体総合評価値Ｒ^ａｌｌ _ｉと、様相ごとの様相総合評価値をまとめたＫ次元の様相の評価値ベクトル Suppose there is. Further, V is the number of different words (vocabulary) used in the review. Each evaluation object includes an overall evaluation value R ^all _i and an evaluation value vector of K-dimensional aspects that summarizes the aspect comprehensive evaluation values for each aspect.

が付与されるものとする。各評価値は非負の実数であるとする。各レビューには全体評価値ｒ^ａｌｌ _ｄと、Ｋ次元の様相評価値ベクトル Shall be granted. Each evaluation value is assumed to be a non-negative real number. Each review includes an overall evaluation value r ^all _d and a K-dimensional aspect evaluation value vector.

が付与されるものとする。なお、表記「Ｒ」は複数のレビューを総合して得られる評価値、表記「ｒ」は個別のレビューでの評価値を表す。 Shall be granted. The notation “R” represents an evaluation value obtained by integrating a plurality of reviews, and the notation “r” represents an evaluation value in individual reviews.

本実施の形態では、学習データとして評価値が既知の学習用レビューデータセットを利用してモデルを学習し、評価値が未知のレビューデータに対して評価値の推定を行う。学習用レビューデータ中の各レビューは全体評価値ｒ^ａｌｌ _ｄと様相ごとの様相評価値＾ｒ_ｉがすでに付与されているとする。また、本実施の形態で、評価値が未知のレビューデータに対して得られる評価値は、 In the present embodiment, a model is learned using a learning review data set with a known evaluation value as learning data, and the evaluation value is estimated for review data with an unknown evaluation value. Each review in the training for review data is a whole evaluation value r ^all _d and the appearance evaluation value for each aspect ^ r _i has already been granted. In the present embodiment, the evaluation value obtained for the review data whose evaluation value is unknown is:

１．評価値が未知のレビューデータ中の各評価対象についての全体総合評価値Ｒ^ａｌｌ _ｉ、および様相ごとの様相総合評価値＾Ｒ_ｉ
２．評価値が未知のレビューデータ中の各レビューについての全体評価値ｒ^ａｌｌ _d、および様相ごとの様相評価値ｒ_ｄｋである。 1. Overall comprehensive evaluation value R ^all _i for each evaluation object in the review data whose evaluation value is unknown, and aspect comprehensive evaluation value ^ R _i for each aspect
2. The overall evaluation value r ^all _d for each review in the review data whose evaluation value is unknown, and the aspect evaluation value r _dk for each aspect.

[２．モデル]
本発明の実施の形態では、レビューデータにおける自然言語で記述されたレビューを評価値に応じて生成するような「階層的事前分布に基づくトピックモデル」を利用する。 [2. model]
In the embodiment of the present invention, a “topic model based on a hierarchical prior distribution” that generates a review described in a natural language in review data according to an evaluation value is used.

本実施の形態における階層的事前分布に基づくトピックモデルでは、同一の評価対象や同一のレビューにおける様相ごとの様相評価値は互いに独立ではないと仮定する。また、各評価対象Ｅ_ｉについて、当該評価対象Ｅ_ｉの全体総合評価値Ｒ^ａｌｌ _ｉは、以下の式（１）に示すように、平均μ、標準偏差Σの正規分布に従って生成されるものとする。 In the topic model based on the hierarchical prior distribution in the present embodiment, it is assumed that the aspect evaluation values for each aspect in the same evaluation object and the same review are not independent from each other. For each evaluation object E _i , the overall overall evaluation value R ^all _i of the evaluation object E _i is generated according to a normal distribution of mean μ and standard deviation Σ, as shown in the following equation (1). To do.

そして、評価対象Ｅ_ｉの様相ごとの様相総合評価値Ｒ_ｉｋとレビューごとの全体評価値ｒ^ａｌｌ _ｄとは、以下の式（２）〜（３）に示すように、それぞれ評価対象Ｅ_ｉの全体総合評価値Ｒ^ａｌｌ _ｉを平均とする、標準偏差σ_１とσ_２の正規分布に従って生成されるものとする。 Then, the total evaluation value ^{r all} _d for each aspect each aspect review and overall evaluation value _{R ik} evaluated _{E i,} as shown in the following equation (2) to (3), each evaluated _{E i} It is assumed to be generated according to a normal distribution of standard deviations σ ₁ and σ _{2 with} the overall overall evaluation value R ^all _i as an average.

各レビューにおける様相ごとの様相評価値ｒ_ｄｋは、以下の式（４）に示すように、当該レビューの全体評価値ｒ^ａｌｌ _ｄと、当該レビューが評価する評価対象Ｅ_ｉにおける様相ごとの様相総合評価値Ｒ_ｉｋとを線形結合した値を平均とする、標準偏差σ_３の正規分布に従って生成されるものとする。 As shown in the following formula (4), the aspect evaluation value r _dk for each aspect in each review is the overall evaluation value r ^all _d of the review and the aspect total for each aspect in the evaluation object E _i evaluated by the review. It is assumed that the value is generated according to a normal distribution with a standard deviation σ ₃ that averages values obtained by linearly combining the evaluation values R _ik .

ここで、λはレビューｄの全体評価値ｒ^ａｌｌ _ｄと、評価対象Ｅ_ｉのｋ番目の様相に対する様相総合評価値Ｒ_ｉｋとの寄与度合いを調整するパラメータであり、予め定められた任意の定数（例えば０．５）としてもよい。 Here, λ is a parameter for adjusting the degree of contribution between the overall evaluation value r ^all _d of the review d and the overall aspect evaluation value R _ik for the k-th aspect of the evaluation object E _i , and is a predetermined arbitrary constant (For example, 0.5) may be used.

図１に、本実施の形態における階層的事前分布に基づくトピックモデルによるレビューの生成過程を示す。 FIG. 1 shows a review generation process based on a topic model based on a hierarchical prior distribution in the present embodiment.

レビューの各文は同一の様相に対するものであると仮定し、様相をトピックモデルにおける潜在トピックと考えることで、上記図１に示すような生成過程に基づいてレビューが生成されるものとする。つまり、レビューの各単語は、当該単語を含むレビュー文に割り当てられる様相ｚ_ｓとレビューの様相ごとの様相評価値 Assume that each sentence of the review is for the same aspect, and that the aspect is considered as a latent topic in the topic model, and the review is generated based on the generation process as shown in FIG. That is, each word of the review has an aspect z _s assigned to a review sentence including the word and an aspect evaluation value for each aspect of the review.

との条件付き確率分布 Conditional probability distribution with

とに基づいて生成される。 And is generated based on

は、以下の式（５）に示すように、様相ｚ_ｓにおいて出現する単語の分布 Is a distribution of words appearing in the aspect z _s as shown in the following equation (5):

と、様相ｚ_ｓにおいてレビューの様相ごとの様相評価値が And the aspect evaluation value for each aspect of the review in the aspect z _s

であるときに出現する単語の分布 Distribution of words that appear when

との積として表現される。 Expressed as a product of

なお、 In addition,

は、上記式（５）に示すように、任意の正の実数値をとり得るが、レビューデータ中でのレビューの様相ごとの様相評価値は通常離散値であるため、 Can take any positive real value as shown in the above formula (5), but the aspect evaluation value for each aspect of the review in the review data is usually a discrete value.

は整数等の離散値に丸めた値を利用する。 Uses a value rounded to a discrete value such as an integer.

本実施の形態におけるモデルの学習では、評価値が既知の学習用レビューデータにおける単語および評価値の事後確率ｐ（ｗ，ｒ^ａｌｌ _ｄ，ｒ_ｄｋ｜Υ) を最大化するようなモデルパラメータΥ=｛μ，Σ，σ_１，σ_２，σ_３，λ，α，φ，ψ｝を推定し、推定されたモデルパラメータΥを用いて、評価値が未知のレビューに対して、当該レビューの評価値 In the learning of the model in the present embodiment, the model parameter する = that maximizes the posterior probability p (w, r ^all _d , r _dk | Υ) of the word and the evaluation value in the review data for learning whose evaluation value is known. Estimate {μ, Σ, σ ₁ , σ ₂ , σ ₃ , λ, α, φ, ψ}, and use the estimated model parameter Υ to evaluate the review for an unknown evaluation value value

を推定する。 Is estimated.

なお、モデルパラメータΥのμは、複数の評価対象Ｅ_ｉの全体総合評価値Ｒ^ａｌｌ _ｉの平均を表し、Σは、複数の評価対象Ｅ_ｉの全体総合評価値Ｒ^ａｌｌ _ｉの標準偏差を表し、σ_１は様相ごとの様相総合評価値Ｒ_ｉｋの標準偏差を表し、σ_２は各レビューの全体評価値ｒ^ａｌｌ _ｄの標準偏差を表し、σ_３は各レビューの様相ごとの様相評価値ｒ_ｄｋの標準偏差を表す。また、λは各レビューの様相ごとの様相評価値における、レビューの全体評価値と様相ごとの様相総合評価値の寄与度合いのパラメータを表す。また、αはディリクレ分布のパラメータ（Ｋ次元）を表し、φは単語ｗの出現分布（様相Ａ_ｋのｋごとに表され、単語数次元である。）を表し、ψは単語ｗの出現分布（様相Ａ_ｋのｋ及び様相Ａ_ｋのｋごとの様相評価値ｒ_ｄｋごとに表され、単語数次元である。）を表す。 Note that μ of the model parameters Upsilon, represents the average of the entire total evaluation value R ^all _i of a plurality of evaluation target E _i, sigma represents a standard deviation of the entire total evaluation value R ^all _i of a plurality of evaluation target E _i , Σ ₁ represents the standard deviation of the overall evaluation value R _ik for each aspect, σ ₂ represents the standard deviation of the overall evaluation value r ^all _d for each review, and σ ₃ represents the aspect evaluation value r for each aspect of each review. _It represents the standard deviation of _dk . In addition, λ represents a parameter of the degree of contribution of the overall evaluation value of the review and the overall evaluation value of the aspect for each aspect in the aspect evaluation value for each aspect of each review. In addition, α represents a parameter (K dimension) of the Dirichlet distribution, φ represents an appearance distribution of the word w (expressed for each _k of the aspect Ak, and is a word number dimension), and ψ represents an appearance distribution of the word w. (represented in every aspect _{a k} of k and aspects _{a k} of k for each aspect evaluation value _{r dk,} is the number-dimensional word.) represents the.

上記図１の生成過程に基づくと、学習用レビューデータに対して求める単語および評価値の事後確率は以下の式（６）に示すようになる。 Based on the generation process of FIG. 1 above, the posterior probabilities of words and evaluation values obtained for the review data for learning are as shown in the following formula (6).

[３．モデルの導出]
上記式（６）に示す事後確率の最大化のための必要な隠れ変数の事後確率分布を効率的に計算することは困難であるため、本実施の形態では非特許文献４（MICHAEL I. JORDAN ，ZOUBIN GHAHRAMANI，TOMMI S. JAAKKOLA，LAWRENCE K. SAUL，“An Introduction to Variational Methods for Graphical Models”，Machine Learning, Volume 37, Issue 2, p.183-233, 1999）に記載されている変分法を用いる。本実施の形態では、隠れ変数の事後確率分布を、変分法によって以下の式（７）に示す変分分布を用いて近似する。 [3. Model derivation]
Since it is difficult to efficiently calculate the posterior probability distribution of the hidden variables necessary for maximizing the posterior probability shown in the above formula (6), in this embodiment, Non-Patent Document 4 (MICHAEL I. JORDAN , ZOUBIN GHAHRAMANI, TOMMI S. JAAKKOLA, LAWRENCE K. SAUL, “An Introduction to Variational Methods for Graphical Models”, Machine Learning, Volume 37, Issue 2, p.183-233, 1999). Is used. In the present embodiment, the posterior probability distribution of the hidden variable is approximated using the variation distribution shown in the following formula (7) by the variation method.

ここで、各レビュー文への様相の割り当てｚ_ｓはＫ次元多項分布 Here, the allocation z _s of the aspect to each review sentence is a K-dimensional multinomial distribution.

に従い、多項分布のパラメータθ_ｄはＫ次元のディリクレ分布Ｄｉｒ（γ_ｄ）に従い、評価対象Ｅ_ｉの全体総合評価値Ｒ^ａｌｌ _ｉは平均η_ｉ、標準偏差Δ_ｉの正規分布に従い、評価対象Ｅ_ｉの様相に対する様相総合評価値Ｒ_ｉｋは平均β_ｉｋ、標準偏差δ_ｉｋの正規分布に従う。
そして、Ｊｅｎｓｅｎの不等式によって、上記式（６）に示した事後確率の対数尤度 The parameter θ _d of the multinomial distribution follows the K-dimensional Dirichlet distribution Dir (γ _d ), the overall overall evaluation value R ^all _i of the evaluation object E _i follows the normal distribution of the mean η _i and the standard deviation Δ _i , and the evaluation object E _The aspect comprehensive evaluation value R _ik for the aspect of _i follows a normal distribution with an average β _ik and a standard deviation δ _ik .
The log likelihood of the posterior probability shown in the above equation (6) by the Jensen inequality

の下界Ｌが以下の式（８）に示すように得られる。 Is obtained as shown in the following equation (8).

ここで、Ｈ（ｑ）＝−Ｅ［ｌｏｇｑ］は変分分布ｑのエントロピーである。
上記式（８）に基づき、変分パラメータ Here, H (q) = − E [log q] is the entropy of the variational distribution q.
Based on equation (8) above, variational parameters

に対する下界Ｌを最大化する過程（Ｅ−ｓｔｅｐ）と、モデルパラメータΥ=｛μ，Σ，σ_１，σ_２，σ_３，λ，α，φ，ψ｝に対する下界を最大化する過程（Ｍ−ｓｔｅｐ）を繰り返す、変分ＥＭ（Ｅｘｐｅｃｔａｔｉｏｎ−Ｍａｘｉｍｉｚａｔｉｏｎ：期待値最大化）と呼ばれる方法によって事後確率の最大化を行う。なお、変分パラメータΦのηは、複数の評価対象Ｅ_ｉの各々に対する、当該評価対象Ｅ_ｉの全体総合評価値Ｒ^ａｌｌ _ｉの平均η_ｉからなり、Δは、複数の評価対象Ｅ_ｉの各々に対する、当該評価対象Ｅ_ｉの全体総合評価値Ｒ^ａｌｌ _ｉの標準偏差Δ_ｉからなる。また、βは、複数の評価対象Ｅ_ｉ及び複数の様相ｋの組み合わせの各々に対する、当該組み合わせの様相総合評価値Ｒ_ｉｋの平均β_ｉｋからなり、δは、複数の評価対象Ｅ_ｉ及び複数の様相ｋの組み合わせの各々に対する、当該組み合わせの様相総合評価値Ｒ_ｉｋの標準偏差δ_ｉｋからなる。γは、複数の学習用レビューｄの各々に対する、当該学習用レビューｄにおける各様相ｋの分布を表すディリクレ分布のパラメータγ_ｄｋからなり、 The process of maximizing the lower bound L for (E-step) and the process of maximizing the lower bound for the model parameters Υ = {μ, Σ, σ ₁ , σ ₂ , σ ₃ , λ, α, φ, ψ} (M The posterior probability is maximized by a method called variation EM (Expectation-Maximization). Note that eta variational parameter [Phi, for each of a plurality of evaluation target E _i, consists average eta _i overall total evaluation value R ^all _i of the evaluated E _i, delta is a plurality of evaluation target E _i It consists of the standard deviation Δ _i of the overall overall evaluation value R ^all _i of the evaluation object E _i for each. Β is composed of an average β _ik of the overall evaluation values R _ik of the combination for each of the combinations of the plurality of evaluation objects E _i and the plurality of aspects k, and δ is the plurality of evaluation objects E _i and the plurality of evaluation objects E _i For each combination of aspects k, it consists of the standard deviation δ _ik of the aspect total evaluation value R _ik of the combination. γ consists of a Dirichlet distribution parameter γ _dk representing the distribution of each aspect k in the learning review d for each of the plurality of learning reviews d,

は、複数の学習用レビューの複数のレビュー文ｓに対する、当該レビュー文ｓに割り当てられる各様相ｋの分布を表す多項分布のパラメータ Is a parameter of a multinomial distribution representing the distribution of each aspect k assigned to the review sentence s for a plurality of review sentences s of a plurality of review for learning.

からなる。 Consists of.

［３．１Ｅ−ｓｔｅｐ］ [3.1 E-step]

まず、Ｅ−ｓｔｅｐでは、 First, in E-step,

を満たす最適な変分パラメータ Optimal variational parameter satisfying

を求める。図２に変分ＥＭアルゴリズムのＥ−ｓｔｅｐの計算（ｉは評価対象のＩＤ、ｄはレビューのＩＤ、ｓは文のＩＤ、ｋは様相のＩＤを表す。）と初期化についての擬似コードを示す。 Ask for. FIG. 2 shows the calculation of E-step of the variational EM algorithm (i is the ID of the evaluation object, d is the ID of the review, s is the ID of the sentence, and k is the ID of the aspect) and pseudo code for initialization. Show.

変分パラメータΦのη_ｉ，Δ_ｉ，β_ｉｋ，δ_ｉｋは以下の式（９）〜（１２）に基づいて更新する。 Η _i , Δ _i , β _ik , and δ _ik of the variation parameter Φ are updated based on the following equations (9) to (12).

次に、変分パラメータΦのγ_ｄｋ, Next, the variation parameter Φ γ _dk ,

を以下の式（１３）〜（１４）に基づいて更新する（ｋ＝1,...,Ｋ）。 Is updated based on the following equations (13) to (14) (k = 1,..., K).

ここで、Ψ(・) はｌｏｇΓ関数の一次微分を表す。η_ｉとβ_ｉｋに対する微分、γと Here, Ψ (·) represents the first derivative of the log Γ function. The derivative of η _i and β _ik , γ and

に対する微分は相互に依存するため、上記図２に示すような繰り返し手順によって計算を行う。 Since the derivative of is dependent on each other, the calculation is performed according to the iterative procedure as shown in FIG.

［３．２Ｍ−ｓｔｅｐ］
続いて、Ｍ−ｓｔｅｐでは、モデルパラメータΥ＝｛μ，Σ，σ_１，σ_２，σ_３，λ，α，φ，ψ｝を更新して対数尤度の下界を最大化する。 [3.2 M-step]
Subsequently, in M-step, the model parameter Υ = {μ, Σ, σ ₁ , σ ₂ , σ ₃ , λ, α, φ, ψ} is updated to maximize the lower bound of the log likelihood.

まず、φおよびψを以下の式（１５）〜（１６）に基づいて更新する。 First, φ and ψ are updated based on the following formulas (15) to (16).

ただし、１(・)は条件が満たされたとき１、満たされないときに０を返す関数とする。また、φ_ｋ ^ｗｎは、ｋ番目の様相Ａ_ｋに対するｎ番目の単語ｗの出現確率を表し、ψ_ｋｍ ^ｗｎは、レビューｄの様相評価値がｍであり、かつｋ番目の様相Ａ_ｋに対するｎ番目の単語ｗの出現確率を表す。
次に、αを以下の式（１７）に基づいて更新する。 However, 1 (·) is a function that returns 1 when the condition is satisfied and returns 0 when the condition is not satisfied. Φ _k ^wn represents the appearance probability of the n-th word w with respect to the k-th aspect A _k , and ψ _km ^wn represents the aspect evaluation value of the review d and is n with respect to the k-th aspect A _k . This represents the appearance probability of the th word w.
Next, α is updated based on the following equation (17).

以下の式（１８）に示すように、ｋに対する微分はα_ｋ’（ｋ≠ｋ’）に依存するため、上記非特許文献１に記述されているＮｅｗｔｏｎ−Ｒａｐｈｓｏｎ法による繰り返し処理によって最大値を求める方法を必要とする。最大値を求めるためのヘッセ行列は以下の式（１８）に示すように計算する。 As shown in the following formula (18), since the derivative with respect to k depends on α _k ′ (k ≠ k ′), the maximum value is obtained by the iterative process by the Newton-Raphson method described in Non-Patent Document 1 above. You need a way to ask. The Hessian matrix for obtaining the maximum value is calculated as shown in the following equation (18).

次に、以下の式（１９）〜（２３）に基づいてμ，Σ，σ_１，σ_２，σ_３を更新する。 Next, μ, Σ, σ ₁ , σ ₂ , and σ ₃ are updated based on the following equations (19) to (23).

［４．予測］
上述した方法によってモデルパラメータΥを推定した後に、評価値が未知のレビューデータに対する評価値Θの予測を以下の式（２４）に従って行う。 [4. prediction]
After the model parameter Υ is estimated by the above-described method, the evaluation value Θ is predicted for review data whose evaluation value is unknown according to the following equation (24).

Θは評価値を表し、Υはモデルパラメータを表す。ｐ（ｗ｜Θ，Υ）はパラメータｚ_ｓ，α，θを積分消去することによって求められるが、直接計算することは難しい。そこで本実施の形態では、ｚ_ｓをまず各レビュー文に割り当てる。
なお、ｐ（ｗ｜Θ，Υ）の最大化はｐ（ｗ｜Θ，Υ，ｚ）の最大化に置き換えられ、以下の式（２５）に示すように最大化を行う。 Θ represents an evaluation value, and Υ represents a model parameter. Although p (w | Θ, Υ) is obtained by integrating and eliminating the parameters z _s , α, and θ, it is difficult to calculate directly. Therefore, in the present embodiment, z _s is first assigned to each review sentence.
Note that maximization of p (w | Θ, Υ) is replaced with maximization of p (w | Θ, Υ, z), and maximization is performed as shown in the following equation (25).

ｐ（ｗ∈Ｅ_ｉ｜Θ_ｉ，Υ，ｚ）のΘの各要素に対する微分が０となるように得られる以下の式（２６）〜（２９）によって、当該レビューの評価対象Ｅiについての全体総合評価値Ｒ^ａｌｌ _ｉ、および様相ごとの様相総合評価値＾Ｒ_ｉ、当該レビューについての全体評価値ｒ^ａｌｌ _d、および様相ごとの様相評価値ｒ_ｄｋを求める。 _{_{p (w∈E i | Θ i,}} Υ, z) by theta formula below are obtained as the derivative becomes zero for each element of the (26) to (29), the entire evaluation target Ei of the review The overall evaluation value R ^all _i , the aspect overall evaluation value ＲR _i for each aspect, the overall evaluation value r ^all _d for the review, and the aspect evaluation value r _dk for each aspect are obtained.

ここで、Ｔ（ｒ_ｄｋ）は以下の式（３０）によって計算される。 Here, T (r _dk ) is calculated by the following equation (30).

Θの各要素に対する微分は独立でないため、本実施の形態ではΘの全要素を０とし、上記式（２６）〜（２９）に示す更新式によって収束するまで繰り返し更新する。 Since the differentiation for each element of Θ is not independent, in this embodiment, all elements of Θ are set to 0, and are updated repeatedly until convergence is achieved by the update expressions shown in the above equations (26) to (29).

次に、評価対象のレビューに関するモデルパラメータを推定するパラメータ推定装置と、パラメータ推定装置によって推定されたモデルパラメータを用いて、評価値推定対象のレビューの評価値を推定する評価値推定装置に、本発明を適用した場合を例に説明する。 Next, the parameter estimation device for estimating the model parameter relating to the review of the evaluation target and the evaluation value estimation device for estimating the evaluation value of the review of the evaluation value estimation target using the model parameter estimated by the parameter estimation device are A case where the invention is applied will be described as an example.

＜パラメータ推定装置のシステム構成＞
図３に示すように、本実施の形態に係るパラメータ推定装置１００は、ＣＰＵと、ＲＡＭと、後述する学習処理ルーチンを実行するためのプログラムを記憶したＲＯＭとを備えたコンピュータで構成され、機能的には次に示すように構成されている。本実施の形態に係るパラメータ推定装置は、推定用入力部１０と、推定用演算部２０と、モデルパラメータ記憶部３０とを備えている。 <System configuration of parameter estimation device>
As shown in FIG. 3, parameter estimation apparatus 100 according to the present embodiment is configured by a computer including a CPU, a RAM, and a ROM that stores a program for executing a learning processing routine described later, and functions. Specifically, it is configured as follows. The parameter estimation apparatus according to the present embodiment includes an estimation input unit 10, an estimation calculation unit 20, and a model parameter storage unit 30.

推定用入力部１０は、複数の評価対象に関する複数の学習用レビューを受け付ける。学習用レビューには、評価対象に対する全体評価値と評価対象の各様相に対する様相評価値とが予め付与されている。 The estimation input unit 10 receives a plurality of learning reviews related to a plurality of evaluation targets. In the review for learning, the overall evaluation value for the evaluation object and the aspect evaluation value for each aspect of the evaluation object are given in advance.

推定用演算部２０は、レビューデータベース２２と、初期化部２４と、パラメータ推定部２６とを備えている。 The estimation calculation unit 20 includes a review database 22, an initialization unit 24, and a parameter estimation unit 26.

レビューデータベース２２には、推定用入力部１０によって受け付けた複数の学習用レビューが格納される。 The review database 22 stores a plurality of learning reviews received by the estimation input unit 10.

初期化部２４は、各パラメータの初期値を設定する。具体的には、初期化部２４は、モデルパラメータΥの｛μ，Σ，σ_１，σ_２，σ_３，λ，α，φ，ψ｝について、予め定められた初期値を設定し、モデルパラメータを初期化する。また、初期化部２４は、変分パラメータΦについて、以下の式に従って、初期値を設定する。 The initialization unit 24 sets initial values for each parameter. Specifically, the initialization unit 24 sets predetermined initial values for {μ, Σ, σ ₁ , σ ₂ , σ ₃ , λ, α, φ, ψ} of the model parameter 、, and the model Initialize parameters. The initialization unit 24 sets an initial value for the variation parameter Φ according to the following equation.

パラメータ推定部２６は、レビューデータベース２２に格納された複数の学習用レビューに基づいて、上述したトピックモデルに従って、モデルパラメータΥを推定する。モデルパラメータΥには、レビュー文に割り当てられる各様相の分布を表すパラメータαと、各様相についての、様相が割り当てられたレビュー文で出現する各単語の分布を表すパラメータφと、各様相及び各評価値についての、様相が割り当てられ、かつ、様相の様相評価値が評価値となるレビュー文で出現する各単語の分布を表すパラメータψと、複数の評価対象に対する全体総合評価値の平均μ及び標準偏差Σと、各様相についての複数の評価対象に対する様相総合評価値の標準偏差σ_１と、複数の学習用レビューの全体評価値の標準偏差σ_２と、各様相についての複数の学習用レビューの様相評価値の標準偏差σ_３とが含まれている。 The parameter estimation unit 26 estimates the model parameter Υ according to the above-described topic model based on the plurality of learning reviews stored in the review database 22. The model parameter Υ includes a parameter α representing the distribution of each aspect assigned to the review sentence, a parameter φ representing the distribution of each word appearing in the review sentence to which the aspect is assigned, and each aspect and each A parameter ψ representing the distribution of each word appearing in a review sentence in which an aspect is assigned to the evaluation value and the aspect evaluation value of the aspect becomes the evaluation value, and an average μ of the overall comprehensive evaluation values for a plurality of evaluation objects Standard deviation Σ, standard deviation σ ₁ of overall evaluation values for multiple evaluation targets for each aspect, standard deviation σ ₂ of overall evaluation values for multiple learning reviews, and multiple learning reviews for each aspect The standard deviation σ ₃ of the aspect evaluation value is included.

本実施の形態では、上記図１に示されるように、トピックモデルにおいて、評価対象に対する各様相の様相総合評価値と学習用レビューの全体評価値とが評価対象に対する全体総合評価値に依存し、学習用レビューの各様相の様相評価値が評価対象に対する各様相の様相総合評価値と学習用レビューの全体評価値に依存することが表されている。また、上記式（５）に示されるように、当該依存関係から、トピックモデルにおいてパラメータφとパラメータψとが算出されることが表されている。 In the present embodiment, as shown in FIG. 1 above, in the topic model, the aspect comprehensive evaluation value of each aspect for the evaluation object and the overall evaluation value of the review for learning depend on the overall evaluation value for the evaluation object, It is shown that the aspect evaluation value of each aspect of the review for learning depends on the overall evaluation value of each aspect for the evaluation object and the overall evaluation value of the review for learning. Further, as shown in the above equation (5), it is expressed that the parameter φ and the parameter ψ are calculated in the topic model from the dependency relationship.

パラメータ推定部２６は、変分ＥＭアルゴリズムを用いて、Ｅ−ｓｔｅｐにおいて変分パラメータΦを推定し、Ｍ−ｓｔｅｐにおいてモデルパラメータΥを推定する。パラメータ推定部２６は、Ｅ−ｓｔｅｐで推定された変分パラメータΦに基づいて、モデルパラメータΥを推定することを、予め定められた反復終了条件を満たすまで繰り返す。 The parameter estimation unit 26 uses the variational EM algorithm to estimate the variation parameter Φ in E-step and the model parameter において in M-step. The parameter estimation unit 26 repeats estimating the model parameter Υ based on the variation parameter Φ estimated in E-step until a predetermined iteration end condition is satisfied.

まず、パラメータ推定部２６は、初期化され、又は前回推定されたモデルパラメータΥと、複数の学習用レビューの各々の全体評価値及び各様相の様相評価値とに基づいて、複数の評価対象Ｅ_ｉの各々に対する、評価対象Ｅ_ｉの全体総合評価値の平均η_ｉ及び標準偏差Δ_ｉと、複数の評価対象Ｅ_ｉの各々に対する、各様相ｋについての評価対象Ｅ_ｉに対する様相総合評価値の平均β_ｉｋ及び標準偏差δ_ｉｋと、複数の学習用レビューｄの各々に対する、学習用レビューｄにおける各様相ｋの分布を表すパラメータγ_ｄｋと、複数の学習用レビューの各レビュー文ｓに対する、レビュー文ｓに割り当てられる各様相ｋの分布を表すパラメータ First, the parameter estimator 26 sets a plurality of evaluation targets E based on the model parameter Υ that has been initialized or previously estimated, and the overall evaluation values and the aspect evaluation values of each aspect of the plurality of learning reviews. for each _i, the average eta _i and standard deviation delta _i overall total evaluation value of the evaluation object E _i, for each of a plurality of evaluation target E _i, aspects comprehensive evaluation value for the evaluation object E _i for each aspect k Reviews for the average β _ik and standard deviation δ _ik , the parameter γ _dk representing the distribution of each aspect k in the learning review d for each of the plurality of learning reviews d, and the review sentences s of the plurality of learning reviews d Parameter indicating the distribution of each aspect k assigned to sentence s

とを含む変分パラメータΦを推定する。 A variational parameter Φ including

具体的には、パラメータ推定部２６は、Ｅ−ｓｔｅｐにおいて、上記式（１０）に従って、各評価対象ＩＤｉについて、変分パラメータΔ^２ _ｉを更新する。次に、パラメータ推定部２６は、各評価対象ＩＤｉ及び各様相ＩＤｋについて、上記式（１２）に従って、変分パラメータδ^２ _ｉｋを更新する。 Specifically, the parameter estimation unit 26 updates the variation parameter Δ ² _i for each evaluation target IDi in E-step according to the above equation (10). Next, the parameter estimation unit 26 updates the variation parameter δ ² _ik according to the above equation (12) for each evaluation target IDi and each aspect IDk.

次に、パラメータ推定部２６は、上記式（１３）及び（１４）に基づいて、複数の学習用レビューｄの各々に対する、学習用レビューｄにおける各様相ｋの分布を表すパラメータγ_ｄｋと、複数の学習用レビューの各レビュー文ｓに対する、レビュー文ｓに割り当てられる各様相ｋの分布を表すパラメータ Next, the parameter estimation unit 26, based on the above formulas (13) and (14), the parameter γ _dk representing the distribution of each aspect k in the learning review d for each of the plurality of learning reviews d, and a plurality of parameters γ _dk Representing the distribution of each aspect k assigned to the review sentence s for each review sentence s of the review for learning

とを更新する。パラメータ推定部２６は、更新処理が収束するまで上記式（１３）及び（１４）に基づいて更新を繰り返す。 And update. The parameter estimation unit 26 repeats the update based on the above formulas (13) and (14) until the update process converges.

そして、パラメータ推定部２６は、上記式（９）に基づいて、各評価対象ＩＤｉについて、η_ｉを更新する。また、パラメータ推定部２６は、上記式（１１）に基づいて、各評価対象ＩＤｉ及び各様相ＩＤｋについて、β_ｉｋを更新する。パラメータ推定部２６は、更新処理が収束するまで上記式（９）及び（１１）に基づいて更新を繰り返す。 And the parameter estimation part 26 updates (eta) _i about each evaluation object IDi based on the said Formula (9). Moreover, the parameter estimation part 26 updates (beta) _ik about each evaluation object IDi and each aspect IDk based on the said Formula (11). The parameter estimation unit 26 repeats the update based on the above formulas (9) and (11) until the update process converges.

次に、パラメータ推定部２６は、Ｍ−ｓｔｅｐにおいて、Ｅ−ｓｔｅｐで推定されたパラメータ Next, the parameter estimation unit 26 performs parameter estimation in E-step in M-step.

に基づいて、上記式（１５）及び（１６）に従って、パラメータφとパラメータψとを推定する。 Based on the above, the parameter φ and the parameter ψ are estimated according to the above equations (15) and (16).

また、パラメータ推定部２６は、Ｍ−ｓｔｅｐにおいて、Ｅ−ｓｔｅｐで推定されたパラメータγに基づいて、上記式（１７）に従って、レビュー文に割り当てられる各様相の分布を表すパラメータαを推定する。 Also, the parameter estimation unit 26 estimates the parameter α representing the distribution of each aspect assigned to the review sentence according to the above equation (17) based on the parameter γ estimated in the E-step in the M-step.

また、パラメータ推定部２６は、Ｅ−ｓｔｅｐで推定された変分パラメータΦに基づいて、上記式（１９）〜（２３）に従って、｛μ，Σ，σ_１，σ_２，σ_３｝を更新する。 Further, the parameter estimation unit 26 updates {μ, Σ, σ ₁ , σ ₂ , σ ₃ } according to the above formulas (19) to (23) based on the variation parameter Φ estimated in E-step. To do.

そして、パラメータ推定部２６は、推定されたモデルパラメータをモデルパラメータ記憶部３０に格納する。 Then, the parameter estimation unit 26 stores the estimated model parameter in the model parameter storage unit 30.

モデルパラメータ記憶部３０には、パラメータ推定部２６によって推定されたモデルパラメータΥが記憶される。 The model parameter storage unit 30 stores the model parameter 推定 estimated by the parameter estimation unit 26.

＜評価値推定装置＞
図４に示すように、本実施の形態に係る評価値推定装置２００は、ＣＰＵと、ＲＡＭと、後述する評価値推定処理ルーチンを実行するためのプログラムを記憶したＲＯＭとを備えたコンピュータで構成され、機能的には次に示すように構成されている。 <Evaluation value estimation device>
As shown in FIG. 4, evaluation value estimation apparatus 200 according to the present embodiment is configured by a computer that includes a CPU, a RAM, and a ROM that stores a program for executing an evaluation value estimation processing routine described later. Functionally, it is configured as follows.

本実施の形態に係る評価値推定装置は、入力部４０と、モデルパラメータ記憶部５０と、演算部６０と、出力部７０とを備えている。 The evaluation value estimation apparatus according to the present embodiment includes an input unit 40, a model parameter storage unit 50, a calculation unit 60, and an output unit 70.

入力部４０は、評価対象Ｅ_ｉに関するレビューを受け付ける。 The input unit 40 receives a review regarding the evaluation target E _i .

モデルパラメータ記憶部５０には、上記パラメータ推定装置１００のモデルパラメータ記憶部３０に記憶されているモデルパラメータと同じモデルパラメータが記憶される。 The model parameter storage unit 50 stores the same model parameters as the model parameters stored in the model parameter storage unit 30 of the parameter estimation apparatus 100.

演算部６０は、評価値推定部６２を備えている。 The calculation unit 60 includes an evaluation value estimation unit 62.

評価値推定部６２は、入力部４０によって受け付けたレビューと、モデルパラメータ記憶部５０に記憶されたモデルパラメータとに基づいて、上記式（２６）〜（２９）に従って、レビューに対する総合評価値ｒ^ａｌｌ _d、レビューの評価対象の各様相に対する様相評価値ｒ_ｄｋ、評価対象Ｅ_ｉに対する全体総合評価値Ｒ^ａｌｌ _ｉ、及び評価対象Ｅ_ｉの各様相ｋに対する様相総合評価値Ｒ_ｉｋを推定する。評価値推定部６２は、上記式（２６）〜（２９）に示す更新式によって収束するまで繰り返し更新する。 The evaluation value estimation unit 62, based on the review received by the input unit 40 and the model parameters stored in the model parameter storage unit 50, according to the above formulas (26) to (29), the overall evaluation value r ^all for the review. _d, aspects evaluation value _{r dk} for each aspect of the evaluation of the review, evaluation to estimate the appearance overall evaluation value _{R ik} for each aspect k of the entire comprehensive evaluation value ^{R all} _i, and evaluated _{E i} with respect to the target _{E i.} The evaluation value estimation unit 62 repeatedly updates until convergence is achieved by the update formulas shown in the above formulas (26) to (29).

出力部７０は、評価値推定部６２によって推定された評価値を出力する。 The output unit 70 outputs the evaluation value estimated by the evaluation value estimation unit 62.

＜パラメータ推定装置の作用＞
次に、本実施の形態に係るパラメータ推定装置の作用について説明する。まず、複数の学習用レビューがパラメータ推定装置に入力されると、推定用入力部１０が複数の学習用レビューを受け付け、レビューデータベース２２に格納する。そして、パラメータ推定装置において、図５に示すパラメータ推定処理ルーチンが実行される。 <Operation of parameter estimation device>
Next, the operation of the parameter estimation apparatus according to this embodiment will be described. First, when a plurality of learning reviews are input to the parameter estimation device, the estimation input unit 10 receives a plurality of learning reviews and stores them in the review database 22. Then, in the parameter estimation device, a parameter estimation processing routine shown in FIG. 5 is executed.

まず、ステップＳ１００において、初期化部２４は、各パラメータの初期値を設定する。 First, in step S100, the initialization unit 24 sets initial values for each parameter.

そして、ステップＳ１０２において、パラメータ推定部２６は、レビューデータベース２２に格納された複数の学習用レビューと、上記ステップＳ１００で初期化された変分パラメータΦ又は前回の本ステップＳ１０２で前回推定された変分パラメータΦと、上記ステップＳ１００で初期化されたモデルパラメータΥ又は後述するステップＳ１０４で前回推定されたモデルパラメータΥとに基づいて、上述したトピックモデルに従って、変分パラメータΦを推定する。ステップＳ１００は、図６に示すＥ−ｓｔｅｐ処理ルーチンによって実現される。 In step S102, the parameter estimation unit 26, the plurality of learning reviews stored in the review database 22, the variation parameter Φ initialized in step S100, or the variable estimated last time in the previous step S102. Based on the minute parameter Φ and the model parameter 初期 initialized in step S100 or the model parameter Υ estimated last time in step S104 described later, the variation parameter Φ is estimated according to the topic model described above. Step S100 is realized by an E-step processing routine shown in FIG.

＜Ｅ−ｓｔｅｐ処理ルーチン＞
ステップＳ２００において、パラメータ推定部２６は、各評価対象ＩＤｉについて、上記ステップＳ１００で初期化されたモデルパラメータ｛Σ，σ_１，σ_２｝又は後述するステップＳ１０４で前回更新されたモデルパラメータ｛Σ，σ_１，σ_２｝に基づいて、上記式（１０）に従って、変分パラメータΔ^２ _ｉを更新する。 <E-step processing routine>
In step S200, the parameter estimation unit 26 determines, for each evaluation target IDi, the model parameter {Σ, σ ₁ , σ ₂ } initialized in step S100 or the model parameter {Σ, Based on σ ₁ , σ ₂ }, the variation parameter Δ ² _i is updated according to the above equation (10).

ステップＳ２０２において、パラメータ推定部２６は、各評価対象ＩＤｉ及び各様相ＩＤｋについて、上記ステップＳ１００で初期化されたモデルパラメータ｛σ_１，σ_３｝又は後述するステップＳ１０４で前回更新されたモデルパラメータ｛σ_１，σ_３｝に基づいて、上記式（１２）に従って、変分パラメータδ^２ _ｉｋを更新する。 In step S202, the parameter estimation unit 26 determines, for each evaluation object IDi and each aspect IDk, the model parameters {σ ₁ , σ ₃ } initialized in step S100 or the model parameters { Based on σ ₁ , σ ₃ }, the variational parameter δ ² _ik is updated according to the above equation (12).

ステップＳ２０４において、パラメータ推定部２６は、各レビュー文ＩＤｓ及び各様相ＩＤｋについて、上記ステップＳ１００で初期化されたモデルパラメータ｛φ，ψ｝又は後述するステップＳ１０４で前回更新されたモデルパラメータ｛φ，ψ｝と、上記ステップＳ１００で初期化された変分パラメータ｛γ｝又は後述するステップＳ２０８で前回更新された変分パラメータ｛γ｝とに基づいて、上記式（１４）に従って、 In step S204, for each review sentence IDs and each aspect IDk, the parameter estimation unit 26 initializes the model parameter {φ, ψ} in step S100 or the model parameter {φ, last updated in step S104 described later. ψ} and the variation parameter {γ} initialized in step S100 or the variation parameter {γ} updated last time in step S208 described later, according to the above equation (14),

を更新する。 Update.

ステップＳ２０６において、パラメータ推定部２６は、上記ステップＳ２０４で更新された In step S206, the parameter estimation unit 26 is updated in step S204.

に基づいて、 On the basis of the,

を正規化する。 Is normalized.

ステップＳ２０８において、パラメータ推定部２６は、各レビューＩＤｄ及び各様相ＩＤｋについて、上記ステップＳ２０６で得られた In step S208, the parameter estimation unit 26 obtains each review IDd and each aspect IDk in step S206.

と、上記ステップＳ１００で初期化されたモデルパラメータ｛α｝又は後述するステップＳ１０４で前回更新されたモデルパラメータ｛α｝とに基づいて、上記式（１３）に従って、γ_ｄｋを更新する。 Then, γ _dk is updated according to the above equation (13) based on the model parameter {α} initialized in step S100 or the model parameter {α} updated last time in step S104 described later.

ステップＳ２１０において、パラメータ推定部２６は、上記ステップＳ２０４〜ステップＳ２０８の処理が収束したか否かを判定する。処理が収束した場合には、ステップＳ２１２へ進む。一方、処理が収束していない場合には、ステップＳ２０４へ戻る。 In step S210, the parameter estimation unit 26 determines whether or not the processing in steps S204 to S208 has converged. If the process has converged, the process proceeds to step S212. On the other hand, if the process has not converged, the process returns to step S204.

ステップＳ２１２において、パラメータ推定部２６は、各評価対象ＩＤｉについて、上記ステップＳ１００で初期化されたモデルパラメータ｛μ，Σ，σ_１，σ_２｝又は後述するステップＳ１０４で前回更新されたモデルパラメータ｛μ，Σ，σ_１，σ_２｝と、上記ステップＳ１００で初期化された変分パラメータ｛β｝又は後述するステップＳ２１４で前回更新された変分パラメータ｛β｝と、レビューデータベース２２に格納された学習用レビューｄの全体評価値ｒ_ｄ ^ａｌｌとに基づいて、上記式（９）に基づいて、変分パラメータη_ｉを更新する。 In step S212, the parameter estimation unit 26 for each evaluation target IDi, the model parameter {μ, Σ, σ ₁ , σ ₂ } initialized in step S100 or the model parameter { μ, Σ, σ ₁ , σ ₂ }, the variation parameter {β} initialized in step S100 or the variation parameter {β} updated last time in step S214 described later, and the review database 22 are stored. The variation parameter η _i is updated based on the above formula (9) based on the overall evaluation value r _d ^all of the learning review d.

ステップＳ２１４において、パラメータ推定部２６は、各評価対象ＩＤｉ及び各様相ＩＤｋについて、上記ステップＳ２１２で更新された変分パラメータ｛η｝と、上記ステップＳ１００で初期化されたモデルパラメータ｛σ_１，σ_３｝又は後述するステップＳ１０４で前回更新されたモデルパラメータ｛σ_１，σ_３｝と、レビューデータベース２２に格納された学習用レビューｄの全体評価値ｒ_ｄ ^ａｌｌ及び学習用レビューｄの様相ごとの様相評価値ｒ_ｄｋとに基づいて、上記式（１１）に従って、変分パラメータβ_ｉｋを更新する。 In step S214, the parameter estimation unit 26 for each evaluation object IDi and each aspect IDk, the variation parameter {η} updated in step S212 and the model parameters {σ ₁ , σ initialized in step S100. ₃ } or the model parameters {σ ₁ , σ ₃ } last updated in step S104 described later, the overall evaluation value r _d ^all of the learning review d stored in the review database 22, and the aspect of the learning review d Based on the aspect evaluation value r _dk , the variation parameter β _ik is updated according to the above equation (11).

ステップＳ２１６において、パラメータ推定部２６は、上記ステップＳ２１２〜ステップＳ２１４の処理が収束したか否かを判定する。処理が収束した場合には、ステップＳ２１８へ進む。一方、処理が収束していない場合には、ステップＳ２１２へ戻る。 In step S216, the parameter estimation unit 26 determines whether or not the processing in steps S212 to S214 has converged. If the process has converged, the process proceeds to step S218. On the other hand, if the process has not converged, the process returns to step S212.

ステップＳ２１８において、上記ステップＳ２００〜ステップＳ２０８及びステップＳ２１２〜ステップＳ２１４で更新された変分パラメータを結果として出力して、Ｅ−ｓｔｅｐ処理ルーチンを終了する。 In step S218, the variation parameter updated in steps S200 to S208 and S212 to S214 is output as a result, and the E-step processing routine is terminated.

次にパラメータ推定処理ルーチンに戻り、ステップＳ１０４において、上記ステップＳ１０２で出力された変分パラメータΦと、レビューデータベース２２に格納された複数の学習用レビューｄとに基づいて、モデルパラメータΥを推定する。ステップＳ２０２は、図７に示すＭ−ｓｔｅｐ処理ルーチンによって実現される。 Next, returning to the parameter estimation processing routine, in step S104, the model parameter Υ is estimated based on the variation parameter Φ output in step S102 and the plurality of learning reviews d stored in the review database 22. . Step S202 is realized by the M-step processing routine shown in FIG.

＜Ｍ−ｓｔｅｐ処理ルーチン＞
ステップＳ３００において、パラメータ推定部２６は、上記ステップＳ１０２で更新された変分パラメータ <M-step processing routine>
In step S300, the parameter estimation unit 26 updates the variation parameter updated in step S102.

と、レビューデータベース２２に格納された複数の学習用レビューｄとに基づいて、上記式（１５）及び（１６）に従って、モデルパラメータ｛φ，ψ｝を更新する。 Based on the plurality of learning reviews d stored in the review database 22, the model parameters {φ, ψ} are updated according to the above formulas (15) and (16).

ステップＳ３０２において、パラメータ推定部２６は、上記ステップＳ１０２で更新された変分パラメータ｛γ｝と、レビューデータベース２２に格納された複数の学習用レビューｄとに基づいて、上記式（１７）及び（１８）に従って、モデルパラメータ｛α｝を推定する。 In step S302, the parameter estimation unit 26, based on the variation parameter {γ} updated in step S102 and the plurality of learning reviews d stored in the review database 22, the above equations (17) and ( According to 18), the model parameter {α} is estimated.

ステップＳ３０４において、パラメータ推定部２６は、上記ステップＳ１０２で出力された変分パラメータΦと、レビューデータベース２２に格納された複数の学習用レビューｄとに基づいて、上記式（１９）〜（２３）に従って、｛μ，Σ，σ_１，σ_２，σ_３｝を更新する。 In step S304, the parameter estimation unit 26 calculates the above formulas (19) to (23) based on the variation parameter Φ output in step S102 and the plurality of learning reviews d stored in the review database 22. , {Μ, Σ, σ ₁ , σ ₂ , σ ₃ } are updated.

ステップＳ３０６において、上記ステップＳ３００〜ステップＳ３０４で更新されたモデルパラメータΥを結果として出力して、Ｍ−ｓｔｅｐ処理ルーチンを終了する。 In step S306, the model parameter 更新 updated in steps S300 to S304 is output as a result, and the M-step processing routine is terminated.

次にパラメータ推定処理ルーチンに戻り、パラメータ推定部２６は、ステップＳ１０６において、上記ステップＳ１０２〜ステップＳ１０４の処理が収束したか否かを判定する。上記ステップＳ１０２〜ステップＳ１０４の処理が収束した場合には、ステップＳ１０８へ進む。一方、上記ステップＳ１０２〜ステップＳ１０４の処理が収束していない場合には、ステップＳ１０２へ戻る。 Next, returning to the parameter estimation processing routine, in step S106, the parameter estimation unit 26 determines whether or not the processing in steps S102 to S104 has converged. If the processing from step S102 to step S104 has converged, the process proceeds to step S108. On the other hand, if the processes in steps S102 to S104 have not converged, the process returns to step S102.

ステップＳ１０８において、パラメータ推定部２６は、上記ステップＳ１０２〜ステップＳ１０４の処理で得られたモデルパラメータΥをモデルパラメータ記憶部３０に格納して、パラメータ推定処理ルーチンを終了する。 In step S108, the parameter estimation unit 26 stores the model parameter Υ obtained by the processing in steps S102 to S104 in the model parameter storage unit 30, and ends the parameter estimation processing routine.

＜評価値推定装置の作用＞
次に、本実施の形態に係る音響信号分析装置の作用について説明する。まず、パラメータ推定装置１００のモデルパラメータ記憶部３０に記憶されているモデルパラメータΥが、評価値推定装置２００に入力されると、モデルパラメータ記憶部５０に格納される。そして、評価値推定対象のレビューが評価値推定装置２００に入力されると図８に示す評価値推定処理ルーチンが実行される。 <Operation of evaluation value estimation device>
Next, the operation of the acoustic signal analyzer according to this embodiment will be described. First, when the model parameter Υ stored in the model parameter storage unit 30 of the parameter estimation device 100 is input to the evaluation value estimation device 200, it is stored in the model parameter storage unit 50. When the review of the evaluation value estimation target is input to the evaluation value estimation device 200, the evaluation value estimation processing routine shown in FIG. 8 is executed.

ステップＳ４００において、入力部４０は、評価対象Ｅiに関するレビューを受け付ける。 In step S400, the input unit 40 receives a review regarding the evaluation target Ei.

ステップＳ４０２において、モデルパラメータ記憶部５０に記憶されたモデルパラメータΥを読み込む。 In step S402, the model parameter 記憶 stored in the model parameter storage unit 50 is read.

ステップＳ４０４において、評価値推定部６２は、上記ステップＳ４００で受け付けたレビューと、上記ステップＳ４０２で読み込まれたモデルパラメータΥとに基づいて、上記式（２６）〜（２９）に従って、レビューの評価対象に対する総合評価値ｒ^ａｌｌ _ｄ、レビューの評価対象の各様相に対する様相評価値ｒ_ｄｋ、評価対象Ｅ_ｉに対する全体総合評価値Ｒ^ａｌｌ _ｉ、及び評価対象Ｅ_ｉの各様相ｋに対する様相総合評価値Ｒ_ｉｋを推定する。 In step S404, the evaluation value estimation unit 62 evaluates the review according to the above formulas (26) to (29) based on the review received in step S400 and the model parameter 読み込ま read in step S402. overall evaluation value ^{r all} _d, appearance evaluation value for evaluating each aspect of the subject of the review _{r dk} for, the entire overall evaluation value ^{R all} _i for evaluation _{E i,} and evaluation aspects comprehensive evaluation value for each aspect k of the target _{E i} R Estimate _ik .

ステップＳ４０６において、評価値推定部６２は、上記ステップＳ４０４の処理が収束したか否かを判定する。上記ステップＳ４０４の処理が収束した場合には、ステップＳ４０８へ進む。一方、上記ステップＳ４０４の処理が収束していない場合には、ステップＳ４０４へ戻る。 In step S406, the evaluation value estimation unit 62 determines whether or not the process of step S404 has converged. If the process of step S404 has converged, the process proceeds to step S408. On the other hand, if the process of step S404 has not converged, the process returns to step S404.

ステップＳ４０８において、上記ステップＳ４０４で推定された評価値を結果として出力して、評価値推定処理ルーチンを終了する。 In step S408, the evaluation value estimated in step S404 is output as a result, and the evaluation value estimation processing routine is terminated.

＜実施例＞
［データ］
本発明の実施の形態を、評価対象をホテルとするレビューデータに適用した。このレビューデータでは、各ホテルに対して全体評価値に加え、サービス、清潔さ、場所、等の７つの様相に対して１点から５点の様相評価値が付けられている。 <Example>
[data]
The embodiment of the present invention is applied to review data in which an evaluation object is a hotel. In this review data, in addition to the overall evaluation value for each hotel, one to five aspect evaluation values are assigned to seven aspects such as service, cleanliness, and location.

［実験設定］
（様相の同定）
約３００語のキーワードをブートストラップ法によって選択し、選択されたキーワードをＬＤＡモデルを学習させるための事前知識とした。トピックモデルの学習によって得られたトピックは各様相の内容を記述できると考えることができる。 [Experimental settings]
(Identification of aspects)
About 300 keywords were selected by the bootstrap method, and the selected keywords were used as prior knowledge for learning the LDA model. It can be considered that the topic obtained by learning the topic model can describe the contents of each aspect.

そして、ＫＬダイバージェンスを用いて、トピックモデルによって学習された各様相と、真の様相との分布間距離を測った。 Then, using KL divergence, the distance between distributions of each aspect learned by the topic model and the true aspect was measured.

図９に、学習された様相と真の様相との間のＫＬダイバージェンスについての実験結果を表す。本実施の形態から得られた結果と、事前知識なしのＬＤＡ（上記非特許文献１)、ｓＬＤＡ（非特許文献３）、Ｌｏｃａｌ−ＬＤＡ（上記非特許文献２）から得られた結果とを比較したところ、本実施の形態のモデルはＬＤＡやｓＬＤＡより真の様相との距離が近く、Ｌｏｃａｌ−ＬＤＡと同等であるという結果が得られた。 FIG. 9 shows the experimental results of KL divergence between the learned aspect and the true aspect. The results obtained from the present embodiment are compared with the results obtained from LDA without prior knowledge (Non-Patent Document 1), sLDA (Non-Patent Document 3), and Local-LDA (Non-Patent Document 2). As a result, the model according to the present embodiment is closer to the true aspect than LDA and sLDA, and the result is equivalent to Local-LDA.

（評価値推定）
本発明の実施の形態による評価値推定の性能評価を行った。なお、レビューデータで定義されている様相と自動的に学習される様相とが対応するようにするため、キーワードを事前知識として利用した。評価尺度としては、推定した評価値と実際の評価値との間の平均二乗誤差（ＭＳＥ）、Mean Average Precision（ＭＡＰＮ）、情報検索の評価によく使われるＮＤＣＫＮの３つの指標を用いた。 (Evaluation value estimation)
The performance evaluation of evaluation value estimation according to the embodiment of the present invention was performed. In addition, keywords were used as prior knowledge so that the aspects defined in the review data correspond to the aspects learned automatically. As the evaluation scale, three indices of mean square error (MSE) between the estimated evaluation value and the actual evaluation value, Mean Average Precision (MAPN), and NDCKN often used for evaluation of information retrieval were used.

図１０に、実験から得られた評価値の推定精度を表す。図１０に示すように、本発明の実施の形態と、ＬＤＡ、ｓＬＤＡ、Ｂｏｏｔｓｔｒａｐ、ＬｏｃａｌＬＤＡとを比較した結果、本発明の実施の形態は、対象全体の評価値、様相ごとの評価値のいずれでも他手法より優れた推定精度を示した。 FIG. 10 shows the estimation accuracy of the evaluation value obtained from the experiment. As shown in FIG. 10, as a result of comparing the embodiment of the present invention with LDA, sLDA, Bootstrap, and LocalLDA, the embodiment of the present invention can be used for both the evaluation value for the entire object and the evaluation value for each aspect. The estimation accuracy is better than other methods.

以上説明したように、本発明の実施の形態に係るパラメータ推定装置によれば、複数の学習用レビューに基づいて、評価対象に対する各様相の様相総合評価値と学習用レビューの全体評価値とが評価対象に対する全体総合評価値に依存し、学習用レビューの各様相の様相評価値が評価対象に対する各様相の様相総合評価値と学習用レビューの全体評価値に依存して、各様相についての、様相が割り当てられたレビュー文で出現する各単語の分布を表すパラメータと、各様相及び各評価値についての、様相が割り当てられ、かつ、様相の様相評価値が評価値となるレビュー文で出現する各単語の分布を表すパラメータとが算出されることを表すトピックモデルに従って、モデルパラメータを推定することにより、レビューの評価値を精度よく推定するためパラメータを得ることができる。 As described above, according to the parameter estimation device according to the embodiment of the present invention, based on a plurality of learning reviews, the overall evaluation value of each aspect for the evaluation target and the overall evaluation value of the learning review are calculated. Depending on the overall evaluation value for the evaluation target, the aspect evaluation value of each aspect of the review for learning depends on the overall evaluation value of each aspect for the evaluation object and the overall evaluation value of the learning review. Parameters that represent the distribution of each word that appears in a review sentence to which an aspect is assigned, and appearances in a review sentence in which an aspect is assigned to each aspect and each evaluation value, and the aspect evaluation value of the aspect is an evaluation value By estimating the model parameters according to the topic model representing that the parameters representing the distribution of each word are calculated, the evaluation value of the review can be accurately estimated. It is possible to obtain a parameter for.

また、本発明の実施の形態に係る評価値推定装置によれば、評価対象に関するレビュー
と、本発明の実施の形態に係るパラメータ推定装置によって推定されたモデルパラメータとに基づいて、レビューの評価対象に対する全体評価値、及びレビューの評価対象の各様相に対する様相評価値を推定することにより、レビューの評価対象に対する評価値を精度よく推定することができる。 In addition, according to the evaluation value estimation device according to the embodiment of the present invention, the evaluation target for review based on the review related to the evaluation target and the model parameter estimated by the parameter estimation device according to the embodiment of the present invention. By estimating the overall evaluation value for and the aspect evaluation value for each aspect of the review evaluation target, the evaluation value for the review evaluation target can be accurately estimated.

なお、本発明は、上述した実施形態に限定されるものではなく、この発明の要旨を逸脱しない範囲内で様々な変形や応用が可能である。 Note that the present invention is not limited to the above-described embodiment, and various modifications and applications are possible without departing from the gist of the present invention.

例えば、上記実施の形態では、パラメータ推定装置１００と評価値推定装置２００とを別々の装置として構成する場合を例に説明したが、パラメータ推定装置１００と評価値推定装置２００を１つの装置として構成してもよい。 For example, in the above embodiment, the case where the parameter estimation device 100 and the evaluation value estimation device 200 are configured as separate devices has been described as an example. However, the parameter estimation device 100 and the evaluation value estimation device 200 are configured as one device. May be.

また、上述のパラメータ推定装置１００及び評価値推定装置２００は、内部にコンピュータシステムを有しているが、「コンピュータシステム」は、ＷＷＷシステムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。 Moreover, although the parameter estimation apparatus 100 and the evaluation value estimation apparatus 200 described above have a computer system inside, if the “computer system” uses a WWW system, the homepage provision environment (or Display environment).

また、本願明細書中において、プログラムが予めインストールされている実施形態として説明したが、当該プログラムを、コンピュータ読み取り可能な記録媒体に格納して提供することも可能である。 In the present specification, the embodiment has been described in which the program is installed in advance. However, the program can be provided by being stored in a computer-readable recording medium.

１０推定用入力部
２０推定用演算部
２２レビューデータベース
２４初期化部
２６パラメータ推定部
３０モデルパラメータ記憶部
４０入力部
５０モデルパラメータ記憶部
６０演算部
６２評価値推定部
７０出力部
１００パラメータ推定装置
２００評価値推定装置 DESCRIPTION OF SYMBOLS 10 Estimation input part 20 Estimation operation part 22 Review database 24 Initialization part 26 Parameter estimation part 30 Model parameter storage part 40 Input part 50 Model parameter storage part 60 Operation part 62 Evaluation value estimation part 70 Output part 100 Parameter estimation apparatus 200 Evaluation value estimation device

Claims

A parameter estimation method in a parameter estimation device including an estimation input unit and a parameter estimation unit,
The estimation input unit is a plurality of review reviews for a plurality of evaluation objects including a plurality of aspects, and an overall evaluation value for the evaluation object and an aspect evaluation value for each aspect of the evaluation object are given in advance. Accepting multiple learning reviews, and
Based on the plurality of learning reviews received by the estimation input unit by the parameter estimation unit, the overall evaluation value of each aspect with respect to the evaluation target and the overall evaluation value of the learning review are the evaluation target. Depending on the overall evaluation value for each aspect, the aspect evaluation value of each aspect of the review for learning depends on the overall evaluation value of each aspect for the evaluation object and the overall evaluation value of the review for learning. , A parameter indicating the distribution of each word appearing in the review sentence to which the aspect is assigned, and a review in which an aspect is assigned to each aspect and each evaluation value, and the aspect evaluation value of the aspect is the evaluation value. According to the topic model representing that the parameters representing the distribution of each word appearing in the sentence are calculated,
Parameters indicating the distribution of each aspect assigned to the review sentence, parameters indicating the distribution of each word appearing in the review sentence to which the aspect is assigned, and the aspects of each aspect and each evaluation value are assigned. A parameter representing the distribution of each word appearing in a review sentence in which the aspect evaluation value of the aspect is the evaluation value, an average and a standard deviation of the overall comprehensive evaluation values for the plurality of evaluation objects, and each aspect A standard deviation of overall evaluation values of the plurality of evaluation targets for the plurality of evaluation targets, a standard deviation of overall evaluation values of the plurality of learning reviews, and a standard deviation of the aspect evaluation values of the plurality of learning reviews for each aspect, and Estimating model parameters including:
A parameter estimation method including:

The initialization unit further includes a step of initializing the model parameter, and the step of the parameter estimation unit estimating the model parameter includes the model parameter, the overall evaluation value of each of the plurality of learning reviews, and Based on the aspect evaluation value of each aspect, the average and standard deviation of the overall overall evaluation value of the evaluation object for each of the plurality of evaluation objects, and the evaluation for each aspect for each of the plurality of evaluation objects The average and standard deviation of the overall aspect evaluation values for the target, the parameter representing the distribution of each aspect in the review for learning for each of the plurality of learning reviews, and the review sentences of the plurality of review for learning, Estimating variational parameters including parameters representing the distribution of each aspect assigned to the review text And based on the estimated variation parameter, to estimate the model parameter, parameter estimation method according to claim 1 repeated until iteration termination condition is satisfied predetermined.

An evaluation value estimation method in an evaluation value estimation device including an input unit and an evaluation value estimation unit,
The input unit accepting a review related to the evaluation target; and
The evaluation value estimation unit, based on the review received by the input unit, and the model parameter estimated by the parameter estimation method according to claim 1 or 2, the entire evaluation target for the review Estimating an evaluation value and an aspect evaluation value for each aspect of the evaluation target of the review;
Evaluation value estimation method including

A plurality of reviews for learning related to a plurality of evaluation objects including a plurality of aspects, and a plurality of reviews for learning to which an overall evaluation value for the evaluation object and an aspect evaluation value for each aspect of the evaluation object are given in advance An input unit for estimation to accept;
Based on the plurality of learning reviews received by the estimation input unit, the overall aspect evaluation value of each aspect for the evaluation object and the overall evaluation value of the learning review become the overall evaluation value for the evaluation object. The aspect evaluation value of each aspect of the review for learning depends on the overall evaluation value of the aspect for each evaluation object and the overall evaluation value of the review for learning, and the aspect for each aspect is assigned. Parameters that represent the distribution of each word appearing in a review sentence, and each word appearing in a review sentence in which an aspect is assigned to each aspect and each evaluation value, and the aspect evaluation value of the aspect becomes the evaluation value According to the topic model representing that the parameter representing the distribution of
Parameters indicating the distribution of each aspect assigned to the review sentence, parameters indicating the distribution of each word appearing in the review sentence to which the aspect is assigned, and the aspects of each aspect and each evaluation value are assigned. A parameter representing the distribution of each word appearing in a review sentence in which the aspect evaluation value of the aspect is the evaluation value, an average and a standard deviation of the overall comprehensive evaluation values for the plurality of evaluation objects, and each aspect A standard deviation of overall evaluation values of the plurality of evaluation targets for the plurality of evaluation targets, a standard deviation of overall evaluation values of the plurality of learning reviews, and a standard deviation of the aspect evaluation values of the plurality of learning reviews for each aspect, and A parameter estimator for estimating model parameters including:
A parameter estimation apparatus including:

The program for functioning a computer as each part of the parameter estimation apparatus of Claim 4.