JP6558860B2

JP6558860B2 - Estimation device, prediction device, method, and program

Info

Publication number: JP6558860B2
Application number: JP2016103758A
Authority: JP
Inventors: 秀明金; 澤田　宏; 宏澤田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2016-05-24
Filing date: 2016-05-24
Publication date: 2019-08-14
Anticipated expiration: 2036-05-24
Also published as: JP2017211783A

Description

本発明は、推定装置、予測装置、方法、及びプログラムに関する。 The present invention relates to an estimation device, a prediction device, a method, and a program.

従来技術としてトピックモデルを利用した手法がある（非特許文献１）。トピックモデルでは、個体群の行動履歴データから有限の特徴的な行動パターンが抽出され、各個体の行動はその特徴的な行動パターンの組み合わせで表現され、予測が行われる。より具体的には、個体が複数の選択肢からある行動を選ぶ過程は多項分布で確率的にモデル化され、各個体の選択行動は有限の多項分布の線形和で表現され、有限の多項分布のパラメータ値及び各個体の線形和の重み係数は個体群の行動履歴データから推定される。各個体の行動の予測は、推定された多項分布の線形和に基づき実行される。また、各個体に対して推定された重み係数は各個体の行動パターンあるいは行動傾向を数値化したものであるため、その重み係数に基づき個体を分類あるいはセグメンテーションすることができる。 As a conventional technique, there is a technique using a topic model (Non-Patent Document 1). In the topic model, a finite characteristic behavior pattern is extracted from the behavior history data of an individual group, and the behavior of each individual is expressed by a combination of the characteristic behavior patterns and prediction is performed. More specifically, the process in which an individual selects an action from a plurality of options is modeled stochastically with a multinomial distribution, and the selection behavior of each individual is expressed as a linear sum of a finite multinomial distribution. The parameter value and the weight coefficient of the linear sum of each individual are estimated from the behavior history data of the individual group. The prediction of the behavior of each individual is executed based on the linear sum of the estimated multinomial distributions. Further, since the weighting factor estimated for each individual is a numerical value of the behavior pattern or behavior tendency of each individual, the individual can be classified or segmented based on the weighting factor.

また、個体の選択行動が直前の選択に依存することを許容する手法がある（非特許文献２）。この場合、個体が複数の選択肢からある行動を選ぶ過程は条件付きの多項分布で表現される。 In addition, there is a technique that allows an individual's selection behavior to depend on the previous selection (Non-patent Document 2). In this case, the process in which an individual selects an action from a plurality of options is expressed by a conditional multinomial distribution.

T. Iwata et al.,“Topic Tracking Model for Analyzing Consumer Purchase Behavior”, IJCAI, pp.1427-1432, 2009.T. Iwata et al., “Topic Tracking Model for Analyzing Consumer Purchase Behavior”, IJCAI, pp.1427-1432, 2009. H. M. Wallach, “Topic modeling: beyond bag-of-words”, Proceedings of the 23rd international conference on Machine learning, pp.977-984, 2006.H. M. Wallach, “Topic modeling: beyond bag-of-words”, Proceedings of the 23rd international conference on Machine learning, pp.977-984, 2006.

従来技術（非特許文献１，２）において、個体の選択行動の予測は選択の内容のみの予測であり、選択のタイミングは予測の対象外であった。個体の行動履歴データにおいては、個体が過去に取った選択肢の履歴のみが使用され、その選択行動が発生したタイミングあるいは時刻の履歴は使用されていなかった。ここでタイミングとは、直前の選択行動からの経過時間を意味する。このとき、以下の問題が発生する。 In the prior art (Non-Patent Documents 1 and 2), the prediction of the individual's selection behavior is prediction of only the content of selection, and the selection timing is outside the target of prediction. In the behavior history data of an individual, only the history of options taken by the individual in the past is used, and the history of the timing or time when the selected behavior occurs is not used. Here, the timing means an elapsed time from the immediately preceding selection action. At this time, the following problems occur.

（１）各個体が選択行動を行うタイミングを予測することができない。 (1) The timing at which each individual performs a selection action cannot be predicted.

（２）（１）の結果として、各個体に対して推薦あるいはナビゲーションする内容をタイミングによって変更することができない。例えば次回の購買でTシャツと靴下を購買する確率がそれぞれ０．６と０．４の顧客において、この顧客が実は昨日Ｔシャツを購入しており、かつこの顧客がＴシャツを二日連続で購入する確率が極めて低い場合、本日に推薦すべき商品はＴシャツでなく靴下であると考えられるが、従来技術ではタイミングに関係なくＴシャツが選ばれてしまう。 (2) As a result of (1), the content recommended or navigated for each individual cannot be changed depending on the timing. For example, in the next purchase, the probability of purchasing T-shirts and socks is 0.6 and 0.4 respectively, and this customer actually purchased T-shirts yesterday, and this customer purchased T-shirts for two consecutive days. If the probability of purchase is very low, it is considered that the product to be recommended today is not a T-shirt but a sock, but in the prior art, a T-shirt is selected regardless of timing.

（３）選択行動の内容とタイミングの両側面を考慮した行動パターンの特徴を、個体群の行動履歴データから抽出することができない。 (3) It is not possible to extract the behavior pattern characteristics in consideration of the both sides of the content and timing of the selected behavior from the behavior history data of the individual group.

本発明は、上記問題点を解決するために成されたものであり、個体の選択内容及び選択タイミングを精度よく予測するためのパラメータを推定することができる推定装置、方法、及びプログラムを提供することを目的とする。 The present invention has been made to solve the above-described problems, and provides an estimation device, method, and program capable of estimating parameters for accurately predicting the selection content and selection timing of an individual. For the purpose.

また、個体の選択内容及び選択タイミングを精度よく予測することができる予測装置、方法、及びプログラムを提供することを目的とする。 It is another object of the present invention to provide a prediction apparatus, method, and program capable of accurately predicting the selection content and selection timing of an individual.

上記目的を達成するために、第１の発明に係る推定装置は、複数の個体で観測された、前記個体の選択行動の選択内容、及び前回の選択行動からの間隔である選択タイミングを表す選択行動データの集合を入力とし、前記選択行動データの集合に含まれる各選択行動データについて、各トピックに対する、前記トピックを条件とした選択内容の確率を表す条件付き多項分布、及び前記トピックを条件とした選択タイミングの確率密度分布を用いて表される、各トピックに対する、前記トピックを条件とした、前記選択内容及び前記選択タイミングの同時確率分布に従って、前記選択行動データが所属するトピックを計算し、各選択行動データについての前記トピックの計算結果に基づいて、各個体及び各トピックに対する重みを表すトピック比率の分布を表すハイパーパラメータと、前記トピックにおける、前記選択内容及び前記選択タイミングを特徴付けるトピック分布のパラメータの分布を表すハイパーパラメータとを更新することを繰り返すトピック推定部と、前記トピック推定部によって更新された前記トピック分布のパラメータの分布を表すハイパーパラメータ、及び繰り返し毎に得られた、各選択行動データについての前記トピックの計算結果に基づく、前記トピック分布のパラメータの事後確率に従って、前記トピック分布のパラメータを推定するトピック分布パラメータ推定部と、前記トピック推定部によって更新された前記トピック比率の分布を表すハイパーパラメータ、及び繰り返し毎に得られた、各選択行動データについての前記トピックの計算結果に基づく、前記トピック比率の事後確率に従って、各個体について、前記トピック比率を推定するトピック比率推定部と、を含んで構成されている。 In order to achieve the above object, the estimation device according to the first aspect of the present invention provides a selection that represents a selection content of the selection behavior of the individual and a selection timing that is an interval from the previous selection behavior, which are observed in a plurality of individuals. A set of action data as an input, and for each selected action data included in the set of selected action data, a conditional multinomial distribution representing the probability of selection content with respect to each topic for each topic, and the topic as a condition The topic to which the selected action data belongs is calculated according to the selection content and the simultaneous probability distribution of the selection timing, with the topic as a condition, for each topic, expressed using the probability density distribution of the selected timing, A topic representing the weight for each individual and each topic based on the calculation result of the topic for each selected behavior data A topic estimation unit that repeatedly updates a hyper parameter that represents a distribution of a rate and a hyper parameter that represents a distribution of a topic distribution parameter that characterizes the selection content and the selection timing in the topic, and is updated by the topic estimation unit Hyperparameters representing the distribution of the topic distribution parameters and the posterior probabilities of the topic distribution parameters based on the calculation results of the topics for each selection behavior data obtained for each iteration. A topic distribution parameter estimator for estimating a parameter, a hyperparameter representing the distribution of the topic ratio updated by the topic estimator, and a calculation result of the topic for each selection behavior data obtained for each iteration. Z According posterior probability of the topic proportions, for each individual, and is configured to include a topic proportions estimator for estimating the topic proportions.

また、第２の発明に係る推定方法は、トピック推定部が、複数の個体で観測された、前記個体の選択行動の選択内容、及び前回の選択行動からの間隔である選択タイミングを表す選択行動データの集合を入力とし、前記選択行動データの集合に含まれる各選択行動データについて、各トピックに対する、前記トピックを条件とした選択内容の確率を表す条件付き多項分布、及び前記トピックを条件とした選択タイミングの確率密度分布を用いて表される、各トピックに対する、前記トピックを条件とした、前記選択内容及び前記選択タイミングの同時確率分布に従って、前記選択行動データが所属するトピックを計算し、各選択行動データについての前記トピックの計算結果に基づいて、各個体及び各トピックに対する重みを表すトピック比率の分布を表すハイパーパラメータと、前記トピックにおける、前記選択内容及び前記選択タイミングを特徴付けるトピック分布のパラメータの分布を表すハイパーパラメータとを更新することを繰り返し、トピック分布パラメータ推定部が、前記トピック推定部によって更新された前記トピック分布のパラメータの分布を表すハイパーパラメータ、及び繰り返し毎に得られた、各選択行動データについての前記トピックの計算結果に基づく、前記トピック分布のパラメータの事後確率に従って、前記トピック分布のパラメータを推定し、トピック比率推定部が、前記トピック推定部によって更新された前記トピック比率の分布を表すハイパーパラメータ、及び繰り返し毎に得られた、各選択行動データについての前記トピックの計算結果に基づく、前記トピック比率の事後確率に従って、各個体について、前記トピック比率を推定する。 In addition, the estimation method according to the second aspect of the present invention is directed to a selection behavior in which the topic estimation unit indicates a selection content of the selection behavior of the individual and a selection timing that is an interval from the previous selection behavior, observed by a plurality of individuals. With a set of data as input, for each selected action data included in the set of selected action data, a conditional multinomial distribution representing the probability of selection content on the topic for each topic, and the topic as a condition The topic to which the selected behavior data belongs is calculated according to the selection content and the simultaneous probability distribution of the selection timing, with the topic as a condition, for each topic, expressed using the probability density distribution of the selection timing, A topic ratio representing the weight for each individual and each topic based on the calculation result of the topic for the selected behavior data Repeatedly updating the hyperparameter representing the distribution of the topic and the hyperparameter representing the distribution of the parameter of the topic distribution that characterizes the selection content and the selection timing in the topic, and the topic distribution parameter estimation unit includes the topic estimation unit The topic according to the posterior probabilities of the parameter of the topic distribution based on the hyperparameter representing the distribution of the parameter of the topic distribution updated by the method and the calculation result of the topic for each selection behavior data obtained for each iteration. The parameter of the distribution is estimated, and the topic ratio estimation unit is a hyperparameter representing the distribution of the topic ratio updated by the topic estimation unit, and the calculation result of the topic for each selection behavior data obtained for each iteration In Brute according posterior probability of the topic proportions, for each individual, estimates the topic proportions.

また、第１の発明に係る推定装置及び第２の発明に係る推定方法において、前記トピックを条件とした選択タイミングの確率密度分布は、前記選択タイミングが前記選択内容とが独立に生成されるように定められるようにしてもよい。 Further, in the estimation device according to the first invention and the estimation method according to the second invention, the probability density distribution of the selection timing with the topic as a condition is such that the selection timing is generated independently of the selection content. It may be determined as follows.

また、第１の発明に係る推定装置及び第２の発明に係る推定方法において、前記トピックを条件とした選択タイミングの確率密度分布は、前記選択タイミングが直前の前記選択内容に依存して生成されるように定められるようにしてもよい。 In the estimation apparatus according to the first invention and the estimation method according to the second invention, the probability density distribution of the selection timing on the condition of the topic is generated depending on the selection content immediately before the selection timing. It may be determined to be as follows.

また、第１の発明に係る推定装置及び第２の発明に係る推定方法において、前記トピックを条件とした選択タイミングの確率密度分布は、前記選択タイミングが直後の前記選択内容に依存して生成されるように定められるようにしてもよい。 Further, in the estimation device according to the first invention and the estimation method according to the second invention, the probability density distribution of the selection timing based on the topic is generated depending on the selection content immediately after the selection timing. It may be determined to be as follows.

また、第３の発明に係る予測装置は、予測対象の個体の選択行動を予測する予測装置であって、前記予測対象の個体に対して予め求められた、各トピックに対する重みを表すトピック比率と、予め求められた前記トピックにおける、前記個体の選択行動の選択内容及び前記個体の前回の選択行動からの間隔である選択タイミングを特徴付けるトピック分布のパラメータとに基づいて、各トピックに対する、前記トピックを条件とした選択内容の確率を表す条件付き多項分布、及び前記トピックを条件とした、前記選択タイミングの確率密度分布を用いて表される、各トピックに対する、前記トピックを条件とした、前記個体の選択行動の選択内容、及び前記選択タイミングの同時確率分布に従って、前記予測対象の個体についての、前記選択内容及び前記選択タイミングの少なくとも一方を予測する予測部を含んで構成されている。 A prediction device according to a third aspect of the invention is a prediction device that predicts a selection behavior of an individual to be predicted, a topic ratio that represents a weight for each topic, which is obtained in advance for the individual to be predicted. The topic for each topic based on a topic distribution parameter that characterizes the selection content of the individual's selection behavior and a selection timing that is an interval from the individual's previous selection behavior in the topic determined in advance. A conditional multinomial distribution that represents the probability of selection content as a condition, and a probability density distribution at the selection timing that is conditional on the topic. The selection of the individual to be predicted according to the selection content of the selection action and the simultaneous probability distribution of the selection timing It includes prediction unit for predicting at least one of volume and the selected timing is constructed.

第４の発明に係る予測方法は、予測対象の個体の選択行動を予測する予測方法であって、予測部が、前記予測対象の個体に対して予め求められた、各トピックに対する重みを表すトピック比率と、予め求められた前記トピックにおける、前記個体の選択行動の選択内容及び前記個体の前回の選択行動からの間隔である選択タイミングを特徴付けるトピック分布のパラメータとに基づいて、各トピックに対する、前記トピックを条件とした選択内容の確率を表す条件付き多項分布、及び前記トピックを条件とした、前記選択タイミングの確率密度分布を用いて表される、各トピックに対する、前記トピックを条件とした、前記個体の選択行動の選択内容、及び前記選択タイミングの同時確率分布に従って、前記予測対象の個体についての、前記選択内容及び前記選択タイミングの少なくとも一方を予測する。 A prediction method according to a fourth aspect of the invention is a prediction method for predicting a selection behavior of an individual to be predicted, in which a prediction unit represents a weight for each topic, which is obtained in advance for the individual to be predicted. For each topic, based on the ratio and the topic distribution parameters characterizing the selection timing of the individual's selection behavior and the interval from the previous selection behavior of the individual in the topic determined in advance A conditional multinomial distribution representing the probability of selection content subject to a topic, and a probability density distribution of the selection timing, conditional on the topic, for each topic, subject to the topic, According to the selection content of the selection action of the individual, and the simultaneous probability distribution of the selection timing, for the individual to be predicted, Predicting at least one of-option content and the selected timing.

第５の発明に係るプログラムは、コンピュータを、上記推定装置、又は上記予測装置を構成する各部として機能させるためのプログラム。 A program according to a fifth aspect of the invention is a program for causing a computer to function as each unit constituting the estimation device or the prediction device.

本発明の推定装置、方法、及びプログラムによれば、各選択行動データについて、各トピックに対する、トピックを条件とした選択内容の確率を表す条件付き多項分布、及びトピックを条件とした選択タイミングの確率密度分布を用いて表される、選択内容及び選択タイミングの同時確率分布に従って、選択行動データが所属するトピックを計算し、各選択行動データについてのトピックの計算結果に基づいて、トピック比率の分布を表すハイパーパラメータとトピック分布のパラメータの分布を表すハイパーパラメータとを更新することを繰り返し、更新されたトピック分布のパラメータの分布を表すハイパーパラメータ、及び繰り返し毎に得られた、各選択行動データについてのトピックの計算結果に基づく、トピック分布のパラメータの事後確率に従って、トピック分布のパラメータを推定し、更新されたトピック比率の分布を表すハイパーパラメータ、及び繰り返し毎に得られた、各選択行動データについてのトピックの計算結果に基づく、トピック比率の事後確率に従って、各個体について、トピック比率を推定することにより、個体の選択内容及び選択タイミングを精度よく予測するためのパラメータを推定することができる、という効果が得られる。 According to the estimation apparatus, method, and program of the present invention, for each selected action data, a conditional multinomial distribution representing the probability of selection content on the topic for each topic, and the probability of selection timing on the topic The topic to which the selected action data belongs is calculated according to the simultaneous probability distribution of the selection contents and the selection timing represented by the density distribution, and the distribution of the topic ratio is calculated based on the calculation result of the topic for each selected action data. The hyperparameter representing the distribution of the topic parameter and the hyperparameter representing the distribution of the parameter of the topic distribution, and the hyperparameter representing the parameter distribution of the updated topic distribution and each selection behavior data obtained for each iteration. Topic distribution parameters based on topic calculation results Estimating topic distribution parameters according to posterior probabilities, posterior probabilities of topic ratios based on hyperparameters representing updated topic ratio distributions, and topic calculation results for each selected behavior data obtained for each iteration Thus, by estimating the topic ratio for each individual, it is possible to estimate the parameters for accurately predicting the selection content and selection timing of the individual.

また、本発明の予測装置、方法、及びプログラムによれば、予測対象の個体に対して予め求められた、各トピックに対する重みを表すトピック比率とトピック分布のパラメータとに基づいて、各トピックに対する、トピックを条件とした選択内容の確率を表す条件付き多項分布、及びトピックを条件とした、選択タイミングの確率密度分布を用いて表される、各トピックに対する、トピックを条件とした、個体の選択行動の選択内容及び選択タイミングの同時確率分布に従って、予測対象の個体についての、選択内容及び選択タイミングの少なくとも一方を予測することにより、個体の選択内容及び選択タイミングを精度よく予測することができる、という効果が得られる。 Further, according to the prediction apparatus, method, and program of the present invention, based on the topic ratio and the topic distribution parameter representing the weight for each topic, which are obtained in advance for the individual to be predicted, for each topic, Individual selection behavior for each topic, expressed using a conditional multinomial distribution that represents the probability of selection content on the condition of the topic, and a probability density distribution of the selection timing on the topic. According to the simultaneous probability distribution of the selection content and the selection timing, it is possible to accurately predict the selection content and the selection timing of the individual by predicting at least one of the selection content and the selection timing for the individual to be predicted. An effect is obtained.

本発明の実施の形態の概要を説明するための説明図である。It is explanatory drawing for demonstrating the outline | summary of embodiment of this invention. 本発明の実施の形態に係る推定装置の構成を示すブロック図である。It is a block diagram which shows the structure of the estimation apparatus which concerns on embodiment of this invention. 本発明の実施の形態のモデルタイプＩのグラフィカルモデルを表す図である。It is a figure showing the graphical model of the model type I of embodiment of this invention. 本発明の実施の形態のモデルタイプＩＩのグラフィカルモデルを表す図である。It is a figure showing the graphical model of model type II of embodiment of this invention. 本発明の実施の形態のモデルタイプＩＩＩのグラフィカルモデルを表す図である。It is a figure showing the graphical model of model type III of embodiment of this invention. 本発明の実施の形態のモデルタイプＩＶのグラフィカルモデルを表す図である。It is a figure showing the graphical model of model type IV of embodiment of this invention. 本発明の実施の形態に係る予測装置の構成を示すブロック図である。It is a block diagram which shows the structure of the prediction apparatus which concerns on embodiment of this invention. 本発明の実施の形態に係る推定装置における推定処理ルーチンを示すフローチャートである。It is a flowchart which shows the estimation process routine in the estimation apparatus which concerns on embodiment of this invention. 本発明の実施の形態に係る予測装置における予測処理ルーチンを示すフローチャートである。It is a flowchart which shows the prediction process routine in the prediction apparatus which concerns on embodiment of this invention.

以下、図面を参照して本発明の実施の形態を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

＜本発明の実施の形態に係る概要＞ <Outline according to Embodiment of the Present Invention>

まず、本発明の実施の形態における概要を説明する。 First, an outline of the embodiment of the present invention will be described.

本発明の実施の形態では、トピックモデルにおいて、個体の選択行動の内容とタイミングとの二つの情報によって行動パターンの特徴を定義する。各個体の行動はその特徴的な行動パターン、すなわち「何を選択するか」と「いつ選択するか」の組み合わせで表現され、予測が行われる。 In the embodiment of the present invention, in the topic model, the feature of the behavior pattern is defined by two pieces of information, that is, the content and timing of the selected behavior of the individual. The behavior of each individual is expressed by a characteristic behavior pattern, that is, a combination of “what to select” and “when to select”, and prediction is performed.

本実施の形態では、人間の行動パターンを数理モデルによって記述し、各ユーザの行動パターンを各ユーザの行動履歴データから学習し、その学習された数理モデルに基づき各ユーザの将来の行動を予測する。例えば、本実施の形態をテーマパークの入園客に対し適用した場合、各入園客の行動パターンを行動履歴データから学習し、各入園客の入園後の行動を予測する。 In this embodiment, a human behavior pattern is described by a mathematical model, each user's behavior pattern is learned from each user's behavior history data, and the future behavior of each user is predicted based on the learned mathematical model. . For example, when this embodiment is applied to a theme park visitor, the behavior pattern of each visitor is learned from the behavior history data, and the behavior of each visitor after entering the park is predicted.

そして、得られた予測結果に基づき、各ユーザの行動パターン（趣味・嗜好含む）に適したナビゲーションを行う。テーマパークでは、予測結果に基づき、各入園客の行動パターン（趣味・嗜好含む）に適したナビゲーションが、スマホなどを通じて行われる。 Then, based on the obtained prediction result, navigation suitable for each user's behavior pattern (including hobbies and preferences) is performed. In the theme park, navigation suitable for each visitor's behavior pattern (including hobbies and preferences) is performed through a smartphone or the like based on the prediction result.

各ユーザの行動履歴データは（多くの場合）少ないため、各ユーザのモデルを正しく学習させることができない。例えば、テーマパークの各入園客の行動履歴データは少ないか、あるいは（会員としてデータが蓄積されてない場合）ほぼないため、各入園客の行動パターンを正しく学習できず、予測することができない。 Since each user's action history data is small (in many cases), the model of each user cannot be learned correctly. For example, since there is little or no action history data for each visitor in the theme park (when no data is stored as a member), the action pattern of each visitor cannot be learned correctly and cannot be predicted.

そこで、本実施の形態では、トピックモデルを用いて、各ユーザの行動パターンを低次元で表現する。ユーザのパターンにある種の共通性を見出すことで、データの不足を補うことができる（原理はＮＭＴＦ：Nonnegative Multiple Tensor Factorizationとほぼ同じ）。例えばテーマパークの場合、トピックモデルを用いて、入園客に見られる１０種類の行動パターンを抽出する。その後各入園客がどの行動パターンに当てはまるかを特定することで、各入園客の行動パターンを予測し、適切なナビゲーションを行うことができる。 Therefore, in the present embodiment, the action pattern of each user is expressed in a low dimension using a topic model. Finding some kind of commonality in the user's pattern can compensate for the lack of data (the principle is almost the same as NMTF: Nonnegative Multiple Tensor Factorization). For example, in the case of a theme park, 10 types of behavior patterns that are seen by visitors are extracted using a topic model. Then, by identifying which behavior pattern each visitor applies to, the behavior pattern of each visitor can be predicted and appropriate navigation can be performed.

また、従来技術では、これまで個体の行動のタイミングは考慮できなかった。そこで、本実施の形態では、ユーザの選択行動の「内容」と「タイミング」を同時に予測し、その予測結果に基づきナビゲーションの内容とタイミングを決める。また、本実施の形態では、トピックモデルを用いてユーザを低次元表現する。 In the prior art, the timing of an individual's action has not been considered so far. Therefore, in this embodiment, the “content” and “timing” of the user's selection action are predicted simultaneously, and the navigation content and timing are determined based on the prediction result. In the present embodiment, the user is expressed in a low-dimensional manner using the topic model.

従来のトピックモデルでは、多項分布によってユーザの次の行動内容を予測できるが、その行動が発生するタイミング（＝直前の行動発生時刻からの経過時間）を予測できない。そのため、（ｉ）情報を発信するタイミングを決めることができない、（ｉｉ）タイミングに応じて発信する情報の内容を変更することができない。 In the conventional topic model, the user's next action content can be predicted by the multinomial distribution, but the timing at which the action occurs (= elapsed time from the immediately preceding action occurrence time) cannot be predicted. Therefore, (i) the timing for transmitting information cannot be determined, and (ii) the content of information transmitted according to the timing cannot be changed.

そこで、本実施の形態では、ユーザの直前の行動発生時刻からの経過時間を確率密度関数で表現する。図１に、本実施の形態を説明するための図を示す。本実施の形態では、例えば、図１に示すように、ユーザの選択内容（Ａ，Ｂ，Ｃ）の遷移行列及び選択内容の時間変化に基づき、ユーザの行動をモデル化する。また、確率密度関数には周辺化可能な指数分布族を用いることで、従来のトピックモデルと同等の実装の容易さ及び計算コストが実現される。 Therefore, in the present embodiment, the elapsed time from the action occurrence time immediately before the user is expressed by a probability density function. FIG. 1 is a diagram for explaining the present embodiment. In the present embodiment, for example, as shown in FIG. 1, the user's behavior is modeled based on the transition matrix of the user's selection contents (A, B, C) and the change over time of the selection contents. Further, by using a marginal exponential distribution family for the probability density function, the ease of implementation and the calculation cost equivalent to those of the conventional topic model are realized.

より具体的には、個体が選択肢の中からいずれか一つを選ぶ過程を条件付きの多項分布で（選択行動が直前の選択行動の影響を受けない場合は単に多項分布）、その選択行動が発生する際の、直前の選択行動からの経過時間を確率密度分布でモデル化し、条件付き多項分布と確率密度分布の同時分布のパラメータによって行動パターンの特徴を表現する。各個体の選択行動、すなわち何をいつ選択するかは有限個の同時分布の線形和で表現され、パラメータ値及び各個体の線形和の重み係数は個体群の行動履歴データから推定される。本実施の形態により、従来技術における三つの課題が解決される。 More specifically, the process in which an individual chooses one of the options is a conditional multinomial distribution (or simply a multinomial distribution if the selection action is not affected by the previous selection action), and the selection action is The elapsed time from the immediately preceding selected action at the time of occurrence is modeled by a probability density distribution, and the characteristics of the action pattern are expressed by the parameters of the simultaneous distribution of the conditional multinomial distribution and the probability density distribution. The selection behavior of each individual, that is, when to select what is expressed by a linear sum of a finite number of simultaneous distributions, and the parameter value and the weight coefficient of the linear sum of each individual are estimated from the behavior history data of the individual group. The present embodiment solves three problems in the prior art.

本発明の実施の形態は、行動の選択肢が少なくとも二つ以上与えられたとき、個体が将来選択する行動を予測する。本実施の形態によれば、個体が将来どの行動を起こすかを予測することができ、またその予測結果を元に、個体に対し推薦する内容、あるいは個体をナビゲートする内容を適切に選ぶことができる。また、各個体がどのような選択を取る傾向にあるかを分析し、個体群を分類することができる。個体、行動の種類及び選択肢の例を表１に示す。また、個体に対する推薦／ナビゲートの例を表２に示す。ただし、個体、行動の種類及び選択肢は以下に限らない。 The embodiment of the present invention predicts the behavior that an individual will select in the future when at least two or more behavior options are given. According to the present embodiment, it is possible to predict what action an individual will take in the future, and based on the prediction result, appropriately select content recommended for the individual or content for navigating the individual. Can do. Moreover, it is possible to classify individual groups by analyzing how individual individuals tend to select. Table 1 shows examples of individuals, action types, and options. Table 2 shows an example of recommendation / navigation for an individual. However, individuals, types of actions and options are not limited to the following.

本発明の実施の形態では、各個体の選択行動データの集合から各パラメータを推定する推定装置と、予測装置とに本発明を適用させた場合を例に説明する。具体的には、まず推定装置において、特徴的な行動パターン及び各個体の行動傾向を表すパラメータがデータから推定され、各個体の選択内容と選択タイミングの同時確率分布が決定される。次の予測装置においては、推定された個体ごとの同時確率分布に基づき将来観測される選択行動が予測される。各装置の詳細を以下で説明する。 In the embodiment of the present invention, an example will be described in which the present invention is applied to an estimation device that estimates each parameter from a set of selection behavior data of each individual and a prediction device. Specifically, first, in the estimation device, parameters representing characteristic behavior patterns and behavior trends of each individual are estimated from the data, and the simultaneous probability distribution of the selection contents and selection timing of each individual is determined. In the next prediction device, a selection behavior to be observed in the future is predicted based on the estimated joint probability distribution for each individual. Details of each device will be described below.

＜本発明の実施の形態に係る推定装置の構成＞ <Configuration of estimation apparatus according to embodiment of the present invention>

次に、本発明の実施の形態に係る推定装置の構成について説明する。図２に示すように、本発明の実施の形態に係る推定装置１００は、ＣＰＵと、ＲＡＭと、後述する推定処理ルーチンを実行するためのプログラムや各種データを記憶したＲＯＭと、を含むコンピュータで構成することが出来る。この推定装置１００は、機能的には図２に示すように入力部１０と、演算部２０と、出力部４０とを備えている。 Next, the configuration of the estimation apparatus according to the embodiment of the present invention will be described. As shown in FIG. 2, the estimation apparatus 100 according to the embodiment of the present invention is a computer that includes a CPU, a RAM, and a ROM that stores a program and various data for executing an estimation processing routine described later. Can be configured. Functionally, the estimation apparatus 100 includes an input unit 10, a calculation unit 20, and an output unit 40 as shown in FIG.

入力部１０は、複数の個体で観測された、当該個体の選択行動の選択内容、及び前回の選択行動からの間隔である選択タイミングを表す選択行動データの集合の入力を受け付ける。 The input unit 10 accepts input of a selection content of a selection action of the individual observed by a plurality of individuals and a set of selection action data representing a selection timing that is an interval from the previous selection action.

選択行動データは、解析対象である個体から生成される。また、選択行動データは、個体ＩＤ（以下、ｕと表記する）、個体の総数（以下、Ｕと表記する）、個体ｕで観測された選択行動の回数（以下、Ｎ_ｕと表記する）、全個体で観測された選択行動データの総数から個体数を引いたもの（以下、Ｎと表記する）、全個体で観測された選択内容データの集合（｛ｙ^ｊ｝≡（ｙ^１，ｙ^２，…ｙ^Ｎ）と表記）、全個体で観測された選択タイミングデータの集合（｛ｔ^ｊ｝≡（ｔ^１，ｔ^２，…ｔ^Ｎ）と表記）、及び各選択行動データを生成した個体ＩＤの集合（｛ｕ_ｊ｝≡（ｕ_１，ｕ_２，…ｕ_Ｎ）と表記）を含む。 The selection behavior data is generated from an individual to be analyzed. The selection behavior data includes an individual ID (hereinafter referred to as u), a total number of individuals (hereinafter referred to as U), the number of selection behaviors observed in the individual u (hereinafter referred to as _Nu ), Subtracting the number of individuals from the total number of selection behavior data observed for all individuals (hereinafter referred to as N), a set of selection content data observed for all individuals ({y ^j } ≡ (y ¹ , y ² ,..., Y ^N )), a set of selection timing data observed in all individuals (represented as {t ^j } ≡ (t ¹ , t ² ,... T ^N )), and the individual that generated each selection action data A set of IDs (represented as {u _j } ≡ (u ₁ , u ₂ ,... U _N )).

ただし、選択内容データｙ^ｊは直前と直後の選択内容のペア（ｙ_１ ^ｊ，ｙ_２ ^ｊ）からなり、選択タイミングデータｔ^ｊは直前と直後の選択時刻の間隔を表すものとする。また、選択肢は１から始まる自然数で表現し（選択肢の数はＮ_ｃと表記）、選択内容のペアを（直前の購買，直後の購買）で表記するものとする。例えば、消費者による購買選択行動を考え、商品１，２，３，１がそれぞれ時刻ｓ_０，ｓ_１，ｓ_２，ｓ_３で購入されたとき、ｙ^１＝（１，２），ｙ^２＝（２，３），ｙ^３＝（３，１）であり、ｔ^１＝ｓ_１−ｓ_０，ｔ^２＝ｓ_２−ｓ_１，ｔ^３＝ｓ_３−ｓ_２である。 However, the selection content data y ^j is composed of a pair (y ₁ ^j , y ₂ ^j ) of the selection content immediately before and immediately after, and the selection timing data t ^j represents the interval between the selection time immediately before and immediately after. In addition, the options are expressed by natural numbers starting from 1 (the number of options is expressed as _Nc ), and a pair of selection contents is expressed as (immediate purchase, immediate purchase). For example, considering purchase selection behavior by a consumer, when products ₁ , ₂ , ₃ , and ₁ are purchased at times s ₀ , s ₁ , s ₂ , and s ₃ , respectively, y ¹ = (1,2,), y ² = ^(2,3), a y 3 = (3, ^1), a _{_{^{_{t 1 = s 1 -s 0,}}}} t 2 = s 2 -s 1, t 3 = s 3 -s 2.

また、入力部１０は、個体の特徴的な行動パターンを表すトピックの数Ｋを入力として受け付ける。以下、特徴的な行動パターンをトピックと称する。トピックは条件付き多項分布と確率密度分布の同時分布（以下、トピック分布と表記）で表現され、トピック毎に同時分布のパラメータが異なる。ただし、Ｋ個のトピックをそれぞれｋ＝１，２，３，…，Ｋで表記するものとする。 In addition, the input unit 10 receives the number K of topics representing a characteristic behavior pattern of an individual as an input. Hereinafter, characteristic behavior patterns are referred to as topics. A topic is expressed by a simultaneous distribution of conditional multinomial distribution and probability density distribution (hereinafter referred to as topic distribution), and the parameters of the simultaneous distribution differ for each topic. Here, K topics are denoted by k = 1, 2, 3,..., K, respectively.

また、入力部１０は、モデルタイプと選択タイミング分布との入力を受け付ける。後述するように、モデルタイプには選択タイミングが選択内容とは独立に決まるもの（タイプＩ）、直前の選択内容にのみ依存するもの（タイプＩＩ）、直後の選択内容にのみ依存するもの（タイプＩＩＩ）、直前と直後の両方に依存するもの（タイプＩＶ）の四種類存在し、選択タイミング分布は任意の指数分布族の中から指定される。指数分布族の例としては、例えば、正規分布、対数正規分布、ガンマ分布などである。 Further, the input unit 10 receives input of the model type and the selection timing distribution. As will be described later, the model type has a selection timing determined independently of the selection content (type I), depends only on the immediately preceding selection content (type II), and depends only on the immediately following selection content (type) III) There are four types (type IV) that depend on both immediately before and after (type IV), and the selection timing distribution is specified from an arbitrary exponential distribution family. Examples of the exponential distribution family include a normal distribution, a log normal distribution, and a gamma distribution.

ここで、本実施の形態で用いる生成モデルの原理について説明する。 Here, the principle of the generation model used in this embodiment will be described.

まず、以下のような生成モデルに基づき、各個体から選択内容と選択タイミングのデータが生成されると仮定する。ｊ番目のデータ（ｙ^ｊ，ｔ^ｊ）を生成した個体ｕ_ｊは、Ｋ個のトピック分布の線形和で表現される確率分布ｐ（ｙ，ｔ｜｛φ^ｋ｝，｛Υ^ｋ｝）からデータ（ｙ^ｊ，ｔ^ｊ）を生成する。 First, it is assumed that selection content and selection timing data are generated from each individual based on the following generation model. The individual u _{j that} has generated the j-th data (y ^j , t ^j ) is derived from the probability distribution p (y, t | {φ ^k }, {Υ ^k }) expressed by a linear sum of K topic distributions. Data (y ^j , t ^j ) is generated.

ただし、ｚ_ｊはｊ番目のデータが所属するトピック（１〜Ｋ）を表す潜在変数であり、 However, z _j is a latent variable representing the topic (1-K) to which the j-th data belongs,

は個体ｕ_ｊのトピックｋに対する重み（トピック比率と表記）、ｐ（ｙ^ｊ，ｔ^ｊ｜ｚ_ｊ＝ｋ，φ^ｋ，Υ^ｋ）はトピックｋのトピック分布、（｛φ^ｋ｝，｛Υ^ｋ｝）はトピックを特徴付けるトピック分布のパラメータ（テンソルのパラメータ）である。ここで、トピック分布は選択内容を生成する条件付き多項分布と、選択タイミングを生成する確率密度分布の掛け算で表現されるが、本実施の形態では、選択タイミングを生成する確率密度分布の定義の仕方について、以下の四通りがある。 Is the weight of the individual u _{j to} the topic k (represented as topic ratio), p (y ^j , t ^j | z _j = k, φ ^k , ^{ｋ k} ) is the topic distribution of topic k, ({φ ^k }, {Υ ^k }) is a topic distribution parameter (tensor parameter) that characterizes the topic. Here, the topic distribution is expressed by multiplication of the conditional multinomial distribution that generates the selection contents and the probability density distribution that generates the selection timing. In this embodiment, the definition of the probability density distribution that generates the selection timing is There are four ways to do this:

（１）タイプＩ：選択タイミングが選択内容とは独立に生成される。具体的には、以下の式（２）に示すように、トピックを条件とした選択タイミングの確率密度分布ｐ（ｔ^ｊ｜ｚ_ｊ＝ｋ，Υ^ｋ）は、選択タイミングｔ^ｊが選択内容ｙ^ｊとが独立に生成されるように定められる。 (1) Type I: Selection timing is generated independently of selection contents. Specifically, as shown in the following equation (2), the probability density distribution p (t ^j | z _j = k, Υ ^k ) of the selection timing on the condition of the topic indicates that the selection timing t ^j is the selection content y. ^j is generated independently.

（２）タイプＩＩ：選択タイミングが直前の選択内容ｙ_２ ^ｊ−１にのみ依存する。具体的には、以下の式（３）に示すように、トピックを条件とした選択タイミングの確率密度分布ｐ（ｔ^ｊ｜ｙ_２ ^ｊ−１，ｚ_ｊ＝ｋ，Υ^ｋ）は、選択タイミングｔ^ｊが直前の選択内容ｙ_２ ^ｊ−１に依存して生成されるように定められる。 (2) Type II: The selection timing depends only on the immediately preceding selection content y ₂ ^j−1 . Specifically, as shown in the following formula (3), the probability density distribution p (t ^j | y ₂ ^j−1 , z _j = k, Υ ^k ) of the selection timing on the condition of the topic is the selection timing. It is determined that t ^j is generated depending on the immediately preceding selection content y ₂ ^j−1 .

（３）タイプＩＩＩ：選択タイミングが直後の選択内容ｙ_２ ^ｊにのみ依存する。具体的には、以下の式（４）に示すように、トピックを条件とした選択タイミングの確率密度分布ｐ（ｔ^ｊ｜ｙ_２ ^ｊ，ｚ_ｊ＝ｋ，Υ^ｋ）は、選択タイミングｔ^ｊが直後の選択内容ｙ_２ ^ｊに依存して生成されるように定められる。 (3) Type III: The selection timing depends only on the immediately selected content y ₂ ^j . Specifically, as shown in the following formula (4), the probability density distribution p (t ^j | y ₂ ^j , z _j = k, ^{ｋ k} ) of the selection timing on the condition of the topic is the selection timing t ^j Is generated depending on the selection content y ₂ ^j immediately after.

（４）タイプＩＶ：選択タイミングが直前及び直後両方の選択内容（ｙ_２ ^ｊ−１，ｙ_２ ^ｊ）に依存する。具体的には、以下の式（５）に示すように、トピックを条件とした選択タイミングの確率密度分布ｐ（ｔ^ｊ｜ｙ_２ ^ｊ−１，ｙ_２ ^ｊ，ｚ_ｊ＝ｋ，Υ^ｋ）は、選択タイミングｔ^ｊが直前及び直後の選択内容（ｙ_２ ^ｊ−１，ｙ_２ ^ｊ）に依存して生成されるように定められる。 (4) Type IV: The selection timing depends on the selection contents (y ₂ ^j−1 , y ₂ ^j ) both immediately before and after. Specifically, as shown in the following formula (5), the probability density distribution p (t ^j | y ₂ ^j−1 , y ₂ ^j , z _j = k, Υ ^k ) of the selection timing with the topic as a condition. Is determined such that the selection timing t ^j is generated depending on the selection contents (y ₂ ^j−1 , y ₂ ^j ) immediately before and after.

ただし、ｐ（ｙ^ｊ｜ｚ_ｊ＝ｋ，φ^ｋ）は条件付き多項分布でφ^ｋは遷移行列 Where p (y ^j | z _j = k, φ ^k ) is a conditional multinomial distribution and φ ^k is a transition matrix

であり、ｐ（ｔ^ｊ｜ｚ_ｊ＝ｋ，Υ^ｋ）、ｐ（ｔ^ｊ｜ｙ_２ ^ｊ−１，ｚ_ｊ＝ｋ，Υ^ｋ）、ｐ（ｔ^ｊ｜ｙ_２ ^ｊ，ｚ_ｊ＝ｋ，Υ^ｋ）、又はｐ（ｔ^ｊ｜ｙ_２ ^ｊ−１，ｙ_２ ^ｊ，ｚ_ｊ＝ｋ，Υ^ｋ）は任意の指数分布族である。入力部１０によって受け付けたモデルタイプと指数分布族とに応じて、どのモデルタイプ及び指数分布族が使用されるかが決定される。 P (t ^j | z _j = k, Υ ^k ), p (t ^j | y ₂ ^j−1 , z _j = k, Υ ^k ), p (t ^j | y ₂ ^j , z _j = k , ^{Ｋ k} ) or p (t ^j | y ₂ ^j−1 , y ₂ ^j , z _j = k, Υ ^k ) is an arbitrary exponential family. Which model type and exponential distribution family is used is determined according to the model type and exponential distribution family received by the input unit 10.

次に、上記式（１）〜（５）に現れる、トピック比率θとトピックを特徴付けるテンソルパラメータφ^ｋ，Υ^ｋが、それぞれ共役な事前分布ｐ（θ｜α），ｐ（φ^ｋ｜β），ｐ（Υ^ｋ｜γ）から生成されると仮定し、以下の式（７）に示すように、データ点ごとの所属トピックｚ≡（ｚ_１，ｚ_２，…ｚ_Ｎ）及び観測される選択内容ｙ≡（ｙ^１，ｙ^２，…ｙ^Ｎ）と選択タイミングｔ≡（ｔ^１，ｔ^２，…ｔ^Ｎ）の同時確率を得る。 Next, the topic ratio θ and the tensor parameters φ ^k and Υ ^k characterizing the topic appearing in the above formulas (1) to (5) are conjugate prior distributions p (θ | α) and p (φ ^k | β), respectively. , P (Υ ^k | γ), and the assigned topic z≡ (z ₁ , z ₂ ,... Z _N ) for each data point and observed as shown in the following equation (7): The simultaneous probability of the selection content y≡ (y ¹ , y ² ,... Y ^N ) and the selection timing t≡ (t ¹ , t ² ,... T ^N ) is obtained.

ただし、上記式（７）におけるα，β，γはデータから推定されるハイパーパラメータである。参考のため、四つのモデルタイプのそれぞれのグラフィカルモデルを、図３〜図６に示す。なお、図３〜図６における、θ_ｕは個体ｕのトピック比率、インデックスｉは個体ｕの選択行動データの中でｉ番目に生成されたデータ点を表す。 However, α, β, and γ in the above equation (7) are hyperparameters estimated from the data. For reference, the graphical models of each of the four model types are shown in FIGS. 3 to 6, θ _u represents the topic ratio of the individual u, and the index i represents the i-th generated data point in the selection behavior data of the individual u.

上記（１）〜（５）式によって本実施の形態における生成モデルが計算できる。以上が本実施の形態における生成モデルの原理である。 The generation model in the present embodiment can be calculated by the above equations (1) to (5). The above is the principle of the generation model in the present embodiment.

演算部２０は、選択行動データ記憶部２２と、トピック推定部２４と、トピック分布パラメータ推定部２６と、トピック比率推定部２８と、パラメータ記憶部３０と、トピック比率記憶部３２とを含んで構成されている。 The calculation unit 20 includes a selection behavior data storage unit 22, a topic estimation unit 24, a topic distribution parameter estimation unit 26, a topic ratio estimation unit 28, a parameter storage unit 30, and a topic ratio storage unit 32. Has been.

選択行動データ記憶部２２には、入力部１０によって受け付けた選択行動データの集合が格納される。 The selection behavior data storage unit 22 stores a set of selection behavior data received by the input unit 10.

トピック推定部２４は、選択行動データ記憶部２２に記憶された選択行動データの集合に含まれる各選択行動データについて、各トピックに対する、トピックｋを条件とした選択内容の確率を表す条件付き多項分布ｐ（ｙ^ｊ｜ｚ_ｊ＝ｋ，φ^ｋ）、及びトピックｋを条件とした選択タイミングの確率密度分布を用いて表される、各トピックに対する、トピックｋを条件とした、選択内容ｙ^ｊ及び選択タイミングｔ^ｊの同時確率分布ｐ（ｙ^ｊ，ｔ^ｊ｜ｚ_ｊ＝ｋ，φ^ｋ）に従って、選択行動データが所属するトピックｋを計算する。また、トピック推定部２４は、各選択行動データについてのトピックｋの計算結果に基づいて、各個体ｕ及び各トピックｋに対する重みを表すトピック比率θ_ｕの分布を表すハイパーパラメータαと、トピックｋにおける、選択内容及び選択タイミングを特徴付けるトピック分布のパラメータφ^ｋ，Υ^ｋの分布を表すハイパーパラメータβ，γを更新する。トピック推定部２４は、上記のトピックの計算及びハイパーパラメータの更新を繰り返し行う。 The topic estimation unit 24 is a conditional multinomial distribution that represents the probability of selection content for each topic with respect to the topic k for each selection behavior data included in the set of selection behavior data stored in the selection behavior data storage unit 22. p (y ^j | z _j = k, φ ^k ) and a selection density y ^j and a topic k as a condition for each topic, expressed using a probability density distribution at a selection timing with the topic k as a condition. The topic k to which the selected action data belongs is calculated according to the joint probability distribution p (y ^j , t ^j | z _j = k, φ ^k ) at the selection timing t ^j . Further, the topic estimation unit 24, based on the calculation result of the topic k for each selected action data, the hyper parameter α representing the distribution of the topic ratio θ _u representing the weight for each individual u and each topic k, and the topic k The hyperparameters β and γ representing the distribution of topic distribution parameters φ ^k and Υ ^k that characterize the selection contents and selection timing are updated. The topic estimation unit 24 repeatedly performs the above topic calculation and hyperparameter update.

トピック推定部２４で行われる具体的な計算方法について以下に詳述する。トピック推定部２４は、選択行動データ毎の所属トピックｚ≡（ｚ_１，ｚ_２，…ｚ_Ｎ）を事後確率から推定する。本実施の形態では、トピック推定部２４による所属トピックの推定は、以下の式（８）に示すギブスサンプリング式に基づき、ｚ≡（ｚ_１，ｚ_２，…ｚ_Ｎ）のＰ回分のサンプル（ｚ^（１），ｚ^（２），…，ｚ^（Ｐ））を生成することで実行する。 A specific calculation method performed by the topic estimation unit 24 will be described in detail below. The topic estimation unit 24 estimates the belonging topic z≡ (z ₁ , z ₂ ,... Z _N ) for each selected action data from the posterior probability. In the present embodiment, the topic estimation unit 24 estimates the affiliated topic based on the Gibbs sampling formula shown in the following formula (8), and P samples (z≡ (z ₁ , z ₂ ,... Z _N )) ( z ⁽¹⁾ , z ⁽²⁾ ,..., z ^(P) ).

ただし、上記式（８）に現れる−ｊは、総数Ｎの全データ集合からｊ番目のデータを除いた部分集合を表す。また、トピック推定部２４によって計算された、各選択行動データについてのトピックｋの計算結果（ｚ^（１），ｚ^（２），…，ｚ^（Ｐ））は、後述するトピック分布パラメータ推定部２６及びトピック比率推定部２８における処理のために、メモリ（図示省略）に記憶される。 However, -j appearing in the above equation (8) represents a subset obtained by excluding the jth data from the total number N of data sets. The topic k calculation results (z ⁽¹⁾ , z ⁽²⁾ ,..., Z ^(P) ) calculated by the topic estimation unit 24 for each selected action data are the topic distribution parameter estimation unit 26 described later. And stored in a memory (not shown) for processing in the topic ratio estimation unit 28.

また、トピック推定部２４は、Ｐ回分のサンプルを生成するのと同時に、確率的ＥＭ法（例えば、非特許文献３を参照）に基づき、以下の式（９）に従って、未知のハイパーパラメータα，β，γの値をＰ回更新する。そして、トピック推定部２４は、Ｐ回更新後のパラメータ値α^（Ｐ），β^（Ｐ），γ^（Ｐ）をそれぞれの推定値とする。 Further, the topic estimation unit 24 generates P samples at the same time, and based on the probabilistic EM method (see, for example, Non-Patent Document 3), according to the following equation (9), the unknown hyperparameter α, Update the values of β and γ P times. Then, the topic estimation unit 24 sets the parameter values α ^(P) , β ^(P) , and γ ^(P) after updating P times as estimated values.

非特許文献３：C. Bishop.“Pattern Recognition and Machine Learning”, Springer, New York, 2006. Non-Patent Document 3: C. Bishop. “Pattern Recognition and Machine Learning”, Springer, New York, 2006.

トピック分布パラメータ推定部２６は、トピック推定部２４によって更新されたパラメータ｛φ^ｋ｝，｛Υ^ｋ｝の分布を表すハイパーパラメータβ，γ、及び繰り返し毎に得られた、各選択行動データについてのトピックの計算結果（ｚ^（１），ｚ^（２），…，ｚ^（Ｐ））に基づく、トピック分布のパラメータの事後確率に従って、トピック分布のパラメータ｛φ^ｋ｝，｛Υ^ｋ｝を推定する。 The topic distribution parameter estimation unit 26 performs hyperparameters β and γ representing the distribution of the parameters {φ ^k } and {Υ ^k } updated by the topic estimation unit 24, and each selection action data obtained for each iteration. Estimate topic distribution parameters {φ ^k }, {Υ ^k } according to the posterior probabilities of the topic distribution parameters based on the topic calculation results (z ⁽¹⁾ , z ⁽²⁾ ,..., Z ^(P) ). .

具体的には、トピック分布パラメータ推定部２６は、トピック推定部２４によって得られたβ，γ，（ｚ^（１），ｚ^（２），…，ｚ^（Ｐ））を用いて、以下の式（１０），（１１）に従って、トピック分布のパラメータ｛φ^ｋ｝，｛Υ^ｋ）を推定する。なお、本実施の形態では、トピック分布のパラメータの推定値として、事後確率の平均を採用する。 Specifically, the topic distribution parameter estimation unit 26 uses the following formula using β, γ, (z ⁽¹⁾ , z ⁽²⁾ ,..., Z ^(P) ) obtained by the topic estimation unit 24. According to (10) and (11), the topic distribution parameters {φ ^k }, {Υ ^k ) are estimated. In the present embodiment, the average of the posterior probabilities is adopted as the estimated value of the topic distribution parameter.

ただし、ｋ＝１，２，３，…，Ｋであり、事後確率はそれぞれ以下の式（１１）で与えられる。 However, k = 1, 2, 3,..., K, and the posterior probabilities are respectively given by the following formula (11).

トピック比率推定部２８は、トピック推定部２４によって更新されたトピック比率の分布を表すハイパーパラメータα、及び繰り返し毎に得られた、各選択行動データについてのトピックの計算結果（ｚ^（１），ｚ^（２），…，ｚ^（Ｐ））に基づく、トピック比率の事後確率に従って、各個体ｕについて、トピック比率θ_ｕを推定する。 The topic ratio estimation unit 28 calculates the hyperparameter α representing the distribution of the topic ratio updated by the topic estimation unit 24 and the topic calculation results (z ⁽¹⁾ , z ⁾ obtained for each selection behavior data obtained for each iteration. ^(2), ^..., based on the ^{z (P)),} according to the posterior probability of the topic proportions, for each individual u, to estimate the topic ratio θ _u.

具体的には、トピック比率推定部２８は、トピック推定部２４によって得られたハイパーパラメータα，トピックの計算結果（ｚ^（１），ｚ^（２），…，ｚ^（Ｐ））を用いて、以下の式（１２），（１３）に従って、個体ごと（ｕ＝１，２，…，Ｕ）のトピック比率θ_ｕを推定する。なお、本実施の形態では、トピック比率の推定値として、事後確率の平均を採用する。 Specifically, the topic ratio estimation unit 28 uses the hyperparameter α obtained by the topic estimation unit 24 and the topic calculation results (z ⁽¹⁾ , z ⁽²⁾ ,..., Z ^(P) ), The topic ratio θ _u for each individual (u = 1, 2,..., U) is estimated according to the following equations (12) and (13). In the present embodiment, the average of the posterior probabilities is adopted as the estimated value of the topic ratio.

ただし、事後確率は、以下の式（１３）で与えられる。 However, the posterior probability is given by the following equation (13).

パラメータ記憶部３０には、トピック分布パラメータ推定部２６によって推定されたトピック分布のパラメータの値が格納される。トピック分布のパラメータの例を表３に示す。なお、以下の表３に示す例は、モデルタイプがタイプＩＶであり、トピック数Ｋ＝２、選択肢数Ｎ_ｃ＝３、選択タイミング分布のパラメータ数が２である場合である。 The parameter storage unit 30 stores topic distribution parameter values estimated by the topic distribution parameter estimation unit 26. Table 3 shows examples of topic distribution parameters. The example shown in Table 3 below is a case where the model type is type IV, the number of topics K = 2, the number of options N _c = 3, and the number of parameters of the selection timing distribution is 2.

トピック比率記憶部３２には、トピック比率推定部２８によって推定された個体ごとのトピック比率の値が格納される。トピック比率の例を表４に示す。なお、以下の表４に示す例は、トピック数Ｋ＝２の場合である。 The topic ratio storage unit 32 stores the topic ratio value for each individual estimated by the topic ratio estimation unit 28. Table 4 shows an example of the topic ratio. The example shown in Table 4 below is a case where the number of topics K = 2.

出力部４０は、パラメータ記憶部３０に記憶されたトピック分布のパラメータと、トピック比率記憶部３２に記憶された個体毎のトピック比率とを結果として出力する。 The output unit 40 outputs the topic distribution parameters stored in the parameter storage unit 30 and the topic ratios for each individual stored in the topic ratio storage unit 32 as a result.

＜本発明の実施の形態に係る予測装置の構成＞ <Configuration of prediction apparatus according to embodiment of the present invention>

次に、本発明の実施の形態に係る予測装置の構成について説明する。図７に示すように、本発明の実施の形態に係る予測装置２００は、ＣＰＵと、ＲＡＭと、後述する予測処理ルーチンを実行するためのプログラムや各種データを記憶したＲＯＭと、を含むコンピュータで構成することが出来る。この予測装置２００は、機能的には図７に示すように入力部５０と、演算部６０と、出力部７０とを備えている。予測装置２００は、予測対象の個体の選択行動を予測する。 Next, the configuration of the prediction device according to the embodiment of the present invention will be described. As shown in FIG. 7, a prediction device 200 according to an embodiment of the present invention is a computer including a CPU, a RAM, and a ROM that stores a program for executing a prediction processing routine described later and various data. Can be configured. The prediction device 200 functionally includes an input unit 50, a calculation unit 60, and an output unit 70 as shown in FIG. The prediction device 200 predicts the selection behavior of the individual to be predicted.

入力部５０は、予測対象の個体に対して予め推定装置１００によって推定された、各トピックに対する重みを表すトピック比率と、予め求められたトピックにおける、選択内容及び選択タイミングを特徴付けるトピック分布のパラメータとを受け付ける。 The input unit 50 includes a topic ratio representing a weight for each topic, which is estimated in advance by the estimation device 100 with respect to an individual to be predicted, and a topic distribution parameter that characterizes selection content and selection timing in a previously obtained topic. Accept.

演算部６０は、予測部６２を含んで構成されている。 The calculation unit 60 includes a prediction unit 62.

予測部６２は、入力部５０により受け付けたトピック比率｛θ_ｕ｝及びトピック分布のパラメータ｛φ^ｋ｝，｛Υ^ｋ｝に基づいて、各トピックに対する、トピックを条件とした選択内容の確率を表す条件付き多項分布、及びトピックを条件とした、前回の選択行動からの間隔である選択タイミングの確率密度分布を用いて表される、各トピックに対する、トピックを条件とした、個体の選択行動の選択内容、及び選択タイミングの同時確率分布に従って、予測対象の個体についての、個体の選択行動の選択内容ｙ^＊及び選択タイミングｔ^＊を予測する。 Based on the topic ratio {θ _u } and the topic distribution parameters {φ ^k }, {Υ ^k } received by the input unit 50, the prediction unit 62 represents the probability of selection content on the topic as a condition for each topic. Selection of individual selection behavior by topic for each topic expressed using conditional multinomial distribution and probability density distribution of selection timing, which is the interval from the previous selection behavior, with topic as a condition According to the simultaneous probability distribution of the content and the selection timing, the selection content y ^* and the selection timing t ^* of the individual selection behavior for the individual to be predicted are predicted.

具体的には、予測部６２は、推定装置１００によって推定されたトピック分布のパラメータ値｛φ^ｋ｝，｛Υ^ｋ｝とトピック比率｛θ_ｕ｝、及び各個体の過去の選択行動データ（ｙ，ｔ）を用いて、個体ｕ（ｕ＝１，２，…，Ｕ）の次回（将来）の選択行動（ｙ^＊，ｔ^＊）を予測する。 Specifically, the prediction unit 62 uses the topic distribution parameter values {φ ^k }, {Υ ^k } and the topic ratio {θ _u } estimated by the estimation apparatus 100, and past selection behavior data (y , T) is used to predict the next (future) selection action (y ^* , t ^* ) of the individual u (u = 1, 2,..., U).

予測の対象は（ａ）選択タイミングｔ^＊を与えた場合の選択内容ｙ^＊、（ｂ）選択内容ｙ^＊を与えた場合の選択タイミングｔ^＊、（ｃ）選択内容ｙ^＊と選択タイミングｔ^＊の両方、の三種類である。予測部６２は、それぞれの推定を以下の手順で行う。 The target of prediction is (a) selection content y ^* when selection timing t ^* is given, (b) selection timing t ^* when selection content y ^* is given, (c) selection content y ^* and selection timing t ^*. There are three types of both. The prediction unit 62 performs each estimation in the following procedure.

（ａ）選択タイミングｔ^＊を与えた場合の選択内容ｙ^＊を予測する場合
予測部６２は、選択内容ｙ^＊の各々について、以下の式（１４）に従って、選択タイミングｔ^＊を条件とした当該選択内容ｙ^＊の確率分布ｐ（ｙ^＊｜ｔ^＊）を計算し、確率の高さにおいて上位Ｒ個の選択内容を選び出し、予測値とする。 (A) When the selection content y ^* is predicted when the selection timing t ^* is given The prediction unit 62 uses the selection timing t ^* as a condition for each selection content y ^* according to the following equation (14). The probability distribution p (y ^* | t ^* ) of the selection content y ^* is calculated, and the top R selection content is selected at the probability level and set as the predicted value.

このときタイミングｔ^＊によって、すなわち直前の選択行動発生時刻からの経過時間によって予測結果は異なりうるため、推薦あるいはナビゲーションする内容をタイミングによって変更することができる。なお、個数Ｒは１からＮ_ｃまでの値を取ることができ、予測装置２００の使用者が使用形態に応じて任意に決めるものである。 At this time, since the prediction result may vary depending on the timing t ^* , that is, the elapsed time from the immediately preceding selected action occurrence time, the content to be recommended or navigated can be changed depending on the timing. The number R can take a value from 1 to _Nc , and is arbitrarily determined by the user of the prediction device 200 according to the usage pattern.

（ｂ）選択内容ｙ^＊を与えた場合の選択タイミングｔ^＊を予測する場合
予測部６２は、選択タイミングｔ^＊の各々について、以下の式（１５）に従って、選択内容ｙ^＊を条件とした選択タイミングｔ^＊の確率分布ｐ（ｔ^＊｜ｙ^＊）を計算し、ｐ（ｔ^＊｜ｙ^＊）の最大値（モード）あるいはｔ^＊の平均値を計算し、予測値とする。 (B) When predicting the selection timing t ^* when the selection content y ^* is given The prediction unit 62 selects each of the selection timings t ^{* based on} the selection content y ^* according to the following equation (15). The probability distribution p (t ^* | y ^* ) at timing t ^* is calculated, and the maximum value (mode) of p (t ^* | y ^* ) or the average value of t ^* is calculated as a predicted value.

上記式（１５）に従って予測が行われることにより、選択内容 By selecting according to the above formula (15), the selected content

ごとに選択タイミングの予測値が得られる。 A prediction value of the selection timing is obtained for each time.

（ｃ）選択内容ｙ^＊と選択タイミングｔ^＊の両方を予測する場合
予測部６２は、選択内容ｙ^＊と選択タイミングｔ^＊の両方の組み合わせ毎に、選択内容ｙ^＊とタイミングｔ^＊の同時確率分布ｐ（ｔ^＊，ｙ^＊｜｛φ^ｋ｝，｛Υ^ｋ｝）を計算し、ｐ（ｔ^＊，ｙ^＊｜｛φ^ｋ｝，｛Υ^ｋ｝）の最大値（モード）、又はｔ^＊の平均値及びｙ^＊の最大値を計算し、予測値とする。 (C) When predicting both selections ^{y *} and the selected timing ^{t *} prediction unit 62, selection ^{y *} both for each combination of the selection timing ^{t *,} choice ^{y *} and the timing ^{t *} joint probability of The distribution p (t ^* , y ^* | {φ ^k }, {Υ ^k }) is calculated, and the maximum value (mode) of p (t ^* , y ^* | {φ ^k }, {Υ ^k }), or t ^* a calculated average value and maximum value of y ^*, and predicted values.

そして、予測部６２は、上記（ａ）で計算された選択内容の予測値ｙ^＊、上記（ｂ）で計算された選択タイミングの予測値ｔ^＊、又は上記（ｃ）で計算された選択内容と選択タイミングの予測値（ｔ^＊，ｙ^＊）を出力する。 Then, the predicting unit 62 predicts the predicted value y ^* of the selected content calculated in (a), the predicted value t ^* of the selected timing calculated in (b), or the selected content calculated in (c). And the predicted value (t ^* , y ^* ) of the selection timing are output.

出力部７０は、選択内容の予測値ｙ^＊、選択タイミングの予測値ｔ^＊、又は選択内容と選択タイミングとの予測値（ｔ^＊，ｙ^＊）を結果として出力する。 The output unit 70 outputs the predicted value y ^* of the selected content, the predicted value t ^{* of} the selected timing, or the predicted value (t ^* , y ^* ) of the selected content and the selected timing as a result.

＜本発明の実施の形態に係る推定装置の作用＞ <Operation of Estimation Device according to Embodiment of the Present Invention>

次に、本発明の実施の形態に係る推定装置１００の作用について説明する。入力部１０において個体の個体ＩＤが付与された選択行動データの集合と、トピック数Ｋと、モデルタイプと、選択タイミング分布とを受け付けると、推定装置１００は、図８に示す推定処理ルーチンを実行する。 Next, the operation of the estimation apparatus 100 according to the embodiment of the present invention will be described. When receiving the set of selected behavior data to which the individual ID of the individual is assigned, the number of topics K, the model type, and the selection timing distribution in the input unit 10, the estimation apparatus 100 executes an estimation processing routine shown in FIG. To do.

まず、ステップＳ１００では、入力部１０において受け付けた選択行動データの集合と、トピック数Ｋと、モデルタイプと、選択タイミング分布とを取得する。そして、選択行動データの集合が選択行動データ記憶部２２に格納される。 First, in step S100, a set of selected action data received by the input unit 10, the number of topics K, a model type, and a selection timing distribution are acquired. A set of selected action data is stored in the selected action data storage unit 22.

次に、ステップＳ１０２において、トピック推定部２４は、選択行動データ記憶部２２に記憶された選択行動データの集合に含まれる各選択行動データについて、α、β、γの初期値又は前回のステップＳ１０４で推定されたα、β、γに基づいて、上記式（８）に従って、選択行動データが所属するトピックｋを計算する。 Next, in step S102, the topic estimation unit 24 sets initial values of α, β, and γ for each selection behavior data included in the set of selection behavior data stored in the selection behavior data storage unit 22 or the previous step S104. The topic k to which the selected action data belongs is calculated according to the above equation (8) based on α, β, and γ estimated in step (1).

ステップＳ１０４において、トピック推定部２４は、上記ステップＳ１０２で得られた各選択行動データについてのトピックｋの計算結果に基づいて、各個体ｕ及び各トピックｋに対する重みを表すトピック比率θ_ｕの分布を表すハイパーパラメータαと、トピックｋにおける、選択内容及び選択タイミングを特徴付けるトピック分布のパラメータφ^ｋ，Υ^ｋの分布を表すハイパーパラメータβ，γを、上記式（９）に従って更新する。 In step S104, the topic estimation unit 24 calculates the distribution of the topic ratio θ _u representing the weight for each individual u and each topic k based on the calculation result of the topic k for each selection behavior data obtained in step S102. The hyperparameters α and the hyperparameters β and γ representing the distribution of topic distribution parameters φ ^k and Υ ^k that characterize the selection content and selection timing in the topic ^k are updated according to the above equation (9).

ステップＳ１０６において、ステップＳ１０２〜Ｓ１０４の処理が予め定められたＰ回繰り返したかを判定し、予め定められた回数繰り返していなければステップＳ１０２へ戻ってステップＳ１０２〜Ｓ１０４の処理を繰り返し、予め定められた回数繰り返していればステップＳ１０８へ移行する。 In step S106, it is determined whether the process of steps S102 to S104 has been repeated P times. If the process has not been repeated a predetermined number of times, the process returns to step S102 and the processes of steps S102 to S104 are repeated. If it has been repeated a number of times, the process proceeds to step S108.

ステップＳ１０８において、トピック分布パラメータ推定部２６は、上記ステップＳ１０４で更新されたハイパーパラメータβ，γ、及び繰り返し毎に得られた、上記ステップＳ１０２で計算された各選択行動データについてのトピックの計算結果（ｚ^（１），ｚ^（２），…，ｚ^（Ｐ））に基づく、上記式（１１）に示すトピック分布のパラメータの事後確率に従って、トピック分布のパラメータφ^ｋ，Υ^ｋを推定する。また、トピック分布パラメータ推定部２６は、トピック分布のパラメータφ^ｋ，Υ^ｋをパラメータ記憶部３０に格納する。 In step S108, the topic distribution parameter estimation unit 26 calculates the hyperparameters β and γ updated in step S104 and the topic calculation result for each selected behavior data calculated in step S102 obtained for each iteration. Based on (z ⁽¹⁾ , z ⁽²⁾ ,..., Z ^(P) ), the topic distribution parameters φ ^k and Υ ^k are estimated according to the posterior probabilities of the topic distribution parameters shown in the above equation (11). The topic distribution parameter estimation unit 26 stores the topic distribution parameters φ ^k and Υ ^k in the parameter storage unit 30.

ステップＳ１１０において、トピック比率推定部２８は、上記ステップＳ１０４で更新されたトピック比率の分布を表すハイパーパラメータα、及び繰り返し毎に得られた、上記ステップＳ１０２で計算された各選択行動データについてのトピックの計算結果（ｚ^（１），ｚ^（２），…，ｚ^（Ｐ））に基づく、上記式（１３）に示すトピック比率の事後確率に従って、各個体について、トピック比率θ_ｕを推定する。トピック比率推定部２８は、各個体についてのトピック比率θ_ｕをトピック比率記憶部３２に格納する。 In step S110, the topic ratio estimation unit 28 performs the hyper parameter α representing the distribution of the topic ratio updated in step S104 and the topic for each selected behavior data calculated in step S102 obtained for each iteration. The topic ratio θ _u is estimated for each individual according to the posterior probabilities of the topic ratio shown in the above equation (13) based on the calculation results (z ⁽¹⁾ , z ⁽²⁾ ,..., Z ^(P) ). The topic ratio estimation unit 28 stores the topic ratio θ _u for each individual in the topic ratio storage unit 32.

ステップＳ１１２において、出力部４０は、パラメータ記憶部３０に記憶されたトピック分布のパラメータと、トピック比率記憶部３２に記憶された個体毎のトピック比率とを結果として出力して、推定処理ルーチンを終了する。 In step S112, the output unit 40 outputs the topic distribution parameter stored in the parameter storage unit 30 and the topic ratio for each individual stored in the topic ratio storage unit 32 as a result, and ends the estimation processing routine. To do.

＜本発明の実施の形態に係る予測装置の作用＞ <Operation of Prediction Device according to Embodiment of the Present Invention>

次に、本発明の実施の形態に係る予測装置２００の作用について説明する。入力部５０において、予測対象の個体ｕについて推定装置１００により推定されたトピック比率θ_ｕと、トピック分布のパラメータφ^ｋ，Υ^ｋとを受け付けると、予測装置２００は、図９に示す予測処理ルーチンを実行する。 Next, the operation of the prediction device 200 according to the embodiment of the present invention will be described. When the input unit 50 receives the topic ratio θ _u estimated by the estimation device 100 for the individual u to be predicted and the topic distribution parameters φ ^k and Υ ^k , the prediction device 200 performs the prediction processing routine shown in FIG. Execute.

ステップＳ２００において、トピック比率θ_ｕとトピック分布のパラメータφ^ｋ，Υ^ｋとを取得する。 In step S200, the topic ratio θ _u and the topic distribution parameters φ ^k and Υ ^k are acquired.

ステップＳ２０２において、選択タイミングｔ^＊を与えた場合の選択内容ｙ^＊を予測する場合には、予測部６２は、入力部５０において受け付けたトピック比率θ_ｕとトピック分布のパラメータφ^ｋ，Υ^ｋとに基づいて、上記式（１４）に従って、選択タイミングｔ^＊を条件とした選択内容ｙ^＊の確率分布ｐ（ｙ^＊｜ｔ^＊）を計算し、確率の高さにおいて上位Ｒ個の選択内容を選び出し、予測値とする。
また、選択内容ｙ^＊を与えた場合の選択タイミングｔ^＊を予測する場合には、予測部６２は、入力部５０において受け付けたトピック比率θ_ｕとトピック分布のパラメータ（φ^ｋ，Υ^ｋ）とに基づいて、上記式（１５）に従って、選択内容ｙ^＊を条件とした選択タイミングｔ^＊の確率分布ｐ（ｔ^＊｜ｙ^＊）を計算し、ｐ（ｔ^＊｜ｙ^＊）の最大値（モード）あるいはｔ^＊の平均値を計算し、予測値とする。
また、選択内容ｙ^＊と選択タイミングｔ^＊の両方を予測する場合には、予測部６２は、選択内容ｙ^＊とタイミングｔ^＊の同時確率分布ｐ（ｔ^＊，ｙ^＊｜｛φ^ｋ｝，｛Υ^ｋ｝）を計算し、ｐ（ｔ^＊，ｙ^＊｜｛φ^ｋ｝，｛Υ^ｋ｝）の最大値（モード）、又はｔ^＊の平均値及びｙ^＊の最大値を計算し、予測値とする。 In step S202, when the selection content y ^* is predicted when the selection timing t ^* is given, the prediction unit 62 determines the topic ratio θ _u received by the input unit 50 and the topic distribution parameters φ ^k and Υ ^k . Based on the above, the probability distribution p (y ^* | t ^* ) of the selection content y ^* is calculated according to the above equation (14) with the selection timing t ^* as a condition. Select and use as predicted value.
When predicting the selection timing t ^* when the selection content y ^* is given, the prediction unit 62 uses the topic ratio θ _u received by the input unit 50 and the topic distribution parameters (φ ^k , ^{ｋ k} ). Based on the above, the probability distribution p (t ^* | y ^* ) of the selection timing t ^* on the condition of the selection content y ^* is calculated according to the above equation (15), and the maximum value of p (t ^* | y ^* ) Mode) or an average value of t ^* is calculated as a predicted value.
Further, in the case of predicting both selections ^{y *} as selection timing ^{t *,} the prediction unit 62, selection ^{y *} and the timing ^{t *} joint probability distribution ^{^{p (t *, y * |}} {φ k}, {Υ ^k }) and calculate the maximum value (mode) of p (t ^* , y ^* | {φ ^k }, {Υ ^k }), or the average value of t ^* and the maximum value of y ^* , Estimated value.

そして、ステップＳ２０４において、出力部７０は、上記ステップＳ２０２で得られた、選択内容の予測値ｙ^＊、選択タイミングの予測値ｔ^＊、又は選択内容と選択タイミングとの予測値（ｔ^＊，ｙ^＊）を結果として出力して、予測処理ルーチンを終了する。 In step S204, the output unit 70 calculates the predicted value y ^* of the selection content, the predicted value t ^{* of} the selection timing, or the predicted value of the selection content and the selection timing (t ^* , y) obtained in step S202. ^* ) Is output as a result, and the prediction processing routine is terminated.

以上説明したように、本実施の形態に係る推定装置によれば、各選択行動データについて、各トピックに対する、トピックを条件とした選択内容の確率を表す条件付き多項分布、及びトピックを条件とした選択タイミングの確率密度分布を用いて表される、選択内容及び選択タイミングの同時確率分布に従って、選択行動データが所属するトピックを計算し、各選択行動データについてのトピックの計算結果に基づいて、トピック比率の分布を表すハイパーパラメータとトピック分布のパラメータの分布を表すハイパーパラメータとを更新することを繰り返し、更新されたトピック分布のパラメータの分布を表すハイパーパラメータ、及び繰り返し毎に得られた、各選択行動データについてのトピックの計算結果に基づく、トピック分布のパラメータの事後確率に従って、トピック分布のパラメータを推定し、更新されたトピック比率の分布を表すハイパーパラメータ、及び繰り返し毎に得られた、各選択行動データについてのトピックの計算結果に基づく、トピック比率の事後確率に従って、各個体について、トピック比率を推定することにより、個体の選択内容及び選択タイミングを精度よく予測するためのパラメータを推定することができる。 As described above, according to the estimation apparatus according to the present embodiment, for each selected action data, a conditional multinomial distribution that represents the probability of selection content on a topic for each topic, and a topic on a condition The topic to which the selected behavior data belongs is calculated according to the selection content and the simultaneous probability distribution of the selection timing expressed using the probability density distribution of the selection timing, and the topic is calculated based on the calculation result of the topic for each selected behavior data. Repeating the update of the hyperparameter representing the distribution of the ratio and the hyperparameter representing the distribution of the parameter of the topic distribution, the hyperparameter representing the parameter distribution of the updated topic distribution, and each selection obtained at each iteration The topic distribution parameters based on the topic calculation results for behavioral data. Estimate the topic distribution parameters according to the posterior probability of the meter, and update the topic ratios based on the hyperparameters that represent the updated topic ratio distribution and the topic calculation results for each selected behavior data obtained for each iteration. By estimating the topic ratio for each individual according to the posterior probability, it is possible to estimate parameters for accurately predicting the selection content and selection timing of the individual.

また、本実施の形態に係る予測装置によれば、予測対象の個体に対して予め求められた、各トピックに対する重みを表すトピック比率とトピック分布のパラメータとに基づいて、各トピックに対する、トピックを条件とした選択内容の確率を表す条件付き多項分布、及びトピックを条件とした、選択タイミングの確率密度分布を用いて表される、各トピックに対する、トピックを条件とした、個体の選択行動の選択内容及び選択タイミングの同時確率分布に従って、予測対象の個体についての、選択内容及び選択タイミングを予測することにより、個体の選択内容及び選択タイミングを精度よく予測することができる。 Further, according to the prediction device according to the present embodiment, the topic for each topic is calculated based on the topic ratio and the parameter of the topic distribution representing the weight for each topic, which is obtained in advance for the individual to be predicted. Selection of individual selection behavior based on topic for each topic, expressed using conditional multinomial distribution representing the probability of selection content as a condition, and probability density distribution of selection timing based on topic By predicting the selection content and the selection timing for the individual to be predicted according to the simultaneous probability distribution of the content and the selection timing, it is possible to accurately predict the selection content and the selection timing of the individual.

なお、本発明は、上述した実施の形態に限定されるものではなく、この発明の要旨を逸脱しない範囲内で様々な変形や応用が可能である。 The present invention is not limited to the above-described embodiment, and various modifications and applications can be made without departing from the gist of the present invention.

例えば、上記実施の形態に係る予測装置では、個体の選択内容及び選択タイミングを予測する場合を例に説明したが、これに限定されるものではなく、選択内容及び選択タイミングの何れか一方を予測してもよい。 For example, in the prediction device according to the above-described embodiment, the case where the selection content and the selection timing of an individual are predicted has been described as an example. However, the present invention is not limited to this, and either the selection content or the selection timing is predicted. May be.

また、上述の推定装置１００は、選択行動データ記憶部２２、パラメータ記憶部３０、及びトピック比率記憶部３２を備えている場合について説明したが、例えば選択行動データ記憶部２２、パラメータ記憶部３０、及びトピック比率記憶部３２の少なくとも１つが推定装置１００の外部装置に設けられ、推定装置１００は、外部装置と通信手段を用いて通信することにより、選択行動データ記憶部２２、パラメータ記憶部３０、及びトピック比率記憶部３２の少なくとも１つを参照するようにしてもよい。 Moreover, although the above-mentioned estimation apparatus 100 demonstrated the case where the selection action data storage part 22, the parameter storage part 30, and the topic ratio storage part 32 were provided, for example, the selection action data storage part 22, the parameter storage part 30, And at least one of the topic ratio storage unit 32 is provided in an external device of the estimation device 100, and the estimation device 100 communicates with the external device using a communication unit, whereby the selection behavior data storage unit 22, the parameter storage unit 30, Also, at least one of the topic ratio storage unit 32 may be referred to.

また、上記実施の形態では、推定装置１００と予測装置２００とを別々の装置として構成する場合を例に説明したが、推定装置１００と予測装置２００とを１つの装置として構成してもよい。 Moreover, although the case where the estimation apparatus 100 and the prediction apparatus 200 are configured as separate apparatuses has been described as an example in the above embodiment, the estimation apparatus 100 and the prediction apparatus 200 may be configured as one apparatus.

また、上述の推定装置及び予測装置は、内部にコンピュータシステムを有しているが、「コンピュータシステム」は、ＷＷＷシステムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。 In addition, the estimation device and the prediction device described above have a computer system inside, but the “computer system” includes a homepage provision environment (or display environment) if a WWW system is used. Shall be.

また、本願明細書中において、プログラムが予めインストールされている実施形態として説明したが、当該プログラムを、コンピュータ読み取り可能な記録媒体に格納して提供することも可能である。 In the present specification, the embodiment has been described in which the program is installed in advance. However, the program can be provided by being stored in a computer-readable recording medium.

１０，５０入力部
２０，６０演算部
２２選択行動データ記憶部
２４トピック推定部
２６トピック分布パラメータ推定部
２８トピック比率推定部
３０パラメータ記憶部
３２トピック比率記憶部
４０，７０出力部
６２予測部
１００推定装置
２００予測装置 DESCRIPTION OF SYMBOLS 10,50 Input part 20,60 Operation part 22 Selection action data storage part 24 Topic estimation part 26 Topic distribution parameter estimation part 28 Topic ratio estimation part 30 Parameter storage part 32 Topic ratio storage part 40,70 Output part 62 Prediction part 100 Estimation Device 200 prediction device

Claims

Each set included in the set of selected behavior data is input with a set of selection behavior data that represents the selection content of the selection behavior of the individual and a selection timing that is an interval from the previous selection behavior, observed in a plurality of individuals. For the selected behavior data, for each topic, expressed using a conditional multinomial distribution representing the probability of selection content with each topic as a condition for each topic, and a probability density distribution of selection timing with the topic as a condition, The topic to which the selected action data belongs is calculated according to the simultaneous probability distribution of the selection contents and the selection timing, with the topic as a condition,
Based on the calculation result of the topic for each selection behavior data, a hyper parameter representing a distribution of topic ratios representing weights for each individual and each topic, and a topic distribution characterizing the selection content and the selection timing in the topic A topic estimator that repeats updating hyperparameters representing the distribution of parameters;
A posterior probability of the parameter of the topic distribution based on the hyper parameter representing the distribution of the parameter of the topic distribution updated by the topic estimation unit and the calculation result of the topic for each selection behavior data obtained for each iteration A topic distribution parameter estimation unit for estimating a parameter of the topic distribution according to
Each individual according to the posterior probability of the topic ratio based on the hyperparameter representing the distribution of the topic ratio updated by the topic estimation unit and the calculation result of the topic for each selection behavior data obtained for each iteration. A topic ratio estimation unit for estimating the topic ratio;
Including the estimation device.

The estimation apparatus according to claim 1, wherein the probability density distribution of the selection timing with the topic as a condition is determined such that the selection timing is generated independently of the selection content.

The estimation apparatus according to claim 1, wherein the probability density distribution of the selection timing with the topic as a condition is determined such that the selection timing is generated depending on the selection content immediately before.

The estimation apparatus according to claim 1, wherein the probability density distribution of the selection timing with the topic as a condition is determined so as to be generated depending on the selection content immediately after the selection timing.

A prediction device for predicting the selection behavior of an individual to be predicted,
The topic ratio that represents the weight for each topic, obtained in advance for the individual to be predicted, the selection content of the individual's selection behavior and the interval from the previous selection behavior of the individual in the topic obtained in advance And a conditional multinomial distribution representing the probability of selection content on the basis of the topic and a probability density of the selection timing on the topic based on a topic distribution parameter characterizing the selection timing. The selection content for the individual to be predicted according to the selection content of the selection behavior of the individual and the simultaneous probability distribution of the selection timing for each topic represented by the distribution according to the topic. A prediction device including a prediction unit that predicts at least one of the selection timings.

The topic estimation unit receives as input a selection content of the selection behavior of the individual observed in a plurality of individuals and a selection behavior data representing a selection timing that is an interval from the previous selection behavior. Each selection behavior data included in the set is expressed by using a conditional multinomial distribution representing the probability of selection contents with the topic as a condition and a probability density distribution of selection timing with the topic as a condition for each topic. , For each topic, calculate the topic to which the selected behavior data belongs, according to the simultaneous probability distribution of the selection content and the selection timing, with the topic as a condition,
Based on the calculation result of the topic for each selection behavior data, a hyper parameter representing a distribution of topic ratios representing weights for each individual and each topic, and a topic distribution characterizing the selection content and the selection timing in the topic Repeatedly updating the hyperparameters representing the distribution of parameters,
The topic distribution parameter estimation unit is based on the hyperparameter representing the distribution of the topic distribution parameter updated by the topic estimation unit, and the calculation result of the topic for each selection behavior data obtained for each iteration, Estimating the topic distribution parameters according to the posterior probabilities of the topic distribution parameters;
The topic ratio estimator is a hyperparameter representing the distribution of the topic ratio updated by the topic estimator and the calculation result of the topic for each selected behavior data obtained for each iteration. An estimation method for estimating the topic ratio for each individual according to a posteriori probability.

A prediction method for predicting the selection behavior of an individual to be predicted,
The prediction unit is obtained in advance for the individual to be predicted, the topic ratio representing the weight for each topic, the selection content of the individual selection behavior in the topic obtained in advance, and the previous selection of the individual Based on the topic distribution parameters that characterize the selection timing, which is the interval from the action, a conditional multinomial distribution that represents the probability of selection content on the topic for each topic, and the selection on the topic According to the selection content of the selection action of the individual, with the topic as a condition, and the simultaneous probability distribution of the selection timing for each topic, expressed using a probability density distribution of timing, A prediction method for predicting at least one of the selection content and the selection timing.

The program for functioning a computer as each part which comprises the estimation apparatus of any one of Claims 1-4, or the prediction apparatus of Claim 5.