JP6358581B2

JP6358581B2 - Model estimation method, optimization method, apparatus, and program

Info

Publication number: JP6358581B2
Application number: JP2015130345A
Authority: JP
Inventors: 秀明金; 典子高屋; 熊谷　雄介; 雄介熊谷; 伸彦照井; 石垣　司; 司石垣
Original assignee: Tohoku University NUC; Nippon Telegraph and Telephone Corp
Current assignee: Tohoku University NUC; Nippon Telegraph and Telephone Corp
Priority date: 2015-06-29
Filing date: 2015-06-29
Publication date: 2018-07-18
Anticipated expiration: 2035-06-29
Also published as: JP2017016273A

Description

本発明は、モデル推定方法、最適化方法、装置、及びプログラムに係り、特に、刺激に接触した個体の行動を予測するためのモデル推定方法、最適化方法、装置、及びプログラムに関する。 The present invention relates to a model estimation method, an optimization method, an apparatus, and a program, and more particularly, to a model estimation method, an optimization method, an apparatus, and a program for predicting the behavior of an individual that has touched a stimulus.

人間に対して、ある意思決定（行動）を引き起こさせることを目的に刺激を与える様々な状況が存在する。なお、意思決定（行動）を引き起こさせることを目的に刺激を与える対象は人間以外の生物（動物等）でもよいが、以下では、主に人間を例にとって説明する。 There are various situations that stimulate humans for the purpose of causing a certain decision (action). Note that the target to be stimulated for the purpose of causing decision making (behavior) may be a living organism (animal or the like) other than a human being.

上記の状況としては、例えば、ある商品のテレビ／ウェブ広告と接触させることでその商品を購入させる、ある地域への旅行広告に接触させることでその地域への旅行を決断させる、ダイレクトメールと接触させることで特定の企業のホームページアドレスをクリックさせる、などがある。また、参加者で混雑した複合型イベントの会場において、参加者個々人をイベント情報配信によって各自誘導し、会場の混雑を解消させる場合も考えられる。あるいは、目覚ましアラームによって睡眠から覚醒させる、ソフトウェアエージェントの提案によって望ましい食材を摂取させる、といった人間を好ましい行動に誘導するような例もある。 For example, contact with direct mail, contact with a TV / web advertisement for a product, purchase the product, contact with a travel advertisement for a region, and decide to travel to the region. To click on the homepage address of a specific company. In addition, in a complex event venue that is crowded with participants, it may be possible to induce individual participants by event information distribution to eliminate congestion in the venue. Alternatively, there is an example in which a human being is guided to a desirable behavior, such as awakening from sleep by an alarm clock or ingesting a desired food by a software agent proposal.

刺激接触に対して人間がどの程度の確率で行動を起こすかを表す刺激接触効果に関する種々の研究が行われている。このような研究において、刺激接触効果には、閾値性、ウェアアウト（アバージョン）、忘却性といった性質があることがわかっている。 Various studies have been conducted on the effect of stimulating contact that expresses the probability of human action with respect to stimulating contact. In such studies, it has been found that the stimulus contact effect has properties such as threshold value, wearout (aversion), and forgetting property.

刺激接触効果の閾値性とは、刺激接触によってその効果を得るためには一定量以上の刺激が必要である、という性質のことである。また、刺激接触効果のウェアアウトあるいはアバージョンとは、過度の刺激接触が刺激接触効果を減退させる性質のことである。広告研究において（刺激＝広告、効果＝売上あるいは市場占有率）、これら二点の性質は以前よりその存在が指摘されている（例えば、非特許文献１、２）。 The threshold property of the stimulation contact effect is a property that a certain amount or more of stimulation is necessary to obtain the effect by stimulation contact. Further, the wear-out or aversion of the stimulation contact effect is a property that excessive stimulation contact reduces the stimulation contact effect. In advertisement research (stimulus = advertisement, effect = sales or market share), the existence of these two properties has been pointed out for some time (for example, Non-Patent Documents 1 and 2).

刺激接触効果の忘却性とは、ある時刻に接触した刺激の効果が時間と共に減衰していく性質のことである。この性質は広告効果（例えば、非特許文献３、４）、購買行動やメールの送信行動における他者からの影響（例えば、非特許文献５、６）などで広く認められている性質である。動物の選択行動でもしばしば仮定される（例えば、非特許文献７）。心理学実験でも確かめられている（例えば、非特許文献８）。 The forgetting property of the stimulus contact effect is a property in which the effect of the stimulus touched at a certain time attenuates with time. This property is widely recognized as an advertising effect (for example, Non-Patent Documents 3 and 4), an influence from others (for example, Non-Patent Documents 5 and 6) in purchasing behavior and email transmission behavior. It is often assumed even in animal selection behavior (eg, Non-Patent Document 7). It has also been confirmed by psychological experiments (for example, Non-Patent Document 8).

また、変動因子としてＥメール、ダイレクトメール、ＴＶ広告のマーケティング刺激と、顧客間の相互作用を採用し、購買時刻の予測を行う技術が存在する（例えば、上記非特許文献２）。ただし、顧客間の相互作用の大きさはデータから推定され、購買履歴の依存性は考慮していない。 In addition, there is a technique for predicting purchase time by adopting marketing stimulus of e-mail, direct mail, TV advertisement and interaction between customers as a variable factor (for example, Non-Patent Document 2). However, the magnitude of interaction between customers is estimated from the data and does not take into account the dependence of purchase history.

N. Terui, M. Ban, G. M. Allenby,“The effect of Media Advertising on Brand Consideration and Choice”, Marketing Science, Vol. 30, No. 1, pp.74-91, 2011.N. Terui, M. Ban, G. M. Allenby, “The effect of Media Advertising on Brand Consideration and Choice”, Marketing Science, Vol. 30, No. 1, pp.74-91, 2011. C. Pechmann, D. W. Stewart, “Advertising Repetition: A Critical Review of Wearin and Wearout”, Current Issues and Research in Advertising, Vol. 11, Issue 1-2, pp.285-329, 1988.C. Pechmann, D. W. Stewart, “Advertising Repetition: A Critical Review of Wearin and Wearout”, Current Issues and Research in Advertising, Vol. 11, Issue 1-2, pp.285-329, 1988. D. G. Clarke, “Econometric Measurement of the Duration of Advertising Effect on Sales”, Journal of Advertising Research, Vol. 13, No. 4, pp.345-357, 1976.D. G. Clarke, “Econometric Measurement of the Duration of Advertising Effect on Sales”, Journal of Advertising Research, Vol. 13, No. 4, pp.345-357, 1976. P. Manchanda, J. Dube, K. Y. Goh, R. K. Chintagunta, “The Effect of Banner Advertising on Internet Purchasing”, Journal of Marketing Research, Vol. 43, No. 1, pp. 98-108, 2006.P. Manchanda, J. Dube, K. Y. Goh, R. K. Chintagunta, “The Effect of Banner Advertising on Internet Purchasing”, Journal of Marketing Research, Vol. 43, No. 1, pp. 98-108, 2006. Rikiya Takahashi, Hideyuki Mizuta, Naoki Abe, Ruby L. Kennedy, Vincent J. Jeffs, Ravi Shah, Robert H. Crites, “Collective Response Spike Prediction for Mutually Interacting Consumers”, Data Mining (ICDM), 2013 IEEE 13th International Conference on, pp.727-736, 2013.Rikiya Takahashi, Hideyuki Mizuta, Naoki Abe, Ruby L. Kennedy, Vincent J. Jeffs, Ravi Shah, Robert H. Crites, “Collective Response Spike Prediction for Mutually Interacting Consumers”, Data Mining (ICDM), 2013 IEEE 13th International Conference on , pp.727-736, 2013. C. Blundell, J. Beck, K. A. Heller, “Modelling reciprocating relationships with Hawkes processes”, Advances in Neural Information Processing Systems, pp.2600-2608, 2012.C. Blundell, J. Beck, K. A. Heller, “Modeling reciprocating relationships with Hawkes processes”, Advances in Neural Information Processing Systems, pp. 2600-2608, 2012. R. Ratcliff, G. McKoon, “The diffusion decision model: theory and data for two-choice decision tasks”, Neural Computation, Vol. 20, No. 4, pp.873-922, 2008.R. Ratcliff, G. McKoon, “The diffusion decision model: theory and data for two-choice decision tasks”, Neural Computation, Vol. 20, No. 4, pp.873-922, 2008. J. T. Wixted, “Analyzing the empirical course of forgetting”, Journal of Experimental Psychology, Vol. 16, No. 5, pp.927-935, 1990.J. T. Wixted, “Analyzing the empirical course of forgetting”, Journal of Experimental Psychology, Vol. 16, No. 5, pp.927-935, 1990. R. E. Bucklin, S. Gupta, “Brand choice, Purchase incidence, and segmentation: an integrated modeling approach”, Journal of Marketing Research, Vol. 29, pp.201-215, 1992.R. E. Bucklin, S. Gupta, “Brand choice, Purchase incidence, and segmentation: an integrated modeling approach”, Journal of Marketing Research, Vol. 29, pp. 201-215, 1992.

しかし、意思決定の発生に対する刺激接触効果を測定する装置である非特許文献４，５，６では、刺激接触効果の閾値性、ウェアアウト、忘却性のすべてを同時には考慮していないため、正確に刺激接触効果を測定または最適化できない。例えばウェアアウトを無視すると、刺激接触は多ければ多いほど大きな効果が期待されるという誤った結論（例えば、上記非特許文献１）を得ることになり、無駄に多くの刺激接触を与えることにつながってしまう、という問題がある。 However, in Non-Patent Documents 4, 5, and 6, which are devices that measure the stimulus contact effect on the occurrence of decision-making, all of threshold value, wearout, and forgetting property of the stimulus contact effect are not considered at the same time. The stimulus contact effect cannot be measured or optimized. For example, ignoring wear-out results in an erroneous conclusion (for example, Non-Patent Document 1) that a greater effect is expected as the number of stimulation contacts increases, leading to a lot of useless stimulation contacts. There is a problem that.

上記、意思決定の発生に対する刺激接触効果を測定する装置は、人間の意思決定の「発生の有無」に対する刺激接触効果を測定するための装置である。つまり、意思決定の選択肢は一択（特定の商品を購買する等）でなくてはならない。よってこの装置単独では、意思決定発生時に二つ以上の選択肢から意思決定内容が選択される状況を扱うことができないという問題がある。 The apparatus for measuring the stimulus contact effect on the occurrence of decision making is a device for measuring the stimulus contact effect on “occurrence / non-occurrence” of human decision making. In other words, decision-making options must be one choice (such as purchasing a specific product). Therefore, this apparatus alone has a problem that it is not possible to handle a situation in which decision making content is selected from two or more options when decision making occurs.

上記の状況としては、例えば、ある特定の商品カテゴリ（牛乳やコーヒーなど）の購買の有無に対する広告接触効果を測定することはできるが、その商品カテゴリを購入する際どのブランド（多くの場合二つ以上の選択肢）を購買するかに関する広告接触効果を測定することができない、というものが挙げられる。仮に、意思決定の発生に対する刺激接触効果を測定する装置を用いて、牛乳の三つのブランドＡ、Ｂ、Ｃそれぞれに対して広告接触効果を測定する装置を独立に用意したとしても、実際にある時点で購買されるのはＡ、Ｂ、Ｃのいずれか一つであり、その選択の排他性を考慮することができない。事業者が広告接触効果を測定する場合、事業者が知りたいのは自社ブランドの他社ブランドに対する優位性であり、その点で意思決定の発生に対する刺激接触効果を測定する装置は、単独では広告接触効果測定装置として不十分である。 As an example of the above situation, it is possible to measure the effect of advertising contact on the purchase / non-purchase of a specific product category (such as milk and coffee). It is possible to measure the effect of advertising contact on whether to purchase the above option). Even if a device that measures the advertising contact effect for each of the three brands A, B, and C of milk is prepared using a device that measures the stimulating contact effect on the occurrence of decision making, it is actually present. Any one of A, B, and C is purchased at the time, and the exclusivity of the selection cannot be considered. When an operator measures the advertising contact effect, the operator wants to know the superiority of its own brand over other brands, and in that respect, the device that measures the stimulating contact effect on the occurrence of decision-making is not an ad contact alone. It is insufficient as an effect measuring device.

また、意思決定内容の選択に対する刺激接触効果の測定装置はすでに提案されているが（非特許文献１）、刺激接触効果の閾値性、ウェアアウト、忘却性のすべてを同時には考慮していないため、正確に刺激接触効果を測定または最適化できないという問題が存在する。 Moreover, although the measurement apparatus of the stimulation contact effect with respect to selection of the decision-making content has already been proposed (Non-Patent Document 1), it does not consider all of the threshold property, wearout, and forgetting property of the stimulation contact effect at the same time. There is a problem that the stimulus contact effect cannot be accurately measured or optimized.

上記の問題を解決するためには、まず閾値性、ウェアアウト、忘却性のすべての刺激接触効果を同時に考慮した、意思決定の発生に対する刺激接触効果測定装置を構築し、その測定装置に、（二つ以上の選択肢からの）選択行動に対する刺激接触効果の測定装置を組み込めばよい。選択行動に対する刺激接触効果測定装置は閾値性、ウェアアウト、忘却性のすべての刺激接触効果を同時に考慮した場合と、いずれか一つだけあるいは二つだけを考慮した場合が考えられる。意思決定の発生の有無と意思決定内容の選択、その両方を組み込んだ装置（以後統合モデルと呼ぶ）自体はすでに存在するが（非特許文献９）、刺激接触の三つの性質である閾値性、ウェアアウト、忘却性の全てを考慮した統合モデルは未だ提案されていない。 In order to solve the above problems, firstly, a stimulus contact effect measuring device for occurrence of decision making, considering all the stimulus contact effects of threshold value, wearout, and forgetting property at the same time, is constructed. A measuring device for stimulating contact effects on selected behavior (from two or more options) may be incorporated. The stimulating contact effect measuring device for the selected behavior can be considered when considering all stimulating contact effects of threshold value, wearout and forgetting simultaneously, and considering only one or only two. Although there is already an apparatus (hereinafter referred to as an integrated model) that incorporates both the presence / absence of decision-making and selection of decision-making contents (hereinafter referred to as an integrated model), the threshold property, which is the three properties of stimulation contact, An integrated model that considers both wear-out and forgetting has not yet been proposed.

本発明は、上記事情に鑑みて成されたものであり、刺激接触効果の閾値性、刺激接触効果のウェアアウト、及び刺激接触効果の忘却性を考慮した、選択行動に対する刺激接触効果を推定するためのパラメータを得ることができるモデル推定方法、装置、及びプログラムを提供することを目的とする。 The present invention has been made in view of the above circumstances, and estimates the stimulus contact effect on the selected action in consideration of the threshold property of the stimulus contact effect, the wearout of the stimulus contact effect, and the forgetting property of the stimulus contact effect. It is an object of the present invention to provide a model estimation method, apparatus, and program capable of obtaining parameters for the above.

また、刺激接触効果の閾値性、刺激接触効果のウェアアウト、及び刺激接触効果の忘却性を考慮して、個体に特定の選択行動を起こさせるための刺激を最適化することができる最適化方法、装置、及びプログラムを提供することを目的とする。 In addition, an optimization method capable of optimizing a stimulus for causing an individual to perform a specific selection behavior in consideration of a threshold property of the stimulus contact effect, wearout of the stimulus contact effect, and forgetting property of the stimulus contact effect An object is to provide a device and a program.

上記目的を達成するために、本発明に係るモデル推定方法は、入力部及びパラメータ推定部を含むモデル推定装置におけるモデル推定方法であって、前記入力部が、複数の個体の各々について観測された、前記個体が行動を起こした時刻の系列を表す行動時刻系列データと、前記個体が行動したときの行動の選択内容の系列を表す選択系列データとを含む行動データ、及び複数の個体の各々について観測され、かつ行動の選択肢毎に観測された、前記個体が刺激に接触した時刻の系列を表す刺激データを受け付けるステップと、前記パラメータ推定部が、複数の個体の各々についての、前記個体が予め定められた第１の量以上の刺激に接触した場合に、前記個体が刺激との接触によって行動を起こす度合いを表す刺激接触効果が高くなり、かつ、前記個体が予め定められた第２の量以上の刺激に接触した場合に前記刺激接触効果が低くなり、かつ、前記個体の刺激との接触の経過時間に応じて前記刺激接触効果が減衰するように、時刻ｔにおける単位時間当たりに前記個体が行動を起こすレートλ（ｔ）をモデル化した、個体の行動の発生に関するモデルと、時刻ｔにおいて前記個体が行動を起こしたときに各選択肢ｚ_ｌを選択する選択確率ｐ_ｌ（ｔ）をモデル化した、個体の行動が発生したときの選択に関するモデルと、前記複数の個体の各々についての、前記個体の行動の発生に関するモデルに関するパラメータと前記個体の行動が発生したときの選択に関するモデルに関するパラメータとを表す個体パラメータが、全体パラメータが表す分布に従って生成されることをモデル化した、個体パラメータの生成過程に関するモデルと、前記入力部によって受け付けた前記行動データ及び前記刺激データとに基づいて、前記全体パラメータと、前記複数の個体の各々についての前記個体パラメータとを推定するステップと、を含んで構成されている。 To achieve the above object, a model estimation method according to the present invention is a model estimation method in a model estimation apparatus including an input unit and a parameter estimation unit, wherein the input unit is observed for each of a plurality of individuals. Behavior data including behavior time series data representing a sequence of times when the individual has acted, and selection sequence data representing a sequence of behavior selection contents when the individual acts, and each of the plurality of individuals Receiving stimulus data representing a sequence of times when the individual contacts the stimulus, and observed for each action option; and the parameter estimation unit is configured to preliminarily store the individual for each of a plurality of individuals. When contacted with a stimulus greater than a predetermined first amount, the stimulus contact effect representing the degree to which the individual takes action by contact with the stimulus is increased. The stimulus contact effect is reduced when the individual comes into contact with a predetermined amount of stimulus or more, and the stimulus contact effect is attenuated according to the elapsed time of contact with the individual's stimulus. Thus, a model relating to the occurrence of an individual's behavior modeling the rate at which the individual takes action per unit time at time t, and each option z when the individual takes action at time t a model relating to selection when an individual action occurs, modeling a selection probability p _l (t) for selecting _l , a parameter relating to a model relating to the occurrence of the individual action for each of the plurality of individuals, and A model that individual parameters that represent parameters related to selection when individual behavior occurs are generated according to the distribution represented by the overall parameters Based on the model related to the individual parameter generation process and the behavior data and the stimulus data received by the input unit, the overall parameter and the individual parameter for each of the plurality of individuals are estimated. And a step.

また、本発明に係るモデル推定装置は、複数の個体の各々について観測された、前記個体が行動を起こした時刻の系列を表す行動時刻系列データと、前記個体が行動したときの行動の選択内容の系列を表す選択系列データと、前記個体の観測期間とを含む行動データ、及び複数の個体の各々について観測され、かつ行動の選択肢毎に観測された、前記個体が刺激に接触した時刻の系列を表す刺激データを受け付ける入力部と、複数の個体の各々についての、前記個体が予め定められた第１の量以上の刺激に接触した場合に、前記個体が刺激との接触によって行動を起こす度合いを表す刺激接触効果が高くなり、かつ、前記個体が予め定められた第２の量以上の刺激に接触した場合に前記刺激接触効果が低くなり、かつ、前記個体の刺激との接触の経過時間に応じて前記刺激接触効果が減衰するように、時刻ｔにおける単位時間当たりに前記個体が行動を起こすレートλ（ｔ）をモデル化した、個体の行動の発生に関するモデルと、時刻ｔにおいて前記個体が行動を起こしたときに各選択肢ｚ_ｌを選択する選択確率ｐ_ｌ（ｔ）をモデル化した、個体の行動が発生したときの選択に関するモデルと、前記複数の個体の各々についての、前記個体の行動の発生に関するモデルに関するパラメータと前記個体の行動が発生したときの選択に関するモデルに関するパラメータとを表す個体パラメータが、全体パラメータが表す分布に従って生成されることをモデル化した、個体パラメータの生成過程に関するモデルと、前記入力部によって受け付けた前記行動データ及び前記刺激データとに基づいて、前記全体パラメータと、前記複数の個体の各々についての前記個体パラメータとを推定するパラメータ推定部と、を含んで構成されている。 In addition, the model estimation apparatus according to the present invention includes behavior time series data representing a series of times at which the individual has acted, which is observed for each of a plurality of individuals, and action selection contents when the individual has acted Action data including selected series data representing a series of the above and the observation period of the individual, and a series of times when the individual contacts the stimulus, observed for each of the plurality of individuals and observed for each action option An input unit that accepts stimulus data representing the degree to which the individual causes a behavior by contact with the stimulus when the individual comes into contact with a stimulus greater than or equal to a predetermined first amount for each of the plurality of individuals When the individual touches a stimulus greater than or equal to a predetermined second amount, the stimulus contact effect decreases and the contact with the individual's stimulus is increased. A model relating to the occurrence of an individual's action, in which a rate λ (t) at which the individual takes action per unit time at time t is modeled so that the stimulation contact effect attenuates according to the elapsed time of A selection probability p _l (t) for selecting each option z _l when the individual takes action in the model for selection when an individual action occurs, and for each of the plurality of individuals An individual parameter that models that the parameter related to the model related to the occurrence of the behavior of the individual and the parameter related to the model related to the selection when the behavior of the individual occurs is generated according to the distribution represented by the overall parameter On the basis of the model related to the generation process and the action data and the stimulus data received by the input unit. Accordingly, the parameter estimation unit is configured to estimate the overall parameter and the individual parameter for each of the plurality of individuals.

また、本発明に係るモデル推定方法において、前記個体パラメータの生成過程に関するモデルは、１階層の階層ベイズモデルに基づきモデル化され、前記パラメータ推定部がパラメータを推定するステップは、前記入力部によって受け付けた前記行動データ及び前記刺激データに基づいて、前記全体パラメータを推定し、前記入力部によって受け付けた前記行動データ及び前記刺激データと、推定された前記全体パラメータとに基づいて、前記複数の個体の各々について前記個体パラメータを推定するようにしてもよい。 In the model estimation method according to the present invention, the model related to the individual parameter generation process is modeled based on a one-level hierarchical Bayesian model, and the step of the parameter estimation unit estimating the parameter is accepted by the input unit. Based on the behavior data and the stimulus data, the overall parameter is estimated, and based on the behavior data and the stimulus data received by the input unit and the estimated overall parameter, the plurality of individuals The individual parameters may be estimated for each.

また、本発明に係るモデル推定方法において、前記個体パラメータの生成過程に関するモデルは、２階層の階層ベイズモデルに基づきモデル化され、前記パラメータ推定部がパラメータを推定するステップは、前記入力部によって受け付けた前記行動データ及び前記刺激データに基づいて、前記全体パラメータと前記複数の個体の各々についての前記個体パラメータとを推定するようにしてもよい。 In the model estimation method according to the present invention, the model related to the individual parameter generation process is modeled based on a two-level hierarchical Bayesian model, and the step of estimating the parameter by the parameter estimation unit is accepted by the input unit. Further, based on the behavior data and the stimulus data, the overall parameter and the individual parameter for each of the plurality of individuals may be estimated.

また、本発明に係るモデル推定方法において、前記入力部が受け付けるステップは、複数の個体の各々についての属性を表す属性データを更に受け付け、前記個体パラメータの生成過程に関するモデルは、前記複数の個体の各々についての前記個体パラメータが、前記個体の属性及び前記全体パラメータが表す分布に従って生成されることをモデル化したものであって、前記パラメータ推定部がパラメータを推定するステップは、前記個体の行動が発生したときの選択に関するモデル及び前記個体パラメータの生成過程に関するモデルと、前記入力部によって受け付けた前記行動データ、前記刺激データ、及び前記属性データとに基づいて、前記全体パラメータと複数の個体の各々についての前記個体パラメータとを推定するようにしてもよい。 Further, in the model estimation method according to the present invention, the step of accepting the input unit further accepts attribute data representing an attribute for each of the plurality of individuals, and the model relating to the process of generating the individual parameters includes the plurality of individuals. The individual parameter for each is modeled that the individual parameter is generated according to the distribution represented by the attribute of the individual and the global parameter, and the step of estimating the parameter by the parameter estimation unit includes the behavior of the individual Based on the model related to the selection when generated and the model related to the generation process of the individual parameter, and the behavior data, the stimulus data, and the attribute data received by the input unit, each of the overall parameter and the plurality of individuals The individual parameters for may be estimated .

また、本発明に係る最適化方法は、入力部及び最適化部を含み、上記モデル推定方法によって推定された前記個体の個体パラメータに基づいて、前記個体に与える予定の前記選択肢ｚ_ｌに関する刺激の時刻の系列を最適化する最適化装置における最適化方法であって、前記入力部が、前記個体に与える刺激に関する拘束条件と、前記個体の前記行動データと、前記選択肢ｚ_ｌについて観測された前記個体の前記刺激データとを受け付けるステップと、前記最適化部が、前記入力部によって受け付けた、前記拘束条件、前記個体の前記行動データ、及び前記選択肢ｚ_ｌについて観測された前記個体の前記刺激データと、前記個体の個体パラメータとに基づいて、前記拘束条件を満たし、予め定められた期間までの間に、前記個体が行動を起こし、かつ、前記選択肢ｚ_ｌを選択する回数を最大化するように、前記個体に与える予定の前記選択肢ｚ_ｌに関する刺激の時刻の系列を最適化するステップと、を含んで構成されている。 In addition, the optimization method according to the present invention includes an input unit and an optimization unit, and based on the individual parameters of the individual estimated by the model estimation method, the stimulation z _{1 to} be given to the individual An optimization method in an optimization device for optimizing a time series, wherein the input unit is observed with respect to a constraint condition related to a stimulus given to the individual, the behavior data of the individual, and the option z _l Receiving the stimulus data of the individual, and the stimulus data of the individual observed for the constraint condition, the behavior data of the individual, and the option z _l received by the input unit by the optimization unit And the individual parameters of the individual satisfy the constraint condition and cause the individual to take action during a predetermined period. And said to maximize the number of times to select the option z _l, it is configured to include the steps of: optimizing the choice z _l regarding sequence of time stimulation scheduled to be given to the individual.

また、本発明に係る最適化装置は、上記モデル推定装置によって推定された前記個体の個体パラメータに基づいて、前記個体に与える予定の前記選択肢ｚ_ｌに関する刺激の時刻の系列を最適化する最適化装置であって、前記個体に与える刺激に関する拘束条件と、前記個体の前記行動データと、前記選択肢ｚ_ｌについて観測された前記個体の前記刺激データとを受け付ける入力部と、前記入力部によって受け付けた、前記拘束条件、前記個体の前記行動データ、及び前記選択肢ｚ_ｌについて観測された前記個体の前記刺激データと、前記個体の個体パラメータとに基づいて、前記拘束条件を満たし、予め定められた期間までの間に、前記個体が行動を起こし、かつ、前記選択肢ｚ_ｌを選択する回数を最大化するように、前記個体に与える予定の前記選択肢ｚ_ｌに関する刺激の時刻の系列を最適化する最適化部と、を含んで構成されている。 Further, the optimization device according to the present invention optimizes a sequence of stimulation times related to the option z _{l to} be given to the individual based on the individual parameters of the individual estimated by the model estimation device. An apparatus for receiving a constraint condition relating to a stimulus to be given to the individual, the behavior data of the individual, and the stimulus data of the individual observed with respect to the option z _l , and an input unit received by the input unit , Satisfying the constraint condition based on the individual condition parameters of the individual and the stimulus data observed for the individual, the behavior data of the individual, and the choice z _l of the individual, and a predetermined period Until the individual takes action and maximizes the number of times the option z _l is selected. And an optimizing unit that optimizes a time series of stimuli related to the predetermined option z ₁ .

本発明に係るプログラムは、上記モデル推定方法又は上記最適化方法の各ステップをコンピュータに実行させるためのプログラムである。 The program according to the present invention is a program for causing a computer to execute each step of the model estimation method or the optimization method.

本発明のモデル推定方法、装置、及びプログラムによれば、個体が予め定められた第１の量以上の刺激に接触した場合に刺激接触効果が高くなり、かつ、個体が予め定められた第２の量以上の刺激に接触した場合に刺激接触効果が低くなり、かつ、個体の刺激との接触の経過時間に応じて刺激接触効果が減衰するようにモデル化した個体の行動の発生に関するモデルと、個体が行動を起こしたときに各選択肢を選択する選択確率をモデル化した個体の行動が発生したときの選択に関するモデルと、個体の行動の発生に関するモデルに関するパラメータと個体の行動が発生したときの選択に関するモデルに関するパラメータとを表す個体パラメータが、全体パラメータが表す分布に従って生成されることをモデル化した個体パラメータの生成過程に関するモデルと、複数の個体の各々について観測された行動データ及び刺激データとに基づいて、複数の個体の各々についての個体パラメータとを推定することにより、刺激接触効果の閾値性、刺激接触効果のウェアアウト、及び刺激接触効果の忘却性を考慮した、選択行動に対する刺激接触効果を推定するためのパラメータを得ることができる、という効果が得られる。 According to the model estimation method, apparatus, and program of the present invention, the stimulus contact effect is enhanced when the individual contacts a stimulus of a predetermined first amount or more, and the individual is determined in the second A model related to the occurrence of individual behavior that is modeled so that the stimulus contact effect is low when contacted with a stimulus greater than or equal to the amount, and the stimulus contact effect is attenuated according to the elapsed time of contact with the individual stimulus. When the individual takes action, the selection probability of selecting each option is modeled. The model related to selection when the individual's action occurs, the parameter related to the model related to the occurrence of the individual action, and the action of the individual occur. The generation of individual parameters that modeled that individual parameters that represent parameters related to the selection of models are generated according to the distribution represented by the overall parameters. And the individual parameters for each of the plurality of individuals based on the behavior data and the stimulation data observed for each of the plurality of individuals, thereby determining the threshold characteristics of the stimulation contact effect and the stimulus contact effect. An effect is obtained that it is possible to obtain a parameter for estimating the stimulus contact effect on the selected action in consideration of the wearout and forgetting property of the stimulus contact effect.

また、本発明の最適化方法、装置、及びプログラムによれば、個体に与える刺激に関する拘束条件、個体の行動データ、及び選択肢ｚ_ｌについて観測された個体の刺激データと、モデル推定装置によって推定された個体の個体パラメータとに基づいて、拘束条件を満たし、予め定められた期間までの間に、個体が行動を起こし、かつ、選択肢ｚ_ｌを選択する回数を最大化するように、個体に与える予定の選択肢ｚ_ｌに関する刺激の時刻の系列を最適化することにより、刺激接触効果の閾値性、刺激接触効果のウェアアウト、及び刺激接触効果の忘却性を考慮して、個体に特定の選択行動を起こさせるための刺激を最適化することができる、という効果が得られる。 Further, according to the optimization method, apparatus, and program of the present invention, the constraint condition related to the stimulus given to the individual, the behavior data of the individual, and the individual stimulation data observed for the option z _l are estimated by the model estimation device. Based on the individual parameters of the individual, the individual is satisfied so that the number of times that the individual takes action and selects the choice z _l during a predetermined period is maximized. By optimizing the sequence of stimuli time for the scheduled option z _l , the selection behavior specific to the individual is taken into account in consideration of the threshold value of the stimulus contact effect, the wear out of the stimulus contact effect, and the forgetting property of the stimulus contact effect It is possible to optimize the stimulus for causing the problem.

本発明の実施の形態に係るモデル推定装置の主要構成を示すブロック図である。It is a block diagram which shows the main structures of the model estimation apparatus which concerns on embodiment of this invention. 個体の行動の発生に関するモデルの一例を示す図である。It is a figure which shows an example of the model regarding generation | occurrence | production of an individual's action. 個体の行動の発生に関するモデルの一例を示す図である。It is a figure which shows an example of the model regarding generation | occurrence | production of an individual's action. 個体の行動が発生したときの選択に関するモデルの一例を示す図である。It is a figure which shows an example of the model regarding selection when an individual's action generate | occur | produces. モデル推定装置によって推定された個体パラメータの一例を示す図である。It is a figure which shows an example of the individual parameter estimated by the model estimation apparatus. 本発明の実施の形態に係る最適化装置の主要構成を示すブロック図である。It is a block diagram which shows the main structures of the optimization apparatus which concerns on embodiment of this invention. 最適化装置によって出力された所定の選択肢に関する刺激の時刻の系列の一例を示す図である。It is a figure which shows an example of the series of the time of the stimulus regarding the predetermined option output by the optimization apparatus. 本発明の実施の形態におけるモデル推定処理ルーチンの内容を示すフローチャートである。It is a flowchart which shows the content of the model estimation process routine in embodiment of this invention. 本発明の実施の形態における最適化処理ルーチンの内容を示すフローチャートである。It is a flowchart which shows the content of the optimization process routine in embodiment of this invention.

以下、図面を参照して本発明の実施の形態を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

＜本発明の実施の形態の概要＞ <Outline of Embodiment of the Present Invention>

まず、本発明の実施の形態の概要を説明する。 First, an outline of an embodiment of the present invention will be described.

本発明の実施の形態では、人間や動物等の個体に対し、ある意思決定（行動）を引き起こさせることを目的に、刺激を与える様々な状況を想定する。上記の状況としては、例えば、ある商品のテレビ/ウェブ広告と接触させることでその商品を購入させる、ある地域への旅行広告に接触させることでその地域への旅行を決断させる、ダイレクトメールと接触させることで特定の企業のホームページアドレスをクリックさせる、などがある。 In the embodiment of the present invention, various situations in which stimulation is given are assumed for the purpose of causing a certain decision (action) to an individual such as a human being or an animal. For example, contact with direct mail, contact with a TV / web advertisement for a product, purchase the product, contact with a travel advertisement for a region, and decide to travel to the region. To click on the homepage address of a specific company.

本発明の実施の形態では、意思決定の発生と意思決定内容の選択との両方をモデルに組み込み、かつ刺激接触効果の閾値性、ウェアアウト、忘却性を同時に考慮した高精度な刺激接触効果の推定を個体単位で実現することを可能とする技術を提供することを目的とする。 In the embodiment of the present invention, both the generation of decision making and the selection of decision making contents are incorporated into the model, and the high-precision stimulation contact effect considering the threshold property, wearout, and forgetting property of the stimulation contact effect at the same time. An object of the present invention is to provide a technique that makes it possible to realize estimation on an individual basis.

本発明の実施の形態では、モデル推定装置と最適化装置とに本発明を適用した場合を例に説明する。まず、モデル推定装置においてデータから個体パラメータが推定される。個体パラメータは、個体毎の広告接触効果の評価値である。次の最適化装置においては、推定された個体パラメータに基づき広告効果を最大化する刺激接触が決定される。 In the embodiment of the present invention, a case where the present invention is applied to a model estimation device and an optimization device will be described as an example. First, an individual parameter is estimated from data in a model estimation device. The individual parameter is an evaluation value of the advertisement contact effect for each individual. In the next optimizing device, the stimulation contact that maximizes the advertising effect is determined based on the estimated individual parameters.

＜本発明の実施の形態に係るモデル推定装置の構成＞
本発明の実施の形態に係るモデル推定装置１００の構成について説明する。 <Configuration of Model Estimation Device According to Embodiment of the Present Invention>
The configuration of model estimation apparatus 100 according to the embodiment of the present invention will be described.

図１に示すように、本実施の形態に係るモデル推定装置１００は、ＣＰＵと、ＲＡＭと、後述するモデル推定処理ルーチンを実行するためのプログラムや各種データを記憶したＲＯＭと、を含むコンピュータで構成することが出来る。このモデル推定装置１００は、機能的には図１に示すように入力部１０と、演算部２０と、出力部５０とを備えている。 As shown in FIG. 1, a model estimation apparatus 100 according to the present embodiment is a computer that includes a CPU, a RAM, and a ROM that stores a program and various data for executing a model estimation processing routine described later. Can be configured. Functionally, the model estimation apparatus 100 includes an input unit 10, a calculation unit 20, and an output unit 50 as shown in FIG.

入力部１０は、行動データ、刺激データ、及び属性データを受け付ける。 The input unit 10 receives behavior data, stimulus data, and attribute data.

行動データは、複数の個体の各々について観測された、個体が行動を起こした時刻の系列を表す行動時刻系列データと、個体が行動したときの行動の選択内容の系列を表す選択系列データと、観測期間とを含んでいる。行動データは、解析対象である複数の個体（例えば、人間または動物を表すものとする）の行動の時系列データを表す。 Behavior data is observed for each of a plurality of individuals, behavior time series data representing a series of times when an individual took action, selection series data representing a series of action selection contents when an individual acts, Including the observation period. The action data represents time-series data of actions of a plurality of individuals to be analyzed (for example, humans or animals).

具体的には、行動データは、複数の個体の各々についての、個体ＩＤｕ、個体ｕの行動を起こした時刻ｔ_ｊ ^ｕの系列を表す行動時刻系列データ（｛ｔ_ｊ ^ｕ｝＝（ｔ_０ ^ｕ，ｔ_１ ^ｕ，ｔ_２ ^ｕ，…））、個体ｕが行動したときの行動の選択内容ｖ_ｊ ^ｕの系列を表す選択系列データ（｛ｖ_ｊ ^ｕ｝＝（ｖ_０ ^ｕ，ｖ_１ ^ｕ，ｖ_２ ^ｕ，…））、観測期間（開始時刻をＴ_ｓ、終了時刻をＴ_ｅとして[Ｔ_ｓ，Ｔ_ｅ]と表記）とを含んで構成される。ただし、行動の選択肢はｖ∈｛ｚ_１，ｚ_２，ｚ_３，…｝とする。また、ｊは行動の順番を表すインデックスである。 Specifically, the action data is action time series data ({t _j ^u } = (t ₀ ^u ) representing a series of the individual ID u and the time t _j ^u at which the action of the individual u occurred for each of a plurality of individuals. _{^{_{^{, t 1 u, t 2 u}}}} , ...)), selection of actions when the individual u acted _v ^j selected sequence data representing a sequence of ^{_{^{_{^{u ({v j u} =}}}}} (v 0 u, v 1 u, v ₂ ^u, ...)), the observation period (start time _{T s,} constituted the end time _[T s, _{T e]} as _{T e} the drawing) and contain. However, the action options are v∈ {z ₁ , z ₂ , z ₃ ,. J is an index representing the order of actions.

刺激データは、複数の個体ｕの各々について観測され、かつ行動の選択肢ｚ_ｌ毎に観測された、個体ｕが刺激ｚ_ｌに接触した時刻 Stimulation data is observed for each of a plurality of individuals u and observed for each action option z _{l, the} time when the individual u contacts the stimulus z _l

の系列を表すデータである。本実施の形態では、刺激データを This data represents a series of In this embodiment, the stimulus data is

と表す。例えば、個体をユーザ、刺激をウェブ広告とすれば、各ユーザが広告を見た時刻の列が刺激データとして入力される。 It expresses. For example, if an individual is a user and a stimulus is a web advertisement, a sequence of times when each user views the advertisement is input as stimulus data.

属性データは、複数の個体ｕの各々についての属性を表す。本実施の形態では、属性データをｄ^ｕと表す。属性データとしては、例えばユーザの性別、年齢、収入などを用いる。ただし、本実施の形態に係るモデル推定装置において属性情報は必須ではない。 The attribute data represents an attribute for each of a plurality of individuals u. In this embodiment, it represents the attribute data and d ^u. As the attribute data, for example, the gender, age, income, etc. of the user are used. However, attribute information is not essential in the model estimation apparatus according to the present embodiment.

また、入力部１０は、後述する選択部３２において用いる階層ベイズモデルの選択に関する情報を受け付ける。 The input unit 10 also receives information related to selection of a hierarchical Bayesian model used in the selection unit 32 described later.

演算部２０は、記憶部２２と、推定部３０とを含んで構成されている。 The calculation unit 20 includes a storage unit 22 and an estimation unit 30.

記憶部２２には、個体の行動の発生に関するモデルと、個体の行動が発生したときの選択に関するモデルと、個体パラメータの生成過程に関するモデルとが格納されている。個体の行動の発生に関するモデルについては、後述する［例１−１］〜［例１−５］の少なくとも１つを組み合わせたモデルが格納されている。また、個体の行動が発生したときの選択に関するモデルについては、後述する［例２−１］〜［例２−４］の少なくとも１つを組み合わせたモデルが格納されている。なお、個体パラメータの生成過程に関するモデルについては、後述する１階層のベイズモデルによってモデル化されたモデルと、２階層のベイズモデルによってモデル化されたモデルとが格納されている。 The storage unit 22 stores a model related to the occurrence of individual behavior, a model related to selection when an individual behavior occurs, and a model related to the generation process of individual parameters. As a model relating to the occurrence of individual behavior, a model in which at least one of [Example 1-1] to [Example 1-5] described later is combined is stored. In addition, as a model related to selection when an individual's action occurs, a model in which at least one of [Example 2-1] to [Example 2-4] described later is combined is stored. Note that the model related to the individual parameter generation process stores a model modeled by a one-level Bayesian model, which will be described later, and a model modeled by a two-layer Bayesian model.

ここで、本実施の形態におけるモデル化の原理について説明する。 Here, the principle of modeling in this embodiment will be described.

本発明の実施の形態で仮定される数理モデルは、個体の行動の発生に関するモデルに、個体の行動が発生したときの選択に関するモデルを統合したモデルである。以下、個体の行動の発生に関するモデルと、個体の行動が発生したときの選択に関するモデルと、個体パラメータの生成過程に関するモデルとについて、具体的に説明する。 The mathematical model assumed in the embodiment of the present invention is a model in which a model relating to the occurrence of an individual's behavior is integrated with a model relating to selection when the behavior of the individual occurs. Hereinafter, a model relating to the occurrence of individual behavior, a model relating to selection when an individual behavior occurs, and a model relating to the process of generating individual parameters will be described in detail.

［個体の行動の発生に関するモデル］ [Model of behavior of individuals]

まず、個体の行動の発生に関するモデルについて説明する。
本実施の形態では、個体の行動の発生に関するモデルを、複数の個体の各々についての、個体が予め定められた第１の量以上の刺激に接触した場合に、個体が刺激との接触によって行動を起こす度合いを表す刺激接触効果が高くなり、かつ、個体が予め定められた第２の量以上の刺激に接触した場合に刺激接触効果が低くなり、かつ、個体の刺激との接触の経過時間に応じて刺激接触効果が減衰するように、時刻ｔにおける単位時間当たりに個体が行動を起こす確率λ（ｔ）をモデル化する。 First, a model relating to the occurrence of individual behavior will be described.
In the present embodiment, a model relating to the occurrence of an individual's behavior is obtained by selecting the behavior of the individual by contact with the stimulus when the individual contacts a predetermined amount or more of the stimulus for each of the plurality of individuals. Stimulus contact effect indicating the degree of occurrence of the stimulation is increased, and when the individual contacts a predetermined amount or more of the stimulus, the stimulation contact effect is decreased, and the elapsed time of contact with the individual stimulus The probability λ (t) that the individual takes an action per unit time at time t is modeled so that the stimulus contact effect is attenuated according to.

本実施の形態では、個体の行動の発生に関するモデルを、個体が予め定められた第１の量以上の刺激に接触した場合に、個体が刺激との接触によって行動を起こす度合いを表す刺激接触効果が高くなるようにモデル化し、刺激接触効果の閾値性を表現する。
また、個体の行動の発生に関するモデルを、個体が予め定められた第２の量以上の刺激に接触した場合に刺激接触効果が低くなるようにモデル化し、刺激接触効果のウェアアウトを表現する。
また、個体の行動の発生に関するモデルを、個体の刺激との接触の経過時間に応じて刺激接触効果が減衰するようにモデル化し、刺激接触効果の忘却性を表現する。 In the present embodiment, a stimulus contact effect representing a degree of an individual's behavior when contacted with a stimulus when the individual contacts a stimulus of a predetermined first amount or more as a model related to the occurrence of the behavior of the individual. Is modeled so that the threshold value of the stimulus contact effect is expressed.
In addition, a model relating to the occurrence of the behavior of the individual is modeled so that the stimulus contact effect is lowered when the individual contacts a predetermined amount or more of the stimulus, thereby expressing the wear-out of the stimulus contact effect.
In addition, a model relating to the occurrence of the individual's behavior is modeled so that the stimulus contact effect is attenuated according to the elapsed time of contact with the individual's stimulus, thereby expressing the forgetting property of the stimulus contact effect.

また、個体の行動の発生に関するモデルは、「点過程」と呼ばれる確率理論の下で、個体の行動は単位時間当りに個体が行動を起こす確率λ（ｔ）によって表現される。ここでｔは時刻を表す。λ（ｔ）はレートと称される。刺激接触効果の閾値性、ウェアアウト、忘却性それぞれのモデル化は、このレートを通じて実現される。以下、モデル化の例をいくつか挙げる。ただし、閾値性、ウェアアウト、忘却性それぞれのモデル化は以下の例に限定されるものではない。 In addition, a model relating to the occurrence of an individual's behavior is expressed by a probability λ (t) of the individual's behavior per unit time under a probability theory called “point process”. Here, t represents time. λ (t) is called a rate. Modeling of the threshold property, the wearout, and the forgetting property of the stimulus contact effect is realized through this rate. Here are some examples of modeling. However, the modeling of the threshold value, the wearout, and the forgetting property is not limited to the following examples.

また、本実施の形態で用いる刺激データにおいて、各個体の刺激接触時刻は行動内容の選択肢ごとに与えられるが（ In addition, in the stimulus data used in the present embodiment, the stimulus contact time of each individual is given for each action content option (

）、いずれの選択肢に対する刺激も、行動の発生そのものに対する刺激となりうる。そこで、以下のモデル化の例では行動の発生に対する刺激｛ｓ_ｊ ^ｕ｝を ), A stimulus for any option can be a stimulus for the occurrence of behavior itself. Therefore, in the following modeling example, the stimulus {s _j ^u } for the occurrence of the action is

と定義する。すなわち、いずれの選択肢に対する刺激も等しく行動の発生に対する刺激と見なす。無論、各選択肢に対する刺激を区別して考えてもよい。 It is defined as That is, a stimulus for any option is equally regarded as a stimulus for the occurrence of behavior. Of course, the stimulus for each option may be distinguished.

また、以下の数式は個体ｕごとの確率の議論であり、簡単のため｛ｔ_ｊ ^ｕ｝，｛ｖ_ｊ ^ｕ｝，｛ｓ_ｊ ^ｕ｝を単に｛ｔ_ｊ｝，｛ｖ_ｊ｝，｛ｓ_ｊ｝を表記するものとする。 Further, the following equation is discussion of probability for each individual u, easy for _{^{_{^{{t j u}, {v}}}} j u}, {s j u} simply _{_{{t j}, {v j}} }, {s _j }.

［例１−１：ウェアアウトを強度の減衰としてモデル化する場合］
まず刺激接触の度に刺激の効果が蓄積され、その効果が時間経過と共に指数的に忘却されると仮定すると、刺激効果の蓄積（以下、刺激ストックと称する）ｎ（ｔ）は、以下の式（１）で表現される。 [Example 1-1: Modeling wearout as intensity attenuation]
First, assuming that a stimulus effect is accumulated every time a stimulus is touched, and that the effect is exponentially forgotten over time, the accumulation of stimulus effect (hereinafter referred to as stimulus stock) n (t) is expressed by the following equation: It is expressed by (1).

ただし、上記式（１）のａ_ｓは一度当りの刺激接触効果の強度、 However, a _s is the intensity of the stimulation contact effect of per once in the above formula (1),

は刺激接触効果の持続時間、ｓ_ｉはｉ番目の刺激接触時刻を表す。次に、刺激ストックがある閾値ｎ_ｔｈを超える期間のみレートλ（ｔ）に刺激接触の効果が現れると仮定すると、レートλ（ｔ）は以下の式（２）に示すように表現することができる。 Represents the duration of the stimulation contact effect, and s _i represents the i-th stimulation contact time. Next, assuming that the effect of stimulation contact appears at the rate λ (t) only during a period when the stimulation stock exceeds a certain threshold value n _th , the rate λ (t) can be expressed as shown in the following equation (2). it can.

ただし、上記式（２）のλ_０は基礎レート、λ_１は励起レート、ｎ_ｔｈは刺激接触効果の閾値、関数Ｇ（ｙ）は非線形関数を表す。非線形関数Ｇは刺激接触の閾値性をモデル化するものであり、例えば、以下の式（３）に示すようなシグモイド型の関数を用いれば良い。 In the above formula (2), λ ₀ represents a basal rate, λ ₁ represents an excitation rate, n _th represents a threshold value of a stimulation contact effect, and a function G (y) represents a nonlinear function. The nonlinear function G models the threshold value of the stimulation contact. For example, a sigmoid function as shown in the following formula (3) may be used.

さらに、上記式（１）の強度ａ_ｓが、以下の式（４）に示すようなダイナミクスを持つと仮定する。 Furthermore, it is assumed that the intensity a _s of the above formula (1) has dynamics as shown in the following formula (4).

ただし、上記式（４）のａ_０は基礎強度、ｓ_ｉはｉ番目の刺激接触時刻、 However, _{a 0} is basic strength of the above formula (4), _{s i} is the i-th stimulation contact time,

は一度の刺激接触によるウェアアウトの強さ、そして Is the strength of wear-out due to a single stimulus contact, and

はウェアアウトの持続時間である。上記式（３）の一つ目の関数を採用したときのレートの挙動例を図２に示す。図２に示すように、個々の刺激接触の効果が減衰する（忘却性）、その効果が閾値を超えない限りレートへの影響が現れない（閾値性）様子が確認できる。また、過度に頻繁な刺激接触は刺激接触強度ａ_ｓの減退を招き、刺激接触強度ａ_ｓの減退が招かれる結果、レートの減退を生じさせる。レートの減退は、例えば過度なウェブ広告により購買意欲が減退する状況を表現している。 Is the wearout duration. An example of rate behavior when the first function of the above equation (3) is employed is shown in FIG. As shown in FIG. 2, it can be confirmed that the effect of individual stimulation is attenuated (forgetting), and the effect on the rate does not appear (thresholding) unless the effect exceeds the threshold. Also, too frequent stimulation contact leads to decline in stimulating contact strength a _s, a result of a decline in stimulating contact strength a _s are invited, cause decline of the rate. The decline in the rate expresses a situation in which the willingness to purchase is reduced due to, for example, excessive web advertisement.

また、以下の式（５）に示すように、忘却性は指数的な減衰の他に、ベキ的な減衰 In addition, as shown in the following formula (5), forgetting is not only exponential attenuation but also power attenuation.

を用いてモデル化することもできる。 Can also be used to model.

［例１−２：ウェアアウトを刺激ストックと独立にモデル化する場合］
上記例１−１では、ウェアアウトは強度ａ_ｓの減衰として表現されたが、ここでは以下の式（６）及び式（７）に示すように、刺激ストックｎ（ｔ）とは別にウェアアウトの状態ｍ（ｔ）を考える。 [Example 1-2: Modeling wearout independently of stimulus stock]
In the example 1-1, but wear-out was expressed as the attenuation of the intensity a _s, here, as shown in the following equation (6) and (7), apart from wear-out and stimulus stock n (t) Consider state m (t).

ただし、ａ_ｗは一度の刺激接触によるウェアアウトの増加量、 However, _aw is the increase in wearout due to a single stimulus contact,

はウェアアウトの持続時間、ｍ_ｔｈはウェアアウトの閾値である。上記式（６）において、刺激ストックｎ（ｔ）が閾値ｎ_ｔｈ以上かつウェアアウトの状態ｍ（ｔ）が閾値ｍ_ｔｈ以下のときレートは励起状態λ_１となる。また、上記式（７）において、刺激ストックｎ（ｔ）とウェアアウトの状態ｍ（ｔ）は共に刺激への接触の度に増加し、接触が無ければ時間とともにゼロに漸近する。そのため、過少な刺激接触では刺激ストックが閾値を下回りレートが上昇せず、また過度に頻繁な刺激接触でもウェアアウト状態が閾値を上回りレートが上昇しない。図３にウェアアウトを刺激ストックと独立にモデル化した場合の挙動例を示す。 Is the wearout duration and mth is the _wearout threshold. In the above formula (6), stimulation stock n (t) is the threshold value _{n th} or more and wear out the state m (t) is the threshold value _{m th} following time rate is excited lambda _1. In the above equation (7), both the stimulus stock n (t) and the wear-out state m (t) increase every time they come into contact with the stimulus, and gradually approach zero with time if there is no contact. For this reason, the stimulation stock does not increase below the threshold when the stimulation is too low, and the rate does not increase when the wearout state exceeds the threshold even when the stimulation is excessively frequent. FIG. 3 shows an example of behavior when wearout is modeled independently of the stimulus stock.

［例１−３：刺激データが期間で与えられた場合］
上記例１−１及び上記例１−２では、刺激接触時刻の情報しか考慮しなかったが、例えば１分間と１０分間のウェブＣＭでは１度の接触で与えるインパクトは異なりうる。その差異をモデルに取り入れる方法の１つとして、以下の式（８）に示すような方法が考えられる。 [Example 1-3: When stimulus data is given by period]
In the example 1-1 and the example 1-2, only the information on the stimulation contact time is considered. However, for example, the impact given by a single contact may be different in the web CM of 1 minute and 10 minutes. As one method for incorporating the difference into the model, a method shown in the following formula (8) is conceivable.

上記式（８）のｗ_ｋ ^{ｓｔａｒｔ}、ｗ_ｋ ^ｅｎｄはそれぞれｋ番目の刺激の接触開始時刻と接触終了時刻である。接触時間が長いとそれだけ受ける影響が大きくなる。ｇ（ｔ−ｔ’）は畳み込み関数であり、上記例１−１及び上記例１−２と同様に、指数関数やベキ関数などを当てはめることができる。上記例１−２の場合はウェアアウト状態ｍ（ｔ）についても上記式（７）と同様のモデルを考えてもよい。 In the above equation (8), w _k ^start and w _k ^end are the contact start time and the contact end time of the k-th stimulus, respectively. The longer the contact time, the greater the impact. g (t−t ′) is a convolution function, and an exponential function, a power function, or the like can be applied in the same manner as in Example 1-1 and Example 1-2. In the case of Example 1-2, a model similar to the above equation (7) may be considered for the wear-out state m (t).

［例１−４：過去の行動履歴からの影響を考慮する場合］
上記例１−１〜上記例１−３では、刺激接触の効果のみを考慮していたが、過去の自身の行動によって現在の行動が影響を受ける状況も考えられる。例えば、コーヒーやタバコのような嗜好品では自身の購買が次回の購買を促す場合がある。その場合、Ｈａｗｋｅｓ過程（例えば、以下の非特許文献１０を参照）を用いてレートは、以下の式（９）の形でモデル化することができる。 [Example 1-4: Considering influence from past action history]
In Example 1-1 to Example 1-3 described above, only the effect of stimulating contact was considered, but a situation in which the current behavior is affected by the past own behavior is also conceivable. For example, in the case of a luxury item such as coffee or tobacco, the purchase of the user may prompt the next purchase. In that case, the rate can be modeled in the form of the following equation (9) using the Hawkes process (see, for example, Non-Patent Document 10 below).

（非特許文献１０）
A. Hawkes, "Spectra of some self-exciting and mutually exciting point processes", Biometrika, Vol. 58, No. 1, pp. 83-90, 1971. (Non-Patent Document 10)
A. Hawkes, "Spectra of some self-exciting and mutually exciting point processes", Biometrika, Vol. 58, No. 1, pp. 83-90, 1971.

ただし、上記式（９）のｔ_ｊは個体のｊ番目の行動時刻、 Where t _j in the above equation (9) is the j-th action time of the individual,

はインパクトの強さ、 Is the strength of impact,

は効果の持続時間を表す。個体によっては刺激接触ではなく自身の行動リズムに基づいて行動を起こしている場合もある。刺激の影響と行動履歴からの影響を同時に考慮することで、刺激の効果をより正しく推定することができると期待される。ただし、行動履歴のモデル化は上記式（９）に限るものではない。 Represents the duration of the effect. Some individuals may behave based on their own behavioral rhythm rather than stimulating contact. It is expected that the effect of the stimulus can be estimated more correctly by simultaneously considering the effect of the stimulus and the effect from the action history. However, the behavior history modeling is not limited to the above equation (9).

［例１−５：刺激以外の共変量の影響を考慮する場合］
刺激以外の共変量Ｙ（ｔ）の影響を受けてレートが変動する状況を考えることもできる。例えば、消費財の購買行動における家庭内在庫が挙げられる。家庭内在庫の減少と共に購買確率は上昇し、購買後、在庫は回復し購買確率が下降するという状況は、これまでの例で定義されたレートλ（ｔ）を [Example 1-5: Considering the influence of covariates other than stimuli]
A situation where the rate fluctuates under the influence of a covariate Y (t) other than the stimulus can also be considered. For example, in-house inventory in consumer goods purchasing behavior. The situation in which the purchase probability rises with a decrease in domestic stock, the stock recovers after purchase and the purchase probability declines. The rate λ (t) defined in the previous examples is

と修正することで実現される。ただし、共変量の影響のモデル化は上記に限るものではない。 It is realized by correcting. However, the modeling of covariate effects is not limited to the above.

［個体の行動が発生したときの選択に関するモデル］
次に、個体の行動が発生したときの選択に関するモデルについて説明する。
本実施の形態では、時刻ｔにおいて個体が行動を起こしたときに各選択肢ｚ_ｌを選択する選択確率ｐ_ｌ（ｔ）をモデル化して、個体の行動が発生したときの選択に関するモデルとする。 [Model for selection when individual behavior occurs]
Next, a model related to selection when an individual's action occurs will be described.
In the present embodiment, the selection probability p _l (t) for selecting each option z _l when the individual takes action at time t is modeled, and the model is related to selection when the individual action occurs.

具体的には、上記で構築された個体の行動の発生に関するモデルに基づき、個体の行動の発生時刻が決定されると、行動が発生した時刻ｔでの選択内容は離散分布ｐ_{ｃｈｏｉｃｅ}（ｚ｜ｔ）に基づき決定される。ただし、行動の選択肢はｚ∈｛ｚ_１，ｚ_２，ｚ_３，…｝とする。以後、ｐ_{ｃｈｏｉｃｅ}（ｚ｜ｔ）を選択確率と表記する。 Specifically, when the occurrence time of an individual's action is determined based on the model regarding the occurrence of the action of the individual constructed as described above, the selection content at the time t when the action occurs is determined by the discrete distribution _pchoice (z | t). However, the action options are z∈ {z ₁ , z ₂ , z ₃ ,. Hereinafter, p _choice (z | t) is expressed as a selection probability.

選択確率ｐ_{ｃｈｏｉｃｅ}（ｚ｜ｔ）を以下の式（１０）で表現することができる。 The selection probability p _choice (z | t) can be expressed by the following equation (10).

ただし、ｐ_ｌ（ｔ）は個体が時刻ｔに行動を起こした時にｚ_ｌを選択する確率を表し、０から１の間の値を取る。例えば、ｚ_ｌはある商品カテゴリ（バッグなど）のｌ番目のブランドを表す。上記で構築されたレートλ（ｔ）と上記式（１０）で表される選択確率を統合することで、個体が単位時間当りに行動を起こしかつｚ_ｌを選択する確率λ_ｚ（ｔ）は、以下の式（１１）によって表現することができる。 Here, p _l (t) represents the probability of selecting z _l when the individual takes action at time t, and takes a value between 0 and 1. For example, z _l represents the l-th brand of a certain product category (such as a bag). By integrating the rate λ (t) constructed above and the selection probability expressed by the above equation (10), the probability λ _z (t) that the individual takes action per unit time and selects z _l is And can be expressed by the following equation (11).

なお、上記式（１１）のλ_ｚ（ｔ）は選択ｚ_ｌに対するレートと解釈できる。 Note that λ _z (t) in the above equation (11) can be interpreted as a rate for the selection z _l .

選択確率のモデル化に際しては、刺激接触効果を全く含まないモデル、刺激接触効果の忘却性のみを考慮したモデル、忘却性と閾値性を考慮したモデル、そして閾値性、ウェアアウト、忘却性のすべてを同時に考慮したモデル、のいずれのモデルを採用してもよい。以下、それぞれのモデル化の例を挙げるが、選択確率のモデル化は以下に限るものではない。 When modeling selection probabilities, a model that does not include stimulus contact effects at all, a model that only considers forgetting of stimulus contact effects, a model that considers forgetting and thresholding, and threshold, wearout, and forgetting Any model may be adopted that takes into account simultaneously. Hereinafter, examples of each modeling will be described, but the modeling of the selection probability is not limited to the following.

［例２−１：刺激接触効果を含まないモデル］
個体の行動内容の選択が外的な要因の影響を一切受けない場合、選択確率ｐ_{ｃｈｏｉｃｅ}（ｚ）は上記式（１０）を、以下の式（１２）として表現することができる。 [Example 2-1: Model not including stimulus contact effect]
When the selection of the action content of the individual is not affected by any external factor, the selection probability _pchoice (z) can express the above equation (10) as the following equation (12).

上記式（１２）のｚ_ｌを選択する確率ｐ_ｌは個体毎に行動データから推定されることになる。 The probability p _l of selecting z _l in the above equation (12) is estimated from the behavior data for each individual.

行動内容の選択が各選択肢｛ｚ_１，ｚ_２，ｚ_３，…｝の付随情報｛Ｘ_１（ｔ），Ｘ_２（ｔ），Ｘ_３（ｔ）,…｝から影響を受ける場合、選択確率ｐ_{ｃｈｏｉｃｅ}（ｚ）は多項ロジットモデルを用いて、以下の式（１３）のように表現することができる。 If the selection of action content is affected by the accompanying information {X ₁ (t), X ₂ (t), X ₃ (t),...} Of each option {z ₁ , z ₂ , z ₃ ,. The probability p _choice (z) can be expressed as in the following equation (13) using a multinomial logit model.

ただし、上記式（１３）のξは付随情報の重み係数（反応係数）、Ｕ_ｌ（ｔ）は個体が選択ｚ_ｌを選択する確率を決める関数（以後、効用関数と呼ぶ）である。付随情報はデータとして個体毎に与えられ、ｑ_ｌとξは個体毎に行動データから推定されることになる。 In the above equation (13), ξ is a weighting coefficient (response coefficient) of the accompanying information, and U _l (t) is a function (hereinafter referred to as utility function) that determines the probability that the individual selects the selection z _l . The accompanying information is given as data for each individual, and q _l and ξ are estimated from the behavior data for each individual.

例えば、ある商品カテゴリのブランド選択の場合、付随情報とは各ブランドの値段や値引き率、（ブランド）ロイヤルティなどが挙げられる。付随情報が手に入らない場合は、Ｘ_ｌ（ｔ）＝０かつｅｘｐ（ｑ_ｌ）＝ｐ_ｌとして上記式（１２）に帰着される。 For example, in the case of brand selection for a certain product category, the accompanying information includes the price and discount rate of each brand, (brand) loyalty, and the like. When the accompanying information is not available, X ₁ (t) = 0 and exp (q ₁ ) = p ₁ are used, resulting in the above equation (12).

［例２−２：刺激接触効果の忘却性のみを考慮したモデル］ [Example 2-2: Model that considers only forgettingness of stimulus contact effect]

上記式（１０）の選択確率は、以下の式（１４）で表現することができる。 The selection probability of the above equation (10) can be expressed by the following equation (14).

ただし、上記式（１４）のｎ_ｌ（ｔ）は選択肢ｚ_ｌに対して蓄積された刺激量（刺激ストック）、 However, n _l (t) in the above formula (14) is the amount of stimulation (stimulation stock) accumulated for the choice z _l ,

と When

はそれぞれ選択肢ｚ_ｌに関する刺激接触効果の強度と持続時間（忘却の時定数）である。刺激接触効果の忘却性のみを考慮したモデルの一例を図４に示す。 Are the intensity and duration of the stimulus contact effect for each option z _l (forgetting time constant). An example of a model that considers only the forgettingness of the stimulus contact effect is shown in FIG.

［例３：刺激接触効果の忘却性と閾値性を考慮したモデル］
上記式（１０）の選択確率は、以下の式（１４−２）で表現することができる。 [Example 3: Model considering forgetting property and threshold value of stimulus contact effect]
The selection probability of the above equation (10) can be expressed by the following equation (14-2).

ただし、上記式（１４−２）のｑ_ｌ’は選択肢ｚ_ｌの刺激ストックが閾値ｎ_ｔｈを上回った時の効用関数の上昇分を表す。Ｇ（ｙ）は上記式（３）での定義と同じである。 However, q _l ′ in the above formula (14-2) represents an increase in the utility function when the stimulus stock of the option z _l exceeds the threshold value n _th . G (y) is the same as defined in the above formula (3).

あるいは、上記式（１４−２）における効用関数を、以下の式（１５）に差し替えたモデルを考えることもできる。 Or the model which replaced the utility function in the said Formula (14-2) by the following formula | equation (15) can also be considered.

上記式（１５）に差し替えた場合、刺激ストックが閾値を上回っていない選択肢が選ばれる確率は０となる。なお、上記式（１５）と同等のモデルが上記非特許文献１で提案されている。 When the formula (15) is replaced, the probability that an option whose stimulation stock does not exceed the threshold is selected is zero. Note that a model equivalent to the above equation (15) is proposed in Non-Patent Document 1.

［例２−４：刺激接触効果の閾値性、ウェアアウト、忘却性を考慮したモデル］
上記式（１０）の選択確率は、以下の式（１６）で表現することができる。 [Example 2-4: Model considering the threshold value, wearout, and forgetting property of the stimulation contact effect]
The selection probability of the above equation (10) can be expressed by the following equation (16).

上記式（１４−２）とは異なり、刺激ストックｎ_ｌ（ｔ）が閾値を上回りかつウェアアウトの状態ｍ_ｌ（ｔ）が閾値を下回るときのみ刺激接触の効果ｑ_ｌ’が効用関数に現れる。 Unlike the above equation (14-2), only when the stimulus stock n _l (t) exceeds the threshold and the wear-out state m _l (t) falls below the threshold, the effect of the stimulation contact q _l ′ appears in the utility function. .

また、効果は反応係数ξにも現われると考えてもよい。例えば、刺激接触効果が効用関数に現れている間は価格感度が弱まるとすると、反応係数の価格成分ξ_{ｐｒｉｃｅ}は、以下の式（１７）で表現することができる。 It may be considered that the effect also appears in the reaction coefficient ξ. For example, if the price sensitivity is weakened while the stimulus contact effect appears in the utility function, the price component ξ _price of the response coefficient can be expressed by the following equation (17).

ただし反応係数の定義の仕方は上記式（１７）に限るものではない。上記式（１４−２）においても同様である。 However, the method of defining the reaction coefficient is not limited to the above equation (17). The same applies to the above formula (14-2).

［個体パラメータの生成過程に関するモデル］
次に、個体パラメータの生成過程に関するモデルについて説明する。
本実施の形態では、個体パラメータの生成過程に関するモデルについて、複数の個体の各々についての、個体の行動の発生に関するモデルに関するパラメータと個体の行動が発生したときの選択に関するモデルに関するパラメータとを表す個体パラメータが、全体パラメータが表す分布に従って生成されることをモデル化する。 [Model for individual parameter generation process]
Next, a model relating to an individual parameter generation process will be described.
In the present embodiment, for a model relating to an individual parameter generation process, an individual representing a parameter relating to a model relating to generation of an individual behavior and a parameter relating to a model relating to selection when the behavior of the individual occurs for each of a plurality of individuals. Model that the parameters are generated according to the distribution represented by the global parameters.

個体の行動の発生に関するモデル及び個体の行動が発生したときの選択に関するモデルのモデル化では、モデル化に関するパラメータ群（上述の例ではａ_ｓ，ｂ_ｓ，ａ_０，ａ_１，λ_０，λ_１，ξ，ｑ_１など）が定義される。モデル化に関するパラメータ群をまとめて個体パラメータθ≡（ａ_ｓ，ｂ_ｓ，ａ_０，ａ_１，λ_０，λ_１，ξ，ｑ_１…)と表記する。 In modeling a model relating to the occurrence of an individual behavior and a model relating to selection when an individual behavior occurs, a parameter group relating to modeling (in the above example, a _s , b _s , a ₀ , a ₁ , λ ₀ , λ ₁ , ξ, q _1, etc.) are defined. A group of parameters related to modeling is collectively expressed as an individual parameter θ≡ (a _s , b _s , a ₀ , a ₁ , λ ₀ , λ ₁ , ξ, q ₁ ...).

個体パラメータθは個体ごとに異なりうるので、個体ｕのパラメータを特にθ^ｕと表記する。本処理ブロックでは、個体間でのパラメータの違いを説明するため、個体パラメータの生成過程を階層ベイズモデルに基づきモデル化する。ただし、階層の数が１の場合と２の場合で以後の処理が異なる。 Since the individual parameter θ can be different for each individual, the parameter of the individual ^u is expressed as θ ^u in particular. In this processing block, the individual parameter generation process is modeled based on a hierarchical Bayesian model in order to explain the difference in parameters between individuals. However, the subsequent processing differs depending on whether the number of layers is 1 or 2.

［１階層の階層ベイズモデルの場合］
個体間での個体パラメータの違いをモデル化するため、個体パラメータθ^ｕが以下の式（１８）の正規分布から生成されると仮定する。 [One-level hierarchical Bayesian model]
In order to model the difference in individual parameter between individuals, it is assumed that the individual parameter θ ^u is generated from the normal distribution of the following equation (18).

ただし、上記式（１８）のｄ^ｕは第一成分が１（回帰の切片を表す）、第二成分以降が個体ｕの属性変数である縦ベクトル、βは行数がｄ^ｕのランク、列数がθ^ｕのランクに一致する行列である。また、β’はβの転置行列、Ｎ（θ^ｕ｜ｘ，ｙ）は平均と分散がそれぞれｘとｙである正規分布を表す。 However, d ^u in the formula (18) (representing intercept of the regression) 1 first component, the vertical vector is the second component after an attribute variable individual u, rank β is the number of rows is d ^u, column it is a matrix that number is equal to the rank of θ ^u. Β ′ represents a transpose matrix of β, and N (θ ^u | x, y) represents a normal distribution in which the mean and variance are x and y, respectively.

βとΛは個体全体におけるθ^ｕの分布形状を決めるパラメータであるので、ここではその２種類のパラメータを全体パラメータと呼ぶこととする。全体パラメータは、後述するパラメータ推定部３４によって、データから推定されることになる。 Since β and Λ are parameters that determine the distribution shape of θ ^{u in} the whole individual, the two types of parameters are referred to as global parameters here. The overall parameters are estimated from the data by the parameter estimation unit 34 described later.

［２階層の階層ベイズモデルの場合］
１階層の階層ベイズモデルの場合と同様に、個体パラメータθ^ｕが以下の式（１９）の正規分布から生成されると仮定する。 [Two-level hierarchical Bayesian model]
As in the case of the one-level hierarchical Bayesian model, it is assumed that the individual parameter θ ^u is generated from the normal distribution of the following equation (19).

さらに、全体パラメータΛとβがそれぞれ以下の式（２０）の確率分布から生成されると仮定する。 Furthermore, it is assumed that global parameters Λ and β are generated from the probability distribution of the following equation (20).

ただし、上記式（２０）のＷｉ（θ^ｕ｜ｘ，ｙ）は自由度ｘ、尺度パラメータｙのウィッシャート分布、 Where Wi (θ ^u | x, y) in the above equation (20) is the degree of freedom x, the Wishart distribution of the scale parameter y,

はクロネッカー積を表す。全体パラメータの分布形状を決めるパラメータν_０，Λ_０，β_０，Ω_０は予め定められ、例えばモデル推定装置１００の使用者が適切に指定する。例としては上記非特許文献１の付録を参照のこと。 Represents the Kronecker product. Parameters ν ₀ , Λ ₀ , β ₀ , and Ω ₀ that determine the distribution shape of the overall parameters are determined in advance, and are appropriately specified by, for example, the user of the model estimation apparatus 100. For an example, see the appendix of Non-Patent Document 1 above.

２階層の階層ベイズモデルの場合、全体パラメータΛとβとは個体パラメータθ^ｕと同時に後述する全体及び個体パラメータ推定部４０にて推定されることになる。 For two-tier hierarchical Bayesian model, so that the entire parameter Λ and β are estimated by overall and individual parameter estimation unit 40 to the individual parameter theta ^u simultaneously below.

推定部３０は、入力部１０によって受け付けた行動データ、刺激データ、及び属性データに基づいて、記憶部２２に格納された各モデルのパラメータを推定する。推定部３０は、選択部３２と、パラメータ推定部３４とを含んで構成されている。 The estimation unit 30 estimates the parameters of each model stored in the storage unit 22 based on the behavior data, stimulus data, and attribute data received by the input unit 10. The estimation unit 30 includes a selection unit 32 and a parameter estimation unit 34.

選択部３２は、入力部１０によって受け付けた階層ベイズモデルの選択に関する情報に基づいて、後述するパラメータ推定部３４で用いる階層ベイズモデルを選択する。具体的には、選択部３２は、入力部１０によって受け付けた階層ベイズモデルの選択に関する情報に基づいて、１階層の階層ベイズモデル又は２階層の階層ベイズモデルを選択する。 The selection unit 32 selects a hierarchical Bayes model to be used by the parameter estimation unit 34 described later based on the information related to the selection of the hierarchical Bayes model received by the input unit 10. Specifically, the selection unit 32 selects a hierarchical Bayes model of one hierarchy or a hierarchical Bayes model of two hierarchies based on information regarding selection of the hierarchical Bayes model received by the input unit 10.

パラメータ推定部３４は、選択部３２によって選択された階層ベイズモデルの種類に応じて、記憶部２２に格納された個体の行動の発生に関するモデル、個体の行動が発生したときの選択に関するモデル、及び個体パラメータの生成過程に関するモデルと、入力部１０によって受け付けた行動データ、刺激データ、及び属性データとに基づいて、全体パラメータと、複数の個体の各々についての個体パラメータとを推定する。パラメータ推定部３４は、全体パラメータ推定部３６と、個体パラメータ推定部３８と、全体及び個体パラメータ推定部４０とを含んで構成されている。 The parameter estimation unit 34, according to the type of the hierarchical Bayes model selected by the selection unit 32, a model related to the occurrence of individual behavior stored in the storage unit 22, a model related to selection when an individual behavior occurs, and Based on the model related to the generation process of the individual parameters and the behavior data, stimulus data, and attribute data received by the input unit 10, the overall parameters and the individual parameters for each of the plurality of individuals are estimated. The parameter estimation unit 34 includes an overall parameter estimation unit 36, an individual parameter estimation unit 38, and an overall and individual parameter estimation unit 40.

全体パラメータ推定部３６は、選択部３２によって選択された階層ベイズモデルの種類が、１階層の階層ベイズモデルである場合に、入力部１０によって受け付けた行動データ、刺激データ、及び属性データに基づいて、全体パラメータを推定する。 When the type of the hierarchical Bayes model selected by the selector 32 is a one-level hierarchical Bayes model, the overall parameter estimation unit 36 is based on the behavior data, stimulus data, and attribute data received by the input unit 10. Estimate the overall parameters.

具体的には、全体パラメータ推定部３６は、１階層のベイズモデルを仮定する場合、経験ベイズ法に基づき、全体パラメータΛ，βを入力部１０によって受け付けた行動データ、刺激データ、及び属性データから推定する。 Specifically, when assuming a one-layer Bayes model, the overall parameter estimation unit 36 is based on the experience data, stimulus data, and attribute data received from the input unit 10 based on the experience Bayes method. presume.

まず、各個体ｕの個体パラメータθ^ｕの尤度関数ｐ（｛ｔ_ｊ ^ｕ｝,｛ｖ_ｊ ^ｕ｝｜θ^ｕ）の計算は、記憶部２２に格納された個体の行動の発生に関するモデル、個体の行動が発生したときの選択に関するモデルにおいてパラメータθ^ｕの関数としてモデル化されたレートλ（ｔ|θ^ｕ）およびλ_ｚ（ｔ|θ^ｕ）を用いて、以下の式（２１）に示すように実行される。 First, the calculation of the likelihood function p ({t _j ^u }, {v _j ^u } | θ ^u ) of the individual parameter θ ^u of each individual u is a model relating to the occurrence of the behavior of the individual stored in the storage unit 22, Using the rates λ (t | θ ^u ) and λ _z (t | θ ^u ) modeled as a function of the parameter θ ^{u in} the model related to selection when an individual's action occurs, the following equation (21) is used: Run as shown.

ただし、λ_ｚ（ｔ）=ｐ_{ｃｈｏｉｃｅ}（ｚ｜ｔ）・λ（ｔ）である。上記式（２１）は点過程と呼ばれる確率理論（例えば、非特許文献１１を参照）から導かれる。この尤度関数と個体パラメータの生成過程に関するモデルで選択されたθ^ｕの確率分布ｐ（θ^ｕ|β，Λ）から、ベイズの定理に基づき（β，Λ）に対する周辺尤度関数が次の式（２２）のようにして計算される。 Here, λ _z (t) = p _choice (z | t) · λ (t). The above equation (21) is derived from a probability theory called a point process (for example, see Non-Patent Document 11). From the probability distribution p (θ ^u | β, Λ) of θ ^u selected by this likelihood function and the model relating to the generation process of individual parameters, the marginal likelihood function for (β, Λ) based on Bayes' theorem is It is calculated as shown in Equation (22).

（非特許文献１１）
D. L. Snyder, "Random Point Processes", Wiley New York, 1975. (Non-Patent Document 11)
DL Snyder, "Random Point Processes", Wiley New York, 1975.

ただし上記式（２２）のＤは全個体のデータを表す。全体パラメータ（β，Λ）はこの周辺尤度関数ｐ（Ｄ│β，λ）を最大化するように推定される。このように、周辺尤度関数などを用いてパラメータをすべてデータから推定する方法を経験ベイズ法と呼ぶ。 However, D in the above formula (22) represents data of all individuals. The global parameter (β, Λ) is estimated to maximize this marginal likelihood function p (D | β, λ). In this way, a method for estimating all parameters from data using a marginal likelihood function or the like is called an experience Bayesian method.

周辺尤度関数ｐ（Ｄ│β，λ）の最大化は解析的には不可能であるので、本実施の形態では、全体パラメータ推定部３６は、ＥＭ法を用いて周辺尤度関数ｐ（Ｄ│β，λ）の最大化を実行する。ＥＭ法の下、周辺尤度関数を最大化する（β，λ）は次の式（２３）の更新ルールを繰り返し実行することで得られる。 Since maximization of the marginal likelihood function p (D | β, λ) is impossible analytically, in this embodiment, the overall parameter estimation unit 36 uses the EM method to calculate the marginal likelihood function p ( D│β, λ) is maximized. Under the EM method, (β, λ) that maximizes the marginal likelihood function is obtained by repeatedly executing the update rule of the following equation (23).

ただし、 However,

である。上記式（２３）及び（２４）の（β^（ｐ），Λ^（ｐ））はｐ回目の繰り返しにおける更新値を表し、Ｅ［Ｚ（θ^ｕ）｜｛ｔ_ｊ ^ｕ｝,｛ｖ_ｊ ^ｕ｝,β^（ｐ），Λ^（ｐ）］は以下の式（２５）で定義されるθ^ｕについての事後平均を表す。また、上記式（２４）の「’」は転置を表す。 It is. In the above equations (23) and (24), (β ^(p) , Λ ^(p) ) represents an updated value in the p-th iteration, and E [Z (θ ^u ) | {t _j ^u }, {v _j ^u }, Β ^(p) , Λ ^(p) ] represents the posterior average for θ ^u defined by the following equation (25). In addition, “′” in the above formula (24) represents transposition.

ここで上記式（２５）のＺ（θ^ｕ）は任意のθ^ｕの関数である。事後平均はマルコフ連鎖モンテカルロ法を用いて実行される。 Here, Z (θ ^u ) in the above equation (25) is an arbitrary function of θ ^u . A posteriori averaging is performed using the Markov chain Monte Carlo method.

個体パラメータ推定部３８は、入力部１０によって受け付けた行動データ、刺激データ、及び属性データと、全体パラメータ推定部３６によって推定された全体パラメータとに基づいて、複数の個体の各々について個体パラメータを推定する。 The individual parameter estimation unit 38 estimates individual parameters for each of a plurality of individuals based on the behavior data, stimulus data, and attribute data received by the input unit 10 and the overall parameters estimated by the overall parameter estimation unit 36. To do.

具体的には、個体パラメータ推定部３８は、全体パラメータ推定部３６で推定された全体パラメータ＾β，＾Λを用いて、以下の式（２６）に従って、各個体の個体パラメータθ^ｕを事後平均として推定する。 Specifically, the individual parameter estimation unit 38 uses the overall parameters ^ β and ^ Λ estimated by the overall parameter estimation unit 36, and calculates the individual parameter θ ^u of each individual according to the following equation (26). Estimate as

全体及び個体パラメータ推定部４０は、個体パラメータの生成過程に関するモデルが、２階層の階層ベイズモデルに基づきモデル化さている場合、入力部１０によって受け付けた行動データ、及び刺激データに基づいて、全体パラメータと複数の個体の各々についての個体パラメータとを推定する。 When the model related to the generation process of the individual parameters is modeled based on the two-level hierarchical Bayesian model, the whole and individual parameter estimation unit 40 determines the whole parameter based on the behavior data and the stimulus data received by the input unit 10. And individual parameters for each of the plurality of individuals.

具体的には、全体及び個体パラメータ推定部４０は、個体パラメータの生成過程に関するモデルを２階層のベイズモデルを仮定する場合、以下の式（２７）に従って、個体パラメータθ^ｕだけでなく全体パラメータβ，Λも同時に事後平均として推定する。 Specifically, when assuming that the model relating to the generation process of the individual parameters is a two-layer Bayesian model, the entire and individual parameter estimation unit 40 not only determines the individual parameter θ ^u but also the entire parameter β according to the following equation (27). , Λ are also estimated as posterior averages at the same time.

なお、上記式（２７）の事後平均は以下の式で定義される。 In addition, the posterior average of the above formula (27) is defined by the following formula.

出力部５０は、個体パラメータ推定部３８又は全体及び個体パラメータ推定部４０によって推定された複数の個体の各々についての個体パラメータ＾θ^ｕを結果として出力する。個体パラメータは刺激接触効果を数値的に評価したものである。 The output unit 50 outputs the individual parameter {circumflex over (θ) ^{} u} for each of a plurality of individuals estimated by the individual parameter estimation unit 38 or the whole and the individual parameter estimation unit 40 as a result. The individual parameter is a numerical evaluation of the stimulus contact effect.

上記個体の行動の発生に関するモデルとして［例１−２］を用い、かつ上記個体の行動が発生したときの選択に関するモデルとして［例２−２］を用いた場合の個体パラメータの出力例を図５に示す。ただし図５の選択肢は３つ（３種類のブランド、など）、付随情報は２つ（商品価格とロイヤリティ、など）とする。 FIG. 11 is an example of output of individual parameters when [Example 1-2] is used as a model related to the occurrence of the individual's behavior and [Example 2-2] is used as a model related to selection when the behavior of the individual occurs. As shown in FIG. However, it is assumed that there are three options (three types of brands, etc.) and two accompanying information (product price and royalty, etc.) in FIG.

＜本発明の実施の形態に係る最適化装置の構成＞ <Configuration of Optimization Device According to Embodiment of the Present Invention>

次に、本発明の実施の形態に係る最適化装置について説明する。本発明の実施の形態に係る最適化装置は、モデル推定装置１００によって推定された各個体の個体パラメータに基づいて、当該個体に与える予定の選択肢に関する刺激の時刻の系列を最適化する。図６に示すように、本実施の形態に係る最適化装置２００は、ＣＰＵと、ＲＡＭと、後述する最適化処理ルーチンを実行するためのプログラムや各種データを記憶したＲＯＭと、を含むコンピュータで構成することが出来る。この最適化装置２００は、機能的には図７に示すように入力部２１０と、演算部２２０と、出力部２５０とを備えている。 Next, an optimization apparatus according to an embodiment of the present invention will be described. The optimization apparatus according to the embodiment of the present invention optimizes a series of stimulation times related to options to be given to an individual based on the individual parameters of each individual estimated by the model estimation apparatus 100. As shown in FIG. 6, the optimization apparatus 200 according to the present embodiment is a computer that includes a CPU, a RAM, and a ROM that stores a program for executing an optimization processing routine described later and various data. Can be configured. Functionally, the optimization apparatus 200 includes an input unit 210, a calculation unit 220, and an output unit 250 as shown in FIG.

入力部２１０は、個体に与える刺激に関する拘束条件と、個体の行動データと、選択肢について観測された個体の刺激データとを受け付ける。刺激に関する拘束条件は、刺激接触の最適化を図る際の拘束条件として入力される。例えば広告刺激であれば、広告に割くことのできる予算（あるいは接触回数）、接触させることのできる刺激の最小の間隔、広告の期間などである。 The input unit 210 receives constraint conditions related to the stimulus given to the individual, individual behavior data, and individual stimulus data observed for the options. The constraint condition related to the stimulus is input as a constraint condition when the stimulus contact is optimized. For example, in the case of an advertisement stimulus, the budget (or the number of contacts) that can be allocated to the advertisement, the minimum interval of the stimulus that can be contacted, the period of the advertisement, and the like.

演算部２２０は、記憶部２２２と、最適化部２２４とを含んで構成されている。 The calculation unit 220 includes a storage unit 222 and an optimization unit 224.

記憶部２２２には、モデル推定装置１００により推定された複数の個体各々についての個体パラメータと、各個体パラメータを推定する際に用いられた個体の行動の発生に関するモデル及び個体の行動が発生したときの選択に関するモデルとが格納されている。 In the storage unit 222, individual parameters for each of a plurality of individuals estimated by the model estimation apparatus 100, a model relating to the occurrence of individual behavior used when estimating each individual parameter, and the behavior of the individual have occurred. And a model related to the selection.

最適化部２２４は、複数の個体の各々について、入力部２１０によって受け付けた、刺激に関する拘束条件、当該個体の行動データ、及び行動の選択肢ｚ_ｌについて観測された当該個体の刺激データと、当該個体の個体パラメータとに基づいて、刺激に関する拘束条件を満たし、予め定められた期間までの間に、当該個体が行動を起こし、かつ、選択肢ｚ_ｌを選択する回数を最大化するように、当該個体に与える予定の選択肢ｚ_ｌに関する刺激の時刻の系列を最適化する。 For each of the plurality of individuals, the optimization unit 224 receives the constraint conditions relating to the stimulus, the behavior data of the individual, and the stimulation data of the individual observed for the action option z _l and the individual Based on the individual parameters of the individual, so that the constraint on the stimulus is satisfied, the individual takes action during a predetermined period, and the number of times the option z _l is selected is maximized. Optimize the stimulus time series for the option z _{l to} be given to.

具体的には、最適化部２２４は、当該個体の過去の行動データ、当該個体の過去の刺激データ、モデル推定装置１００により推定された当該個体の個体パラメータ、そして刺激に関する拘束条件の下で個体ｕ毎に接触させる刺激Ｉ_ｚ ^ｕ（ｔ）を最適化する。ここで最適化とは刺激接触により、特定の選択行動の発生数を最大化することを意味する。 Specifically, the optimization unit 224 performs the individual behavior under the constraint condition related to the individual's past behavior data, the individual's past stimulus data, the individual parameter of the individual estimated by the model estimation apparatus 100, and the stimulus. The stimulus I _z ^u (t) to be contacted for each u is optimized. Here, “optimization” means maximizing the number of occurrences of a specific selection action by stimulus contact.

まず最適化部２２４は、当該個体ｕに対して、過去の行動データ｛ｔ_ｊ ^ｕ｝｛ｖ_ｊ ^ｕ｝、過去の刺激データＩ_ｚ ^ｕ（Ｔ_ｓ＜ｔ＜Ｔ_ｅ）、推定された個体パラメータ＾θ^ｕ、そして接触させる予定の刺激Ｉ_ｚｕ（ｔ＞Ｔ_ｅ）を用いて、選択ｚのレートの予測値＾λ_ｚ（ｔ｜＾θ^ｕ，｛ｔ_ｊ ^ｕ｝，{ｖ_ｊ ^ｕ}，Ｉ_ｚ ^ｕ（ｔ））を計算する。次に、最適化部２２４は、予測されたレートの積分値を以下の式（２８）に従って計算する。 First, the optimization unit 224 performs the past behavior data {t _j ^u } {v _j ^u }, the past stimulus data I _z ^u (T _s <t <T _e ), and the estimated individual for the individual ^u. Using the parameter ^ θ ^u and the stimulus I _z u (t> T _e ) to be contacted, the predicted rate of the selected z ^ λ _z (t | ^ θ ^u , {t _j ^u }, {v _j ^u }, I _z ^u (t)). Next, the optimization unit 224 calculates an integrated value of the predicted rate according to the following equation (28).

ただし、上記式（２８）のＴは刺激を与える予定の期間を表す。レートは単位時間当りに個体が選択行動ｚ_ｌを起こす確率であるので、上記式（２８）で計算される積分値Ｑ（Ｉ_ｚ ^ｕ（ｔ））は刺激接触によって引き起こされる行動の回数の期待値である。具体的には、以下の式に従って、この期待値Ｑ（Ｉ_ｚ ^ｕ（ｔ））を刺激に関する拘束条件の下で最大化するような刺激^〜Ｉ_ｚ ^ｕ（ｔ＞Ｔ_ｅ）を求めることで刺激接触の最適化が実現される。 However, T in the above equation (28) represents a period for which stimulation is to be applied. Since the rate is the probability that the individual will cause the selected action z _l per unit time, the integral value Q (I _z ^u (t)) calculated by the above equation (28) is the expectation of the number of actions caused by the stimulus contact. Value. Specifically, according to the following formula, by determining a stimulus ^to I _z ^u (t> T _e ) that maximizes the expected value Q (I _z ^u (t)) under the constraint condition related to the stimulus. Optimization of stimulation contact is realized.

出力部２５０は、最適化部２２４によって最適化された、複数の個体各々についての、当該個体に与える予定の選択肢ｚ_ｌに関する刺激の時刻の系列を出力する。出力部２５０では、最適化された刺激接触が個体毎に出力される。出力例を図７に示す。ただし図７は刺激接触回数が１０回の場合の例を示す。 The output unit 250 outputs, for each of the plurality of individuals optimized by the optimization unit 224, a series of stimulation times related to the option z _{l to} be given to the individual. In the output unit 250, the optimized stimulation contact is output for each individual. An output example is shown in FIG. However, FIG. 7 shows an example in which the number of stimulation contacts is 10.

＜本発明の実施の形態に係るモデル推定装置の作用＞ <Operation of Model Estimation Device According to Embodiment of the Present Invention>

次に、本発明の実施の形態に係るモデル推定装置１００の作用について説明する。複数の個体各々についての行動データ、刺激データ、及び属性データと、階層ベイズモデルの選択に関する情報とが入力されると、モデル推定装置１００は、図８に示すモデル推定処理ルーチンを実行する。 Next, the operation of the model estimation apparatus 100 according to the embodiment of the present invention will be described. When behavior data, stimulus data, and attribute data for each of a plurality of individuals and information related to selection of a hierarchical Bayes model are input, the model estimation device 100 executes a model estimation processing routine shown in FIG.

まず、ステップＳ１００において、入力部１０は、複数の個体各々についての行動データ、刺激データ、及び属性データと、階層ベイズモデルの選択に関する情報とを受け付ける。 First, in step S100, the input unit 10 receives behavior data, stimulus data, and attribute data for each of a plurality of individuals, and information related to selection of a hierarchical Bayesian model.

次に、ステップＳ１０２において、選択部３２は、上記ステップＳ１００で受け付けた階層ベイズモデルの選択に関する情報に基づいて、後述するパラメータ推定部３４で用いる階層ベイズモデルを選択する。具体的には、選択部３２は、上記ステップＳ１００で受け付けた階層ベイズモデルの選択に関する情報が１階層の階層ベイズモデルを表している場合には、ステップＳ１０６へ進む。一方、上記ステップＳ１００で受け付けた階層ベイズモデルの選択に関する情報が２階層の階層ベイズモデルを表している場合には、ステップＳ１１０へ進む。 Next, in step S102, the selection unit 32 selects a hierarchical Bayes model to be used by the parameter estimation unit 34 described later based on the information regarding the selection of the hierarchical Bayes model received in step S100. Specifically, the selection unit 32 proceeds to step S106 when the information regarding the selection of the hierarchical Bayes model accepted in Step S100 represents a hierarchical Bayes model of one hierarchy. On the other hand, if the information related to the selection of the hierarchical Bayes model received in Step S100 represents a hierarchical Bayes model of the second hierarchy, the process proceeds to Step S110.

ステップＳ１０６において、全体パラメータ推定部３６は、上記ステップＳ１００で受け付けた複数の個体各々についての行動データ、刺激データ、及び属性データに基づいて、上記式（２３）の更新ルールを繰り返し実行することにより、全体パラメータ＾β、＾Λを推定する。 In step S106, the overall parameter estimation unit 36 repeatedly executes the update rule of the formula (23) based on the behavior data, stimulus data, and attribute data for each of the plurality of individuals received in step S100. Estimate the overall parameters ^ β, ^ Λ.

そして、Ｓ１０８において、個体パラメータ推定部３８は、上記ステップＳ１００で受け付けた複数の個体各々についての行動データ及び刺激データと、上記ステップＳ１０６で推定された全体パラメータ＾β、＾Λとに基づいて、上記式（２６）に従って、複数の個体の各々について個体パラメータ＾θ^ｕを推定する。 Then, in S108, the individual parameter estimation unit 38, based on the behavior data and stimulus data for each of the plurality of individuals received in Step S100, and the overall parameters ^ β and ^ Λ estimated in Step S106, According to the above equation (26), the individual parameter ^ θ ^u is estimated for each of the plurality of individuals.

ステップＳ１１０において、全体及び個体パラメータ推定部４０は、上記ステップＳ１００で受け付けた複数の個体各々についての行動データ及び刺激データに基づいて、上記式（２７）に従って、全体パラメータ＾β、＾Λと、複数の個体の各々についての個体パラメータ＾θ^ｕとを推定する。 In step S110, the whole and individual parameter estimation unit 40, based on the behavior data and stimulus data for each of the plurality of individuals received in step S100, according to the above equation (27), the whole parameters ^ β, ^ Λ, An individual parameter {circumflex over (θ) ^{} u} for each of a plurality of individuals is estimated.

そして、ステップＳ１１２において、出力部５０は、上記ステップＳ１０８又は上記ステップＳ１１０で推定された複数の個体の各々についての個体パラメータ＾θ^ｕを結果として出力して、モデル推定処理ルーチンを終了する。 Then, in step S112, the output unit 50 outputs the individual parameters ^ theta ^u for each of a plurality of individuals estimated in step S108 or step S110 as a result, and ends the model estimation process routine.

＜本発明の実施の形態に係る最適化装置の作用＞ <Operation of the optimization device according to the embodiment of the present invention>

次に、本発明の実施の形態に係る最適化装置２００の作用について説明する。まず、モデル推定装置１００により出力された複数の個体各々についての個体パラメータ＾θ^ｕが、最適化装置２００に入力されると、記憶部２２２に格納される。そして、最適化対象の個体の行動データ、特定の選択肢について観測された当該個体の刺激データと、刺激に関する拘束条件とが入力されると、最適化装置２００は、図９に示す最適化処理ルーチンを実行する。最適化処理ルーチンは、各個体を最適化対象として繰り返し実行される。 Next, the operation of the optimization apparatus 200 according to the embodiment of the present invention will be described. First, when an individual parameter {circumflex over (θ)} ^u for each of a plurality of individuals output by the model estimation apparatus 100 is input to the optimization apparatus 200, it is stored in the storage unit 222. Then, when the behavior data of the individual to be optimized, the stimulus data of the individual observed for a specific option, and the constraint conditions related to the stimulus are input, the optimization apparatus 200 performs the optimization processing routine shown in FIG. Execute. The optimization processing routine is repeatedly executed with each individual as an optimization target.

まず、ステップＳ２００において、入力部２１０は、個体に与える刺激に関する拘束条件と、最適化定対象の個体の行動データと、特定の選択肢について観測された当該個体の刺激データとを受け付ける。 First, in step S200, the input unit 210 receives constraint conditions related to stimulation given to an individual, behavior data of an optimization target individual, and stimulation data of the individual observed for a specific option.

次に、ステップＳ２０２において、最適化部２２４は、最適化対象の個体ｕについて、上記ステップＳ１００で受け付けた、刺激に関する拘束条件、個体の行動データ、及び特定の選択肢ｚ_ｌについて観測された当該個体の刺激データと、記憶部２２２に格納された当該個体の個体パラメータ＾θ^ｕとに基づいて、上記式（２８）及び式（２９）に従って、刺激に関する拘束条件を満たし、予め定められた期間までの間に、個体が行動を起こし、かつ、選択肢ｚ_ｌを選択する回数を最大化するように、個体に与える予定の選択肢ｚ_ｌに関する刺激の時刻の系列を最適化する。 Next, in step S202, the optimization unit 224, with respect to the individual u to be optimized, the individual observed for the constraint condition related to the stimulus, the behavior data of the individual, and the specific option z _l received in step S100. On the basis of the stimulation data and the individual parameter {circumflex over (θ) ^u } of the individual stored in the storage unit 222, satisfying the constraint condition regarding stimulation according to the above formula (28) and formula (29), and up to a predetermined period during, individuals take action, and to maximize the number of times to select the option z _l, optimizing the sequence of time of stimulation related choices z _l scheduled to be given to the individual.

次に、ステップＳ２０４において、出力部２５０は、上記ステップＳ２０２で最適化された、最適化対象の個体に与える予定の選択肢ｚ_ｌに関する刺激の時刻の系列を出力して、最適化処理ルーチンを終了する。 Next, in step S204, the output unit 250 outputs the stimulus time series related to the option z _{1 to} be given to the optimization target individual optimized in step S202, and ends the optimization processing routine. To do.

以上説明したように、本実施の形態に係るモデル推定装置によれば、個体が予め定められた第１の量以上の刺激に接触した場合に刺激接触効果が高くなり、かつ、個体が予め定められた第２の量以上の刺激に接触した場合に刺激接触効果が低くなり、かつ、個体の刺激との接触の経過時間に応じて刺激接触効果が減衰するようにモデル化した個体の行動の発生に関するモデルと、個体が行動を起こしたときに各選択肢を選択する選択確率をモデル化した個体の行動が発生したときの選択に関するモデルと、個体の行動の発生に関するモデルに関するパラメータと個体の行動が発生したときの選択に関するモデルに関するパラメータとを表す個体パラメータが、全体パラメータが表す分布に従って生成されることをモデル化した個体パラメータの生成過程に関するモデルと、複数の個体の各々について観測された行動データ及び刺激データとに基づいて、複数の個体の各々についての個体パラメータとを推定することにより、刺激接触効果の閾値性、刺激接触効果のウェアアウト、及び刺激接触効果の忘却性を考慮した、選択行動に対する刺激接触効果を推定するためのパラメータを得ることができる。 As described above, according to the model estimation device according to the present embodiment, the stimulus contact effect is enhanced when an individual contacts a stimulus of a predetermined first amount or more, and the individual is predetermined. Of the behavior of the individual modeled so that the stimulus contact effect is reduced when contacted with a stimulus of the second amount or more, and the stimulus contact effect is attenuated according to the elapsed time of contact with the individual stimulus. A model related to occurrence, a model related to selection when an individual's behavior occurs modeling the selection probability of selecting each option when the individual takes action, a parameter related to the model related to occurrence of the individual's behavior, and the individual's behavior An individual parameter that models that an individual parameter that represents a parameter related to a model related to selection when an error occurs is generated according to the distribution represented by the overall parameter Based on the model related to the generation process and the behavior data and stimulus data observed for each of the plurality of individuals, by estimating the individual parameters for each of the plurality of individuals, the threshold property of the stimulus contact effect, the stimulus contact It is possible to obtain a parameter for estimating the stimulation contact effect on the selected action in consideration of the effect wearout and the forgetting property of the stimulation contact effect.

また、本実施の形態に係る行動予測装置によれば、個体に与える刺激に関する拘束条件、個体の行動データ、及び選択肢ｚ_ｌについて観測された個体の刺激データと、モデル推定装置によって推定された個体の個体パラメータとに基づいて、拘束条件を満たし、予め定められた期間までの間に、個体が行動を起こし、かつ、選択肢ｚ_ｌを選択する回数を最大化するように、個体に与える予定の選択肢ｚ_ｌに関する刺激の時刻の系列を最適化することにより、刺激接触効果の閾値性、刺激接触効果のウェアアウト、及び刺激接触効果の忘却性を考慮して、個体に特定の選択行動を起こさせるための刺激を最適化することができる。 In addition, according to the behavior prediction device according to the present embodiment, the constraint conditions related to the stimulus given to the individual, the behavior data of the individual, the individual stimulation data observed for the option z _l , and the individual estimated by the model estimation device Based on the individual parameters of the individual to be given to the individual so as to satisfy the constraints and to maximize the number of times that the individual takes action and selects the choice z _l during a predetermined period of time. By optimizing the time series of stimuli with respect to option z _l , a specific selection action is caused to an individual in consideration of the threshold property of the stimulus contact effect, the wear out of the stimulus contact effect, and the forgetting property of the stimulus contact effect. Can be optimized.

また、本実施の形態に係るモデル推定装置及び最適化装置によれば、意思決定の発生と意思決定内容の選択、その両方を組み込み、かつ閾値性、ウェアアウト、及び忘却性を考慮した高精度な刺激接触効果の推定を個体単位で実現することを可能とする技術が提供される。また、各個体に対する最適な刺激の与え方（量とタイミング）を導出することも可能となる． In addition, according to the model estimation device and the optimization device according to the present embodiment, high accuracy in consideration of threshold generation, wearout, and forgetting, incorporating decision generation and selection of decision content, and both Provided is a technique that makes it possible to realize a simple stimulus contact effect estimation for each individual. It is also possible to derive the optimal method (quantity and timing) for each individual.

また、各個体に対する最適な刺激の与え方の情報により、例えば、事業者は各個体に対して広告接触が有効かどうかを判断することができる。あるいは、最小の予算で最大の広告効果を図ることが可能となる。また、広告業者は、正しい広告効果に基づいた適正な広告料金を算定することが可能となる． Further, for example, the business operator can determine whether or not the advertisement contact is effective for each individual based on the information on how to give the optimum stimulus to each individual. Alternatively, the maximum advertising effect can be achieved with the minimum budget. In addition, the advertising agency can calculate an appropriate advertising fee based on the correct advertising effect.

なお、本発明は、上述した実施形態に限定されるものではなく、この発明の要旨を逸脱しない範囲内で様々な変形や応用が可能である。 Note that the present invention is not limited to the above-described embodiment, and various modifications and applications are possible without departing from the gist of the present invention.

例えば、上記実施の形態では、全体パラメータ及び個体パラメータを推定する際に、属性データｄ^ｕを用いる場合を例に説明したが、これに限定されるものではなく、属性データｄ^ｕを用いずに全体パラメータ及び個体パラメータを推定してもよい。属性データｄ^ｕを用いずに全体パラメータ及び個体パラメータを推定する場合には、属性データｄ^ｕを要素１のベクトルとして設定し、かつ当該要素の値を１に設定すればよい。
また、１階層の階層ベイズモデル又は２階層の階層ベイズモデルを選択する場合を例に説明したが、これに限定されるものではなく、１階層の階層ベイズモデル及び２階層の階層ベイズモデルの何れか一方のみを用いるように構成してもよい。１階層の階層ベイズモデルを用いる場合には、選択部３２、及び全体及び個体パラメータ推定部４０が不要であり、２階層の階層ベイズモデルを用いる場合には、選択部３２、全体パラメータ推定部３６、及び個体パラメータ推定部３８が不要である。 For example, in the above embodiment, when estimating the overall parameters and individual parameters, a case of using the attribute data d ^u has been described as an example, but the invention is not limited to this, without using the attribute data d ^u Global parameters and individual parameters may be estimated. When estimating the overall parameters and individual parameters without using the attribute data d ^u sets the attribute data d ^u as a vector of elements 1, and may be set to the value of the element 1.
Moreover, although the case where the one-level hierarchical Bayes model or the two-level hierarchical Bayes model is selected has been described as an example, the present invention is not limited to this, and any one of the one-level hierarchical Bayes model and the two-level hierarchical Bayes model is used. You may comprise only one of them. When a one-level hierarchical Bayes model is used, the selection unit 32 and the entire and individual parameter estimation unit 40 are not necessary. When a two-level hierarchical Bayes model is used, the selection unit 32 and the overall parameter estimation unit 36 are used. , And the individual parameter estimation unit 38 is unnecessary.

また、上述のモデル推定装置は、記憶部２２を備えている場合について説明したが、例えば記憶部２２がモデル推定装置の外部装置に設けられ、モデル推定装置は、外部装置と通信手段を用いて通信することにより、記憶部２２を参照するようにしてもよい。 Moreover, although the above-mentioned model estimation apparatus demonstrated the case where the memory | storage part 22 was provided, the memory | storage part 22 is provided in the external apparatus of the model estimation apparatus, for example, and a model estimation apparatus uses an external apparatus and a communication means. You may make it refer to the memory | storage part 22 by communicating.

また、上述の最適化装置は、記憶部２２２を備えている場合について説明したが、例えば記憶部２２２が最適化装置の外部装置に設けられ、最適化装置は、外部装置と通信手段を用いて通信することにより、記憶部２２２を参照するようにしてもよい。 Moreover, although the above-mentioned optimization apparatus demonstrated the case where the memory | storage part 222 was provided, the memory | storage part 222 is provided in the external apparatus of the optimization apparatus, for example, and an optimization apparatus uses an external apparatus and a communication means. You may make it refer to the memory | storage part 222 by communicating.

また、上記実施の形態では、モデル推定装置と最適化装置とを別々の装置として構成する場合を例に説明したが、モデル推定装置と最適化装置とを１つの装置として構成してもよい。 Moreover, although the case where the model estimation apparatus and the optimization apparatus are configured as separate apparatuses has been described as an example in the above embodiment, the model estimation apparatus and the optimization apparatus may be configured as one apparatus.

また、上述のモデル推定装置及び最適化装置は、内部にコンピュータシステムを有しているが、コンピュータシステムは、ＷＷＷシステムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。 In addition, the model estimation device and the optimization device described above have a computer system inside, but if the computer system uses a WWW system, the homepage providing environment (or display environment) is also included. Shall be.

また、本願明細書中において、プログラムが予めインストールされている実施形態として説明したが、当該プログラムを、コンピュータ読取り可能な記録媒体に格納して提供することも可能である。 In the present specification, the embodiment has been described in which the program is installed in advance. However, the program can be provided by being stored in a computer-readable recording medium.

１０入力部
２０演算部
２２記憶部
３０推定部
３２選択部
３４パラメータ推定部
３６全体パラメータ推定部
３８個体パラメータ推定部
４０全体及び個体パラメータ推定部
１００モデル推定装置
２００最適化装置
２１０入力部
２２０演算部
２２２記憶部
２２４最適化部
２５０出力部 DESCRIPTION OF SYMBOLS 10 Input part 20 Operation part 22 Storage part 30 Estimation part 32 Selection part 34 Parameter estimation part 36 Overall parameter estimation part 38 Individual parameter estimation part 40 Whole and individual parameter estimation part 100 Model estimation apparatus 200 Optimization apparatus 210 Input part 220 Operation part 222 Storage unit 224 Optimization unit 250 Output unit

Claims

A model estimation method in a model estimation device including an input unit and a parameter estimation unit,
Action time series data representing a time series when the individual took action observed by each of the plurality of individuals, and a selection series representing a series of action selection contents when the individual acted Receiving behavior data including data, and stimulating data that is observed for each of a plurality of individuals and that is observed for each action option, and that represents a sequence of times when the individual contacts the stimulus;
The parameter estimation unit,
For each of multiple individuals,
When the individual comes into contact with a stimulus of a predetermined first amount or more, a stimulus contact effect representing a degree of the individual to take action by contact with the stimulus becomes high, and the individual is predetermined. A unit time at time t so that the stimulus contact effect is reduced when contacted with a stimulus of a second amount or more, and the stimulus contact effect is attenuated according to the elapsed time of contact with the individual's stimulus. A model relating to the occurrence of an individual's behavior that models the rate λ (t) at which the individual takes action, and a selection probability p _l for selecting each option z _l when the individual takes action at time t A model for selecting when an individual's behavior occurs, modeling (t);
For each of the plurality of individuals, an individual parameter that represents a parameter related to the model related to the occurrence of the behavior of the individual and a parameter related to the model related to the selection when the behavior of the individual occurs is generated according to the distribution represented by the overall parameter. Based on the model related to the generation process of the individual parameters and the behavior data and the stimulus data received by the input unit,
Estimating the global parameters and the individual parameters for each of the plurality of individuals;
A model estimation method including:

The model related to the generation process of the individual parameters is modeled based on a one-level hierarchical Bayesian model,
The step of estimating the parameter by the parameter estimating unit estimates the overall parameter based on the behavior data and the stimulus data received by the input unit, and the behavior data and the stimulus data received by the input unit; The model estimation method according to claim 1, wherein the individual parameter is estimated for each of the plurality of individuals based on the estimated overall parameter.

The model related to the generation process of the individual parameters is modeled based on a two-level hierarchical Bayesian model,
The step of estimating a parameter by the parameter estimation unit estimates the overall parameter and the individual parameter for each of the plurality of individuals based on the behavior data and the stimulus data received by the input unit. 2. The model estimation method according to 1.

The step of receiving by the input unit further receives attribute data representing an attribute for each of a plurality of individuals,
The model related to the generation process of the individual parameter is a model in which the individual parameter for each of the plurality of individuals is generated according to a distribution represented by the attribute of the individual and the overall parameter,
The step of estimating the parameter by the parameter estimation unit includes a model relating to selection when the behavior of the individual occurs and a model relating to the generation process of the individual parameter, the behavior data received by the input unit, the stimulation data, and The model estimation method according to claim 1, wherein the overall parameter and the individual parameter for each of a plurality of individuals are estimated based on the attribute data.

The option z _{l to} be given to the individual based on the individual parameters of the individual estimated by the model estimation method according to any one of claims 1 to 4 including an input unit and an optimization unit An optimization method in an optimization device for optimizing a sequence of stimulation times,
Receiving the constraint condition relating to the stimulus given to the individual by the input unit, the behavior data of the individual, and the stimulus data of the individual observed for the option z _l ;
Based on the constraint condition, the behavior data of the individual, the stimulation data of the individual observed for the option z _l , and the individual parameters of the individual, which the optimization unit has received by the input unit The option z to be given to the individual so as to maximize the number of times that the individual takes action and selects the option z _l during a predetermined period of time, satisfying the constraint conditions optimizing the sequence of stimulus times for _l ;
Optimization method including.

Observed for each of a plurality of individuals, behavior time series data representing a series of times when the individual took action, selection series data representing a series of action selection contents when the individual acted, and the individual And an input unit for receiving stimulus data representing a series of times when the individual contacts the stimulus, which is observed for each of the plurality of individuals and observed for each action option,
For each of multiple individuals,
When the individual comes into contact with a stimulus of a predetermined first amount or more, a stimulus contact effect representing a degree of the individual to take action by contact with the stimulus becomes high, and the individual is predetermined. A unit time at time t so that the stimulus contact effect is reduced when contacted with a stimulus of a second amount or more, and the stimulus contact effect is attenuated according to the elapsed time of contact with the individual's stimulus. A model relating to the occurrence of an individual's behavior that models the rate λ (t) at which the individual takes action, and a selection probability p _l for selecting each option z _l when the individual takes action at time t A model for selecting when an individual's behavior occurs, modeling (t);
For each of the plurality of individuals, an individual parameter that represents a parameter related to the model related to the occurrence of the behavior of the individual and a parameter related to the model related to the selection when the behavior of the individual occurs is generated according to the distribution represented by the overall parameter. Based on the model related to the generation process of the individual parameters and the behavior data and the stimulus data received by the input unit,
A parameter estimator that estimates the overall parameters and the individual parameters for each of the plurality of individuals;
Model estimation apparatus including

An optimization device for optimizing a sequence of stimulation times related to the option z _{l to} be given to the individual based on the individual parameters of the individual estimated by the model estimation device according to claim 6,
An input unit for receiving a constraint condition relating to a stimulus given to the individual, the behavior data of the individual, and the stimulus data of the individual observed with respect to the option z _l ;
The constraint condition is satisfied based on the constraint condition, the behavior data of the individual, the stimulus data of the individual observed for the option z _l , and the individual parameter of the individual, which is received by the input unit. The time of stimulation for the option z _{l to} be given to the individual so as to maximize the number of times that the individual takes action and selects the option z _l during a predetermined period of time An optimization unit for optimizing the series;
Including an optimization device.

The program for making a computer perform each step of the model estimation method of any one of Claims 1-4, or the optimization method of Claim 5.