JP2024002569A

JP2024002569A - Information processing device, information processing system, information processing method, and program

Info

Publication number: JP2024002569A
Application number: JP2022101837A
Authority: JP
Inventors: 琢人杉崎; Takuto Sugisaki
Original assignee: Japan Data Science Consortium Co Ltd JDSC
Current assignee: Japan Data Science Consortium Co Ltd JDSC
Priority date: 2022-06-24
Filing date: 2022-06-24
Publication date: 2024-01-11

Abstract

PROBLEM TO BE SOLVED: To improve prediction accuracy of an intervention effect in direct marketing.

SOLUTION: An information processing device includes: an acquisition part for acquiring customer information and purchase information; a learning part which calculates a predicted value of a conditional intervention effect for each user and a predicted value of a future intervention effect by inputting the customer information and purchase information acquired by the acquisition part into a learning model in which data showing conditional intervention effects for each user due to campaign intervention and data showing future intervention effects due to the campaign intervention is learned as a learning data set; and a prediction unit for correcting the prediction value of the future intervention effect by using a trend.

SELECTED DRAWING: Figure 2

Description

本発明は、情報処理装置、情報処理システム、情報処理方法、及びプログラムに関する。 The present invention relates to an information processing device, an information processing system, an information processing method, and a program.

ダイレクトマーケティングとは、企業が顧客と直接コミュニケーションを図りながら購入を働きかける手法である。例えば、ある百貨店が顧客にカタログ発行し、カタログ経由での注文を受注する。この場合、百貨店は、カタログ作成やその配布に必要とするコストと、カタログ経由で発生した売上、利益を比較することで、そのマーケティング施策に投下したコストに対してどの程度のリターンがあったかを定量的に計測することが可能である。 Direct marketing is a method in which companies communicate directly with customers to encourage them to make purchases. For example, a department store issues a catalog to customers and accepts orders via the catalog. In this case, the department store can quantify the return on costs invested in marketing measures by comparing the costs required to create and distribute the catalog with the sales and profits generated through the catalog. It is possible to measure the

ダイレクトマーケティング活動における損益分岐点の構造は、固定生産費用、変動生産費用、およびプロモーション費用に関連する。ダイレクトマーケティングにおける意思決定では、固定生産費と変動生産費との構造が所与である場合、自らの利益を最大化するようにプロモーション費用を決定する。一方で、プロモーション費用に関する正確な意思決定のためには、プロモーション費用が需要に及ぼす影響を把握している必要がある。プロモーション費用が需要に及ぼす影響は、そのキャンペーンにおけるターゲット、タイミング、クリエイティブ、オファー内容などにより変動する。そのため、ダイレクトマーケティング全体としては、プロモーション費用あたりの需要獲得を最大化するようなキャンペーンを企画しつつ、最終利益が最大化するようなプロモーション費用の総額を決定するという２つの意思決定要素から成り立つ。 The structure of break-even in direct marketing activities is related to fixed production costs, variable production costs, and promotional costs. When making decisions in direct marketing, given the structure of fixed production costs and variable production costs, promotion costs are determined to maximize one's profits. On the other hand, in order to make accurate decisions regarding promotion costs, it is necessary to understand the impact of promotion costs on demand. The impact of promotional costs on demand varies depending on the campaign's targets, timing, creative, offer content, etc. Therefore, direct marketing as a whole consists of two decision-making elements: planning a campaign that maximizes demand acquisition per promotional cost, and determining the total promotional cost that maximizes final profit.

ここで、キャンペーンの対象とした場合のｉＲＯＡＳ（ｉｎｃｒｅｍｅｎｔａｌＲｅｔｕｒｎｏｎＡｄＳｐｅｎｄ：広告支出によって増加する収入）が顧客ごとに正確に算定できると仮定する。この場合、意思決定要素のうち、最適なターゲティングとプロモーション費用との総額は、一度に求めることが可能である。例えば、ｉＲＯＡＳが正となる顧客全てをキャンペーンの対象とし、その場合に必要な費用が最適なプロモーション費用の総額となる。
しかしながら、現実のダイレクトマーケティングにおいては、必ずしもそのような理想的意思決定が行われていない。主な要因としては次のようなものがある。 Here, it is assumed that iROAS (incremental return on ad spend: revenue increased by advertising expenditure) can be accurately calculated for each customer when targeted by a campaign. In this case, among the decision-making factors, the total amount of optimal targeting and promotion costs can be determined at once. For example, all customers whose iROAS is positive are targeted for a campaign, and the cost required in that case becomes the optimal total promotional cost.
However, in actual direct marketing, such ideal decision-making is not always carried out. The main factors include:

（１）Ｕｐｌｉｆｔの予測
因果推論の根本的問題により、キャンペーンによる増幅効果（Ｕｐｌｉｆｔ）を推定、予測するには困難が伴う。 (1) Prediction of Uplift Due to fundamental problems in causal inference, it is difficult to estimate and predict the amplification effect (Uplift) of a campaign.

（２）消費者異質性の考慮
顧客一人一人によってキャンペーンによるＵｐｌｉｆｔ効果は異なるが、これを高精度で予測するためには、機械学習等の手法が必要となる。機械学習を活用する組織的ノウハウの無い場合は、ＲＦＭ（ＲｅｃｅｎｃｙＦｒｅｑｕｅｎｃｙＭｏｎｅｔａｒｙ）ベースでのセグメンテーションなどを用いて消費者異質性を加味する方法が採用されていることが多い。ただし、この場合も多くはＲｅｓｐｏｎｓｅベースでの異質性の予測に止まっている。ダイレクトマーケティングにＵｐｌｉｆｔＭｏｄｅｌｉｎｇを活用し、Ｕｐｌｉｆｔ効果における消費者異質性を加味した意思決定を行う先行研究は幾つか存在する。 (2) Consideration of consumer heterogeneity The Uplift effect of a campaign differs depending on each customer, but in order to predict this with high accuracy, a method such as machine learning is required. If there is no organizational know-how to utilize machine learning, a method is often adopted in which consumer heterogeneity is taken into account using segmentation based on RFM (Recency Frequency Monetary). However, in most cases, heterogeneity is only predicted based on the response. There are several previous studies that utilize Uplift Modeling in direct marketing to make decisions that take into account consumer heterogeneity in the Uplift effect.

（３）時間的異質性の考慮
顧客毎にＵｐｌｉｆｔ効果が異なるだけでなく、キャンペーンの実施タイミングによってもＵｐｌｉｆｔ効果は異なる。介入時と非介入時におけるアウトカムの数値的変化を元に介入効果を推定する技術は、経済学分野を中心に数多く存在する。しかしながら、Ｕｐｌｉｆｔ効果自体が時間とともに変動することを前提として、将来のＵｐｌｉｆｔ効果の予測からダイレクトマーケティングの意思決定を行う技術は、存在しない。
現実のダイレクトマーケティングにおいては、ＵｐｌｉｆｔではなくＲｅｓｐｏｎｓｅベースで季節ごとのキャンペーン結果の傾向を定量化し、季節ごとにＲＦＭセグメントの異なる閾値を用いたり、前年同時期のキャンペーン結果を元にターゲットを選定するなどの方法により季節性やトレンドを極力反映させるように工夫したりする技術も存在する。 (3) Consideration of temporal heterogeneity Not only does the Uplift effect differ for each customer, but it also differs depending on the timing of campaign implementation. There are many techniques, mainly in the field of economics, for estimating intervention effects based on numerical changes in outcomes between the intervention and non-intervention periods. However, there is no technology for making direct marketing decisions based on predictions of future Uplift effects, assuming that Uplift effects themselves vary over time.
In actual direct marketing, trends in seasonal campaign results are quantified based on Response rather than Uplift, and different thresholds for RFM segments are used for each season, and targets are selected based on campaign results from the same period the previous year. There is also a technique for devising ways to reflect seasonality and trends as much as possible.

（４）クリエイティブ・オファー内容によるＵｐｌｉｆｔへの影響の定量化
マーケティングの効果を事前に予測するには、ターゲットやそのタイミングだけでなく、どのようなオファー、クリエイティブが提示されるかによる影響も考慮する必要がある。一方で、顧客の反応をモデリングするには、オファー、クリエイティブなどの要素を定量的に示すことが難しい。オファーにおける対象商品が毎回異なる、クリエイティブが毎回異なるなどの性質を持つキャンペーンの予測は、モデルに含めることができないオファー・クリエイティブによる変動要因が大きいほど難易度が増大する。 (4) Quantifying the impact of creative offer content on Uplift To predict marketing effectiveness in advance, consider not only the target and its timing, but also the impact of what kind of offer and creative will be presented. There is a need. On the other hand, when modeling customer reactions, it is difficult to quantify elements such as offers and creatives. The difficulty of predicting campaigns that have characteristics such as the target product in an offer being different each time or the creative being different each time increases as the variation factors due to offers and creatives that cannot be included in the model are large.

例えば、非特許文献１に記載の技術では、シミュレーションデータが一定の精度を示す。 For example, in the technique described in Non-Patent Document 1, simulation data exhibits a certain degree of accuracy.

ＳｈｕＬｉａｎｄＰｅｔｅｒＢｕｈｌｍａｎｎ，Ｅｓｔｉｍａｔｉｎｇｈｅｔｅｒｏｇｅｎｅｏｕｓｔｒｅａｔｍｅｎｔｅｆｆｅｃｔｓｉｎｎｏｎ－ｓｔａｔｉｏｎａｒｙｔｉｍｅｓｅｒｉｅｓｗｉｔｈｓｔａｔｅ－ｓｐａｃｅｍｏｄｅｌｓ，Ｄｅｃｅｍｂｅｒ，２０１８．Shu Li and Peter Buhlmann, Estimating heterogeneous treatment effects in non-stationary time series with state-space models, December, 2018.

しかしながら、非特許文献１に記載の技術では、実データを用いた検証した場合に、将来トレンドの推定をする際に季節成分が消失してしまうことがある。このため、ダイレクトマーケティングにおけるトレンドを算出できないことがあった。また、季節成分の消失により介入効果の予測ができないことがあった。このように、ダイレクトマーケティングにおける介入効果の予測精度が十分でないという課題があった。 However, in the technique described in Non-Patent Document 1, when verified using actual data, seasonal components may disappear when estimating future trends. For this reason, trends in direct marketing could not be calculated. In addition, the effect of the intervention could not be predicted due to the disappearance of the seasonal component. As described above, there has been a problem in that the accuracy of predicting intervention effects in direct marketing is not sufficient.

本発明は、上記の課題を解決すべくなされたもので、その目的は、ダイレクトマーケティングにおける介入効果の予測精度を向上させることができる情報処理装置、情報処理システム、情報処理方法、およびプログラムを提供することにある。 The present invention has been made to solve the above problems, and its purpose is to provide an information processing device, an information processing system, an information processing method, and a program that can improve the prediction accuracy of intervention effects in direct marketing. It's about doing.

本発明の一態様は、上記課題を鑑みてなされたものであり、顧客情報および購買情報を取得する取得部と、キャンペーン介入によるユーザごとの条件付き介入効果を示すデータと、キャンペーン介入による将来の介入効果を示すデータとを学習データセットとして学習された学習モデルに、前記取得部が取得した顧客情報および購買情報を入力することで、ユーザごとの条件付き介入効果の予測値と、将来の介入効果の予測値とを算出する学習部と、前記将来の介入効果の予測値を、トレンドを用いて補正する予測部と、を備える情報処理装置である。 One aspect of the present invention has been made in view of the above problems, and includes an acquisition unit that acquires customer information and purchase information, data indicating conditional intervention effects for each user due to campaign intervention, and future predictions due to campaign intervention. By inputting the customer information and purchase information acquired by the acquisition unit into a learning model that has been trained using the data indicating the intervention effect as a learning data set, the predicted value of the conditional intervention effect for each user and the future intervention can be calculated. The information processing apparatus includes a learning unit that calculates a predicted value of an effect, and a prediction unit that corrects the predicted value of the future intervention effect using a trend.

また、本発明の一態様は、上記課題を鑑みてなされたものであり、キャンペーン介入した期間と、前記キャンペーン介入しない期間とを少なくとも含む所定期間におけるユーザごとの条件付き介入効果を示すデータと、将来の介入効果を示すデータとを学習データセットとして学習する学習モデルによって、将来の介入効果の予測値を算出する学習部と、前記キャンペーン介入しない期間における前記ユーザごとの条件付き介入効果を示すデータによる影響を抑制して前記将来の介入効果の予測値を補正するためのトレンドを算出する予測部と、を備える情報処理装置である。 Further, one aspect of the present invention has been made in view of the above problems, and includes data indicating a conditional intervention effect for each user in a predetermined period including at least a period in which the campaign intervenes and a period in which the campaign does not intervene; a learning unit that calculates a predicted value of a future intervention effect using a learning model that learns data indicating a future intervention effect as a learning data set, and data indicating a conditional intervention effect for each user during a period in which the campaign does not intervene. and a prediction unit that calculates a trend for correcting the predicted value of the future intervention effect by suppressing the influence of the intervention effect.

また、本発明の一態様は、上記課題を鑑みてなされたものであり、顧客情報および購買情報を取得する取得部と、キャンペーン介入によるユーザごとの条件付き介入効果を示すデータと、キャンペーン介入による将来の介入効果を示すデータとを学習データセットとして学習された学習モデルに、前記取得部が取得した顧客情報および購買情報を入力することで、ユーザごとの条件付き介入効果の予測値と、将来の介入効果の予測値とを算出する学習部と、前記将来の介入効果の予測値を、トレンドを用いて補正する予測部と、を備える情報処理システムである。 Further, one aspect of the present invention has been made in view of the above problems, and includes an acquisition unit that acquires customer information and purchase information, data indicating a conditional intervention effect for each user due to campaign intervention, and data indicating the conditional intervention effect for each user due to campaign intervention. By inputting the customer information and purchase information acquired by the acquisition unit into a learning model trained using data indicating future intervention effects as a learning data set, the predicted value of the conditional intervention effect for each user and the future The information processing system includes a learning unit that calculates a predicted value of the intervention effect in the future, and a prediction unit that corrects the predicted value of the future intervention effect using a trend.

また、本発明の一態様は、上記課題を鑑みてなされたものであり、キャンペーン介入した期間と、前記キャンペーン介入しない期間とを少なくとも含む所定期間におけるユーザごとの条件付き介入効果を示すデータと、将来の介入効果を示すデータとを学習データセットとして学習する学習モデルによって、将来の介入効果の予測値を算出する学習部と、前記キャンペーン介入しない期間における前記ユーザごとの条件付き介入効果を示すデータによる影響を抑制して前記将来の介入効果の予測値を補正するためのトレンドを算出する予測部と、を備える情報処理システムである。 Further, one aspect of the present invention has been made in view of the above problems, and includes data indicating a conditional intervention effect for each user in a predetermined period including at least a period in which the campaign intervenes and a period in which the campaign does not intervene; a learning unit that calculates a predicted value of a future intervention effect using a learning model that learns data indicating a future intervention effect as a learning data set, and data indicating a conditional intervention effect for each user during a period in which the campaign does not intervene. and a prediction unit that calculates a trend for correcting the predicted value of the future intervention effect by suppressing the influence of the intervention effect.

また、本発明の一態様は、上記課題を鑑みてなされたものであり、情報処理装置のコンピュータが、顧客情報および購買情報を取得する取得過程と、キャンペーン介入によるユーザごとの条件付き介入効果を示すデータと、キャンペーン介入による将来の介入効果を示すデータとを学習データセットとして学習された学習モデルに、前記取得部が取得した顧客情報および購買情報を入力することで、ユーザごとの条件付き介入効果の予測値と、将来の介入効果の予測値とを算出する学習過程と、前記将来の介入効果の予測値を、トレンドを用いて補正する予測過程と、を有する情報処理方法である。 Further, one aspect of the present invention has been made in view of the above problems, and includes an acquisition process in which a computer of an information processing device acquires customer information and purchase information, and a conditional intervention effect for each user due to campaign intervention. By inputting the customer information and purchase information acquired by the acquisition unit into a learning model that has been trained using data indicating the future intervention effect of campaign intervention as a learning data set, the data indicating the future intervention effect of campaign intervention can be input into the learning model. This information processing method includes a learning process of calculating a predicted value of an effect and a predicted value of a future intervention effect, and a prediction process of correcting the predicted value of the future intervention effect using a trend.

また、本発明の一態様は、上記課題を鑑みてなされたものであり、情報処理装置のコンピュータが、キャンペーン介入した期間と、前記キャンペーン介入しない期間とを少なくとも含む所定期間におけるユーザごとの条件付き介入効果を示すデータと、将来の介入効果を示すデータとを学習データセットとして学習する学習モデルによって、将来の介入効果の予測値を算出する学習過程と、前記キャンペーン介入しない期間における前記ユーザごとの条件付き介入効果を示すデータによる影響を抑制して前記将来の介入効果の予測値を補正するためのトレンドを算出する予測過程と、を有する情報処理方法である。 Further, one aspect of the present invention has been made in view of the above-mentioned problems, and the computer of the information processing device sets conditions for each user during a predetermined period that includes at least a period in which a campaign intervenes and a period in which the campaign does not intervene. A learning process that calculates a predicted value of the future intervention effect using a learning model that learns data showing the intervention effect and data showing the future intervention effect as a learning data set, and An information processing method comprising: a prediction step of calculating a trend for suppressing the influence of data indicating a conditional intervention effect and correcting a predicted value of the future intervention effect.

また、本発明の一態様は、上記課題を鑑みてなされたものであり、情報処理装置のコンピュータに、顧客情報および購買情報を取得する取得ステップと、キャンペーン介入によるユーザごとの条件付き介入効果を示すデータと、キャンペーン介入による将来の介入効果を示すデータとを学習データセットとして学習された学習モデルに、前記取得部が取得した顧客情報および購買情報を入力することで、ユーザごとの条件付き介入効果の予測値と、将来の介入効果の予測値とを算出する学習ステップと、前記将来の介入効果の予測値を、トレンドを用いて補正する予測ステップと、を実行させるためのプログラムである。 Further, one aspect of the present invention has been made in view of the above problems, and includes an acquisition step of acquiring customer information and purchase information, and a conditional intervention effect for each user due to campaign intervention, in a computer of an information processing device. By inputting the customer information and purchase information acquired by the acquisition unit into a learning model that has been trained using data indicating the future intervention effect of campaign intervention as a learning data set, the data indicating the future intervention effect of campaign intervention can be input into the learning model. This is a program for executing a learning step of calculating a predicted value of an effect and a predicted value of a future intervention effect, and a prediction step of correcting the predicted value of the future intervention effect using a trend.

また、本発明の一態様は、上記課題を鑑みてなされたものであり、情報処理装置のコンピュータに、キャンペーン介入した期間と、前記キャンペーン介入しない期間とを少なくとも含む所定期間におけるユーザごとの条件付き介入効果を示すデータと、将来の介入効果を示すデータとを学習データセットとして学習する学習モデルによって、将来の介入効果の予測値を算出する学習ステップと、前記キャンペーン介入しない期間における前記ユーザごとの条件付き介入効果を示すデータによる影響を抑制して前記将来の介入効果の予測値を補正するためのトレンドを算出する予測ステップと、を実行させるためのプログラムである。 Further, one aspect of the present invention has been made in view of the above-mentioned problems, and is configured to set a condition for each user in a predetermined period including at least a period in which a campaign intervenes and a period in which the campaign does not intervene in the computer of the information processing device. a learning step of calculating a predicted value of the future intervention effect by a learning model that learns data showing the intervention effect and data showing the future intervention effect as a learning data set; This is a program for executing a prediction step of calculating a trend for suppressing the influence of data indicating a conditional intervention effect and correcting the predicted value of the future intervention effect.

上記少なくとも一つの態様によれば、ダイレクトマーケティングにおける介入効果の予測精度を向上させることができる。 According to at least one aspect described above, it is possible to improve the prediction accuracy of intervention effects in direct marketing.

第１実施形態に係る情報処理システムの構成の一例を示す概略図である。1 is a schematic diagram showing an example of the configuration of an information processing system according to a first embodiment. 本実施形態に係る情報処理装置の機能構成を示す概略ブロック図である。FIG. 1 is a schematic block diagram showing the functional configuration of an information processing device according to the present embodiment. 本実施形態に係るＣＣＥＵＴの一例を示す概略図である。It is a schematic diagram showing an example of CCEUT concerning this embodiment. 本実施形態に係るＣＣＥＰＴＡの一例を示す概略図である。It is a schematic diagram showing an example of CCEPTA concerning this embodiment. 本実施形態に係る顧客情報の構成の一例を示すテーブルである。It is a table showing an example of the structure of customer information according to the present embodiment. 本実施形態に係る購買情報の構成の一例を示すテーブルである。It is a table showing an example of the structure of purchasing information according to the present embodiment. 本実施形態に係るＤＭ情報の構成の一例を示すテーブルである。It is a table showing an example of the structure of DM information according to the present embodiment. 本実施形態に係る送付先情報の構成の一例を示すテーブルである。It is a table showing an example of the structure of destination information according to the present embodiment. 本実施形態に係る情報処理装置のハードウェア構成を示す概略ブロック図である。1 is a schematic block diagram showing a hardware configuration of an information processing device according to an embodiment. FIG. 本実施形態に係る学習処理の一例を示すフローチャートである。7 is a flowchart illustrating an example of learning processing according to the present embodiment. 本実施形態に係る予測処理の一例を示すフローチャートである。It is a flow chart which shows an example of prediction processing concerning this embodiment. 本実施形態に係る予測処理の一例を示すフローチャートである。It is a flow chart which shows an example of prediction processing concerning this embodiment. 本実施形態に係る検証データの一例を示す図である。It is a figure showing an example of verification data concerning this embodiment. 本実施形態に係る各共変量の分布の一例を示す図である。It is a figure showing an example of distribution of each covariate concerning this embodiment. 本実施形態に係る各共変量の分布の他の一例を示す図である。It is a figure showing another example of distribution of each covariate concerning this embodiment. 本実施形態に係る各共変量の分布の他の一例を示す図である。It is a figure showing another example of distribution of each covariate concerning this embodiment. 本実施形態に係る各共変量と潜在アウトカムの関係性の一例を示す図である。FIG. 2 is a diagram showing an example of the relationship between each covariate and a potential outcome according to the present embodiment. 本実施形態に係る各個体の真の介入効果の時系列プロットの一例を示す図である。FIG. 7 is a diagram showing an example of a time-series plot of the true intervention effect for each individual according to the present embodiment. 本実施形態に係る学習・予測期間のデータ諸元の一例を示す図である。It is a figure showing an example of data specifications of a learning/prediction period concerning this embodiment. 本実施形態に係る各Ｐｏｌｉｃｙでサンプリングされたデータの一例を示す図である。It is a figure which shows an example of the data sampled by each policy based on this embodiment. 本実施形態に係る各Ｐｏｌｉｃｙでのトレンドの一例を示す図である。It is a figure showing an example of a trend in each policy concerning this embodiment. 本実施形態に係るサンプリングの検証結果の一例を示す図である。It is a figure showing an example of the verification result of sampling concerning this embodiment. 本実施形態に係るＣａｌｉｂｒａｔｉｏｎ方式の検証結果の一例を示す図である。FIG. 3 is a diagram illustrating an example of a verification result of the Calibration method according to the present embodiment.

（第１実施形態）
以下、図面を参照して、第１実施形態について詳しく説明する。 (First embodiment)
Hereinafter, the first embodiment will be described in detail with reference to the drawings.

＜情報処理システム＞
図１は、第１実施形態に係る情報処理システムＳｙｓの一例を示す概略図である。
情報処理システムＳｙｓは、情報処理装置１と、企業Ｂと、ユーザＵ１、ユーザＵ２とを含む。情報処理装置１は、企業Ｂから顧客情報、購買情報、ダイレクトメッセージ（ＤＭ）情報、その他情報などの各種情報ＩＤを受け付ける。情報処理装置１は、受け付けた各種情報ＩＤに対応する情報に基づいて機械学習し、学習結果として、ダイレクトメッセージ等の送信先などの予測情報ＯＤを出力する。 <Information processing system>
FIG. 1 is a schematic diagram showing an example of an information processing system Sys according to the first embodiment.
The information processing system Sys includes an information processing device 1, a company B, a user U1, and a user U2. The information processing device 1 receives various information IDs from company B, such as customer information, purchase information, direct message (DM) information, and other information. The information processing device 1 performs machine learning based on information corresponding to the received various information IDs, and outputs predicted information OD, such as the destination of a direct message, etc., as a learning result.

企業Ｂは、予測情報ＯＤに基づいて、ユーザにダイレクトメッセージを送信する。例えば、企業Ｂは、ユーザＵ１に対して商品紹介などのダイレクトメッセージＤＭ１を送信する。また、企業ＢはユーザＵ２に対して商品紹介などのダイレクトメッセージＤＭ２を送信する。これに対して、ユーザＵ１は、応答Ｒ１として商品の購入などを行わなかったとする。一方、ユーザＵ２は、応答Ｒ２として商品紹介のダイレクトメッセージＤＭ２を受けて、商品の購入を行ったとする。
ダイレクトマーケティングにおいて、企業Ｂは、ダイレクトメッセージに対して購買などのアクションにつながるユーザＵ２のようなユーザに対してより多くのダイレクトメッセージを送付できれば、高い広告効果が得られる。 Company B sends a direct message to the user based on the prediction information OD. For example, company B sends a direct message DM1 such as product introduction to user U1. Further, company B sends a direct message DM2 such as product introduction to user U2. In contrast, it is assumed that the user U1 does not purchase a product as a response R1. On the other hand, it is assumed that the user U2 receives the direct message DM2 introducing the product as a response R2, and purchases the product.
In direct marketing, if company B can send more direct messages to users, such as user U2, who take actions such as purchasing in response to direct messages, high advertising effectiveness can be achieved.

情報処理装置１は、広告費用に対する需要獲得を最大化するようなキャンペーンを企画しつつ、最終利益が最大化するような広告費用の総額を決定する。また、情報処理装置１は、トレンドを考慮したユーザごとの条件付き介入効果の予測と将来の介入効果の予測とを行う。ここで、トレンドには、トレンド成分と季節成分とが含まれる。また、トレンドを考慮するとは、トレンドを用いて予測値の補正を行うことである。情報処理装置１は、介入効果の側に基づいて、ダイレクトメッセージの送付先やダイレクトメッセージの送付タイミングを予測する。換言すれば、情報処理装置１は、ダイレクトマーケティングにおける意思決定の一要素であるターゲティングの最適化を行い、広告効果が高い、つまり広告の内容への親和性が高いと見込まれるユーザを抽出するために、ユーザごとの条件付き介入効果の予測を行う。また、情報処理装置１は、親和性が高いと見込まれるユーザのみがダイレクトメッセージを受けられるように、ダイレクトメッセージの送付先を決定する。このため、情報処理装置１は、この最適化の核となるプロセスである介入効果の推定（ＵｐｌｉｆｔＭｏｄｅｌｉｎｇ）において、個人ごとの違い（消費者異質性）とタイミングによる違い（時間的異質性）の両者を共に考慮することで、より精度の高いＵｐｌｉｆｔＭｏｄｅｌｉｎｇを行う。 The information processing device 1 plans a campaign that maximizes the acquisition of demand for advertising costs, and determines the total amount of advertising costs that will maximize the final profit. Further, the information processing device 1 performs prediction of conditional intervention effects for each user and prediction of future intervention effects in consideration of trends. Here, the trend includes trend components and seasonal components. Moreover, considering the trend means correcting the predicted value using the trend. The information processing device 1 predicts the destination of the direct message and the timing of sending the direct message based on the intervention effect. In other words, the information processing device 1 optimizes targeting, which is an element of decision-making in direct marketing, and extracts users who are expected to have high advertising effectiveness, that is, high affinity with the advertising content. Next, we predict the conditional intervention effect for each user. Furthermore, the information processing device 1 determines the destination of the direct message so that only users with whom it is expected to have a high affinity can receive the direct message. Therefore, in estimating the intervention effect (Uplift Modeling), which is the core process of this optimization, the information processing device 1 takes into account differences between individuals (consumer heterogeneity) and differences due to timing (temporal heterogeneity). By considering both of them together, more accurate Uplift Modeling can be performed.

ここで、個人別に広告によるＵｐｌｉｆｔの多寡を予測した場合のＵｐｌｉｆｔＭｏｄｅｌの精度は、理想的には、真のＵｐｌｉｆｔ値と予測値の二乗誤差などで測定することが可能である。しかしながら、実キャンペーンにおいてＵｐｌｉｆｔＭｏｄｅｌの精度を評価・測定しようとする場合は、先述の因果推論の根本問題により、直接的に二乗誤差などを測定することが不可能である。このような実キャンペーンにおけるＵｐｌｉｆｔＭｏｄｅｌの精度評価には、ＡＵＵＣ（ＡｒｅａＵｎｄｅｒｔｈｅＵｐｌｉｆｔＣｕｒｖｅ）などを用いることがある。しかしながら、本願における検証には、直接的に真のＵｐｌｉｆｔ値を知ることが可能な人工データを利用するため、通常の機械学習の評価に使われるようなＭＳＥ、ＭＡＥなどの指標を利用する。この場合、予想されるＵｐｌｉｆｔ値が、人工データの実際のＵｐｌｉｆｔ値に近いほど精度が高いとして評価する。 Here, the accuracy of the Uplift Model when predicting the amount of Uplift due to advertisements for each individual can ideally be measured by the square error between the true Uplift value and the predicted value. However, when trying to evaluate and measure the accuracy of the Uplift Model in an actual campaign, it is impossible to directly measure the squared error etc. due to the fundamental problem of causal inference described above. AUUC (Area Under the Uplift Curve) or the like may be used to evaluate the accuracy of the Uplift Model in such an actual campaign. However, since the verification in this application uses artificial data that allows one to directly know the true Uplift value, indicators such as MSE and MAE that are used in normal evaluation of machine learning are used. In this case, the closer the predicted Uplift value is to the actual Uplift value of the artificial data, the higher the accuracy is evaluated.

ここで、顧客情報は、ユーザ名、氏名、住所、電話番号、メールアドレス、性別、年齢、世帯構成などを含むデータである。購買情報は、ユーザごとの購買履歴のデータであり、購入日、ジャンル、商品名、購買店などを含むデータである。ＤＭ情報は、ダイレクトメッセージごとのＤＭ識別情報、ジャンル、依頼元、ターゲット、種別などを含むデータである。なお、顧客情報は、顧客の属性や顧客ごとの購買履歴等によってあらかじめグルーブ、サブグループに分類されていてもよい。 Here, the customer information is data including user name, name, address, telephone number, email address, gender, age, household composition, etc. The purchase information is data on the purchase history of each user, and includes the date of purchase, genre, product name, store of purchase, and the like. The DM information is data including DM identification information, genre, request source, target, type, etc. for each direct message. Note that the customer information may be classified in advance into groups and subgroups based on customer attributes, purchase history of each customer, and the like.

また、ダイレクトメッセージは、宛名を指定して送付するメール、カタログや手紙・はがきなどの郵便送付、営業電話などである。不特定多数を対象とする新聞折り込みチラシ、無記名ダイレクトメール、エリアポスティング、屋外広告、雑誌広告、新聞広告、ラジオコマーシャル、テレビコマーシャル、インターネットコマーシャルなどはダイレクトメッセージに含まれないものとする。また、トレンドは、季節要素、時期要素、流行要素などの時限的傾向要素（成分）である。 Furthermore, direct messages include e-mails sent by specifying an addressee, mailings such as catalogs, letters, and postcards, and sales calls. Direct messages do not include newspaper insert flyers, anonymous direct mail, area postings, outdoor advertisements, magazine advertisements, newspaper advertisements, radio commercials, television commercials, Internet commercials, etc. that are targeted at an unspecified number of people. Moreover, a trend is a time-limited trend element (component) such as a seasonal element, a time element, or a fashionable element.

＜情報処理装置＞
図２は、本実施形態に係る情報処理装置１の一例を示す機能ブロック図である。
情報処理装置１は、制御部１１０と、記憶部１３０と、入力部１４０と、出力部１５０と、を含んで構成される。 <Information processing device>
FIG. 2 is a functional block diagram showing an example of the information processing device 1 according to the present embodiment.
The information processing device 1 includes a control section 110, a storage section 130, an input section 140, and an output section 150.

記憶部１３０は、例えば、ハードディスクドライブ、ＳＳＤ、メモリなどの記憶装置である。記憶部１３０は、ファームウェアやアプリケーションプログラムなど、制御部１１０が実行するための各種プログラム、及び、制御部１１０が実行した処理の結果などを記憶する。 The storage unit 130 is, for example, a storage device such as a hard disk drive, SSD, or memory. The storage unit 130 stores various programs to be executed by the control unit 110, such as firmware and application programs, and results of processing executed by the control unit 110.

制御部１１０は、中央演算装置（ＣＰＵ）などのプロセッサである。制御部１１０は、例えば、入力部１４０から入力された入力情報と記憶部１３０が記憶する情報に基づいて、入力情報に対する出力情報を生成する。
具体的には、制御部１１０は、記憶部１３０に記憶された、購買情報と、ＤＭ情報と、送付先情報とを読み出し、読み出した情報に基づいて学習データセットを生成する。制御部１１０は、生成した学習データセットに基づいて個人別の条件付き介入効果を学習する。当該学習モデルは、例えば、ＵｐｌｉｆｔＭｏｄｅｌｉｎｇを用いる。詳細は後述する。 The control unit 110 is a processor such as a central processing unit (CPU). The control unit 110 generates output information for the input information, for example, based on the input information input from the input unit 140 and the information stored in the storage unit 130.
Specifically, the control unit 110 reads purchase information, DM information, and destination information stored in the storage unit 130, and generates a learning data set based on the read information. The control unit 110 learns individual conditional intervention effects based on the generated learning data set. The learning model uses Uplift Modeling, for example. Details will be described later.

制御部１１０は、顧客情報、購買情報、ＤＭ情報、送付先情報などの統計に基づいて、ユーザごと、すなわち、個人別の条件付き介入効果（ＣＡＴＥ）を推定する。また、制御部１１０は、Ｕｐｌｉｆｔ値と類似の傾向を示すと考えられる指標の季節成分・トレンド成分をモデル化し、Ｕｐｌｉｆｔ値の予測値を補正する。つまり、制御部１１０は、個人別の条件付き介入効果を推定し、また、将来の介入効果をＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅとして予測することで、個人別のＡｄｊｕｓｔｅｄＣＡＴＥを予測する。制御部１１０は、予測したＡｄｊｕｓｔｅｄＣＡＴＥに基づいて、ダイレクトメッセージの送付先を決定する。 The control unit 110 estimates the conditional intervention effect (CATE) for each user, that is, for each individual, based on statistics such as customer information, purchase information, DM information, and destination information. Further, the control unit 110 models the seasonal component/trend component of an index that is considered to show a similar trend to the Uplift value, and corrects the predicted value of the Uplift value. That is, the control unit 110 estimates the conditional intervention effect for each individual and predicts the future intervention effect as a Calibration Score, thereby predicting the Adjusted CATE for each individual. The control unit 110 determines the destination of the direct message based on the predicted Adjusted CATE.

入力部１４０は、例えば、キーボードやタッチパネルなどの入力装置である。入力部１４０は、ユーザ操作に基づく入力情報を受け付ける。入力部１４０は、受け付けた入力情報を制御部１１０に出力する。 The input unit 140 is, for example, an input device such as a keyboard or a touch panel. The input unit 140 receives input information based on user operations. The input unit 140 outputs the received input information to the control unit 110.

出力部１５０は、例えば表示部であり、制御部１１０から入力された出力情報を表示する。 The output unit 150 is, for example, a display unit, and displays output information input from the control unit 110.

なお、入力部１４０、出力部１５０に代えて、または加えて通信部（不図示）によって、外部装置からの入力情報を受け付け、また、外部装置への出力情報を出力してもよい。 Note that instead of or in addition to the input section 140 and the output section 150, a communication section (not shown) may accept input information from an external device and output information to the external device.

制御部１１０について、より詳細に説明する。 The control unit 110 will be explained in more detail.

制御部１１０は、取得部１１１と、学習部１１２と、予測部１１３と、出力処理部１１４と、を含んで構成される。
取得部１１１は、入力部１４０を介して入力情報を取得する。また、取得部１１１は、記憶部１３０に記憶された各種情報を読み出して取得する。取得部１１１は、取得した情報を、学習部１１２、予測部１１３に出力する。 The control unit 110 includes an acquisition unit 111, a learning unit 112, a prediction unit 113, and an output processing unit 114.
The acquisition unit 111 acquires input information via the input unit 140. Further, the acquisition unit 111 reads and acquires various information stored in the storage unit 130. The acquisition unit 111 outputs the acquired information to the learning unit 112 and the prediction unit 113.

学習部１１２は、取得部１１１が取得した情報、および記憶部１３０に記憶された情報に基づいて、機械学習する。具体的には、学習部１１２は、記憶部１３０に記憶された購買情報と、ＤＭ情報と、送付先情報とを読み出し、読み出した情報に基づいて学習データセットを生成する。学習部１１２は、生成した学習データセットに基づいて個人別の条件付き介入効果を学習する。ここで、個人別の条件付き介入効果の学習モデルには、ＵｐｌｉｆｔＭｏｄｅｌｉｎｇを用いる。 The learning unit 112 performs machine learning based on the information acquired by the acquisition unit 111 and the information stored in the storage unit 130. Specifically, the learning unit 112 reads purchase information, DM information, and destination information stored in the storage unit 130, and generates a learning data set based on the read information. The learning unit 112 learns individual conditional intervention effects based on the generated learning data set. Here, Uplift Modeling is used as a learning model for individual conditional intervention effects.

ＵｐｌｉｆｔＭｏｄｅｌｉｎｇは、ｉＲＯＡＳを最大化するために個人ごとの介入効果をモデル化・予測するモデルである。本実施形態では、消費者異質性及び時間的異質性を要素として加味し、状態空間モデルのような複雑なモデル設定を行うことなく将来の個人ごとの介入効果である個人別Ｕｐｌｉｆｔ値を予測する。 Uplift Modeling is a model that models and predicts intervention effects for each individual in order to maximize iROAS. In this embodiment, consumer heterogeneity and temporal heterogeneity are taken into consideration as factors, and the individual Uplift value, which is the future intervention effect for each individual, is predicted without setting up a complicated model such as a state space model. .

本実施形態では、このような性質を満たすモデルとして、ＣＡＬＩＢＲＡＴＥＤＣＡＴＥＥｓｔｉｍａｔｏｒを用いる。ＣＡＬＩＢＲＡＴＥＤＣＡＴＥＥｓｔｉｍａｔｏｒには、２つのモデルが含まれる。１つ目のモデルは、過去データから消費者異質性を学習するＣＡＴＥＥｓｔｉｍａｔｏｒである。２つ目のモデルは、過去データにおけるＵｐｌｉｆｔの時間経過に伴う振る舞いの変化を捉え、将来の予測したい時点におけるＵｐｌｉｆｔの水準を示すＴｉｍｅｓｅｒｉｅｓＦｏｒｅｃａｓｔｅｒである。制御部１１０は、ＣＡＴＥＥｓｔｉｍａｔｏｒの個人別Ｕｐｌｉｆｔ値を、ＴｉｍｅｓｅｒｉｅｓＦｏｒｅｃａｓｔｅｒによって得られるＣＡＴＥＥｓｔｉｍａｔｏｒの学習時点と予測時点とにおけるＵｐｌｉｆｔ水準の差である、ＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅによって補正する。これにより、制御部１１０は、ＣＡＴＥＥｓｔｉｍａｔｏｒの学習時点から予測時点における時間的異質性を加味した予測を行う。ここで、ＣＡＴＥは、ＣｏｎｄｉｔｉｏｎａｌＡｖｅｒａｇｅＴｒｅａｔｍｅｎｔＥｆｆｅｃｔ、すなわち条件付き介入効果のことであり、時間的異質性を含む。 In this embodiment, a CALIBRATED CATE Estimator is used as a model that satisfies these properties. The CALIBRATED CATE Estimator includes two models. The first model is CATE Estimator, which learns consumer heterogeneity from past data. The second model is Timeseries Forecaster, which captures changes in Uplift behavior in past data over time and indicates the level of Uplift at a desired point in the future. The control unit 110 corrects the individual Uplift value of the CATE Estimator using a Calibration Score, which is the difference in the Uplift level between the CATE Estimator's learning time and the prediction time obtained by the Timeseries Forecaster. Thereby, the control unit 110 performs prediction that takes into account temporal heterogeneity from the CATE Estimator learning time point to the prediction time point. Here, CATE refers to Conditional Average Treatment Effect, ie, conditional intervention effect, and includes temporal heterogeneity.

消費者異質性を加味したＵｐｌｉｆｔＭｏｄｅｌｉｎｇは、その精度やデータに対する制約は様々ながらも多くの手法が存在する。本実施形態では、Ｍｅｔａ－Ｌｅａｒｎｅｒの一種であるＸ－ＬｅａｒｎｅｒをＣＡＴＥＥｓｔｉｍａｔｏｒとして用い、消費者異質性を推定する。なお、本発明は、適用範囲をＸ－Ｌｅａｒｎｅｒに限定するものではなく、条件付き介入効果（ＣＡＴＥ）を推定するあらゆるアルゴリズムを利用することが可能である。この性質により、分析者は、自らが利用するデータセットの量は複雑さに応じて、或いは自身の慣れ親しんだアルゴリズムを選択して利用することができる。 There are many methods for Uplift Modeling that takes into account consumer heterogeneity, although their accuracy and constraints on data vary. In this embodiment, consumer heterogeneity is estimated using X-Learner, which is a type of Meta-Learner, as a CATE Estimator. Note that the scope of application of the present invention is not limited to X-Learner, and any algorithm for estimating conditional intervention effect (CATE) can be used. Due to this property, analysts can use the amount of data set they use depending on its complexity, or select and use an algorithm with which they are familiar.

ＭｅｔａＬｅａｒｎｅｒは、単一のアルゴリズムの中で介入効果を推定するのではなく、通常の機械学習モデルの結果に対してメタ的な操作を加えることで、介入効果を推定する。ＭｅｔａＬｅａｒｎｅｒには、Ｔ－ｌｅａｒｎｅｒ、Ｓ－ｌｅａｒｎｅｒ、Ｘ－ｌｅａｒｎｅｒなどがある。Ｔ－ｌｅａｒｎｅｒは、介入を受けたグループだけで学習されたモデルと、介入を受けないグループだけで学習されたモデルの出力を比較することで介入効果を推定する。Ｓ－ｌｅａｒｎｅｒは、対象群と処置群双方を同時に学習し、介入条件ｗをｗ＝０とした時の出力とｗ＝１とした時の出力を比較する。Ｘ－ｌｅａｒｎｅｒは、Ｔ－ｌｅａｒｎｅｒの方に介入グループごとにモデリングをした後、介入グループごとに処置効果を求め、予測したい個体のＣＡＴＥの算出のために、両モデルのＣＡＴＥを、それぞれのグループに対する傾向スコアで重み付けを行う。 Meta Learner does not estimate the intervention effect within a single algorithm, but rather estimates the intervention effect by adding meta operations to the results of a regular machine learning model. Meta learners include T-learner, S-learner, and X-learner. T-learner estimates the intervention effect by comparing the outputs of a model learned only by a group that received the intervention and a model learned only by a group that did not receive the intervention. S-learner simultaneously learns both the target group and the treatment group, and compares the output when the intervention condition w is set to w=0 and the output when w=1. X-learner models the T-learner for each intervention group, then calculates the treatment effect for each intervention group, and calculates the CATE of the individual you want to predict by combining the CATE of both models for each group. Weight by propensity score.

Ｍｅｔａ－Ｌｅａｒｎｅｒの特徴は、介入群と非介入群のデータ量が不均衡な場合であっても良好な推定精度が期待できる点である。また、Ｍｅｔａ－Ｌｅａｒｎｅｒの特徴は、ベースとなる学習器を自由に指定できるため、データの特性に合ったアルゴリズムを利用することができる点にもある。後者は、例えば線形性が仮定できるデータ量が少ないケースでは線形モデルを用い、逆に非線形な関係が期待されデータ量が多い場合は勾配ブースティングなどの表現力の高いアルゴリズムを利用するなどの使い分けができる。 A feature of Meta-Learner is that good estimation accuracy can be expected even when the amount of data between the intervention group and the non-intervention group is imbalanced. Another feature of Meta-Learner is that since the base learning device can be freely specified, an algorithm that matches the characteristics of the data can be used. The latter can be used in different ways, such as using a linear model when there is a small amount of data where linearity can be assumed, and conversely, using a highly expressive algorithm such as gradient boosting when a nonlinear relationship is expected and the amount of data is large. Can be done.

時間的異質性の推定は、一般的な時系列データ解析、ＳＡＲＩＭＡ、ＥＴＳ、Ｐｒｏｐｈｅｔなどを用いることが可能である。本実施形態では、季節トレンドや外生変数の取り込みやその解釈が容易で、精度も良好なＰｒｏｐｈｅｔをＴｉｍｅｓｅｒｉｅｓＦｏｒｅｃａｓｔｅｒとして用いる場合について説明する。 Temporal heterogeneity can be estimated using general time series data analysis, SARIMA, ETS, Prophet, etc. In this embodiment, a case will be described in which Prophet is used as the Timeseries Forecaster because it is easy to capture and interpret seasonal trends and exogenous variables, and has good accuracy.

ＴｉｍｅｓｅｒｉｅｓＦｏｒｅｃａｓｔｅｒにおいてトレンドや季節性を予測するためには、予測対象の過去の一定期間のデータが必要である。この時、過去データには介入があった期間と介入がなかった期間が入り交じることになる。ＣＣＥＰＴＡでは、並行トレンドの仮定を置く指標が介入の影響を受ける場合、本来のトレンドとは関係ない、介入による指標の変動がトレンド推定に含まれてしまうことがある。対応策としては、トレンド推定時に介入の有無やこの規模を外生変数として加えることも可能である。しかしながら、介入回数が１～２回しかなかったり、特定の曜日などに集中していたりした場合、介入による影響を正確に本来のトレンド変動から排除するのは困難である。また、介入を受けていないデータのみでトレンドを推定する場合には、介入がない期間で全データの平均を用いることになるが、介入があった期間は、介入がなかったデータの平均となってしまう。このため、介入前後で平均を算出するための前提となる母集団の分布が変化することによるバイアスを生じる。 In order to predict trends and seasonality in Timeseries Forecaster, data for a certain period of time in the past to be predicted is required. At this time, past data will include periods in which there was an intervention and periods in which there was no intervention. In CCEPTA, if an indicator on which parallel trends are assumed is affected by an intervention, the trend estimation may include changes in the indicator due to the intervention that are unrelated to the original trend. As a countermeasure, it is also possible to add the presence or absence of intervention and its scale as exogenous variables when estimating trends. However, if there are only one or two interventions, or if they are concentrated on a specific day of the week, it is difficult to accurately exclude the influence of the intervention from the original trend fluctuations. Furthermore, when estimating a trend using only data that did not receive an intervention, the average of all data during the period without the intervention is used, but during the period with the intervention, the average of the data without the intervention is used. I end up. For this reason, bias occurs due to changes in the distribution of the population, which is the basis for calculating the average, before and after the intervention.

このバイアスを回避するために、トレンドや季節性の予測には、ＴｉｍｅｓｅｒｉｅｓＦｏｒｅｃａｓｔｅｒの学習に用いるデータに対するＰｒｏｐｅｎｓｉｔｙＢａｓｅｄＳａｍｐｌｉｎｇ（式（１））を用いる。介入が発生していないタイムウィンドウにおいては、介入を受ける確率であるｐｒｏｐｅｎｓｉｔｙｓｃｏｒｅ，ｅ（Ｘ）を１から減じた確率でサンプリングする。介入期間においては、介入を受けなかった全ての個体をサンプルリングする。これにより、情報処理装置１は、学習期間に介入がある期間と介入がない期間が混在している場合においてもバイアスのないトレンドの推定が可能となる。 In order to avoid this bias, Propensity Based Sampling (Equation (1)) is used for the data used for learning the Timeseries Forecaster to predict trends and seasonality. In a time window in which no intervention occurs, sampling is performed with a probability that is the probability of receiving an intervention, propensity score, e(X), which is subtracted from 1. During the intervention period, all individuals who did not receive the intervention will be sampled. Thereby, the information processing device 1 can estimate trends without bias even when the learning period includes periods with intervention and periods without intervention.

ＣＡＴＥＥｓｔｉｍａｔｏｒにより一時点で学習された予測値を、時間的異質性を加味する形で変換することは、ＤａｔａｓｅｔＳｈｉｆｔの一種であるＣｏｎｃｅｐｔＳｈｉｆｔの問題として捉えることができる。ＣｏｎｃｅｐｔＳｈｉｆｔの定義は式（２）の通り、学習時と予測時とにおいて、ｘ（共変量）で条件付けた時のｙ（ターゲット）の分布、またｙで条件付けた時のｘの分布が変化している状況である。 Converting the predicted value learned at one point by the CATE Estimator in a manner that takes into account temporal heterogeneity can be considered as a problem of Concept Shift, which is a type of Dataset Shift. Concept Shift is defined as Equation (2), where the distribution of y (target) when conditioned by x (covariate) and the distribution of x when conditioned by y change between learning and prediction. This is the situation.

ＤａｔａｓｅｔＳｈｉｆｔが発生している際には、一般的にはモデルの再トレーニングを頻繁に実施したり、直近の学習サンプルにより強い重みを付けて学習させたりする。本実施形態では、将来時点であるＰｔｅｓｔ（ｙ｜ｘ）の水準を予測することで、Ｐｔｒａｉｎｉｎｇ（ｙ｜ｘ）を変換してＰｔｅｓｔ（ｙ｜ｘ）と一致させるスコアｚを算出することで、時間的異質性を考慮する際のＣｏｎｃｅｐｔＳｈｉｆｔの問題を対処する。 When a Dataset Shift occurs, the model is generally retrained frequently or the most recent training samples are given stronger weight for learning. In this embodiment, by predicting the level of Ptest(y|x) at a future time, Ptraining(y|x) is converted to calculate a score z that matches Ptest(y|x) Addressing Concept Shift issues when considering temporal heterogeneity.

なお、式（３）では、予測結果を補正するためにＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅを利用しているが、それ以外にも過去データの学習時にＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅによって補正した値を学習することも可能である。 Note that in equation (3), Calibration Score is used to correct the prediction result, but it is also possible to learn a value corrected by Calibration Score when learning past data.

この場合、スコアｚの算出に用いるｘ_ｔｓと、Ｙの予測に用いるｘ_ｃａｔｅが異なるものである。個人別の細かなｕｐｌｉｆｔ値（効果）を予測するためのｘ_ｃａｔｅは、通常の機械学習のプロセスのように、有効と考えられる特徴量は基本的に全て含めることが望ましい。ＵｎｃｏｎｆｏｕｎｄｅｄｎｅｓｓをＣＡＴＥ算出の前提としている場合は、未観測の交絡因子が無いことを担保する上でも、特徴量の範囲は広い方が望ましい。 In this case, x _ts used to calculate the score z and x _cate used to predict Y are different. It is desirable that the x _cate for predicting detailed uplift values (effects) for each individual basically include all feature quantities considered to be effective, as in a normal machine learning process. If Unconfoundedness is a premise for CATE calculation, it is desirable that the range of feature values be wide in order to ensure that there are no unobserved confounding factors.

一方で、ＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅ（スコアｚ）を算出する際のｘ_ｔｓは、「どの単位で時系列トレンドが異なるか」というグルーピングの単位に過ぎない。ここで、ｘ_ｔｓを必要以上に細かくしても、スコアｚの推定精度には寄与せず、過学習を引き起こす可能性もある。ｘ_ｔｓの例としては、旅行関連のマーケティングの場合には夏休み、冬休みに対する時間的異質性が異なるグルーピングとして、「小中学生の子供の有無」などの変数を含めることも可能である。 On the other hand, x _ts when calculating the Calibration Score (score z) is only a grouping unit that indicates "in which unit the time series trends differ." Here, even if x _ts is made smaller than necessary, it will not contribute to the estimation accuracy of the score z and may cause overfitting. As an example of x _ts , in the case of travel-related marketing, it is also possible to include a variable such as "presence or absence of children in elementary and junior high school" as a grouping with different temporal heterogeneity with respect to summer vacation and winter vacation.

本実施形態で用いる時間的異質性の推定には、利用可能なデータの制約からＣＡＬＩＢＲＡＴＥＤＣＡＴＥＥｓｔｉｍａｔｏｒｗｉｔｈＵｐｌｉｆｔＴｒｅｎｄ（ＣＣＥＵＴ）と、ＣＡＬＩＢＲＡＴＥＤＣＡＴＥＥｓｔｉｍａｔｏｒｗｉｔｈＰａｒａｌｌｅｌＴｒｅｎｄＡｓｓｕｍｐｔｉｏｎ（ＣＣＥＰＴＡ）を用いる。 To estimate the temporal heterogeneity used in this embodiment, due to the constraints of available data, we use CALIBRATED CATE Estimator with Uplift Trend (CCEUT) and CALIBRATED CATE Estimator with Parallel Trend Assump. tion (CCEPTA).

ＣＣＥＵＴは、Ｕｐｌｉｆｔ値自体の推移を時系列データとして扱い、そのトレンドをモデル化することでＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅを算出する手法である。ＣＣＥＵＴは、Ｕｐｌｉｆｔ値の推定に足るデータが、ある程度複数期間にまたがって利用可能な場合に適用可能である。 CCEUT is a method of calculating a Calibration Score by treating the transition of the Uplift value itself as time series data and modeling the trend. CCEUT is applicable when data sufficient for estimating the Uplift value is available over multiple periods to some extent.

一方、ＣＣＥＰＴＡは、Ｕｐｌｉｆｔ値自体とは別の観察可能な指標（Ｃｏｎｔｒｏｌ群の売上水準など）がＵｐｌｉｆｔ値のトレンドと並行トレンドを持つと仮定し、当該指標の推移を時系列データとしてトレンドをモデル化することでＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅを算出する手法である。 On the other hand, CCEPTA assumes that an observable indicator other than the Uplift value itself (such as the sales level of the Control group) has a parallel trend with the Uplift value trend, and models the trend using the changes in the indicator as time series data. This is a method of calculating the Calibration Score by

ＣＣＥＵＴについてより詳細に説明する。
図３は、本実施形態で用いるＣＣＥＵＴの一例を示す概略図である。
ＣＣＥＵＴは、ＣＡＴＥＥｓｔｉｍａｔｏｒとして用いるＭｅｔａＬｅａｒｎｅｒによって推定される個人別の条件付き介入効果（ＣＡＴＥ）に対する補正値であるＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅを算出するため手法である。ＣＣＥＵＴは、過去のキャンペーンのＵｐｌｉｆｔ実績から、Ｕｐｌｉｆｔの季節成分、トレンド成分をモデル化することで、個人別のＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅ、すなわち、未来の介入効果を予測する。ＭｅｔａＬｅａｒｎｅｒによって推定されるＣＡＴＥに対して、ＣＣＥＵＴによって算出されるＣｌｉｂｒａｔｉｏｎＳｃｏｒｅを乗算することでＡｄｊｕｓｔｅｄＣＡＴＥを得る。 CCEUT will be explained in more detail.
FIG. 3 is a schematic diagram showing an example of a CCEUT used in this embodiment.
CCEUT is a method for calculating a Calibration Score, which is a correction value for the individual conditional intervention effect (CATE) estimated by Meta Learner used as a CATE Estimator. CCEUT predicts individual Calibration Scores, that is, future intervention effects, by modeling Uplift seasonal components and trend components from Uplift results of past campaigns. Adjusted CATE is obtained by multiplying the CATE estimated by Meta Learner by the Clibration Score calculated by CCEUT.

次いで、ＣＣＥＰＴＡについてより詳細に説明する。
図４は、本実施形態で用いるＣＣＥＰＴＡの一例を示す概略図である。
ＣＣＥＰＴＡは、ＣＡＴＥＥｓｔｉｍａｔｏｒとして用いるＭｅｔａＬｅａｒｎｅｒによって推定される個人別の条件付き介入効果（ＣＡＴＥ）に対する補正値であるＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅを算出するため手法である。ＣＣＥＰＴＡは、Ｕｐｌｉｆｔ値と類似の傾向を示すと考えられる指標の季節成分、トレンド成分をモデル化することで、個人別のＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅ、すなわち、未来の介入効果を予測する。ＭｅｔａＬｅａｒｎｅｒによって推定されるＣＡＴＥに対して、ＣＣＥＰＴＡによって算出されるＣｌｉｂｒａｔｉｏｎＳｃｏｒｅを乗算することでＡｄｊｕｓｔｅｄＣＡＴＥを得る。 Next, CCEPTA will be explained in more detail.
FIG. 4 is a schematic diagram showing an example of CCEPTA used in this embodiment.
CCEPTA is a method for calculating a Calibration Score, which is a correction value for the individual conditional intervention effect (CATE) estimated by Meta Learner used as a CATE Estimator. CCEPTA predicts an individual's Calibration Score, that is, the future intervention effect, by modeling the seasonal component and trend component of an index that is considered to show a similar trend to the Uplift value. Adjusted CATE is obtained by multiplying the CATE estimated by Meta Learner by the Citation Score calculated by CCEPTA.

ＣＣＥＰＴＡは、Ｕｐｌｉｆｔ値自体の推移を時系列データとして扱えるほど潤沢に過去データが存在しない場合にも適用可能である。並行トレンドの仮定は、一見強い制約となり得る。しかしながら、例えば製品のプロモーション効果を考えた場合、全く需要がないタイミングでプロモーションを行ってもその効果は限られる。一方で、一定の需要があるタイミングでプロモーションを行った場合にはより大きなＵｐｌｉｆｔ効果を見込むことができる。 CCEPTA can be applied even when there is not enough past data to treat the transition of the Uplift value itself as time series data. The assumption of parallel trends can be a seemingly strong constraint. However, when considering the promotional effects of a product, for example, the effect will be limited even if the promotion is carried out at a time when there is no demand at all. On the other hand, if the promotion is carried out at a time when there is a certain level of demand, a larger uplift effect can be expected.

このようなケースを考えると、例えば、プロモーションを除いた素の需要トレンドは、プロモーション効果のトレンドとある程度は正の相関を持つことが予想される。ドメイン知識をもつ実務家であれば、そのようなＣＣＥＰＴＡに好ましい特性を持つ観測可能な指標を見つけることは、ＣＣＥＵＴのために反復的なキャンペーンの効果検証とデータ蓄積を行うことよりも容易である。ただし、例外として、素の需要トレンドが高まり過ぎて、もはやプロモーションによるＵｐｌｉｆｔの余地がなくなるような局面においては、素の需要トレンドとプロモーション効果が負の相関関係となるような可能性もある。このように、一定条件下で並行トレンドの仮定をすることが困難な状況もあるため、並行トレンドの仮定が本当に成り立つのかを慎重に判断する必要がある。
本実施形態では、ＣＣＥＰＴＡを用いる場合の一例について説明する。 Considering such a case, for example, it is expected that the raw demand trend excluding promotions will have a positive correlation to some extent with the trend of promotion effects. For practitioners with domain knowledge, finding observable indicators with favorable characteristics for such CCEPTA is easier than iteratively testing the effectiveness of campaigns and accumulating data for CCEUT. . However, as an exception, in a situation where the raw demand trend has increased so much that there is no longer any room for uplift through promotions, there is a possibility that the raw demand trend and the promotion effect will have a negative correlation. In this way, there are situations in which it is difficult to assume parallel trends under certain conditions, so it is necessary to carefully judge whether the assumption of parallel trends really holds true.
In this embodiment, an example in which CCEPTA is used will be described.

図２に戻って、学習部１１２は、顧客情報、購買情報、ＤＭ情報、送付先情報に基づいて機械学習し、個人別の条件付き介入効果を学習する。学習部１１２は、学習結果を記憶部１３０に記憶させる。また、学習部１１２は、過去のキャンペーンのＵｐｌｉｆｔ実績からＵｐｌｉｆｔの季節成分、トレンド成分を学習することでモデル化し、個人別のＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅを学習する。 Returning to FIG. 2, the learning unit 112 performs machine learning based on customer information, purchase information, DM information, and destination information to learn conditional intervention effects for each individual. The learning unit 112 causes the storage unit 130 to store the learning results. Further, the learning unit 112 models Uplift by learning seasonal components and trend components from Uplift results of past campaigns, and learns individual Calibration Scores.

予測部１１３は、学習部１１２による学習結果に基づいて、個人別ＣＡＴＥおよびＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅによってＡｄｊｕｓｔｅｄＣＡＴＥを算出する。予測部１１３は、ＡｄｊｕｓｔｅｄＣＡＴＥに基づいて、ダイレクトメッセージの送付先を決定し、記憶部１３０に記憶させる。 The prediction unit 113 calculates Adjusted CATE based on the learning result by the learning unit 112 using the individual CATE and the Calibration Score. The prediction unit 113 determines the destination of the direct message based on the Adjusted CATE, and stores it in the storage unit 130.

出力処理部１１４は、記憶部１３０に記憶された送付先情報を読み出して、出力部１５０に出力させる。 The output processing unit 114 reads out destination information stored in the storage unit 130 and causes the output unit 150 to output the destination information.

次いで、記憶部１３０について、より詳細に説明する。 Next, the storage unit 130 will be explained in more detail.

記憶部１３０は、顧客情報記憶部１３１と、購買情報記憶部１３２と、ＤＭ情報記憶部１３３と、送付先情報記憶部１３４と、学習結果記憶部１３５と、を含んで構成される。
顧客情報記憶部１３１は、顧客情報を記憶する。顧客情報は、識別情報、ユーザ名、氏名、住所、電話番号、メールアドレス、性別、年齢、世帯構成などが対応付けられた情報である。図５を参照して説明する。 The storage unit 130 includes a customer information storage unit 131, a purchase information storage unit 132, a DM information storage unit 133, a destination information storage unit 134, and a learning result storage unit 135.
The customer information storage unit 131 stores customer information. The customer information is information in which identification information, user name, name, address, telephone number, email address, gender, age, household structure, etc. are associated. This will be explained with reference to FIG.

図５は、顧客情報記憶部１３１が記憶する顧客情報の構成の一例を示すテーブルである。
図示する例では、識別情報に、ユーザ名、氏名、住所、電話番号、メールアドレス、性別、年齢、世帯構成が対応付けられている。
識別情報は、ユーザを識別するための情報である。ユーザ名は、ユーザの登録名である。氏名は、ユーザの氏名である。住所は、ユーザの住所である。電話番号は、ユーザの電話番号である。メールアドレスは、ユーザの電子メールアドレスである。性別、年齢は、ユーザの性別および年齢である。世帯構成は、ユーザが属する世帯の世帯構成である。
これらの各項目の情報は、例えば、各ユーザによって予め登録された情報である。 FIG. 5 is a table showing an example of the structure of customer information stored in the customer information storage unit 131.
In the illustrated example, the identification information is associated with a user name, name, address, telephone number, email address, gender, age, and household structure.
Identification information is information for identifying a user. The user name is the user's registered name. The name is the user's name. The address is the user's address. The telephone number is the user's telephone number. The email address is the user's email address. Gender and age are the user's gender and age. The household composition is the household composition of the household to which the user belongs.
Information on each of these items is, for example, information registered in advance by each user.

図２に戻って、購買情報記憶部１３２は、購買情報を記憶する。購買情報は、顧客ごとの購買履歴を示す情報であり、購入日、ジャンル、商品名、購入店などの情報が対応付けられた情報である。図６を参照して説明する。 Returning to FIG. 2, the purchase information storage unit 132 stores purchase information. Purchase information is information indicating the purchase history of each customer, and is information in which information such as purchase date, genre, product name, and store of purchase are associated with each other. This will be explained with reference to FIG.

図６は、購買情報記憶部１３２が記憶する購買情報の構成の一例を示すテーブルである。
図示する例では、ユーザを識別する識別情報ごとに、購入日、ジャンル、商品名、購入店が対応付けられている。
購入日は、商品を購入した日付である。ジャンルは、商品のジャンルである。商品名は、商品の名称である。購入店は、商品を購入した店舗などである。
これらの各項目の情報は、各ユーザによって予め登録されてもよいし、決済履歴情報等に基づいて作成されてもよい。 FIG. 6 is a table showing an example of the configuration of purchase information stored in the purchase information storage unit 132.
In the illustrated example, each piece of identification information that identifies a user is associated with a purchase date, genre, product name, and store of purchase.
The purchase date is the date on which the product was purchased. The genre is the genre of the product. The product name is the name of the product. The purchase store is the store where the product was purchased.
Information on each of these items may be registered in advance by each user, or may be created based on payment history information or the like.

図２に戻って、ＤＭ情報記憶部１３３は、ＤＭ情報を記憶する。ＤＭ情報は、ダイレクトメッセージの情報であり、ＤＭ識別情報、ジャンル、依頼元、ターゲット、種別などの情報が対応付けられた情報である。図７を参照して説明する。 Returning to FIG. 2, the DM information storage unit 133 stores DM information. DM information is direct message information, and is information in which DM identification information, genre, request source, target, type, and other information are associated with each other. This will be explained with reference to FIG.

図７は、ＤＭ情報記憶部１３３が記憶するＤＭ情報の構成の一例を示すテーブルである。
図示する例では、ＤＭ識別情報に、ジャンル、依頼元、ターゲット、種別が対応づけられている。
ＤＭ識別情報は、ダイレクトメッセージを識別する識別情報である。ジャンルは、ダイレクトメッセージが関連するジャンルである。依頼元は、ダイレクトメッセージの送付を依頼する依頼元である。ターゲットは、ダイレクトメッセージを送付するターゲットである。種別は、ダイレクトメッセージの送付方法である。
これらの各項目の情報は、依頼元からの依頼によって作成される。 FIG. 7 is a table showing an example of the structure of DM information stored in the DM information storage section 133.
In the illustrated example, genre, request source, target, and type are associated with DM identification information.
DM identification information is identification information that identifies a direct message. The genre is the genre to which the direct message is related. The request source is a request source that requests sending of a direct message. The target is the target to whom the direct message is sent. The type is the direct message sending method.
Information on each of these items is created based on a request from the requester.

図２に戻って、送付先情報記憶部１３４は、送付先情報を記憶する。送付先情報は、ＤＭ識別情報ごとに、送付先となるユーザの識別情報が対応付けられた情報である。図８を参照して説明する。 Returning to FIG. 2, destination information storage section 134 stores destination information. The destination information is information in which identification information of a user who is a destination is associated with each DM identification information. This will be explained with reference to FIG.

図８は、送付先情報記憶部１３４が記憶する送付先情報の構成の一例を示すテーブルである。
図示する例では、ＤＭ識別情報ごとに、送付先となるユーザの識別情報が対応付けられている。
当該送付先情報は、予測部１１３によってダイレクトメッセージの送付先候補として決定されることで生成される。 FIG. 8 is a table showing an example of the structure of the destination information stored in the destination information storage unit 134.
In the illustrated example, each piece of DM identification information is associated with the identification information of the user who is the destination.
The destination information is generated by being determined by the prediction unit 113 as a destination candidate for the direct message.

図２に戻って、学習結果記憶部１３５は、学習部１１２による学習結果を記憶する。 Returning to FIG. 2, the learning result storage unit 135 stores the learning results by the learning unit 112.

＜ハードウェア構成＞
次いで、情報処理装置１のハードウェア構成について説明する。図９は、本実施形態に係る情報処理装置１のハードウェア構成の一例を示すブロック図である。
情報処理装置１は、ＣＰＵ１１と、ドライブ部１２と、記憶媒体１３と、入力部１４と、出力部１５と、ＲＯＭ１６（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）と、ＲＡＭ１７（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）と、補助記憶部１８と、インターフェース部１９と、を備える。ＣＰＵ１１と、ドライブ部１２と、入力部１４と、出力部１５と、ＲＯＭ１６と、ＲＡＭ１７と、補助記憶部１８と、インターフェース部１９とは、バスを介して相互に接続される。 <Hardware configuration>
Next, the hardware configuration of the information processing device 1 will be explained. FIG. 9 is a block diagram showing an example of the hardware configuration of the information processing device 1 according to the present embodiment.
The information processing device 1 includes a CPU 11, a drive section 12, a storage medium 13, an input section 14, an output section 15, a ROM 16 (Read Only Memory), a RAM 17 (Random Access Memory), and an auxiliary storage section 18. , and an interface section 19. The CPU 11, the drive section 12, the input section 14, the output section 15, the ROM 16, the RAM 17, the auxiliary storage section 18, and the interface section 19 are interconnected via a bus.

なお、ここで言うＣＰＵ１１は、プロセッサ一般のことを示すものであって、狭義のいわゆるＣＰＵと呼ばれるデバイスのことだけではなく、例えばＧＰＵやＤＳＰ等も含む。また、ここで言うＣＰＵ１１は、一つのプロセッサで実現されることに限られず、同じ、または異なる種類の複数のプロセッサを組み合わせることで実現されてもよい。 Note that the CPU 11 here refers to a processor in general, and includes not only a device called a CPU in a narrow sense, but also includes, for example, a GPU, a DSP, and the like. Furthermore, the CPU 11 referred to herein is not limited to being implemented by one processor, but may be implemented by combining multiple processors of the same or different types.

ＣＰＵ１１は、補助記憶部１８、ＲＯＭ１６およびＲＡＭ１７が記憶するプログラムを読み出して実行し、また、補助記憶部１８、ＲＯＭ１６およびＲＡＭ１７が記憶する各種データを読み出し、補助記憶部１８、ＲＡＭ１７に対して各種データを書き込むことにより、情報処理装置１を制御する。また、ＣＰＵ１１は、ドライブ部１２を介して記憶媒体１３が記憶する各種データを読み出し、また、記憶媒体１３に各種データを書き込む。記憶媒体１３は、光磁気ディスク、フレキシブルディスク、フラッシュメモリなどの可搬記憶媒体であり、各種データを記憶する。 The CPU 11 reads and executes programs stored in the auxiliary storage unit 18, ROM 16, and RAM 17, reads out various data stored in the auxiliary storage unit 18, ROM 16, and RAM 17, and writes various data to the auxiliary storage unit 18 and RAM 17. By writing , the information processing device 1 is controlled. Further, the CPU 11 reads various data stored in the storage medium 13 via the drive section 12, and writes various data to the storage medium 13. The storage medium 13 is a portable storage medium such as a magneto-optical disk, a flexible disk, or a flash memory, and stores various data.

ドライブ部１２は、光ディスクドライブ、フレキシブルディスクドライブなどの記憶媒体１３の読み出し装置である。 The drive section 12 is a reading device for a storage medium 13 such as an optical disk drive or a flexible disk drive.

入力部１４は、マウス、キーボード、タッチパネル、電源ボタン、設定ボタンなどの入力装置である。 The input unit 14 is an input device such as a mouse, a keyboard, a touch panel, a power button, and a setting button.

出力部１５は、表示部、スピーカなどの出力装置である。 The output unit 15 is an output device such as a display unit or a speaker.

ＲＯＭ１６、ＲＡＭ１７は、表示装置１０の各機能部を動作させるためのプログラムや各種データを記憶する。 The ROM 16 and the RAM 17 store programs and various data for operating each functional section of the display device 10.

補助記憶部１８は、ハードディスクドライブ、フラッシュメモリなどであり、情報処理装置１の各機能部を動作させるためのプログラム、各種データを記憶する。 The auxiliary storage unit 18 is a hard disk drive, a flash memory, or the like, and stores programs and various data for operating each functional unit of the information processing device 1.

インターフェース部１９は、通信インターフェースを有し、有線または無線によりネットワークＮＷに接続される。 The interface unit 19 has a communication interface and is connected to the network NW by wire or wirelessly.

例えば、後述する図２における情報処理装置１の機能構成における制御部１１０は、図９におけるＣＰＵ１１に対応する。また、図２における入力部１４０は、図９における入力部１４に対応する。また、図２における出力部１５０は、図９における出力部１５に対応する。また、図２における記憶部１３０は、図９における記憶媒体１３に対応する。 For example, the control unit 110 in the functional configuration of the information processing device 1 in FIG. 2, which will be described later, corresponds to the CPU 11 in FIG. Further, the input section 140 in FIG. 2 corresponds to the input section 14 in FIG. 9. Further, the output section 150 in FIG. 2 corresponds to the output section 15 in FIG. 9. Furthermore, the storage unit 130 in FIG. 2 corresponds to the storage medium 13 in FIG. 9.

＜動作について＞
図１０は、本実施形態に係る情報処理装置１の動作の一例を示すフローチャートである。この図は、学習段階における情報処理装置１の動作を示す。 <About operation>
FIG. 10 is a flowchart illustrating an example of the operation of the information processing device 1 according to this embodiment. This figure shows the operation of the information processing device 1 in the learning stage.

（ステップＳ１００）取得部１１１は、記憶部１３０から顧客情報、ＤＭ情報、購買情報、送付先情報、学習結果、を読み出す。取得部１１１は処理が終了すると処理をステップＳ１０２に進める。
（ステップＳ１０２）取得部１１１は、読み出した各情報に基づいて、学習データセットを作成し、記憶部１３０に記憶させる。取得部１１１は処理が終了すると、処理をステップＳ１０４に進める。
（ステップＳ１０４）学習部１１２は、記憶部１３０から学習データセットを読み出し、読み出した情報に基づいて学習を行う。学習部１１２は、学習が終了すると、学習結果を記憶部１３０に記憶させる。その後、本図の動作を終了する。 (Step S100) The acquisition unit 111 reads customer information, DM information, purchase information, destination information, and learning results from the storage unit 130. Upon completion of the process, the acquisition unit 111 advances the process to step S102.
(Step S102) The acquisition unit 111 creates a learning data set based on each piece of read information, and stores it in the storage unit 130. When the acquisition unit 111 completes the process, it advances the process to step S104.
(Step S104) The learning unit 112 reads the learning data set from the storage unit 130 and performs learning based on the read information. When the learning is completed, the learning unit 112 causes the storage unit 130 to store the learning result. After that, the operation shown in this figure ends.

図１１は、本実施形態に係る情報処理装置１の動作の別の一例を示すフローチャートである。この図は、ＣＡＴＥ、ＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅ推定（算出）段階における情報処理装置１の動作を示す。 FIG. 11 is a flowchart showing another example of the operation of the information processing device 1 according to this embodiment. This figure shows the operation of the information processing device 1 in the CATE and Calibration Score estimation (calculation) stages.

（ステップＳ２００）学習部１１２は、顧客情報記憶部１３１から顧客情報を取得する。ＤＭ情報記憶部１３３からＤＭ情報を取得する。学習部１１２は、処理が終了すると、処理をステップＳ２０２に進める。ステップＳ２０２～ステップＳ２０４までは、個人ごと（ユーザごと）に行う。
（ステップＳ２０２）学習部１１２は、顧客情報に含まれる顧客の識別情報に対応する購買情報を、購買情報記憶部１３２から取得する。学習部１１２は、処理が終了すると、処理をステップＳ２０４に進める。 (Step S200) The learning unit 112 acquires customer information from the customer information storage unit 131. DM information is acquired from the DM information storage unit 133. Upon completion of the process, the learning unit 112 advances the process to step S202. Steps S202 to S204 are performed for each individual (for each user).
(Step S202) The learning unit 112 acquires purchase information corresponding to the customer identification information included in the customer information from the purchase information storage unit 132. Upon completion of the process, the learning unit 112 advances the process to step S204.

（ステップＳ２０４）学習部１１２は、学習結果と購買履歴とに基づいて、ＣＡＴＥおよびＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅを予測する。学習部１１２は、学習結果と予測結果を記憶部１３０に記憶させる。学習部１１２は、全てのユーザについて、処理が終了すると、本図の動作を終了する。 (Step S204) The learning unit 112 predicts CATE and Calibration Score based on the learning result and purchase history. The learning unit 112 causes the storage unit 130 to store the learning results and prediction results. When the learning unit 112 completes the processing for all users, the learning unit 112 ends the operation shown in this figure.

図１２は、本実施形態に係る情報処理装置１の動作の別の一例を示すフローチャートである。
（ステップＳ３００）予測部１１３は、顧客情報に含まれる顧客の識別情報を顧客情報記憶部１３１から取得する。予測部１１３は、顧客の識別情報に対応する購買情報を購買情報記憶部１３２から取得する。予測部１１３は、ＤＭ情報を、ＤＭ情報記憶部１３３から取得する。予測部１１３は、処理が終了すると、処理をステップＳ３０２に進める。
（ステップＳ３０２）予測部１１３は、ＤＭ情報と購買情報都に基づいて、顧客ごとのＣＡＴＥを算出する。予測部１１３は、処理が終了すると、処理をステップＳ３０４に進める。 FIG. 12 is a flowchart showing another example of the operation of the information processing device 1 according to this embodiment.
(Step S300) The prediction unit 113 acquires customer identification information included in the customer information from the customer information storage unit 131. The prediction unit 113 acquires purchase information corresponding to the customer's identification information from the purchase information storage unit 132. The prediction unit 113 acquires DM information from the DM information storage unit 133. Upon completion of the process, the prediction unit 113 advances the process to step S302.
(Step S302) The prediction unit 113 calculates CATE for each customer based on the DM information and the purchase information. Upon completion of the process, the prediction unit 113 advances the process to step S304.

（ステップＳ３０４）予測部１１３は、ＤＭ情報と購買情報都に基づいて、顧客ごとのＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅを算出する。予測部１１３は、処理が終了すると、処理をステップＳ３０６に進める。
（ステップＳ３０６）予測部１１３は、ＣＡＴＥおよびＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅに基づいて、ＡｄｊｕｓｔｅｄＣＡＴＥを算出する。情報処理装置１は、処理が終了すると、処理をステップＳ３０８に進める。 (Step S304) The prediction unit 113 calculates a Calibration Score for each customer based on the DM information and the purchase information. Upon completion of the process, the prediction unit 113 advances the process to step S306.
(Step S306) The prediction unit 113 calculates Adjusted CATE based on CATE and Calibration Score. When the processing is completed, the information processing device 1 advances the processing to step S308.

（ステップＳ３０８）予測部１１３は、顧客情報に含まれる識別情報と、算出したＡｄｊｕｔｅｄＣＡＴＥに基づいて、送付先を予測する。情報処理装置１は、処理が終了すると、処理をステップＳ３１０に進める。 (Step S308) The prediction unit 113 predicts the destination based on the identification information included in the customer information and the calculated Adjusted CATE. When the processing is completed, the information processing device 1 advances the processing to step S310.

（ステップＳ３１０）出力処理部１１４は、送付先情報記憶部１３４から送付先情報を取得し、出力画像を生成する。出力処理部１１４は、生成した画像を、出力部１５０に出力させる。情報処理装置１は、処理が終了すると、本図の処理を終了する。 (Step S310) The output processing unit 114 acquires destination information from the destination information storage unit 134 and generates an output image. The output processing unit 114 causes the output unit 150 to output the generated image. When the processing is completed, the information processing device 1 ends the processing shown in this figure.

このように、本実施形態に係る情報処理装置１は、顧客情報および購買情報を取得する取得部１１１と、キャンペーン介入によるユーザごとの条件付き介入効果を示すデータと、キャンペーン介入による将来の介入効果を示すデータとを学習データセットとして学習された学習モデルに、取得部１１１が取得した顧客情報および購買情報を入力することで、ユーザごとの条件付き介入効果の予測値と、将来の介入効果の予測値とを算出する学習部１１２と、将来の介入効果の予測値を、トレンドを用いて補正する予測部１１３と、を備える。 In this way, the information processing device 1 according to the present embodiment includes the acquisition unit 111 that acquires customer information and purchase information, data indicating conditional intervention effects for each user due to campaign intervention, and future intervention effects due to campaign intervention. By inputting the customer information and purchase information acquired by the acquisition unit 111 into a learning model that has been trained using the data indicating the learning data as a learning data set, the predicted value of the conditional intervention effect for each user and the predicted value of the future intervention effect can be calculated. The learning unit 112 includes a learning unit 112 that calculates a predicted value, and a prediction unit 113 that corrects a predicted value of a future intervention effect using a trend.

これにより、情報処理装置１は、将来の介入効果の予測値を、トレンドを用いて補正することで、ダイレクトマーケティングにおける意思決定の一要素であるターゲティングの最適化を行うことができる。このため、ユーザごとの条件付き介入効果の予測値と、将来の介入効果の予測値とに基づいて、広告効果が高い、つまり広告の内容への親和性が高いと見込まれる人物を抽出することできる。また、ダイレクトマーケティングにおける介入効果の予測精度を向上させることができる。 Thereby, the information processing device 1 can optimize targeting, which is one element of decision-making in direct marketing, by correcting the predicted value of future intervention effects using trends. Therefore, based on the predicted value of the conditional intervention effect for each user and the predicted value of the future intervention effect, it is possible to extract people who are expected to have a high advertising effect, that is, a high affinity with the content of the advertisement. can. Furthermore, the prediction accuracy of intervention effects in direct marketing can be improved.

また、本実施形態に係る情報処理装置１は、キャンペーン介入した期間と、キャンペーン介入しない期間とを少なくとも含む所定期間におけるユーザごとの条件付き介入効果を示すデータと、将来の介入効果を示すデータとを学習データセットとして学習する学習モデルによって、将来の介入効果の予測値を算出する学習部１１２と、キャンペーン介入しない期間における前記ユーザごとの条件付き介入効果を示すデータによる影響を抑制して前記将来の介入効果の予測値を補正するためのトレンドを算出する予測部１１３と、を備える。 Furthermore, the information processing device 1 according to the present embodiment stores data indicating a conditional intervention effect for each user in a predetermined period including at least a period in which the campaign intervenes and a period in which the campaign does not intervene, and data indicating the future intervention effect. A learning unit 112 calculates a predicted value of the future intervention effect using a learning model that learns as a learning data set, and a learning unit 112 that calculates the predicted value of the future intervention effect by suppressing the influence of data indicating the conditional intervention effect for each user during the period in which the campaign does not intervene. and a prediction unit 113 that calculates a trend for correcting the predicted value of the intervention effect.

これにより、情報処理装置１は、学習期間にキャンペーン介入がある期間とキャンペーン介入がない期間が混在している場合であっても、介入前後で平均を算出するための前提となる母集団の分布が変化を抑制することができる。そのため、母集団の分布が変化することによるバイアスがないトレンドを推定することができる。そのため、ダイレクトマーケティングにおける介入効果の予測精度を向上させることができる。 As a result, even if the learning period includes a period in which there is a campaign intervention and a period in which there is no campaign intervention, the information processing device 1 can calculate the distribution of the population, which is the premise for calculating the average before and after the intervention, even if there is a mixture of periods with campaign intervention and periods without campaign intervention in the learning period. can suppress change. Therefore, it is possible to estimate trends without bias due to changes in population distribution. Therefore, it is possible to improve the prediction accuracy of intervention effects in direct marketing.

次いで、上記で説明した内容についての検証結果について説明する。 Next, the verification results for the contents explained above will be explained.

ＣａｌｉｂｒａｔｅｄＣＡＴＥＥｓｔｉｍａｔｏｒを利用する前提としては、異質性の推定に足るだけの個体数と、時系列予測が行えるだけの期間に亘り利用可能なパネルデータ形式で、且つ、推定されたＵｐｌｉｆｔ値の精度を評価するために、ＲＣＴまたは反実アウトカムが計測可能な人工データを利用する必要がある。しかしながら、そのような理想的な条件で実施され、且つ検証に利用可能な形でオープンにされているデータは存在しない。そのため、本検証では人工データを用いて検証を行う。 The prerequisites for using the Calibrated CATE Estimator are that the number of individuals is sufficient to estimate heterogeneity, the panel data format is available over a period long enough to perform time-series prediction, and the accuracy of the estimated Uplift value is For evaluation, it is necessary to use RCTs or artificial data with measurable counterfactual outcomes. However, there is no data that has been conducted under such ideal conditions and that has been made available for verification. Therefore, in this verification, we will use artificial data.

一方で、理想的な条件下で生成された人工データでの時系列予測の精度が高かったとしても、現実のマーケティングにおいて、複雑なトレンドの変化を捉えることができるとは必ずしも限らない。このことから、本検証では、実際のマーケティングキャンペーンに対する反応データをトレンド成分のベースとして用い、そこに人工的に生成した個体レベルの異質性を加えた、実データと人工データとのハイブリッド形式のパネルデータを分析対象として用いる。これにより、人工データの長所である、反実アウトカムを観測できることからなる個体レベルでのＵｐｌｉｆｔ値の検証と、時系列成分の予測に関してある程度実データと同等の条件での検証することができる。 On the other hand, even if the accuracy of time-series prediction using artificial data generated under ideal conditions is high, it is not necessarily possible to capture changes in complex trends in actual marketing. For this reason, in this verification, we used response data to actual marketing campaigns as the base for trend components, and added artificially generated individual-level heterogeneity to it, creating a hybrid panel of real data and artificial data. Use the data as an analysis target. This makes it possible to verify the Uplift value at the individual level, which is an advantage of artificial data in that counterfactual outcomes can be observed, and to verify the prediction of time series components under conditions that are to some extent equivalent to real data.

時系列成分のベースとなるデータには、ＧｅｏｅｘｐｅｒｉｍｅｎｔｓＲｅｓｅａｒｃｈｖｅｒｓｉｏｎ１．０．３を用いる。図１３を参照して説明する。図１３は、本実施形態に係る検証データの一例を示す図である。この検証では、１００地域を売上の合計が概ね等しくなるようＴｒｅａｔｍｅｎｔ群とＣｏｎｔｒｏｌ群にランダムに割り振り、Ｐｒｅｔｅｓｔ、ｉｎｔｅｒｖｅｎｔｉｏｎ、ｃｏｏｌｄｏｗｎの３つの期間において各地域の日次の売上げを観測したものである。Ｔｒｅａｔｍｅｎｔ群のｉｎｔｅｒｖｅｎｔｉｏｎ期間のみオンライン広告が配信され、それ以外の対象、期間における介入はないものとする。当該データは、２０１５年１月１日から２０１５年４月７日という比較的長期間におけるＲＣＴのパネルデータである。当該データは、本検証において理想的なデータに近いが、１００地域しか存在しないために異質性をモデル化するためのサンプル数としては十分ではない。Ｐｒｅｔｅｓｔ期間においては、Ｔｒｅａｔｍｅｎｔ群とＣｏｎｔｒｏ群のＳａｌｅｓの動きが極めて高い精度で一致しているが、２月１５日から始まるＩｎｔｅｒｖｅｎｔｉｏｎ期間において、Ｔｒｅａｔｍｅｎｔ群のＳａｌｅｓが上昇し、Ｃｏｏｌｄｏｗｎ期間にまたＣｏｎｔｒｏｌ群と同等の水準に戻る。 Geoexperiments Research version 1.0.3 is used as the base data for the time series components. This will be explained with reference to FIG. FIG. 13 is a diagram showing an example of verification data according to this embodiment. In this verification, 100 regions were randomly assigned to a treatment group and a control group so that the total sales were approximately equal, and daily sales in each region were observed in three periods: pretest, intervention, and cooldown. It is assumed that online advertisements are distributed only during the intervention period of the treatment group, and there is no intervention for other targets or periods. The data is panel data from an RCT over a relatively long period from January 1, 2015 to April 7, 2015. This data is close to ideal data in this verification, but since there are only 100 regions, it is not a sufficient number of samples to model heterogeneity. During the Pretest period, the Sales movements of the Treatment group and Control group matched with extremely high accuracy, but during the Intervention period starting from February 15th, the Sales of the Treatment group increased, and during the Cooldown period, the Sales movements of the Control group and Return to parity.

また、Ｓａｌｅｓの水準が週次と思われる周期性を持ちながら、データ収集期間の後半に掛けて緩やかに上昇トレンドを持つ。このように時系列データにおいて、時間の経過とともに観察データの平均や分散が変化するデータは、非定常データと呼ばれ、分析の前提として独立同時分布（ｉ．ｉ．ｄ）を仮定している通常の機械学習等では分析が難しい。このため、分析前に差分変換、対数差分変換などを用いて定常性を持たせるよう変換することが多い。 Furthermore, while the level of Sales has a periodicity that appears to be weekly, it shows a gradual upward trend towards the latter half of the data collection period. In time-series data, data in which the mean and variance of observed data change over time is called non-stationary data, and independent and simultaneous distribution (i.i.d.) is assumed as a premise for analysis. It is difficult to analyze using normal machine learning. For this reason, before analysis, it is often transformed to provide stationarity using difference transformation, logarithmic difference transformation, or the like.

ＧｅｏｅｘｐｅｒｉｍｅｎｔｓＲｅｓｅａｒｃｈｖｅｒｓｉｏｎ１．０．３のデータのみでは、異質性の推定のために利用可能な個体が１００地域（１００通り）しか存在しない。ここでは、１００００人の顧客がランダムに１００地域のいずれかに属し、各顧客がランダム生成された共変量を持つようなデータを式（４）、式（５）により生成する。生成した各共変量の分布は、図１４～図１６の通りである。図１４～図１６は、本実施形態に係る各共変量の分布の一例を示す図である。 With only the data of Geoexperiments Research version 1.0.3, there are only 100 regions (100 types) of individuals that can be used for estimation of heterogeneity. Here, data in which 10,000 customers randomly belong to one of 100 regions and each customer has a randomly generated covariate is generated using equations (4) and (5). The distribution of each generated covariate is shown in FIGS. 14 to 16. FIGS. 14 to 16 are diagrams showing examples of distributions of each covariate according to this embodiment.

各顧客は、それぞれ潜在アウトカムＹ_（０）とＹ_（１）とを持ち、実際に観測されるのは介入条件と一致するＹ_（ｗ）＝Ｙと設定する。潜在アウトカムは、それぞれ式（６）、式（７）、式（８）のように生成する。なお、ｂａｓｅ_ｉｔは、図１３のデータにおいて個体ｉが所属するグループのその時点ｔにおけるＳａｌｅｓの値であり、時間的異質性を表現しているｗはｘ_ｉｔが実際に受けた介入の有無を表す。 Each customer has potential outcomes Y ₍₀₎ and Y ₍₁₎ , respectively, and what is actually observed is set to Y _(w) = Y, which matches the intervention condition. The latent outcomes are generated as shown in Equation (6), Equation (7), and Equation (8), respectively. Note that base _it is the value of Sales at time t of the group to which individual i belongs in the data in Figure 13, and w, which expresses temporal heterogeneity, indicates the presence or absence of the intervention that x _it actually received. represent.

各共変量と潜在アウトカムの関係性をＤＡＧ（ＤｉｒｅｃｔｅｄＡｃｙｃｌｉｃＧｒａｐｈ）で示した結果は、図１７のようになる。図１７は、本実施形態に係る各共変量と潜在アウトカムの関係性の一例を示す図である。 The relationship between each covariate and the latent outcome is shown in a DAG (Directed Acyclic Graph) as shown in FIG. 17. FIG. 17 is a diagram showing an example of the relationship between each covariate and a potential outcome according to this embodiment.

このＤＡＧから生成された各個体の真の介入効果τ_ｉｔの時系列プロットを図１８に示す。図１８は、本実施形態に係る各個体の真の介入効果の時系列プロットの一例を示す図である。
それぞれの個体ごとに異なる介入効果の水準（消費者異質性）があり、かつ曜日によって大きく介入効果が変動する個体から、ほぼ介入効果がない個体まで、様々な時間的異質性を持っている。 FIG. 18 shows a time series plot of the true intervention effect τ _it for each individual generated from this DAG. FIG. 18 is a diagram showing an example of a time-series plot of the true intervention effect for each individual according to the present embodiment.
Each individual has a different level of intervention effect (consumer heterogeneity), and there is a variety of temporal heterogeneity, ranging from individuals whose intervention effects vary greatly depending on the day of the week to individuals who have almost no intervention effect.

本検証で用いた学習・予測期間のデータ諸元は、図１９の通りである。図１９は、本実施形態に係る検証で用いた学習・予測期間のデータ諸元の一例を示す図である。
介入については、各個体が紐づく地域ごとにＲＣＴで割付けた場合と、ｘ_ｉｕを、介入を受ける確率（ｐｒｏｐｅｎｓｉｔｙｓｃｏｒｅ）とした場合と、のそれぞれで検証を行った。 The data specifications of the learning/prediction period used in this verification are as shown in FIG. FIG. 19 is a diagram showing an example of data specifications of the learning/prediction period used in the verification according to this embodiment.
Regarding the intervention, we verified two cases: one in which each individual was assigned to each region in an RCT, and the other in which x _iu was used as the probability of receiving the intervention (propensity score).

ＣａｌｉｂｒａｔｅｄＣＡＴＥＥｓｔｉｍａｔｏｒは、メタ的な学習アルゴリズムであり、ＣＡＴＥＥｓｔｉｍａｔｏｒとＴｉｍｅｓｅｒｉｅｓＦｏｒｅｃａｓｔｅｒとのそれぞれに、任意のアルゴリズムを利用することが可能である。
本検証では、実施形態と同様にＭｅｔａＬｅａｒｎｅｒの一種であるＸ－ＬｅａｒｎｅｒをＣＡＴＥＥｓｔｉｍａｔｏｒとして用いる。Ｘ－ＬｅａｒｎｅｒのアウトカムモデルおよびＣＡＴＥモデルには、Ｍｉｃｒｏｓｏｆｔ社（登録商標）が開発するＬｉｇｈｔｇｂｍのｖｅｒｓｉｏｎ３．２．１を用いた。 Calibrated CATE Estimator is a meta-learning algorithm, and any algorithm can be used for each of CATE Estimator and Timeseries Forecaster.
In this verification, as in the embodiment, X-Learner, which is a type of Meta Learner, is used as a CATE Estimator. Lightgbm version 3.2.1 developed by Microsoft (registered trademark) was used for the X-Learner outcome model and CATE model.

また、本検証では、Ｆａｃｅｂｏｏｋ社（登録商標）が開発するＰｒｏｐｈｅｔのｖｅｒｓｉｏｎ０．７．１をＴｉｍｅｓｅｒｉｅｓＦｏｒｅｃａｓｔｅｒとして用いた。なお、Ｐｒｏｐｈｅｔの学習に用いるデータは、Ｐｒｏｐｅｎｓｉｔｙｂａｓｅｄｓａｍｐｌｉｎｇ後のものを用いたが、Ｐｒｏｐｅｎｓｉｔｙｂａｓｅｄｓａｍｐｌｉｎｇ自体の有効性も検証をする。 Further, in this verification, version 0.7.1 of Prophet developed by Facebook Inc. (registered trademark) was used as Timeseries Forecaster. Note that although the data used for learning Prophet was obtained after propensity-based sampling, the effectiveness of propensity-based sampling itself will also be verified.

検証における評価は、以下の３つのパターンにおいて学習および予測を行い、介入効果の予測と介入効果の真値とを比較した次の３つの指標により評価を行った。 Evaluation in the verification was performed by learning and predicting in the following three patterns, and using the following three indicators to compare the predicted intervention effect with the true value of the intervention effect.

（１）ＭｅａｎＡｂｓｏｌｕｔｅＥｒｒｏｒ（ＭＡＥ）
ＭＡＥは、式（９）で表される。ＭＡＥは、誤差の絶対値の平均であるため、精度として解釈しやすい指標である。 (1) Mean Absolute Error (MAE)
MAE is expressed by equation (9). Since MAE is the average of the absolute values of errors, it is an index that can be easily interpreted as accuracy.

（２）ＭｅａｎＳｑｕａｒｅｄＥｒｒｏｒ（ＭＳＥ）
ＭＳＥは、式（１０）で表される。ＭＳＥは、誤差を二乗した値の平均であるため、ＭＡＥと比較して、外れ値の影響を受けやすい指標である。本検証では、大きく外したケースが他の試行と比べて極端に多くないかどうかを確認する意味合いで、補助的な指標として用いている。 (2) Mean Squared Error (MSE)
MSE is expressed by equation (10). Since MSE is the average of squared errors, it is an index that is more susceptible to outliers than MAE. In this verification, we use it as an auxiliary indicator to check whether the number of cases in which we missed the mark is extremely large compared to other trials.

（３）ＭｅａｎＥｒｒｏｒ（ＭＥ）
ＭＥは、式（１１）で表される。ＭＥは、通常あまり使われることがないが、今回のケースではＴｉｍｅｓｅｒｉｅｓＦｏｒｅｃａｓｔｅｒによって時間的異質性によって起こるバイアスを軽減できるかどうかがを評価することから、予測値にバイアスがどの程度残っているかを判断するための指標として用いる。誤差が正規分布に従っている場合、０に近いほどバイアスなく予測できていると解釈できる。 (3) Mean Error (ME)
ME is expressed by equation (11). ME is not usually used much, but in this case, we are evaluating whether the bias caused by temporal heterogeneity can be reduced by Timeseries Forecaster, and we are determining how much bias remains in the predicted values. Used as an indicator for If the error follows a normal distribution, it can be interpreted that the closer it is to 0, the more unbiased the prediction is.

ここではまず、ＰｒｏｐｅｎｓｉｔｙＢａｓｅｄＳａｍｐｌｉｎｇの有効性を検証する。以下４通りのパターンでデータのサンプリングおよびＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅの算定を実施した。 First, the effectiveness of Propensity Based Sampling will be verified. Data sampling and Calibration Score calculation were performed using the following four patterns.

（１）ＧｒｏｕｎｄＴｒｕｔｈ
ＧｒｏｕｎｄＴｒｕｔｈは、シミュレーションデータでのみ利用可能な、全個体に対して、実際の観測値Ｙ_ｉｔではなく、介入が一切発生しなかった場合のデータＹ_{（０）ｉｔ}を集計したものである。 (1) Ground truth
The Ground Truth is not the actual observed value Y _it but is aggregation of data Y _{(0) it} when no intervention occurs for all individuals, which is available only as simulation data.

（２）ＰｒｏｐｅｎｓｉｔｙＢａｓｅｄＳａｍｐｌｉｎｇ
ＰｒｏｐｅｎｓｉｔｙＢａｓｅｄＳａｍｐｌｉｎｇは、実施形態と同様にしてサンプリングされたデータである。 (2) Propensity Based Sampling
Propensity Based Sampling is data sampled in the same manner as in the embodiment.

（３）ＡｌｌＳａｍｐｌｅｓ
ＡｌｌＳａｍｐｌｅｓは、介入を受けた／受けなかったかを問わず、全てのデータを抽出したものである。 (3) All Samples
All Samples is an extraction of all data regardless of whether the intervention was received or not.

（４）ＵｎｔｒｅａｔｅｄＳａｍｐｌｅｓ
ＵｎｔｒｅａｔｅｄＳａｍｐｌｅｓは、介入ありの期間／なしの期間を問わず、介入がなかったＹ_ｉｔの全てを抽出したデータである。 (4) Untreated Samples
Untreated Samples is data extracted from all Y _it in which there was no intervention, regardless of the period with or without intervention.

（１）～（４）のそれぞれのＰｏｌｉｃｙでサンプリングされたデータを可視化した。図２０は、本実施形態に係る各Ｐｏｌｉｃｙでサンプリングされたデータの一例を示す図である。
図示する例では、ＡｌｌＳａｍｐｌｅｓは、非介入期間においてはＧｒｏｕｎｄＴｒｕｔｈと完全に一致しているものの、介入期間に入るとＧｒｏｕｎｄＴｒｕｔｈに比べて上方乖離する。これは、単純に介入の効果によって実際のトレンドよりも強い数字が観測されたために生じている。
一方、ＵｎｔｒｅａｔｅｄＳａｍｐｌｅｓは、同じく非介入期間においてはＧｒｏｕｎｄＴｒｕｔｈと完全に一致しているものの、介入期間に入るとＧｒｏｕｎｄＴｒｕｔｈに比べて下方乖離する。これは、今回の検証データにおいて、売上期待値が高い個体ほど介入を受けやすいような傾向があるため、介入期間において介入を受けていない個体の売上平均は、全サンプルの平均である介入前の売上平均を下回ってしまうことから生じるバイアスである。 Data sampled under each of Policies (1) to (4) was visualized. FIG. 20 is a diagram showing an example of data sampled by each policy according to the present embodiment.
In the illustrated example, All Samples completely matches the Ground Truth during the non-intervention period, but deviates upward compared to the Ground Truth when entering the intervention period. This is simply due to the effect of the intervention, which causes the observed numbers to be stronger than the actual trend.
On the other hand, although Untreated Samples completely agrees with the Ground Truth during the non-intervention period, it deviates downward from the Ground Truth when entering the intervention period. This is because in the current verification data, individuals with higher sales expectations tend to be more likely to receive intervention, so the average sales of individuals who did not receive intervention during the intervention period was the average of all samples before the intervention. This is a bias that arises from falling below the sales average.

ＰｒｏｐｅｎｓｉｔｙＢａｓｅｄＳａｍｐｌｉｎｇのデータは、同様の理由から非介入期間、介入期間のいずれもＧｒｏｕｎｄＴｒｕｔｈに比べて下方乖離する。しかしながら、上記とは異なり、介入期間に入った際にもトレンドに変化がなく、可視化の結果からは安定したトレンドが抽出可能となる。 For the same reason, the data of Propensity Based Sampling deviates downward from the Ground Truth in both the non-intervention period and the intervention period. However, unlike the above, there is no change in the trend even when the intervention period begins, and a stable trend can be extracted from the visualization results.

図２１は、本実施形態に係る各Ｐｏｌｉｃｙでのトレンドの一例を示す図である。
図２１の通り、（３）及び（４）のデータにより抽出されたトレンドは、真のデータから抽出されたトレンド１と比較して、上方および下方乖離する。（２）のデータから抽出されたトレンドは、ここでも真のトレンドと並行なトレンドを構成している。 FIG. 21 is a diagram showing an example of trends in each policy according to the present embodiment.
As shown in FIG. 21, the trends extracted using the data in (3) and (4) deviate upward and downward compared to trend 1 extracted from the true data. The trend extracted from the data in (2) also constitutes a trend parallel to the true trend.

図２２は、本実施形態に係るサンプリングの検証結果の一例を示す図である。
上記のように、それぞれ抽出されたトレンドに基づいて、学習期間および予測期間でＣａｌｉｂｒａｔｉｏｎＳｃｏｒｅを計算すると、ＰｒｏｐｅｎｓｉｔｙＢａｓｅｄＳａｍｐｌｉｎｇが最もＧｒｏｕｎｄＴｒｕｔｈに近く、誤差にして０．００１７と、他手法に比べて１０分の１以下の誤差を達成した。 FIG. 22 is a diagram illustrating an example of sampling verification results according to this embodiment.
As mentioned above, when calculating the Calibration Score in the learning period and prediction period based on the extracted trends, Propensity Based Sampling is closest to the Ground Truth, with an error of 0.0017, which is 10 minutes faster than other methods. An error of less than 1 was achieved.

このことから、ＰｒｏｐｅｎｓｉｔｙＢａｓｅｄＳａｍｐｌｉｎｇは今回比較対象とした（１）、（３）、（４）の他のサンプリングと比べて、介入期間と非介入期間が入り交じるような状況におけるトレンド推定に効果的な手法である。 From this, Propensity Based Sampling is more effective in estimating trends in situations where intervention periods and non-intervention periods intersect, compared to other samplings (1), (3), and (4) that were compared this time. This is a great method.

次いで、以下の検証の前提の２通りとＣａｌｉｂｒａｔｉｏｎの３通りとの組み合わせによる６通りのパターンにより予測期間における個人別の介入効果を推定し、その結果を正の値と比較した。 Next, the intervention effect for each individual in the prediction period was estimated using six patterns of combinations of the following two verification assumptions and three calibrations, and the results were compared with positive values.

検証の前提
（１）完全ランダムな介入割付であるＲＴＣデータ
（２）各個体に対する割付確率だけが既知である準実験データ
Ｃａｌｉｂｒａｔｉｏｎ方式
（１）予測結果をＣａｌｉｂｒａｔｉｏｎする（Ｐｏｓｔ－Ｔｒａｉｎ）
（２）学習データをＣａｌｉｂｒａｔｉｏｎする（Ｐｒｅ－Ｔｒａｉｎ）
（３）Ｃａｌｉｂｒａｔｉｏｎを行わない（ＮｏＣａｌｉｂｒａｔｉｏｎ） Verification assumptions (1) RTC data with completely random intervention allocation (2) Quasi-experimental data where only the allocation probability for each individual is known Calibration method (1) Calibrate the predicted results (Post-Train)
(2) Calibrate the learning data (Pre-Train)
(3) No Calibration

図２３は、本実施形態に係るＣａｌｉｂｒａｔｉｏｎの検証結果の一例を示す図である。
図示するように、検証の前提におけるＲＣＴデータ、準実験データのいずれにおいても、ＣａｌｉｂｒａｔｅｄＣＡＴＥＥｓｔｉｍａｔｏｒが、Ｃａｌｉｂｒａｔｉｏｎを行わない通常のモデルＮｏＣａｌｉｂｒａｔｉｏｎや、学習データをＣａｌｉｂｒａｔｉｏｎするＰｒｅ－ｔｒａｉｎと比べて精度が向上していることが示されている。 FIG. 23 is a diagram showing an example of a verification result of Calibration according to this embodiment.
As shown in the figure, Calibrated CATE Estimator has improved accuracy compared to the normal model No Calibration that does not perform calibration and Pre-train that calibrates learning data in both RCT data and quasi-experimental data under the verification premise. It is shown that

このように、上記実施形態で説明した内容の有効性検証のために人工データを用いて有効性を検証した結果、特徴量と学習・予測データの指定のみで精度高い消費者異質性の推定が可能という機械学習ベースの利点を残しながらも、目的変数の非定常性から生じる予測値のバイアスを、通常の機械学習ベースと比較して抑えることができ、また、高精度な予測が実現できることを確認した。 In this way, as a result of verifying the effectiveness of the content explained in the above embodiment using artificial data, it was found that consumer heterogeneity can be estimated with high accuracy just by specifying feature values and training/prediction data. While retaining the advantage of machine learning-based methods that it is possible, we have demonstrated that bias in predicted values caused by non-stationarity of the objective variable can be suppressed compared to normal machine learning-based methods, and that highly accurate predictions can be achieved. confirmed.

以上、図面を参照してこの発明の一実施形態について詳しく説明してきたが、具体的な構成は上述のものに限られることはなく、この発明の要旨を逸脱しない範囲内において様々な設計変更等をすることが可能である。 Although one embodiment of the present invention has been described above in detail with reference to the drawings, the specific configuration is not limited to that described above, and various design changes etc. may be made without departing from the gist of the present invention. It is possible to

例えば、上述した各実施形態では、情報処理装置１が学習、予測する例を説明したが、これには限られない。例えば、情報処理装置１のうち、学習を行う学習装置と、予測を行う予測装置とが個別のものであってもよい。 For example, in each of the embodiments described above, an example in which the information processing device 1 learns and predicts is explained, but the present invention is not limited to this. For example, in the information processing device 1, a learning device that performs learning and a prediction device that performs prediction may be separate devices.

また、上述した各実施形態における情報処理装置１の一部、例えば、制御部１１０などをコンピュータで実現するようにしてもよい。その場合、この機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することによって実現してもよい。なお、ここでいう「コンピュータシステム」とは、情報処理装置１に内蔵されたコンピュータシステムであって、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）や周辺機器等のハードウェアを含むものとする。 Furthermore, a part of the information processing device 1 in each of the embodiments described above, for example, the control unit 110, may be realized by a computer. In that case, a program for realizing this function may be recorded on a computer-readable recording medium, and the program recorded on the recording medium may be read into a computer system and executed. Note that the "computer system" herein refers to a computer system built into the information processing device 1, and includes an OS (Operating System) and hardware such as peripheral devices.

また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ－ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間、動的にプログラムを保持するもの、その場合のサーバ装置やクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含んでもよい。また上記プログラムは、前述した機能の一部を実現するためのものであってもよく、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであってもよい。 Furthermore, the term "computer-readable recording medium" refers to portable media such as flexible disks, magneto-optical disks, ROMs, and CD-ROMs, and storage devices such as hard disks built into computer systems. Furthermore, a "computer-readable recording medium" refers to a medium that dynamically stores a program for a short period of time, such as a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line. In this case, it may also include a device that retains a program for a certain period of time, such as a volatile memory inside a computer system that is a server device or a client. Further, the above-mentioned program may be one for realizing a part of the above-mentioned functions, or may be one that can realize the above-mentioned functions in combination with a program already recorded in the computer system.

また、上述した実施形態における情報処理装置１の一部、または全部を、ＬＳＩ（ＬａｒｇｅＳｃａｌｅＩｎｔｅｇｒａｔｉｏｎ）等の集積回路として実現してもよい。サーバ装置２０（２０ａ）の各機能部は個別にプロセッサ化してもよいし、一部、または全部を集積してプロセッサ化してもよい。また、集積回路化の手法はＬＳＩに限らず専用回路、または汎用プロセッサで実現してもよい。また、半導体技術の進歩によりＬＳＩに代替する集積回路化の技術が出現した場合、当該技術による集積回路を用いてもよい。 Further, part or all of the information processing device 1 in the embodiment described above may be realized as an integrated circuit such as an LSI (Large Scale Integration). Each functional unit of the server device 20 (20a) may be made into a processor individually, or some or all of them may be integrated into a processor. Further, the method of circuit integration is not limited to LSI, but may be implemented using a dedicated circuit or a general-purpose processor. Further, if an integrated circuit technology that replaces LSI emerges due to advances in semiconductor technology, an integrated circuit based on this technology may be used.

Ｓｙｓ情報処理システム
１情報処理装置
Ｂ企業
Ｕ１、Ｕ２ユーザ
１１０制御部
１１１取得部
１１２学習部
１１３予測部
１１４出力処理部
１３０記憶部
１３１顧客情報記憶部
１３２購買情報記憶部
１３３ＤＭ情報記憶部
１３４送付先情報記憶部
１３５学習結果記憶部
１４０入力部
１５０出力部
１１ＣＰＵ
１２ドライブ部
１３記憶媒体
１４入力部
１５出力部
１６ＲＯＭ
１７ＲＡＭ
１８補助記憶部
１９インターフェース部 Sys Information processing system 1 Information processing device B Company U1, U2 User 110 Control unit 111 Acquisition unit 112 Learning unit 113 Prediction unit 114 Output processing unit 130 Storage unit 131 Customer information storage unit 132 Purchase information storage unit 133 DM information storage unit 134 Sending Ahead information storage unit 135 Learning result storage unit 140 Input unit 150 Output unit 11 CPU
12 Drive section 13 Storage medium 14 Input section 15 Output section 16 ROM
17 RAM
18 Auxiliary storage section 19 Interface section

Claims

an acquisition unit that acquires customer information and purchasing information;
The customer information and purchase information acquired by the acquisition unit are applied to a learning model that has been trained using data indicating a conditional intervention effect for each user due to campaign intervention and data indicating future intervention effects due to campaign intervention as a learning data set. a learning unit that calculates a predicted value of a conditional intervention effect and a predicted value of a future intervention effect for each user by inputting the information;
a prediction unit that corrects the predicted value of the future intervention effect using a trend;
An information processing device comprising:

A learning model that learns data indicating the conditional intervention effect for each user during a predetermined period including at least the period in which the campaign intervened and the period in which the campaign did not intervene, and data indicating the future intervention effect as a learning dataset, a learning department that calculates the predicted value of the intervention effect;
a prediction unit that calculates a trend for correcting the predicted value of the future intervention effect by suppressing the influence of data indicating the conditional intervention effect for each user during the period in which the campaign is not intervened;
An information processing device comprising:

The prediction unit changes the number of samples of data indicating the conditional intervention effect for each user in the period in which the campaign is not intervened, and the number of samples of data indicating the conditional intervention effect for each user in the period in which the campaign intervenes. to suppress the said influence,
The information processing device according to claim 2.

The trend is a predicted value obtained by modeling similar results,
The information processing device according to claim 1 or claim 2.

The trend is a predicted value obtained by modeling indicators showing similar trends;
The information processing device according to claim 1 or claim 2.

an acquisition unit that acquires customer information and purchasing information;
The customer information and purchase information acquired by the acquisition unit are applied to a learning model that has been trained using data indicating a conditional intervention effect for each user due to campaign intervention and data indicating future intervention effects due to campaign intervention as a learning data set. a learning unit that calculates a predicted value of a conditional intervention effect and a predicted value of a future intervention effect for each user by inputting the information;
a prediction unit that corrects the predicted value of the future intervention effect using a trend;
An information processing system equipped with.

A learning model that learns data indicating the conditional intervention effect for each user during a predetermined period including at least the period in which the campaign intervened and the period in which the campaign did not intervene, and data indicating the future intervention effect as a learning dataset, a learning department that calculates the predicted value of the intervention effect;
a prediction unit that calculates a trend for correcting the predicted value of the future intervention effect by suppressing the influence of data indicating the conditional intervention effect for each user during the period in which the campaign is not intervened;
An information processing system equipped with.

The computer of the information processing device is
an acquisition process for acquiring customer information and purchasing information;
The customer information and purchase information acquired through the acquisition process are added to a learning model that has been trained using data indicating the conditional intervention effect for each user due to the campaign intervention and data indicating the future intervention effect due to the campaign intervention as a learning data set. A learning process that calculates a predicted value of a conditional intervention effect and a predicted value of a future intervention effect for each user by inputting
a prediction process of correcting the predicted value of the future intervention effect using a trend;
An information processing method having

The computer of the information processing device is
A learning model that learns data indicating the conditional intervention effect for each user during a predetermined period including at least the period in which the campaign intervened and the period in which the campaign did not intervene, and data indicating the future intervention effect as a learning dataset, a learning process that calculates a predicted value of the intervention effect;
a prediction step of calculating a trend for correcting the predicted value of the future intervention effect by suppressing the influence of data indicating the conditional intervention effect for each user during the period in which the campaign does not intervene;
An information processing method having

In the computer of the information processing device,
an acquisition step of acquiring customer information and purchasing information;
The customer information and purchase information obtained in the acquisition step are added to the learning model that has been trained using data indicating the conditional intervention effect for each user due to the campaign intervention and data indicating the future intervention effect due to the campaign intervention as a learning data set. a learning step of calculating a predicted value of a conditional intervention effect and a predicted value of a future intervention effect for each user by inputting the information;
a prediction step of correcting the predicted value of the future intervention effect using a trend;
A program to run.

In the computer of the information processing device,
A learning model that learns data indicating the conditional intervention effect for each user during a predetermined period including at least the period in which the campaign intervened and the period in which the campaign did not intervene, and data indicating the future intervention effect as a learning dataset, a learning step of calculating a predicted value of the intervention effect;
a prediction step of calculating a trend for correcting the predicted value of the future intervention effect by suppressing the influence of data indicating the conditional intervention effect for each user during the period in which the campaign is not intervened;
A program to run.