JP2016071881A

JP2016071881A - Item recommendation system

Info

Publication number: JP2016071881A
Application number: JP2015185334A
Authority: JP
Inventors: 祐樹渡辺; Yuki Watanabe; 卓岡本; Taku Okamoto
Original assignee: Colorful Board Inc; Chiba University NUC
Current assignee: Colorful Board Inc; Chiba University NUC
Priority date: 2014-09-22
Filing date: 2015-09-18
Publication date: 2016-05-09

Abstract

PROBLEM TO BE SOLVED: To recommend an item which is more adaptable to the taste of a user.SOLUTION: An item featured value vector calculation part 24 extracts item information from an item information table 32, and calculates an item featured value vector. A prediction score calculation part 26 derives an appropriate function F on the basis of the score of an actual score table 38 and the corresponding item featured value vector for each user, and calculates prediction scores for all item information by applying the derived approximate function F. A Web server 30 presents the item information having the highest prediction score among item information which has not been presented to a user terminal 12. A user inputs the evaluation of likes/dislikes for the item information presented to the user terminal 12, and the Web server 30 stores the evaluation result as a score in the actual score table 38. Thus, it is possible to recommend an item which is more adaptable to the taste of the user.SELECTED DRAWING: Figure 1

Description

本発明は、アイテムリコメンドシステムに関し、詳しくは、ユーザ端末にアイテムをリコメンドするアイテムリコメンドシステムに関する。 The present invention relates to an item recommendation system, and more particularly to an item recommendation system for recommending an item to a user terminal.

従来、この種のアイテムリコメンドシステムでは、たとえば、下記の特許文献１および特許文献２の背景技術で説明されているように、ユーザ自身或いはユーザと似た属性をもつ他者の過去の行動履歴に基づいて協調フィルタリングやベイジアンネットワークなどの技術を用いてアイテムをリコメンデーションしている。 Conventionally, in this type of item recommendation system, for example, as described in the background art of Patent Document 1 and Patent Document 2 below, the past action history of the user himself or others having similar attributes to the user is used. Based on this, we recommend items using technologies such as collaborative filtering and Bayesian networks.

また、コンテンツベースフィルタリングとよばれる方法も提案されている。このコンテンツベースフィルタリングでは、事前にアイテムがもつ複数の特徴を数値的に解析しておき、これをアイテム特徴量ベクトルとして保存する一方で、いくつかのアイテムに対してユーザから「好き」か「好きではない（嫌い）」の２値データとしての評価データを収集する。続いて、ユーザが好むアイテム群を集め、その特徴量ベクトルなどの平均を取るなどの方法を用いて、ユーザの嗜好を代表して表現するユーザ特徴量ベクトルを算出する。そして、アイテム特徴量ベクトルとユーザ特徴量ベクトルの類似度を算出し、類似度が大きい順にアイテムを推薦する。類似度の算出には、コサイン類似度とよばれる類似度を用いることが一般的であるが、類似度算出にニューラルネットワーク等の機械学習器を用いることも可能である。また、類似度を算出することなく、好きか好きでないかを判別するために、サポートベクターマシン等の識別器を用いることもできる。 A method called content-based filtering has also been proposed. In this content-based filtering, a number of features of an item are numerically analyzed in advance and stored as an item feature quantity vector, while the user likes or dislikes some items. Evaluation data is collected as binary data of “not (dislike)”. Subsequently, a group of items preferred by the user is collected, and a user feature vector representing the user's preference is calculated using a method such as averaging the feature vectors. Then, the similarity between the item feature vector and the user feature vector is calculated, and items are recommended in descending order of similarity. The similarity is generally calculated using a similarity called cosine similarity, but a machine learning device such as a neural network can also be used to calculate the similarity. In addition, a discriminator such as a support vector machine can be used to discriminate whether the user likes or dislikes without calculating the similarity.

特開２００７−４１９７１号公報JP 2007-41971 A 特開２０１２−１９００６１号公報JP 2012-190061 A

前述したユーザ自身或いはユーザと似た属性をもつ他者の過去の行動履歴に基づいたリコメンデーションは、膨大なユーザ情報を事前に収集できることを前提としているため、膨大なユーザ情報を事前に収集することができない場合には信頼性の高いリコメンデーションを実現することが困難となる。また、対象ユーザから得た直接的な情報ではなく行動履歴を経由して結びつく他者の嗜好情報に基づいて対象ユーザの嗜好を予測しているため、対象ユーザが必要とするリコメンデーションを実現できない場合が生じる。 The above-mentioned recommendation based on the past behavior history of the user himself / herself or others having similar attributes is based on the premise that a large amount of user information can be collected in advance, so a large amount of user information is collected in advance. If this is not possible, it will be difficult to achieve highly reliable recommendations. In addition, because the target user's preference is predicted based on the preference information of others connected via the action history instead of direct information obtained from the target user, the recommendation required by the target user cannot be realized. Cases arise.

また、コンテンツベースフィルタリングでは、一般に「好き」か「好きでない」の２値識別を行なうので、多段階評価（４段階評価や５段階評価など）を行なうシステムとの親和性が良くない。仮に多段階評価結果を用いてコンテンツベースフィルタリングを行なう場合には、評価１を代表するユーザ特徴量ベクトルに対する類似度、評価２を代表するユーザ特徴量ベクトルに対する類似度といったように、評価基準ごとに類似度、ないしはこれに得点に応じた重みを勘案した類似度を算出して、一番大きい類似度をもつアイテムを推薦するという仕組みを考えることも可能であるが、ユーザごとに異なる特徴量に対する重要度を十分に表現できない。 In addition, since content-based filtering generally performs binary identification of “like” or “not like”, the compatibility with a system that performs multi-stage evaluation (such as 4-stage evaluation or 5-stage evaluation) is not good. If content-based filtering is performed using the multi-step evaluation result, the similarity to the user feature vector representing the evaluation 1 and the similarity to the user feature vector representing the evaluation 2 are set for each evaluation criterion. It is possible to consider a mechanism to calculate the degree of similarity or the degree of similarity considering the weight according to the score, and recommend the item with the largest degree of similarity. The importance cannot be expressed sufficiently.

本発明のアイテムリコメンドシステムは、ユーザの嗜好により適合するアイテムをリコメンドすることを主目的とする。 The item recommendation system of the present invention is mainly intended to recommend items that are more suitable for the user's preference.

本発明のアイテムリコメンドシステムは、上述の主目的を達成するために以下の手段を採った。 The item recommendation system of the present invention employs the following means in order to achieve the main object described above.

本発明のアイテムリコメンドシステムは、
ユーザ端末にアイテムをリコメンドするアイテムリコメンドシステムであって、
ＥＣサイトからアイテム情報を取得し、該アイテム情報とアイテム識別子とを関連つけてアイテム情報テーブルに保存するアイテム情報取得手段と、
前記アイテム情報テーブルのアイテム情報に基づいて予め定めたアイテム特徴量に対するアイテム特徴量ベクトルを計算し、該アイテム特徴量ベクトルとアイテム識別子とを関連つけてアイテム特徴量ベクトルテーブルに保存するアイテム特徴量ベクトル計算手段と、
前記アイテム情報テーブルにおけるアイテム情報のユーザ端末への提示に対して入力された多段階の評価値としてのスコアを取得し、該スコアとアイテム識別子とユーザ識別子とを関連つけて実スコアテーブルに保存する実スコア取得手段と、
ユーザ毎に、前記実スコアテーブルのスコアと該スコアに対応する前記アイテム特徴量ベクトルテーブルにおけるアイテム特徴量ベクトルとに基づいてアイテム特徴量ベクトルの入力に対してスコアを計算する近似関数を導出すると共に該近似関数に対して前記アイテム特徴量ベクトルテーブルにおける全アイテム特徴量ベクトルを適用して全アイテム情報に対する予測スコアを計算し、該予測スコアとアイテム識別子とユーザ識別子とを関連つけて予測スコアテーブルに保存する予測スコア計算手段と、
ユーザ端末からのリクエストに対して前記アイテム情報テーブルの全アイテム情報のうちユーザ端末に提示していないアイテム情報であって、少なくとも該ユーザ端末のユーザの予測スコアが最も高いアイテム情報を提示する提示手段と、
を備えることを要旨とする。 The item recommendation system of the present invention is:
An item recommendation system for recommending items to a user terminal,
Item information acquisition means for acquiring item information from an EC site and associating the item information with an item identifier and storing them in an item information table;
Item feature amount vector for calculating an item feature amount vector for a predetermined item feature amount based on item information in the item information table, and storing the item feature amount vector in association with the item identifier in the item feature amount vector table Calculation means;
A score as a multi-stage evaluation value input for presentation of item information in the item information table to the user terminal is acquired, and the score, the item identifier, and the user identifier are associated and stored in the actual score table. Real score acquisition means;
For each user, an approximate function for calculating a score for the input of the item feature quantity vector is derived based on the score of the actual score table and the item feature quantity vector in the item feature quantity vector table corresponding to the score. A prediction score for all item information is calculated by applying all item feature vectors in the item feature vector table to the approximate function, and the prediction score, the item identifier, and the user identifier are associated with each other in the prediction score table. A predicted score calculation means to be stored;
Presenting means for presenting item information that is not presented to the user terminal among all item information in the item information table in response to a request from the user terminal and has the highest predicted user score of the user terminal at least When,
It is a summary to provide.

この本発明のアイテムリコメンドシステムでは、アイテム情報取得手段が、ＥＣ（electronic commerce）サイトからアイテム情報を取得し、アイテム情報とアイテム識別子とを関連つけてアイテム情報テーブルに保存する。アイテム情報としては、例えば、アイテム名，ＥＣサイトのショップ名，ブランド名，アイテム画像のＵＲＬ（Uniform Resource Locator），カテゴリ，性別，定価，実売価格，実売価格の割引率，展開されているサイズ，素材の情報，原産国，紹介文などを挙げることができる。アイテム特徴量ベクトル計算手段は、アイテム情報テーブルのアイテム情報に基づいて予め定めたアイテム特徴量に対するアイテム特徴量ベクトルを計算し、アイテム特徴量ベクトルとアイテム識別子とを関連つけてアイテム特徴量ベクトルテーブルに保存する。ここで、アイテム特徴量としては、例えば、アイテム画像に含まれる色やその占有率，ブランド，カテゴリ，同一カテゴリでの実売価格，価格の割引率などを挙げることができる。アイテム特徴量ベクトルの成分としては、上述のアイテム特徴量であり、アイテム画像に含まれる代表色の色ベクトルや代表色の占有率，ブランドのＮ次空間における座標（ブランド座標），カテゴリのＮ次空間における座標（カテゴリ座標），同一カテゴリでの実売価格の偏差値，実売価格の割引率などの要素を挙げることができる。実スコア取得手段は、アイテム情報テーブルにおけるアイテム情報のユーザ端末への提示に対して入力された多段階の評価値としてのスコアを取得し、スコアとアイテム識別子とユーザ識別子とを関連つけて実スコアテーブルに保存する。ここで、多段階としては、２段階（２値）も含むが、基本的には３段階以上を意味している。予測スコア計算手段は、ユーザ毎に、実スコアテーブルのスコアとこのスコアに対応するアイテム特徴量ベクトルテーブルにおけるアイテム特徴量ベクトルとに基づいてアイテム特徴量ベクトルの入力に対してスコアを計算する近似関数を導出すると共に近似関数に対してアイテム特徴量ベクトルテーブルにおける全アイテム特徴量ベクトルを適用して全アイテム情報に対する予測スコアを計算する。そして、予測スコアとアイテム識別子とユーザ識別子とを関連つけて予測スコアテーブルに保存する。提示手段は、ユーザ端末からのリクエストに対してアイテム情報テーブルの全アイテム情報のうちユーザ端末に提示していないアイテム情報であって、少なくともユーザ端末のユーザの予測スコアが最も高いアイテム情報を提示する。 In the item recommendation system of the present invention, the item information acquisition means acquires item information from an EC (electronic commerce) site, associates the item information with the item identifier, and stores them in the item information table. Item information includes, for example, item name, EC site shop name, brand name, item image URL (Uniform Resource Locator), category, gender, list price, actual selling price, discount price of selling price, and so on. You can list size, material information, country of origin, and introductory text. The item feature quantity vector calculating means calculates an item feature quantity vector for a predetermined item feature quantity based on the item information in the item information table, and associates the item feature quantity vector with the item identifier in the item feature quantity vector table. save. Here, examples of the item feature amount include a color included in the item image, an occupation ratio thereof, a brand, a category, an actual selling price in the same category, a price discount rate, and the like. The component of the item feature vector is the item feature described above, the color vector of the representative color included in the item image, the occupancy of the representative color, the coordinates in the Nth-order space of the brand (brand coordinates), the Nth order of the category Factors such as coordinates in space (category coordinates), deviation value of actual sales price in the same category, discount rate of actual sales price, and the like can be mentioned. The actual score acquisition means acquires a score as a multi-stage evaluation value input for the presentation of the item information in the item information table to the user terminal, and associates the score, the item identifier, and the user identifier with the actual score Save to table. Here, the multistage includes two stages (binary values), but basically means three or more stages. The predicted score calculation means is an approximation function that calculates a score for an input of an item feature vector based on the score of the actual score table and the item feature vector in the item feature vector table corresponding to the score for each user. And a prediction score for all item information is calculated by applying all item feature vectors in the item feature vector table to the approximation function. Then, the prediction score, the item identifier, and the user identifier are associated with each other and stored in the prediction score table. The presenting means presents item information that is not presented to the user terminal among all item information in the item information table in response to a request from the user terminal, and that has at least the highest user prediction score of the user terminal. .

このように、ユーザ毎に実スコアテーブルのスコアと対応するアイテム特徴量ベクトルとに基づいて近似関数を導出し、導出した近似関数を適用して全アイテム情報に対して予測スコアを計算し、ユーザ端末に対してそれまでに表示されていないアイテム情報のうち最も高い予測スコアのアイテム情報を提示するから、ユーザの嗜好により適合したアイテムをリコメンドすることができる。特に、スコアとして３段階以上の多段階の評価値をもちいることにより、よりユーザの嗜好に適合したアイテムをリコメンドすることができる。 In this way, an approximate function is derived for each user based on the score of the actual score table and the corresponding item feature vector, and the predicted score is calculated for all item information by applying the derived approximate function, and the user Since the item information having the highest prediction score among the item information that has not been displayed so far is presented to the terminal, it is possible to recommend an item that is more suitable for the user's preference. In particular, by using a multi-stage evaluation value of three or more levels as a score, it is possible to recommend an item more suitable for the user's preference.

こうした本発明のアイテムリコメンドシステムにおいて、前記実スコア取得手段は、前記提示手段によってユーザ端末に提示されたアイテム情報に対して入力されたスコアを取得する手段であるものとすることもできる。こうすれば、それまでに表示されていないアイテム情報のうち最も高い予測スコアのアイテム情報に対して実スコアを取得することができる。この場合、前記提示手段は、前記実スコア取得手段がスコアを取得したときにはユーザ端末に提示していないアイテム情報のうち該スコアに対応するアイテム情報の次に高い予測スコアのアイテム情報を提示する手段であるものとすることもできる。こうすれば、スコアの取得に連動して次に予測スコアの高いアイテム情報を提示することができる。 In such an item recommendation system of the present invention, the actual score acquisition means may be means for acquiring a score input to the item information presented to the user terminal by the presentation means. If it carries out like this, an actual score can be acquired with respect to the item information of the highest prediction score among the item information which has not been displayed so far. In this case, the presenting means presents the item information having the next highest predicted score after the item information corresponding to the item information that is not presented to the user terminal when the actual score obtaining means obtains the score. It can also be assumed. In this way, item information with the next highest predicted score can be presented in conjunction with the acquisition of the score.

また、本発明のアイテムリコメンドシステムにおいて、前記スコアは、アイテム情報のうちのアイテム画像の提示に対して他の情報の提示を求めることなく直ちに入力された適否と前記他の情報の提示後に入力された適否とが段階的な数値として定められた評価値であるものとすることもできる。例えば、アイテム情報のうちのアイテム画像の提示に対して他の情報の提示を求めることなく直ちに入力された適および否として４および１、他の情報の提示後に入力された適および否として３および２とすれば１〜４の４段階の数値の評価値となる。この場合、前記スコアは、アイテム情報の提示の程度に応じた段階的な数値として定められた評価値であるものとすることもできるし、前記スコアは、アイテム情報の提示の程度と提示開始から入力までの時間に応じた段階的な数値として定められた評価値であるものとすることもできる。こうすれば、スコアを５段階以上の数値の評価値とすることができ、ユーザの嗜好をより適切に反映した予測スコアを設定することができる。この結果、ユーザの嗜好により適正なアイテムをリコメンドすることができる。 Further, in the item recommendation system of the present invention, the score is input immediately after the presentation of the suitability and the other information that are input immediately without requesting the presentation of other information with respect to the presentation of the item image of the item information. It can be assumed that the suitability is an evaluation value determined as a stepwise numerical value. For example, for the presentation of the item image of the item information, 4 and 1 as the right and wrong input immediately without requesting the presentation of other information, 3 and as the right and wrong input after the presentation of other information If it is 2, it will be a four-stage numerical evaluation value of 1-4. In this case, the score may be an evaluation value determined as a stepwise numerical value according to the degree of presentation of item information, and the score is determined from the degree of presentation of item information and the start of presentation. It can also be an evaluation value defined as a stepwise numerical value according to the time until input. In this way, the score can be an evaluation value of a numerical value of five or more levels, and a prediction score that more appropriately reflects the user's preference can be set. As a result, an appropriate item can be recommended according to the user's preference.

本発明のアイテムリコメンドシステムにおいて、前記予測スコア計算手段は、全予測スコアのうちスコアの高いものから所定件数の予測スコアをアイテム識別子とユーザ識別子とを関連つけて予測スコアテーブルに保存する手段であるものとすることもできる。こうすれば、予測スコアテーブルに保存する予測スコアの件数を少なくすることができ、処理を迅速にすることができる。 In the item recommendation system according to the present invention, the prediction score calculation means is a means for storing a predetermined number of prediction scores in the prediction score table in association with item identifiers and user identifiers from the highest scores among all prediction scores. It can also be. In this way, the number of prediction scores stored in the prediction score table can be reduced, and the processing can be speeded up.

本発明のアイテムリコメンドシステムにおいて、前記予測スコア計算手段は、前記実スコアテーブルのスコアのうちユーザが指定したスコアを用いて前記近似関数を導出する手段であるものとすることもできる。こうすれば、ユーザの意向を反映した嗜好に沿ったアイテムをリコメンドすることができる。この場合、前記予測スコア計算手段は、前記実スコアテーブルのスコアのうちユーザのスコアとユーザが指定した他のユーザのスコアとを用いて前記近似関数を導出する手段であるものとすることもできる。こうすれば、ユーザは、自分の嗜好だけでなく、指定した他のユーザの嗜好に沿ったアイテムの提示を受けることができる。更に、前記予測スコア計算手段は、ユーザによって入力された重みによりユーザのスコアとユーザが指定した他のユーザのスコアとを用いて前記近似関数を導出する手段であるものとすることもできる。 In the item recommendation system of the present invention, the predicted score calculation means may be means for deriving the approximate function using a score designated by a user among the scores of the actual score table. By doing this, it is possible to recommend an item in accordance with the preference reflecting the user's intention. In this case, the predicted score calculation means may be means for deriving the approximation function using a user score and a score of another user specified by the user among the scores of the actual score table. . In this way, the user can receive the presentation of items not only according to his / her preference but also according to the preferences of other designated users. Further, the predicted score calculation means may be a means for deriving the approximation function using a user score and a score of another user specified by the user based on a weight input by the user.

本発明のアイテムリコメンドシステムにおいて、前記アイテム特徴量ベクトルはアイテム画像に含まれる代表色を成分として含み、前記アイテム画像に含まれる代表色はアイテム画像に対してアイテムのカテゴリ毎の所定領域の各画素の色情報におけるＬａｂ色空間の座標の分布に基づいて設定される予め定めた成分数の色ベクトルであるものとすることもできる。こうすれば、ユーザの色に対する嗜好をより反映した予測スコアを計算することができ、ユーザの色に対する嗜好をより反映したアイテムをリコメンドすることができる。この場合、前記アイテム特徴量ベクトルは前記代表色の占有率を成分として含み、前記代表色の占有率は、前記所定領域の各画素を該画素の色に直近の前記色ベクトルにおける成分により区分し、前記所定領域の全画素数に対する各区分の画素数として得られるベクトルであるものとすることもできる。こうすれば、更にユーザの色に対する嗜好をより反映した予測スコアを計算してアイテムをリコメンドすることができる。 In the item recommendation system of the present invention, the item feature quantity vector includes a representative color included in the item image as a component, and the representative color included in the item image is a pixel in a predetermined region for each category of the item with respect to the item image. It is also possible to use a color vector with a predetermined number of components set based on the distribution of coordinates in the Lab color space in the color information. If it carries out like this, the prediction score which reflected the user's preference with respect to a color can be calculated, and the item which reflected the user's preference with respect to a color more can be recommended. In this case, the item feature vector includes the occupancy of the representative color as a component, and the occupancy of the representative color classifies each pixel in the predetermined area by a component in the color vector closest to the color of the pixel. Further, it can be a vector obtained as the number of pixels in each section with respect to the total number of pixels in the predetermined area. In this way, the item can be recommended by calculating a prediction score that further reflects the user's preference for color.

本発明のアイテムリコメンドシステムにおいて、前記アイテム特徴量ベクトルはブランド座標および／またはカテゴリ座標を含み、前記ブランド座標は予め定めた各ブランド間の距離に基づいて同定されるＮ次元空間の座標であり、前記カテゴリ座標は予め定めた各カテゴリ間の距離に基づいて同定されるＮ次元空間の座標であるものとすることもできる。こうすれば、ユーザのブランドやカテゴリに対する嗜好をより反映した予測スコアを計算することができ、ユーザのブランドやカテゴリに対する嗜好をより反映したアイテムをリコメンドすることができる。 In the item recommendation system of the present invention, the item feature quantity vector includes brand coordinates and / or category coordinates, and the brand coordinates are coordinates in an N-dimensional space identified based on a predetermined distance between each brand, The category coordinates may be coordinates in an N-dimensional space identified based on predetermined distances between the categories. In this way, a prediction score that more reflects the user's preference for the brand and category can be calculated, and an item that more reflects the user's preference for the brand and category can be recommended.

本発明のアイテムリコメンドシステムにおいて、前記実スコアテーブルの同一ユーザのスコアのうち第１所定数の訓練データに対して、前記アイテム特徴量ベクトルを計算するのに必要なパラメータの１つを変化させてアイテム特徴量ベクトルを訓練用ベクトルとして計算し、前記実スコアテーブルの前記同一ユーザのスコアのうち第２所定数の評価データに対応するアイテム特徴量ベクトルと前記訓練用ベクトルとに基づいて前記パラメータの最適値を計算するパラメータ最適値計算手段を備え、前記アイテム特徴量ベクトル計算手段は、前記パラメータとして前記パラメータ最適値計算手段により計算された最適値を用いてアイテム特徴量ベクトルを計算する手段であるものとすることもできる。こうすれば、アイテム特徴量ベクトルを計算するのに必要なパラメータをより適正なものとすることができる。この結果、ユーザの嗜好をより適正に反映した予測スコアを計算することができ、ユーザの嗜好をより適正に反映したアイテムをリコメンドすることができる。この場合、前記パラメータ最適値計算手段は、アイテム特徴量ベクトルのうち前記パラメータにより影響を受ける成分と影響を受けない成分とを用いて最適化問題を解くことにより前記パラメータの最適値を計算するものとすることもできる。こうすれば、アイテム特徴量のすべてがパラメータによって影響を受けるとして計算する場合に比して、パラメータの最適値を求める計算を簡易なものとすることができる。 In the item recommendation system according to the present invention, one of the parameters required to calculate the item feature vector is changed for a first predetermined number of training data among the scores of the same user in the actual score table. An item feature quantity vector is calculated as a training vector, and based on the item feature quantity vector corresponding to a second predetermined number of evaluation data of the same user score in the actual score table and the training vector, Parameter optimum value calculating means for calculating an optimum value is provided, and the item feature quantity vector calculating means is means for calculating an item feature quantity vector using the optimum value calculated by the parameter optimum value calculating means as the parameter. It can also be. This makes it possible to make the parameters necessary for calculating the item feature vector more appropriate. As a result, it is possible to calculate a prediction score that more appropriately reflects the user's preference, and to recommend an item that more appropriately reflects the user's preference. In this case, the parameter optimum value calculation means calculates the optimum value of the parameter by solving the optimization problem using the component affected by the parameter and the component not affected by the parameter of the item feature quantity vector. It can also be. In this way, the calculation for obtaining the optimum value of the parameter can be simplified as compared with the case where the calculation is performed assuming that all the item feature amounts are affected by the parameter.

本発明のアイテムリコメンドシステムにおいて、特定のアイテム情報に対してアイテム特徴量ベクトルを計算すると共に指定したユーザに対する前記近似関数を用いて予測スコアを計算して第１スコアとして出力し、前記特定のアイテム情報のアイテム特徴量の一部を変更した情報に対してアイテム特徴量ベクトルを計算すると共に前記指定したユーザに対する前記近似関数を用いて予測スコアを計算して第２スコアとして出力するスコア変動計算手段を備えるものとすることもできる。こうすれば、特定のアイテム情報のアイテム特徴量の一部を変更したときに予測スコアがどのようになるかを調べることができる。この場合、前記スコア変動計算手段は、前記特定のアイテム情報のアイテム特徴量の一部を順次変更して複数の予測スコアを計算し、前記計算した複数の予測スコアを前記第２スコアとして対応するアイテム特徴量と共に出力する手段であるものとすることもできる。こうすれば、アイテム特徴量の変動に対して予測スコアの変化を容易に見いだすことができる。また、前記スコア変動計算手段は、前記特定のアイテム情報のアイテム特徴量の一部を順次変更して複数の予測スコアを計算し、前記計算した複数の予測スコアのうち最も高い予測スコアを含む少なくとも１つ以上の予測スコアを前記第２スコアとして対応するアイテム特徴量と共に出力する手段であるものとすることもできる。こうすれば、最も高い予測スコアに対するアイテム特徴量を容易に見いだすことができる。 In the item recommendation system of the present invention, an item feature quantity vector is calculated for specific item information, a predicted score is calculated using the approximate function for a specified user, and is output as a first score. A score fluctuation calculating means for calculating an item feature amount vector for information obtained by changing a part of the item feature amount of information, and calculating a predicted score using the approximation function for the specified user and outputting it as a second score It can also be provided. In this way, it is possible to check what the prediction score will be when a part of the item feature amount of the specific item information is changed. In this case, the score variation calculation means sequentially changes a part of the item feature amount of the specific item information to calculate a plurality of prediction scores, and the calculated plurality of prediction scores corresponds to the second score. It may be a means for outputting together with the item feature amount. In this way, it is possible to easily find a change in the prediction score with respect to a change in the item feature amount. In addition, the score variation calculation means calculates a plurality of prediction scores by sequentially changing a part of the item feature amount of the specific item information, and includes at least a highest prediction score among the calculated plurality of prediction scores It may be a means for outputting one or more prediction scores together with the corresponding item feature amount as the second score. In this way, it is possible to easily find the item feature amount for the highest prediction score.

本発明の一実施例としてのアイテムリコメンドシステム２０の概略を示す概略構成図である。It is a schematic block diagram which shows the outline of the item recommendation system 20 as one Example of this invention. 切り抜きパラメータの一例を示す説明図である。It is explanatory drawing which shows an example of a clipping parameter. カテゴリがアウターの場合の切り抜きパラメータによるアイテム画像中の切り抜き領域の一例を示す説明図である。It is explanatory drawing which shows an example of the clipping area | region in the item image by the clipping parameter in case a category is an outer. 支配ベクトルの決定の様子の一例を示す説明図である。It is explanatory drawing which shows an example of the mode of determination of a dominance vector. ブランド間の距離の一例を示す説明図である。It is explanatory drawing which shows an example of the distance between brands. ユーザ端末１２にアイテム画像を提示する提示画面の一例を示す説明図である。It is explanatory drawing which shows an example of the presentation screen which presents an item image to the user terminal.

次に、本発明を実施するための形態を実施例を用いて説明する。 Next, the form for implementing this invention is demonstrated using an Example.

図１は、本発明の一実施例としてのアイテムリコメンドシステム２０の概略を示す概略構成図である。実施例のアイテムリコメンドシステム２０は、図示するように、ＥＣ（electronic commerce）サイト１０やユーザ端末１２と接続可能に構成されており、アイテム情報取得部２２と、アイテム特徴量ベクトル計算部２４と、予測スコア計算部２６と、パラメータ最適値計算部２８と、Ｗｅｂサーバ３０と、アイテム情報テーブル３２と、アイテム特徴量ベクトルテーブル３４と、予測スコアテーブル３６と、実スコアテーブル３８と、ユーザ情報テーブル４０と、を備える。アイテム情報取得部２２やアイテム特徴量ベクトル計算部２４，予測スコア計算部２６，パラメータ最適値計算部２８，Ｗｅｂサーバ３０は、複数のコンピュータにより構成してもよいし、単一のコンピュータにより構成してもよい。アイテム情報テーブル３２やアイテム特徴量ベクトルテーブル３４，予測スコアテーブル３６，実スコアテーブル３８，ユーザ情報テーブル４０は、複数の記憶装置により構成してもよいし、単一の記憶装置により構成してもよい。 FIG. 1 is a schematic configuration diagram showing an outline of an item recommendation system 20 as an embodiment of the present invention. The item recommendation system 20 of the embodiment is configured to be connectable to an EC (electronic commerce) site 10 and a user terminal 12, as shown in the figure, and includes an item information acquisition unit 22, an item feature quantity vector calculation unit 24, Prediction score calculation unit 26, parameter optimum value calculation unit 28, Web server 30, item information table 32, item feature quantity vector table 34, prediction score table 36, actual score table 38, and user information table 40 And comprising. The item information acquisition unit 22, the item feature quantity vector calculation unit 24, the prediction score calculation unit 26, the parameter optimum value calculation unit 28, and the Web server 30 may be configured by a plurality of computers or a single computer. May be. The item information table 32, the item feature vector table 34, the prediction score table 36, the actual score table 38, and the user information table 40 may be configured by a plurality of storage devices, or may be configured by a single storage device. Good.

アイテム情報取得部２２は、ＥＣサイト１０から新規のアイテム情報を取得し、取得したアイテム情報をアイテム識別子に関連つけてアイテム情報テーブル３２に保存する。アイテム情報としては、以下のものを挙げることができる。
(a) アイテム名（〇〇Ｔシャツ，△△パンツ，□□スーツ等）
(b) ＥＣサイトのショップ名（〇〇ストア，〇〇Ｗｅｂ等）
(c) ブランド名
(d) アイテム画像ＵＲＬ（Uniform Resource Locator）
(e) カテゴリ（トップス，アウター，スーツ等）
(f) 性別（メンズ，ウィメンズ等）
(g) 定価
(h) 実売価格
(i) 実売価格の割引率（３０％オフ等）
(j) 展開されているサイズ（Ｓ，Ｍ，Ｌ，ＬＬ等）
(k) 素材の情報
(l) 原産国
(m) 紹介文 The item information acquisition unit 22 acquires new item information from the EC site 10 and stores the acquired item information in the item information table 32 in association with the item identifier. The item information can include the following.
(a) Item name (00 T-shirt, △△ pants, □□ suit, etc.)
(b) EC site shop name (00 store, 00 web, etc.)
(c) Brand name
(d) Item image URL (Uniform Resource Locator)
(e) Category (tops, outerwear, suits, etc.)
(f) Gender (Men, Women, etc.)
(g) List price
(h) Sales price
(i) Discount rate of actual sales price (30% off, etc.)
(j) Expanded size (S, M, L, LL, etc.)
(k) Material information
(l) Country of origin
(m) Intro

アイテム特徴量ベクトル計算部２４は、アイテム情報テーブル３２からアイテム特徴量ベクトルが計算されていないアイテム情報をアイテム識別子を用いて取り出し、取り出したアイテム情報に基づいてアイテム特徴量ベクトルを計算する。そして、計算したアイテム特徴量ベクトルをアイテム識別子に関連つけてアイテム特徴量ベクトルテーブル３４に保存する。アイテム特徴量ベクトルとしては、実施例では以下の成分を有するものとした。
(a) アイテム画像に含まれる代表色情報（代表色ベクトル）とその占有率
(b) ブランド座標
(c) カテゴリ座標
(d) 同一カテゴリ内での実売価格の偏差値
(e) 実売価格の割引率 The item feature vector calculation unit 24 extracts item information for which no item feature vector has been calculated from the item information table 32 using the item identifier, and calculates an item feature vector based on the extracted item information. Then, the calculated item feature quantity vector is stored in the item feature quantity vector table 34 in association with the item identifier. The item feature vector has the following components in the embodiment.
(a) Representative color information (representative color vector) included in the item image and its occupancy
(b) Brand coordinates
(c) Category coordinates
(d) Deviation value of the actual sales price within the same category
(e) Sales price discount rate

(a)アイテム画像に含まれる代表色情報（代表色ベクトル）は、例えば、以下のように計算することができる。まず、アイテム情報に含まれるアイテム画像ＵＲＬを用いてアイテム画像をダウンロードし、アイテムのカテゴリに対応した切り抜きパラメータにしたがって、ダウンロードしたアイテム画像の一部分を切り抜く。切り抜きパラメータの一例を図２に示す。図３にカテゴリがアウターの場合の切り抜きパラメータによるアイテム画像中の切り抜き領域の一例を示す。図３中、幅が（Width=100）、高さが（height=100）で示された矩形領域がアイテム画像であり、このアイテム画像中に幅が（１００×０．２５＝２５）、高さが（１００×０．２５×１．０＝２５）で示された矩形領域が切り抜き領域である。図２中、「w#offset」は切り抜き領域の中心のアイテム画像における左側からの位置を意味している。アウターの場合、「０．６」であるから、アイテム画像の左側からアイテム画像の幅に「０．６」を乗じた位置（Width×0.6）となる。図２中、「h#offset」は切り抜き領域の中心のアイテム画像における上側からの位置を意味している。アウターの場合、「０．４７」であるから、アイテム画像の左側からアイテム画像の幅に「０．４７」を乗じた位置（height×0.47）となる。図２中、「h#ratio」は切り抜き領域の高さのアイテム画像の高さに対する比を意味している。アウターの場合、「０．２５」であるから、アイテム画像の高さ（height=100）に「０．２５」を乗じた値（height×0.25）となる。図２中、「w#h#ratio」は切り抜き領域の幅の切り抜き領域の高さに対する比を意味している。アウターの場合、「１．０」であるから、切り抜き領域の高さ（height×0.25）に「１．０」を乗じた値（height×0.25×1.0）となる。切り抜きパラメータは、初期値として予め定められた値、例えばカテゴリー毎に画像の特徴をよく表わしている部分であると想定される領域として予め設定した値を用い、実施例では、その後にパラメータ最適値計算部２８により計算された値に更新するものとした。 (a) The representative color information (representative color vector) included in the item image can be calculated as follows, for example. First, the item image is downloaded using the item image URL included in the item information, and a part of the downloaded item image is cut out according to the cutting parameters corresponding to the category of the item. An example of the clipping parameters is shown in FIG. FIG. 3 shows an example of a cutout area in the item image based on the cutout parameter when the category is outer. In FIG. 3, a rectangular area having a width (Width = 100) and a height (height = 100) is an item image, and the item image has a width (100 × 0.25 = 25) and a height. A rectangular region indicated by (100 × 0.25 × 1.0 = 25) is a cutout region. In FIG. 2, “w # offset” means the position from the left side of the item image at the center of the cutout area. In the case of the outer, since it is “0.6”, the position (Width × 0.6) is obtained by multiplying the width of the item image by “0.6” from the left side of the item image. In FIG. 2, “h # offset” means the position from the upper side in the item image at the center of the cutout region. In the case of the outer, since it is “0.47”, the position (height × 0.47) is obtained by multiplying the width of the item image by “0.47” from the left side of the item image. In FIG. 2, “h # ratio” means the ratio of the height of the cutout area to the height of the item image. In the case of the outer, since it is “0.25”, the height (height = 100) of the item image is multiplied by “0.25” (height × 0.25). In FIG. 2, “w # h # ratio” means the ratio of the width of the cutout area to the height of the cutout area. In the case of the outer, since it is “1.0”, it becomes a value (height × 0.25 × 1.0) obtained by multiplying the height (height × 0.25) of the cutout region by “1.0”. The clipping parameter is a value that is set in advance as an initial value, for example, a value that is set in advance as a region that is assumed to be a part that well represents the image characteristics for each category. It is assumed that the value calculated by the calculation unit 28 is updated.

次に、切り抜き領域の画像の各画素の色情報をＬａｂ色空間（ＣＩＥＬａｂ表色系）に変換し、切り抜き領域の左上の画素から順にｉ番目の画素のＬ値がＬ、ａ値がａ、ｂ値がｂとしたときにｉ番目の画素の色情報ｘ_i＝（Ｌ，ａ，ｂ）としてメモリに保存する。そして、色情報ｘ_i（ｉ＝１，…，画素数）をニューラルガス（ＮＧ）に入力し、色情報ｘ_i（ｉ＝１，…，画素数）の分布を最もうまく代表するｋ個の代表色ベクトルｙ₁，ｙ₂，…，ｙ_kを算出する。ここで、「ｋ」は予め設定しておくパラメータである。実施例ではｋ＝４として代表色を４色とした。また、ニューラルガスは、多数の入力ベクトルの集合を比較的少ないユニット（参照ベクトル）で近似するベクトル量子化に用いられるネットワークであり、自己組織化マップ（ＳＯＭ）と同様に教師なし学習によって自己組織的にベクトル量子化を行なう。ニューラルガスは、入力ベクトルが与えられると入力ベクトルに最も近い参照ベクトルを持つユニットを勝者ユニットとするから、実施例では、この勝者ユニットとして代表色ベクトルｙ₁，ｙ₂，…，ｙ_kを算出するものとした。なお、実施例ではニューラルガスを用いたが、k-means法などのベクトル量子化を実現する他の方法を用いるものとしてもよい。 Next, the color information of each pixel of the image in the cutout region is converted into a Lab color space (CIE Lab color system), and the L value of the i-th pixel in order from the upper left pixel of the cutout region is L, and the a value is a When the b value is b, the color information x _i of the i th pixel is stored in the memory as (L, a, b). Then, the color information x _i (i = 1,..., The number of pixels) is input to the neural gas (NG), and the k number of colors that best represent the distribution of the color information x _i (i = 1,..., The number of pixels). Representative color vectors y ₁ , y ₂ ,..., Y _k are calculated. Here, “k” is a parameter set in advance. In the embodiment, k = 4 and four representative colors. A neural gas is a network used for vector quantization that approximates a set of a large number of input vectors with relatively few units (reference vectors). Like a self-organizing map (SOM), a neural gas is self-organized by unsupervised learning. Vector quantization is performed. In the embodiment, the neural gas calculates a representative color vector y ₁ , y ₂ ,..., Y _k as the winner unit since the unit having the reference vector closest to the input vector is given as the winner unit. To do. Although the neural gas is used in the embodiment, other methods for realizing vector quantization such as the k-means method may be used.

代表色の占有率は、色情報ｘ_i（ｉ＝１，…，画素数）と代表色ベクトルｙ₁，ｙ₂，…，ｙ_kを用いて算出する。具体的には、ｉ番目の画素の色情報ｘ_iと代表色ベクトルｙ₁，ｙ₂，…，ｙ_kとの差を計算し、この差が最も小さい代表色ベクトルをｉ番目の画素の色情報ｘ_iの支配ベクトルとし、各代表色ベクトルが支配ベクトルとなった画素数の全画素数に対する割合として占有率を算出する。例えば、３×３の画素で色ベクトルの成分が１つの場合を考える。図４は、３×３の画素で色ベクトルの成分が１つで代表色ベクトルｙ₁＝０．２，ｙ₂＝０．４，ｙ₃＝０．６，ｙ₄＝０．８の場合の支配ベクトルの決定の様子の一例を示す説明図である。図中、上段左の画素では色ベクトルが「０．８」であるから、代表色ベクトルｙ₁，ｙ₂，ｙ₃，ｙ₄との差は、０．６，０．４，０．２，０となり、差の最も小さい代表色ベクトルｙ₄が支配ベクトルとなる。上段中の画素では色ベクトルが「０．６」であるから、代表色ベクトルｙ₁，ｙ₂，ｙ₃，ｙ₄との差は、０．４，０．２，０，０．２となり、差の最も小さい代表色ベクトルｙ₃が支配ベクトルとなる。以下、同様に支配ベクトルを決定することができる。代表色ベクトルｙ₁，ｙ₂，ｙ₃，ｙ₄の各占有率は、全画素数に対する代表色ベクトルが支配ベクトルの画素数の割合であるから、３／９，２／９，２／９，２／９として算出することができる。 The occupation ratio of the representative color is calculated using the color information x _i (i = 1,..., The number of pixels) and the representative color vectors y ₁ , y ₂ _,. Specifically, the difference between the color information x _i of the i th pixel and the representative color vectors y ₁ , y ₂ ,..., Y _k is calculated, and the representative color vector with the smallest difference is determined as the color of the i th pixel. The occupation ratio is calculated as the ratio of the number of pixels in which the representative color vector becomes the dominant vector to the total number of pixels as the dominant vector of the information x _i . For example, consider a case where there are 3 × 3 pixels and one color vector component. FIG. 4 shows a case where a 3 × 3 pixel has one color vector component and a representative color vector y ₁ = 0.2, y ₂ = 0.4, y ₃ = 0.6, and y ₄ = 0.8. It is explanatory drawing which shows an example of the mode of determination of the dominance vector. In the figure, since the color vector is “0.8” in the upper left pixel, the difference from the representative color vectors y ₁ , y ₂ , y ₃ , y ₄ is 0.6, 0.4, 0.2. , 0, and the representative color vector y ₄ having the smallest difference becomes the dominant vector. Since the color vector is “0.6” in the pixels in the upper stage, the difference from the representative color vectors y ₁ , y ₂ , y ₃ , y ₄ is 0.4, 0.2, 0, 0.2. The representative color vector y ₃ having the smallest difference becomes the dominant vector. Hereinafter, the dominant vector can be determined in the same manner. The occupation ratios of the representative color vectors y ₁ , y ₂ , y ₃ , and y ₄ are 3/9, 2/9, and 2/9 because the representative color vector is the ratio of the number of pixels of the dominant vector to the total number of pixels. , 2/9.

(b) ブランド座標は、以下のように算出することができる。まず、ブランド間の相性などを距離として設定する。例えば、ブランド１〜４に対して図５に示すようにブランド間の距離を設定する。この距離の設定は感覚的なものであるため、人によって予め定められる。図５の例では、ブランド間の最大距離を「１．０」として示している。この例では、ブランド１とブランド２との距離が０．２１であり、ブランド１とブランド３との距離が０．４５であり、ブランド１とブランド４との距離が０．５３であるから、ブランド１に対する相性の良さはブランド２，ブランド３，ブランド４の順となる。そして、ブランド間の距離をＮ次元空間内でもっともうまく説明できる座標を同定する。座標を同定するための最適化問題は、次式（１）により定式化される。この問題を最適化手法（例えば、粒子群最適化法 (Particle Swarm Optimization) や差分進化法 (Differential Evolution) 等）を用いて解くことで、Ｎ次元空間でのブランド座標を決定する。 (b) Brand coordinates can be calculated as follows. First, the compatibility between brands is set as the distance. For example, the distance between brands is set for brands 1 to 4 as shown in FIG. Since the setting of this distance is sensuous, it is predetermined by a person. In the example of FIG. 5, the maximum distance between brands is indicated as “1.0”. In this example, the distance between brand 1 and brand 2 is 0.21, the distance between brand 1 and brand 3 is 0.45, and the distance between brand 1 and brand 4 is 0.53. Brand 2, Brand 3, Brand 4 are in order of good compatibility with Brand 1. Then, the coordinates that can best explain the distance between the brands in the N-dimensional space are identified. The optimization problem for identifying the coordinates is formulated by the following equation (1). By solving this problem using an optimization method (for example, particle swarm optimization method, differential evolution method, etc.), brand coordinates in the N-dimensional space are determined.

つぎに、上述のブランド座標同定作業を次元数Ｎを変更しながら繰り返し行ない、ブランド座標を表現するための最適な空間次元を決定する。即ち、次式（２）の最適化問題を解いてブランド座標を決定する。 Next, the above-described brand coordinate identification operation is repeated while changing the number of dimensions N, and the optimum spatial dimension for expressing the brand coordinates is determined. That is, the brand coordinates are determined by solving the optimization problem of the following equation (2).

(c) カテゴリ座標は、上述のブランド座標同定作業とほぼ同様に行う。即ち、式（１）のｐをカテゴリ番号、ｘｐをｐ番目のカテゴリのＮ次元空間上の座標、Ｄｐｑをカテゴリｐ、ｑ間の距離として、式（２）の最適化問題を解いてカテゴリ座標を決定する。 (c) Category coordinates are performed in substantially the same manner as the above-described brand coordinate identification work. That is, by solving the optimization problem of Equation (2), p in Equation (1) is the category number, xp is the coordinate in the Nth space of the pth category, and Dpq is the distance between categories p and q. To decide.

(d) 同一カテゴリ内での実売価格の偏差値は、解析対象アイテムの実売価格が同じカテゴリ内のアイテムと比較して、どの程度の位置（高価格帯，中価格帯，低価格帯）にあるかを偏差値 (T-Score) を用いて表現するために用いる。まず、アイテム情報テーブルから、対象とするカテゴリに属する全アイテム（アイテム数をＮとする）の実売価格情報を取得する。取得した価格情報のうちｉ番目のアイテムの価格をｘ_iとしてＸ＝｛ｘ₁，ｘ₂，…，ｘ_N｝とする。このとき、対象とするアイテムの実売価格をｘとすると、その偏差値Ｔ（ｘ）は次式（３）により計算される。 (d) The deviation value of the actual selling price within the same category is compared to the item within the same category where the actual selling price of the analysis target item is at what position (high price range, medium price range, low price range). ) Is used to express the deviation value (T-Score). First, the actual sale price information of all items (the number of items is N) belonging to the target category is acquired from the item information table. X = {x ₁ , x ₂ ,..., X _N } where x _i is the price of the i-th item in the acquired price information. At this time, assuming that the actual selling price of the target item is x, the deviation value T (x) is calculated by the following equation (3).

Ｗｅｂサーバ３０は、アイテム情報テーブル３２からアイテム情報を取り出してユーザ端末１２に提示する提示部として機能する。また、Ｗｅｂサーバ３０は、ユーザ端末１２に提示されたアイテム画像等のアイテム情報に基づいてアイテムに対してユーザが入力した多段階の評価値としてのスコアを取得すると共に、取得したスコアをアイテム識別子とユーザ識別子とに関連つけて実スコアテーブル３８に保存する実スコア取得部としても機能する。図６は、ユーザ端末１２にアイテム画像を提示する提示画面の一例を示す説明図である。図６の例では、中央にアイテム画像が表示される。また、アイテム画像の左上にカテゴリ（「パンツ」）が表示され、アイテム画像の下に性別（「MEN'S」）が表示される。アイテム画像の左側にはハートマークが破れた「ＤＩＳＬＩＫＥ」のボタンが表示され、アイテム画像の右側にはハートマークの「ＬＩＫＥ」のボタンが表示される。アイテム画像の下には、この提示画面に提示されていないアイテム情報を表示するための「More」ボタンが設けられている。スコアは、実施例では、提示画面に提示されていないアイテム情報を表示することなく「ＬＩＫＥ」のボタンを押したときにはユーザの嗜好として最も好ましいと判断して「４」、提示画面に提示されていないアイテム情報を表示することなく「ＤＩＳＬＩＫＥ」のボタンを押したときにはユーザの嗜好として最も好ましくない（嫌い）と判断して「１」、提示画面に提示されていないアイテム情報を表示した後に「ＬＩＫＥ」のボタンを押したときにはユーザは迷いを感じながらも好ましいと判断して「３」、提示画面に提示されていないアイテム情報を表示した後に「ＤＩＳＬＩＫＥ」のボタンを押したときにはユーザは迷いを感じながらも好ましくない（嫌い）と判断した「２」、の４段階の評価値を用いた。このように多段階の評価値とすることにより、ユーザの嗜好をより適正に反映することができる。Ｗｅｂサーバ３０は、ユーザ端末１２に提示するアイテム情報としては、アイテム情報テーブル３２のアイテム情報のうちそれまでに提示していないアイテム情報であって、後述する予測スコアが最も高いアイテム情報から順に「ＬＩＫＥ」のボタン又は「ＤＩＳＬＩＫＥ」のボタンが操作される毎に提示される。ユーザ端末１２に提示したか否かの判定は、実スコアテーブル３８にユーザ識別子とアイテム識別子とに関連つけてスコアが保存されているか否かにより行なうことができる。なお、Ｗｅｂサーバ３０は、ユーザ端末１２から取得したユーザ情報をユーザ識別子に関連つけてユーザ情報テーブルに登録するユーザ登録部としての機能も有する。 The Web server 30 functions as a presentation unit that extracts item information from the item information table 32 and presents the item information to the user terminal 12. Further, the Web server 30 acquires a score as a multi-stage evaluation value input by the user for an item based on item information such as an item image presented on the user terminal 12, and the acquired score is used as an item identifier. It also functions as an actual score acquisition unit that stores the actual score table 38 in association with the user identifier. FIG. 6 is an explanatory diagram illustrating an example of a presentation screen for presenting an item image on the user terminal 12. In the example of FIG. 6, the item image is displayed in the center. In addition, a category (“pants”) is displayed at the upper left of the item image, and a sex (“MEN'S”) is displayed below the item image. On the left side of the item image, a “DISLIKE” button with a broken heart mark is displayed, and on the right side of the item image, a “LIKE” button with a heart mark is displayed. Below the item image, a “More” button for displaying item information not presented on the presenting screen is provided. In the embodiment, the score is determined to be the most preferable user preference when the “LIKE” button is pressed without displaying item information that is not presented on the presentation screen, and the score is presented on the presentation screen. When the “DISLINK” button is pressed without displaying any item information, “1” is determined as the most unfavorable (disliked) user preference, and “LIKE” is displayed after displaying item information that is not presented on the presentation screen. When the user presses the “DISLINK” button after displaying item information that is not presented on the presentation screen, the user feels lost. However, a four-stage evaluation value of “2” judged to be unfavorable (dislike) was used. Thus, by setting it as a multistage evaluation value, a user's preference can be reflected more appropriately. The item information to be presented to the user terminal 12 by the Web server 30 is item information that has not been presented so far, among the item information in the item information table 32, and the item information having the highest predicted score, which will be described later. It is presented each time the “LIKE” button or the “DISLIKE” button is operated. It can be determined whether or not the user terminal 12 has been presented based on whether or not a score is stored in the actual score table 38 in association with the user identifier and the item identifier. The Web server 30 also has a function as a user registration unit that registers the user information acquired from the user terminal 12 in the user information table in association with the user identifier.

予測スコア計算部２６は、ユーザ毎に、実スコアテーブル３８のスコアとこのスコアに対応するアイテム特徴量ベクトルテーブル３４におけるアイテム特徴量ベクトルとに基づいて、アイテム特徴量ベクトルの入力に対してスコアを計算する近似関数を導出する。具体的には、まず、対象ユーザに対して、ユーザ識別子をキーとして実スコアテーブル３８からスコアとアイテム識別子とを１組とする情報をランダムに予め設定した数（例えば１００件など）だけ取得する。情報をランダムに取得するのは、ランダムに取得した情報に基づいて予測スコアの計算を行なうと、スコア計算の度に異なる結果が得られ、リコメンドする際の多様性や意外性を担保することができるからである。続いて、取得した情報のアイテム識別子をキーとして、アイテム特徴量ベクトルテーブル３４からアイテム特徴量ベクトルを取得し、アイテムｐのアイテム特徴量ベクトルｘ^pとアイテムｐのスコアｓ^pとに基づいて訓練データ（ｘ^p，ｓ^p），ｐ＝１，…，Ｐを次式（４）のように準備する。 For each user, the predicted score calculation unit 26 calculates a score for the input of the item feature vector based on the score of the actual score table 38 and the item feature vector in the item feature vector table 34 corresponding to this score. The approximate function to be calculated is derived. Specifically, first, for a target user, a set number (for example, 100) of information that sets a score and an item identifier as one set is acquired from the actual score table 38 using the user identifier as a key. . Information is obtained at random. When the predicted score is calculated based on the randomly acquired information, a different result is obtained each time the score is calculated, and it is possible to ensure diversity and unexpectedness when making recommendations. Because it can. Subsequently, as a key item identifier of the acquired information, and acquires the item characteristic amount vector from the item characteristic amount vector table 34, the training data based on the scores s ^p item feature vector x ^p and item p items p ^{^{(x p, s p),}} p = 1, ..., be prepared as following equation P (4).

次に、訓練データの（ｘ^p，ｓ^p），ｐ＝１，…，Ｐのすべてのアイテム特徴量ベクトルｘ^pの入力に対してスコアｓ^pが出力されるように、即ちすべてのアイテムｐに対して次式（５）となるように、機械学習法を用いて近似関数Ｆを構築する。ここで、近似関数Ｆの構築（これを学習とも称する。）とは、関数構造Ｆおよびそのパラメータを決定することを意味する。機械学習法としては、ニューラルネットワークを始めとしてさまざまな方法を適用することができるが、実施例では、ラジアル基底関数ネットワークを用いた。ラジアル基底関数ネットワークを用いるのは、行列演算を１度行なうだけで高速に学習を行なうことができ、連続関数近似を得意としているため、実施例で想定している数万ユーザ数の予測スコア計算に向いているからである。 Next, the training data (x ^p, s ^p), p = 1, ..., as the score s ^p for the input of all the items feature vectors x ^p of P is output, i.e., all the items p , An approximate function F is constructed using a machine learning method so that the following equation (5) is obtained. Here, the construction of the approximate function F (also referred to as learning) means that the function structure F and its parameters are determined. As a machine learning method, various methods such as a neural network can be applied. In the embodiment, a radial basis function network is used. The use of the radial basis function network enables high-speed learning with only one matrix operation, and is good at continuous function approximation. Therefore, the prediction score calculation of tens of thousands of users assumed in the embodiment is performed. Because it is suitable for.

ラジアル基底関数ネットワークでは、複数のガウス関数の重ね合わせで近似関数Ｆを構成する。ラジアル基底関数の近似関数Ｆは次式（６）として構成される。 In the radial basis function network, the approximate function F is configured by superimposing a plurality of Gaussian functions. The approximate function F of the radial basis function is configured as the following equation (6).

ラジアル基底関数ネットワークの学習では、ｘの成分数をＮとして、パラメータｗ，ｒ，ｖ₁，…ｖ_Mを次式（７）〜（１０）のように算出することができる。ここで、「λ」は予め設定しておく定数であり、「Ｉ」はＰ×Ｐの単位行列である。 In learning of the radial basis function network, the number of components of x is N, and the parameters w, r, v ₁ ,... V _M can be calculated as in the following equations (7) to (10). Here, “λ” is a preset constant, and “I” is a P × P unit matrix.

実施例では、構築した近似関数Ｆを用いて、全アイテムを評価する方法またはクラスタ上の代表点を評価する方法のいずれかの方法を用いて、アイテムの予測スコアを算出し、その結果を予測スコアテーブル３６に保存する。 In the embodiment, by using the constructed approximate function F, the prediction score of the item is calculated using either the method of evaluating all items or the method of evaluating the representative points on the cluster, and the result is predicted. Save in the score table 36.

全アイテムを評価する方法では、まず、アイテム特徴量ベクトルテーブル３４の全アイテムに対するアイテム特徴量ベクトルを順次取得し、取得したアイテム特徴量ベクトルを近似関数Ｆに入力して全アイテムに対する予測スコアを算出する。ｉ番目のアイテムのアイテム特徴量ベクトルをｘｉとすると、ｉ番目のアイテムの予測スコアｓｉは、次式（１１）として算出することができる。こうして算出した予測スコアを大きい順に並び替え、スコアが大きい予測スコアから予め定めた設定数（例えば１０００件など）だけ予測スコアにアイテム識別子とユーザ識別子とに関連つけて予測スコアテーブル３６に保存する。 In the method for evaluating all items, first, item feature vectors for all items in the item feature vector table 34 are sequentially acquired, and the obtained item feature vectors are input to the approximation function F to calculate a prediction score for all items. To do. If the item feature quantity vector of the i-th item is xi, the prediction score si of the i-th item can be calculated as the following equation (11). The prediction scores calculated in this way are rearranged in descending order, and a preset number (for example, 1000) of the prediction scores having the highest score is stored in the prediction score table 36 in association with the item identifier and the user identifier.

クラスタ上の代表点を評価する方法では、まず、事前にアイテム特徴量ベクトルを特徴量として、公知のベクトル量子化器を用いて予測対象アイテムをクラスタリングし、各クラスタの代表ベクトルを保存しておく。次に、各クラスタの代表ベクトルを近似関数Ｆに入力して、各クラスタの代表ベクトルの予測スコアを算出する。ｉ番目のクラスタの代表ベクトルをｘｉとすると、ｉ番目のクラスタの代表ベクトルの予測スコアｓｉは、上述の式（１１）として算出することができる。すべてのクラスタの代表ベクトルの予測スコアを算出すると、代表ベクトルの予測スコアが大きい順にクラスタを並び替え、１番目のクラスタに含まれるアイテムでスコアリングが行われていないアイテムについて、上述の式（１１）を用いて予測スコアを算出し、そのスコアが大きい順に予測スコアをアイテム識別子とユーザ識別子とに関連つけて予測スコアテーブル３６に保存する。１番目のクラスタに含まれるアイテムの評価が終了したら、２番目のクラスタに移行し、これを順次繰り返す。この操作の過程で設定数（例えば１０００件）まで予測スコアテーブル３６に保存した時点で処理を終了する。 In the method of evaluating representative points on a cluster, first, the item feature quantity vector is used as a feature quantity, the prediction target items are clustered using a known vector quantizer, and the representative vector of each cluster is stored. . Next, the representative vector of each cluster is input to the approximation function F, and the prediction score of the representative vector of each cluster is calculated. Assuming that the representative vector of the i-th cluster is xi, the prediction score si of the representative vector of the i-th cluster can be calculated as the above equation (11). When the prediction scores of the representative vectors of all the clusters are calculated, the clusters are rearranged in descending order of the prediction scores of the representative vectors, and the items in the first cluster that have not been scored are subjected to the above equation (11). ) Is used to calculate the prediction score, and the prediction score is stored in the prediction score table 36 in association with the item identifier and the user identifier in descending order. When the evaluation of the items included in the first cluster is completed, the process moves to the second cluster, and this is sequentially repeated. In the course of this operation, the process is terminated when the set number (for example, 1000) is stored in the prediction score table 36.

予測スコア計算部２６は、上述したように、ユーザ毎にそのユーザの実スコアに基づいて近似関数Ｆを導出するが、ユーザが指定した実スコアに基づいて近似関数Ｆを導出するものとしてもよい。例えば、ユーザが、自分自身と他のユーザ（例えば、雑誌モデルなど）を指定したときには、ユーザの実スコアと指定した他のユーザ（以下、「お気に入りユーザ」と称する。）の実スコアとに基づいて近似関数Ｆを導出する。具体的には、実スコアテーブル３８からユーザのスコアを件数Ｐ１だけランダムに取得すると共にお気に入りユーザのスコアを件数Ｐ２だけランダムに取得する。そして、取得した全件数Ｐ（＝Ｐ１＋Ｐ２）のスコアを用いて上述した手法により近似関数Ｆを導出する。ここで、件数Ｐ１，Ｐ２は、予め設定したものとしてもよいし、ユーザが設定するものとしてもよい。ユーザが設定する場合、件数Ｐ１，Ｐ２はユーザとお気に入りユーザとに対する嗜好の重みを設定するものと解することができる。このように、ユーザとユーザが指定した他のユーザのスコアに基づいて近似関数Ｆを導出することにより、ユーザの嗜好だけでなく、ユーザの嗜好と指定した他のユーザの嗜好とを融合した嗜好に沿った予測スコアを計算することができる。 As described above, the predicted score calculation unit 26 derives the approximate function F for each user based on the actual score of the user. However, the predicted score calculation unit 26 may derive the approximate function F based on the actual score designated by the user. . For example, when the user designates himself and another user (for example, a magazine model), based on the actual score of the user and the actual score of the designated other user (hereinafter referred to as “favorite user”). Thus, an approximate function F is derived. Specifically, the user's score is randomly acquired from the actual score table 38 by the number P1, and the favorite user's score is randomly acquired by the number P2. And the approximate function F is derived | led-out with the method mentioned above using the score of the acquired total number P (= P1 + P2). Here, the number of cases P1 and P2 may be set in advance, or may be set by the user. When the user sets, the number of cases P1 and P2 can be understood as setting the preference weight for the user and the favorite user. Thus, by deriving the approximation function F based on the score of the user and the other user specified by the user, not only the user preference but also the preference that fuses the user preference and the specified other user preference. A predicted score along can be calculated.

上述したように、Ｗｅｂサーバ３０は、ユーザ端末１２から実施例のアイテムリコメンドシステムへのアクセスを受けると、アイテム情報の提示のリクエストと判断し、ユーザ識別子をキーとして予測スコアテーブル３６の予測スコアのアイテム情報のうち、それまでに提示していないアイテム情報であって予測スコアが最も高いアイテム情報をユーザ端末１２に提示する。予測スコア計算部２６では、ユーザの嗜好に沿った予測スコアを計算したり、ユーザの嗜好とユーザが指定した他のユーザの嗜好とを融合した嗜好に沿った予測スコアを計算するから、ユーザの意向に沿ったアイテム情報を提示することができる。 As described above, when receiving the access to the item recommendation system of the embodiment from the user terminal 12, the Web server 30 determines that it is a request for presentation of item information, and uses the user identifier as a key for the prediction score of the prediction score table 36. Among the item information, item information that has not been presented so far and has the highest predicted score is presented to the user terminal 12. The predicted score calculation unit 26 calculates a predicted score according to the user's preference, or calculates a predicted score according to the preference that fuses the user's preference with the preferences of other users specified by the user. Item information according to the intention can be presented.

パラメータ最適値計算部２８は、アイテム特徴量ベクトル計算部２４でアイテム特徴量ベクトルを計算するのに必要なパラメータの最適値を計算する。この計算は、まず、実スコアテーブル３８の対象ユーザのスコアをランダムに所定件数Ｐ（例えば２００件）だけ取得し、取得した全スコアのうち所定件数Ｐ１（例えば１００件）を訓練データとし、残余の件数Ｐ２（＝Ｐ−Ｐ１、例えば１００件）を評価データとする。次に、訓練データを用いてアイテム特徴量ベクトルを計算するのに必要なパラメータの１つを変化させたアイテム特徴量ベクトルを訓練用ベクトルとして計算する。そして、評価データに対応するアイテム特徴量ベクトルと訓練用ベクトルとに基づいてパラメータの最適値を計算する。具体例として、以下に、アイテム画像に含まれる代表色情報を計算する際に用いられる切り抜きパラメータ「w#offset」，「h#offset」，「h#ratio」，「w#h#ratio」の最適値を計算する様子について説明する。切り抜きパラメータＹを次式（１２）のように表わし、式（１３）に示すように機械学習機を構築する。そして、式（１４）に示す最適化問題を解いて切り抜きパラメータＹを決定する。この最適化問題は、種々の連続変数最適化手法（例えば、粒子群最適化法や、差分進化法など）で解くことができる。こうして得られた切り抜きパラメータＹは、アイテム特徴量ベクトル計算部２４に送られて用いられる。 The parameter optimum value calculation unit 28 calculates the optimum parameter values necessary for the item feature vector calculation unit 24 to calculate the item feature vector. In this calculation, first, a predetermined number P (for example, 200) of the target user's score in the actual score table 38 is randomly acquired, and the predetermined number P1 (for example, 100) of all the acquired scores is used as training data, and the remainder The number of cases P2 (= P-P1, for example, 100 cases) is used as evaluation data. Next, an item feature amount vector obtained by changing one of the parameters necessary for calculating the item feature amount vector using the training data is calculated as a training vector. Then, the optimum parameter value is calculated based on the item feature vector and the training vector corresponding to the evaluation data. As specific examples, the clipping parameters “w # offset”, “h # offset”, “h # ratio”, “w # h # ratio” used when calculating the representative color information included in the item image are as follows: How to calculate the optimum value will be described. The clipping parameter Y is expressed as in the following equation (12), and a machine learning machine is constructed as shown in the equation (13). Then, the optimization parameter shown in Expression (14) is solved to determine the clipping parameter Y. This optimization problem can be solved by various continuous variable optimization methods (for example, particle swarm optimization method, differential evolution method, etc.). The clipping parameter Y obtained in this way is sent to the item feature vector calculation unit 24 for use.

以上説明した実施例のアイテムリコメンドシステムでは、ユーザ毎に実スコアテーブル３８のスコアと対応するアイテム特徴量ベクトルとに基づいて近似関数Ｆを導出し、導出した近似関数Ｆを適用して全アイテム情報に対して予測スコアを計算し、ユーザ端末１２に対してそれまでに表示されていないアイテム情報のうち最も高い予測スコアのアイテム情報を提示する。このため、ユーザの嗜好により適合したアイテムをリコメンドすることができる。 In the item recommendation system of the embodiment described above, an approximate function F is derived for each user based on the score of the actual score table 38 and the corresponding item feature quantity vector, and all item information is applied by applying the derived approximate function F. The prediction score is calculated, and the item information of the highest prediction score is presented to the user terminal 12 among the item information not displayed so far. For this reason, it is possible to recommend an item that is more suitable for the user's preference.

また、実施例のアイテムリコメンドシステムでは、実スコアテーブル３８のスコアのうちユーザのスコアを用いて近似関数Ｆを導出したり、ユーザのスコアとユーザが指定した他のユーザのスコアとを用いて近似関数Ｆを導出したりすることができるから、ユーザは、自分の嗜好だけでなく、指定した他のユーザの嗜好に沿ったアイテムをリコメンドすることができる。 Further, in the item recommendation system of the embodiment, the approximation function F is derived using the user's score among the scores of the actual score table 38, or approximated using the user's score and another user's score designated by the user. Since the function F can be derived, the user can recommend items not only according to his / her preference but also in accordance with the preferences of other designated users.

実施例のアイテムリコメンドシステムでは、アイテム画像に対してアイテムのカテゴリ毎の所定領域の各画素の色情報におけるＬａｂ色空間の座標の分布に基づいてアイテム画像に含まれる代表色情報（代表色ベクトル）を計算し、これをアイテム特徴量ベクトルの成分とする。これにより、ユーザの色に対する嗜好をより反映したアイテムをリコメンドすることができる。 In the item recommendation system of the embodiment, the representative color information (representative color vector) included in the item image based on the distribution of the coordinates of the Lab color space in the color information of each pixel in the predetermined area for each item category with respect to the item image. Is used as a component of the item feature vector. Thereby, the item which reflected the user's preference for the color can be recommended.

実施例のアイテムリコメンドシステムでは、予め定めた各ブランド間の距離に基づいて同定されるＮ次元空間の座標としてブランド座標とカテゴリ座標とを計算し、これをアイテム特徴量ベクトルの成分とする。これにより、ユーザのブランドやカテゴリに対する嗜好をより反映した予測スコアを計算することができ、ユーザのブランドやカテゴリに対する嗜好をより反映したアイテムをリコメンドすることができる。 In the item recommendation system of the embodiment, brand coordinates and category coordinates are calculated as coordinates in an N-dimensional space identified based on a predetermined distance between each brand, and are used as components of the item feature vector. Accordingly, a prediction score that more reflects the user's preference for the brand or category can be calculated, and an item that more reflects the user's preference for the brand or category can be recommended.

実施例のアイテムリコメンドシステムでは、アイテム特徴量ベクトル計算部２４でアイテム特徴量ベクトルを計算するのに必要なパラメータの最適値を計算し、計算したパラメータを用いてアイテム特徴量ベクトルを計算する。これにより、アイテム特徴量ベクトルを計算するのに必要なパラメータをより適正なものとすることができる。しかも、パラメータの最適値の計算において、アイテム特徴量ベクトルのうちパラメータにより影響を受ける成分と影響を受けない成分とを用いて最適化問題を解くから、より容易に且つ適正にパラメータの最適値を計算するものとすることもできる。 In the item recommendation system of the embodiment, the item feature value vector calculation unit 24 calculates the optimum value of the parameter necessary for calculating the item feature value vector, and calculates the item feature value vector using the calculated parameter. Thereby, the parameter required for calculating the item feature vector can be made more appropriate. In addition, since the optimization problem is solved using the component affected by the parameter and the component not affected by the parameter in the item feature vector in the calculation of the optimum value of the parameter, the optimum value of the parameter can be determined more easily and appropriately. It can also be calculated.

実施例のアイテムリコメンドシステムでは、アイテム情報テーブル３２のアイテム情報のうちそれまでに提示していないアイテム情報であって、予測スコアが最も高いアイテムをユーザ端末１２に提示し、「ＬＩＫＥ」のボタン又は「ＤＩＳＬＩＫＥ」のボタンが操作される毎に次に予測スコアの高いアイテムをユーザ端末１２に提示するものとした。しかし、予測スコアが最も高いアイテムから順に所定数（例えば、５つ）をユーザ端末１２に一覧で提示するものとしてもよい。この場合、「ＬＩＫＥ」のボタンや「ＤＩＳＬＩＫＥ」のボタンなどの評価値を設定するためのボタンを各アイテムの左右の両方或いは一方に配置すればよい。 In the item recommendation system of the embodiment, the item information that has not been presented so far among the item information in the item information table 32 and that has the highest predicted score is presented to the user terminal 12, and the “LIKE” button or Each time the “DISLINK” button is operated, an item with the next highest predicted score is presented to the user terminal 12. However, a predetermined number (for example, five) may be presented in a list to the user terminal 12 in order from the item with the highest predicted score. In this case, a button for setting an evaluation value, such as a “LIKE” button or a “DISLIKE” button, may be arranged on both the left and right sides of each item.

実施例のアイテムリコメンドシステムでは、提示画面に提示されていないアイテム情報を表示することなく「ＬＩＫＥ」のボタンを押したときの「４」、提示画面に提示されていないアイテム情報を表示することなく「ＤＩＳＬＩＫＥ」のボタンを押したときの「１」、提示画面に提示されていないアイテム情報を表示した後に「ＬＩＫＥ」のボタンを押したときの「３」、提示画面に提示されていないアイテム情報を表示した後に「ＤＩＳＬＩＫＥ」のボタンを押したときの「２」、の１〜４の４段階の評価値としてスコアを取得するものとした。しかし、提示画面に提示されていないアイテム情報を表示した後の評価値としてはアイテム情報の提示の程度に応じた段階的な評価値としてもよい。例えば、アイテム情報の提示画面が最初のアイテム画像の提示画面を含めて３つある場合、最初のアイテム画像の提示画面の表示だけで「ＬＩＫＥ」「ＤＩＳＬＩＫＥ」のボタンを操作したときの「６」「１」、最初のアイテム画像の提示画面と次のアイテム情報の提示画面の２つの提示画面の表示に対して「ＬＩＫＥ」「ＤＩＳＬＩＫＥ」のボタンを操作したときの「５」「２」、３つの提示画面のすべての表示に対して「ＬＩＫＥ」「ＤＩＳＬＩＫＥ」のボタンを操作したときの「４」「３」の６段階としてもよい。また、アイテム画像の提示画面の提示を開始してから「ＬＩＫＥ」「ＤＩＳＬＩＫＥ」のボタンを操作するまでの時間に応じた段階的な数値を評価値としてもよい。例えば、アイテム画像の提示画面の提示を開始してから「ＬＩＫＥ」「ＤＩＳＬＩＫＥ」のボタンを操作するまでの時間が３秒以内のときの「４」「１」、３秒を超える時間を要したときの「３」「２」の４段階としてもよい。更に、アイテム情報の提示の程度とアイテム画像の提示画面の提示を開始してから「ＬＩＫＥ」「ＤＩＳＬＩＫＥ」のボタンを操作するまでの時間に基づいて多段階の評価値としてもよい。これらの評価値をスコアとして用いれば、ユーザの嗜好をより適切に反映した予測スコアを設定することができ、ユーザの嗜好により適正なアイテムをリコメンドすることができる。 In the item recommendation system of the embodiment, “4” when the “LIKE” button is pressed without displaying item information that is not presented on the presentation screen, without displaying item information that is not presented on the presentation screen. "1" when the "DISLIKE" button is pressed, "3" when the "LIKE" button is pressed after displaying item information not presented on the presentation screen, item information not presented on the presentation screen It is assumed that the score is acquired as four-level evaluation values of 1 to 4 of “2” when the “DISLIKE” button is pressed after displaying “”. However, the evaluation value after displaying the item information not presented on the presentation screen may be a stepwise evaluation value corresponding to the degree of presentation of the item information. For example, when there are three item information presentation screens including the first item image presentation screen, “6” when the “LIKE” and “DISLINK” buttons are operated only by displaying the first item image presentation screen. “1”, “5”, “2”, and “3” when the “LIKE” and “DISLIKE” buttons are operated for the display of the two presentation screens of the first item image presentation screen and the next item information presentation screen. Six steps of “4” and “3” when the buttons of “LIKE” and “DISLIKE” are operated for all displays of one presentation screen may be used. Further, a stepwise numerical value corresponding to the time from the start of the presentation of the item image presentation screen to the operation of the “LIKE” or “DISLIKE” button may be used as the evaluation value. For example, it took more than “4”, “1”, and 3 seconds when the time from starting the presentation of the item image presentation screen to operating the “LIKE” and “DISLIKE” buttons was within 3 seconds. It is good also as four steps of "3" and "2" of time. Furthermore, it is good also as a multi-stage evaluation value based on the time from the start of presentation of the item information and the presentation of the item image presentation screen to the operation of the “LIKE” and “DISLIKE” buttons. If these evaluation values are used as scores, it is possible to set a prediction score that more appropriately reflects the user's preference, and it is possible to recommend an appropriate item according to the user's preference.

実施例のアイテムリコメンドシステムでは、アイテム情報として、アイテム名，ＥＣサイトのショップ名，ブランド名，アイテム画像ＵＲＬ，カテゴリ，性別，定価，実売価格，実売価格の割引率，展開されているサイズ，素材の情報，原産国，紹介文などとしたが、アイテム情報はこれらに限定されるものではなく、これらの一部としてもよいし、他の情報を含めるものとしてもよい。基本的にアイテム情報に含まれる情報として好ましい情報としては、アイテム名，ＥＣサイトのショップ名，ブランド名，アイテム画像ＵＲＬ，カテゴリ，性別などを挙げることができる。 In the item recommendation system of the embodiment, as item information, item name, EC site shop name, brand name, item image URL, category, gender, list price, sale price, sale price discount rate, deployed size , Material information, country of origin, introductory text, etc., but the item information is not limited to these, and may be a part of them or may include other information. Basically, information preferable as information included in the item information includes item name, EC site shop name, brand name, item image URL, category, gender, and the like.

実施例のアイテムリコメンドシステムでは、アイテム特徴量ベクトルの成分として、アイテム画像に含まれる代表色情報とその占有率，ブランド座標，カテゴリ座標，同一カテゴリ内での実売価格の偏差値，実売価格の割引率を有するものとした。しかし、アイテム特徴量ベクトルの成分としては、これらに限定されるものではなく、これらの一部としてもよいし、他の成分を含めるものとしてもよい。基本的にアイテム特徴量ベクトルの成分として含まれるのが好ましい成分としては、アイテム画像に含まれる代表色情報，ブランド座標，カテゴリ座標などを挙げることができる。 In the item recommendation system of the embodiment, as the component of the item feature vector, the representative color information included in the item image and its occupancy, brand coordinates, category coordinates, the deviation value of the selling price within the same category, the selling price Discount rate. However, the component of the item feature vector is not limited to these, and may be a part of these, or may include other components. Basically, the components preferably included as the component of the item feature vector include representative color information, brand coordinates, category coordinates and the like included in the item image.

実施例のアイテムリコメンドシステムでは、単一ユーザのスコアリング結果あるいは複数ユーザのスコアリング結果から、訓練データをランダムに予め設定した数だけ取得する方法を用いるものとした。さらに、単一ユーザのスコアリング結果のクラスタを利用する場合や、複数ユーザのスコアリング結果のクラスタを利用する場合を挙げることができる。これらについて以下に簡単に説明する。 In the item recommendation system of the embodiment, a method of acquiring a predetermined number of training data randomly from the scoring result of a single user or the scoring result of a plurality of users is used. Furthermore, the case where the cluster of the scoring result of a single user is used, and the case of using the cluster of the scoring result of a plurality of users can be mentioned. These are briefly described below.

単一ユーザのスコアリング結果のクラスタを利用する場合では、まず、実スコアテーブル３８から予測対象ユーザのスコアリング結果のすべて（全スコア）を取得する。次に、全スコアを公知のベクトル量子化器を用いてアイテム特徴量ベクトルを特徴量として、スコアリングされたアイテムをスコア毎にクラスタリングし、各クラスタの代表ベクトルを算出する。算出された代表ベクトルとスコアの情報を用いてデータセットを次式（１５）のように生成する。これらをまとめてＰ＝Ｐ１＋Ｐ２＋Ｐ３＋Ｐ４個の訓練データとし、上述した方法を適用して、予測対象ユーザの予測スコアを算出する。 In the case of using a cluster of scoring results of a single user, first, all of the scoring results of the prediction target user (all scores) are acquired from the actual score table 38. Next, the scored items are clustered for each score by using the item feature quantity vector as a feature quantity by using a known vector quantizer for all scores, and a representative vector of each cluster is calculated. Using the calculated representative vector and score information, a data set is generated as in the following equation (15). These are combined into P = P1 + P2 + P3 + P4 pieces of training data, and the prediction score of the prediction target user is calculated by applying the method described above.

複数ユーザのスコアリング結果のクラスタを利用する場合では、まず、ユーザ情報テーブルから予測対象ユーザのお気に入りユーザのユーザ識別子を取得する。次に、実スコアテーブル３８から予測対象ユーザのスコアリング結果のすべて（全スコア）とお気に入りユーザのスコアリング結果のすべて（全スコア）を取得する。これらを用いて、上述の単一ユーザのスコアリング結果のクラスタを利用する場合と同様にして、対象ユーザの予測スコアを算出する。 When using a cluster of scoring results of a plurality of users, first, the user identifier of the favorite user of the prediction target user is acquired from the user information table. Next, all of the scoring results of the prediction target user (all scores) and all of the scoring results of the favorite users (all scores) are acquired from the actual score table 38. Using these, the predicted score of the target user is calculated in the same manner as in the case of using the cluster of scoring results of the single user described above.

実施例のアイテムリコメンドシステムは、アイテム情報取得部２２やアイテム特徴量ベクトル計算部２４，予測スコア計算部２６，パラメータ最適値計算部２８の他に、アイテム特徴量の一部が変更された場合に、対象アイテムに対する予測スコアがどのように変化するかを計算するスコア変動計算部を備えるものとしてもよい。スコア変動計算部は、対象アイテムのアイテム情報に対してアイテム特徴量ベクトルを計算し、対象ユーザに対する近似関数Ｆを用いて予測スコアを計算して第１スコアとする。次に対象アイテムのアイテム情報のアイテム特徴量（例えば価格など）の一部を変更した情報に対してアイテム特徴量ベクトルを計算し、対象ユーザに対する近似関数Ｆを用いて予測スコアを計算して第２スコアとする。これを全ユーザまたは指定したユーザに対して行ない、第１スコアと第２スコアとを出力する。こうすれば、アイテム特徴量の変動に対して予測スコアの変化を容易に見いだすことができる。この場合、対象アイテムのアイテム情報のアイテム特徴量の一部を順次変更して複数の予測スコアを計算し、計算した複数の予測スコアのうち最も高い予測スコアを含む所定数の予測スコアを第２スコアとし、この第２スコアとアイテム特徴量とを出力するものとしてもよい。こうすれば、最も高い予測スコアに対するアイテム特徴量を容易に見いだすことができる。 In the item recommendation system of the embodiment, in addition to the item information acquisition unit 22, the item feature quantity vector calculation unit 24, the prediction score calculation unit 26, and the parameter optimum value calculation unit 28, when a part of the item feature quantity is changed, A score fluctuation calculation unit that calculates how the predicted score for the target item changes may be provided. The score variation calculation unit calculates an item feature vector for the item information of the target item, calculates a predicted score using the approximation function F for the target user, and sets it as the first score. Next, an item feature quantity vector is calculated for information obtained by changing a part of the item feature quantity (for example, price) of the item information of the target item, and a prediction score is calculated using an approximation function F for the target user. 2 scores. This is performed for all users or designated users, and the first score and the second score are output. In this way, it is possible to easily find a change in the prediction score with respect to a change in the item feature amount. In this case, a part of the item feature amount of the item information of the target item is sequentially changed to calculate a plurality of prediction scores, and a predetermined number of prediction scores including the highest prediction score among the calculated plurality of prediction scores It is good also as what outputs this 2nd score and item feature-value as a score. In this way, it is possible to easily find the item feature amount for the highest prediction score.

以上、本発明を実施するための形態について実施例を用いて説明したが、本発明はこうした実施例に何等限定されるものではなく、本発明の要旨を逸脱しない範囲内において、種々なる形態で実施し得ることは勿論である。 As mentioned above, although the form for implementing this invention was demonstrated using the Example, this invention is not limited at all to such an Example, In the range which does not deviate from the summary of this invention, it is with various forms. Of course, it can be implemented.

本発明は、アイテムリコメンドシステムの製造産業などに利用可能である。 The present invention can be used in the manufacturing industry of item recommendation systems.

１０ＥＣサイト、１２ユーザ端末、２０アイテムリコメンドシステム、２２アイテム情報取得部、２４アイテム特徴量ベクトル計算部、２６予測スコア計算部、２８パラメータ最適値計算部、３０Ｗｅｂサーバ、３２アイテム情報テーブル、３４アイテム特徴量ベクトルテーブル、３６予測スコアテーブル、３８実スコアテーブル、４０ユーザ情報テーブル。 10 EC site, 12 User terminal, 20 Item recommendation system, 22 Item information acquisition unit, 24 Item feature vector calculation unit, 26 Predictive score calculation unit, 28 Parameter optimum value calculation unit, 30 Web server, 32 Item information table, 34 Item feature vector table, 36 prediction score table, 38 actual score table, 40 user information table.

Claims

An item recommendation system for recommending items to a user terminal,
Item information acquisition means for acquiring item information from an EC site and associating the item information with an item identifier and storing them in an item information table;
Item feature amount vector for calculating an item feature amount vector for a predetermined item feature amount based on item information in the item information table, and storing the item feature amount vector in association with the item identifier in the item feature amount vector table Calculation means;
A score as a multi-stage evaluation value input for presentation of item information in the item information table to the user terminal is acquired, and the score, the item identifier, and the user identifier are associated and stored in the actual score table. Real score acquisition means;
For each user, an approximate function for calculating a score for the input of the item feature quantity vector is derived based on the score of the actual score table and the item feature quantity vector in the item feature quantity vector table corresponding to the score. A prediction score for all item information is calculated by applying all item feature vectors in the item feature vector table to the approximate function, and the prediction score, the item identifier, and the user identifier are associated with each other in the prediction score table. A predicted score calculation means to be stored;
Presenting means for presenting item information that is not presented to the user terminal among all item information in the item information table in response to a request from the user terminal and has the highest predicted user score of the user terminal at least When,
Item recommendation system with.

The item recommendation system according to claim 1,
The actual score acquisition means is means for acquiring a score input to the item information presented to the user terminal by the presentation means.
Item recommendation system.

An item recommendation system according to claim 1 or 2,
The score is determined as a stepwise numerical value for whether or not the item image of the item information is input immediately without requesting other information to be displayed and whether or not the item is input after the other information is presented. Is the evaluated value,
Item recommendation system.

An item recommendation system according to any one of claims 1 to 3,
The predicted score calculation means is a means for deriving the approximate function using a score designated by a user among the scores of the actual score table.
Item recommendation system.

The item recommendation system according to claim 4,
The predicted score calculation means is a means for deriving the approximation function using a user score and a score of another user specified by the user among the scores of the actual score table.
Item recommendation system.

An item recommendation system according to any one of claims 1 to 5,
The item feature vector includes a representative color included in the item image as a component,
The representative color included in the item image is a color vector having a predetermined number of components set based on the distribution of coordinates in the Lab color space in the color information of each pixel in a predetermined area for each item category with respect to the item image. Is,
Item recommendation system.

The item recommendation system according to claim 6,
The item feature vector includes an occupancy of the representative color as a component,
The occupancy rate of the representative color is a vector obtained by dividing each pixel in the predetermined area by a component in the color vector closest to the color of the pixel and obtaining the number of pixels in each section with respect to the total number of pixels in the predetermined area. ,
Item recommendation system.

The item recommendation system according to any one of claims 1 to 7,
The item feature quantity vector includes brand coordinates and / or category coordinates as components,
The brand coordinates are coordinates in an N-dimensional space identified based on a predetermined distance between each brand,
The category coordinates are coordinates in an N-dimensional space identified based on a predetermined distance between each category.
Item recommendation system.