JP6167029B2

JP6167029B2 - RECOMMENDATION INFORMATION GENERATION DEVICE AND RECOMMENDATION INFORMATION GENERATION METHOD

Info

Publication number: JP6167029B2
Application number: JP2013249172A
Authority: JP
Inventors: 佑介深澤; 優也立花
Original assignee: NTT Docomo Inc
Current assignee: NTT Docomo Inc
Priority date: 2013-12-02
Filing date: 2013-12-02
Publication date: 2017-07-19
Anticipated expiration: 2033-12-02
Also published as: JP2015106346A

Description

本発明は、レコメンド情報を生成するレコメンド情報生成装置およびレコメンド情報生成方法に関する。 The present invention relates to a recommendation information generating apparatus and a recommendation information generating method for generating recommendation information.

通信販売サイトでは、他のユーザの購入履歴に基づいて、ある商品の紹介を行っている。このようなサイトでは、バッチ処理により、事前にすべての商品に対して、他のユーザの購入履歴を参照して、その商品に関連する商品を紹介するためのレコメンド情報を生成している。一般的には、コンテンツ特徴ベクトル同士の内積を計算することにより、その類似度を把握しておき、類似度の高い順にコンテンツが並ぶようレコメンド情報を生成している。 A mail order site introduces a product based on the purchase history of other users. In such a site, recommendation information for introducing a product related to the product is generated by batch processing with reference to the purchase history of other users in advance for all the products. Generally, by calculating the inner product of content feature vectors, the similarity is grasped, and recommendation information is generated so that the contents are arranged in descending order of similarity.

しかしながら、一般的には、商品等は膨大に存在しており、このようなバッチ処理には時間がかかる。例えば、１００万件のコンテンツがあった場合、１００万×１００万の演算処理をする必要が有り、その処理は膨大なものとなる。 However, generally, there are a large number of products and the like, and such batch processing takes time. For example, when there are 1 million contents, it is necessary to perform 1 million × 1 million arithmetic processing, and the processing becomes enormous.

また、通信販売サイト以外の一般的なデジタルコンテンツ、例えばニュース記事などでは、賞味期限が早く関連コンテンツとして提供するコンテンツを常時更新する必要がある。また、一般論として、デジタルコンテンツでは、ロングテールが大きく、また誰にも見られていないコンテンツがほとんどである。よって、そのようなコンテンツに対してもあらかじめ計算しておくことは計算リソースの無駄ということに他ならない。 In addition, in general digital content other than mail order sites, such as news articles, it is necessary to constantly update content provided as related content with an early expiration date. In general, digital content has a large long tail and is not seen by anyone. Therefore, calculation for such content in advance is a waste of calculation resources.

ここで、レコメンド情報を提供するに際して、バッチ処理部分とリアルタイム処理部分とに分けて提供することが考えられている。下記特許文献１は、バッチ処理とリアルタイム処理とでレコメンド情報の生成を行っている技術について開示している。 Here, when providing the recommendation information, it is considered that the recommendation information is provided separately in a batch processing portion and a real-time processing portion. The following Patent Document 1 discloses a technique that generates recommendation information by batch processing and real-time processing.

特開２０１２−２１６１７５号公報JP 2012-216175 A

しかしながら、特許文献１に記載の技術においては、バッチ処理時に、ユーザ間の相関に基づいたレコメンド情報を生成しようとしたものであり、アクセス予定のないユーザ分のレコメンドまで生成してしまうため、無駄な計算処理を行うことがあり得る。よって、計算リソースの無駄はやはり存在している。 However, in the technique described in Patent Document 1, it is intended to generate recommendation information based on the correlation between users at the time of batch processing, and generates recommendations for users who are not scheduled to access. It is possible to perform a complicated calculation process. Thus, there is still a waste of computing resources.

そこで、本発明においては、上述の課題を解決するために、バッチ処理を行う上で無駄な計算リソースを費やすことなくレコメンド情報を生成することができるレコメンド情報生成装置およびレコメンド情報生成方法を提供することを目的とする。 Therefore, in order to solve the above-described problems, the present invention provides a recommendation information generation device and a recommendation information generation method that can generate recommendation information without spending unnecessary calculation resources in performing batch processing. For the purpose.

上述の課題を解決するために、本発明のレコメンド情報生成装置は、レコメンド対象となるコンテンツのコンテンツ特徴ベクトルに基づいて、当該コンテンツ特徴ベクトルを構成する特徴要素ごとに、一または複数のコンテンツを対応付けた転置インデックステーブルを生成する転置インデックステーブル生成手段と、前記転置インデックステーブル生成手段により生成された転置インデックステーブルを記憶する転置インデックス記憶手段と、ユーザ端末からレコメンド要求が受け付けられると、前記転置インデックス記憶手段に記憶されている転置インデックステーブルに基づいて決定されたユーザ端末におけるコンテンツの表示順位に従ったレコメンド情報を生成するレコメンド情報生成手段と、前記レコメンド情報生成手段により生成されたレコメンド情報を送信する送信手段と、を備え、前記転置インデックステーブル生成手段は、あらかじめ定められた条件を満たした場合に転置インデックステーブルの生成処理を実行し、前記レコメンド情報生成手段は、ユーザ端末からのレコメンド要求が受け付けられるたびに、表示順位の決定処理を実行する。 In order to solve the above-described problem, the recommendation information generation apparatus according to the present invention supports one or a plurality of contents for each feature element constituting the content feature vector based on the content feature vector of the content to be recommended. An inverted index table generating means for generating an attached inverted index table, an inverted index storage means for storing the inverted index table generated by the inverted index table generating means, and when a recommendation request is received from a user terminal, the inverted index Recommendation information generating means for generating recommendation information in accordance with the display order of contents at the user terminal determined based on the transposed index table stored in the storage means; and the recommendation information generating means Transmitting means for transmitting the recommended information, wherein the transposed index table generating means executes a transposed index table generating process when a predetermined condition is satisfied, and the recommended information generating means is a user Each time a recommendation request from a terminal is received, display order determination processing is executed.

また、本発明のレコメンド情報生成方法は、レコメンド情報生成装置におけるレコメンド情報生成方法において、レコメンド対象となるコンテンツのコンテンツ特徴ベクトルに基づいて、当該コンテンツ特徴ベクトルを構成する特徴要素ごとに、一または複数のコンテンツを対応付けた転置インデックステーブルを生成する転置インデックステーブル生成ステップと、前記転置インデックステーブル生成ステップにより生成された転置インデックステーブルを転置インデックス記憶手段に記憶する記憶ステップと、ユーザ端末からレコメンド要求が受け付けられると、前記転置インデックス記憶手段に記憶されている転置インデックステーブルに基づいて決定されたユーザ端末におけるコンテンツの表示順位に従ったレコメンド情報を生成するレコメンド情報生成ステップと、前記レコメンド情報生成ステップにより生成されたレコメンド情報を送信する送信ステップと、を備え、前記転置インデックステーブル生成ステップは、あらかじめ定められた条件を満たした場合に転置インデックステーブルの生成処理を実行し、前記レコメンド情報生成ステップは、ユーザ端末からのレコメンド要求が受け付けられるたびに、表示順位の決定処理を実行する。 Also, the recommendation information generation method of the present invention is a recommendation information generation method in the recommendation information generation apparatus, wherein one or a plurality of feature information constituting the content feature vector is provided for each feature element based on the content feature vector of the content to be recommended. A transposed index table generating step for generating a transposed index table associated with the content, a storing step for storing the transposed index table generated by the transposed index table generating step in a transposed index storage unit, and a recommendation request from the user terminal. When accepted, it generates recommendation information according to the display order of the content on the user terminal determined based on the transposed index table stored in the transposed index storage means. A recommended information generating step, and a transmitting step for transmitting the recommended information generated by the recommended information generating step. The transposed index table generating step generates a transposed index table when a predetermined condition is satisfied. A process is executed, and the recommendation information generation step executes a display order determination process each time a recommendation request from a user terminal is received.

この発明によれば、レコメンド対象となるコンテンツのコンテンツ特徴ベクトルに基づいて、当該コンテンツ特徴ベクトルを構成する特徴要素ごとに、一または複数のコンテンツを対応付けた転置インデックステーブルを生成し、この転置インデックステーブルを記憶する。そして、ユーザ端末からレコメンド要求が受け付けられると、転置インデックステーブルに基づいて決定されたユーザ端末におけるコンテンツの表示順位に従ったレコメンド情報を生成し、レコメンド情報を送信する。これにより、あらかじめ定められた条件を満たした場合に転置インデックステーブルの生成処理を実行し、ユーザ端末からのレコメンド要求が受け付けられるたびに、表示順位の決定処理を実行することになる。したがって、計算時間がかかる処理については事前に終わらすことで、高速な更新処理が求められるサービスに対してもレコメンド情報の提供が可能となる。そして、一部のコンテンツのみが集中的にレコメンド対象となる場合であっても、全部のコンテンツに対して計算することはしないため、その計算リソースが無駄になるということはない、という効果を奏する。 According to this invention, based on the content feature vector of the content to be recommended, a transposed index table in which one or a plurality of contents are associated with each feature element constituting the content feature vector is generated, and this transposed index is generated. Remember the table. Then, when a recommendation request is received from the user terminal, recommendation information is generated according to the display order of contents in the user terminal determined based on the transposed index table, and the recommendation information is transmitted. Thus, the transposed index table generation process is executed when a predetermined condition is satisfied, and the display order determination process is executed every time a recommendation request from the user terminal is received. Therefore, it is possible to provide recommendation information even for a service that requires a high-speed update process, by ending the process that requires calculation time in advance. Even if only some contents are intensively targeted for recommendation, calculation is not performed for all the contents, so that the calculation resource is not wasted. .

また、本発明のレコメンド情報生成装置において、前記レコメンド情報生成手段は、転置インデックステーブルに基づいて、特徴要素を変えて重複して記述されているコンテンツの数である重複数を、コンテンツごとに集計する集計手段と、前記集計手段により集計された重複数を、当該コンテンツのコンテンツ特徴ベクトルを構成する特徴要素のうち任意の特徴要素に対応付けた管理リストを生成する生成手段と、前記生成手段により生成された管理リストに基づいて、ユーザ端末にて重複数の多い順にコンテンツを表示可能なようにコンテンツを抽出することでレコメンド情報を生成する抽出手段と、から構成される。 Further, in the recommendation information generating apparatus of the present invention, the recommendation information generating means aggregates, for each content, a duplication number that is the number of contents that are described in duplicate by changing the feature elements based on the transposed index table. And a generating unit that generates a management list in which the duplication numbers totaled by the totaling unit are associated with arbitrary feature elements among the characteristic elements constituting the content feature vector of the content. Extraction means for generating recommendation information by extracting the content so that the content can be displayed in the order of multiple duplications on the user terminal based on the generated management list.

この発明によれば、転置インデックステーブルに基づいて、特徴要素を変えて重複して記述されているコンテンツの数である重複数を、コンテンツごとに集計し、集計された重複数を、当該コンテンツのコンテンツ特徴ベクトルを構成する特徴要素のうち任意の特徴要素に対応付けた管理リストを生成し、そして、生成された管理リストに基づいて、ユーザ端末にて重複数の多い順にコンテンツを表示可能なようにコンテンツを抽出することでレコメンド情報を生成する。このような、重複数を計数するなど簡易な演算処理を採用していることから、リアルタイム処理に適したものとなる。よって、事前に転置インデックステーブルを生成しておくことで、リアルタイム処理によりレコメンド情報を生成することができる。 According to this invention, based on the transposed index table, the duplication number, which is the number of contents described in duplicate by changing the characteristic elements, is tabulated for each content, and the tabulated duplication number of the content is counted. A management list associated with an arbitrary feature element among the feature elements constituting the content feature vector is generated, and content can be displayed in the order of multiple overlaps on the user terminal based on the generated management list Recommendation information is generated by extracting content. Since simple arithmetic processing such as counting the number of overlaps is employed, it is suitable for real-time processing. Therefore, by generating an inverted index table in advance, recommendation information can be generated by real-time processing.

また、本発明のレコメンド情報生成装置は、あらかじめ定められた特徴要素に基づいてコンテンツの特徴が表現されるコンテンツ特徴ベクトルを生成するコンテンツ特徴ベクトル生成手段と、前記コンテンツ特徴ベクトル生成手段により生成されたコンテンツ特徴ベクトルを、コンテンツごとに記憶するコンテンツ特徴ベクトル記憶手段と、をさらに備え、前記転置インデックステーブル生成手段は、前記コンテンツ特徴ベクトル記憶手段に記憶されているコンテンツ特徴ベクトルを用いて転置インデックステーブルを生成する。 Further, the recommendation information generating apparatus according to the present invention includes a content feature vector generating unit that generates a content feature vector in which content features are expressed based on predetermined feature elements, and the content feature vector generating unit. Content feature vector storage means for storing a content feature vector for each content, and the transposed index table generating means uses the content feature vector stored in the content feature vector storage means to generate a transposed index table. Generate.

この発明によれば、あらかじめ定められた特徴要素に基づいてコンテンツの特徴が表現されるコンテンツ特徴ベクトルを生成し、生成されたコンテンツ特徴ベクトルを、コンテンツごとに記憶する。そして、記憶されているコンテンツ特徴ベクトルを用いて転置インデックステーブルを生成する。これにより、計算時間がかかる処理については事前に終わらすことで、高速な更新処理が求められるサービスに対してもレコメンド情報の提供が可能となる。 According to the present invention, a content feature vector in which content features are expressed based on predetermined feature elements is generated, and the generated content feature vector is stored for each content. Then, a transposed index table is generated using the stored content feature vector. As a result, it is possible to provide recommendation information even for services that require high-speed update processing by ending processing that takes a long time in advance.

また、本発明のレコメンド情報生成装置において、前記転置インデックステーブル生成手段は、前記コンテンツ特徴ベクトル記憶手段に記憶されているコンテンツ特徴ベクトルの数が所定件数に達した時に、バッチ処理により所定件数に対してまとめて実行する。 In the recommendation information generating apparatus according to the present invention, the transposed index table generating unit may perform batch processing on a predetermined number of content feature vectors when the number of content feature vectors stored in the content feature vector storage unit reaches a predetermined number. And execute together.

この発明によれば、転置インデックステーブルを生成するまでの処理を、あらかじめまとまったコンテンツ特徴ベクトルに対して行うといったバッチ処理により実行することになるため、転置インデックステーブルの生成効率を向上させることができ、結果的に計算リソースを効率的に利用することができる。 According to the present invention, since the processing up to the generation of the inverted index table is performed by batch processing such as performing on the content feature vectors that have been collected in advance, the generation efficiency of the inverted index table can be improved. As a result, computational resources can be used efficiently.

本発明によれば、計算時間がかかる処理については事前に終わらすことで、高速な更新処理が求められるサービスに対してもレコメンド情報の提供が可能となる。そして、一部のコンテンツのみが集中的にレコメンド対象となる場合であっても、全部のコンテンツに対して計算することはしないため、その計算リソースが無駄になるということはない、という効果を奏する。 According to the present invention, it is possible to provide recommendation information even for a service that requires high-speed update processing, by ending processing that takes a long time in advance. Even if only some contents are intensively targeted for recommendation, calculation is not performed for all the contents, so that the calculation resource is not wasted. .

情報配信サーバ１００を用いたシステム構成図である。1 is a system configuration diagram using an information distribution server 100. FIG. 情報配信サーバ１００の機能構成を示すブロック図である。2 is a block diagram showing a functional configuration of an information distribution server 100. FIG. 情報配信サーバ１００のハードウェア構成図である。2 is a hardware configuration diagram of an information distribution server 100. FIG. 特徴辞書記憶部１０７に記憶されている特徴辞書の具体例を示す説明図である。5 is an explanatory diagram illustrating a specific example of a feature dictionary stored in a feature dictionary storage unit 107. FIG. メタデータおよびコンテンツ特徴ベクトルを示す説明図である。It is explanatory drawing which shows metadata and a content feature vector. 評価値ファイルを示す説明図である。It is explanatory drawing which shows an evaluation value file. 転置インデックステーブルの具体例である。It is a specific example of an inverted index table. コンテンツＩＤリストの説明図である。It is explanatory drawing of a content ID list. 中間リストの説明図である。It is explanatory drawing of an intermediate | middle list. 中間リストから最終リストへの生成過程、および最終結果であるレコメンドコンテンツのリストの生成過程を示す説明図である。It is explanatory drawing which shows the production | generation process from the intermediate | middle list to the last list, and the production | generation process of the list of the recommendation content which is a final result. 携帯端末２００の機能を示すブロック図である。4 is a block diagram illustrating functions of the mobile terminal 200. FIG. 情報配信サーバ１００の処理を示すフローチャートである。4 is a flowchart showing processing of the information distribution server 100.

添付図面を参照しながら本発明の実施形態を説明する。可能な場合には、同一の部分には同一の符号を付して、重複する説明を省略する。 Embodiments of the present invention will be described with reference to the accompanying drawings. Where possible, the same parts are denoted by the same reference numerals, and redundant description is omitted.

図１は、本実施形態におけるレコメンド情報生成装置である情報配信サーバ１００を用いたシステム構成図である。図１に示される通り、情報配信サーバ１００は、携帯端末２００とネットワークを介して通信接続して、携帯端末２００のユーザの好み等に沿ったコンテンツを送信することができる。なお、携帯端末２００は、携帯電話やスマートフォンであるが、これに限るものではなく、パソコンや、そのほかタブレット端末などであってもよい。 FIG. 1 is a system configuration diagram using an information distribution server 100 which is a recommendation information generation apparatus according to the present embodiment. As shown in FIG. 1, the information distribution server 100 can communicate with the mobile terminal 200 via a network and transmit content in accordance with user preferences of the mobile terminal 200. The mobile terminal 200 is a mobile phone or a smartphone, but is not limited to this, and may be a personal computer or a tablet terminal.

図２は、本実施形態の情報配信サーバ１００の機能構成を示すブロック図である。この情報配信サーバ１００は、コンテンツ特徴ベクトル生成部１０１（コンテンツ特徴ベクトル生成手段）、転置インデックス生成部１０２（転置インデックステーブル生成手段）、コンテンツ特徴ベクトル取得部１０３、レコメンドコンテンツ取得部１０４（レコメンド情報生成手段、集計手段、生成手段、抽出手段）、レコメンドコンテンツ送信部１０５（送信手段）、メタデータ記憶部１０６、特徴辞書記憶部１０７、コンテンツ特徴ベクトル記憶部１０８（コンテンツ特徴ベクトル記憶手段）、および転置インデックス記憶部１０９（転置インデックス記憶手段）を含んで構成されている。 FIG. 2 is a block diagram showing a functional configuration of the information distribution server 100 of the present embodiment. The information distribution server 100 includes a content feature vector generation unit 101 (content feature vector generation unit), a transposed index generation unit 102 (transposition index table generation unit), a content feature vector acquisition unit 103, a recommended content acquisition unit 104 (recommendation information generation). Means, aggregation means, generation means, extraction means), recommended content transmission section 105 (transmission means), metadata storage section 106, feature dictionary storage section 107, content feature vector storage section 108 (content feature vector storage means), and transposition. The index storage unit 109 (transposed index storage means) is included.

図３は、情報配信サーバ１００のハードウェア構成図である。図２に示される情報配信サーバ１００は、物理的には、図３に示すように、一または複数のＣＰＵ１１、主記憶装置であるＲＡＭ１２及びＲＯＭ１３、入力デバイスであるキーボード及びマウス等の入力装置１４、ディスプレイ等の出力装置１５、ネットワークカード等のデータ送受信デバイスである通信モジュール１６、ハードディスクまたは半導体メモリ等の補助記憶装置１７などを含むコンピュータシステムとして構成されている。図２における各機能は、図３に示すＣＰＵ１１、ＲＡＭ１２等のハードウェア上に所定のコンピュータソフトウェアを読み込ませることにより、ＣＰＵ１１の制御のもとで入力装置１４、出力装置１５、通信モジュール１６を動作させるとともに、ＲＡＭ１２や補助記憶装置１７におけるデータの読み出し及び書き込みを行うことで実現される。以下、図２に示す機能ブロックに基づいて、各機能ブロックを説明する。 FIG. 3 is a hardware configuration diagram of the information distribution server 100. As shown in FIG. 3, the information distribution server 100 shown in FIG. 2 physically includes one or more CPUs 11, a RAM 12 and a ROM 13 that are main storage devices, and an input device 14 such as a keyboard and a mouse that are input devices. The computer system includes an output device 15 such as a display, a communication module 16 that is a data transmission / reception device such as a network card, and an auxiliary storage device 17 such as a hard disk or a semiconductor memory. Each function in FIG. 2 operates the input device 14, the output device 15, and the communication module 16 under the control of the CPU 11 by reading predetermined computer software on the hardware such as the CPU 11 and the RAM 12 shown in FIG. 3. In addition, it is realized by reading and writing data in the RAM 12 and the auxiliary storage device 17. Hereinafter, each functional block will be described based on the functional blocks shown in FIG.

コンテンツ特徴ベクトル生成部１０１は、メタデータ記憶部１０６に記憶されている各コンテンツのメタデータおよび特徴辞書記憶部１０７に記憶されている特徴辞書に基づいて、コンテンツ特徴ベクトルを生成する部分である。生成したコンテンツ特徴ベクトルは、コンテンツ特徴ベクトル記憶部１０８に記憶される。なお、このコンテンツ特徴ベクトル生成部１０１は、コンテンツの登録がコンテンツ提供者からなされると、すなわち、コンテンツ提供者からコンテンツ及びそのメタデータを受信すると、特徴辞書に基づいてコンテンツ特徴ベクトルを生成することになる。 The content feature vector generation unit 101 is a part that generates content feature vectors based on the metadata of each content stored in the metadata storage unit 106 and the feature dictionary stored in the feature dictionary storage unit 107. The generated content feature vector is stored in the content feature vector storage unit 108. The content feature vector generation unit 101 generates a content feature vector based on a feature dictionary when content registration is performed by a content provider, that is, when content and its metadata are received from the content provider. become.

ここで、図面を用いて、コンテンツ特徴ベクトルの生成について説明する。図４は、特徴辞書記憶部１０７に記憶されている特徴辞書の具体例を示す説明図である。この特徴辞書は、特徴要素、マッチング対象、およびマッチング情報から構成されている。ここで特徴要素は、特徴ベクトルで表現した時の各要素を表すものであり、そのＩＤとその特徴を表す内容とからなる。図４では、特徴要素を表す内容である“メジャー”に対してＩＤ：１が割り振られている。マッチング対象は、コンテンツの特徴を表現する際のマッチングの対象となるコンテンツ内の情報について規定するものであり、コンテンツのうちどの部分をマッチング情報のマッチング対象とするかを示す情報である。図４においては、“メジャー”という特徴要素のマッチング対象として“説明文”が記述されており、メタデータの“説明文”欄に“メジャー”という単語が記述されているか否かが判断されることになる。 Here, generation of content feature vectors will be described with reference to the drawings. FIG. 4 is an explanatory diagram showing a specific example of the feature dictionary stored in the feature dictionary storage unit 107. This feature dictionary is composed of feature elements, matching targets, and matching information. Here, the feature element represents each element when expressed by a feature vector, and includes an ID and contents representing the feature. In FIG. 4, ID: 1 is assigned to “major” which is the content representing the feature element. The matching target defines information in the content to be matched when expressing the feature of the content, and is information indicating which part of the content is the matching target of the matching information. In FIG. 4, “description” is described as a matching target of the feature element “major”, and it is determined whether or not the word “major” is described in the “description” column of the metadata. It will be.

さらに、マッチング情報は、マッチングする対象のキーワードと、このキーワードと特徴要素との間の重要度を示す値とを示している。このマッチング情報は、いわゆる表記ゆれを調整するための情報であり、例えば、野球用語において“メジャー”には、“大リーグ”という言い方もあるが、実質的に同じ意味である。このような同じ意味を指す単語をマッチング情報において対応付けておき、その重要度の値を若干変えることにより、その表記ゆれをベクトルでの表現に反映させることができる。図４では、“メジャー”に対して値：１対応付けられているが、“大リーグ”に対しては、値：０．５が対応付けられている。ここから分かるように特徴要素そのものではない単語については、その重要度の値を下げることにより、コンテンツ特徴ベクトルにおける表現を弱めている。 Further, the matching information indicates a keyword to be matched and a value indicating the importance between the keyword and the feature element. This matching information is information for adjusting so-called notation fluctuations. For example, in the baseball terminology, “major” may be referred to as “major league”, but has substantially the same meaning. By associating such words indicating the same meaning in the matching information and slightly changing the importance value, the notation fluctuation can be reflected in the vector expression. In FIG. 4, the value “1” is associated with “major”, but the value “0.5” is associated with “major league”. As can be seen from this, for the word that is not the feature element itself, the expression in the content feature vector is weakened by lowering the importance value.

つぎに、コンテンツのメタデータについて説明する。図５（ａ）は、コンテンツのメタデータを示す説明図であり、図５（ｂ）および図５（ｃ）は、メタデータに基づいて生成されたコンテンツ特徴ベクトルの一例を示す説明図である。 Next, content metadata will be described. FIG. 5A is an explanatory diagram showing content metadata, and FIG. 5B and FIG. 5C are explanatory diagrams showing examples of content feature vectors generated based on the metadata. .

図５（ａ）に示されるようにメタデータは、コンテンツＩＤ、カテゴリおよび説明文から構成されている。ここでコンテンツＩＤは、コンテンツを識別するための識別番号であり、カテゴリは、コンテンツの種別を示し、説明文は、コンテンツの概要を示している。 As shown in FIG. 5A, the metadata includes a content ID, a category, and an explanatory text. Here, the content ID is an identification number for identifying the content, the category indicates the type of content, and the explanatory text indicates the outline of the content.

コンテンツ特徴ベクトル生成部１０１は、このメタデータと特徴辞書とを用いて、図５（ｂ）に示されるコンテンツ特徴ベクトルを生成する。図５（ｂ）では、コンテンツＩＤごとにその特徴ベクトルを構成する数値が記述されており、ここでは、メジャー、不動産、ドラマ・・・などを特徴要素とするコンテンツ特徴ベクトルを生成することになる。 The content feature vector generation unit 101 generates the content feature vector shown in FIG. 5B using the metadata and the feature dictionary. In FIG. 5B, numerical values constituting the feature vector are described for each content ID. Here, a content feature vector having a major, real estate, drama, etc. as a feature element is generated. .

図５（ｃ）は、図５（ｂ）で示されているコンテンツ特徴ベクトルに対して正規化したコンテンツ特徴ベクトルを示す。正規化処理は、一般的に行われているものを流用することができる。このコンテンツ特徴ベクトル生成部１０１は、生成したコンテンツ特徴ベクトルに対してさらに正規化処理を施したコンテンツ特徴ベクトルを生成し、これをコンテンツ特徴ベクトル記憶部１０８に記憶する。なお、正規化処理は必須の処理ではなく、省略してもよい。 FIG. 5C shows a content feature vector normalized with respect to the content feature vector shown in FIG. As the normalization process, a general process can be used. The content feature vector generation unit 101 generates a content feature vector obtained by further normalizing the generated content feature vector, and stores the content feature vector in the content feature vector storage unit 108. The normalization process is not an essential process and may be omitted.

転置インデックス生成部１０２は、コンテンツ特徴ベクトル記憶部１０８に記憶されている任意またはすべてのコンテンツ特徴ベクトルから、特徴要素ごとに、コンテンツを関連付けた転置インデックステーブルを生成する部分である。ここで、コンテンツ特徴ベクトルは、数百から数千次元からなる特徴要素で構成されている。各コンテンツは、各特徴要素が示す性質が表現される場合には、それら特徴要素には０より大きい数値が記述されることになる。上述転置インデックス生成部１０２は、特徴要素ごとに、その特徴要素に０より大きい数値が記述されているコンテンツのコンテンツＩＤを対応付けた転置インデックステーブルを生成して、これを転置インデックス記憶部１０９に記憶する。 The transposed index generation unit 102 is a part that generates a transposed index table in which content is associated with each feature element from any or all content feature vectors stored in the content feature vector storage unit 108. Here, the content feature vector is composed of feature elements having hundreds to thousands of dimensions. In the case of each content, when the property indicated by each feature element is expressed, a numerical value greater than 0 is described in each feature element. The above-described transposed index generation unit 102 generates a transposed index table in which the content ID of content in which a numerical value greater than 0 is described for each feature element is associated with each feature element, and this is stored in the transposed index storage unit 109. Remember.

図を用いて転置インデックステーブルの生成方法について説明する。まず、その生成方法に先立って、転置インデックステーブルを生成するために使用される評価値ファイルについて説明する。図６は、評価値ファイルを示す説明図である。図６に示される通り、この評価値ファイルは、特徴ＩＤ、コンテンツＩＤおよび評価値を関連付けたものである。この評価値ファイルは、特徴ＩＤとコンテンツＩＤとの組み合わせに対する評価値を示すものであって、転置インデックステーブルにおけるコンテンツの優先順位を決定する際に使用されるものである。例えば、ある特徴ＩＤに対して、コンテンツＩＤを対応付ける際、その評価値に基づいてコンテンツＩＤで示されるコンテンツの優先順位を決定するのに用いられる。この評価値ファイルは、あらかじめコンテンツの提供者からの情報に基づいて、レコメンドサービスのオペレータにより作成される。 A method for generating an inverted index table will be described with reference to the drawings. First, an evaluation value file used for generating an inverted index table will be described prior to the generation method. FIG. 6 is an explanatory diagram showing an evaluation value file. As shown in FIG. 6, this evaluation value file associates a feature ID, a content ID, and an evaluation value. This evaluation value file indicates an evaluation value for the combination of the feature ID and the content ID, and is used when determining the priority order of the content in the transposed index table. For example, when a content ID is associated with a certain feature ID, it is used to determine the priority order of the content indicated by the content ID based on the evaluation value. This evaluation value file is created in advance by the operator of the recommendation service based on information from the content provider.

そして、転置インデックス生成部１０２は、特徴要素ごとに、コンテンツの対応付けを行うことにより転置インデックステーブルを生成する。すなわち、転置インデックス生成部１０２は、特徴ＩＤごとに、その特徴ＩＤで示される特徴要素が０より大きい値が設定されているコンテンツ特徴ベクトルのコンテンツＩＤを抽出し、これを評価値ファイルで示されている評価値順に対応付ける処理を行う。 Then, the transposed index generation unit 102 generates a transposed index table by associating contents for each feature element. That is, the transposed index generation unit 102 extracts, for each feature ID, the content ID of the content feature vector in which the feature element indicated by the feature ID is set to a value greater than 0, and this is indicated in the evaluation value file. Corresponding processing is performed in the order of evaluation values.

図７は、転置インデックステーブルの具体例である。図７に示される通り、この転置インデックステーブルは、特徴要素を識別する特徴要素ごとに、一または複数のコンテンツのコンテンツＩＤを対応付けたものである。コンテンツＩＤ欄においては、複数のコンテンツＩＤが記述されており、これは、評価値ファイルで示される評価値に基づいた順番で記述されている。例えば、評価値ファイルにおいて、特徴ＩＤ：１に対応するコンテンツＩＤ：１１の評価値を０．２５とし、特徴ＩＤ：１に対応するコンテンツＩＤ：２１の評価値を０．２０とした場合、コンテンツＩＤ：１１の評価値の方が高いため、コンテンツＩＤ：２１より前に記述されることになる。 FIG. 7 is a specific example of an inverted index table. As shown in FIG. 7, the transposed index table associates content IDs of one or more contents with each feature element that identifies the feature element. In the content ID column, a plurality of content IDs are described, which are described in the order based on the evaluation values indicated in the evaluation value file. For example, in the evaluation value file, when the evaluation value of the content ID: 11 corresponding to the feature ID: 1 is 0.25 and the evaluation value of the content ID: 21 corresponding to the feature ID: 1 is 0.20, the content Since the evaluation value of ID: 11 is higher, it is described before the content ID: 21.

転置インデックス生成部１０２は、このような処理を経て転置インデックステーブルを生成することになる。生成された転置インデックステーブルは、転置インデックス記憶部１０９に記憶される。なお、転置インデックス生成部１０２は、バッチ処理により、事前にその転置インデックステーブルを生成しておくものである。例えば、コンテンツ特徴ベクトル記憶部１０８に記憶されるコンテンツ特徴ベクトルが所定件数に達した場合や、月に１回などの所定の条件を満たした場合に転置インデックステーブルを生成し、転置インデックス記憶部１０９の転置インデックステーブルを更新する。 The inverted index generation unit 102 generates an inverted index table through such processing. The generated inverted index table is stored in the inverted index storage unit 109. Note that the inverted index generation unit 102 generates the inverted index table in advance by batch processing. For example, when the content feature vector stored in the content feature vector storage unit 108 reaches a predetermined number or when a predetermined condition such as once a month is satisfied, a transposed index table is generated, and the transposed index storage unit 109 Update the inverted index table.

引き続き図２における各構成について説明する。コンテンツ特徴ベクトル取得部１０３は、ユーザからのレコメンド要求（すなわち検索要求）がなされると、そのレコメンド要求に応じたレコメンドコンテンツを生成するために、コンテンツ特徴ベクトル記憶部１０８から所定の規則に基づいて定められたコンテンツのコンテンツ特徴ベクトルを取得する部分である。例えば、あるサイトにあるコンテンツを閲覧する際、そのコンテンツに基づいたレコメンドコンテンツを生成するため、当該コンテンツのコンテンツ特徴ベクトルを取得する。なお、ここでは、一のコンテンツ特徴ベクトルを取得することとするが、例えば、そのコンテンツに類似する一または複数のコンテンツのコンテンツ特徴ベクトルを合わせて取得してもよい。類似の判断は、例えばコンテンツ特徴ベクトルの内積をとることにより行うことができる。 Next, each configuration in FIG. 2 will be described. When a recommendation request (that is, a search request) is made from the user, the content feature vector acquisition unit 103 generates a recommended content corresponding to the recommendation request based on a predetermined rule from the content feature vector storage unit 108. This is a part for acquiring the content feature vector of the determined content. For example, when browsing content in a certain site, in order to generate recommended content based on the content, a content feature vector of the content is acquired. Although one content feature vector is acquired here, for example, content feature vectors of one or a plurality of contents similar to the content may be acquired together. Similar determinations can be made, for example, by taking the inner product of content feature vectors.

レコメンドコンテンツ取得部１０４は、コンテンツ特徴ベクトル取得部１０３により取得されたコンテンツ特徴ベクトルに基づいて、特徴要素の識別番号である特徴ＩＤを抽出する。この特徴ＩＤの抽出の仕方としては、コンテンツ特徴ベクトルにおいて０より大きい数値が記述されている特徴ＩＤを抽出する。そして、この特徴ＩＤをキーにして、転置インデックス記憶部１０９からレコメンド用コンテンツを取得するために、特徴ＩＤに対応する一または複数のコンテンツＩＤを抽出する。すなわち、抽出された特徴ＩＤに対応付けられているコンテンツＩＤをさらに抽出して、特徴ＩＤとコンテンツＩＤとを対応付けたコンテンツＩＤリストを生成する。 The recommended content acquisition unit 104 extracts a feature ID that is an identification number of a feature element based on the content feature vector acquired by the content feature vector acquisition unit 103. As a method of extracting the feature ID, a feature ID in which a numerical value greater than 0 is described in the content feature vector is extracted. Then, using this feature ID as a key, one or more content IDs corresponding to the feature ID are extracted in order to acquire recommendation content from the transposed index storage unit 109. That is, the content ID associated with the extracted feature ID is further extracted to generate a content ID list in which the feature ID and the content ID are associated with each other.

さらに、レコメンドコンテンツ取得部１０４は、携帯端末２００において表示する一または複数のコンテンツを、生成されたコンテンツＩＤリストから順に抽出する。すなわち、レコメンドコンテンツ取得部１０４は、コンテンツＩＤリストから所定の規則に従った順序でコンテンツを抽出する。 Further, the recommended content acquisition unit 104 sequentially extracts one or more contents to be displayed on the mobile terminal 200 from the generated content ID list. That is, the recommended content acquisition unit 104 extracts content from the content ID list in an order according to a predetermined rule.

この抽出処理について以下に詳細に説明する。まず、レコメンドコンテンツ取得部１０４は、図８に示されるように、転置インデックステーブルに基づいて特徴ＩＤごとに対応するコンテンツＩＤのリストを生成する。 This extraction process will be described in detail below. First, the recommended content acquisition unit 104 generates a list of content IDs corresponding to each feature ID based on the transposed index table, as shown in FIG.

図８は、転置インデックステーブルに基づいて生成された特徴ＩＤごとに対応するコンテンツＩＤを示すコンテンツＩＤリストの説明図である。上述した通り、このコンテンツＩＤリストは、レコメンドの元となったコンテンツ（例えば閲覧されたコンテンツ）の特徴要素に基づいて生成されたものであり、図８においては、説明を簡易にするために、特徴ＩＤとして、featureId_a〜featureId_cの３つが抽出されたものとしており、その順番でリストが形成されている。ここでは、便宜上、特徴ＩＤ：featureId_aが、重要度が高く、featureId_b、featureId_cになるにつれて、重要度が低く設定されているものとする。この重要度は、それぞれの特徴ＩＤに関連付けられている。 FIG. 8 is an explanatory diagram of a content ID list indicating the content ID corresponding to each feature ID generated based on the transposed index table. As described above, this content ID list is generated based on the characteristic elements of the content that is the source of the recommendation (for example, the browsed content). In FIG. As feature IDs, three featureId_a to featureId_c are extracted, and a list is formed in that order. Here, for the sake of convenience, it is assumed that the feature ID: featureId_a has a higher importance level, and the importance level is set lower as the feature ID_b and featureId_c become. This importance is associated with each feature ID.

この図８において、特徴ＩＤ：featureId_aには、コンテンツＩＤ：contentsId_2、contentsId_5、contentsId_6が対応付けられている。また、同様に、特徴ＩＤ：featureId_bには、コンテンツＩＤ：contentsId_3、contentsId_5、contentsId_1が対応付けられており、特徴ＩＤ：featureId_cには、コンテンツＩＤ：contentsId_5、contentsId_1、contentsId_4が対応付けられている。いずれのコンテンツＩＤも、転置インデックステーブルに基づいて抽出されたものである。 In FIG. 8, content IDs: contentsId_2, contentsId_5, and contentsId_6 are associated with feature ID: featureId_a. Similarly, the content ID: featuresId_b is associated with content IDs: contentsId_3, contentsId_5, and contentsId_1, and the feature ID: featureId_c is associated with content IDs: contentsId_5, contentsId_1, and contentsId_4. Each content ID is extracted based on the transposed index table.

つぎに、レコメンドコンテンツ取得部１０４は、図８におけるコンテンツＩＤリストに、同一のコンテンツＩＤが複数出現している場合、その数を重複数として計数する。そして、そのコンテンツＩＤリストに重複数欄を追加した中間リストを生成する。 Next, when the same content ID appears in the content ID list in FIG. 8, the recommended content acquisition unit 104 counts the number as the overlapping number. Then, an intermediate list is generated by adding duplicate columns to the content ID list.

図９は、その中間リストの具体例を示す説明図である。この中間リストは、特徴ＩＤとコンテンツＩＤとの管理のための管理リストであって、特徴ＩＤ、コンテンツＩＤ、および重複数を対応付けて記憶している。 FIG. 9 is an explanatory diagram of a specific example of the intermediate list. This intermediate list is a management list for managing feature IDs and content IDs, and stores feature IDs, content IDs, and overlapping numbers in association with each other.

図９において、コンテンツＩＤ：contentsId_5の重複数は３である。ここでは、特徴ＩＤ：featureId_a〜featureId_cのそれぞれにコンテンツＩＤ：contentsId_5が出現していることから、それぞれを計数することにより重複数を３と算出することができる。 In FIG. 9, the overlap number of content ID: contentsId_5 is 3. Here, since the content ID: contentsId_5 appears in each of the feature IDs: featureId_a to featureId_c, the overlap number can be calculated as 3 by counting each.

そして、レコメンドコンテンツ取得部１０４は、この中間リストをさらに加工して最終リストを生成する。図１０は、中間リストから最終リストへの生成過程、および最終結果であるレコメンドコンテンツのリストの生成過程を示す。 Then, the recommended content acquisition unit 104 further processes this intermediate list to generate a final list. FIG. 10 shows a generation process from the intermediate list to the final list, and a process of generating a list of recommended contents as the final result.

図１０（ａ）は、中間リストにおいて、同一コンテンツＩＤを削除したことを示す説明図である。図に示される通り、コンテンツＩＤ：contentsId_5が、特徴ＩＤ：featureId_a、featureId_bおよびfeatureId_cのいずれに出現していることから上位の特徴ＩＤ：featureId_aのコンテンツＩＤ：contentuId_5にのみ重複数として３を対応付けるように記述し、特徴ＩＤ：featureId_bおよびfeatureId_cの欄は削除される。図１０（ａ）においては、取り消し線によって、削除状態を示している。 FIG. 10A is an explanatory diagram showing that the same content ID is deleted from the intermediate list. As shown in the figure, since the content ID: contentsId_5 appears in any of the feature IDs: featureId_a, featureId_b, and featureId_c, 3 is associated with only the content ID: contentuId_5 of the upper feature ID: featureId_a as a duplication number. The field of feature ID: featureId_b and featureId_c is deleted. In FIG. 10A, the deletion state is indicated by a strikethrough.

図１０（ｂ）は、その削除処理がなされたコンテンツＩＤリストの最終リストである。図から明らかなとおり、コンテンツＩＤ：contentsId_5やcontentsId_1が、それぞれ特徴ＩＤ:featureId_bや、特徴ＩＤ:featureId_cとの対応付から削除されている。 FIG. 10B is the final list of the content ID list that has been deleted. As is apparent from the figure, the content IDs: contentsId_5 and contentsId_1 are deleted from the correspondence with the feature ID: featureId_b and the feature ID: featureId_c, respectively.

そして、レコメンドコンテンツ取得部１０４は、図１０（ｂ）のコンテンツＩＤリストの最終リストに基づいて、レコメンドコンテンツとしてのコンテンツを順に選択することにより、携帯端末２００における表示順位を決定する。すなわち、この最終リストは、特徴ＩＤの上位順にコンテンツＩＤが並べられるとともに、さらに同一特徴ＩＤ欄内においては、その重複数の多い順にコンテンツＩＤが並べられている。レコメンドコンテンツ取得部１０４は、あらかじめ定められた上位所定件数のコンテンツＩＤを選択することにより、その表示順を決定することができる。 And the recommendation content acquisition part 104 determines the display order in the portable terminal 200 by selecting the content as recommendation content in order based on the last list of the content ID list | wrist of FIG.10 (b). That is, in this final list, the content IDs are arranged in order from the top of the feature ID, and further, in the same feature ID column, the content IDs are arranged in the descending order of the overlapping number. The recommended content acquisition unit 104 can determine the display order by selecting a predetermined upper number of content IDs.

なお、最終リストからのコンテンツの抽出方法は、上述以外に、重複数の多いコンテンツＩＤを優先してもよい。すなわち。重複数の多い順にコンテンツＩＤを抽出し、その後、特徴ＩＤが上位のものを順に抽出するようにしてもよい。 In addition to the above, the content extraction method from the final list may prioritize multiple content IDs. That is. The content IDs may be extracted in descending order of the number of duplicates, and then the higher-ranked feature IDs may be extracted in order.

図１０（ｂ）においては、最上位に位置する特徴ＩＤ：featureId_aに対応するコンテンツＩＤ：contentsId_5の重複数が一番多く、その次に並んでいるものはコンテンツＩＤ：contentsId_2およびcontentsId_6であり、重複数は１である。 In FIG. 10B, the overlapping number of the content ID: contentsId_5 corresponding to the topmost feature ID: featureId_a is the largest, and the next ones are the content IDs: contentsId_2 and contentsId_6. The plurality is 1.

また、特徴ＩＤ：featureId_aの次の序列にあるものは、特徴ＩＤ：featureId_bであり、その同一特徴ＩＤ欄内においては、コンテンツＩＤ：contentsId_1およびcontentsId_3が重複数の多い順に並んでいる。さらにその次の序列の特徴ＩＤについても同様である。 In the next order of the feature ID: featureId_a is the feature ID: featureId_b, and in the same feature ID column, the content IDs: contentsId_1 and contentsId_3 are arranged in the descending order. The same applies to the feature ID of the next rank.

レコメンドコンテンツ取得部１０４は、このように特徴ＩＤ順および重複数順に並んでいるコンテンツの上位所定件数を、順に選択することにより、レコメンドコンテンツとしての表示順を決定することができる。本実施形態では、図１０（ｃ）に示される通り、上位５件分のコンテンツＩＤが選択される。そして、レコメンドコンテンツ取得部１０４は、このように選択されたコンテンツＩＤに対応するコンテンツをその表示順に、レコメンドコンテンツとして取得する。 The recommended content acquisition unit 104 can determine the display order as the recommended content by sequentially selecting the upper predetermined number of the contents arranged in the order of the feature ID and the overlapping order. In the present embodiment, as shown in FIG. 10C, content IDs for the top five items are selected. Then, the recommended content acquisition unit 104 acquires the content corresponding to the content ID selected in this way as recommended content in the display order.

レコメンドコンテンツ送信部１０５は、レコメンドコンテンツ取得部１０４により抽出されたコンテンツを順に、レコメンドコンテンツとして携帯端末２００に送信する部分である。 The recommended content transmission unit 105 is a part that sequentially transmits the content extracted by the recommended content acquisition unit 104 to the mobile terminal 200 as recommended content.

つぎに、上述情報配信サーバ１００に対してレコメンドコンテンツの要求を行う携帯端末２００について説明する。図１１は、携帯端末２００の機能を示すブロック図である。図１１に示される通り、携帯端末２００は、ブラウザ２０１、履歴送信部２０２、レコメンド要求送信部２０３、および配信情報表示部２０４を含んで構成されている。この携帯端末２００は、図３に示されるハードウェア構成により実現されるものであり、ＣＰＵ、ＲＡＭ、ＲＯＭ等を備えたコンピュータシステムにより実現されるものである。以下、各構成要素について説明する。 Next, the portable terminal 200 that makes a request for recommended content to the information distribution server 100 will be described. FIG. 11 is a block diagram illustrating functions of the mobile terminal 200. As shown in FIG. 11, the mobile terminal 200 includes a browser 201, a history transmission unit 202, a recommendation request transmission unit 203, and a distribution information display unit 204. The portable terminal 200 is realized by the hardware configuration shown in FIG. 3, and is realized by a computer system including a CPU, a RAM, a ROM, and the like. Hereinafter, each component will be described.

ブラウザ２０１は、インターネット等を介して各種コンテンツを閲覧するための部分である。一般的にはアプリケーションにより構成されているものである。 The browser 201 is a part for browsing various contents via the Internet or the like. Generally, it is configured by an application.

履歴送信部２０２は、ブラウザ２０１により実行された検索要求または閲覧要求からなる履歴情報を外部に送信する部分である。 The history transmission unit 202 is a part that transmits history information including a search request or a browsing request executed by the browser 201 to the outside.

レコメンド要求送信部２０３は、情報配信サーバ１００に対してレコメンド要求を送信する部分である。レコメンド要求そのものを送信してもよいし、ブラウザ２０１においてあるコンテンツが閲覧された際に、そのコンテンツに関連するレコメンド要求を送信するようにしてもよい。 The recommendation request transmission unit 203 is a part that transmits a recommendation request to the information distribution server 100. The recommendation request itself may be transmitted, or when a certain content is browsed in the browser 201, a recommendation request related to the content may be transmitted.

配信情報表示部２０４は、レコメンド要求送信部２０３により送信されたレコメンド要求に対するレコメンド情報を受信して表示する部分である。ここでは、上述した通りにコンテンツの表示順位についての処理がなされたレコメンドコンテンツをその表示順位で表示する。 The distribution information display unit 204 is a part that receives and displays recommendation information for the recommendation request transmitted by the recommendation request transmission unit 203. Here, the recommended content that has been processed for the display order of the content as described above is displayed in the display order.

つぎに、このように構成された情報配信サーバ１００および携帯端末２００の処理について説明する。図１２は、情報配信サーバ１００の処理を示すフローチャートである。 Next, processing of the information distribution server 100 and the mobile terminal 200 configured as described above will be described. FIG. 12 is a flowchart showing processing of the information distribution server 100.

情報配信サーバ１００において、コンテンツの提供者からコンテンツの登録がなされると（Ｓ１０１）、コンテンツ特徴ベクトル生成部１０１によりコンテンツ特徴ベクトルの生成が行われる（Ｓ１０２）。そして、コンテンツ特徴ベクトル記憶部１０８に、生成されたコンテンツ特徴ベクトルが記憶される（Ｓ１０３）。 In the information distribution server 100, when content is registered by the content provider (S101), the content feature vector generation unit 101 generates a content feature vector (S102). Then, the generated content feature vector is stored in the content feature vector storage unit 108 (S103).

つぎに、転置インデックス生成部１０２により、転置インデックステーブルが生成され（Ｓ１０４）、転置インデックス記憶部１０９に、転置インデックステーブルが記憶される（Ｓ１０５）。このＳ１０４の処理は、Ｓ１０３からの一連の処理としてもよいし、ある程度コンテンツ特徴ベクトルが蓄積された後に、転置インデックステーブルが生成されるようにしてもよい。 Next, a transposed index table is generated by the transposed index generation unit 102 (S104), and the transposed index table is stored in the transposed index storage unit 109 (S105). The process of S104 may be a series of processes from S103, or the transposed index table may be generated after content feature vectors are accumulated to some extent.

このＳ１０１からＳ１０５までの処理は、レコメンドコンテンツの生成処理の前に行われるものであり、いわゆるバッチ処理により実行される。 The processing from S101 to S105 is performed before the recommendation content generation processing, and is executed by so-called batch processing.

つぎに、このようにして生成された転置インデックステーブルに基づいて、レコメンドコンテンツが生成される。 Next, recommended content is generated based on the transposed index table generated in this way.

まず、携帯端末２００からのレコメンドコンテンツ要求が受信されると（Ｓ２０１）、コンテンツ特徴ベクトル取得部１０３により、コンテンツ特徴ベクトル記憶部１０８から一または複数のコンテンツ特徴ベクトルが取得される（Ｓ２０２）。そして、レコメンドコンテンツ取得部１０４により、転置インデックステーブルに基づいてコンテンツＩＤリストが生成される（Ｓ２０３）。ここでは、図８に示されるコンテンツＩＤリストが取得される。 First, when a recommended content request is received from the mobile terminal 200 (S201), the content feature vector acquisition unit 103 acquires one or more content feature vectors from the content feature vector storage unit 108 (S202). Then, the recommended content acquisition unit 104 generates a content ID list based on the transposed index table (S203). Here, the content ID list shown in FIG. 8 is acquired.

そして、レコメンドコンテンツ取得部１０４により、コンテンツの抽出処理が実行される（Ｓ２０４）。すなわち、図８のコンテンツＩＤリストから、コンテンツＩＤの重複数などが求められ、その中間リストが生成される（図９参照）。つぎに、その重複数に基づいて整理されたコンテンツＩＤリストである最終リスト（図１０（ｂ）参照）が生成され、その最終リストに基づいて、レコメンドコンテンツとしてのコンテンツが抽出される（図１０（ｃ）参照）。 Then, the recommended content acquisition unit 104 executes content extraction processing (S204). That is, from the content ID list of FIG. 8, a plurality of content IDs are obtained, and an intermediate list is generated (see FIG. 9). Next, a final list (see FIG. 10B), which is a content ID list arranged based on the overlap, is generated, and content as recommended content is extracted based on the final list (FIG. 10). (See (c)).

そして、抽出されたコンテンツは、レコメンドコンテンツとしてレコメンドコンテンツ送信部１０５により携帯端末２００に対して送信される（Ｓ２０５）。 The extracted content is transmitted as recommended content to the mobile terminal 200 by the recommended content transmitting unit 105 (S205).

このようにして、バッチ処理により転置インデックステーブルを生成し、その後に、レコメンド要求が来るたびに、リアルタイム処理により、転置インデックステーブルを用いたレコメンド処理を行うことができる。 In this way, an inverted index table is generated by batch processing, and thereafter, every time a recommendation request is received, recommendation processing using the inverted index table can be performed by real-time processing.

つぎに、本実施形態の情報配信サーバ１００の作用効果について説明する。本実施形態の情報配信サーバ１００によれば、コンテンツ特徴ベクトル生成部１０１は、あらかじめ定められた特徴要素に基づいてコンテンツの特徴が表現されるコンテンツ特徴ベクトルを生成し、コンテンツ特徴ベクトル記憶部１０８は、生成されたコンテンツ特徴ベクトルを、コンテンツごとに記憶する。そして、転置インデックス生成部１０２は、コンテンツ特徴ベクトル記憶部１０８に記憶されたコンテンツ特徴ベクトルを構成する特徴要素ごとに、当該特徴要素で特徴付けられるコンテンツをそれぞれ選択し、選択されたコンテンツのそれぞれを当該特徴要素に対応付けた転置インデックステーブルを生成する。転置インデックス記憶部１０９は、生成された転置インデックステーブルを記憶する。このコンテンツ特徴ベクトルの生成から転置インデックステーブルの生成までの処理は、携帯端末２００からのレコメンド要求を受け付ける前に終わらせておく。 Next, operational effects of the information distribution server 100 of the present embodiment will be described. According to the information distribution server 100 of the present embodiment, the content feature vector generation unit 101 generates a content feature vector in which content features are expressed based on predetermined feature elements, and the content feature vector storage unit 108 The generated content feature vector is stored for each content. Then, the transposed index generation unit 102 selects, for each feature element constituting the content feature vector stored in the content feature vector storage unit 108, the content characterized by the feature element, and selects each selected content. A transposed index table associated with the feature element is generated. The transposed index storage unit 109 stores the generated transposed index table. The processing from the generation of the content feature vector to the generation of the transposed index table is terminated before the recommendation request from the portable terminal 200 is received.

つぎに、レコメンド要求受付部（図示せず）により、ユーザ端末からレコメンド要求が受け付けられると、レコメンドコンテンツ取得部１０４は、転置インデックス記憶部１０９に記憶されている転置インデックステーブルから、重複して記述されているコンテンツの数を、コンテンツごとに集計する。 Next, when a recommendation request is received from a user terminal by a recommendation request receiving unit (not shown), the recommended content acquisition unit 104 is described in duplicate from the transposed index table stored in the transposed index storage unit 109. The number of contents that have been added is aggregated for each content.

そして、レコメンドコンテンツ取得部１０４は、集計されたコンテンツの数を、当該コンテンツのコンテンツ特徴ベクトルを構成する特徴要素のうち任意の特徴要素（本実施形態では、最上位の特徴要素）に対応付ける中間リストを生成する。そして、レコメンドコンテンツ取得部１０４は、この中間リストから最終リストを生成して、ユーザ端末にて重複数が多い順にコンテンツを表示可能なように、コンテンツを抽出する。レコメンドコンテンツ送信部１０５は、抽出されたコンテンツを携帯端末２００において表示されるよう、コンテンツを送信する。 Then, the recommended content acquisition unit 104 associates the total number of contents with an arbitrary feature element (in this embodiment, the highest-order feature element) among the feature elements constituting the content feature vector of the content. Is generated. And the recommendation content acquisition part 104 produces | generates a final list from this intermediate list, and extracts a content so that a content can be displayed in order with many duplication in a user terminal. The recommended content transmission unit 105 transmits the content so that the extracted content is displayed on the mobile terminal 200.

これにより、計算時間がかかる処理については事前に終わらすことができ、高速な更新処理が求められるサービスに対しても利用することが可能となる。そして、一部のコンテンツのみが集中的にレコメンド対象となる場合であっても、全部のコンテンツに対して計算するわけではないため、その計算リソースが無駄になるということはない、という効果を奏する。 As a result, processing that takes a long time can be completed in advance, and can be used for services that require high-speed update processing. Even if only a part of the content is intensively targeted for recommendation, the calculation resource is not wasted because it is not calculated for all the content. .

また、本実施形態の情報配信サーバ１００において、転置インデックス生成部１０２は、コンテンツ特徴ベクトル記憶部１０８に記憶されているコンテンツ特徴ベクトルが所定件数に達した時に、バッチ処理により所定件数に対してまとめて実行する、ようにしてもよい。 Further, in the information distribution server 100 of the present embodiment, the transposed index generation unit 102 collects the predetermined number of content by batch processing when the content feature vectors stored in the content feature vector storage unit 108 reach the predetermined number. May be executed.

これにより、転置インデックステーブルを生成するまでの処理を、あらかじめまとまったコンテンツ特徴ベクトルに対して行うといったバッチ処理により実行することになるため、転置インデックステーブルの生成効率を向上させることができ、結果的に計算リソースを効率的に利用することができる。 As a result, the processing up to the generation of the inverted index table is executed by batch processing such as performing on the content feature vectors collected in advance, so that the generation efficiency of the inverted index table can be improved. In addition, computational resources can be used efficiently.

なお、上述以外の条件として、所定周期ごとや、所定時刻または所定日時、そのほか、本レコメンドサービスを提要するオペレータの指示に従って、転置インデックステーブルを生成するようにしてもよい。 Note that, as conditions other than those described above, the transposed index table may be generated in accordance with a predetermined cycle, a predetermined time or a predetermined date and time, or according to an instruction of an operator who provides this recommendation service.

１００…情報配信サーバ、１０１…コンテンツ特徴ベクトル生成部、１０２…転置インデックス生成部、１０３…コンテンツ特徴ベクトル取得部、１０４…レコメンドコンテンツ取得部、１０５…レコメンドコンテンツ送信部、１０６…メタデータ記憶部、１０７…特徴辞書記憶部、１０８…コンテンツ特徴ベクトル記憶部、１０９…転置インデックス記憶部、２００…携帯端末、２０１…ブラウザ、２０２…履歴送信部、２０３…レコメンド要求送信部、２０４…配信情報表示部。
DESCRIPTION OF SYMBOLS 100 ... Information delivery server, 101 ... Content feature vector production | generation part, 102 ... Transposition index production | generation part, 103 ... Content feature vector acquisition part, 104 ... Recommended content acquisition part, 105 ... Recommended content transmission part, 106 ... Metadata storage part, DESCRIPTION OF SYMBOLS 107 ... Feature dictionary memory | storage part, 108 ... Content feature vector memory | storage part, 109 ... Transposition index memory | storage part, 200 ... Mobile terminal, 201 ... Browser, 202 ... History transmission part, 203 ... Recommendation request transmission part, 204 ... Distribution information display part .

Claims

Based on the content feature vector of the content to be recommended, a transposed index table generating unit that generates a transposed index table in which one or more contents are associated with each feature element constituting the content feature vector;
A transposed index storage unit for storing the transposed index table generated by the transposed index table generating unit;
When a recommendation request is received from a user terminal, recommendation information generating means for generating recommendation information according to the display order of contents in the user terminal determined based on the transposed index table stored in the transposed index storage means;
Transmitting means for transmitting the recommendation information generated by the recommendation information generating means;
With
The transposed index table generating means executes a transposed index table generating process when a predetermined condition is satisfied,
The recommendation information generation unit is a recommendation information generation device that executes a display order determination process each time a recommendation request from a user terminal is received.

The recommendation information generation means includes:
Based on the transposed index table, a totaling unit that counts the duplication number, which is the number of contents that are duplicated by changing the characteristic elements, for each content;
Generating means for generating a management list in which the duplication numbers totaled by the totaling means are associated with arbitrary feature elements among the feature elements constituting the content feature vector of the content;
Based on the management list generated by the generating means, extracting means for generating recommendation information by extracting the content so that the content can be displayed in the order of multiple duplication at the user terminal;
The recommendation information generating apparatus according to claim 1, comprising:

Content feature vector generation means for generating a content feature vector in which the feature of the content is expressed based on a predetermined feature element;
Content feature vector storage means for storing the content feature vector generated by the content feature vector generation means for each content;
With
The recommendation information generating apparatus according to claim 1, wherein the transposed index table generating unit generates a transposed index table using a content feature vector stored in the content feature vector storage unit.

The recommendation information generating apparatus according to claim 3, wherein the transposed index table generating unit collectively executes the batch processing when the number of content feature vectors stored in the content feature vector storage unit reaches a predetermined number. .

In the recommendation information generation method in the recommendation information generation device,
Based on the content feature vector of the content to be recommended, a transposed index table generating step for generating a transposed index table in which one or a plurality of contents are associated with each feature element constituting the content feature vector;
A storage step of storing the inverted index table generated by the inverted index table generating step in an inverted index storage means;
When a recommendation request is received from a user terminal, a recommendation information generation step of generating recommendation information according to the display order of content in the user terminal determined based on the transposed index table stored in the transposed index storage unit;
A transmission step of transmitting the recommendation information generated by the recommendation information generation step;
With
The transposed index table generating step performs a transposed index table generating process when a predetermined condition is satisfied,
The recommendation information generation step is a recommendation information generation method for executing a display order determination process each time a recommendation request from a user terminal is received.