JP5732441B2

JP5732441B2 - Information recommendation method, apparatus and program

Info

Publication number: JP5732441B2
Application number: JP2012196469A
Authority: JP
Inventors: 藤田　将成; 将成藤田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2011-10-06
Filing date: 2012-09-06
Publication date: 2015-06-10
Anticipated expiration: 2032-09-06
Also published as: JP2013093015A

Description

この発明は、ユーザの嗜好や状況等に合わせた情報を推薦する情報推薦方法、装置及びプログラムに関する。 The present invention relates to an information recommendation method, an apparatus, and a program for recommending information according to a user's preference and situation.

近年、ユーザの嗜好や状況等に応じた情報を推薦するサービスが種々提案されている。例えば、ユーザが設定した嗜好情報等の検索条件とユーザの現在位置の情報から周辺の飲食店の情報を選択し、ユーザに提示する飲食店ガイドサービスが提供されている。しかし、ユーザが適切な情報を取得するために、ユーザ自身が設定した嗜好情報等の検索条件を手入力することは、インターネット上の情報が大量で詳細になるにつれ、ユーザに過大な負担を強いるだけでなく、潜在的な嗜好を反映できない問題がある。 In recent years, various services for recommending information in accordance with user preferences and situations have been proposed. For example, a restaurant guide service is provided in which peripheral restaurant information is selected from search conditions such as preference information set by the user and information on the current location of the user and presented to the user. However, manually inputting search conditions such as preference information set by the user himself / herself in order to acquire appropriate information imposes an excessive burden on the user as the amount of information on the Internet becomes more detailed. There is a problem that cannot reflect the potential taste as well.

また、モバイルインターネットの普及によりスマートホン等の携帯端末を使用した情報検索が増加しているが、携帯端末は画面に一度に表示できる情報量が少なく、また検索条件の入力に手間がかかることから、ユーザの嗜好を考慮して適切な情報を推薦する技術のニーズはますます高まっている。 In addition, information searches using mobile terminals such as smart phones are increasing due to the spread of mobile Internet, but mobile terminals have a small amount of information that can be displayed on the screen at one time, and it takes time to input search conditions. Therefore, there is a growing need for a technique for recommending appropriate information in consideration of user preferences.

そこで、ユーザのインターネット閲覧履歴等を分析することで適切な情報推薦を行う技術が種々研究されている。しかし、従来の技術ではユーザ履歴が一位量以上蓄積されるまで推薦精度が向上しない、いわゆるCold-startという問題が発生する。この問題により、ユーザのサービス利用初期に適切な推薦が行えず満足度が上がらないことから、継続利用されないため履歴の蓄積が進まない。この結果、推薦精度の向上が長期化しサービス普及が進まない。 Thus, various techniques for recommending appropriate information by analyzing a user's Internet browsing history and the like have been studied. However, in the conventional technique, there arises a problem of so-called Cold-start in which the recommendation accuracy is not improved until the user history is accumulated more than the first amount. Due to this problem, since the user cannot make an appropriate recommendation at the beginning of service use and the satisfaction level does not increase, the history is not accumulated because it is not continuously used. As a result, the improvement of recommendation accuracy will be prolonged and the service will not spread.

一方、閲覧履歴や検索履歴をもとにした情報推薦技術は一般にメモリベース法と呼ばれ、協調フィルタリングとコンテンツベースフィルタリングに大別される。この２つの方法のうち、アイテムにカテゴリ等を示すタグを付与することができる場合には、コンテンツベースフィルタリングがCold-start問題に強いといわれている。このため、履歴の蓄積状況等に応じてこの２つの技術を切り替えるか又は合成する方法が研究されている（例えば非特許文献１を参照）。 On the other hand, information recommendation technology based on browsing history and search history is generally called a memory-based method, and is roughly divided into collaborative filtering and content-based filtering. Of these two methods, when a tag indicating a category or the like can be attached to an item, content-based filtering is said to be strong against the Cold-start problem. For this reason, a method of switching or synthesizing these two techniques according to the history accumulation status or the like has been studied (for example, see Non-Patent Document 1).

R. Burke, "Hybrid recommender systems: surveyand, experiments, user-modeling and user-adapted inter-actions," User Modeling and User-Adapted Interaction, Vol. 12, No. 4, pp.331-370, 2002.R. Burke, "Hybrid recommender systems: surveyand, experiments, user-modeling and user-adapted inter-actions," User Modeling and User-Adapted Interaction, Vol. 12, No. 4, pp.331-370, 2002.

ところが、非特許文献１に記載された飲食店推薦システムには以下のような改善すべき課題がある。すなわち、このシステムは、最初の段階では価格帯やグルメジャンル等の希望条件をユーザが手入力し、コンテンツベースフィルタリングを用いて候補となるレストランを選び出す。そして、協調フィルタリングによって候補の順位付けをするものとなっている。このため、一定の推薦精度を早い段階で実現できる。しかしながら、候補の絞り込みのための飲食店の希望条件をユーザ手入力に依存している。このため、候補が多い場合の推薦精度を向上させるには、順位付けのため履歴の蓄積が必要となる。すなわち、この方法は多くの候補が存在するわりに履歴の蓄積が少ない、飲食店等の情報を推薦する場合には十分な効果が期待できない。 However, the restaurant recommendation system described in Non-Patent Document 1 has the following problems to be improved. That is, in this system, the user manually inputs desired conditions such as price range and gourmet genre at the first stage, and selects candidate restaurants using content-based filtering. The candidates are ranked by collaborative filtering. For this reason, a fixed recommendation accuracy can be realized at an early stage. However, the desired conditions of the restaurant for narrowing down candidates are dependent on user manual input. For this reason, in order to improve the recommendation accuracy when there are many candidates, it is necessary to accumulate history for ranking. That is, this method cannot be expected to have a sufficient effect when recommending information such as restaurants, where there are many candidates but there is little history accumulation.

この発明は上記事情に着目してなされたもので、その目的とするところは、ユーザの閲覧履歴実績が少ない状況下でも、ユーザによる条件の手入力に頼ることなくより高精度な情報推薦を可能にした情報推薦方法、装置及びプログラムを提供することにある。 The present invention has been made paying attention to the above circumstances, and its purpose is to make it possible to recommend information with higher accuracy without relying on manual input of conditions by the user even in situations where the user's browsing history is low. It is to provide an information recommendation method, apparatus and program.

上記目的を達成するためにこの発明の第１の観点は、コンピュータにより、地理的に散在する複数のスポットに対するユーザの興味を分析してその分析結果をもとに当該ユーザに対し推薦するスポットの情報を作成する情報推薦方法、装置及びプログラムにあって、
上記コンピュータが、上記複数のスポットに関する情報をそれぞれ掲示する閲覧サイトに対する上記ユーザの閲覧履歴を表す情報を取得して第１の記憶媒体に格納すると共に、上記ユーザの地理的な移動履歴を表す情報を取得して第２の記憶媒体に格納する。また、上記コンピュータが、上記第１の記憶媒体に記憶された閲覧履歴情報をもとに、当該閲覧履歴情報に含まれる複数のスポット閲覧情報を閲覧時間の近さを条件にクラスタ化して閲覧チャンクを生成すると共に、上記第２の記憶媒体に記憶された移動履歴情報、又はユーザが事前に入力した情報や無線基地局との接続情報をもとにユーザが訪問可能な参照エリアを決定して、予め記憶されたスポット情報の中から上記決定された参照エリアに含まれるスポット情報の集合を参照スポット集合として抽出する。そして、上記生成された閲覧チャンクに含まれるスポット閲覧情報の数ａと、上記閲覧チャンクに含まれるスポット閲覧情報群のうち所定の概念を有するスポット閲覧情報の数ｎと、上記参照スポット集合に含まれるスポット情報の数Ｓと、上記参照スポット集合に含まれるスポット情報群のうち上記概念が付与されたスポット情報の数Ｎをもとに、上記閲覧チャンクに上記概念が出現するスポット閲覧情報の数が、上記ｎ以上となる第１の確率及びｎ未満となる第２の確率をそれぞれ算出し、この算出された第１及び第２の確率をもとに標準正規分布の累積分布関数の逆関数により、前記概念に対する前記ユーザの興味の度合いを表す興味スコアｚを算出することを特徴とするものである。 In order to achieve the above object, a first aspect of the present invention is to analyze a user's interest in a plurality of geographically dispersed spots by a computer, and to recommend spots recommended to the user based on the analysis result. In an information recommendation method, apparatus and program for creating information,
Information that represents the user's browsing history for the browsing site where the computer posts information on the plurality of spots, stores the information in a first storage medium, and information that represents the geographical movement history of the user Is acquired and stored in the second storage medium. In addition, based on the browsing history information stored in the first storage medium, the computer clusters a plurality of spot browsing information included in the browsing history information on the condition that the browsing time is close, thereby browsing chunks. And a reference area that the user can visit is determined based on the movement history information stored in the second storage medium or information input in advance by the user or connection information with the radio base station. A set of spot information included in the determined reference area is extracted as a reference spot set from spot information stored in advance. And the number a of the spot browsing information included in the generated browsing chunk, the number n of the spot browsing information having a predetermined concept among the spot browsing information group included in the browsing chunk, and included in the reference spot set The number of spot browsing information in which the concept appears in the browsing chunk based on the number S of spot information and the number N of spot information to which the concept is given out of the spot information group included in the reference spot set Calculates a first probability that is equal to or greater than n and a second probability that is less than n, and an inverse function of the cumulative distribution function of the standard normal distribution based on the calculated first and second probabilities. by, it is characterized in Rukoto issuing calculation interest score z representing the degree of interest of the user with respect to the concepts.

また、この発明の第１の観点は以下のような態様を備えることを特徴とする。
第１の態様は、上記コンピュータが、上記第２の記憶媒体に記憶された移動履歴情報をもとに、当該移動履歴情報に含まれる複数の滞在情報を距離の近さを条件にクラスタ化して滞在チャンクを生成し、この滞在チャンクに対し距離の近さを条件にしてスポット群を関連付ける機能をさらに備え、上記生成された滞在チャンクに含まれるスポット情報の数を上記ａとすると共に、上記滞在チャンクに含まれるスポット情報群のうち所定の概念を有するスポット情報の数を上記ｎとして、上記興味スコアｚを算出するものである。 The first aspect of the present invention is characterized by comprising the following aspects.
In the first aspect, the computer clusters a plurality of pieces of stay information included in the movement history information on the basis of distance proximity based on the movement history information stored in the second storage medium. The function further includes a function of generating a stay chunk and associating a spot group with the stay chunk on condition that the distance is close. The number of pieces of spot information included in the generated stay chunk is set to a, and the stay is stored. The interest score z is calculated by setting the number of pieces of spot information having a predetermined concept in the spot information group included in the chunk as n.

第２の態様は、上記コンピュータが、上記第２の記憶媒体に記憶された移動履歴情報をもとに、当該移動履歴情報に含まれる複数の滞在情報を距離の近さを条件にクラスタ化して滞在チャンクを生成し、この滞在チャンクに対し距離の近さを条件にしてスポット群を関連付ける機能をさらに備え、上記閲覧チャンクに出現するスポット閲覧情報の数を上記Ｎとすると共に、上記閲覧チャンクの閲覧時刻から予め設定した一定時間内に存在する滞在チャンク群を共起チャンクとしてこの共起チャンク内の滞在チャンクに対し上記距離の近さを条件にして関連付けられたスポット群の数を上記ａとし、上記一定時間内の滞在チャンクに関連付けられたスポット群の中で、上記閲覧チャンクに出現したスポットの数をｎとして、上記興味スコアｚを算出するものである。 In the second aspect, the computer clusters a plurality of pieces of stay information included in the movement history information on the basis of the proximity of the distance based on the movement history information stored in the second storage medium. A function of generating a stay chunk and associating a spot group with the stay chunk on condition that the distance is close is further provided, and the number of pieces of spot browse information appearing in the browse chunk is N, and The number of spot groups associated with stay chunks in the co-occurrence chunk as a co-occurrence chunk as a co-occurrence chunk on the condition that the distance is close is defined as a. , among the group spots associated with the stay chunks within the predetermined time, the number of spots that appeared in the viewing chunk is n, the interest scores z It is intended to leave.

第３の態様は、上記コンピュータが、上記第２の記憶媒体に記憶された移動履歴情報をもとに、当該移動履歴情報に含まれる複数の滞在情報を距離の近さを条件にクラスタ化して滞在チャンクを生成し、この滞在チャンクに対し距離の近さを条件にしてスポット群を関連付ける機能をさらに備え、上記参照スポット集合に含まれるスポット情報の数Ｓとし、閲覧チャンク内のスポット情報数をａとしたうえで、上記閲覧チャンクの閲覧時刻から予め設定した一定時間内に存在する滞在チャンク群に出現するスポット情報の数をＮとし、上記閲覧チャンクに出現するスポット情報において、一定時間内に滞在開始時刻を持つ滞在チャンクに出現するスポット情報と共起するスポット情報群を共起チャンクとし、当該共起チャンク内のスポットの数をｎとして上記興味スコアｚを算出するものである。 According to a third aspect, the computer clusters a plurality of pieces of stay information included in the movement history information based on the proximity of the distance based on the movement history information stored in the second storage medium. A function of generating a stay chunk and associating a spot group with this stay chunk on condition of closeness is provided, and the number S of spot information included in the reference spot set is defined as S. The number of spot information appearing in the stay chunk group existing within a predetermined time set in advance from the browsing time of the browsing chunk is N, and the spot information appearing in the browsing chunk is within a certain time. The spot information group that co-occurs with the spot information that appears in the stay chunk having the stay start time is defined as a co-occurrence chunk, and the spots in the co-occurrence chunk are identified. And it calculates the interest scores z as n.

第４の態様は、上記コンピュータが、上記第２の記憶媒体に記憶された移動履歴情報をもとに、当該移動履歴情報に含まれる複数の滞在情報を距離の近さを条件にクラスタ化して滞在チャンクを生成し、この滞在チャンクに対し距離の近さを条件にしてスポット群を関連付ける機能をさらに備え、上記参照スポット集合に含まれるスポット情報の数Ｓとし、閲覧チャンク内のスポット情報数をａとしたうえで、上記滞在チャンクの滞在開始時刻から予め設定した一定時間内を閲覧時刻とする閲覧チャンク群に出現するスポット情報の数をＮとし、上記滞在チャンクに出現するスポット情報群において、一定時間内の閲覧チャンクに出現するスポット情報と共起するスポット情報群を共起チャンクとして、共起チャンク内のスポットの数をｎとして上記興味スコアｚを算出するものである。 According to a fourth aspect, the computer clusters a plurality of pieces of stay information included in the movement history information based on the proximity of the distance based on the movement history information stored in the second storage medium. A function of generating a stay chunk and associating a spot group with this stay chunk on condition of closeness is provided, and the number S of spot information included in the reference spot set is defined as S. In the spot information group appearing in the stay chunk, N is the number of spot information appearing in the browse chunk group whose browsing time is within a predetermined time from the stay start time of the stay chunk. A spot information group that co-occurs with spot information appearing in a browsing chunk within a certain time is defined as a co-occurrence chunk, and the number of spots in the co-occurrence chunk is n. Te and calculates the interest score z.

第５の態様は、上記閲覧チャンク、滞在チャンクまたは共起チャンクについて、ユーザの操作履歴、各チャンクの時刻及び状況、及び上記スポットと上記概念との関連度の少なくとも１つに応じて上記興味スコアｚを重み付けするものである。 In a fifth aspect, for the browsing chunk, stay chunk, or co-occurrence chunk, the interest score according to at least one of a user operation history, a time and situation of each chunk, and a degree of association between the spot and the concept z is weighted.

第６の態様は、上記第１の確率及び上記第２の確率を超幾何分布、二項分布または正規分布により算出するものである。 In the sixth aspect, the first probability and the second probability are calculated by hypergeometric distribution, binomial distribution or normal distribution.

第７の態様は、１つ以上の概念が出現するスポット情報について、当該スポット情報に出現する各概念の上記興味スコアｚを用いて、当該スポット情報に対するユーザの評価スコアを算出する機能をさらに備えたものである。 The seventh aspect further includes a function of calculating the user's evaluation score for the spot information using the interest score z of each concept that appears in the spot information for the spot information in which one or more concepts appear. It is a thing.

第８の態様は、上記スポット情報と各概念との関連度及び各概念の重要度の少なくとも１つに応じて上記興味スコアｚを重み付けするようにしたものである。 In the eighth aspect, the interest score z is weighted according to at least one of the degree of association between the spot information and each concept and the importance of each concept.

すなわちこの発明によれば、ユーザの閲覧履歴実績が少ない状況下でも、ユーザによる条件の手入力に頼ることなくより高精度な情報推薦を可能にした情報推薦方法、装置及びプログラムを提供することができる。 That is, according to the present invention, it is possible to provide an information recommendation method, apparatus, and program that enable highly accurate information recommendation without relying on manual input of conditions by a user even in a situation where the user's browsing history record is small. it can.

この発明の第１の実施形態に係る情報推薦装置を備えたシステムの全体構成を示す図。The figure which shows the whole structure of the system provided with the information recommendation apparatus which concerns on 1st Embodiment of this invention. 図１に示した情報推薦装置における嗜好学習処理部の機能構成を示すブロック図。The block diagram which shows the function structure of the preference learning process part in the information recommendation apparatus shown in FIG. 図１に示した情報推薦装置における飲食店推薦処理部の機能構成を示すブロック図。The block diagram which shows the function structure of the restaurant recommendation process part in the information recommendation apparatus shown in FIG. 図１に示した情報推薦装置による全体の処理手順と処理内容を示すフローチャート。The flowchart which shows the whole process sequence and process content by the information recommendation apparatus shown in FIG. 図４に示した全体処理のうち嗜好学習処理の処理手順と処理内容を示すフローチャート。The flowchart which shows the process sequence and process content of a preference learning process among the whole processes shown in FIG. 図４に示した全体処理のうち飲食店推薦処理の処理手順と処理内容を示すフローチャート。The flowchart which shows the process sequence and process content of a restaurant recommendation process among the whole processes shown in FIG. 図５に示した嗜好学習処理において閲覧チャンク及び滞在チャンクを抽出する処理を説明するための図。The figure for demonstrating the process which extracts a browsing chunk and a stay chunk in the preference learning process shown in FIG. 図５に示した嗜好学習処理において参照飲食店集合に対する閲覧チャンク及び滞在チャンクの乖離確率を算出する処理を説明するための図。The figure for demonstrating the process which calculates the deviation probability of the browsing chunk and stay chunk with respect to a reference restaurant set in the preference learning process shown in FIG. 図５に示した嗜好学習処理において、共起チャンクを抽出する処理と、参照飲食店集合に対する共起チャンクの乖離確率を算出する処理を説明するための図。The figure for demonstrating the process which calculates the deviation probability of the co-occurrence chunk with respect to a reference restaurant set in the preference learning process shown in FIG. 図１に示した情報推薦装置による推薦精度の評価結果を示す図。The figure which shows the evaluation result of the recommendation precision by the information recommendation apparatus shown in FIG.

以下、図面を参照してこの発明に係わる実施形態を説明する。
（第１の実施形態）
［構成］
図１はこの発明の第１の実施形態に係るシステムの概略構成図であり、ユーザそれぞれ所持するユーザ端末ＵＴ１〜ＵＴｎを通信ネットワークＮＷを介して情報推薦装置としての機能を有する情報サーバＳＶに接続可能としたものである。
通信ネットワークＮＷは、インターネットと、このインターネットにアクセスするためのアクセス網とから構成される。アクセス網としては、携帯電話網やホットスポット（登録商標）、WiMAX（登録商標）、無線ＬＡＮ等の移動通信網が用いられる。 Embodiments according to the present invention will be described below with reference to the drawings.
(First embodiment)
[Constitution]
FIG. 1 is a schematic configuration diagram of a system according to a first embodiment of the present invention, in which user terminals UT1 to UTn possessed by respective users are connected to an information server SV having a function as an information recommendation device via a communication network NW. It is possible.
The communication network NW includes the Internet and an access network for accessing the Internet. As the access network, a mobile communication network such as a mobile phone network, Hotspot (registered trademark), WiMAX (registered trademark), or wireless LAN is used.

ユーザ端末ＵＴ１〜ＵＴｎは、例えば携帯電話機やスマートホン、ＰＤＡ（Personal Digital Assistant）、タブレット型端末、ノート型のパーソナル・コンピュータ等の携帯端末からなり、ＧＰＳ（Global Positioning System）を利用して自端末の位置を計測する機能と、Ｗｅｂサイトにアクセスするためのブラウザ機能と、電子メールを送受信するメーラ機能を備えている。 The user terminals UT1 to UTn are, for example, mobile terminals such as mobile phones, smart phones, PDAs (Personal Digital Assistants), tablet terminals, notebook personal computers, and the like, and use the GPS (Global Positioning System). A function for measuring the position of the user, a browser function for accessing a Web site, and a mailer function for sending and receiving electronic mail.

情報サーバＳＶは例えばＷｅｂサーバからなり、図示しない通信インタフェースユニットと、制御ユニット１と、記憶ユニット２を備えている。
通信インタフェースユニットは、上記通信ネットワークＮＷを介してユーザ端末ＵＴ１〜ＵＴｎとの間で情報通信を行うもので、通信プロトコルとしては例えばＴＣＰ／ＩＰ（Transmission Control Protocol／Internet Protocol）が用いられる。 The information server SV is composed of, for example, a Web server, and includes a communication interface unit (not shown), a control unit 1, and a storage unit 2.
The communication interface unit performs information communication with the user terminals UT1 to UTn via the communication network NW. For example, TCP / IP (Transmission Control Protocol / Internet Protocol) is used as the communication protocol.

記憶ユニット２は、記憶媒体としてＨＤＤやＮＡＮＤ型フラッシュメモリ等の随時書込み読出しが可能な不揮発性メモリを使用したもので、この実施形態を実施するために必要な記憶機能として、ＧＰＳ移動履歴記憶部２１と、詳細閲覧履歴記憶部２２と、嗜好データベース２３と、飲食店データベース２４と、推薦リスト記憶部２５を備えている。 The storage unit 2 uses a non-volatile memory that can be written and read as needed, such as an HDD or a NAND flash memory, as a storage medium. As a storage function necessary to implement this embodiment, a GPS movement history storage unit 21, a detailed browsing history storage unit 22, a preference database 23, a restaurant database 24, and a recommendation list storage unit 25.

ＧＰＳ移動履歴記憶部２１は、ユーザ端末ＵＴ１〜ＵＴｎから定期的に収集されるユーザのＧＰＳ移動履歴データを蓄積するために用いられる。このＧＰＳ移動履歴データは、ユーザの滞在地を表す測位データの集合からなり、各測位データは測位時刻と測位座標とからなる。 The GPS movement history storage unit 21 is used for accumulating user's GPS movement history data periodically collected from the user terminals UT1 to UTn. This GPS movement history data consists of a set of positioning data representing the user's place of stay, and each positioning data consists of positioning time and positioning coordinates.

詳細閲覧履歴記憶部２２は、ユーザ端末ＵＴ１〜ＵＴｎから送信された飲食店のＵＲＬ（Uniform Resource Locator）に対する閲覧履歴を表すデータの集合を蓄積するために用いられる。各閲覧履歴データは、閲覧時刻と店ＩＤとで表される。 The detailed browsing history storage unit 22 is used for accumulating a set of data representing a browsing history with respect to a restaurant URL (Uniform Resource Locator) transmitted from the user terminals UT1 to UTn. Each browsing history data is represented by browsing time and store ID.

嗜好データベース（Preference DB）２３は、後述する嗜好学習処理部１３により計算された嗜好データを記憶するために用いられる。各嗜好データは、タグと嗜好スコアにより構成される。 A preference database (Preference DB) 23 is used to store preference data calculated by a preference learning processing unit 13 described later. Each preference data includes a tag and a preference score.

飲食店データベース２４には、システムのサービスエリアに存在する飲食店を表す情報が予め記憶されている。この飲食店を表す情報は、「店ＩＤ」と、店の概要を表す「概要」と、メニューや価格帯等を表す「詳細」、店のＷｅｂサイトへのリンク情報である「ＵＲＬ」と、移動・経度により表される店の「住所座標」と、「タグ」により構成される。「タグ」は、飲食店のグルメジャンルや駐車場の有無等の属性情報を示す。 In the restaurant database 24, information representing restaurants existing in the service area of the system is stored in advance. The information representing the restaurant includes a “store ID”, a “summary” representing a summary of the store, a “detail” representing a menu, a price range, etc., a “URL” which is link information to the store website, It is composed of “address coordinates” and “tag” of the store represented by movement / longitude. “Tag” indicates attribute information such as the gourmet genre of the restaurant and the presence / absence of a parking lot.

推薦リスト記憶部２５は、後述する飲食店推薦処理部１４により算出された飲食店の推薦リストを保存するために用いられる。推薦リスト（Recommend List）は、推薦候補となる飲食店データの集合を嗜好スコア順にソートしたもので、推薦順位、店ＩＤ、概要、詳細及びＵＲＬにより構成される。 The recommendation list storage unit 25 is used for storing a restaurant recommendation list calculated by the restaurant recommendation processing unit 14 described later. The recommendation list (Recommend List) is a collection of restaurant data serving as recommendation candidates, sorted in order of preference score, and includes a recommendation rank, a store ID, an outline, details, and a URL.

制御ユニット１は、中央処理ユニット（Central Processing Unit：ＣＰＵ）を備えたもので、この実施形態を実施するために必要な制御機能として、ＧＰＳ移動履歴収集部１１と、詳細閲覧履歴収集部１２と、嗜好学習処理部１３と、飲食店推薦処理部１４を備えている。 The control unit 1 includes a central processing unit (CPU). As a control function necessary to implement this embodiment, a GPS movement history collection unit 11, a detailed browsing history collection unit 12, The preference learning processing unit 13 and the restaurant recommendation processing unit 14 are provided.

ＧＰＳ移動履歴収集部１１は、定期的にユーザ端末ＵＴ１〜ＵＴｎから当該ユーザ端末ＵＴ１〜ＵＴｎにおいて計測されたＧＰＳ移動履歴データを通信ネットワークＮＷを介してそれぞれ収集し、収集したＧＰＳ移動履歴データをＧＰＳ移動履歴記憶部２１に記憶させる機能を有する。 The GPS movement history collection unit 11 periodically collects GPS movement history data measured at the user terminals UT1 to UTn from the user terminals UT1 to UTn via the communication network NW, and collects the collected GPS movement history data to the GPS. It has a function to be stored in the movement history storage unit 21.

詳細閲覧履歴収集部１２は、飲食店のＵＲＬに対するユーザ端末ＵＴ１〜ＵＴｎのアクセス履歴、つまり詳細閲覧履歴データを、当該ユーザ端末ＵＴ１〜ＵＴｎから通信ネットワークＮＷを介してそれぞれ収集する。そして、この収集した閲覧履歴データをＧＰＳ移動履歴記憶部２１に記憶させる機能を有する。 The detailed browsing history collection unit 12 collects the access history of the user terminals UT1 to UTn with respect to the URL of the restaurant, that is, detailed browsing history data, from the user terminals UT1 to UTn via the communication network NW. The collected browsing history data is stored in the GPS movement history storage unit 21.

嗜好学習処理部１３は、上記ＧＰＳ移動履歴記憶部２１に蓄積されたユーザのＧＰＳ移動履歴データと、上記詳細閲覧履歴記憶部２２に蓄積されている詳細閲覧履歴データと、上記飲食店データベース２４に記憶された飲食店データに基づいて、各飲食店に対するユーザの嗜好スコアを算出し、この算出された嗜好スコアを嗜好データベース２３に記憶させる機能を有するもので、具体的には以下のように構成される。図２はその機能構成を示すブロック図である。 The preference learning processing unit 13 stores the GPS movement history data of the user stored in the GPS movement history storage unit 21, the detailed browsing history data stored in the detailed browsing history storage unit 22, and the restaurant database 24. Based on the stored restaurant data, a user's preference score for each restaurant is calculated, and the calculated preference score is stored in the preference database 23. Specifically, the configuration is as follows. Is done. FIG. 2 is a block diagram showing the functional configuration.

すなわち嗜好学習処理部１３は、参照飲食店集合作成部１３１と、詳細閲覧履歴分析部１３２と、ＧＰＳ移動履歴分析部１３３と、飲食店共起分析部１３４と、嗜好スコア計算部１３５を備えている。 That is, the preference learning processing unit 13 includes a reference restaurant set creation unit 131, a detailed browsing history analysis unit 132, a GPS movement history analysis unit 133, a restaurant co-occurrence analysis unit 134, and a preference score calculation unit 135. Yes.

参照飲食店集合作成部１３１は、ユーザごとに、上記ＧＰＳ移動履歴記憶部２１に蓄積されたＧＰＳ移動履歴データをもとに当該ユーザの行動範囲を分析し、この行動範囲内に存在する飲食店を飲食店データベース２４から抽出して参照飲食店集合を作成する機能を有する。 The reference restaurant set creation unit 131 analyzes the user's action range based on the GPS movement history data stored in the GPS movement history storage unit 21 for each user, and the restaurants that exist within this action range. Is extracted from the restaurant database 24 to create a reference restaurant set.

詳細閲覧履歴分析部１３２は、ユーザごとに、上記詳細閲覧履歴記憶部２２に蓄積された詳細閲覧履歴データに出現する各店に関して、当該各店が保持するタグの合計数について、上記参照飲食店集合作成部１３１により作成された参照飲食店集合から無作為抽出した場合との乖離確率を算出する機能を有する。なお、後述するＧＰＳ移動履歴との統合を考慮した乖離確率の計算のために、上記詳細閲覧履歴から閲覧時刻の近さをもとに詳細閲覧のまとまりを抽出する。このまとまりを閲覧チャンクと呼ぶ。 For each user, the detailed browsing history analysis unit 132 creates the reference restaurant set creation for the total number of tags held by each store for each store that appears in the detailed browsing history data accumulated in the detailed browsing history storage unit 22. It has a function of calculating a deviation probability from the case of random extraction from the reference restaurant set created by the unit 131. In order to calculate a divergence probability in consideration of integration with a GPS movement history described later, a group of detailed browsing is extracted from the detailed browsing history based on the proximity of the browsing time. This group is called a browsing chunk.

ＧＰＳ移動履歴分析部１３３は、上記ユーザごとに、上記ＧＰＳ移動履歴記憶部２１に蓄積されたＧＰＳ移動履歴データをもとに、ユーザが一定時間、一定の範囲内に留まるＧＰＳ移動履歴を抽出する。この抽出されたＧＰＳ移動履歴を滞在チャンクと呼称する。そして、この抽出した滞在チャンクにおける訪問可能性飲食店を、参照飲食店集合からの標本と考え、タグの滞在チャンクにおける無作為抽出との乖離確率を算出する機能を有する。 The GPS movement history analysis unit 133 extracts, for each user, a GPS movement history in which the user stays within a certain range for a certain period of time based on the GPS movement history data stored in the GPS movement history storage unit 21. . This extracted GPS movement history is referred to as a stay chunk. Then, the visit possibility restaurant in the extracted stay chunk is considered as a sample from the reference restaurant set, and has a function of calculating a deviation probability from the random extraction in the tag stay chunk.

飲食店共起分析部１３４は、詳細閲覧後にユーザが実際に飲食店に訪問した可能性を分析に反映させるため、閲覧チャンクに出現した飲食店と、滞在チャンクで訪問可能性飲食店の共起について、これまでの各履歴への分析と同様に無作為抽出した場合との乖離確率を算出する機能を有する。 The restaurant co-occurrence analysis unit 134 reflects the possibility that the user has actually visited the restaurant after the detailed browsing in order to reflect in the analysis the co-occurrence of the restaurant that appeared in the browsing chunk and the restaurant that can be visited in the stay chunk As in the analysis to each history so far, it has a function of calculating the deviation probability from the case of random extraction.

嗜好スコア計算部１３５は、上記詳細閲覧履歴分析部１３２、ＧＰＳ移動履歴分析部１３３及び飲食店共起分析部１３４によりそれぞれ算出された乖離確率を統合し、これにより各タグに対するユーザの嗜好スコアを算出し、この算出された嗜好スコアを嗜好データベース２３に格納する機能を有する。 The preference score calculation unit 135 integrates the divergence probabilities calculated by the detailed browsing history analysis unit 132, the GPS movement history analysis unit 133, and the restaurant co-occurrence analysis unit 134, thereby calculating the user's preference score for each tag. A function of calculating and storing the calculated preference score in the preference database 23;

飲食店推薦処理部１４は、ユーザの要求に応じて推薦候補となる飲食店集合を作成し、この候補飲食店集合に含まれる各飲食店についてのスコアを計算して推薦リストを作成し、この推薦リストを要求元のユーザ端末ＵＴ１〜ＵＴｎへ返送する機能を有するもので、具体的には以下のように構成される。図３はその機能構成を示すブロック図である。 The restaurant recommendation processing unit 14 creates a restaurant set serving as a recommendation candidate in response to a user request, calculates a score for each restaurant included in the candidate restaurant set, creates a recommendation list, The recommendation list is returned to the requesting user terminals UT1 to UTn. Specifically, the recommendation list is configured as follows. FIG. 3 is a block diagram showing the functional configuration.

すなわち飲食店推薦処理部１４は、候補飲食店集合作成部１４１と、飲食店スコア計算部１４２と、推薦リスト作成部１４３を備えている。
候補飲食店集合作成部１４１は、ユーザ端末ＵＴ１〜ＵＴｎから飲食店の推薦を要求するユーザリクエストが送られた場合に、当該ユーザリクエストに含まれる地域とタグにより表される店舗条件に該当する飲食店の集合を飲食店データベース２４から検索し、これにより候補飲食店集合を作成する機能を有する。 That is, the restaurant recommendation processing unit 14 includes a candidate restaurant set creation unit 141, a restaurant score calculation unit 142, and a recommendation list creation unit 143.
The candidate restaurant set creation unit 141, when a user request for requesting a restaurant is sent from the user terminals UT1 to UTn, the restaurant corresponding to the store condition represented by the region and the tag included in the user request. It has a function of retrieving a set of shops from the restaurant database 24 and thereby creating a candidate restaurant set.

飲食店スコア計算部１４２は、上記嗜好データベース２３に記憶された、タグに対するユーザの嗜好スコアをもとに、上記候補飲食店集合に含まれる各飲食店のスコアを算出する機能を有する。この飲食店スコアは、上記ユーザの嗜好スコアとの合致度を表す情報となる。 The restaurant score calculation unit 142 has a function of calculating the score of each restaurant included in the candidate restaurant set based on the user's preference score for the tag stored in the preference database 23. This restaurant score is information representing the degree of coincidence with the user's preference score.

推薦リスト作成部１４３は、上記算出された各飲食店のスコアをもとに飲食店情報をその降順にソートし、これにより上記ユーザの嗜好スコアとの合致度が高い順に飲食店情報を並べ替えた推薦リストを作成する。そして、この作成された推薦リストを、推薦リスト記憶部２５に保存すると共に、通信ネットワークＮＷを介して要求元のユーザ端末ＵＴ１〜ＵＴｎへ返送する機能を有する。
なお、以上述べた制御ユニット１の各処理機能は、何れも図示しないプログラムメモリに格納されたアプリケーション・プログラムを上記ＣＰＵに実行させることにより実現される。 The recommendation list creation unit 143 sorts the restaurant information in descending order based on the calculated score of each restaurant, thereby rearranging the restaurant information in descending order of the degree of coincidence with the user preference score. Create a recommended list. The prepared recommendation list is stored in the recommendation list storage unit 25 and returned to the requesting user terminals UT1 to UTn via the communication network NW.
Each processing function of the control unit 1 described above is realized by causing the CPU to execute an application program stored in a program memory (not shown).

［動作］
次に、以上のように構成された情報推薦サーバＳＶによる情報推薦処理の動作を説明する。図４はその全体の処理手順と処理内容を示すフローチャートである。
（１）ＧＰＳ移動履歴及び閲覧履歴の収集処理
情報推薦サーバＳＶの制御ユニット１は、ステップＳ１及びステップ３によりそれぞれＧＰＳ移動履歴の収集タイミングと詳細閲覧履歴の収集タイミングを監視している。この状態で、例えば一定の時間間隔に設定されたＧＰＳ移動履歴の収集タイミングになると、ステップＳ２によりＧＰＳ移動履歴収集部１１を起動し、このＧＰＳ移動履歴収集部１１の制御の下で、各ユーザ端末ＵＴ１〜ＵＴｎに対し順次ＧＰＳ移動履歴の収集要求を送信する。 [Operation]
Next, an operation of information recommendation processing by the information recommendation server SV configured as described above will be described. FIG. 4 is a flowchart showing the overall processing procedure and processing contents.
(1) Collection processing of GPS movement history and browsing history The control unit 1 of the information recommendation server SV monitors the collection timing of the GPS movement history and the collection timing of the detailed browsing history at step S1 and step 3, respectively. In this state, for example, when the GPS movement history collection timing set at a certain time interval comes, the GPS movement history collection unit 11 is activated in step S2, and each user is controlled under the control of the GPS movement history collection unit 11. A GPS movement history collection request is sequentially transmitted to the terminals UT1 to UTn.

これに対し各ユーザ端末ＵＴ１〜ＵＴｎでは、前回の収集タイミングから今回の収集タイミングまでの期間に計測されたＧＰＳ測位データの集合が記憶されており、上記収集要求を受信すると各ユーザ端末ＵＴ１〜ＵＴｎは上記記憶されていたＧＰＳ測位データの集合を読み出して情報推薦サーバＳＶへ送信する。 On the other hand, each user terminal UT1 to UTn stores a set of GPS positioning data measured during the period from the previous collection timing to the current collection timing, and when receiving the collection request, each user terminal UT1 to UTn. Reads out the stored GPS positioning data set and transmits it to the information recommendation server SV.

情報推薦サーバＳＶの制御ユニット１は、ＧＰＳ移動履歴収集部１１の制御の下で、上記各ユーザ端末ＵＴ１〜ＵＴｎから送られたＧＰＳ測位データの集合を受信し、この受信されたＧＰＳ測位データの集合をＧＰＳ移動履歴データとしてＧＰＳ移動履歴記憶部２１に格納する。 The control unit 1 of the information recommendation server SV receives a set of GPS positioning data sent from each of the user terminals UT1 to UTn under the control of the GPS movement history collection unit 11, and receives the received GPS positioning data. The set is stored in the GPS movement history storage unit 21 as GPS movement history data.

一方、詳細閲覧履歴の収集タイミングになると、ステップＳ４により詳細閲覧履歴収集部１２を起動し、この詳細閲覧履歴収集部１２の制御の下で、先に述べたＧＰＳ移動履歴を収集する場合と同様に、各ユーザ端末ＵＴ１〜ＵＴｎに対し順次詳細閲覧履歴の収集要求を送信する。 On the other hand, when the detailed browsing history collection timing comes, the detailed browsing history collecting unit 12 is activated in step S4, and the GPS movement history is collected under the control of the detailed browsing history collecting unit 12 as described above. In addition, a detailed browsing history collection request is sequentially transmitted to each of the user terminals UT1 to UTn.

これに対し各ユーザ端末ＵＴ１〜ＵＴｎでは、前回の収集タイミングから今回の収集タイミングまでの期間にユーザが飲食店の詳細な情報を確認するために閲覧した当該飲食店のＵＲＬと閲覧時刻が記憶されている。そして、上記収集要求を受信すると、各ユーザ端末ＵＴ１〜ＵＴｎは上記記憶されていた飲食店のＵＲＬと閲覧時刻とからなる閲覧履歴データを読み出し、情報推薦サーバＳＶへ送信する。 On the other hand, in each user terminal UT1 to UTn, the URL and browsing time of the restaurant that the user browsed to confirm the detailed information of the restaurant during the period from the previous collection timing to the current collection timing are stored. ing. When receiving the collection request, each of the user terminals UT1 to UTn reads the browsing history data including the stored restaurant URL and browsing time, and transmits the reading history data to the information recommendation server SV.

情報推薦サーバＳＶの制御ユニット１は、詳細閲覧履歴収集部１２の制御の下で、上記各ユーザ端末ＵＴ１〜ＵＴｎから送られた閲覧履歴データを受信し、この受信された閲覧履歴データを詳細閲覧履歴記憶部２２に格納する。 The control unit 1 of the information recommendation server SV receives the browsing history data transmitted from each of the user terminals UT1 to UTn under the control of the detailed browsing history collection unit 12, and performs detailed browsing of the received browsing history data. Store in the history storage unit 22.

（２）嗜好学習処理（Learning Process）
情報推薦サーバＳＶの制御ユニット１は、上記ＧＰＳ移動履歴データ及び詳細閲覧履歴データの収集処理を定期的に実行しながら、ステップＳ５で嗜好学習タイミングになったか否かを監視している。この状態で予め定められた処理タイミングになると、ステップＳ６により嗜好学習処理部１３を起動し、この嗜好学習処理部１３の制御の下で以下のようにユーザの飲食店に対する嗜好を学習する。図５はその処理手順と処理内容を示すフローチャートである。 (2) Preference learning process (Learning Process)
The control unit 1 of the information recommendation server SV monitors whether or not the preference learning timing has come in step S5 while periodically collecting the GPS movement history data and the detailed browsing history data. When the predetermined processing timing is reached in this state, the preference learning processing unit 13 is activated in step S6, and the user's preference for the restaurant is learned as described below under the control of the preference learning processing unit 13. FIG. 5 is a flowchart showing the processing procedure and processing contents.

（２−１）参照飲食店集合の作成
情報推薦サーバＳＶの制御ユニット１は、先ずステップＳ６１により参照飲食店集合作成部１３１を起動する。そして、この参照飲食店集合作成部１３１の制御の下、ステップＳ６１１により学習対象のユーザに関するＧＰＳ移動履歴データをＧＰＳ移動履歴記憶部２１から読み出す。次にステップＳ６１２において、上記読み出されたＧＰＳ移動履歴データをもとにユーザの行動範囲を表す参照エリアを求め、この参照エリア内に存在する飲食店をステップＳ６１３により飲食店データベース２４から読み出して、参照飲食店集合とする。なお、参照エリアの決定については、ユーザへの行動範囲や居住地、勤務地に対するアンケート、プロフィール情報等による手入力で収集した情報を元に決定することもできる。さらに、ＧＰＳ移動履歴以外でも、位置情報が事前に把握されている無線基地局への接続履歴、前記無線基地局からの電波強度による位置測位情報等によって決定することもできる。 (2-1) Creation of reference restaurant set The control unit 1 of the information recommendation server SV first activates the reference restaurant set creation unit 131 in step S61. Then, under the control of the reference restaurant set creation unit 131, the GPS movement history data related to the learning target user is read from the GPS movement history storage unit 21 in step S <b> 611. Next, in step S612, a reference area representing the user's action range is obtained based on the read GPS movement history data, and restaurants existing in the reference area are read from the restaurant database 24 in step S613. The reference restaurant set. Note that the reference area can be determined based on information collected by manual input based on the user's range of action, residence, work location questionnaire, profile information, and the like. Further, other than the GPS movement history, it can be determined based on the connection history to the radio base station whose position information is known in advance, the position positioning information based on the radio wave intensity from the radio base station, and the like.

上記参照飲食店集合を作成する具体的な処理方法としては、以下の方法が考えられる。すなわち、先ずユーザのＧＰＳ移動履歴について全測位点の重心を計算し、この重心から各測位点の９５％が含まれる距離を計算する。９５％とする理由は、旅行等による移動による外れ値を考慮するためである。次に、この距離の定数倍を半径として上記計算された重心を中心とする円エリアを描き、この円エリアを参照エリアとしてこの参照エリア内に存在する飲食店を飲食店データベース２４から読み出す。 The following method can be considered as a specific processing method for creating the reference restaurant set. That is, first, the center of gravity of all positioning points is calculated for the user's GPS movement history, and the distance including 95% of each positioning point is calculated from this center of gravity. The reason for setting it to 95% is to consider outliers due to travel due to travel or the like. Next, a circle area centered on the calculated center of gravity is drawn with a constant multiple of this distance as a radius, and restaurants existing in this reference area are read from the restaurant database 24 using this circle area as a reference area.

（２−２）詳細閲覧履歴の分析
制御ユニット１は、次にステップＳ６２により詳細閲覧履歴分析部１３２を起動する。そして、この詳細閲覧履歴分析部１３２の制御の下、先ずステップＳ６２１により詳細閲覧履歴記憶部２２から学習対象ユーザの詳細閲覧履歴データを読出し、この詳細閲覧履歴から閲覧時刻の近さをもとに詳細閲覧のまとまりを抽出する。このまとまりを閲覧チャンクと呼ぶ。具体的には、あるユーザの詳細閲覧履歴が多数ある場合に、各履歴の前後一定時間（例えば３０分）以内に別の履歴がある場合には、これらの履歴を同じ閲覧チャンクとしてまとめる。この処理を時系列的に順次行うことで、詳細閲覧履歴データから閲覧チャンク群を抽出する。図７（ａ）はその抽出結果の一例を示すもので、この例では閲覧チャンク１、閲覧チャンク２、閲覧チャンク３が抽出された場合を示している。 (2-2) Analysis of Detailed Browsing History Next, the control unit 1 activates the detailed browsing history analyzing unit 132 in step S62. Then, under the control of the detailed browsing history analysis unit 132, first, the detailed browsing history data of the learning target user is read from the detailed browsing history storage unit 22 in step S621, and based on the closeness of the browsing time from the detailed browsing history. Extract a group of detailed browsing. This group is called a browsing chunk. Specifically, when there are many detailed browsing histories of a certain user and there are other histories within a certain time (for example, 30 minutes) before and after each history, these histories are collected as the same browsing chunk. By sequentially performing this process in time series, a browsing chunk group is extracted from the detailed browsing history data. FIG. 7A shows an example of the extraction result. In this example, browsing chunk 1, browsing chunk 2, and browsing chunk 3 are extracted.

なお、一般にユーザ情報閲覧は、一定の時間帯に集中して行われる。このとき、一連の情報閲覧の中で各詳細閲覧店を選択する際には、他の詳細閲覧店との関係も存在すると考えられる。このため、閲覧チャンク化して分析を行うことは、推薦精度の向上させる上で有効である。 In general, user information browsing is concentrated in a certain time zone. At this time, when selecting each detailed browsing store in a series of information browsing, it is considered that there is also a relationship with other detailed browsing stores. For this reason, it is effective to improve analysis by recommending browsing chunks for analysis.

次にステップＳ６２２において、詳細閲覧履歴データに出現する各店に関して、当該各店が保持するタグの合計数について、上記参照飲食店集合作成部１３１により作成された参照飲食店集合から無作為抽出した場合との乖離確率を算出する。
例えば、上記抽出された各閲覧チャンクを参照飲食店集合からの標本と捉え、タグｉの閲覧チャンクｊにおける無作為抽出との乖離確率ｐijを、

とする。 Next, in step S622, for each store appearing in the detailed browsing history data, the total number of tags held by each store is randomly extracted from the reference restaurant set created by the reference restaurant set creating unit 131. Is calculated.
For example, each of the extracted browsing chunks is regarded as a sample from the reference restaurant set, and the deviation probability pij from the random sampling in the browsing chunk j of the tag i is expressed as follows:

And

なお、ｒは参照飲食店集合内の飲食店数、ｄi は参照飲食店集合内のタグｉが付与された飲食店数である。また、ｖj は閲覧チャンクｊ内における出現飲食店数、ｎijは閲覧チャンクｊ内でタグｉが付与されている飲食店数である。これらのｒ、ｄi 、ｎijは飲食店データベース２４から読み出して算出する。 In addition, r is the number of restaurants in the reference restaurant set, and di is the number of restaurants to which the tag i in the reference restaurant set is assigned. Further, vj is the number of restaurants that appear in the browsing chunk j, and nij is the number of restaurants that are assigned the tag i in the browsing chunk j. These r, di and nij are read from the restaurant database 24 and calculated.

（２−３）ＧＰＳ移動履歴の分析
制御ユニット１は、次にステップＳ６３においてＧＰＳ移動履歴分析部１３３を起動し、このＧＰＳ移動履歴分析部１３３の制御の下、以下のようにＧＰＳ移動履歴の分析処理を実行する。
すなわち、先ずステップＳ６３１により、ユーザごとに、上記ＧＰＳ移動履歴記憶部２１に蓄積されたＧＰＳ移動履歴データをもとに、ユーザが一定時間、一定の範囲内に留まるＧＰＳ移動履歴を抽出し、これを滞在チャンクとする。図７（ｂ）はその抽出結果の一例を示すもので、この例では滞在チャンク１と滞在チャンク２が抽出された場合を示している。 (2-3) Analysis of GPS Movement History Next, the control unit 1 activates the GPS movement history analysis unit 133 in step S63, and under the control of the GPS movement history analysis unit 133, the GPS movement history analysis is performed as follows. Perform analysis processing.
That is, first, in step S631, for each user, a GPS movement history in which the user stays within a certain range for a certain period of time is extracted based on the GPS movement history data stored in the GPS movement history storage unit 21. Is a stay chunk. FIG. 7B shows an example of the extraction result. In this example, the stay chunk 1 and the stay chunk 2 are extracted.

この滞在チャンクの抽出には、ＧＰＳ測位点にしばしば現れる外れ値や測位結果の変動を一定程度排除するためにＤＢＳＣＡＮ法を用いる方法がある。他に、一定期間、一定範囲、一定数のＧＰＳ測位点が存在する場合を滞在チャンクとして取り出す方法もある。ＤＢＳＣＡＮ法は、分析対象店群について距離の閾値Ｅpsと、近接対象店数の閾値ＭinＰtsという２つのパラメータで決定される、接続関係（Directly Density Reachable）で連結される群を求める方法である。但し、ここでは接続関係の探索を時系列に限定し、連結が抽出できた場合には、始点と終了点との間の時間間隔を計算し、一定時間を超えた群を滞在チャンクとする。その際、初めの時刻を滞在開始時刻、終わりの時刻を滞在終了時刻とする。 For the extraction of the stay chunk, there is a method of using the DBSCAN method in order to exclude outliers often appearing at GPS positioning points and fluctuations in positioning results to a certain extent. In addition, there is a method of extracting a case where there are a certain number of GPS positioning points for a certain period, a certain range, as a stay chunk. The DBSCAN method is a method for obtaining a group connected by a connection relationship (Directly Density Reachable), which is determined by two parameters of a threshold value Eps of the distance to be analyzed and a threshold value MinPts of the number of adjacent stores. However, here, the connection relationship search is limited to the time series, and when the connection can be extracted, the time interval between the start point and the end point is calculated, and the group exceeding the predetermined time is set as the stay chunk. At that time, the first time is the stay start time and the last time is the stay end time.

続いてステップＳ６３２において、上記抽出された滞在チャンクに属するＧＰＳ測位点の重心を中心点とし、一定距離内に存在する飲食店を飲食店データベース２４から抽出する。この抽出された飲食店を滞在チャンクにおける「訪問可能性飲食店」とする。なお、一定距離のパラメータは、使用するＧＰＳ機器の測位精度や履歴の特徴等により決定される。 Subsequently, in step S632, restaurants that exist within a certain distance are extracted from the restaurant database 24 with the center of gravity of the GPS positioning points belonging to the extracted stay chunk as a central point. This extracted restaurant is referred to as “visitable restaurant” in the stay chunk. The constant distance parameter is determined by the positioning accuracy of the GPS device to be used, the history characteristics, and the like.

次にステップＳ６３３において、上記抽出した滞在チャンクにおける訪問可能性飲食店を参照飲食店集合からの標本と考え、タグの滞在チャンクにおける無作為抽出との乖離確率ｐikを以下のように算出する。
すなわち、タグｉの滞在チャンクをｋとしたとき、この滞在チャンクｋにおける無作為抽出との乖離確率ｐikを、

とする。 Next, in step S633, the visit possibility restaurant in the extracted stay chunk is considered as a sample from the reference restaurant set, and the deviation probability pik from the random extraction in the tag stay chunk is calculated as follows.
That is, assuming that the stay chunk of tag i is k, the divergence probability pik with random sampling in this stay chunk k is

And

なお、Ｓk は滞在チャンクｋの訪問可能性飲食店数、ｎikは滞在チャンクｋにおいてタグｉが付与されている訪問可能性飲食店数である。
なお、各滞在チャンクについて訪問店は原則１店あると想定し、式（５）では、タグｉが訪問可能性飲食店のどれか１つに付与されているものとして、無作為抽出した場合との乖離確率ｐikを計算する。 Note that Sk is the number of restaurants that can be visited by stay chunk k, and nik is the number of restaurants that can be visited by tag i in stay chunk k.
Assuming that there is one store in principle for each stay chunk, in equation (5), if tag i is assigned to any one of the visitable restaurants The divergence probability pik of is calculated.

次にステップＳ６３４において、時刻による飲食行動可能性を計算する。すなわち、より高精度の店舗推薦を行うには、推薦対象の飲食店に訪問している可能性が高い滞在チャンクの影響を強くすることが適切である。各滞在チャンクは、滞在開始時刻と滞在終了時刻の情報を持つため、時刻を考慮した重要度を算出し分析に利用する。 Next, in step S634, the possibility of eating and drinking behavior according to time is calculated. In other words, in order to recommend a store with higher accuracy, it is appropriate to increase the influence of stay chunks that are highly likely to be visiting a restaurant to be recommended. Since each stay chunk has information on a stay start time and a stay end time, the importance considering the time is calculated and used for analysis.

第１の実施形態では、特に飲食店を推薦することを目的とすることから、時間帯別の食事行動率のデータを予め取得し記憶しておき、各滞在チャンクの滞在開始時刻から滞在終了時刻までの期間における食事行動率の平均値を、滞在チャンクｋの重要度ｗk とする。なお、ｗk の算出には、例えば総務省統計局の社会生活時間調査の値を利用することができる。
図８は、以上のようにして計算された、閲覧チャンクｊにおける無作為抽出との乖離確率と、滞在チャンクｋにおける無作為抽出との乖離確率ｐikの概念を示す図である。 In the first embodiment, in particular, for the purpose of recommending restaurants, meal activity rate data for each time zone is acquired and stored in advance, and the stay end time from the stay start time of each stay chunk. The average value of the meal action rate in the period until is set as the importance wk of the stay chunk k. For the calculation of wk, for example, the value of the social life time survey of the Statistics Bureau of the Ministry of Internal Affairs and Communications can be used.
FIG. 8 is a diagram showing the concept of the divergence probability pik between the random extraction in the browsing chunk j and the random extraction in the stay chunk k, calculated as described above.

（２−４）飲食店共起分析
制御ユニット１は、次にステップＳ６４において飲食店共起分析部１３４を起動し、この飲食店共起分析部１３４の制御の下、以下のように詳細訪問後の訪問分析を行う。すなわち、先ずステップＳ６４１において、上記詳細履歴分析部１３２により抽出された閲覧チャンクと、ＧＰＳ移動履歴分析部１３３により抽出された滞在チャンクに基づいて、各閲覧チャンクに出現した飲食店が、その閲覧時刻から滞在開始時刻が一定時間以内（例えば２４時間）の滞在チャンクにおいて訪問の可能性がある飲食店として出現した場合に、この飲食店を共起飲食店として抽出する。この共起飲食店は、ユーザがＷｅｂサイトを詳細に閲覧した飲食店の情報を契機として訪問とした可能性があると推測される。 (2-4) Restaurant Co-occurrence Analysis Next, the control unit 1 activates the restaurant co-occurrence analysis unit 134 in step S64, and under the control of the restaurant co-occurrence analysis unit 134, a detailed visit is made as follows. Perform a later visit analysis. That is, first, in step S641, a restaurant that appears in each browsing chunk based on the browsing chunk extracted by the detailed history analysis unit 132 and the stay chunk extracted by the GPS movement history analysis unit 133 has its browsing time. The restaurant is extracted as a co-occurrence restaurant when it appears as a restaurant that may be visited in a stay chunk whose stay start time is within a certain time (for example, 24 hours). It is speculated that this co-occurring restaurant may have a visit based on information about the restaurant where the user browsed the website in detail.

但し、当該時間内に繁華街等で頻繁に滞留し、多数の飲食店に近接して、偶然共起した可能性を排除する必要がある。このため、各共起飲食店が一定期間内の滞在チャンクで訪問可能性飲食店となったことの珍しさを、無作為抽出からの乖離により以下のように算出する。 However, it is necessary to eliminate the possibility that it stays frequently in the downtown area during the time and closes to many restaurants and co-occurs. For this reason, the rarity that each co-occurrence restaurant has become a visitable restaurant in a stay chunk within a certain period is calculated as follows from the deviation from random sampling.

すなわち、ステップＳ６４２において、閲覧チャンクの閲覧時刻から一定時間内の滞在チャンクの訪問可能性飲食店の全てを、上記参照飲食店集合作成部１３１により作成された参照飲食店集合からの標本と捉える。そして、タグｉの閲覧チャンクｊにおける共起飲食店に対する無作為抽出との乖離確率ｐicj を、

として算出する。 That is, in step S642, all the visiting possibility restaurants of the stay chunk within a certain time from the browsing time of the browsing chunk are regarded as samples from the reference restaurant set created by the reference restaurant set creation unit 131. And the deviation probability picj with the random sampling for the co-occurrence restaurant in the browsing chunk j of the tag i,

Calculate as

なお、ｖijは閲覧チャンクｊにおけるタグｉが付与された詳細閲覧店数、ｏj は閲覧チャンクｊの閲覧時刻から一定時間内の全滞在チャンクの訪問可能性飲食店の合計、ｃijは閲覧チャンクｊにおけるタグｉが付与された詳細閲覧店で、閲覧時刻から一定時間内の滞在チャンクの訪問可能性飲食店として共起した数である。
図９は、上記共起飲食店の抽出処理及び乖離確率の算出処理の概念を示したものである。 Note that vij is the number of detailed browsing stores to which the tag i is assigned in the browsing chunk j, oj is the total number of restaurants that can be visited by all the staying chunks within a certain time from the browsing time of the browsing chunk j, and cij is the browsing chunk j It is the number that co-occurs as a restaurant that can visit a stay chunk within a certain period of time from the browsing time at the detailed browsing store to which the tag i is assigned.
FIG. 9 shows the concept of the co-occurrence restaurant extraction process and the divergence probability calculation process.

（２−５）嗜好スコアの計算
制御ユニット１は、最後にステップＳ６５において嗜好スコア計算部１３５を起動し、この嗜好スコア計算部１３５の制御の下、各タグに対するユーザの嗜好スコアを以下のように計算する。 (2-5) Calculation of Preference Score Finally, the control unit 1 activates the preference score calculation unit 135 in step S65. Under the control of the preference score calculation unit 135, the user preference score for each tag is as follows. To calculate.

すなわち、先ずステップＳ６５１において、上記詳細閲覧履歴分析部１３２、ＧＰＳ移動履歴分析部１３３及び飲食店共起分析部１３４によりそれぞれ計算された各乖離確率ｐij，ｐik，ｐicj を、

により統合し、各タグｉに対するユーザの嗜好スコアｚi を算出する。この嗜好スコアｚi が大きいほど店舗に対するユーザの嗜好が強いと言える。 That is, first, in step S651, the divergence probabilities pij, pic, picj calculated by the detailed browsing history analysis unit 132, the GPS movement history analysis unit 133, and the restaurant co-occurrence analysis unit 134, respectively,

And the user's preference score zi for each tag i is calculated. It can be said that the greater the preference score zi, the stronger the user's preference for the store.

なお、φ1(p) は標準正規分布の累積密度関数の逆数である。また、Ｊは閲覧チャンクの総数、Ｋは滞在チャンクの総数、ｗk は滞在チャンクｋの重要度である。なお、式（７）はStouffer’s Z-score methodに基づいており、各結合対象の重要度に対して、自然な形で重みを導入し確率を結合できる。すなわち、上記飲食店情報と各概念との関連度及び各概念の重要度の少なくとも１つに応じて上記嗜好スコアｚを重み付けする。 Φ1 (p) is the reciprocal of the cumulative density function of the standard normal distribution. J is the total number of browsing chunks, K is the total number of staying chunks, and wk is the importance of the staying chunk k. Expression (7) is based on the Stouffer's Z-score method, and the probability can be combined by introducing a weight in a natural manner with respect to the importance of each combination target. That is, the preference score z is weighted according to at least one of the degree of association between the restaurant information and each concept and the importance of each concept.

（３）飲食店推薦処理（Recommend process）
情報推薦サーバＳＶの制御ユニット１は、上記ＧＰＳ移動履歴データ及び詳細閲覧履歴データの収集処理を定期的に実行しながら、ステップＳ７でユーザからの情報推薦要求（ユーザリクエスト）の到来を監視している。この状態でユーザ端末ＵＴ１〜ＵＴｎから送信されたユーザリクエストを受信すると、ステップＳ８により飲食店推薦処理部１４を起動し、この飲食店推薦処理部１４の制御の下、以下のように飲食店の推薦処理を実行する。図６はその処理手順と処理内容を示すフローチャートである。 (3) Restaurant recommendation process (Recommend process)
The control unit 1 of the information recommendation server SV monitors the arrival of an information recommendation request (user request) from the user in step S7 while periodically performing the collection processing of the GPS movement history data and the detailed browsing history data. Yes. In this state, when the user request transmitted from the user terminals UT1 to UTn is received, the restaurant recommendation processing unit 14 is activated in step S8, and under the control of the restaurant recommendation processing unit 14, the restaurant's recommendation is as follows. Execute the recommendation process. FIG. 6 is a flowchart showing the processing procedure and processing contents.

（３−１）候補飲食店集合の作成
先ずステップＳ８１において、候補飲食店集合作成部１４１の制御の下、上記受信されたユーザリクエストに含まれる地域とタグにより表される店舗条件に基づいて、当該条件に該当する飲食店を飲食店データベース２４から抽出し、これにより候補飲食店集合（Candidate restaurant set）を作成する。候補飲食店集合は、ユーザリクエストにより指定された条件、例えば地域条件として「Ｘ駅から１km以内」に基づいて、飲食店データベース２４を検索して作成する。 (3-1) Creation of candidate restaurant set First, in step S81, under the control of the candidate restaurant set creation unit 141, based on the store conditions represented by the region and the tag included in the received user request, A restaurant corresponding to the condition is extracted from the restaurant database 24, thereby creating a candidate restaurant set. The candidate restaurant set is created by searching the restaurant database 24 based on the conditions specified by the user request, for example, “within 1 km from the X station” as the regional condition.

（３−２）飲食店スコアの計算
次にステップＳ８２において飲食店スコア計算部１４２を起動し、この飲食店スコア計算部１４２の制御の下、上記嗜好データベース２３に記憶された、タグｉに対するユーザの嗜好スコアｚi をもとに、上記候補飲食店集合に含まれる各飲食店のスコアRestaurant Score(r)を、

により算出する。 (3-2) Calculation of Restaurant Score Next, in step S82, the restaurant score calculation unit 142 is activated, and the user corresponding to the tag i stored in the preference database 23 under the control of the restaurant score calculation unit 142. Based on the preference score zi, the restaurant score (r) of each restaurant included in the candidate restaurant set is

Calculated by

なお、ｒは候補飲食店集合の要素である飲食店の識別子、Ｔr は飲食店ｒ２付与されたタグの集合であり、ｔr はＴr に含まれるタグの数を示す。なお、本数式によるRestaurant Scoreによる飲食店のソートは、嗜好スコアｚi と各飲食店が持つタグをそれぞれタグｉに対する行列と見なした場合の、嗜好スコアｚi のノルムを各計算時点で定数と考えたコサイン類似度による結果と同等である。 Note that r is an identifier of a restaurant that is an element of a candidate restaurant set, Tr is a set of tags assigned to the restaurant r2, and tr indicates the number of tags included in Tr. It should be noted that the sorting of restaurants by the Restaurant Score according to this formula considers the norm of the preference score zi as a constant at each calculation point when the preference score zi and the tag of each restaurant are regarded as a matrix for the tag i. The result is equivalent to the cosine similarity.

（３−３）推薦リストの作成と配信
次にステップＳ８３において推薦リスト作成部１４３を起動し、この推薦リスト作成部１４３の制御の下で、上記飲食店スコア計算部１４２により算出された各飲食店のスコアｚi をもとに、飲食店情報をスコアｚi の降順にソートする（ステップＳ８３１）。すなわち、飲食店情報をユーザの嗜好スコアとの合致度が高い順に並べ替える。これによりユーザに適した飲食店の推薦リストが作成される。そして、この作成された推薦リストを、ステップＳ８３２により推薦リスト記憶部２５に保存すると共に、ステップＳ８３３により通信ネットワークＮＷを介して推薦要求元のユーザ端末ＵＴ１〜ＵＴｎに向け返送する。 (3-3) Creation and Distribution of Recommendation List Next, in step S83, the recommendation list creation unit 143 is activated, and each food and drink calculated by the restaurant score calculation unit 142 under the control of the recommendation list creation unit 143. Based on the store scores zi, the restaurant information is sorted in descending order of the scores zi (step S831). That is, the restaurant information is sorted in descending order of the degree of coincidence with the user's preference score. Thereby, a recommendation list of restaurants suitable for the user is created. Then, the created recommendation list is stored in the recommendation list storage unit 25 in step S832, and is returned to the recommendation request source user terminals UT1 to UTn via the communication network NW in step S833.

（４）実施形態による推薦方法の評価
この実施形態による推薦方法による推薦精度を、Random Sort、及びCosine Similarityによる推薦精度と比較した。
Random Sortは、候補飲食店集合に含まれる各店舗を無作為にソートする方法である。例えば、「現在地から１km以内」等を条件とした検索で、ユーザによる嗜好や条件入力、他の推薦方法がない場合は、この方法と同じである。このRandom Sortの評価には期待値を利用する。 (4) Evaluation of Recommendation Method According to Embodiment The recommendation accuracy according to the recommendation method according to this embodiment was compared with the recommendation accuracy according to Random Sort and Cosine Similarity.
Random Sort is a method of randomly sorting each store included in the candidate restaurant set. For example, if there is no preference, condition input, or other recommendation method by the user in a search that is “within 1 km from the current location” or the like, this method is the same. The expected value is used for evaluation of this Random Sort.

Cosine Similarityは、メモリベースのコンテンツベースフィルタリングの従来方法として、詳細閲覧履歴に出現した各飲食店にタグが付与されていた場合に、タグ種別ごとに「１」を累計した結果をベクトルとして、評価対象の飲食店に付与されたタグによるベクトルとのコサイン類似度を飲食店スコアとする方法である。 Cosine Similarity is a conventional method of memory-based content-based filtering. When a tag is attached to each restaurant that appears in the detailed browsing history, the result of accumulating “1” for each tag type is evaluated as a vector. This is a method of setting a restaurant score as a cosine similarity with a vector by a tag given to a target restaurant.

精度評価の指標として、上記各方法による評価対象飲食店１１０件の推薦順に対して、情報検索分野で多く利用されている順位を考慮した適合指標であるＤＣＧ（Discounted Cumulative Gain）を算出した。ＤＣＧを採用した理由は、例えばこの発明技術の適用対象として想定される携帯端末（スマートホン等）向けのグルメレコメンドサービスにおいては、端末に一度に表示できる情報量が少ないため、嗜好との合致度順に提示できることが重要だからである。 As an accuracy evaluation index, DCG (Discounted Cumulative Gain), which is a conformity index that takes into consideration the rank that is frequently used in the information search field, was calculated with respect to the recommendation order of 110 restaurants to be evaluated by the above methods. The reason for adopting DCG is that, for example, in a gourmet recommendation service for portable terminals (smartphones, etc.) that is assumed as an application target of the present invention technique, the amount of information that can be displayed on the terminal at a time is small. It is important to be able to present in order.

ＤＣＧの算出に必要な入力値ｇ(i) は、評価対象飲食店（１１０店）に対する被験者アンケート回答の選択肢に対し、「是非行きたい」＝７、「行きたい」＝６、「まあまあ行きたい」＝５、「どちらでもない」＝２、「行きたくない」＝１のように割り当てた。このうち、「まあまあ行きたい」と、「どちらでもない」の間のＤＣＧ入力値ｇ(i) の値の開きは、事前の被験者ヒヤリングにより、飲食店推薦の満足度には「行きたくない」飲食店の推薦を如何に減らすかが重要であることが判明したため、評価の低い飲食店が上位になる場合のＤＣＧの減衰を大きくするためである。また、今回の評価のＤＣＧ算出範囲は、同様に被験者ヒヤリングにより得た推薦結果に対する平均的な確認範囲を反映して、上位３２位とした。 The input value g (i) required for the calculation of DCG is “I want to go” = 7, “I want to go” = 6, "I want" = 5, "Neither" = 2, "I don't want to go" = 1. Among these, the opening of the DCG input value g (i) between “Well I want to go” and “Neither” is “I don't want to go to the restaurant recommendation satisfaction due to subject hearing in advance” This is to increase the attenuation of DCG when a restaurant with a low evaluation is ranked higher because it has been found that how to reduce restaurant recommendations is important. In addition, the DCG calculation range of this evaluation was ranked in the top 32 reflecting the average confirmation range for the recommendation result obtained by subject hearing similarly.

以上のような条件の下で、本実施形態による推薦方法と、上記Random Sort及びCosine Similarityによる、被験者平均のＤＣＧ値と履歴収集期間との関係を求めると、図１０に示す結果が得られた。この図１０から明らかなように、詳細閲覧履歴とＧＰＳ移動履歴との統合分析により推薦精度が向上し、詳細閲覧履歴が少ない状況でも、より高い精度での推薦が可能であることが分かる。 Under the conditions as described above, when the relationship between the DCG value of the subject average and the history collection period was determined by the recommendation method according to the present embodiment and the Random Sort and Cosine Similarity, the result shown in FIG. 10 was obtained. . As can be seen from FIG. 10, the recommendation accuracy is improved by the integrated analysis of the detailed browsing history and the GPS movement history, and it is understood that the recommendation can be made with higher accuracy even in a situation where the detailed browsing history is small.

なお、Cosine Similarityについて、各タグｉの詳細閲覧履歴における出現しやすさの違いを考慮するために、タグ出現時に加算する値を重み付けし、１／ｌｏｇ(Ei)とする方法と、１／Ｅi とする方法についても確認したが、推薦精度は向上しなかった。Ｅi は飲食店データベース２４におけるタグｉが付与された飲食店の総数である。 For Cosine Similarity, in order to take into account the difference in the likelihood of appearance in the detailed browsing history of each tag i, the value added at the time of tag appearance is weighted to 1 / log (Ei), and 1 / Ei However, the recommendation accuracy was not improved. Ei is the total number of restaurants to which the tag i in the restaurant database 24 is assigned.

［第１の実施形態の効果］
以上詳述したように第１の実施形態では、飲食店のＷｅｂサイトに対する詳細な閲覧履歴データをもとに、閲覧時刻の近さに応じた詳細閲覧のまとまりを閲覧チャンクとして抽出すると共に、ＧＰＳ移動履歴データをもとに、ユーザが一定時間に一定の範囲内に留まっている場合にその滞在地情報の集合をユーザの行動範囲のまとまりを表す滞在チャンクとして抽出する。そして、これらのチャンクを参照飲食店集合からの標本と捉え、閲覧チャンクにおける無作為抽出との乖離確率ｐij、及び滞在チャンクにおける無作為抽出との乖離確率ｐikをそれぞれ計算し、これらの乖離確率を統合して飲食店のタグに対するユーザの嗜好スコアを計算し、この嗜好スコアを用いてユーザに対する飲食店の推薦リストを作成し配信するようにしている。 [Effect of the first embodiment]
As described in detail above, in the first embodiment, based on detailed browsing history data for a restaurant website, a collection of detailed browsing according to the proximity of the browsing time is extracted as a browsing chunk, and GPS Based on the movement history data, when the user stays within a certain range at a certain time, a set of the stay location information is extracted as a stay chunk that represents a group of the user's action range. Then, these chunks are regarded as samples from the reference restaurant set, and the deviation probability pij from the random extraction in the browsing chunk and the deviation probability pik from the random extraction in the stay chunk are calculated, respectively, and these deviation probabilities are calculated. The user's preference score for the restaurant tag is calculated in a unified manner, and the restaurant recommendation list for the user is created and distributed using this preference score.

すなわち、詳細閲覧履歴に含まれる飲食店と、ＧＰＳ移動履歴から抽出される近接店とを、ユーザの行動範囲内の飲食店の集合からのユーザによる標本抽出であると見なし、各店に付与されたタグの出現数と、無作為抽出による出現数との乖離を計算して、出現数の乖離が大きいタグはユーザの嗜好と関連していると見なし、推薦に利用するようにしている。 That is, the restaurants included in the detailed browsing history and the nearby stores extracted from the GPS movement history are regarded as sample extraction by the user from the set of restaurants within the user's behavior range, and are given to each store. The divergence between the number of appearances of tags and the number of appearances by random extraction is calculated, and a tag with a large divergence in the number of appearances is considered to be related to the user's preference and is used for recommendation.

したがって、ユーザによる詳細閲覧履歴とＧＰＳ移動履歴とを用いて高精度の情報推薦処理を行うことができる。このため、ユーザの閲覧履歴実績が少ない状況下でも、ユーザによる条件の手入力に頼ることなくより高精度な情報推薦を行うことが可能となる。しかも、上記推薦リストを得る過程において、詳細閲覧履歴及びＧＰＳ移動履歴を共通データ化し、共通の指標により分析し統合するようにしているので、詳細閲覧履歴とＧＰＳ移動履歴を適切に利用して高精度の推薦リストを作成することができる。 Therefore, highly accurate information recommendation processing can be performed using the detailed browsing history by the user and the GPS movement history. For this reason, it is possible to perform highly accurate information recommendation without depending on the user's manual input of conditions even in a situation where the user's browsing history record is small. In addition, in the process of obtaining the recommendation list, the detailed browsing history and the GPS movement history are converted into common data, and are analyzed and integrated using a common index. An accuracy recommendation list can be created.

［第２の実施形態］
この発明の第２の実施形態は、飲食店の共起分析において、各閲覧チャンクに出現した飲食店が、その閲覧時刻から滞在開始時刻が一定時間以内の滞在チャンクにおいて訪問の可能性がある飲食店として出現した場合に、この飲食店を共起飲食店として抽出すると共に、各滞在チャンクにおいて訪問の可能性がある飲食店として出現した飲食店が、その滞在開始時刻から閲覧時刻が一定時間以内の閲覧チャンクにおいて出現した場合に、この飲食店を対象としている滞在チャンクにおける共起飲食店として抽出するようにしたものである。
なお、第２の実施形態においても、情報推薦装置の基本的な構成及び処理動作は第１の実施形態と同一なので、図１乃至図５を使用して説明を行う。 [Second Embodiment]
According to the second embodiment of the present invention, in the co-occurrence analysis of restaurants, a restaurant that appears in each browsing chunk may be visited in a stay chunk whose stay start time is within a certain time from the browsing time. When it appears as a restaurant, this restaurant is extracted as a co-occurrence restaurant, and a restaurant that appears as a restaurant that can be visited in each stay chunk has a browsing time within a certain time from the stay start time When it appears in the browsing chunk, it is extracted as a co-occurrence restaurant in the stay chunk targeting this restaurant.
Also in the second embodiment, the basic configuration and processing operation of the information recommendation device are the same as those in the first embodiment, and therefore description will be made with reference to FIGS.

（２−４）′飲食店共起分析
制御ユニット１は、ステップＳ６４において飲食店共起分析部１３４を起動し、この飲食店共起分析部１３４の制御の下、以下のように訪問分析を行う。すなわち、先ずステップＳ６４１において、上記詳細履歴分析部１３２により抽出された閲覧チャンクと、ＧＰＳ移動履歴分析部１３３により抽出された滞在チャンクに基づいて、各閲覧チャンクに出現した飲食店が、その閲覧時刻から滞在開始時刻が一定時間以内（例えば一週間）の滞在チャンクにおいて訪問の可能性がある飲食店として出現した場合に、この飲食店を、対象としている閲覧チャンクにおける共起飲食店として抽出する。 (2-4) ′ Restaurant co-occurrence analysis The control unit 1 activates the restaurant co-occurrence analysis unit 134 in step S64, and performs the visit analysis as described below under the control of the restaurant co-occurrence analysis unit 134. Do. That is, first, in step S641, a restaurant that appears in each browsing chunk based on the browsing chunk extracted by the detailed history analysis unit 132 and the stay chunk extracted by the GPS movement history analysis unit 133 has its browsing time. When the stay start time within a certain time (for example, one week) appears as a restaurant that may be visited, the restaurant is extracted as a co-occurring restaurant in the target viewing chunk.

またそれと共に、各滞在チャンクにおいて訪問の可能性があるとして出現した飲食店について、当該飲食店が上記滞在チャンクにおける滞在開始時刻から閲覧時刻が一定時間以内（例えば一週間）の閲覧チャンクにおいて出現した場合に、この飲食店を対象としている滞在チャンクにおける共起飲食店として抽出する。
そして、この共起飲食店を、ユーザがＷｅｂサイトを詳細に閲覧した飲食店の情報を契機として訪問した、もしくは訪問を契機としてＷｅｂサイトを詳細に閲覧した可能性がある飲食店と推測する。このような飲食店はユーザの嗜好に強く関係すると仮定することができる。 At the same time, with regard to a restaurant that appeared as a possibility of visiting in each stay chunk, the restaurant appeared in a browse chunk whose browsing time was within a certain time (for example, one week) from the stay start time in the stay chunk. In this case, the restaurant is extracted as a co-occurrence restaurant in the stay chunk targeting this restaurant.
And this co-occurrence restaurant is guessed to be a restaurant which the user visited with the information of the restaurant which browsed the website in detail as a trigger, or the website with the possibility of browsing the website in detail with the visit as a trigger. It can be assumed that such restaurants are strongly related to user preferences.

但し、ユーザが一定時間内に繁華街等に頻繁に滞留し、多数の飲食店に近接した場合にも、上記共起飲食店は抽出される。この種の飲食店は、偶然共起した可能性があるものであり、排除する必要がある。
そこで、各共起飲食店が一定期間内の滞在チャンクで訪問可能性飲食店となったことの珍しさを、無作為抽出からの乖離により以下のように算出する。 However, the above-mentioned co-occurrence restaurants are also extracted when the user frequently stays in a downtown area within a certain time and approaches a large number of restaurants. This type of restaurant may have happened by chance and needs to be excluded.
Therefore, the rarity of each co-occurring restaurant becoming a visitable restaurant in a stay chunk within a certain period of time is calculated as follows from the deviation from random sampling.

すなわち、ステップＳ６４２において、閲覧チャンクの閲覧時刻から一定時間内の滞在チャンクの訪問可能性飲食店の全てを、上記参照飲食店集合作成部１３１により作成された参照飲食店集合からの標本と捉える。そして、タグｉの閲覧チャンクｊにおける共起飲食店による無作為抽出との乖離確率ｐijco を、

として算出する。 That is, in step S642, all the visiting possibility restaurants of the stay chunk within a certain time from the browsing time of the browsing chunk are regarded as samples from the reference restaurant set created by the reference restaurant set creation unit 131. And the deviation probability pijco from the random sampling by the co-occurrence restaurant in the browsing chunk j of the tag i,

Calculate as

なお、ｂj は閲覧チャンクｊに出現した詳細閲覧店数、ｒは参照飲食店集合の飲食店数、Ｓj は閲覧チャンクj の時刻から一定期間内の滞在チャンクを示す識別子の集合である。ｃijcoは閲覧チャンクj においてタグｉが付与された詳細閲覧店であり、閲覧時刻から一定時間内の滞在チャンク（Ｓj に含まれる滞在チャンク）群の訪問可能性飲食店と共起した数である。 In addition, bj is the number of detailed browsing stores that have appeared in the browsing chunk j, r is the number of restaurants in the reference restaurant set, and Sj is a set of identifiers indicating stay chunks within a certain period from the time of the browsing chunk j. cijco is a detailed browsing store to which the tag i is assigned in the browsing chunk j, and is the number that co-occurs with the visiting possibility restaurants of the stay chunks (stay chunks included in Sj) within a certain time from the browsing time.

また、同様に滞在チャンクの訪問可能性飲食店について、滞在時刻から一定時間内の閲覧チャンクに出現した飲食店との共起に対して同様の処理を行う。すなわち、タグｉの滞在チャンクｋにおける共起飲食店による無作為抽出との乖離確率ｐikcoを、

として算出する。 In the same manner, the same processing is performed for the co-occurrence of the staying possibility visitor restaurant with the restaurant appearing in the browsing chunk within a certain time from the stay time. In other words, the deviation probability pikco from the random sampling by the co-occurrence restaurant in the stay chunk k of tag i,

Calculate as

なお、ｓk は滞在チャンクk に出現した訪問可能性飲食店数、Ｂk は滞在チャンクk の時刻から一定期間内の閲覧チャンク群の識別子の集合である。また、ｃikcoは滞在チャンクk においてタグｉが付与された訪問可能性飲食店であり、閲覧時刻から一定時間内の閲覧チャンク（Ｂk に含まれる閲覧チャンク）の詳細閲覧飲食店と共起した数である。 Here, sk is the number of restaurants that can be visited in the stay chunk k, and Bk is a set of identifiers of browsing chunk groups within a certain period from the time of the stay chunk k. In addition, cikco is a visit possibility restaurant where tag i is given in stay chunk k, and is the number that co-occurs with the detailed browsing restaurants within a certain time from the browsing time (viewing chunks included in Bk). is there.

（２−５）′嗜好スコアの計算
制御ユニット１は、ステップＳ６５において嗜好スコア計算部１３５を起動し、この嗜好スコア計算部１３５の制御の下、各タグに対するユーザの嗜好スコアを以下のように計算する。 (2-5) ′ Preference Score Calculation The control unit 1 activates the preference score calculation unit 135 in step S65, and under the control of the preference score calculation unit 135, the user preference score for each tag is as follows. calculate.

すなわち、先ずステップＳ６５１において、上記詳細閲覧履歴分析部１３２、ＧＰＳ移動履歴分析部１３３及び飲食店共起分析部１３４によりそれぞれ計算された各乖離確率について、履歴が収集前の初期状態をｔ＝１とし、履歴チャンクの抽出順にｔをインクリメントした識別子を与えることとする。このとき、時点ｔにおいて抽出された滞在チャンクと閲覧履歴チャンクの乖離確率をｐit、ｐitcoとし、タグｉに対する嗜好スコアを、

とする。 That is, first, in step S651, for each divergence probability calculated by the detailed browsing history analysis unit 132, the GPS movement history analysis unit 133, and the restaurant co-occurrence analysis unit 134, the initial state before the history is collected is t = 1. And an identifier obtained by incrementing t in the order in which history chunks are extracted is given. At this time, the divergence probabilities of the stay chunk and the browsing history chunk extracted at time t are pit and pitco, and the preference score for the tag i is

And

この嗜好スコアＺitが大きいほど、タグｉに対するユーザの嗜好が強いと見なせる。なお、φ-1(p) は標準正規分布の累積密度関数の逆数、ｗt はチャンクｔの重要度を示している。また、式（１３）はStouffer’s Z-score methodに基づいており、各結合対象の重要度に対して自然な形で重みを導入し確率を結合できる。すなわち、上記飲食店情報と各概念との関連度及び各概念の重要度の少なくとも１つに応じて上記嗜好スコアｚを重み付けする。 It can be considered that the user's preference for the tag i is stronger as the preference score Zit is larger. Φ-1 (p) is the reciprocal of the cumulative density function of the standard normal distribution, and w t is the importance of the chunk t. Also, equation (13) is based on the Stouffer's Z-score method, and the probability can be combined by introducing a weight in a natural manner with respect to the importance of each combination target. That is, the preference score z is weighted according to at least one of the degree of association between the restaurant information and each concept and the importance of each concept.

さらに、この処理は滞在チャンクもしくは閲覧チャンクの追加に伴い逐次更新可能とし、計算リソースを効率化している。具体的には、逐次処理において保持が必要なデータは、各タグのｘit，ｙit、及び現時点から過去の一定時間までの閲覧履歴チャンクと滞在チャンクに出現した店舗ＩＤ群のみである。 Furthermore, this processing can be updated sequentially with the addition of stay chunks or browsing chunks, thereby increasing the computational resource efficiency. Specifically, the data that needs to be retained in the sequential processing is only the xit and yit of each tag and the store ID group that appears in the browsing history chunk and the stay chunk from the present time to the past certain time.

［第２の実施形態の効果］
以上述べたように第２の実施形態によれば、「店舗情報閲覧後に訪問した近接店舗」、及び「店舗訪問後の店舗情報閲覧」を別にモデルとして分析したので、店舗情報閲覧の前後の近接する店舗への訪問のみをモデルとして分析するようにした第１の実施形態に比べ、さらに詳細な分析を行うことが可能となる。また第１の実施形態では、店舗情報閲覧前の近接する店舗への訪問を分析するために、チャンクの追加毎の逐次更新ができなかったが、分析結果のチャンク追加ごとの逐次更新を可能としたことにより、計算コストやメモリ等のコンピュータリソースを低減することができる。 [Effects of Second Embodiment]
As described above, according to the second embodiment, “proximity store visited after browsing store information” and “store information browsing after store visit” are analyzed as separate models, so proximity before and after store information browsing Compared to the first embodiment in which only a visit to a store to be analyzed is used as a model, a more detailed analysis can be performed. Further, in the first embodiment, in order to analyze a visit to a neighboring store before browsing store information, sequential update for each addition of chunks could not be performed, but sequential update for each addition of chunks of analysis results can be performed. Thus, computer resources such as calculation cost and memory can be reduced.

（その他の実施形態）
なお、この発明は上記各実施形態に限定されるものではない。例えば、前記第１の実施形態では嗜好学習処理部１３の飲食店共起分析部１３４において、飲食店の推薦リストを提供する前の段階では、ユーザが任意に飲食店のＷｅｂサイトを閲覧して取得した詳細閲覧情報をもとに、当該ユーザが実際にその飲食店へ訪問した可能性を分析に反映するようにした。しかし、それに限らず飲食店推薦処理部１４により作成された推薦リストがユーザに配信され、ユーザがこの推薦リストをもとに飲食店のＷｅｂサイトに対しアクセスして取得した詳細閲覧情報を上記飲食店共起分析に反映するようにしてもよい。 (Other embodiments)
The present invention is not limited to the above embodiments. For example, in the first embodiment, in the restaurant co-occurrence analysis unit 134 of the preference learning processing unit 13, the user arbitrarily browses the restaurant website before providing the restaurant recommendation list. Based on the acquired detailed browsing information, the possibility that the user actually visited the restaurant was reflected in the analysis. However, the recommendation list created by the restaurant recommendation processing unit 14 is not limited to this, and the detailed browsing information acquired by accessing the website of the restaurant based on this recommendation list is distributed to the user. It may be reflected in the store co-occurrence analysis.

また、前記実施形態では推薦対象となるスポットとして飲食店を例にとって説明したが、情報閲覧して実際に訪問するスポットであれば他のスポットでもよい。例えば、週末等の余暇に訪問するスポットが挙げられる。その一例としては、公園、遊園地、動物園、各種シッピング施設、ゲームセンタ、カラオケハウスが考えられる。これらのスポットについて、その評価、属性、設備等をタグとしてデータベースに蓄積し、ユーザによる情報閲覧履歴とＧＰＳ移動履歴を分析してユーザの嗜好を学習し、推薦リストを作成することで、ユーザの好みにあったスポットをユーザに提供することが可能となる。その他、宿泊施設や旅行情報の検索、観光地ガイドサービス、不動産情報検索、結構式場検索などへの応用が考えられる。 In the above embodiment, a restaurant has been described as an example of a spot to be recommended. However, other spots may be used as long as the spot is visited by browsing information. For example, a spot visited during leisure time such as a weekend. As an example, a park, an amusement park, a zoo, various shipping facilities, a game center, and a karaoke house can be considered. About these spots, the evaluation, attributes, facilities, etc. are accumulated in the database as tags, the user's preference is analyzed by analyzing the information browsing history and GPS movement history by the user, and the recommendation list is created. It is possible to provide a spot that suits the user's preference. In addition, it can be applied to search for accommodation facilities and travel information, sightseeing spot guide service, real estate information search, and ceremonial hall search.

さらに、前記各乖離確率は、超幾何分布、二項分布または正規分布により算出するようにしてもよい。その他、推薦サーバＳＶの構成や、ＧＰＳ移動履歴及び閲覧履歴の収集処理、嗜好学習処理、飲食店推薦処理の処理手順と処理内容等についても、この発明の要旨を逸脱しない範囲で種々変形して実施可能である。 Further, each divergence probability may be calculated by a hypergeometric distribution, binomial distribution or normal distribution. In addition, the configuration of the recommendation server SV, the collection process of the GPS movement history and the browsing history, the preference learning process, the processing procedure and processing contents of the restaurant recommendation process, and the like are variously modified without departing from the gist of the present invention. It can be implemented.

要するにこの発明は、上記各実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記各実施形態に開示されている複数の構成要素の適宜な組み合せにより種々の発明を形成できる。例えば、各実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。さらに、異なる実施形態に亘る構成要素を適宜組み合せてもよい。 In short, the present invention is not limited to the above-described embodiments as they are, and can be embodied by modifying the components without departing from the scope of the invention in the implementation stage. Moreover, various inventions can be formed by appropriately combining a plurality of constituent elements disclosed in the above embodiments. For example, some components may be deleted from all the components shown in each embodiment. Furthermore, you may combine suitably the component covering different embodiment.

ＳＶ…推薦サーバ、ＵＴ１〜ＵＴｎ…ユーザ端末、ＮＷ…通信ネットワーク、１…制御ユニット、２…記憶ユニット、１１…ＧＰＳ移動履歴収集部、１２…詳細閲覧履歴収集部、１３…嗜好学習処理部、１４…飲食店推薦処理部、２１…ＧＰＳ移動履歴記憶部、２２…詳細閲覧履歴記憶部、２３…嗜好データベース、２４…飲食店データベース、２５…推薦リスト記憶部、１３１…参照飲食店集合作成部、１３２…詳細閲覧履歴分析部、１３３…ＧＰＳ移動履歴分析部、１３４…飲食店共起分析部、１３５…嗜好スコア計算部、１４１…候補飲食店集合作成部、１４２…飲食店スコア計算部、１４３…推薦リスト作成部。 SV: recommendation server, UT1 to UTn ... user terminal, NW ... communication network, 1 ... control unit, 2 ... storage unit, 11 ... GPS movement history collection unit, 12 ... detailed browsing history collection unit, 13 ... preference learning processing unit, DESCRIPTION OF SYMBOLS 14 ... Restaurant recommendation processing part, 21 ... GPS movement history storage part, 22 ... Detailed browsing history storage part, 23 ... Preference database, 24 ... Restaurant database, 25 ... Recommendation list storage part, 131 ... Reference restaurant group creation part 132 ... Detailed browsing history analysis unit, 133 ... GPS movement history analysis unit, 134 ... Restaurant co-occurrence analysis unit, 135 ... Preference score calculation unit, 141 ... Candidate restaurant set creation unit, 142 ... Restaurant score calculation unit, 143: A recommendation list creation unit.

Claims

An information recommendation method for analyzing a user's interest in a plurality of geographically dispersed spots by a computer and creating information on a spot recommended for the user based on the analysis result,
A first step in which the computer acquires information representing the browsing history of the user for a browsing site that posts information on the plurality of spots, and stores the information in a first storage medium;
A second process in which the computer acquires information representing a geographical movement history of the user and stores the information in a second storage medium;
Based on the browsing history information stored in the first storage medium, the computer generates a browsing chunk by clustering a plurality of spot browsing information included in the browsing history information on condition that the browsing time is close A third process to
The computer determines a reference area where the user can visit based on movement history information stored in the second storage medium, information input in advance by the user or connection information with a wireless base station, A fourth step of extracting, from the stored spot information, a set of spot information included in the determined reference area as a reference spot set;
The computer includes a number a of spot browsing information included in the generated browsing chunk, a number n of spot browsing information having a predetermined concept among the spot browsing information groups included in the browsing chunk, and the reference spot set. The spot browsing information in which the concept appears in the browsing chunk based on the number S of spot information included in the reference spot and the number N of spot information to which the concept is assigned in the spot information group included in the reference spot set The first probability that the number of is equal to or greater than n and the second probability that is less than n are calculated, respectively, and the cumulative distribution function of the standard normal distribution is calculated based on the calculated first and second probabilities. the inverse function, and a fifth step of calculating an interest score z representing the degree of interest of the user with respect to the concept
Information recommendation method characterized by comprising the.

Based on the movement history information stored in the second storage medium, the computer generates a stay chunk by clustering a plurality of stay information included in the movement history information on the condition of distance proximity, Further comprising a step of associating spots with the stay chunks on condition of proximity.
In the fifth step, the number of spot information included in the generated stay chunk is set to the a, and the number of spot information having a predetermined concept among the spot information groups included in the stay chunk is set to the n. The information recommendation method according to claim 1, wherein the interest score z is calculated.

Based on the movement history information stored in the second storage medium, the computer generates a stay chunk by clustering a plurality of stay information included in the movement history information on the condition of distance proximity, Further comprising a step of associating spots with the stay chunks on condition of proximity.
In the fifth step, the number of spot browsing information appearing in the browsing chunk is set as the N, and a stay chunk group existing within a predetermined time from the browsing time of the browsing chunk is set as a co-occurrence chunk. The number of spot groups associated with stay chunks in a co-occurrence chunk on condition that the distance is close is set as a, and among the spot groups associated with stay chunks within the predetermined time, the browsing chunks The information recommendation method according to claim 1, wherein the interest score z is calculated by setting n as the number of spots that appear on the screen.

Based on the movement history information stored in the second storage medium, the computer generates a stay chunk by clustering a plurality of stay information included in the movement history information on the condition of distance proximity, Further comprising the step of associating a spot group with this stay chunk on condition of close distance;
In the fifth process, the number S of spot information included in the reference spot set is set to a, and the number of spot information in the browsing chunk is set to a, and exists within a predetermined time from the browsing time of the browsing chunk. The number of spot information appearing in the stay chunk group is N, and in the spot information appearing in the browsing chunk, the spot information group co-occurring with the spot information appearing in the stay chunk having the stay start time within a certain time is shared. 2. The information recommendation method according to claim 1, wherein the interest score z is calculated as a starting chunk, where n is the number of spots in the co-occurring chunk.

Based on the movement history information stored in the second storage medium, the computer generates a stay chunk by clustering a plurality of stay information included in the movement history information on the condition of distance proximity, Further comprising the step of associating a spot group with this stay chunk on condition of close distance;
In the fifth step, the number S of spot information included in the reference spot set is set as described above, the number of spot information in the browsing chunk is a, and a predetermined time set in advance from the stay start time of the stay chunk. The number of spot information appearing in the browsing chunk group whose inside is the browsing time is N, and in the spot information group appearing in the stay chunk, the spot information group co-occurring with the spot information appearing in the browsing chunk within a certain time The information recommendation method according to claim 1, wherein the interest score z is calculated as a co-occurrence chunk, where n is the number of spots in the co-occurrence chunk.

In the fifth step, for the browsing chunk, stay chunk, or co-occurrence chunk, the interest according to at least one of a user operation history, a time and situation of each chunk, and a degree of association between the spot and the concept. 6. The information recommendation method according to claim 1, wherein the score z is weighted.

The information recommendation method according to claim 1, wherein the fifth step calculates the first probability and the second probability by a hypergeometric distribution, a binomial distribution, or a normal distribution.

For spot information in which one or more concepts appear, the method further includes a sixth step of calculating a user evaluation score for the spot information using the interest score z of each concept that appears in the spot information. The information recommendation method according to claim 1.

9. The information recommendation method according to claim 8, wherein in the sixth step, the interest score z is weighted according to at least one of the degree of association between the spot information and each concept and the importance of each concept. .

An information recommendation device that analyzes a user's interest in a plurality of geographically dispersed spots and creates information on spots recommended to the user based on the analysis result,
Means for acquiring and storing in a first storage medium information representing the user's browsing history for a browsing site that posts information relating to the plurality of spots, respectively.
Means for acquiring information representing the geographical movement history of the user and storing it in a second storage medium;
Based on the browsing history information stored in the first storage medium, a means for clustering a plurality of spot browsing information included in the browsing history information on the condition of near browsing time and generating a browsing chunk;
A reference area that the user can visit is determined based on the movement history information stored in the second storage medium, and the spot information included in the determined reference area is determined from the spot information stored in advance. Means for extracting the set as a reference spot set;
The number a of the spot browsing information included in the generated browsing chunk, the number n of the spot browsing information having a predetermined concept among the spot browsing information group included in the browsing chunk, and the spot included in the reference spot set Based on the number S of information and the number N of spot information to which the concept is given among the spot information groups included in the reference spot set, the number of spot browsing information in which the concept appears in the browsing chunk is: A first probability that is greater than or equal to n and a second probability that is less than n are calculated, respectively, and based on the calculated first and second probabilities, an inverse function of a cumulative distribution function of a standard normal distribution is obtained . Means for calculating an interest score z representing the degree of interest of the user with respect to the concept ;
Information recommendation apparatus characterized by comprising a.

The program which makes the said computer perform the process of each process with which the information recommendation method in any one of Claims 1 thru | or 9 is provided.