JP2008269427A

JP2008269427A - Information retrieval device, information retrieval method, information retrieval program, and computer readable recording medium recorded with information retrieval program

Info

Publication number: JP2008269427A
Application number: JP2007113475A
Authority: JP
Inventors: Makoto Muto; 誠武藤; Hiroshi Takeno; 浩竹野; Koichi Ushijima; 浩一牛島; Junji Tomita; 準二富田; Yuji Kaneda; 有二金田
Original assignee: NTT Resonant Inc
Current assignee: NTT Resonant Inc
Priority date: 2007-04-23
Filing date: 2007-04-23
Publication date: 2008-11-06
Anticipated expiration: 2027-04-23
Also published as: JP4991379B2

Abstract

<P>PROBLEM TO BE SOLVED: To display contents in an order reflecting a user's taste. <P>SOLUTION: The contents including a retrieval word for retrieving a content and a content identifier corresponding to the content are retrieved from a content storing part 301, a sentence adaptation degree indicating a frequency in which the content retrieved by the retrieval word appears is calculated, a content identifier having the same content identifier with the retrieved content identifier is retrieved from a browsing history log 302, a browsing popularity degree indicating the frequency of browsing content of the retrieved content identifier is calculated, and a content identifier retrieved on the basis of a result obtained by combining the calculated sentence adaptation degree and browsing popularity degree is outputted. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、コンテンツを検索する技術に関する。 The present invention relates to a technology for searching for content.

従来、インターネット上に設けられた情報検索システムでは、文章や図面等のコンテンツをデータベースに蓄積し、利用者により与えられた検索キーワードに関連するコンテンツをそのデータベースから取り出して、検索結果としてユーザに提示する技術が用いられていた。 Conventionally, in an information search system provided on the Internet, contents such as sentences and drawings are accumulated in a database, contents related to a search keyword given by a user are taken out from the database, and presented as search results to a user. Technology was used.

そして、検索対象であるコンテンツを提示する際には、通常、検索式に含まれる各検索語の使用頻度に基づいて、検索式と検索結果の各コンテンツとの文章関連度を求め、その文章関連度の高い順番でコンテンツを表示する方法が利用されている（非特許文献１参照）。 When presenting the content that is the search target, usually, the text relevance between the search formula and each content of the search result is obtained based on the frequency of use of each search term included in the search formula, and the text related A method of displaying contents in descending order is used (see Non-Patent Document 1).

しかしながら、検索語の使用頻度が高い場合であっても、利用者が期待するコンテンツであるとは限らない。逆に、使用頻度が低いコンテンツであっても、期待するコンテンツである可能性もある。故に、文章関連度を用いて順序付けられた検索結果は、利用者の所望する順番で表示された検索結果に一致するものではなく、所望するコンテンツに辿り着く迄には多くの試行錯誤や時間を利用者に与えるという問題があった。 However, even if the search term is frequently used, the content is not necessarily expected by the user. Conversely, even content that is used infrequently may be expected content. Therefore, the search results ordered using the text relevance level do not match the search results displayed in the order desired by the user, and it takes a lot of trial and error and time until the desired content is reached. There was a problem of giving to users.

そこで、特許文献１及び特許文献２では、利用者による閲覧要求の回数を記録し、この回数を用いて前述の文章関連度を補正することで、ユーザの求めるコンテンツをより上位に表示する技術が開示されている。これにより、閲覧要求の多いコンテンツはより上位に表示され、閲覧要求の少ないコンテンツはより下位に表示されることになる。
特開２００５−１９０４５３号公報特開２００６−３３８１００号公報北研二著、「情報検索アルゴリズム」、共立出版、2002年 Therefore, in Patent Document 1 and Patent Document 2, there is a technique for displaying the content requested by the user at a higher level by recording the number of browsing requests made by the user and correcting the above-described sentence relevance using this number of times. It is disclosed. As a result, content with a high browsing request is displayed at a higher level, and content with a low browsing request is displayed at a lower level.
JP 2005-190453 A JP 2006-338100 A Kitakenji, “Information Retrieval Algorithm”, Kyoritsu Shuppan, 2002

しかしながら、多様な言語が多く用いられており、且つ日常の生活に密着して頻繁に検索されるコンテンツ、例えば、天気予報や全国の地名などのコンテンツに、利用者が入力した検索語が一語でも含まれている場合には、それらのコンテンツが常に優先されて提示されるため、必ずしも利用者の期待するコンテンツが優先的に表示されるとは限らないという問題があった。 However, a variety of languages are often used, and content that is frequently searched in close contact with daily life, such as weather forecasts and national place names, is a single search term entered by the user. However, if the content is included, the content is always prioritized and presented, so that the content expected by the user is not always displayed preferentially.

また、特定の利用者が意図的に大量の閲覧を行った場合でも閲覧要求回数に反映されるため、本来の閲覧要求とは異なり、その意図的なコンテンツが優先的に表示されるという問題があった。 In addition, even if a specific user intentionally browses a large amount, it is reflected in the number of browsing requests, so unlike the original browsing request, the intentional content is displayed with priority. there were.

本発明は、上記を鑑みてなされたものであり、ユーザの趣向を反映したコンテンツを表示することを課題とする。 This invention is made in view of the above, and makes it a subject to display the content reflecting a user's taste.

請求項１に記載の本発明は、コンテンツと当該コンテンツを識別可能な第１コンテンツ識別子とを関連付けて格納しておくコンテンツ格納手段と、閲覧されたコンテンツの第２コンテンツ識別子を格納しておく閲覧履歴格納手段と、コンテンツを検索する検索語を受信する検索語受信手段と、前記コンテンツ格納手段から前記コンテンツと前記第１コンテンツ識別子とを読み出して、前記検索語を含むコンテンツと当該コンテンツに対応する第１コンテンツ識別子とを検索し、当該検索語が当該コンテンツに出現する頻度を示す文章適合度を算出するコンテンツ検索手段と、前記閲覧履歴格納手段から前記第２コンテンツ識別子を読み出して、検索された前記第１コンテンツ識別子と同じコンテンツ識別子を有する第２コンテンツ識別子を検索し、当該第１コンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出する第１閲覧履歴検索手段と、前記文章適合度と前記閲覧人気度とを合成する適合度合成手段と、合成された結果に基づいて、検索された前記第１コンテンツ識別子を出力する検索結果出力手段と、を有することを要旨とする。 The present invention according to claim 1 is a content storage means for storing a content and a first content identifier that can identify the content in association with each other, and a browsing for storing a second content identifier of the browsed content. A history storage means, a search word receiving means for receiving a search word for searching for content, and the content and the first content identifier are read from the content storage means, and the content containing the search word and the content The first content identifier is searched, the content search means for calculating the sentence matching degree indicating the frequency at which the search term appears in the content, and the second content identifier is read from the browsing history storage means and searched. A second content identifier having the same content identifier as the first content identifier is detected. A first browsing history search unit that calculates a browsing popularity degree indicating a frequency of browsing the content of the first content identifier, a matching degree combining unit that combines the sentence matching degree and the browsing popularity degree, And a search result output means for outputting the searched first content identifier based on the result.

本発明にあっては、コンテンツを検索する検索語を含むコンテンツと該コンテンツに対応するコンテンツ識別子とをコンテンツ格納手段から検索して、この検索語により検索されたコンテンツに出現する頻度を示す文章適合度を算出すると共に、検索されたコンテンツ識別子と同じコンテンツ識別子を有するコンテンツ識別子を閲覧履歴格納手段から検索し、検索されたコンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出し、算出された文章適合度と閲覧人気度とを合成した結果に基づいて検索されたコンテンツ識別子を出力するため、利用者の趣向を反映したコンテンツを表示することができる。 In the present invention, the text matching indicating the frequency of appearance in the content searched for by searching the content storing means for the content including the search word for searching the content and the content identifier corresponding to the content. A content identifier having the same content identifier as the searched content identifier is searched from the browsing history storage means, and a browsing popularity degree indicating the frequency of browsing the content of the searched content identifier is calculated and calculated. Since the retrieved content identifier is output based on the result of combining the sentence suitability and the browsing popularity, the content reflecting the user's preference can be displayed.

請求項２に記載の本発明は、前記閲覧履歴格納手段が、前記第２コンテンツ識別子を行単位で格納するものであって、前記閲覧人気度は、検索された前記第１コンテンツ識別子と同じコンテンツ識別子を有する第２コンテンツ識別子の行数を用いて計算されることを要旨とする。 In the present invention according to claim 2, the browsing history storage means stores the second content identifier in units of rows, and the browsing popularity is the same content as the searched first content identifier. The gist is that the calculation is performed using the number of rows of the second content identifier having the identifier.

請求項３に記載の本発明は、前記適合度合成手段が、前記文章適合度と前記閲覧人気度とを合成した得点を計算するものであって、前記検索結果出力手段は、当該得点の高い順番で前記第１コンテンツ識別子を出力することを要旨とする。 The present invention according to claim 3, wherein the matching degree synthesizing unit calculates a score obtained by synthesizing the sentence matching degree and the browsing popularity degree, and the search result output unit has a high score. The gist is to output the first content identifier in order.

本発明にあっては、適合度合成手段が、文章適合度と閲覧人気度とを合成した得点を計算するものであって、検索結果出力手段は、得点の高い順番でコンテンツ識別子を出力するため、利用者によって入力された検索語との適合度が大きく、且つ人気のあるコンテンツ識別子を優先して利用者に提供することができる。 In the present invention, the matching degree synthesizing unit calculates a score obtained by synthesizing the sentence matching degree and the browsing popularity degree, and the search result output unit outputs the content identifiers in the order of the highest score. Therefore, it is possible to preferentially provide a user with a popular content identifier that has a high degree of matching with a search term input by the user.

請求項４に記載の本発明は、前記閲覧履歴格納手段が、前記第２コンテンツ識別子と当該第２コンテンツ識別子のコンテンツが閲覧された閲覧時刻とを関連付けて格納しておくものであって、前記第１閲覧履歴検索手段は、前記第２コンテンツ識別子を検索した後、当該第２コンテンツ識別子に対応する前記閲覧時刻を当該閲覧履歴格納手段から読み出して、当該閲覧時刻から現在までの経過時間を示す閲覧新鮮度を算出する算出手段と、当該閲覧新鮮度を前記閲覧人気度に加算する加算手段と、を更に有することを要旨とする。 The present invention according to claim 4, wherein the browsing history storage means stores the second content identifier in association with a browsing time when the content of the second content identifier is browsed. The first browsing history search means, after searching for the second content identifier, reads the browsing time corresponding to the second content identifier from the browsing history storage means, and indicates the elapsed time from the browsing time to the present. The gist of the invention is to further include a calculating means for calculating the browsing freshness and an adding means for adding the browsing freshness to the browsing popularity.

本発明にあっては、検索語により検索されたコンテンツ識別子と同じコンテンツ識別子が閲覧された時刻から現在までの経過時間を示す閲覧新鮮度を閲覧人気度に加算するため、本来の人気の度合いを反映したコンテンツ識別子を利用者に提供することができる。 In the present invention, since the browsing freshness indicating the elapsed time from the time when the same content identifier as the content identifier searched by the search word is browsed to the present is added to the browsing popularity, the original popularity degree is set. The reflected content identifier can be provided to the user.

請求項５に記載の本発明は、前記閲覧履歴格納手段が、前記第２コンテンツ識別子と当該第２コンテンツ識別子のコンテンツが閲覧された閲覧時刻と当該コンテンツを閲覧した利用者の利用者識別子とを関連付けて格納しておくものであって、前記第１閲覧履歴検索手段は、前記第２コンテンツ識別子を検索した後、当該第２コンテンツ識別子に対応する前記閲覧時刻及び前記利用者識別子を当該閲覧履歴格納手段から読み出して、当該閲覧時刻と当該利用者識別子との関係を示すリストＡを作成する作成手段と、前記リストＡの各利用者識別子に対して、当該閲覧時刻から現在までの経過時間を示す閲覧新鮮度を算出する算出手段と、前記リストＡの全ての利用者識別子に対する前記閲覧新鮮度を合計し、合計された閲覧新鮮度を前記閲覧人気度に加算する加算手段と、を更に有することを要旨とする。 In the present invention described in claim 5, the browsing history storage means includes the second content identifier, the browsing time when the content of the second content identifier was browsed, and the user identifier of the user who browsed the content. The first browsing history search means searches for the second content identifier, and then stores the browsing time and the user identifier corresponding to the second content identifier in the browsing history. Read from the storage means, create a list A showing the relationship between the browsing time and the user identifier, and for each user identifier of the list A, the elapsed time from the browsing time to the present A calculation means for calculating the browsing freshness to be displayed, and the browsing freshness for all user identifiers in the list A are totaled, and the total browsing freshness is calculated as the browsing Adding means for adding the air permeability, it is summarized as further comprising a.

請求項６に記載の本発明は、前記算出手段が、前記第２コンテンツ識別子に対応する閲覧時刻のうち、最も新しい閲覧時刻を前記閲覧履歴格納手段から読み出して、前記閲覧新鮮度を算出することを要旨とする。 In the present invention according to claim 6, the calculating means reads the latest browsing time from the browsing history storage means among the browsing times corresponding to the second content identifier, and calculates the browsing freshness. Is the gist.

本発明にあっては、最も新しい閲覧時刻を用いて閲覧新鮮度を算出するため、本来の人気の度合いをより確実に反映したコンテンツ識別子を利用者に提供することができる。 In the present invention, since the freshness of browsing is calculated using the newest browsing time, it is possible to provide the user with a content identifier that more reliably reflects the original popularity.

請求項７に記載の本発明は、前記閲覧新鮮度の値が、前記経過時間が長いほど小さい値であり、前記経過時間が短いほど大きい値であることを要旨とする。 The gist of the present invention described in claim 7 is that the browsing freshness value is a smaller value as the elapsed time is longer, and a larger value as the elapsed time is shorter.

請求項８に記載の本発明は、利用者の利用者識別子と当該利用者の属性とを関連付けて格納しておく利用者属性格納手段を更に有し、前記閲覧履歴検索部は、利用者の属性を受信した場合に、前記利用者識別子と前記属性とを前記利用者属性格納手段から読み出して、受信した前記属性と同じ属性を持つ利用者識別子を検索し、検索された当該利用者識別子と当該属性との関係を示すリストＢを作成する作成手段と、前記リストＡから前記リストＢに格納されていない利用者識別子と当該利用者識別子に対応する閲覧時刻とを削除する削除手段と、を更に有し、前記算出手段は、削除後のリストＡを用いて前記閲覧新鮮度を算出することを要旨とする。 The present invention according to claim 8 further includes user attribute storage means for associating and storing a user identifier of the user and the attribute of the user, wherein the browsing history search unit includes When an attribute is received, the user identifier and the attribute are read from the user attribute storage means, a user identifier having the same attribute as the received attribute is searched, and the searched user identifier and Creating means for creating a list B indicating a relationship with the attribute; and deleting means for deleting a user identifier not stored in the list B and a browsing time corresponding to the user identifier from the list A. In addition, the gist is to calculate the browsing freshness using the list A after deletion.

本発明にあっては、受信した利用者の属性と同じ属性を持つ利用者識別子を利用者属性格納手段から検索し、検索された利用者識別子とその属性との関係を示すリストＢを作成し、既に作成されたリストＡからリストＢに格納されていない利用者識別子とこの利用者識別子に対応する閲覧時刻とを削除したリストＡを用いて閲覧新鮮度を算出するため、利用者の属性に近い利用者に人気のあるコンテンツを優先して提示することができる。 In the present invention, a user identifier having the same attribute as the received user attribute is searched from the user attribute storage means, and a list B indicating the relationship between the searched user identifier and the attribute is created. In order to calculate the browsing freshness using the list A in which the user identifiers not stored in the list B and the browsing time corresponding to the user identifiers are deleted from the list A that has already been created, Content that is popular with nearby users can be preferentially presented.

請求項９に記載の本発明は、前記第１閲覧履歴検索手段が、削除後における前記リストＡの利用者識別子の総数が所定の閾値よりも少ない場合に、検索された前記第１コンテンツ識別子の一部と一致するコンテンツ識別子に対応する利用者識別子を前記閲覧履歴格納手段から読み出して、読み出した利用者識別子と当該利用者識別子に対応する閲覧時刻とを前記リストＡに追加することを要旨とする。 The present invention according to claim 9 is the first browsing history search unit, when the total number of user identifiers in the list A after deletion is less than a predetermined threshold, A gist is that a user identifier corresponding to a content identifier that matches a part is read from the browsing history storage unit, and the read user identifier and a browsing time corresponding to the user identifier are added to the list A. To do.

本発明にあっては、リストＡの利用者識別子の総数が所定の閾値よりも少ない場合に、コンテンツ検索手段により検索されたコンテンツ識別子の一部と一致するコンテンツ識別子に対応する利用者識別子を閲覧履歴格納手段から読み出して、読み出した利用者識別子とこの利用者識別子に対応する閲覧時刻とをリストＡに追加するため、より多くの関連する閲覧情報を用いて閲覧新鮮度を計算することができる。 In the present invention, when the total number of user identifiers in the list A is smaller than a predetermined threshold, the user identifier corresponding to the content identifier that matches a part of the content identifier searched by the content search means is viewed. Since reading from the history storage means and adding the read user identifier and the browsing time corresponding to this user identifier to the list A, the browsing freshness can be calculated using more related browsing information. .

請求項１０に記載の本発明は、前記閲覧履歴格納手段が、前記第２コンテンツ識別子のコンテンツが閲覧された閲覧時間を更に格納しておくものであって、前記算出手段は、前記リストＡの各利用者識別子に対応する閲覧時間を前記閲覧履歴格納手段から読み出して、当該閲覧時間を更に用いて前記閲覧新鮮度を計算することを要旨とする。 According to a tenth aspect of the present invention, the browsing history storage means further stores a browsing time when the content of the second content identifier is browsed, and the calculation means stores the list A in the list A. The gist is to read the browsing time corresponding to each user identifier from the browsing history storage means and calculate the browsing freshness by further using the browsing time.

本発明にあっては、リストＡの各利用者識別子に対応する閲覧時間を閲覧履歴格納手段から読み出して、この閲覧時間を更に用いて閲覧新鮮度を計算するため、本来の人気の度合いを更に反映したコンテンツ識別子を利用者に提供することができる。 In the present invention, the browsing time corresponding to each user identifier in the list A is read from the browsing history storage means, and the browsing freshness is calculated by further using this browsing time. The reflected content identifier can be provided to the user.

請求項１１に記載の本発明は、出力された前記第１コンテンツ識別子が利用者によって選択された場合に、当該第１コンテンツ識別子を受信するコンテンツ識別子受信手段と、当該第１コンテンツ識別子のコンテンツを取得するコンテンツ取得手段と、取得した前記コンテンツから当該コンテンツに直接的に関連するコンテンツ内の関連コンテンツを解析し、当該関連コンテンツの関連コンテンツ識別子を抽出するコンテンツ解析部と、前記閲覧履歴格納手段から前記第２コンテンツ識別子を読み出して、前記関連コンテンツ識別子と同じコンテンツ識別子を有する第２コンテンツ識別子を検索し、当該関連コンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出する第２閲覧履歴検索手段と、当該閲覧人気度に基づいて、前記コンテンツを加工するコンテンツ加工手段と、加工された前記コンテンツを出力するコンテンツ出力手段と、を更に有することを要旨とする。 The present invention as set forth in claim 11, when the output first content identifier is selected by a user, content identifier receiving means for receiving the first content identifier, and content of the first content identifier From the acquired content acquisition means, the content analysis unit that analyzes the related content in the content directly related to the content from the acquired content, and extracts the related content identifier of the related content, and the browsing history storage means A second browsing history for reading the second content identifier, searching for a second content identifier having the same content identifier as the related content identifier, and calculating a browsing popularity indicating the frequency of browsing the content of the related content identifier Based on the search means and the browsing popularity, A content processing means for processing a serial content, and content output means for outputting processed the contents were, and summarized in that further comprising a.

本発明にあっては、利用者によって選択されたコンテンツ識別子のコンテンツに対し、そのコンテンツに直接的に関連するコンテンツ内の関連コンテンツを解析し、解析した関連コンテンツに対する閲覧人気度を計算し、その閲覧人気度に基づいて加工したコンテンツを出力するため、出力されたコンテンツ内において人気のある関連コンテンツを容易に検索することができる。 In the present invention, for the content with the content identifier selected by the user, the related content in the content directly related to the content is analyzed, and the browsing popularity for the analyzed related content is calculated. Since the content processed based on the browsing popularity is output, it is possible to easily search for related content that is popular in the output content.

請求項１２に記載の本発明は、前記第２閲覧履歴検索手段が、前記閲覧人気度を算出すると共に、前記コンテンツに間接的に関連するコンテンツ外の関連コンテンツの関連コンテンツ識別子を抽出するものであって、前記コンテンツ出力手段は、当該コンテンツ外の関連コンテンツ識別子を更に出力することを要旨とする。 In the present invention according to claim 12, the second browsing history search means calculates the browsing popularity and extracts a related content identifier of related content outside the content indirectly related to the content. The content output means further outputs a related content identifier outside the content.

本発明にあっては、コンテンツに間接的に関連するコンテンツ外の関連コンテンツ識別子を更に出力するため、コンテンツの利便性を向上することができる。 In the present invention, since the related content identifier outside the content indirectly related to the content is further output, the convenience of the content can be improved.

請求項１３に記載の本発明は、前記閲覧履歴格納部は、当該閲覧履歴格納部に格納された第２コンテンツ識別子のコンテンツが閲覧される前に閲覧されていた遷移元コンテンツの遷移元コンテンツ識別子を更に格納しておくものであって、前記第２閲覧履歴検索手段は、当該閲覧履歴格納手段から前記第２コンテンツ識別子を読み出して、受信した第１コンテンツ識別子と同じコンテンツ識別子を有するコンテンツ識別子を検索し、当該コンテンツ識別子に対応する前記遷移元コンテンツ識別子と当該遷移元コンテンツ識別子の数とを関連付けた遷移元コンテンツ識別子リストを作成し、前記遷移元コンテンツ識別子リストの遷移元コンテンツ識別子と同じ遷移元コンテンツ識別子を有する遷移元コンテンツ識別子を前記閲覧履歴格納手段から検索し、検索された遷移元コンテンツ識別子と当該遷移元コンテンツ識別子の数とを関連付けた遷移先コンテンツ識別子リストを作成し、前記遷移元コンテンツ識別子リストと前記遷移先コンテンツ識別子リストを用いて、前記遷移先コンテンツ識別子の重要度を算出して、当該遷移先コンテンツ識別子を前記コンテンツ外の関連コンテンツ識別子として抽出することを要旨とする。 The present invention described in claim 13 is characterized in that the browsing history storage unit is a transition source content identifier of a transition source content that has been browsed before the content of the second content identifier stored in the browsing history storage unit is browsed. The second browsing history search unit reads the second content identifier from the browsing history storage unit, and stores a content identifier having the same content identifier as the received first content identifier. Search and create a transition source content identifier list in which the transition source content identifier corresponding to the content identifier is associated with the number of the transition source content identifiers, and the same transition source as the transition source content identifier in the transition source content identifier list Whether the transition source content identifier having the content identifier is the browsing history storage means A transition destination content identifier list is created by associating the searched transition source content identifiers with the number of the transition source content identifiers, and the transition is performed using the transition source content identifier list and the transition destination content identifier list. The gist is to calculate the importance of the destination content identifier and extract the transition destination content identifier as a related content identifier outside the content.

請求項１４に記載の本発明は、前記閲覧履歴格納部が、当該閲覧履歴格納部に格納された第２コンテンツ識別子のコンテンツが閲覧される前に閲覧されていた遷移元コンテンツの遷移元コンテンツ識別子を更に格納しておくものであって、前記第２閲覧履歴検索手段は、当該閲覧履歴格納手段から前記第２コンテンツ識別子を読み出して、受信した第１コンテンツ識別子と同じコンテンツ識別子を有するコンテンツ識別子を検索し、当該コンテンツ識別子に対応する前記遷移元コンテンツ識別子と当該遷移元コンテンツ識別子の数とを関連付けたリストを遷移先コンテンツ識別子リストとして作成し、当該遷移先コンテンツ識別子リストから前記コンテンツ内の関連コンテンツ識別子を削除し、削除後の遷移先コンテンツ識別子リストの遷移先コンテンツ識別子をコンテンツ外の関連コンテンツ識別子として抽出すると共に、当該遷移先コンテンツ識別子の数を重要度とすることを要旨とする。 The present invention according to claim 14 is the transition source content identifier of the transition source content browsed before the browsing history storage unit browses the content of the second content identifier stored in the browsing history storage unit. The second browsing history search unit reads the second content identifier from the browsing history storage unit, and stores a content identifier having the same content identifier as the received first content identifier. A list in which the transition source content identifier corresponding to the content identifier and the number of the transition source content identifiers are associated is created as a transition destination content identifier list, and the related content in the content is retrieved from the transition destination content identifier list. Delete the identifier and transition to the destination content identifier list after deletion Extracts a content identifier as the content out of the associated content identifier, and summarized in that the number of the transition destination content identifier and importance.

請求項１５に記載の本発明は、前記コンテンツ出力手段が、前記重要度に応じて及び／又は前記重要度と共に、前記コンテンツ外の関連コンテンツ識別子を出力することを要旨とする。 The gist of the present invention described in claim 15 is that the content output means outputs a related content identifier outside the content according to the importance and / or together with the importance.

本発明にあっては、重要度に応じて及び／又は重要度と共に、コンテンツ外の関連コンテンツが出力されるため、人気のあるコンテンツをより効率的に閲覧することができる。 In the present invention, the related content outside the content is output according to the importance and / or together with the importance, so that the popular content can be browsed more efficiently.

請求項１６に記載の本発明は、閲覧されたコンテンツの第２コンテンツ識別子を格納しておく閲覧履歴格納手段と、所望のコンテンツに対する第１コンテンツ識別子を受信するコンテンツ識別子受信手段と、当該第１コンテンツ識別子のコンテンツを取得するコンテンツ取得手段と、取得した前記コンテンツから当該コンテンツに直接的に関連するコンテンツ内の関連コンテンツを解析し、当該関連コンテンツの関連コンテンツ識別子を抽出するコンテンツ解析部と、前記閲覧履歴格納手段から前記第２コンテンツ識別子を読み出して、前記関連コンテンツ識別子と同じコンテンツ識別子を有する第２コンテンツ識別子を検索し、当該関連コンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出する閲覧履歴検索手段と、当該閲覧人気度に基づいて、前記コンテンツを加工するコンテンツ加工手段と、加工された前記コンテンツを出力するコンテンツ出力手段と、を有することを要旨とする。 According to a sixteenth aspect of the present invention, there is provided browsing history storage means for storing a second content identifier of browsed content, content identifier receiving means for receiving a first content identifier for desired content, and the first A content acquisition unit that acquires content with a content identifier, a content analysis unit that analyzes related content in content directly related to the content from the acquired content, and extracts a related content identifier of the related content; The second content identifier is read from the browsing history storage means, a second content identifier having the same content identifier as the related content identifier is searched, and a browsing popularity indicating the frequency of browsing the content of the related content identifier is calculated. Browsing history search means to Based on viewing popularity to a content processing means for processing the content, a content output means for outputting processed the contents were, summarized in that with.

本発明にあっては、所望のコンテンツに対するコンテンツ識別子を受信し、そのコンテンツに直接的に関連するコンテンツ内の関連コンテンツを解析し、解析した関連コンテンツに対する閲覧人気度を計算し、その閲覧人気度に基づいて加工したコンテンツを出力するため、出力されたコンテンツ内において人気のある関連コンテンツを容易に検索することができる。 In the present invention, the content identifier for the desired content is received, the related content in the content directly related to the content is analyzed, the browsing popularity for the analyzed related content is calculated, and the browsing popularity Since the content processed based on is output, the related content that is popular in the output content can be easily searched.

請求項１７に記載の本発明は、前記閲覧履歴検索手段が、前記閲覧人気度を算出すると共に、前記コンテンツに間接的に関連するコンテンツ外の関連コンテンツの関連コンテンツ識別子を抽出するものであって、前記コンテンツ出力手段は、当該コンテンツ外の関連コンテンツ識別子を更に出力することを要旨とする。 In the present invention described in claim 17, the browsing history search means calculates the browsing popularity and extracts a related content identifier of related content outside the content indirectly related to the content. The content output means further outputs related content identifiers outside the content.

請求項１８に記載の本発明は、前記閲覧履歴格納部が、当該閲覧履歴格納部に格納された第２コンテンツ識別子のコンテンツが閲覧される前に閲覧されていた遷移元コンテンツの遷移元コンテンツ識別子を更に格納しておくものであって、前記閲覧履歴検索手段は、当該閲覧履歴格納手段から前記第２コンテンツ識別子を読み出して、受信した第１コンテンツ識別子と同じコンテンツ識別子を有するコンテンツ識別子を検索し、当該コンテンツ識別子に対応する前記遷移元コンテンツ識別子と当該遷移元コンテンツ識別子の数とを関連付けた遷移元コンテンツ識別子リストを作成し、前記遷移元コンテンツ識別子リストの遷移元コンテンツ識別子と同じ遷移元コンテンツ識別子を有する遷移元コンテンツ識別子を前記閲覧履歴格納手段から検索し、検索された遷移元コンテンツ識別子と当該遷移元コンテンツ識別子の数とを関連付けた遷移先コンテンツ識別子リストを作成し、前記遷移元コンテンツ識別子リストと前記遷移先コンテンツ識別子リストを用いて、前記遷移先コンテンツ識別子の重要度を算出して、当該遷移先コンテンツ識別子を前記コンテンツ外の関連コンテンツ識別子として抽出することを要旨とする。 The present invention according to claim 18 is the transition source content identifier of the transition source content that was browsed before the browsing history storage unit browsed the content of the second content identifier stored in the browsing history storage unit. The browsing history search unit reads the second content identifier from the browsing history storage unit and searches for a content identifier having the same content identifier as the received first content identifier. Creating a transition source content identifier list in which the transition source content identifier corresponding to the content identifier and the number of the transition source content identifiers are associated, and the same transition source content identifier as the transition source content identifier in the transition source content identifier list The transition source content identifier having And creating a transition destination content identifier list in which the retrieved transition source content identifiers and the number of the transition source content identifiers are associated, and using the transition source content identifier list and the transition destination content identifier list, The gist is to calculate the importance of the content identifier and extract the transition destination content identifier as a related content identifier outside the content.

請求項１９に記載の本発明は、前記閲覧履歴格納部が、当該閲覧履歴格納部に格納された第２コンテンツ識別子のコンテンツが閲覧される前に閲覧されていた遷移元コンテンツの遷移元コンテンツ識別子を更に格納しておくものであって、前記閲覧履歴検索手段は、当該閲覧履歴格納手段から前記第２コンテンツ識別子を読み出して、受信した第１コンテンツ識別子と同じコンテンツ識別子を有するコンテンツ識別子を検索し、当該コンテンツ識別子に対応する前記遷移元コンテンツ識別子と当該遷移元コンテンツ識別子の数とを関連付けたリストを遷移先コンテンツ識別子リストとして作成し、当該遷移先コンテンツ識別子リストから前記コンテンツ内の関連コンテンツ識別子を削除し、削除後の遷移先コンテンツ識別子リストの遷移先コンテンツ識別子をコンテンツ外の関連コンテンツ識別子として抽出すると共に、当該遷移先コンテンツ識別子の数を重要度とすることを要旨とする。 The present invention described in claim 19 is the transition source content identifier of the transition source content that was browsed before the browsing history storage unit browsed the content of the second content identifier stored in the browsing history storage unit. The browsing history search unit reads the second content identifier from the browsing history storage unit and searches for a content identifier having the same content identifier as the received first content identifier. A list in which the transition source content identifier corresponding to the content identifier and the number of the transition source content identifiers are associated is created as a transition destination content identifier list, and a related content identifier in the content is obtained from the transition destination content identifier list. Delete the transition destination code in the transition destination content identifier list after deletion. Extracts the Ceiling identifier as the content out of the associated content identifier, and summarized in that the number of the transition destination content identifier and importance.

請求項２０に記載の本発明は、前記コンテンツ出力手段が、前記重要度に応じて及び／又は前記重要度と共に、前記コンテンツ外の関連コンテンツ識別子を出力することを要旨とする。 The gist of the present invention described in claim 20 is that the content output means outputs a related content identifier outside the content according to the importance and / or together with the importance.

請求項２１に記載の本発明は、コンテンツと当該コンテンツを識別可能な第１コンテンツ識別子とを関連付けてコンテンツ格納手段に格納しておくステップと、閲覧されたコンテンツの第２コンテンツ識別子を閲覧履歴格納手段に格納しておくステップと、コンテンツを検索する検索語を受信するステップと、前記コンテンツ格納手段から前記コンテンツと前記第１コンテンツ識別子とを読み出して、前記検索語を含むコンテンツと当該コンテンツに対応する第１コンテンツ識別子とを検索し、当該検索語が当該コンテンツに出現する頻度を示す文章適合度を算出するステップと、前記閲覧履歴格納手段から前記第２コンテンツ識別子を読み出して、検索された前記第１コンテンツ識別子と同じコンテンツ識別子を有する第２コンテンツ識別子を検索し、当該第１コンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出するステップと、前記文章適合度と前記閲覧人気度とを合成するステップと、合成された結果に基づいて、検索された前記第１コンテンツ識別子を出力するステップと、を有することを要旨とする。 According to a twenty-first aspect of the present invention, a step of associating a content with a first content identifier that can identify the content is stored in the content storage means, and a second content identifier of the browsed content is stored in a browsing history. Storing in the means, receiving a search word for searching for content, reading the content and the first content identifier from the content storage means, and corresponding to the content including the search word and the content Searching for the first content identifier, calculating a sentence suitability indicating the frequency with which the search term appears in the content, and reading the second content identifier from the browsing history storage means, Second content identification having the same content identifier as the first content identifier Based on the result of combining, the step of calculating the browsing popularity indicating the frequency of browsing the content of the first content identifier, the step of combining the sentence suitability and the browsing popularity And a step of outputting the searched first content identifier.

請求項２２に記載の本発明は、閲覧されたコンテンツの第２コンテンツ識別子を閲覧履歴格納手段に格納しておくステップと、所望のコンテンツに対する第１コンテンツ識別子を受信するステップと、当該第１コンテンツ識別子のコンテンツを取得するステップと、取得した前記コンテンツから当該コンテンツに直接的に関連するコンテンツ内の関連コンテンツを解析し、当該関連コンテンツの関連コンテンツ識別子を抽出するステップと、前記閲覧履歴格納手段から前記第２コンテンツ識別子を読み出して、前記関連コンテンツ識別子と同じコンテンツ識別子を有する第２コンテンツ識別子を検索し、当該関連コンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出するステップと、当該閲覧人気度に基づいて、前記コンテンツを加工するステップと、加工された前記コンテンツを出力するステップと、を有することを要旨とする。 According to a twenty-second aspect of the present invention, the step of storing the second content identifier of the browsed content in the browsing history storage means, the step of receiving the first content identifier for the desired content, and the first content From the acquired content, analyzing the related content in the content directly related to the content from the acquired content, extracting the related content identifier of the related content, from the browsing history storage means Reading the second content identifier, searching for a second content identifier having the same content identifier as the related content identifier, and calculating a browsing popularity indicating the frequency of browsing the content of the related content identifier; Based on browsing popularity, A step of processing the Ntsu, and outputting the processed the contents were, and summarized in that with.

請求項２３に記載の本発明は、コンテンツと当該コンテンツを識別可能な第１コンテンツ識別子とを関連付けてコンテンツ格納手段に格納しておく処理と、閲覧されたコンテンツの第２コンテンツ識別子を閲覧履歴格納手段に格納しておく処理と、コンテンツを検索する検索語を受信する処理と、前記コンテンツ格納手段から前記コンテンツと前記第１コンテンツ識別子とを読み出して、前記検索語を含むコンテンツと当該コンテンツに対応する第１コンテンツ識別子とを検索し、当該検索語が当該コンテンツに出現する頻度を示す文章適合度を算出する処理と、前記閲覧履歴格納手段から前記第２コンテンツ識別子を読み出して、検索された前記第１コンテンツ識別子と同じコンテンツ識別子を有する第２コンテンツ識別子を検索し、当該第１コンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出する処理と、前記文章適合度と前記閲覧人気度とを合成する処理と、合成された結果に基づいて、検索された前記第１コンテンツ識別子を出力する処理と、をコンピュータに実行させることを要旨とする。 According to a twenty-third aspect of the present invention, a process of associating a content with a first content identifier that can identify the content and storing the content in a content storage unit, and storing a second content identifier of the browsed content in a browsing history A process for storing in the means, a process for receiving a search word for searching for content, and reading out the content and the first content identifier from the content storage means to correspond to the content including the search word and the content The first content identifier to be searched, a process for calculating a sentence suitability indicating the frequency of occurrence of the search term in the content, and the second content identifier is read from the browsing history storage means, and the searched Search for a second content identifier having the same content identifier as the first content identifier, and The process of calculating the browsing popularity indicating the frequency of browsing the content of one content identifier, the process of combining the sentence matching degree and the browsing popularity, and the search result based on the combined result The gist is to cause a computer to execute a process of outputting one content identifier.

請求項２４に記載の本発明は、閲覧されたコンテンツの第２コンテンツ識別子を閲覧履歴格納手段に格納しておく処理と、所望のコンテンツに対する第１コンテンツ識別子を受信する処理と、当該第１コンテンツ識別子のコンテンツを取得する処理と、取得した前記コンテンツから当該コンテンツに直接的に関連するコンテンツ内の関連コンテンツを解析し、当該関連コンテンツの関連コンテンツ識別子を抽出する処理と、前記閲覧履歴格納手段から前記第２コンテンツ識別子を読み出して、前記関連コンテンツ識別子と同じコンテンツ識別子を有する第２コンテンツ識別子を検索し、当該関連コンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出する処理と、当該閲覧人気度に基づいて、前記コンテンツを加工する処理と、加工された前記コンテンツを出力する処理と、をコンピュータに実行させることを要旨とする。 The present invention as set forth in claim 24, the process of storing the second content identifier of the browsed content in the browsing history storage means, the process of receiving the first content identifier for the desired content, and the first content A process of acquiring the content of the identifier, a process of analyzing the related content in the content directly related to the content from the acquired content, extracting the related content identifier of the related content, and the browsing history storage means A process of reading the second content identifier, searching for a second content identifier having the same content identifier as the related content identifier, and calculating a browsing popularity degree indicating a frequency of browsing the content of the related content identifier; Processing to process the content based on browsing popularity , And summarized in that to execute a process of outputting the processed the contents were, to the computer.

請求項２５に記載の本発明は、コンテンツと当該コンテンツを識別可能な第１コンテンツ識別子とを関連付けてコンテンツ格納手段に格納しておく処理と、閲覧されたコンテンツの第２コンテンツ識別子を閲覧履歴格納手段に格納しておく処理と、コンテンツを検索する検索語を受信する処理と、前記コンテンツ格納手段から前記コンテンツと前記第１コンテンツ識別子とを読み出して、前記検索語を含むコンテンツと当該コンテンツに対応する第１コンテンツ識別子とを検索し、当該検索語が当該コンテンツに出現する頻度を示す文章適合度を算出する処理と、前記閲覧履歴格納手段から前記第２コンテンツ識別子を読み出して、検索された前記第１コンテンツ識別子と同じコンテンツ識別子を有する第２コンテンツ識別子を検索し、当該第１コンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出する処理と、前記文章適合度と前記閲覧人気度とを合成する処理と、合成された結果に基づいて、検索された前記第１コンテンツ識別子を出力する処理と、をコンピュータに実行させることを要旨とする。 The present invention described in claim 25 relates to a process of associating a content with a first content identifier that can identify the content and storing it in the content storage means, and storing a second content identifier of the browsed content as a browsing history. A process for storing in the means, a process for receiving a search word for searching for content, and reading out the content and the first content identifier from the content storage means to correspond to the content including the search word and the content The first content identifier to be searched, a process for calculating a sentence suitability indicating the frequency of occurrence of the search term in the content, and the second content identifier is read from the browsing history storage means, and the searched Search for a second content identifier having the same content identifier as the first content identifier, and The process of calculating the browsing popularity indicating the frequency of browsing the content of one content identifier, the process of combining the sentence matching degree and the browsing popularity, and the search result based on the combined result The gist is to cause a computer to execute a process of outputting one content identifier.

請求項２６に記載の本発明は、閲覧されたコンテンツの第２コンテンツ識別子を閲覧履歴格納手段に格納しておく処理と、所望のコンテンツに対する第１コンテンツ識別子を受信する処理と、当該第１コンテンツ識別子のコンテンツを取得する処理と、取得した前記コンテンツから当該コンテンツに直接的に関連するコンテンツ内の関連コンテンツを解析し、当該関連コンテンツの関連コンテンツ識別子を抽出する処理と、前記閲覧履歴格納手段から前記第２コンテンツ識別子を読み出して、前記関連コンテンツ識別子と同じコンテンツ識別子を有する第２コンテンツ識別子を検索し、当該関連コンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出する処理と、当該閲覧人気度に基づいて、前記コンテンツを加工する処理と、加工された前記コンテンツを出力する処理と、をコンピュータに実行させることを要旨とする。 The present invention as set forth in claim 26, a process of storing the second content identifier of the browsed content in the browsing history storage means, a process of receiving the first content identifier for the desired content, and the first content A process of acquiring the content of the identifier, a process of analyzing the related content in the content directly related to the content from the acquired content, extracting the related content identifier of the related content, and the browsing history storage means A process of reading the second content identifier, searching for a second content identifier having the same content identifier as the related content identifier, and calculating a browsing popularity degree indicating a frequency of browsing the content of the related content identifier; Processing to process the content based on browsing popularity , And summarized in that to execute a process of outputting the processed the contents were, to the computer.

本発明によれば、ユーザの趣向を反映したコンテンツを表示することができる。 According to the present invention, it is possible to display content reflecting the user's preferences.

〔第１の実施の形態〕
図１は、第１の実施の形態における情報検索システム１の構成を示す構成図である。本実施の形態における情報検索システム１は、ユーザ端末２と情報検索装置３とを備えた構成であり、互いにネットワーク４を介して接続されている。 [First Embodiment]
FIG. 1 is a configuration diagram showing a configuration of an information search system 1 according to the first embodiment. The information search system 1 according to the present embodiment includes a user terminal 2 and an information search device 3 and is connected to each other via a network 4.

ユーザ端末２は、モニタ，キーボード等の入力手段や、ネットワーク４に対する通信手段等を備え、利用者によるコンテンツの検索要求を受け付けて、その検索要求に格納された検索語を、ネットワーク４を介して後述する情報検索装置３の検索語受信部３１に送信する。また、ユーザ端末２は、ネットワーク４を介して後述する情報検索装置３の検索結果出力部３５から送信された検索結果を受信し、その検索結果を利用者に提示する。 The user terminal 2 includes input means such as a monitor and a keyboard, and communication means for the network 4. The user terminal 2 accepts a search request for content by a user and sends a search term stored in the search request via the network 4. It transmits to the search word receiving part 31 of the information search device 3 mentioned later. Further, the user terminal 2 receives a search result transmitted from the search result output unit 35 of the information search device 3 described later via the network 4 and presents the search result to the user.

情報検索装置３は、コンテンツを検索する検索語を受信する検索語受信部３１と、受信した検索語を含むコンテンツを検索するコンテンツ検索部３２と、検索されたコンテンツの閲覧履歴を検索する閲覧履歴検索部３３と、検索されたコンテンツの文章適合度と閲覧人気度とを合成する適合度合成部３４と、合成された結果に基づいてコンテンツの検索結果を出力する検索結果出力部３５とを備えた情報検索装置であって、次に説明する格納部を具備している。 The information search device 3 includes a search word receiving unit 31 that receives a search word for searching for content, a content search unit 32 that searches for content including the received search word, and a browsing history that searches the browsing history of the searched content A search unit 33, a fitness synthesis unit 34 that synthesizes the text suitability and browsing popularity of the searched content, and a search result output unit 35 that outputs a content search result based on the synthesized result. The information retrieval apparatus includes a storage unit described below.

コンテンツ格納部３０１は、コンテンツ検索部３２に接続されており、図２に示すように、コンテンツそのものを示すコンテンツ文章情報と、このコンテンツを識別可能なコンテンツ識別子と、このコンテンツが格納されているコンテンツ所在場所とが関連付けて格納されている。コンテンツ文章情報とは、コンテンツの内容を示す文章であり、例えば、コンテンツがウェブ上のサーバに格納されたページであれば、そのページに記された文章を意味する。コンテンツ識別子とは、あるコンテンツを他のコンテンツから識別可能とするものであり、例えば、コンテンツがウェブ上のサーバに格納されたページであれば、そのページのＵＲＬ（Uniform Resource Locator）を意味する。また、コンテンツ所在場所とは、コンテンツのファイルが格納されている場所であり、例えば、ローカルの計算機にコンテンツが格納されている場合には図２に示すようなパス情報が格納され、ウェブ上のコンテンツの場合にはＵＲＬが格納される。 The content storage unit 301 is connected to the content search unit 32, and as shown in FIG. 2, content text information indicating the content itself, a content identifier that can identify the content, and content in which the content is stored Stored in association with the location. The content sentence information is a sentence indicating the content of the content. For example, if the content is a page stored on a server on the web, it means a sentence written on the page. The content identifier makes it possible to identify a certain content from other content. For example, if the content is a page stored in a server on the web, it means a URL (Uniform Resource Locator) of the page. The content location is a location where content files are stored. For example, when content is stored in a local computer, path information as shown in FIG. In the case of content, a URL is stored.

閲覧履歴ログ３０２は、ログとしてのファイル形式で構成されるものであって、閲覧履歴検索部３３に接続されており、利用者によって過去に閲覧されたコンテンツの閲覧履歴が格納されている。例えば、図３に示すように、複数のコンテンツ識別子が１行に１回の閲覧要求に対応する形で格納されており、最初に「ｗｗｗ．ｇｏｏ．ｎｅ．ｊｐ／」のコンテンツが閲覧され、その次に「ｗｗｗ．ｎｔｔｒ．ｃｏ．ｊｐ／ｇｅｎｋｉ．ｈｔｍｌ」のコンテンツが閲覧されたことを意味している。なお、閲覧履歴ログ３０２に格納されるコンテンツ識別子は、任意の利用者からの閲覧要求があった都度、情報検索システム１とは別に更新されている。 The browsing history log 302 is configured in a file format as a log, is connected to the browsing history search unit 33, and stores browsing history of content browsed in the past by the user. For example, as shown in FIG. 3, a plurality of content identifiers are stored in a form corresponding to one browsing request per line, and the content “www.goo.ne.jp/” is browsed first, Next, it means that the content “www.nttr.co.jp/genki.html” has been browsed. The content identifier stored in the browsing history log 302 is updated separately from the information search system 1 every time a browsing request is received from an arbitrary user.

ネットワーク４は、例えば、ＬＡＮ（Local Area Network）、インターネット、公衆回線網、ケーブルテレビ網等を利用することができる。なお、インターネットとは、所定のプロトコルに基づいて相互リンクされたネットワークの集合体である。 As the network 4, for example, a LAN (Local Area Network), the Internet, a public line network, a cable television network, or the like can be used. The Internet is a collection of networks linked to each other based on a predetermined protocol.

続いて、本発明の実施の形態における情報検索装置３の処理について、図４を用いて説明する。図４は、第１の実施の形態における情報検索装置３の処理を示すフローチャートである。 Next, processing of the information search device 3 in the embodiment of the present invention will be described with reference to FIG. FIG. 4 is a flowchart showing processing of the information search device 3 in the first embodiment.

まず、検索語受信部３１は、ネットワーク４を介してユーザ端末２から送信された検索語を受信し、その検索語をコンテンツ検索部３２に送信する（Ｓ１０１）。 First, the search word receiving unit 31 receives a search word transmitted from the user terminal 2 via the network 4, and transmits the search word to the content search unit 32 (S101).

次に、コンテンツ検索部３２は、検索語受信部３１から送信された検索語を受け取り、コンテンツ格納部３０１からコンテンツとコンテンツ識別子とを読み出して、受け取った検索語を含む複数のコンテンツとその複数のコンテンツにそれぞれ対応する複数のコンテンツ識別子とを検索し、検索した各コンテンツの文章適合度をそれぞれ算出して、検索した各コンテンツ識別子と算出した各文章適合度とを関連付けたコンテンツ検索結果を閲覧履歴検索部３３及び適合度合成部３４に送信する（Ｓ１０２）。 Next, the content search unit 32 receives the search term transmitted from the search term receiving unit 31, reads the content and the content identifier from the content storage unit 301, a plurality of contents including the received search term, and the plurality of the content Search multiple content identifiers corresponding to each content, calculate the text suitability of each searched content, and browse the content search results that associate each searched content identifier with each calculated text suitability The data is transmitted to the search unit 33 and the fitness composition unit 34 (S102).

ここで、文章適合度とは、検索語がコンテンツに出現する頻度を意味するものであり、例えば、ＴＦ（Term Frequency）法や、ＴＦ／ＩＤＦ（Term Frequency／Inverse Document Frequency）法などが一般的に利用されている。 Here, the sentence suitability means the frequency with which a search word appears in the content, and for example, TF (Term Frequency) method, TF / IDF (Term Frequency / Inverse Document Frequency) method, etc. are common. Has been used.

続いて、閲覧履歴検索部３３は、コンテンツ検索部３２から送信されたコンテンツ検索結果を受け取り、閲覧履歴ログ３０２から過去に閲覧されたコンテンツ識別子を読み出して、受け取ったコンテンツ検索結果に含まれるコンテンツ識別子と同じコンテンツ識別子を有するコンテンツ識別子を検索し、コンテンツ検索結果に含まれるコンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出して、適合度合成部３４に送信する（Ｓ１０３）。 Subsequently, the browsing history search unit 33 receives the content search result transmitted from the content search unit 32, reads the content identifier browsed in the past from the browsing history log 302, and the content identifier included in the received content search result A content identifier having the same content identifier is searched, a browsing popularity indicating the frequency of browsing the content of the content identifier included in the content search result is calculated, and transmitted to the matching level synthesizing unit 34 (S103).

ここで、本実施の形態における閲覧人気度の計算方法について説明する。閲覧人気度（Ｓ）は、コンテンツ検索部３２から受け取ったコンテンツ識別子と同じコンテンツ識別子が閲覧履歴ログ３０２に格納されている場合に、その閲覧履歴ログ３０２に格納されたコンテンツ識別子の行数を閲覧要求回数（Ｃ）として、例えば、式（１）で計算することができる。 Here, the browsing popularity calculation method in the present embodiment will be described. When the content identifier same as the content identifier received from the content search unit 32 is stored in the browsing history log 302, the browsing popularity (S) is the number of rows of the content identifier stored in the browsing history log 302. The number of requests (C) can be calculated by, for example, Expression (1).

Ｓ＝Ｃ・・・式（１）
式（１）を用いて閲覧人気度を計算することで、過去に多く閲覧されたコンテンツの閲覧人気度は大きくなり、後述するコンテンツの得点に合成することで、人気のあるコンテンツを優先して利用者に提供することができる。なお、式（１）に代えて、式（２）〜式（４）などの任意の関数を用いることも可能である。 S = C Formula (1)
By calculating the browsing popularity using equation (1), the browsing popularity of the content that has been browsed a lot in the past will increase, and by combining with the score of the content to be described later, the popular content will be given priority Can be provided to users. In addition, it is also possible to use arbitrary functions, such as Formula (2)-Formula (4), instead of Formula (1).

Ｓ＝ｌｏｇ（Ｃ）・・・式（２）
Ｓ＝ｅｘｐ（Ｃ）・・・式（３）
Ｓ＝ｆ（Ｃ）・・・式（４）
その後、適合度合成部３４は、コンテンツ検索部３２から送信されたコンテンツ検索結果と閲覧履歴検索部３３から送信された閲覧人気度とを受け取り、各コンテンツ識別子に対する文章適合度と閲覧人気度とを合成した得点を計算し、その得点を検索結果出力部３５に送信する（Ｓ１０４）。なお、この得点の計算方法は、加算や乗算等を用いて計算しても良いし、閲覧人気度と同様に任意の関数を用いてもよい。 S = log (C) (2)
S = exp (C) (3)
S = f (C) (4)
Thereafter, the matching level synthesis unit 34 receives the content search result transmitted from the content search unit 32 and the browsing popularity level transmitted from the browsing history search unit 33, and calculates the sentence matching level and the browsing popularity level for each content identifier. The combined score is calculated, and the score is transmitted to the search result output unit 35 (S104). The score calculation method may be calculated using addition, multiplication, or the like, or an arbitrary function may be used as in the case of browsing popularity.

（Ｓ１０３）及び（Ｓ１０４）の処理を、（Ｓ１０２）で検索された全てのコンテンツ識別子に対して繰り返し行う（Ｓ１０５）。 The processes of (S103) and (S104) are repeated for all content identifiers searched in (S102) (S105).

検索結果出力部３５は、適合度合成部３４から送信された各コンテンツ識別子に対する得点を受け取り、得点の高い順番に各コンテンツ識別子を並び替えた検索結果を、ネットワーク４を介してユーザ端末２に出力する（Ｓ１０６）。なお、この検索結果は、得点の高い順番で並べられたコンテンツ識別子のうち、例えば、上位Ｎ件（Ｎは１以上の自然数）のみを検索結果として出力することも可能である。 The search result output unit 35 receives the score for each content identifier transmitted from the fitness synthesis unit 34 and outputs the search result in which the content identifiers are rearranged in the order of the highest score to the user terminal 2 via the network 4. (S106). In addition, for example, only the top N items (N is a natural number of 1 or more) among the content identifiers arranged in descending order of the scores can be output as the search results.

以上の処理により、ユーザ端末２の画面を介して、例えば図５に示すような検索結果を利用者に提示することができる。 Through the above processing, for example, a search result as shown in FIG. 5 can be presented to the user via the screen of the user terminal 2.

最後に、文章適合度と閲覧人気度とを合成することにより、検索語と適合し、且つ人気のあるコンテンツが優先的に利用者に提供される理由について説明する。 Finally, the reason why the content that matches the search word and is popular is preferentially provided to the user by synthesizing the sentence matching degree and the browsing popularity degree will be described.

図６は、第１の実施の形態における効果を説明する説明図である。ここで、利用者が用いた検索語は「東京都」であると仮定する。また、検索結果として得られたコンテンツは、「東京都情報」，「東京都庁」，「東京の観光」，「東京大学」，「東京駅」，「天気予報」であり、利用者はこの順番の検索結果を望んでいるものとする。更に、それぞれのコンテンツに対する文章適合度は、１４７，１５０，１５２，１４５，１４２，３１であり、閲覧人気度は、３２，２７，２３，２１，１８，１００であると仮定する。 FIG. 6 is an explanatory diagram for explaining the effect of the first embodiment. Here, it is assumed that the search term used by the user is “Tokyo”. The contents obtained as a search result are “Tokyo information”, “Tokyo Metropolitan Government”, “Tokyo tourism”, “University of Tokyo”, “Tokyo station”, “weather forecast”. Suppose you want search results for. Furthermore, it is assumed that the sentence suitability for each content is 147, 150, 152, 145, 142, 31 and the browsing popularity is 32, 27, 23, 21, 18, 100.

ここで、文章適合度は、コンテンツ中に「東京都」という文字列が出現する頻度に基づくので、「東京の観光」などのように観光地の名称が多く使われているようなコンテンツに対しては大きな値となり、一方、「天気予報」のようにコンテンツの主題とは直接関係がないが、コンテンツ中に一度だけ出現するようなコンテンツに対しては小さな値となる。 Here, the text suitability is based on the frequency of occurrence of the character string “Tokyo” in the content. Therefore, for the content where the name of the tourist destination is often used such as “Sightseeing in Tokyo”. On the other hand, it is not directly related to the subject of the content such as “weather forecast”, but it is a small value for content that appears only once in the content.

古典的な手法の場合、図６（ａ）に示すように文章適合度を得点とするので、コンテンツ中の「東京都」の出現回数のみの観点により順序付けられ、利用者が求める順番で検索結果を出力することができない。 In the case of the classical method, as shown in FIG. 6 (a), the score is scored according to the sentence matching degree. Therefore, the search results are ordered in terms of only the number of appearances of “Tokyo” in the content, and in the order that the user seeks. Cannot be output.

また、従来の手法の場合、図６（ｂ）に示すように閲覧人気度を得点とするので、利用者にとって人気の高いコンテンツの順番で検索結果を提供することができる。しかし、その一方で、「天気予報」のような様々な検索語によって検索や閲覧されるコンテンツが上位に提示されるので、そのような人気の高いコンテンツが、利用者により与えられた検索語との適合度が大きいコンテンツとは限らない。 Further, in the case of the conventional method, as shown in FIG. 6B, the browsing popularity is scored, so that the search results can be provided in the order of contents popular to the user. However, on the other hand, content that is searched and viewed by various search terms such as “weather forecast” is presented at the top, so that such popular content is the search term given by the user. It is not necessarily content with a high degree of fitness.

それに対し、本実施の形態で説明した文章適合度と閲覧人気度とを合成する手法の場合、図６（ｃ）に示すように利用者が望む順番通りの検索結果を提供することが可能であり、検索語との適合度が大きく、且つ人気のあるコンテンツを上位に表示するように順位付けすることが可能となる。 On the other hand, in the case of the method of synthesizing the sentence matching degree and the browsing popularity degree described in the present embodiment, it is possible to provide search results in the order desired by the user as shown in FIG. In addition, it is possible to rank so that popular contents are highly displayed and have a high degree of matching with a search term.

本実施の形態によれば、コンテンツを検索する検索語を含むコンテンツと該コンテンツに対応するコンテンツ識別子とをコンテンツ格納部３０１から検索して、この検索語により検索されたコンテンツに出現する頻度を示す文章適合度を算出すると共に、検索されたコンテンツ識別子と同じコンテンツ識別子を有するコンテンツ識別子を閲覧履歴ログ３０２から検索し、検索されたコンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出し、算出された文章適合度と閲覧人気度とを合成した結果に基づいて検索されたコンテンツ識別子を出力するので、利用者の趣向を反映したコンテンツを表示することができる。 According to the present embodiment, a content including a search word for searching for content and a content identifier corresponding to the content are searched from the content storage unit 301, and the frequency of appearance in the content searched by this search word is indicated. The sentence matching degree is calculated, and a content identifier having the same content identifier as the searched content identifier is searched from the browsing history log 302, and the browsing popularity degree indicating the frequency of browsing the content of the searched content identifier is calculated. Since the content identifier searched based on the result of combining the calculated sentence fitness and the browsing popularity is output, it is possible to display the content reflecting the user's preference.

本実施の形態によれば、適合度合成部３４が、文章適合度と閲覧人気度とを合成した得点を計算するものであって、検索結果出力部３５は、得点の高い順番でコンテンツ識別子を出力するため、利用者によって入力された検索語との適合度が大きく、且つ人気のあるコンテンツ識別子を優先して利用者に提供することができる。 According to the present embodiment, the matching degree synthesis unit 34 calculates a score obtained by synthesizing the sentence matching degree and the browsing popularity degree, and the search result output unit 35 selects the content identifiers in descending order of the score. Therefore, it is possible to preferentially provide a popular content identifier to the user with a high degree of matching with the search term input by the user.

〔第２の実施の形態〕
図７は、第２の実施の形態における情報検索システム１の構成を示す構成図である。本実施の形態における情報検索システム１の構成は、閲覧履歴格納部３０２を除き、第１の実施の形態と基本的には同様なので、ここでは重複した説明は省略する。 [Second Embodiment]
FIG. 7 is a configuration diagram showing the configuration of the information search system 1 in the second embodiment. The configuration of the information search system 1 in the present embodiment is basically the same as that of the first embodiment except for the browsing history storage unit 302, and therefore, a duplicate description is omitted here.

本実施の形態における閲覧履歴格納部３０２は、データベース形式で構成されるものであって、図８に示すように、閲覧されたコンテンツのコンテンツ識別子と、そのコンテンツが閲覧された閲覧時刻を示す閲覧要求日時と、そのコンテンツを閲覧した利用者の利用者識別子とが、１行に１回の閲覧要求に対応する形で関連付けて格納されている。図８の場合、最初に、「２００７年２月２７日の１２時３２分５４秒」の時刻に、「ｗｗｗ．ｇｏｏ．ｎｅ．ｊｐ／」のコンテンツ識別子を持つコンテンツが、「７５０２３９７０４７」の利用者識別子を持つ利用者により閲覧されたことを意味している。また、閲覧履歴格納部３０２に格納される閲覧要求日時，コンテンツ識別子，利用者識別子は、任意の利用者からの閲覧要求があった都度、情報検索システム１とは別に更新されている。 The browsing history storage unit 302 according to the present embodiment is configured in a database format. As shown in FIG. 8, the browsing history storage unit 302 is a browsing that shows the content identifier of the browsed content and the browsing time when the content was browsed. The request date and time and the user identifier of the user who has browsed the content are stored in association with each other in a form corresponding to a single browsing request. In the case of FIG. 8, the content having the content identifier “www.goo.ne.jp/” at the time “12:32:54 on Feb. 27, 2007” is first used as “75023997047”. It means that it was browsed by a user with a user identifier. Further, the browsing request date and time, the content identifier, and the user identifier stored in the browsing history storage unit 302 are updated separately from the information search system 1 every time a browsing request is received from an arbitrary user.

続いて、本発明の実施の形態における情報検索装置３の処理について、図９を用いて説明する。図９は、第２の実施の形態における情報検索装置３の処理を示すフローチャートである。 Next, processing of the information search device 3 in the embodiment of the present invention will be described with reference to FIG. FIG. 9 is a flowchart showing processing of the information search device 3 in the second embodiment.

まず、検索語受信部３１は、ネットワーク４を介してユーザ端末２から送信された検索語を受信し、その検索語をコンテンツ検索部３２に送信する（Ｓ２０１）。 First, the search word receiving unit 31 receives a search word transmitted from the user terminal 2 via the network 4 and transmits the search word to the content search unit 32 (S201).

次に、コンテンツ検索部３２は、検索語受信部３１から送信された検索語を受け取り、コンテンツ格納部３０１からコンテンツとコンテンツ識別子とを読み出して、受け取った検索語を含む複数のコンテンツとその複数のコンテンツにそれぞれ対応する複数のコンテンツ識別子とを検索し、検索した各コンテンツの文章適合度をそれぞれ算出して、検索した各コンテンツ識別子と算出した各文章適合度とを関連付けたコンテンツ検索結果を閲覧履歴検索部３３及び適合度合成部３４に送信する（Ｓ２０２）。なお、文章適合度の算出方法は、第１の実施の形態と同様である。 Next, the content search unit 32 receives the search term transmitted from the search term receiving unit 31, reads the content and the content identifier from the content storage unit 301, a plurality of contents including the received search term, and the plurality of the content Search multiple content identifiers corresponding to each content, calculate the text suitability of each searched content, and browse the content search results that associate each searched content identifier with each calculated text suitability The data is transmitted to the search unit 33 and the fitness composition unit 34 (S202). Note that the method for calculating the sentence matching degree is the same as that in the first embodiment.

続いて、閲覧履歴検索部３３は、検索語受信部３１から送信されたコンテンツ検索結果を受け取り、閲覧履歴格納部３０２から過去に閲覧されたコンテンツ識別子と閲覧要求日時と利用者識別子とを読み出して、受け取ったコンテンツ検索結果に含まれるコンテンツ識別子と同じコンテンツ識別子を有するコンテンツ識別子を検索し、コンテンツ検索結果に含まれるコンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出して、適合度合成部３４に送信する（Ｓ２０３）。 Subsequently, the browsing history search unit 33 receives the content search result transmitted from the search word receiving unit 31, and reads the content identifier, the browsing request date and time, and the user identifier that were browsed in the past from the browsing history storage unit 302. And searching for a content identifier having the same content identifier as the content identifier included in the received content search result, calculating a browsing popularity indicating the frequency of browsing the content with the content identifier included in the content search result, The data is transmitted to the combining unit 34 (S203).

ここで、（Ｓ２０３）で説明した本実施の形態における閲覧人気度の算出方法について具体的に説明する。図１０は、第２の実施の形態における閲覧人気度を算出する処理を示すフローチャートである。 Here, the browsing popularity calculation method in the present embodiment described in (S203) will be specifically described. FIG. 10 is a flowchart illustrating a process of calculating the browsing popularity according to the second embodiment.

最初に、閲覧人気度の計算に用いる変数ａに０（ゼロ）の値を設定する（Ｓ２０３―１）。 First, a value of 0 (zero) is set to the variable a used for calculating the browsing popularity (S203-1).

次に、現在の時刻を取得する（Ｓ２０３―２）。 Next, the current time is acquired (S203-2).

続いて、閲覧履歴格納部３０２から過去に閲覧されたコンテンツ識別子と閲覧要求日時と利用者識別子とを読み出して、受け取ったコンテンツ検索結果に含まれるコンテンツ識別子と同じコンテンツ識別子を有するコンテンツ識別子を検索し、検索されたコンテンツ識別子に対応する閲覧要求日時と利用者識別子とを取得する。そして、取得した各利用者識別子に対し、閲覧要求日時が最も新しいもののみを残し、図１１に示す利用者識別子と最終閲覧要求日時との関係を示すリストを作成する（Ｓ２０３―３）。 Subsequently, the content identifier browsed in the past, the browse request date and time, and the user identifier are read from the browse history storage unit 302, and a content identifier having the same content identifier as the content identifier included in the received content search result is searched. The browsing request date and time and the user identifier corresponding to the searched content identifier are acquired. Then, for each acquired user identifier, only the one with the newest browsing request date and time is left, and a list showing the relationship between the user identifier and the final browsing request date and time shown in FIG. 11 is created (S203-3).

その後、（Ｓ２０３―２）で取得した現在時刻と（Ｓ２０３―３）で作成したリストの最終閲覧要求日時との差を計算し、各利用者がコンテンツを最後に閲覧した時点から現在までの経過時間Ｅを算出する（Ｓ２０３―４）。 After that, the difference between the current time acquired in (S203-2) and the last browsing request date and time of the list created in (S203-3) is calculated, and the time from when each user last viewed the content to the present Time E is calculated (S203-4).

そして、算出された経過時間Ｅを式（５）に代入して閲覧新鮮度Ｓを計算し、閲覧人気度ａに加算する（Ｓ２０３―５）。 Then, the calculated fresh time S is calculated by substituting the calculated elapsed time E into the equation (5), and is added to the browsing popularity a (S203-5).

Ｓ＝ｅｘｐ（−Ｅ）・・・式（５）
式（５）により得られる閲覧新鮮度は、最後に閲覧した時点から現在までの経過時間Ｅが長いほど小さく、経過時間Ｅが短いほど大きい値になる。故に、頻繁に閲覧される人気のコンテンツの場合、経過時間Ｅが一般的に小さくなる傾向があるので、閲覧人気度の値が大きくなる。従って、この閲覧新鮮度を閲覧人気度に加えることで、利用者の閲覧の頻度に基づいた閲覧人気度を得ることが可能となる。なお、式（５）に代えて、式（６）〜式（１０）などの任意の関数を用いることも可能である。 S = exp (−E) (5)
The browsing freshness degree obtained by the equation (5) becomes smaller as the elapsed time E from the last browsing time to the present time becomes longer, and becomes larger as the elapsed time E becomes shorter. Therefore, in the case of popular content that is browsed frequently, the elapsed time E generally tends to be small, so that the value of the browsing popularity increases. Therefore, by adding this browsing freshness to the browsing popularity, it is possible to obtain the browsing popularity based on the browsing frequency of the user. In addition, it is also possible to use arbitrary functions, such as Formula (6)-Formula (10), instead of Formula (5).

Ｓ＝１／Ｅ・・・式（６）
Ｓ＝１／ｌｏｇ（Ｅ）・・・式（７）
Ｓ＝ｃ×ｅｘｐ（−ｂＥ／）・・・式（８）
Ｓ＝ｃ／ｂＥ・・・式（９）
Ｓ＝ｃ／ｌｏｇ（ｂＥ）・・・式（１０）
但し、ｃ，ｂは定数である。 S = 1 / E (6)
S = 1 / log (E) (7)
S = c × exp (−bE /) (8)
S = c / bE Formula (9)
S = c / log (bE) (10)
However, c and b are constants.

（Ｓ２０３−４）及び（Ｓ２０３−５）の処理を、（Ｓ２０３−３）で検索された全ての利用者識別子に対して繰り返し行う（Ｓ２０３−６）。 The processes of (S203-4) and (S203-5) are repeated for all user identifiers searched in (S203-3) (S203-6).

最終的に求められた変数ａの値をコンテンツの閲覧人気度とし、適合度合成部３４に送信する（Ｓ２０３−７）。なお、変数ａの値をそのまま閲覧人気度する代わりに、ｌｏｇ（ａ）として計算された値を閲覧人気度とすることも可能である。 The value of the variable a finally obtained is used as the browsing popularity of the content, and is transmitted to the fitness composition unit 34 (S203-7). Note that the value calculated as log (a) can be used as the browsing popularity instead of the value of the variable a as it is.

図９に戻り、適合度合成部３４は、コンテンツ検索部３２から送信されたコンテンツ検索結果と、閲覧履歴検索部３３から送信された閲覧人気度とを受け取り、各コンテンツ識別子に対する文章適合度と閲覧人気度とを合成した得点を計算し、その得点を検索結果出力部３５に送信する（Ｓ２０４）。 Returning to FIG. 9, the matching level synthesis unit 34 receives the content search result transmitted from the content search unit 32 and the browsing popularity level transmitted from the browsing history search unit 33, and the sentence matching level and browsing for each content identifier. A score obtained by combining the popularity is calculated, and the score is transmitted to the search result output unit 35 (S204).

（Ｓ２０３）及び（Ｓ２０４）の処理を、（Ｓ２０２）で検索された全てのコンテンツ識別子に対して繰り返し行う（Ｓ２０５）。 The processes of (S203) and (S204) are repeated for all content identifiers searched in (S202) (S205).

検索結果出力部３５は、適合度合成部３４から送信された各コンテンツ識別子に対する得点を受け取り、得点の高い順番に各コンテンツ識別子を並び替えた検索結果を、ネットワーク４を介してユーザ端末２に出力する（Ｓ２０６）。 The search result output unit 35 receives the score for each content identifier transmitted from the fitness synthesis unit 34 and outputs the search result in which the content identifiers are rearranged in the order of the highest score to the user terminal 2 via the network 4. (S206).

最後に、閲覧新鮮度を考慮して閲覧人気度を計算することにより、一部の利用者の集中的な閲覧の影響を受けない理由について説明する。 Finally, the reason for not being influenced by the intensive browsing of some users by calculating the browsing popularity considering the browsing freshness will be described.

図１２は、第２の実施の形態における効果を説明する説明図である。ここで、ある検索要求に対する検索結果に、コンテンツＡとコンテンツＢとが含まれていたとする。また、閲覧履歴検索部３３が閲覧履歴格納部３０２を検索した過去の閲覧履歴は、図１２に示すように、コンテンツＡとコンテンツＢの両方共に、総閲覧数は１７であったが（図１２に示す「＋」で示す部分の総数）、各利用者の閲覧状況に大きな差があると仮定する。 FIG. 12 is an explanatory diagram for explaining the effect of the second embodiment. Here, it is assumed that content A and content B are included in a search result for a certain search request. Further, the past browsing history searched by the browsing history search unit 33 in the browsing history storage unit 302 has a total browsing number of 17 for both the content A and the content B as shown in FIG. 12 (FIG. 12). It is assumed that there is a large difference in the viewing status of each user.

コンテンツＡの場合、特定の利用者２によって連続的に閲覧されているが、他の利用者による閲覧は殆ど無いので、人気の無いコンテンツであると考えられるが、一部の利用者の悪戯による閲覧がなされたと考えることができる。一方、コンテンツＢの場合、複数の利用者が同じ頻度で比較的頻繁に閲覧されているので、このコンテンツＢは人気のあるコンテンツであり、多くの利用者により頻繁に閲覧されたと考えられる。 In the case of the content A, it is viewed continuously by a specific user 2, but since it is hardly viewed by other users, it is considered to be an unpopular content, but due to mischief of some users It can be considered that browsing was done. On the other hand, in the case of the content B, since a plurality of users are viewed relatively frequently with the same frequency, the content B is a popular content and is considered to be frequently viewed by many users.

ここで、従来の手法のように、コンテンツＡとコンテンツＢとの閲覧人気度を閲覧数で計算する場合、両者の閲覧人気度は共に１７で同じになり、閲覧人気度は本来の人気度を反映していない。一方、本実施の形態による閲覧人気度の計算によれば、図１２に示すように、コンテンツＡは０．９０，コンテンツＢは１．２７となり、実際と同じようにコンテンツＢの閲覧人気度の方が大きくなり、本来の人気度を反映した閲覧人気度を求めることができる。 Here, when the browsing popularity of the contents A and B is calculated by the number of browsing as in the conventional method, the browsing popularity of both is the same at 17, and the browsing popularity is the original popularity. Not reflected. On the other hand, according to the calculation of the browsing popularity according to the present embodiment, as shown in FIG. 12, the content A is 0.90 and the content B is 1.27. It becomes larger and it is possible to obtain the browsing popularity reflecting the original popularity.

故に、本実施の形態における閲覧人気度の計算により、一部の利用者の集中的な閲覧の影響を受けずに、実際の人気の度合いを忠実に反映した人気度が得られるので、閲覧人気度が加算された得点に基づいて順位付けた検索結果を利用者に提示することにより、利用者は人気のあるコンテンツを効率的に検索することができる。 Therefore, the calculation of the browsing popularity in this embodiment can obtain a popularity that faithfully reflects the actual popularity without being affected by the intensive browsing of some users. By presenting the search results ranked based on the scores with added degrees to the user, the user can efficiently search for popular content.

続いて、本実施の形態による閲覧人気度の計算により、一過性の集中的な閲覧の影響を受けない理由について説明する。 Subsequently, the reason why the calculation of the browsing popularity according to the present embodiment is not affected by the temporary intensive browsing will be described.

図１３は、第２の実施の形態における効果を説明する説明図である。図１２で説明した場合と同様に、ある検索要求に対する検索結果に、コンテンツＡとコンテンツＢとが含まれていたとする。また、閲覧履歴検索部３３が閲覧履歴格納部３０２を検索した過去の閲覧履歴は、図１３に示すように、コンテンツＡとコンテンツＢの両方共に、総閲覧数は１７であったが、各利用者の閲覧状況に大きな差があると仮定する。 FIG. 13 is an explanatory diagram for explaining the effect of the second embodiment. Assume that content A and content B are included in a search result for a certain search request, as in the case described with reference to FIG. In addition, as shown in FIG. 13, the past browsing history searched by the browsing history search unit 33 in the browsing history storage unit 302 has a total browsing count of 17 for both the content A and the content B. Suppose that there is a big difference in the browsing situation of the person.

コンテンツＡの場合、閲覧時期が過去のある一定期間の間に限られ、最近は殆ど閲覧されていないので、人気の無いコンテンツであると考えられるが、一過性の流行によって、一時期に限って多くの閲覧がなされたことを意味している。一方、コンテンツＢの場合、複数の利用者が同じような頻度で比較的頻繁に閲覧しているので、コンテンツＡよりも永続的に人気のあるコンテンツであり、多くの利用者が頻繁に閲覧を継続したことを意味している。 In the case of the content A, the browsing time is limited to a certain period in the past, and since it has not been browsed recently, it is considered to be an unpopular content. This means that a lot of browsing has been done. On the other hand, in the case of the content B, since a plurality of users browse relatively frequently with the same frequency, it is a content that is permanently more popular than the content A, and many users frequently view it. It means that it has continued.

ここで、従来の手法のように、コンテンツＡとコンテンツＢとの閲覧人気度を閲覧数で計算する場合、両者の閲覧人気度は共に１７で同じになり、閲覧人気度は本来の人気度を反映していない。一方、本実施の形態による閲覧人気度の計算によれば、図１３に示すように、コンテンツＡは０．０３，コンテンツＢは１．２７となり、実際と同じようにコンテンツＢの閲覧人気度の方が大きくなり、本来の人気度を反映した閲覧人気度を求めることができる。 Here, when the browsing popularity of the contents A and B is calculated by the number of browsing as in the conventional method, the browsing popularity of both is the same at 17, and the browsing popularity is the original popularity. Not reflected. On the other hand, according to the calculation of the browsing popularity according to the present embodiment, as shown in FIG. 13, the content A is 0.03 and the content B is 1.27. It becomes larger and it is possible to obtain the browsing popularity reflecting the original popularity.

故に、本実施の形態における閲覧人気度の計算により、一過性の集中的な閲覧の影響を受けずに、実際の人気の度合いをより忠実に反映した人気度が得られるので、閲覧人気度が加算された得点に基づいて順位付けた検索結果を利用者に提示することにより、利用者は人気のあるコンテンツを効率的に検索することができる。 Therefore, the calculation of the browsing popularity in the present embodiment can obtain a popularity that more faithfully reflects the actual popularity without being affected by the temporary intensive browsing. By presenting the search results ranked based on the score with the added to the user, the user can efficiently search for popular content.

本実施の形態によれば、検索語により検索されたコンテンツ識別子と同じコンテンツ識別子が閲覧された時刻から現在までの経過時間を示す閲覧新鮮度を閲覧人気度に加算するので、本来の人気の度合いを反映したコンテンツ識別子を利用者に提供することができる。 According to the present embodiment, since the browsing freshness indicating the elapsed time from the time when the same content identifier as the content identifier searched by the search word is browsed to the present is added to the browsing popularity, the original popularity degree Can be provided to the user.

本実施の形態によれば、最も新しい閲覧時刻を用いて閲覧新鮮度を算出するので、本来の人気の度合いをより確実に反映したコンテンツ識別子を利用者に提供することができる。 According to the present embodiment, since the freshness of browsing is calculated using the newest browsing time, it is possible to provide the user with a content identifier that more reliably reflects the original popularity.

〔第３の実施の形態〕
図１４は、第３の実施の形態における情報検索システム１の構成を示す構成図である。本実施の形態における情報検索システム１の構成は、閲覧履歴格納部３０２と利用者属性格納部３０３とを除き、第１の実施の形態と基本的には同様なので、ここでは重複した説明は省略する。 [Third Embodiment]
FIG. 14 is a configuration diagram illustrating a configuration of the information search system 1 according to the third embodiment. The configuration of the information search system 1 in the present embodiment is basically the same as that of the first embodiment except for the browsing history storage unit 302 and the user attribute storage unit 303, and therefore, a duplicate description is omitted here. To do.

本実施の形態における閲覧履歴格納部３０２は、データベース形式で構成されるものであって、図１５に示すように、閲覧されたコンテンツのコンテンツ識別子と、そのコンテンツが閲覧された閲覧時刻を示す閲覧要求日時と、そのコンテンツを閲覧した利用者の利用者識別子と、そのコンテンツが閲覧された閲覧時間とが、１行に１回の閲覧要求に対応する形で関連付けて格納されている。なお、この閲覧時間は、閲覧を開始した時刻から別のコンテンツの閲覧を開始した時刻、又は通信を遮断した時刻までの長さである。また、閲覧時間がある一定値を超えた場合には、利用者が端末を放置したとみなして、適当な値を閲覧時間として設定することも可能である。 The browsing history storage unit 302 in the present embodiment is configured in a database format, and as shown in FIG. 15, a browsing that indicates a content identifier of the browsed content and a browsing time when the content is browsed. The request date and time, the user identifier of the user who browsed the content, and the browse time when the content was browsed are stored in association with each other in a form corresponding to one browse request. This browsing time is the length from the time when browsing is started to the time when browsing of another content is started or when communication is interrupted. Further, when the browsing time exceeds a certain value, it is possible that the user has left the terminal and an appropriate value can be set as the browsing time.

利用者属性格納部３０３は、データベース形式で構成されるものであって、図１６に示すように、利用者の利用者識別子とその利用者の属性とが関連付けて格納されている。属性とは、例えば、性別，年齢，趣味，職業，国籍，住所などの利用者に関する固有の情報である。 The user attribute storage unit 303 is configured in a database format, and stores the user identifier of the user and the attribute of the user in association with each other as shown in FIG. The attribute is specific information about the user such as sex, age, hobby, occupation, nationality, address, and the like.

続いて、本発明の実施の形態における情報検索装置３の処理について、図１７を用いて説明する。図１７は、第３の実施の形態における情報検索装置３の処理を示すフローチャートである。 Next, processing of the information search device 3 in the embodiment of the present invention will be described with reference to FIG. FIG. 17 is a flowchart showing the processing of the information search device 3 in the third embodiment.

まず、検索語受信部３１は、ネットワーク４を介してユーザ端末２から送信された検索語と利用者の属性とを受信し、その検索語をコンテンツ検索部３２に送信する（Ｓ３０１）。なお、利用者の属性については、利用者が検索要求を発行する際に利用者に入力してもらうのでも良いし、ユーザ端末２に予め設定された属性情報を自動的に送信するようにしても良い。 First, the search word receiving unit 31 receives the search word and the user attribute transmitted from the user terminal 2 via the network 4, and transmits the search word to the content search unit 32 (S301). The user attributes may be input by the user when the user issues a search request, or attribute information set in advance in the user terminal 2 is automatically transmitted. Also good.

次に、コンテンツ検索部３２は、検索語受信部３１から送信された検索語と利用者の属性とを受け取り、コンテンツ格納部３０１からコンテンツとコンテンツ識別子とを読み出して、受け取った検索語を含む複数のコンテンツとその複数のコンテンツにそれぞれ対応する複数のコンテンツ識別子とを検索し、検索した各コンテンツの文章適合度をそれぞれ算出して、検索した各コンテンツ識別子と算出した各文章適合度とを関連付けたコンテンツ検索結果を閲覧履歴検索部３３及び適合度合成部３４に送信すると共に、利用者の属性についても閲覧履歴検索部３３及び適合度合成部３４に送信する（Ｓ３０２）。なお、文章適合度の算出方法は、第１の実施の形態と同様である。 Next, the content search unit 32 receives the search term and the user attribute transmitted from the search term receiving unit 31, reads the content and content identifier from the content storage unit 301, and includes a plurality of received search terms. The content and the plurality of content identifiers corresponding to the plurality of contents are searched, the sentence suitability of each searched content is calculated, and each searched content identifier is associated with each calculated sentence suitability The content search result is transmitted to the browsing history search unit 33 and the matching level synthesis unit 34, and the user attributes are also transmitted to the browsing history search unit 33 and the matching level synthesis unit 34 (S302). Note that the method for calculating the sentence matching degree is the same as that in the first embodiment.

続いて、閲覧履歴検索部３３は、検索語受信部３１から送信されたコンテンツ検索結果と利用者の属性とを受け取り、閲覧履歴格納部３０２から過去に閲覧されたコンテンツ識別子と閲覧要求日時と利用者識別子とを読み出して、受け取ったコンテンツ検索結果に含まれるコンテンツ識別子と同じコンテンツ識別子を有するコンテンツ識別子を検索し、コンテンツ検索結果に含まれるコンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出して、適合度合成部３４に送信する（Ｓ３０３）。 Subsequently, the browsing history search unit 33 receives the content search result and the user attribute transmitted from the search word receiving unit 31, and the content identifier, the browsing request date and time, and the browsing request date and time used in the past from the browsing history storage unit 302. The content identifier having the same content identifier as the content identifier included in the received content search result is retrieved, and the browsing popularity indicating the frequency of browsing the content with the content identifier included in the content search result is obtained. It calculates and transmits to the fitness synthesis unit 34 (S303).

ここで、（Ｓ３０３）で説明した本実施の形態における閲覧人気度の算出方法について具体的に説明する。図１８は、第３の実施の形態における閲覧人気度の算出する処理を示すフローチャートである。 Here, the method for calculating the browsing popularity in the present embodiment described in (S303) will be specifically described. FIG. 18 is a flowchart illustrating a process of calculating the browsing popularity according to the third embodiment.

最初に、閲覧人気度の計算に用いる変数ａに０（ゼロ）の値を設定する（Ｓ３０３―１）。 First, a value of 0 (zero) is set to the variable a used for calculating the browsing popularity (S303-1).

次に、現在の時刻を取得する（Ｓ３０３―２）。 Next, the current time is acquired (S303-2).

続いて、閲覧履歴格納部３０２から過去に閲覧されたコンテンツ識別子と閲覧要求日時と利用者識別子とを読み出して、受け取ったコンテンツ検索結果に含まれるコンテンツ識別子と同じコンテンツ識別子を有するコンテンツ識別子を検索し、検索されたコンテンツ識別子に対応する閲覧要求日時と利用者識別子とを取得する。そして、取得した各利用者識別子に対し、閲覧要求日時が最も新しいもののみを残し、第２の実施の形態で説明した図１１に示す利用者識別子と最終閲覧要求日時との関係を示すリストＡを作成する。また、リストＡを作成すると共に、利用者属性格納部３０３から利用者識別子と利用者の属性とを読み出して、受け取った利用者の属性と同じ属性を持つ利用者識別子を検索し、検索された利用者識別子と利用者の属性との関係を示すリストＢを作成する。その後、リストＡから、リストＢに含まれていない利用者識別子とこの利用者識別子に対応する閲覧時刻とを削除する（Ｓ３０３―３）。 Subsequently, the content identifier browsed in the past, the browse request date and time, and the user identifier are read from the browse history storage unit 302, and a content identifier having the same content identifier as the content identifier included in the received content search result is searched. The browsing request date and time and the user identifier corresponding to the searched content identifier are acquired. Then, for each acquired user identifier, only the one with the latest browsing request date and time is left, and the list A showing the relationship between the user identifier and the final browsing request date and time shown in FIG. 11 described in the second embodiment. Create In addition, the list A is created, and the user identifier and the user attribute are read from the user attribute storage unit 303, and the user identifier having the same attribute as the received user attribute is searched and searched. A list B indicating the relationship between user identifiers and user attributes is created. Thereafter, the user identifiers not included in the list B and the browsing time corresponding to the user identifiers are deleted from the list A (S303-3).

なお、リストＢの作成時における利用者の属性を検索する条件は、「性別のみ」，「性別の年齢」，「年齢の差が３歳以内」，「年齢と国籍」など、任意の組み合わせを設定することができる。 The search conditions for user attributes at the time of creating List B can be any combination such as “sex only”, “sex age”, “age difference within 3 years”, “age and nationality”, etc. Can be set.

これにより、コンテンツの閲覧履歴を検索する際に、利用者の属性と類似した利用者による過去の閲覧履歴のみを用いて閲覧人気度を計算するので、利用者の属性に近い利用者に人気のあるコンテンツを優先して提示することが可能となる。 As a result, when searching the browsing history of content, the browsing popularity is calculated using only the past browsing history by the user similar to the user's attribute, so it is popular with users close to the user's attribute It becomes possible to present certain content with priority.

リストＢの作成時において、受信した利用者の属性と同じ属性を持つ利用者のヒット件数、即ち、削除後におけるリストＡの利用者識別子の総数が、事前に設定された所定の閾値Ｈよりも多い場合、後述する（Ｓ３０３−６）の処理を開始し、所定の閾値Ｈよりも少ない場合、（Ｓ３０４−５）の処理を開始する（Ｓ３０４−４）。なお、所定の閾値とは、例えば、２０などの任意の値を設定することが可能である。 When creating the list B, the number of hits of users having the same attributes as the received user attributes, that is, the total number of user identifiers of the list A after deletion is greater than a predetermined threshold value H set in advance. If there are many, the processing of (S303-6) described later is started, and if it is less than the predetermined threshold H, the processing of (S304-5) is started (S304-4). Note that, for example, an arbitrary value such as 20 can be set as the predetermined threshold.

コンテンツ検索結果に含まれるコンテンツ識別子の一部と一致するコンテンツ識別子に対応する利用者識別子を閲覧履歴格納部３０２から読み出して、読み出した利用者識別子とこの利用者識別子に対応する閲覧時刻とをリストＡに追加する（Ｓ３０４−５）。 A user identifier corresponding to a content identifier that matches a part of the content identifier included in the content search result is read from the browsing history storage unit 302, and the read user identifier and the browsing time corresponding to this user identifier are listed. A is added to A (S304-5).

例えば、コンテンツ検索結果に含まれるコンテンツ識別子が、「ｈｔｔｐ：／／ｗｗｗ．ｇｏｏ．ｎｅ．ｊｐ／〜ｔｏｋｋｙｏ／ｇａｉｄｅｎ／ｕｓｈｉ／ｇｕｓ．ｈｔｍｌ」の場合、このコンテンツ識別子からドメイン名「ｗｗｗ．ｇｏｏ．ｎｅ．ｊｐ」のみを抽出し、このドメイン名と同じドメイン名をコンテンツ識別子とするコンテンツ識別子を閲覧履歴格納部３０２から検索する。 For example, when the content identifier included in the content search result is “http://www.goo.ne.jp/˜tokyo/gaiden/ushi/gus.html”, the domain name “www.goo. “ne.jp” is extracted, and a content identifier having the same domain name as this domain name as a content identifier is searched from the browsing history storage unit 302.

なお、ドメイン名ではなく、「／」，「．」，「〜」などの記号で区切った任意の要素の組み合わせを検索の条件とすることも可能である。例えば、「ｗｗｗ．ｇｏｏ．ｎｅ．ｊｐ／〜ｔｏｋｋｙｏ」，「ｗｗｗ．ｇｏｏ．ｎｅ．ｊｐ／〜ｔｏｋｋｙｏ／ｇａｉｄｅｎ」，「ｗｗｗ．ｇｏｏ．ｎｅ．ｊｐ／〜ｔｏｋｋｙｏ／ｇａｉｄｅｎ／ｕｓｈｉ」，「ｇｏｏ．ｎｅ．ｊｐ」，「ｔｏｋｋｙｏ」，「ｇｕｓ．ｈｔｍｌ」など、一部分を検索条件としてもよい。 It is also possible to use a combination of arbitrary elements separated by symbols such as “/”, “.”, “˜” instead of the domain name as a search condition. For example, “www.goo.ne.jp/˜tokyo”, “www.goo.ne.jp/˜tokyo/gaiden”, “www.goo.ne.jp/˜tokyo/gaiden/ushi”, “goo. A part of the search condition may be “ne.jp”, “tokyo”, “gus.html”, or the like.

これにより、過去の閲覧履歴の希少なコンテンツについては、コンテンツ検索結果に含まれるコンテンツ識別子と共通するコンテンツ識別子の要素を持つコンテンツに関しても検索対象とすることで、より多くの関連する閲覧検索情報を得ることができる。例えば、過去に閲覧履歴が無いコンテンツであっても、そのコンテンツとドメイン名が同じコンテンツに対する閲覧履歴が存在すれば、そのコンテンツの閲覧人気度を計算することができる。 As a result, content with a rare content in the past browsing history can also be searched for content having a content identifier element common to the content identifier included in the content search result, so that more related browsing search information can be obtained. Obtainable. For example, even if the content has no browsing history in the past, if there is a browsing history for content with the same domain name as that content, the browsing popularity of the content can be calculated.

その後、（Ｓ３０３―２）で取得した現在時刻と、（Ｓ３０３―３）又は（Ｓ３０３―５）で作成したリストＡの最終閲覧要求日時との差を計算し、各利用者がコンテンツを最後に閲覧した時点から現在までの経過時間Ｅを算出する（Ｓ３０３―６）。 After that, the difference between the current time acquired in (S303-2) and the last browse request date / time of list A created in (S303-3) or (S303-5) is calculated, and each user puts the content at the end. Elapsed time E from the point of view to the present is calculated (S303-6).

そして、各利用者識別子に対する閲覧時間を閲覧履歴格納部３０２から読み出して、読み出した閲覧時間Ｄと算出された経過時間Ｅを式（１１）に代入して閲覧新鮮度Ｓを計算し、閲覧人気度ａに加算する（Ｓ３０３―７）。 Then, the browsing time for each user identifier is read from the browsing history storage unit 302, and the browsing freshness S is calculated by substituting the read browsing time D and the calculated elapsed time E into the equation (11). Add to degree a (S303-7).

Ｓ＝ｌｏｇ（Ｄ）×ｅｘｐ（−Ｅ）・・・式（１１）
これにより、利用者の閲覧の頻度を反映した人気度であることに加え、利用者がコンテンツを閲覧した際の閲覧時間が長いほど閲覧人気度が大きくなり、逆に閲覧時間が短いほど閲覧人気度が小さくなるので、検索結果を閲覧人気度が加算された得点によって順位付けて利用者に提示することにより、利用者は人気のあるコンテンツを効率的に検索することができる。なお、式（１１）に代えて、式（１２）〜式（１５）などの任意の関数を用いることも可能である。 S = log (D) × exp (−E) (11)
As a result, in addition to the popularity reflecting the frequency of browsing by the user, the browsing popularity increases as the browsing time when the user browses the content, and conversely, the browsing popularity decreases as the browsing time decreases. Since the degree becomes small, the user can efficiently search for popular contents by ranking the search results according to the scores added with the browsing popularity and presenting them to the user. In addition, it is also possible to use arbitrary functions, such as Formula (12)-Formula (15), instead of Formula (11).

Ｓ＝Ｄ×ｅｘｐ（−Ｅ）・・・式（１２）
Ｓ＝ｃ×Ｄ×ｅｘｐ（−Ｅ）・・・式（１３）
Ｓ＝ｌｏｇ（ｃ×Ｄ）×ｅｘｐ（−Ｅ）・・・式（１４）
Ｓ＝ｆ（Ｄ）×ｆ（Ｅ）・・・式（１５）
但し、ｃは定数である。 S = D × exp (−E) (12)
S = c × D × exp (−E) (13)
S = log (c × D) × exp (−E) (14)
S = f (D) × f (E) (15)
However, c is a constant.

なお、閲覧時間Ｄを用いることなく、第２の実施の形態で説明したように、経過時間Ｅのみを用いて閲覧新鮮度Ｓを計算しても良い。 Note that the browsing freshness S may be calculated using only the elapsed time E without using the browsing time D, as described in the second embodiment.

（Ｓ３０３−６）及び（Ｓ３０３−７）の処理を、（Ｓ３０３−３）又は（Ｓ３０３−５）で検索された全ての利用者識別子に対して繰り返し行う（Ｓ３０３−８）。 The processes of (S303-6) and (S303-7) are repeated for all user identifiers searched in (S303-3) or (S303-5) (S303-8).

最終的に求められた変数ａの値をコンテンツの閲覧人気度とし、適合度合成部３４に送信する（Ｓ３０３−９）。なお、変数ａの値をそのまま閲覧人気度する代わりに、ｌｏｇ（ａ）として計算された値を閲覧人気度とすることも可能である。 The value of the variable a finally obtained is used as the browsing popularity of the content, and is transmitted to the fitness composition unit 34 (S303-9). Note that the value calculated as log (a) can be used as the browsing popularity instead of the value of the variable a as it is.

図１７に戻り、適合度合成部３４は、コンテンツ検索部３２から送信されたコンテンツ検索結果と、閲覧履歴検索部３３から送信された閲覧人気度とを受け取り、各コンテンツ識別子に対する文章適合度と閲覧人気度とを合成した得点を計算し、その得点を検索結果出力部３５に送信する（Ｓ３０４）。 Returning to FIG. 17, the matching level synthesis unit 34 receives the content search result transmitted from the content search unit 32 and the browsing popularity level transmitted from the browsing history search unit 33, and the sentence matching level and browsing for each content identifier. The score obtained by combining the popularity is calculated, and the score is transmitted to the search result output unit 35 (S304).

（Ｓ３０３）及び（Ｓ３０４）の処理を、（Ｓ３０２）で検索された全てのコンテンツ識別子に対して繰り返し行う（Ｓ３０５）。 The processes of (S303) and (S304) are repeated for all content identifiers searched in (S302) (S305).

検索結果出力部３５は、適合度合成部３４から送信された各コンテンツ識別子に対する得点を受け取り、得点の高い順番に各コンテンツ識別子を並び替えた検索結果を、ネットワーク４を介してユーザ端末２に出力する（Ｓ３０６）。 The search result output unit 35 receives the score for each content identifier transmitted from the fitness synthesis unit 34 and outputs the search result in which the content identifiers are rearranged in the order of the highest score to the user terminal 2 via the network 4. (S306).

本実施の形態によれば、受信した利用者の属性と同じ属性を持つ利用者識別子を利用者属性格納部３０３から検索し、検索された利用者識別子とその属性との関係を示すリストＢを作成し、既に作成されたリストＡからリストＢに格納されていない利用者識別子とこの利用者識別子に対応する閲覧時刻とを削除したリストＡを用いて閲覧新鮮度を算出するので、利用者の属性に近い利用者に人気のあるコンテンツを優先して提示することができる。 According to the present embodiment, a user identifier having the same attribute as the received user attribute is retrieved from the user attribute storage unit 303, and a list B indicating the relationship between the retrieved user identifier and the attribute is obtained. Since the freshness of browsing is calculated using the list A in which the created user A and the user identifiers not stored in the list B and the browsing time corresponding to the user identifier are deleted, the freshness of browsing is calculated. It is possible to preferentially present content that is popular with users close to the attribute.

本実施の形態によれば、リストＡの利用者識別子の総数が所定の閾値よりも少ない場合に、コンテンツ検索手段により検索されたコンテンツ識別子の一部と一致するコンテンツ識別子に対応する利用者識別子を閲覧履歴格納部３０２から読み出して、読み出した利用者識別子とこの利用者識別子に対応する閲覧時刻とをリストＡに追加するので、より多くの関連する閲覧情報を用いて閲覧新鮮度を計算することができる。 According to the present embodiment, when the total number of user identifiers in the list A is smaller than a predetermined threshold, the user identifier corresponding to the content identifier that matches a part of the content identifier searched by the content search means is Reading from the browsing history storage unit 302 and adding the read user identifier and the browsing time corresponding to this user identifier to the list A, so that the browsing freshness is calculated using more related browsing information Can do.

本実施の形態によれば、リストＡの各利用者識別子に対応する閲覧時間を閲覧履歴格納部３０２から読み出して、この閲覧時間を更に用いて閲覧新鮮度を計算するので、本来の人気の度合いを更に反映したコンテンツ識別子を利用者に提供することができる。 According to the present embodiment, the browsing time corresponding to each user identifier in the list A is read from the browsing history storage unit 302, and the browsing freshness is calculated by further using this browsing time. Can be provided to the user.

〔第４の実施の形態〕
第４の実施の形態では、第１〜第３の実施の形態により検索結果が利用者に提示され、その検索結果として出力されたコンテンツ識別子が利用者により選択された後、選択されたコンテンツ識別子のコンテンツを利用者に提示するまでの処理について説明する。 [Fourth Embodiment]
In the fourth embodiment, the search result is presented to the user according to the first to third embodiments, and the content identifier output as the search result is selected by the user, and then the selected content identifier is selected. The process until the content is presented to the user will be described.

図１９は、第４の実施の形態における情報検索システム１の構成を示す構成図である。本実施の形態における情報検索システム１は、ユーザ端末２と情報検索装置３とウェブサーバ５とを備えた構成であり、互いにネットワーク４を介して接続されている。 FIG. 19 is a configuration diagram illustrating a configuration of the information search system 1 according to the fourth embodiment. The information search system 1 according to the present embodiment includes a user terminal 2, an information search device 3, and a web server 5, which are connected to each other via a network 4.

情報検索装置３は、コンテンツ識別子を受信するコンテンツ識別子受信部３６と、そのコンテンツ識別子のコンテンツを取得するコンテンツ取得部３７と、取得したコンテンツの関連コンテンツを解析するコンテンツ解析部３８と、解析された関連コンテンツの閲覧履歴を検索する閲覧履歴検索部３９と、解析結果に基づいてコンテンツを加工するコンテンツ加工部４０と、加工されたコンテンツを出力するコンテンツ出力部４１とを更に備えた情報検索装置であって、第１〜第３の実施の形態で説明した閲覧履歴格納部３０２を具備している。 The information search device 3 is analyzed by a content identifier receiving unit 36 that receives a content identifier, a content acquisition unit 37 that acquires the content of the content identifier, and a content analysis unit 38 that analyzes related content of the acquired content. An information search apparatus further comprising a browsing history search unit 39 that searches a browsing history of related content, a content processing unit 40 that processes content based on an analysis result, and a content output unit 41 that outputs the processed content. Thus, the browsing history storage unit 302 described in the first to third embodiments is provided.

なお、本実施の形態で説明する閲覧履歴検索部３９の機能は、第１〜第３の実施の形態で説明した閲覧履歴検索部３３と同じ機能であり、閲覧履歴検索部３３が閲覧履歴検索部３９として動作することも可能である。 The function of the browsing history search unit 39 described in the present embodiment is the same function as the browsing history search unit 33 described in the first to third embodiments, and the browsing history search unit 33 searches the browsing history. It is also possible to operate as the unit 39.

続いて、本発明の実施の形態における情報検索装置３の処理について、図２０を用いて説明する。図２０は、第４の実施の形態における情報検索装置３の処理を示すフローチャートである。 Next, processing of the information search device 3 in the embodiment of the present invention will be described with reference to FIG. FIG. 20 is a flowchart showing the processing of the information search apparatus 3 in the fourth embodiment.

最初に、検索結果として利用者に提示された複数のコンテンツ識別子のうち、あるコンテンツ識別子が選択された場合に、コンテンツ識別子受信部３６は、ネットワーク４を介してその選択されたコンテンツ識別子を受信し、コンテンツ取得部３７に送信する（Ｓ４０１）。 First, when a certain content identifier is selected from among a plurality of content identifiers presented to the user as a search result, the content identifier receiving unit 36 receives the selected content identifier via the network 4. And transmitted to the content acquisition unit 37 (S401).

次に、コンテンツ取得部３７は、コンテンツ識別子受信部３６から送信されたコンテンツ識別子を受け取り、そのコンテンツ識別子を持つコンテンツをウェブサーバ５から取得して、コンテンツ解析部３８及びコンテンツ加工部４０に送信する（Ｓ４０２）。 Next, the content acquisition unit 37 receives the content identifier transmitted from the content identifier reception unit 36, acquires the content having the content identifier from the web server 5, and transmits it to the content analysis unit 38 and the content processing unit 40. (S402).

続いて、コンテンツ解析部３８は、コンテンツ取得部３７から送信されたコンテンツを受け取り、このコンテンツに直接的に関連するコンテンツ内の関連コンテンツを解析して、解析した関連コンテンツの関連コンテンツ識別子を抽出し、抽出された関連コンテンツ識別子を閲覧履歴検索部３９に送信する（Ｓ４０３）。 Subsequently, the content analysis unit 38 receives the content transmitted from the content acquisition unit 37, analyzes the related content in the content directly related to the content, and extracts the related content identifier of the analyzed related content. Then, the extracted related content identifier is transmitted to the browsing history search unit 39 (S403).

ここで、関連コンテンツとは、例えば、送信されたコンテンツがウェブページであれば、そのウェブページ内のハイパーリンク先のコンテンツを意味するものであり、関連コンテンツ識別子とは、その関連コンテンツのＵＲＬを意味する。 Here, for example, if the transmitted content is a web page, the related content means a hyperlink destination content in the web page, and the related content identifier is a URL of the related content. means.

その後、閲覧履歴検索部３９は、コンテンツ解析部３８から送信された関連コンテンツ識別子を受け取り、閲覧履歴格納部３０２から過去に閲覧されたコンテンツ識別子を読み出して、受け取ったコンテンツ内の関連コンテンツ識別子と同じコンテンツ識別子を検索し、その関連コンテンツ識別子のコンテンツが閲覧された頻度を示す閲覧人気度を算出して、コンテンツ加工部４０に送信する（Ｓ４０４）。なお、本実施の形態における閲覧人気度の計算方法は、第１〜第３の実施の形態で説明したいずれの計算方法でもよい。 Thereafter, the browsing history search unit 39 receives the related content identifier transmitted from the content analysis unit 38, reads the content identifier browsed in the past from the browsing history storage unit 302, and is the same as the related content identifier in the received content. A content identifier is searched, a browsing popularity degree indicating the frequency of browsing the content of the related content identifier is calculated, and transmitted to the content processing unit 40 (S404). Note that the browsing popularity calculation method according to the present embodiment may be any of the calculation methods described in the first to third embodiments.

コンテンツ加工部４０は、コンテンツ取得部３７から送信されたコンテンツと、閲覧履歴検索部３９から送信された関連コンテンツ識別子の閲覧人気度とを受け取り、その閲覧人気度の値に基づいてコンテンツを加工する（Ｓ４０５）。 The content processing unit 40 receives the content transmitted from the content acquisition unit 37 and the browsing popularity of the related content identifier transmitted from the browsing history search unit 39, and processes the content based on the browsing popularity value. (S405).

例えば、閲覧人気度の値が高い場合には、該当するハイパーリンクのアンカーテキストの文字の大きさをより大きく加工し、閲覧人気度の値が低い場合には、より小さくように加工する。その大きさの程度は、閲覧人気度の値に比例するように変更してもよいし、所定の値からの差分を計算し、その差分がプラスの場合にはその大きさを１５ポイント，マイナスの場合には１０ポイントとしても良い。図２１の場合、「言語」及び「次へ」のアンカーテキストに対応するハイパーリンクのリンク先コンテンツの閲覧人気度は大きく、一方、「人間」についての閲覧人気度は小さい場合を示している。 For example, when the browsing popularity value is high, the character size of the anchor text of the corresponding hyperlink is processed to be larger, and when the browsing popularity value is low, it is processed to be smaller. The degree of the size may be changed so as to be proportional to the value of the browsing popularity, or a difference from a predetermined value is calculated, and when the difference is positive, the size is reduced by 15 points, minus In this case, 10 points may be used. In the case of FIG. 21, the browsing popularity of the link destination content of the hyperlink corresponding to the anchor texts of “language” and “next” is large, while the browsing popularity of “human” is small.

このように、閲覧人気度の大小に応じてアンカーテキストの表示の大きさを変えることで、利用者はより容易に人気のあるコンテンツを検索することができる。 In this way, by changing the display size of the anchor text according to the level of browsing popularity, the user can search for popular content more easily.

（Ｓ４０４）及び（Ｓ４０５）の処理を、（Ｓ４０３）で検索された全ての関連コンテンツ識別子に対して繰り返し行う（Ｓ４０６）。 The processes of (S404) and (S405) are repeated for all the related content identifiers searched in (S403) (S406).

最後に、コンテンツ出力部４１は、コンテンツ加工部４０から送信されたコンテンツを、ネットワーク４を介してユーザ端末２に出力する（Ｓ４０７）。 Finally, the content output unit 41 outputs the content transmitted from the content processing unit 40 to the user terminal 2 via the network 4 (S407).

以上、検索結果として出力されたコンテンツ識別子が利用者により選択された後に、該コンテンツを利用者に提示する場合について説明したが、本実施の形態は、検索結果として出力されたコンテンツを選択した場合に限らず、任意の経路に依って閲覧に至った場合についても同じように適用するようにしても良く、その場合であってもここで説明したのと同様の効果が得られる。 As described above, the case where the content identifier output as the search result is selected by the user and then the content is presented to the user has been described. However, in the present embodiment, the content output as the search result is selected. However, the present invention is not limited to this, and the same application may be applied to a case where browsing is reached through an arbitrary route, and even in that case, the same effect as described here can be obtained.

本実施の形態によれば、利用者によって選択されたコンテンツ識別子のコンテンツに対し、そのコンテンツに直接的に関連するコンテンツ内の関連コンテンツを解析し、解析した関連コンテンツに対する閲覧人気度を計算し、その閲覧人気度に基づいて加工したコンテンツを出力するので、出力されたコンテンツ内において人気のある関連コンテンツを容易に検索することができる。 According to the present embodiment, for the content with the content identifier selected by the user, the related content in the content directly related to the content is analyzed, the browsing popularity for the analyzed related content is calculated, Since the content processed based on the browsing popularity is output, it is possible to easily search for related content that is popular in the output content.

〔第５の実施の形態〕
図２２は、第５の実施の形態における情報検索システム１の構成を示す構成図である。本実施の形態における情報検索システム１は、第４の実施の形態で説明した構成に加えて、第３の実施の形態で説明した利用者属性格納部３０３を更に備えた構成である。 [Fifth Embodiment]
FIG. 22 is a configuration diagram showing the configuration of the information search system 1 in the fifth embodiment. The information search system 1 in the present embodiment has a configuration further including the user attribute storage unit 303 described in the third embodiment in addition to the configuration described in the fourth embodiment.

また、本実施の形態で説明する閲覧履歴格納部３０２には、第３の実施の形態で説明した構成の他に、図２３に示すように、閲覧履歴格納部３０２に格納されたコンテンツ識別子のコンテンツが閲覧される前に閲覧されていた遷移元コンテンツの遷移元コンテンツ識別子が更に格納されている。例えば、コンテンツＡを閲覧中に、コンテンツＡに表示されているハイパーリンクを辿ってコンテンツＢを閲覧した場合には、コンテンツＡが遷移元コンテンツであり、コンテンツＡのコンテンツ識別子が遷移元コンテンツ識別子となる。 In addition to the configuration described in the third embodiment, the browsing history storage unit 302 described in the present embodiment includes content identifiers stored in the browsing history storage unit 302 as shown in FIG. The transition source content identifier of the transition source content that was browsed before the content was browsed is further stored. For example, when browsing the content A while browsing the content A while tracing the hyperlink displayed on the content A, the content A is the transition source content, and the content identifier of the content A is the transition source content identifier. Become.

なお、遷移元コンテンツは、利用者が閲覧した全てのコンテンツを時間順に並べた記録から容易に取得することができる。例えば、携帯電話事業者の場合には、通信機器のログから利用者の全ての閲覧要求情報が得られるので、遷移元コンテンツの情報も得ることができる。また、ハイパーリンクによらない遷移であっても、時間的に直近に閲覧していたコンテンツを遷移元コンテンツとしても良く、更に、利用者がコンテンツ識別子を直接入力して閲覧要求を行った場合には、遷移元コンテンツを無しとしても良い。 The transition source content can be easily obtained from a record in which all the content viewed by the user is arranged in time order. For example, in the case of a mobile phone operator, all browsing request information of the user can be obtained from the log of the communication device, so that information on the transition source content can also be obtained. Moreover, even if the transition is not based on a hyperlink, the content that was browsed most recently in time may be used as the transition source content, and when the user makes a browsing request by directly entering the content identifier. May have no transition source content.

続いて、本発明の実施の形態における情報検索装置３の処理について、図２４を用いて説明する。図２４は、第５の実施の形態における情報検索装置３の処理を示すフローチャートである。 Next, processing of the information search device 3 in the embodiment of the present invention will be described with reference to FIG. FIG. 24 is a flowchart illustrating the processing of the information search device 3 in the fifth embodiment.

最初に、検索結果として利用者に提示された複数のコンテンツ識別子のうち、あるコンテンツ識別子が選択された場合に、コンテンツ識別子受信部３６は、ネットワーク４を介してその選択されたコンテンツ識別子を受信し、コンテンツ取得部３７に送信する（Ｓ５０１）。 First, when a certain content identifier is selected from among a plurality of content identifiers presented to the user as a search result, the content identifier receiving unit 36 receives the selected content identifier via the network 4. And transmitted to the content acquisition unit 37 (S501).

次に、コンテンツ取得部３７は、コンテンツ識別子受信部３６から送信されたコンテンツ識別子を受け取り、そのコンテンツ識別子を持つコンテンツをウェブサーバ５から取得して、コンテンツ解析部３８及びコンテンツ加工部４０に送信すると共に、受け取ったコンテンツ識別子をコンテンツ解析部３８に送信する（Ｓ５０２）。 Next, the content acquisition unit 37 receives the content identifier transmitted from the content identifier reception unit 36, acquires the content having the content identifier from the web server 5, and transmits it to the content analysis unit 38 and the content processing unit 40. At the same time, the received content identifier is transmitted to the content analysis unit 38 (S502).

続いて、コンテンツ解析部３８は、コンテンツ取得部３７から送信されたコンテンツとコンテンツ識別子とを受け取り、このコンテンツに直接関連するコンテンツ内の関連コンテンツを解析して、解析したコンテンツ内の関連コンテンツの関連コンテンツ識別子を抽出し、抽出したコンテンツ内の関連コンテンツ識別子と受け取ったコンテンツ識別子とを閲覧履歴検索部３９に送信する（Ｓ５０３）。 Subsequently, the content analysis unit 38 receives the content and the content identifier transmitted from the content acquisition unit 37, analyzes the related content in the content directly related to the content, and relates the related content in the analyzed content. The content identifier is extracted, and the related content identifier in the extracted content and the received content identifier are transmitted to the browsing history search unit 39 (S503).

次に、閲覧履歴検索部３９は、コンテンツ解析部３８から送信されたコンテンツ内の関連コンテンツ識別子とコンテンツ識別子とを受け取り、コンテンツに間接的に関連するコンテンツ外の関連コンテンツを解析して、解析したコンテンツ外の関連コンテンツの関連コンテンツ識別子を抽出し、抽出したコンテンツ外の関連コンテンツ識別子とこのコンテンツ外の関連コンテンツ識別子に対する重要度とをコンテンツ加工部４０に送信する（Ｓ５０４）。 Next, the browsing history search unit 39 receives the related content identifier and the content identifier in the content transmitted from the content analysis unit 38, analyzes and analyzes the related content outside the content indirectly related to the content. The related content identifier of the related content outside the content is extracted, and the extracted related content identifier outside the content and the importance with respect to the related content identifier outside the content are transmitted to the content processing unit 40 (S504).

（Ｓ５０４）におけるコンテンツ外の関連コンテンツ識別子の抽出方法とその重要度の算出方法とについて、（ｉ）受け取ったコンテンツ識別子と遷移元コンテンツ識別子とが同じコンテンツを「コンテンツ外の関連コンテンツ」として抽出する方法と、（ｉｉ）受け取ったコンテンツ識別子のコンテンツを遷移元コンテンツとするコンテンツを閲覧履歴から検索し、その中から受け取ったコンテンツ識別子のコンテンツのハイパーリンクに含まれないコンテンツを「コンテンツ外の関連コンテンツ」として抽出する方法とに分けて説明する。なお、受け取ったコンテンツ識別子は、「ＵＲＬ０」として以下説明する。 Regarding the extraction method of the related content identifier outside the content and the calculation method of the importance in (S504), (i) the content having the same content identifier and the transition source content identifier is extracted as “related content outside the content”. And (ii) searching the browsing history for content having the content of the received content identifier as the source content, and searching for content not included in the hyperlink of the content of the received content identifier from the related content outside the content "And the method of extraction as follows. The received content identifier will be described below as “URL0”.

最初に（ｉ）の方法について説明する。 First, the method (i) will be described.

閲覧履歴検索部３９は、閲覧履歴格納部３０２からコンテンツ識別子を読み出して、受け取ったコンテンツ識別子と同じコンテンツ識別子を有するコンテンツ識別子を検索し、検索されたコンテンツ識別子に対応する遷移元コンテンツ識別子を閲覧履歴格納部３０２から読み出して、遷移元コンテンツ識別子とこの遷移元コンテンツ識別子の数とを関連付けた遷移元コンテンツ識別子のリストＣを作成する（Ｓ５０４−１ａ）。 The browsing history search unit 39 reads the content identifier from the browsing history storage unit 302, searches for a content identifier having the same content identifier as the received content identifier, and displays the transition source content identifier corresponding to the searched content identifier as the browsing history. A list C of transition source content identifiers that is read from the storage unit 302 and associates the transition source content identifiers with the number of the transition source content identifiers is created (S504-1a).

例えば、図２５（ａ）で示す遷移元コンテンツ識別子のリストＣの場合、受け取ったコンテンツ識別子「ＵＲＬ０」に対し、遷移元コンテンツ識別子が「ＵＲＬ１」の行数が５つであり、「ＵＲＬ２」の行数が７つであり、「ＵＲＬ３」の行数が２つであることを意味している。 For example, in the case of the transition source content identifier list C shown in FIG. 25A, the number of rows of the transition source content identifier “URL1” is five for the received content identifier “URL0”, and “URL2” This means that the number of lines is seven, and the number of lines of “URL3” is two.

また、（Ｓ５０４−１ａ）において、遷移元コンテンツ識別子を閲覧履歴格納部３０２から読み出す時に、更なる条件を与えることも可能である。例えば、閲覧履歴検索部３９は、利用者属性格納部３０３から利用者識別子と利用者の属性とを読み出して、ユーザ端末２から受け取った利用者の属性と同じ属性を持つ利用者識別子を検索し、検索された利用者識別子と利用者の属性との関係を示すリストＤを作成する。そして、このリストＤに含まれる利用者識別子と同じ利用者識別子を持つ遷移元コンテンツ識別子を閲覧履歴格納部３０２から読み出して、リストＣを作成する。 Further, when the transition source content identifier is read from the browsing history storage unit 302 in (S504-1a), further conditions can be given. For example, the browsing history search unit 39 reads the user identifier and the user attribute from the user attribute storage unit 303 and searches for a user identifier having the same attribute as the user attribute received from the user terminal 2. Then, a list D indicating the relationship between the searched user identifier and the user attribute is created. Then, the transition source content identifier having the same user identifier as the user identifier included in the list D is read from the browsing history storage unit 302 to create the list C.

なお、利用者識別子の検索結果の多少に応じて、リストＤの使用の可否を判断することもできる。また、リストＤの作成時に用いた利用者の属性は、（Ｓ５０１）において、利用者が検索要求を発行する際に利用者に入力してもらうのでも良いし、ユーザ端末２に予め設定された属性情報を自動的に送信するようにしても良い。更に、第３の実施の形態で用いた利用者識別子を流用しても良い。 It should be noted that whether or not the list D can be used can also be determined according to the search result of the user identifier. In addition, the user attributes used when creating the list D may be input by the user when the user issues a search request in (S501), or set in advance in the user terminal 2. The attribute information may be automatically transmitted. Furthermore, the user identifier used in the third embodiment may be used.

次に、（Ｓ５０４−１ａ）で作成した遷移元コンテンツ識別子のリストＣに含まれる各遷移元コンテンツ識別子に対し、その遷移元コンテンツ識別子と同じ遷移元コンテンツ識別子を有する遷移元コンテンツ識別子を閲覧履歴格納部３０２から検索し、検索された遷移元コンテンツ識別子に対応するコンテンツ識別子（以降、「遷移先コンテンツ識別子」と称する）とこの遷移先コンテンツ識別子の数とを関連付けた遷移先コンテンツ識別子のリストＥを作成する（Ｓ５０４−２ａ）。 Next, for each transition source content identifier included in the list C of transition source content identifiers created in (S504-1a), a transition source content identifier having the same transition source content identifier as the transition source content identifier is stored in the browsing history. A list E of transition destination content identifiers in which the content identifier corresponding to the searched transition source content identifier (hereinafter referred to as “transition destination content identifier”) and the number of the transition destination content identifiers are associated with each other is retrieved from the section 302. Create (S504-2a).

例えば、図２５（ｂ）の場合、遷移元コンテンツ識別子「ＵＲＬ１」に対し、遷移先コンテンツ識別子が「ＵＲＬ１１」の行数が３つであり、「ＵＲＬ１２」の行数が６つであることを意味している。 For example, in the case of FIG. 25B, the number of lines of the transition destination content identifier “URL11” is 3 and the number of lines of “URL12” is 6 with respect to the transition source content identifier “URL1”. I mean.

続いて、リストＣ及びリストＥを用いて、リストＥに含まれる各遷移先コンテンツ識別子の重要度を計算する（Ｓ５０４−３ａ）。 Subsequently, the importance of each transition destination content identifier included in the list E is calculated using the list C and the list E (S504-3a).

具体的には、図２５（ｃ）に示すように、例えば、遷移先コンテンツ「ＵＲＬ１１」については、遷移元コンテンツ「ＵＲＬ１」の５と遷移先コンテンツ「ＵＲＬ１１」の３とを乗算した１５と、遷移元コンテンツ「ＵＲＬ２」の７と遷移先コンテンツ「ＵＲＬ１１」の２とを乗算した１４とを加算し、加算した合計値２９を重要度とする。なお、重要度Ｇの計算方法は、行数を乗算する代わりに、式（１６）〜式（２０）などを用いることも可能である。 Specifically, as illustrated in FIG. 25C, for example, for the transition destination content “URL11”, 15 obtained by multiplying 5 of the transition source content “URL1” and 3 of the transition destination content “URL11”; 14 which is obtained by multiplying 7 of the transition source content “URL2” and 2 of the transition destination content “URL11” is added, and the added total value 29 is set as the importance. Note that the importance G can be calculated by using the equations (16) to (20) instead of multiplying the number of rows.

Ｇ＝ｌｏｇ（Ｃ）×ｌｏｇ（Ｅ）・・・式（１６）
Ｇ＝Ｃ×ｌｏｇ（Ｅ）・・・式（１７）
Ｇ＝ｌｏｇ（Ｃ）×Ｅ・・・式（１８）
Ｇ＝ｌｏｇ（Ｃ）＋ｌｏｇ（Ｅ）・・・式（１９）
Ｇ＝ｌｏｇ（Ｃ）＋Ｅ・・・式（２０）
なお、Ｃは遷移元コンテンツ識別子の行数、Ｅは遷移先コンテンツ識別子の行数である。 G = log (C) × log (E) (16)
G = C × log (E) (17)
G = log (C) × E (18)
G = log (C) + log (E) Expression (19)
G = log (C) + E (20)
Note that C is the number of rows of the transition source content identifier, and E is the number of rows of the transition destination content identifier.

そして、閲覧履歴検索部３９は、各遷移先コンテンツ識別子をコンテンツ外の関連コンテンツ識別子とし、このコンテンツ外の関連コンテンツ識別子と重要度とをコンテンツ加工部４０に送信する（Ｓ５０４−４ａ）。 Then, the browsing history search unit 39 sets each transition destination content identifier as a related content identifier outside the content, and transmits the related content identifier outside the content and the importance to the content processing unit 40 (S504-4a).

次に（ｉｉ）の方法について説明する。 Next, the method (ii) will be described.

閲覧履歴検索部３９は、閲覧履歴格納部３０２からコンテンツ識別子を読み出して、受け取ったコンテンツ識別子と同じコンテンツ識別子を有するコンテンツ識別子を検索し、検索されたコンテンツ識別子に対応する遷移元コンテンツ識別子を閲覧履歴格納部３０２から読み出して、図２６（ａ）に示すように、遷移元コンテンツ識別子とこの遷移元コンテンツ識別子の数とを関連付けたリストＣを遷移先コンテンツ識別子として作成する（Ｓ５０４−１ｂ）。なお、（ｉ）で説明した方法で用いたリストＤを使用して、遷移元コンテンツ識別子を閲覧履歴格納部３０２から読み出す時に、更なる条件を与えることも可能である。 The browsing history search unit 39 reads the content identifier from the browsing history storage unit 302, searches for a content identifier having the same content identifier as the received content identifier, and displays the transition source content identifier corresponding to the searched content identifier as the browsing history. As shown in FIG. 26A, the list C that is read from the storage unit 302 and associates the transition source content identifier with the number of the transition source content identifiers is created as the transition destination content identifier (S504-1b). It is also possible to give further conditions when reading the transition source content identifier from the browsing history storage unit 302 using the list D used in the method described in (i).

次に、図２６（ｂ）に示すように、（Ｓ５０３）で抽出されたコンテンツ内の関連コンテンツ識別子のリストＥを作成し、図２６（ｃ）に示すように、リストＣの中でリストＥに含まれていないリストＦを作成する（Ｓ５０４−２ｂ）。 Next, as shown in FIG. 26B, a list E of related content identifiers in the content extracted in (S503) is created, and as shown in FIG. A list F not included in the list is created (S504-2b).

そして、閲覧履歴検索部３９は、リストＦの各遷移先コンテンツ識別子をコンテンツ外の関連コンテンツ識別子とし、また、各遷移先コンテンツ識別子の数を重要度とて、このコンテンツ外の関連コンテンツ識別子と重要度とをコンテンツ加工部４０に送信する（Ｓ５０４−３ｂ）。 Then, the browsing history search unit 39 sets each transition destination content identifier in the list F as a related content identifier outside the content, and uses the number of each transition destination content identifier as the importance, and the related content identifier outside the content and the important content identifier. The degree is transmitted to the content processing unit 40 (S504-3b).

図２４に戻り、閲覧履歴検索部３９は、（Ｓ５０４）の処理と共に、閲覧履歴格納部３０２から過去に閲覧されたコンテンツ識別子を読み出して、受け取ったコンテンツ内の関連コンテンツ識別子と同じコンテンツ識別子を検索し、その関連コンテンツ識別子のコンテンツが閲覧された頻度を示すに対応する閲覧人気度を算出して、コンテンツ加工部４０に送信する（Ｓ５０５）。なお、第４の実施の形態と同様に、本実施の形態における閲覧人気度の計算方法は、第１〜第３の実施の形態で説明したいずれの計算方法でもよい。 Returning to FIG. 24, the browsing history search unit 39 reads the content identifier browsed in the past from the browsing history storage unit 302 and searches for the same content identifier as the related content identifier in the received content together with the processing of (S504). Then, the browsing popularity corresponding to the frequency of browsing the content of the related content identifier is calculated and transmitted to the content processing unit 40 (S505). As in the fourth embodiment, the calculation method of the browsing popularity in this embodiment may be any of the calculation methods described in the first to third embodiments.

コンテンツ加工部４０は、コンテンツ取得部３７から送信されたコンテンツと、閲覧履歴検索部３９から送信されたコンテンツ内の関連コンテンツ識別子に対する閲覧人気度とコンテンツ外の関連コンテンツ識別子及びその重要度とを受け取り、その閲覧人気度の値に基づいてコンテンツを加工すると共に、重要度に応じてコンテンツ外の関連コンテンツ識別子を加工する（Ｓ５０６）。 The content processing unit 40 receives the content transmitted from the content acquisition unit 37, the browsing popularity for the related content identifier in the content transmitted from the browsing history search unit 39, the related content identifier outside the content, and its importance. The content is processed based on the browsing popularity value, and the related content identifier outside the content is processed according to the importance (S506).

図２７の場合、「言語」及び「次へ」のアンカーテキストに対応するハイパーリンクのリンク先コンテンツの閲覧人気度は大きく、一方、「人間」についての閲覧人気度は小さい場合を示している。また、このコンテンツの下部には、別途設けたスペースにおいて、コンテンツ外の関連コンテンツの関連コンテンツ識別子とその重要度とが表示されている。コンテンツ内／コンテンツ外の関連コンテンツ識別子は、ハイパーリンクとしてコンテンツへのリンクとしても良い。また、アンカーテキストを関連コンテンツ識別子そのものとする代わりに、リンク先コンテンツのタイトル文字列等を表示するようにしてもよい。更に、それぞれのハイパーリンクの表示属性を、重要度の値に応じて変化させても良い。例えば、重要度の高い関連コンテンツのハイパーリンクは大きく、重要度の低い関連コンテンツのハイパーリンクは小さく表示するようにしてもよい。また、ハイパーリンクの表示の並び順を重要度の大小に応じて並べてもよい。例えば、ハイパーリンクを重要度の高い順に並べても良い。更に、コンテンツ外の関連コンテンツ識別子を表示する場所は、このコンテンツの右側，左側，上側でもよく、別窓に表示してもよい。 In the case of FIG. 27, the browsing popularity of the link destination content of the hyperlink corresponding to the anchor text of “language” and “next” is large, while the browsing popularity of “human” is small. Also, in the lower part of this content, the related content identifier and the importance level of the related content outside the content are displayed in a separately provided space. The related content identifier inside / outside the content may be a link to the content as a hyperlink. Further, instead of using the anchor text as the related content identifier itself, a title character string or the like of the link destination content may be displayed. Further, the display attribute of each hyperlink may be changed according to the importance value. For example, the hyperlink of related content with high importance may be large and the hyperlink of related content with low importance may be displayed small. Further, the display order of hyperlinks may be arranged according to the degree of importance. For example, hyperlinks may be arranged in descending order of importance. Furthermore, the location where the related content identifier outside the content is displayed may be on the right side, the left side, or the upper side of the content, or may be displayed in a separate window.

これにより、コンテンツＡ→コンテンツＢ→コンテンツＡ→コンテンツＣの順序でコンテンツが閲覧された場合、コンテンツＢを閲覧している利用者に対してコンテンツ外の関連コンテンツとして、コンテンツＣが表示されるので、コンテンツＡを経由することなくコンテンツＣを閲覧することができる。 As a result, when content is browsed in the order of content A → content B → content A → content C, content C is displayed as related content outside the content to the user browsing the content B. The content C can be browsed without going through the content A.

また、コンテンツＡ→コンテンツＢの順序で閲覧された場合、ある時にコンテンツＡ内のコンテンツＢへのハイパーリンクが削除された場合であっても、コンテンツ外の関連コンテンツとしてコンテンツＢが提示されるので、利用者はコンテンツＡにあるハイパーリンクに存在しないコンテンツＢを閲覧することができる。 In addition, when browsing in the order of content A → content B, even if a hyperlink to content B in content A is deleted at some time, content B is presented as related content outside the content. The user can browse the content B that does not exist in the hyperlink in the content A.

更に、コンテンツ外の閲覧コンテンツの重要性をその表示属性や表示順序により把握できるので、人気のあるコンテンツをより効率的に閲覧することができる。 Furthermore, since the importance of the browsing content outside the content can be grasped by the display attribute and display order, the popular content can be browsed more efficiently.

最後に、第１〜第５の実施の形態で説明した処理はプログラムにより実行され、そのプログラムは記録媒体に格納して再生可能に保管することが可能である。 Finally, the processing described in the first to fifth embodiments is executed by a program, and the program can be stored in a recording medium so as to be reproducible.

本実施の形態によれば、コンテンツに間接的に関連するコンテンツ外の関連コンテンツ識別子を更に出力するため、コンテンツの利便性を向上することができる。 According to the present embodiment, since the related content identifier outside the content indirectly related to the content is further output, the convenience of the content can be improved.

本実施の形態によれば、重要度に応じて及び／又は重要度と共に、コンテンツ外の関連コンテンツが出力されるので、人気のあるコンテンツをより効率的に閲覧することができる。 According to the present embodiment, related content outside the content is output according to the importance and / or together with the importance, so that popular content can be browsed more efficiently.

第１の実施の形態における情報検索システムの構成を示す構成図である。It is a block diagram which shows the structure of the information search system in 1st Embodiment. コンテンツ格納部に格納されたテーブルを示す図である。It is a figure which shows the table stored in the content storage part. 第１の実施の形態における閲覧履歴格納部に格納された閲覧履歴を示す図である。It is a figure which shows the browsing history stored in the browsing history storage part in 1st Embodiment. 第１の実施の形態における情報検索装置の処理を示すフローチャートである。It is a flowchart which shows the process of the information search device in 1st Embodiment. 第１の実施の形態における検索結果の一例を示す図である。It is a figure which shows an example of the search result in 1st Embodiment. 第１の実施の形態における効果を説明する説明図である。It is explanatory drawing explaining the effect in 1st Embodiment. 第２の実施の形態における情報検索システムの構成を示す構成図である。It is a block diagram which shows the structure of the information search system in 2nd Embodiment. 第２の実施の形態における閲覧履歴格納部に格納された閲覧履歴を示す図である。It is a figure which shows the browsing history stored in the browsing history storage part in 2nd Embodiment. 第２の実施の形態における情報検索装置の処理を示すフローチャートである。It is a flowchart which shows the process of the information search device in 2nd Embodiment. 第２の実施の形態における閲覧人気度を算出する処理を示すフローチャートである。It is a flowchart which shows the process which calculates the browsing popularity in 2nd Embodiment. 利用者識別子と最終閲覧要求日時との関係を示すリストである。It is a list | wrist which shows the relationship between a user identifier and the last browsing request date. 第２の実施の形態における効果を説明する説明図である。It is explanatory drawing explaining the effect in 2nd Embodiment. 第２の実施の形態における効果を説明する説明図である。It is explanatory drawing explaining the effect in 2nd Embodiment. 第３の実施の形態における情報検索システムの構成を示す構成図である。It is a block diagram which shows the structure of the information search system in 3rd Embodiment. 第２の実施の形態における閲覧履歴格納部に格納された閲覧履歴を示す図である。It is a figure which shows the browsing history stored in the browsing history storage part in 2nd Embodiment. 利用者属性格納部に格納されたテーブルを示す図である。It is a figure which shows the table stored in the user attribute storage part. 第３の実施の形態における情報検索装置３の処理を示すフローチャートである。It is a flowchart which shows the process of the information search device 3 in 3rd Embodiment. 第３の実施の形態における閲覧人気度の算出する処理を示すフローチャートである。It is a flowchart which shows the process which calculates the browsing popularity in 3rd Embodiment. 第４の実施の形態における情報検索システムの構成を示す構成図である。It is a block diagram which shows the structure of the information search system in 4th Embodiment. 第４の実施の形態における情報検索装置の処理を示すフローチャートである。It is a flowchart which shows the process of the information search device in 4th Embodiment. 第４の実施の形態における加工前及び加工後のコンテンツの一例を示す図である。It is a figure which shows an example of the content before and after a process in 4th Embodiment. 第５の実施の形態における情報検索システム１の構成を示す構成図である。It is a block diagram which shows the structure of the information search system 1 in 5th Embodiment. 第５の実施の形態における閲覧履歴格納部に格納された閲覧履歴を示す図である。It is a figure which shows the browsing history stored in the browsing history storage part in 5th Embodiment. 第５の実施の形態における情報検索装置３の処理を示すフローチャートである。It is a flowchart which shows the process of the information search device 3 in 5th Embodiment. （ｉ）の方法におけるコンテンツ外の関連コンテンツを抽出する流れを示す図である。It is a figure which shows the flow which extracts the related content outside a content in the method of (i). （ｉｉ）の方法におけるコンテンツ外の関連コンテンツを抽出する流れを示す図である。It is a figure which shows the flow which extracts the related content outside the content in the method of (ii). 第５の実施の形態における加工前及び加工後のコンテンツの一例を示す図である。It is a figure which shows an example of the content before the process in the 5th Embodiment, and after a process.

Explanation of symbols

１…情報検索システム
２…ユーザ端末
３…情報検索装置
４…ネットワーク
５…ウェブサーバ
３１…検索語受信部
３２…コンテンツ検索部
３３…閲覧履歴検索部
３４…適合度合成部
３５…検索結果出力部
３６…コンテンツ識別子受信部
３７…コンテンツ取得部
３８…コンテンツ解析部
３９…閲覧履歴検索部
４０…コンテンツ加工部
４１…コンテンツ出力部
３０１…コンテンツ格納部
３０２…閲覧履歴ログ，閲覧履歴格納部
３０３…利用者属性格納部 DESCRIPTION OF SYMBOLS 1 ... Information search system 2 ... User terminal 3 ... Information search device 4 ... Network 5 ... Web server 31 ... Search word receiving part 32 ... Content search part 33 ... Browsing history search part 34 ... Conformity synthesis part 35 ... Search result output part 36 ... Content identifier receiving unit 37 ... Content acquisition unit 38 ... Content analysis unit 39 ... Browsing history search unit 40 ... Content processing unit 41 ... Content output unit 301 ... Content storage unit 302 ... Browsing history log, browsing history storage unit 303 ... Usage Person attribute storage

Claims

Content storage means for storing the content and a first content identifier that can identify the content in association with each other;
Browsing history storage means for storing a second content identifier of the browsed content;
A search word receiving means for receiving a search word for searching for content;
The content and the first content identifier are read from the content storage means, the content including the search word and the first content identifier corresponding to the content are searched, and the frequency at which the search word appears in the content is determined. A content search means for calculating the sentence suitability shown,
The second content identifier is read from the browsing history storage means, a second content identifier having the same content identifier as the searched first content identifier is searched, and the frequency at which the content of the first content identifier is browsed is determined. A first browsing history search means for calculating the browsing popularity shown;
Relevance synthesis means for synthesizing the sentence relevance and the browsing popularity;
Search result output means for outputting the searched first content identifier based on the combined result;
An information retrieval apparatus comprising:

The browsing history storage means stores the second content identifier in units of rows,
The information search device according to claim 1, wherein the browsing popularity is calculated using the number of rows of the second content identifier having the same content identifier as the searched first content identifier.

The fitness synthesis means calculates a score obtained by synthesizing the text fitness and the browsing popularity,
The information search device according to claim 1 or 2, wherein the search result output means outputs the first content identifiers in the order of the highest score.

The browsing history storage means stores the second content identifier in association with the browsing time when the content of the second content identifier was browsed,
The first browsing history search means includes:
After searching for the second content identifier, calculation means for reading the browsing time corresponding to the second content identifier from the browsing history storage means and calculating a freshness of browsing indicating an elapsed time from the browsing time to the present When,
Adding means for adding the browsing freshness to the browsing popularity;
The information search apparatus according to claim 1, further comprising:

The browsing history storage means stores the second content identifier, the browsing time when the content of the second content identifier was browsed, and the user identifier of the user who browsed the content in association with each other. ,
The first browsing history search means includes:
After searching for the second content identifier, the browsing time and the user identifier corresponding to the second content identifier are read from the browsing history storage means, and a list showing the relationship between the browsing time and the user identifier Creating means for creating A;
For each user identifier in the list A, calculating means for calculating a browsing freshness indicating an elapsed time from the browsing time to the present time;
Summing up the browsing freshness for all user identifiers in the list A, and adding means for adding the total browsing freshness to the browsing popularity,
The information search apparatus according to claim 1, further comprising:

The said calculation means reads the newest browsing time from the said browsing history storage means among the browsing times corresponding to the said 2nd content identifier, and calculates the said browsing freshness degree. The information retrieval device described.

7. The information search apparatus according to claim 4, wherein the browsing freshness value is a smaller value as the elapsed time is longer, and is a larger value as the elapsed time is shorter. .

A user attribute storage means for storing the user identifier of the user and the attribute of the user in association with each other;
The browsing history search unit
When a user attribute is received, the user identifier and the attribute are read from the user attribute storage means, and a user identifier having the same attribute as the received attribute is searched, and the searched use Creating means for creating a list B indicating the relationship between the person identifier and the attribute;
Delete means for deleting a user identifier not stored in the list B from the list A and a browsing time corresponding to the user identifier;
The information search device according to claim 5, wherein the calculation unit calculates the browsing freshness using the list A after deletion.

The first browsing history search means corresponds to a content identifier that matches a part of the searched first content identifier when the total number of user identifiers in the list A after deletion is less than a predetermined threshold. 9. The information search apparatus according to claim 8, wherein a user identifier is read from the browsing history storage means, and the read user identifier and a browsing time corresponding to the user identifier are added to the list A. .

The browsing history storage means further stores a browsing time during which the content of the second content identifier was browsed,
The said calculating means reads the browsing time corresponding to each user identifier of the said list | wrist from the said browsing history storage means, and calculates the said browsing freshness further using the said browsing time. Or the information search device of 9.

Content identifier receiving means for receiving the first content identifier when the output first content identifier is selected by a user;
Content acquisition means for acquiring the content of the first content identifier;
A content analysis unit that analyzes the related content in the content directly related to the content from the acquired content and extracts the related content identifier of the related content;
The second content identifier is read from the browsing history storage means, the second content identifier having the same content identifier as the related content identifier is searched, and the browsing popularity indicating the frequency of browsing the content of the related content identifier is obtained. A second browsing history search means for calculating;
Content processing means for processing the content based on the browsing popularity;
Content output means for outputting the processed content;
The information search apparatus according to claim 1, further comprising:

The second browsing history search means calculates the browsing popularity and extracts a related content identifier of related content outside the content indirectly related to the content,
The information search apparatus according to claim 11, wherein the content output means further outputs a related content identifier outside the content.

The browsing history storage unit further stores a transition source content identifier of the transition source content that was browsed before the content of the second content identifier stored in the browsing history storage unit was browsed. ,
The second browsing history search means includes:
The second content identifier is read from the browsing history storage means, a content identifier having the same content identifier as the received first content identifier is searched, the transition source content identifier corresponding to the content identifier and the transition source content identifier Create a transition source content identifier list that correlates with the number of
A transition source content identifier having the same transition source content identifier as the transition source content identifier in the transition source content identifier list is searched from the browsing history storage means, and the searched transition source content identifier and the number of the transition source content identifiers are obtained. Create an associated transition destination content identifier list,
Using the transition source content identifier list and the transition destination content identifier list, the importance of the transition destination content identifier is calculated,
The information search apparatus according to claim 12, wherein the transition destination content identifier is extracted as a related content identifier outside the content.

The browsing history storage unit further stores a transition source content identifier of a transition source content that was browsed before the content of the second content identifier stored in the browsing history storage unit was browsed. ,
The second browsing history search means includes:
The second content identifier is read from the browsing history storage means, a content identifier having the same content identifier as the received first content identifier is searched, the transition source content identifier corresponding to the content identifier and the transition source content identifier Create a list that associates the number of
Delete the related content identifier in the content from the transition destination content identifier list,
13. The information search according to claim 12, wherein the transition destination content identifier in the transition destination content identifier list after deletion is extracted as a related content identifier outside the content, and the number of the transition destination content identifiers is set as the importance. apparatus.

The information search device according to claim 13 or 14, wherein the content output means outputs a related content identifier outside the content according to the importance and / or together with the importance.

Browsing history storage means for storing a second content identifier of the browsed content;
Content identifier receiving means for receiving a first content identifier for desired content;
Content acquisition means for acquiring the content of the first content identifier;
A content analysis unit that analyzes the related content in the content directly related to the content from the acquired content and extracts the related content identifier of the related content;
The second content identifier is read from the browsing history storage means, the second content identifier having the same content identifier as the related content identifier is searched, and the browsing popularity indicating the frequency of browsing the content of the related content identifier is obtained. Browsing history search means to calculate,
Content processing means for processing the content based on the browsing popularity;
Content output means for outputting the processed content;
An information retrieval apparatus comprising:

The browsing history search means calculates the browsing popularity and extracts a related content identifier of related content outside the content indirectly related to the content,
The information search apparatus according to claim 16, wherein the content output means further outputs a related content identifier outside the content.

The browsing history storage unit further stores a transition source content identifier of a transition source content that was browsed before the content of the second content identifier stored in the browsing history storage unit was browsed. ,
The browsing history search means includes:
The second content identifier is read from the browsing history storage means, a content identifier having the same content identifier as the received first content identifier is searched, the transition source content identifier corresponding to the content identifier and the transition source content identifier Create a transition source content identifier list that correlates with the number of
A transition source content identifier having the same transition source content identifier as the transition source content identifier in the transition source content identifier list is searched from the browsing history storage means, and the searched transition source content identifier and the number of the transition source content identifiers are obtained. Create an associated transition destination content identifier list,
Using the transition source content identifier list and the transition destination content identifier list, the importance of the transition destination content identifier is calculated,
The information search apparatus according to claim 17, wherein the transition destination content identifier is extracted as a related content identifier outside the content.

The browsing history storage unit further stores a transition source content identifier of a transition source content that was browsed before the content of the second content identifier stored in the browsing history storage unit was browsed. ,
The browsing history search means includes:
The second content identifier is read from the browsing history storage means, a content identifier having the same content identifier as the received first content identifier is searched, the transition source content identifier corresponding to the content identifier and the transition source content identifier Create a list that associates the number of
Delete the related content identifier in the content from the transition destination content identifier list,
18. The information search according to claim 17, wherein the transition destination content identifier in the transition destination content identifier list after deletion is extracted as a related content identifier outside the content, and the number of the transition destination content identifiers is set as the importance. apparatus.

The information search device according to claim 18 or 19, wherein the content output means outputs a related content identifier outside the content according to the importance and / or together with the importance.

Associating the content with a first content identifier that can identify the content and storing the content in the content storage means;
Storing the second content identifier of the browsed content in the browsing history storage means;
Receiving a search term to search for content;
The content and the first content identifier are read from the content storage means, the content including the search word and the first content identifier corresponding to the content are searched, and the frequency at which the search word appears in the content is determined. Calculating the sentence suitability shown;
The second content identifier is read from the browsing history storage means, a second content identifier having the same content identifier as the searched first content identifier is searched, and the frequency at which the content of the first content identifier is browsed is determined. Calculating the browsing popularity shown,
Synthesizing the sentence fitness and the browsing popularity;
Outputting the retrieved first content identifier based on the synthesized result;
A method for retrieving information, comprising:

Storing the second content identifier of the browsed content in the browsing history storage means;
Receiving a first content identifier for desired content;
Obtaining the content of the first content identifier;
Analyzing the related content in the content directly related to the content from the acquired content and extracting the related content identifier of the related content;
The second content identifier is read from the browsing history storage means, the second content identifier having the same content identifier as the related content identifier is searched, and the browsing popularity indicating the frequency of browsing the content of the related content identifier is obtained. A calculating step;
Processing the content based on the browsing popularity;
Outputting the processed content;
A method for retrieving information, comprising:

A process of associating the content with a first content identifier that can identify the content and storing the content in the content storage unit;
Processing for storing the second content identifier of the browsed content in the browsing history storage means;
Receiving a search term to search for content;
The content and the first content identifier are read from the content storage means, the content including the search word and the first content identifier corresponding to the content are searched, and the frequency at which the search word appears in the content is determined. A process of calculating the sentence suitability shown,
The second content identifier is read from the browsing history storage means, a second content identifier having the same content identifier as the searched first content identifier is searched, and the frequency at which the content of the first content identifier is browsed is determined. Processing to calculate the viewing popularity shown,
A process of combining the sentence suitability and the browsing popularity;
A process of outputting the searched first content identifier based on the synthesized result;
An information search program for causing a computer to execute the above.

Processing for storing the second content identifier of the browsed content in the browsing history storage means;
Receiving a first content identifier for the desired content;
Processing for acquiring the content of the first content identifier;
Analyzing the related content in the content directly related to the content from the acquired content, and extracting the related content identifier of the related content;
The second content identifier is read from the browsing history storage means, the second content identifier having the same content identifier as the related content identifier is searched, and the browsing popularity indicating the frequency of browsing the content of the related content identifier is obtained. Processing to calculate,
Processing the content based on the browsing popularity;
A process of outputting the processed content;
An information search program for causing a computer to execute the above.

A process of associating the content with a first content identifier that can identify the content and storing the content in the content storage unit;
Processing for storing the second content identifier of the browsed content in the browsing history storage means;
Receiving a search term to search for content;
The content and the first content identifier are read from the content storage means, the content including the search word and the first content identifier corresponding to the content are searched, and the frequency at which the search word appears in the content is determined. A process of calculating the sentence suitability shown,
The second content identifier is read from the browsing history storage means, a second content identifier having the same content identifier as the searched first content identifier is searched, and the frequency at which the content of the first content identifier is browsed is determined. Processing to calculate the viewing popularity shown,
A process of combining the sentence suitability and the browsing popularity;
A process of outputting the searched first content identifier based on the synthesized result;
A computer-readable recording medium on which is recorded an information search program characterized in that the computer is executed.

Processing for storing the second content identifier of the browsed content in the browsing history storage means;
Receiving a first content identifier for the desired content;
Processing for acquiring the content of the first content identifier;
Analyzing the related content in the content directly related to the content from the acquired content, and extracting the related content identifier of the related content;
The second content identifier is read from the browsing history storage means, the second content identifier having the same content identifier as the related content identifier is searched, and the browsing popularity indicating the frequency of browsing the content of the related content identifier is obtained. Processing to calculate,
Processing the content based on the browsing popularity;
A process of outputting the processed content;
A computer-readable recording medium on which is recorded an information search program characterized in that the computer is executed.