JP4818170B2

JP4818170B2 - Information search apparatus, information search method, information search program, and computer-readable recording medium recording the information search program

Info

Publication number: JP4818170B2
Application number: JP2007067239A
Authority: JP
Inventors: 誠武藤; 浩竹野; 浩一牛島; 準二富田; 有二金田
Original assignee: エヌ・ティ・ティレゾナント株式会社
Priority date: 2007-03-15
Filing date: 2007-03-15
Publication date: 2011-11-16
Anticipated expiration: 2027-03-15
Also published as: JP2008226156A

Description

本発明は、コンテンツを検索する技術に関する。 The present invention relates to a technology for searching for content.

従来、インターネット上に設けられた情報検索システムでは、文章や図面等のコンテンツをデータベースに蓄積し、ユーザにより与えられた検索キーワードに関連するコンテンツをそのデータベースから取り出して、検索結果としてユーザに提示する技術が用いられていた。 2. Description of the Related Art Conventionally, in an information search system provided on the Internet, contents such as sentences and drawings are accumulated in a database, contents related to a search keyword given by a user are taken out from the database, and presented as search results to the user. Technology was used.

そして、検索対象であるコンテンツを提示する際には、通常、検索式に含まれる各検索語の使用頻度に基づいて、検索式と検索結果の各コンテンツとの文章関連度を求め、その文章関連度の高い順番でコンテンツを表示する方法が利用されている（非特許文献１参照）。 When presenting the content that is the search target, usually, the text relevance between the search formula and each content of the search result is obtained based on the frequency of use of each search term included in the search formula, and the text related A method of displaying contents in descending order is used (see Non-Patent Document 1).

しかしながら、検索語の使用頻度が高い場合であっても、ユーザが期待するコンテンツであるとは限らない。逆に、使用頻度が低いコンテンツであっても、期待するコンテンツである可能性もある。故に、文章関連度を用いて順序付けられた検索結果は、ユーザの所望する順番で表示された検索結果に一致するものではなく、所望するコンテンツに辿り着く迄には多くの試行錯誤や時間をユーザに与えるという問題があった。 However, even if the search term is frequently used, the content is not necessarily expected by the user. Conversely, even content that is used infrequently may be expected content. Therefore, the search results ordered using the text relevance level do not match the search results displayed in the order desired by the user, and the user needs a lot of trial and error and time until the desired content is reached. There was a problem of giving to.

そこで、特許文献１では、ユーザによる閲覧要求の回数を記録し、この回数を用いて前述の文章関連度を補正することで、ユーザの求めるコンテンツをより上位に表示する技術が開示されている。これにより、閲覧要求の多いコンテンツはより上位に表示され、閲覧要求の少ないコンテンツはより下位に表示されることになる。
特願平１１−２９５０１号公報北研二著、「情報検索アルゴリズム」、共立出版、2002年 Dean M.T.O’ Brien、「Predictive modeling of first-click behavior in web-search, proceeding of WWW2006」、2006、p.1031-1032 Therefore, Patent Document 1 discloses a technique for displaying the content desired by the user at a higher level by recording the number of browsing requests made by the user and correcting the above-described sentence relevance using this number. As a result, content with a high browsing request is displayed at a higher level, and content with a low browsing request is displayed at a lower level.
Japanese Patent Application No. 11-29501 Kitakenji, “Information Retrieval Algorithm”, Kyoritsu Shuppan, 2002 Dean MTO 'Brien, “Predictive modeling of first-click behavior in web-search, proceeding of WWW2006”, 2006, p.1031-1032.

しかしながら、表示されたコンテンツを選択して閲覧要求を行う場合、上位に表示されたコンテンツはより多く選択され、下位のコンテンツはより少なく選択される傾向にあることが知られている（非特許文献２参照）。故に、この閲覧要求時におけるユーザの癖により、特許文献１で開示された技術を用いて検索対象となるコンテンツを順序付けた場合であっても、いわゆる正フィードバックの現象が発生するため、検索結果として表示されるコンテンツの順番が固定的に成り易いという問題があった。 However, when making a browsing request by selecting the displayed content, it is known that more content displayed at the upper level tends to be selected and less content at the lower level tends to be selected (non-patent document). 2). Therefore, even if the content to be searched is ordered using the technique disclosed in Patent Document 1 due to the user's habit at the time of the browsing request, a so-called positive feedback phenomenon occurs, so that the search result is There is a problem that the order of displayed contents tends to be fixed.

本発明は、上記を鑑みてなされたものであり、ユーザの趣向を反映した順番でコンテンツを表示することを課題とする。 This invention is made | formed in view of the above, and makes it a subject to display a content in the order reflecting a user preference.

請求項１に記載の本発明は、コンテンツと該コンテンツの識別子とを関連付けて格納しておくコンテンツ格納手段と、コンテンツの検索要求に用いた検索語と、該検索語による検索結果に基づいて閲覧されたコンテンツの識別子と、該識別子が検索結果として表示された順位を示す表示順位とを関連付けて格納しておく閲覧履歴格納手段と、検索要求に包含された検索語を受信した場合に、該検索語を含む複数のコンテンツを前記コンテンツ格納手段から読み出して、該検索語が該複数のコンテンツに出現する頻度を示す文章適合度をそれぞれ算出し、該複数のコンテンツにそれぞれ対応する識別子を前記コンテンツ格納手段から更に読み出して、該識別子に一致する前記閲覧履歴格納手段に格納された識別子の数をそれぞれ求めて閲覧要求回数とし、該識別子に一致する前記閲覧履歴格納手段に格納された識別子に対応する表示順位の平均値をそれぞれ算出し、該平均値よりも高い表示順位を有する前記閲覧履歴格納手段に格納された識別子の数に対して、該平均値と同等な表示順位を有する前記閲覧履歴格納手段に格納された識別子の数の比率をそれぞれ求めて補正用閲覧要求回数とし、前記閲覧要求回数を前記補正用閲覧要求回数で除した閲覧重要度をそれぞれ算出する検索手段と、前記検索手段で前記コンテンツ格納手段から読み出された複数の識別子に対し、前記文章適合度と前記閲覧重要度とを合成した得点をそれぞれ算出する適合度合成手段と、該複数の識別子を、前記得点に基づいて出力する検索結果出力手段と、を有することを要旨とする。 The present invention according to claim 1 is a content storage means for storing content and an identifier of the content in association with each other, a search term used for a content search request, and browsing based on a search result based on the search term The browsing history storage means for storing the identifier of the content that has been identified in association with the display order indicating the order in which the identifier is displayed as a search result, and when the search term included in the search request is received, A plurality of contents including a search word are read from the content storage unit, sentence matching degrees indicating the frequency of appearance of the search word in the plurality of contents are calculated, and identifiers corresponding to the plurality of contents are respectively determined as the contents Further reading from the storage means to obtain the number of identifiers stored in the browsing history storage means matching the identifier, respectively The average value of the display order corresponding to the identifier stored in the browsing history storage means that matches the identifier is calculated, and stored in the browsing history storage means having a display order higher than the average value. The ratio of the number of identifiers stored in the browsing history storage means having the display rank equivalent to the average value with respect to the number of identifiers is obtained as the number of browsing requests for correction, and the number of browsing requests is the correction number Retrieval means for respectively calculating the browsing importance divided by the number of browsing requests, and a score obtained by combining the sentence suitability and the browsing importance for a plurality of identifiers read from the content storage means by the search means And a search result output means for outputting the plurality of identifiers based on the score.

本発明にあっては、検索要求に包含された検索語を受信した場合に、この検索語を含む複数のコンテンツをコンテンツ格納手段から読み出して、この検索語が複数のコンテンツに出現する頻度を示す文章適合度をそれぞれ算出し、この複数のコンテンツにそれぞれ対応する識別子をコンテンツ格納手段から更に読み出して、この識別子に一致する閲覧履歴格納手段に格納された識別子の数をそれぞれ求めて閲覧要求回数とし、この識別子に一致する閲覧履歴格納手段に格納された識別子に対応する表示順位の平均値をそれぞれ算出し、この平均値よりも高い表示順位を有する閲覧履歴格納手段に格納された識別子の数に対して、この平均値と同等な表示順位を有する閲覧履歴格納手段に格納された識別子の数の比率をそれぞれ求めて補正用閲覧要求回数とし、閲覧要求回数を補正用閲覧要求回数で除した閲覧重要度をそれぞれ算出する検索手段と、この検索手段でコンテンツ格納手段から読み出された複数の識別子に対し、文章適合度と閲覧重要度とを合成した得点をそれぞれ算出する適合度合成手段と、この複数の識別子を、この得点に基づいて出力する検索結果出力手段とを有するため、検索結果が固定的に表示されることを防止し、ユーザの趣向を反映した順番でコンテンツの識別子を出力することができる。 In the present invention, when a search word included in a search request is received, a plurality of contents including the search word are read from the content storage unit, and the frequency at which the search word appears in the plurality of contents is indicated. The sentence matching degree is calculated, the identifier corresponding to each of the plurality of contents is further read out from the content storage means, and the number of identifiers stored in the browsing history storage means that matches this identifier is obtained as the number of browsing requests. The average value of the display order corresponding to the identifier stored in the browsing history storage means that matches this identifier is calculated, and the number of identifiers stored in the browsing history storage means having a display order higher than this average value is calculated. On the other hand, the ratio of the number of identifiers stored in the browsing history storage means having the display order equivalent to the average value is obtained to correct the correction. Search means for calculating the browsing importance by dividing the number of browsing requests by the number of browsing requests for correction, and sentence matching and browsing for a plurality of identifiers read from the content storage means by the searching means Since there is a fitness synthesis unit that calculates the score obtained by combining the importance and a search result output unit that outputs the plurality of identifiers based on the score, the search result is displayed in a fixed manner. Content identifiers can be output in an order that reflects the user preferences.

請求項２に記載の本発明は、請求項１に記載の発明において、前記検索手段が、前記検索要求に包含された検索語に一致する前記閲覧履歴格納手段に格納された検索語の数を求めて検索語利用回数とし、前記閲覧重要度は、前記閲覧要求回数を前記補正用閲覧要求回数で除した値を、更に該検索語利用回数で除した値であることを要旨とする。 According to a second aspect of the present invention, in the first aspect of the present invention, the search means determines the number of search words stored in the browsing history storage means that matches the search word included in the search request. The search term usage count is obtained, and the browsing importance is summarized as a value obtained by dividing the browsing request count by the correction browsing request count and further dividing by the search term usage count.

本発明にあっては、検索手段が、検索要求に包含された検索語に一致する閲覧履歴格納手段に格納された検索語の数を求めて検索語利用回数とし、閲覧重要度は、閲覧要求回数を補正用閲覧要求回数で除した値を、更に検索語利用回数で除した値であるため、検索語の違いによる閲覧要求回数の大きさの偏りを正規化することができる。即ち、どのような検索語が与えられた場合であっても、閲覧要求回数を補正用閲覧要求回数で除した値を一定の領域に納めることができる。 In the present invention, the search means obtains the number of search terms stored in the browsing history storage means that matches the search term included in the search request and sets the number of search terms used. Since the value obtained by dividing the number of times by the number of browsing requests for correction is further divided by the number of times the search word is used, it is possible to normalize the bias in the size of the number of browsing requests due to the difference in the search terms. That is, regardless of what search terms are given, a value obtained by dividing the number of browsing requests by the number of browsing requests for correction can be stored in a certain area.

請求項３に記載の本発明は、請求項１又は２のいずれか１項に記載の発明において、前記平均値よりも高い表示順位が、１位であることを要旨とする。 The gist of the present invention described in claim 3 is that, in the invention described in claim 1 or 2, the display rank higher than the average value is first.

本発明にあっては、平均値よりも高い表示順位が１位であるため、検索結果が固定的に表示されることをより確実に防止し、ユーザの趣向をより反映した順番でコンテンツの識別子を出力することができる。 In the present invention, since the display order higher than the average value is first, it is possible to more reliably prevent the search result from being fixedly displayed, and the content identifiers in an order more reflecting the user's preference. Can be output.

請求項４に記載の本発明は、コンテンツと該コンテンツの識別子とを関連付けてコンテンツ格納手段に格納しておく第１のステップと、コンテンツの検索要求に用いた検索語と、該検索語による検索結果に基づいて閲覧されたコンテンツの識別子と、該識別子が検索結果として表示された順位を示す表示順位とを関連付けて閲覧履歴格納手段に格納しておく第２のステップと、検索要求に包含された検索語を受信した場合に、該検索語を含む複数のコンテンツを前記コンテンツ格納手段から読み出して、該検索語が該複数のコンテンツに出現する頻度を示す文章適合度をそれぞれ算出し、該複数のコンテンツにそれぞれ対応する識別子を前記コンテンツ格納手段から更に読み出して、該識別子に一致する前記閲覧履歴格納手段に格納された識別子の数をそれぞれ求めて閲覧要求回数とし、該識別子に一致する前記閲覧履歴格納手段に格納された識別子に対応する表示順位の平均値をそれぞれ算出し、該平均値よりも高い表示順位を有する前記閲覧履歴格納手段に格納された識別子の数に対して、該平均値と同等な表示順位を有する前記閲覧履歴格納手段に格納された識別子の数の比率をそれぞれ求めて補正用閲覧要求回数とし、前記閲覧要求回数を前記補正用閲覧要求回数で除した閲覧重要度をそれぞれ算出する第３のステップと、前記第３のステップで前記コンテンツ格納手段から読み出された複数の識別子に対し、前記文章適合度と前記閲覧重要度とを合成した得点をそれぞれ算出する第４のステップと、該複数の識別子を、前記得点に基づいて出力する第５のステップと、を有することを要旨とする。 The present invention as set forth in claim 4 is a first step of associating a content and an identifier of the content and storing them in the content storage means, a search term used for the search request for the content, and a search by the search term A second step of associating the identifier of the content browsed based on the result with the display order indicating the order in which the identifier is displayed as the search result in association with each other and storing it in the browsing history storage means; When a search term is received, a plurality of contents including the search term are read out from the content storage means, and sentence matching degrees indicating the frequency of the search term appearing in the plurality of contents are calculated. Identifiers corresponding to the respective contents are further read out from the content storage means, and the identifiers stored in the browsing history storage means matching the identifiers are read. The number of children is obtained as the number of browsing requests, the average value of the display rank corresponding to the identifier stored in the browsing history storage means matching the identifier is calculated, and the display rank is higher than the average value. The ratio of the number of identifiers stored in the browsing history storage means having a display order equivalent to the average value to the number of identifiers stored in the browsing history storage means is determined as the number of browsing requests for correction. , A third step of calculating the browsing importance obtained by dividing the number of browsing requests by the number of browsing requests for correction, and a plurality of identifiers read from the content storage means in the third step, A fourth step of calculating a score obtained by combining the sentence suitability and the browsing importance, and a fifth step of outputting the plurality of identifiers based on the score. And it is required to.

請求項５に記載の本発明は、請求項４に記載の発明において、前記第３のステップが、前記検索要求に包含された検索語に一致する前記閲覧履歴格納手段に格納された検索語の数を求めて検索語利用回数とし、前記閲覧重要度は、前記閲覧要求回数を前記補正用閲覧要求回数で除した値を、更に該検索語利用回数で除した値であることを要旨とする。 According to a fifth aspect of the present invention, in the invention according to the fourth aspect, the third step is a process of searching for a search word stored in the browsing history storage means that matches the search word included in the search request. The number is used as the number of search word usage, and the browsing importance is a value obtained by dividing the number of browsing request times by the number of correction browsing request times and further dividing by the number of search word usage times. .

請求項６に記載の本発明は、請求項４又は５のいずれか１項に記載の発明において、前記平均値よりも高い表示順位が、１位であることを要旨とする。 The gist of the present invention described in claim 6 is that, in the invention described in any one of claims 4 or 5, the display rank higher than the average value is first.

請求項７に記載の本発明は、コンテンツと該コンテンツの識別子とを関連付けてコンテンツ格納手段に格納しておく第１の処理と、コンテンツの検索要求に用いた検索語と、該検索語による検索結果に基づいて閲覧されたコンテンツの識別子と、該識別子が検索結果として表示された順位を示す表示順位とを関連付けて閲覧履歴格納手段に格納しておく第２の処理と、検索要求に包含された検索語を受信した場合に、該検索語を含む複数のコンテンツを前記コンテンツ格納手段から読み出して、該検索語が該複数のコンテンツに出現する頻度を示す文章適合度をそれぞれ算出し、該複数のコンテンツにそれぞれ対応する識別子を前記コンテンツ格納手段から更に読み出して、該識別子に一致する前記閲覧履歴格納手段に格納された識別子の数をそれぞれ求めて閲覧要求回数とし、該識別子に一致する前記閲覧履歴格納手段に格納された識別子に対応する表示順位の平均値をそれぞれ算出し、該平均値よりも高い表示順位を有する前記閲覧履歴格納手段に格納された識別子の数に対して、該平均値と同等な表示順位を有する前記閲覧履歴格納手段に格納された識別子の数の比率をそれぞれ求めて補正用閲覧要求回数とし、前記閲覧要求回数を前記補正用閲覧要求回数で除した閲覧重要度をそれぞれ算出する第３の処理と、前記第３の処理で前記コンテンツ格納手段から読み出された複数の識別子に対し、前記文章適合度と前記閲覧重要度とを合成した得点をそれぞれ算出する第４の処理と、該複数の識別子を、前記得点に基づいて出力する第５の処理と、をコンピュータに実行させることを要旨とする。 The present invention according to claim 7 is a first process for associating a content and an identifier of the content and storing them in the content storage means, a search word used for a content search request, and a search based on the search word The second process of storing the identifier of the content browsed based on the result and the display order indicating the order in which the identifier is displayed as the search result in association with each other and storing it in the browsing history storage means, and included in the search request When a search term is received, a plurality of contents including the search term are read out from the content storage means, and sentence matching degrees indicating the frequency of the search term appearing in the plurality of contents are calculated. The number of identifiers stored in the browsing history storage means that are further read out from the content storage means and correspond to the identifiers, respectively. Each of the obtained browsing requests is calculated, and an average value of display ranks corresponding to the identifiers stored in the browsing history storage means matching the identifiers is calculated, and the browsing history storage having a display rank higher than the average value. The number of identifiers stored in the browsing history storage unit having a display order equivalent to the average value with respect to the number of identifiers stored in the unit is determined as the number of browsing requests for correction, and the browsing request A third process for calculating the browsing importance obtained by dividing the number of times by the number of correction browsing requests, and the sentence suitability for the plurality of identifiers read from the content storage unit in the third process. a fourth process for calculating each score was synthesized with the viewing importance, causes executes a plurality of identifiers, a fifth process, to a computer to be output based on the score The gist of the door.

請求項８に記載の本発明は、請求項７に記載の発明において、前記第３の処理が、前記検索要求に包含された検索語に一致する前記閲覧履歴格納手段に格納された検索語の数を求めて検索語利用回数とし、前記閲覧重要度は、前記閲覧要求回数を前記補正用閲覧要求回数で除した値を、更に該検索語利用回数で除した値であることを要旨とする。 According to an eighth aspect of the present invention, in the invention according to the seventh aspect of the invention, the third process includes the processing of the search term stored in the browsing history storage means that matches the search term included in the search request. The number is used as the number of search word usage, and the browsing importance is a value obtained by dividing the number of browsing request times by the number of correction browsing request times and further dividing by the number of search word usage times. .

請求項９に記載の本発明は、請求項７又は８のいずれか１項に記載の発明において、前記平均値よりも高い表示順位が、１位であることを要旨とする。 The gist of the present invention according to claim 9 is that, in the invention according to any one of claims 7 and 8, the display rank higher than the average value is first.

請求項１０に記載の本発明は、コンテンツと該コンテンツの識別子とを関連付けてコンテンツ格納手段に格納しておく第１の処理と、コンテンツの検索要求に用いた検索語と、該検索語による検索結果に基づいて閲覧されたコンテンツの識別子と、該識別子が検索結果として表示された順位を示す表示順位とを関連付けて閲覧履歴格納手段に格納しておく第２の処理と、検索要求に包含された検索語を受信した場合に、該検索語を含む複数のコンテンツを前記コンテンツ格納手段から読み出して、該検索語が該複数のコンテンツに出現する頻度を示す文章適合度をそれぞれ算出し、該複数のコンテンツにそれぞれ対応する識別子を前記コンテンツ格納手段から更に読み出して、該識別子に一致する前記閲覧履歴格納手段に格納された識別子の数をそれぞれ求めて閲覧要求回数とし、該識別子に一致する前記閲覧履歴格納手段に格納された識別子に対応する表示順位の平均値をそれぞれ算出し、該平均値よりも高い表示順位を有する前記閲覧履歴格納手段に格納された識別子の数に対して、該平均値と同等な表示順位を有する前記閲覧履歴格納手段に格納された識別子の数の比率をそれぞれ求めて補正用閲覧要求回数とし、前記閲覧要求回数を前記補正用閲覧要求回数で除した閲覧重要度をそれぞれ算出する第３の処理と、前記第３の処理で前記コンテンツ格納手段から読み出された複数の識別子に対し、前記文章適合度と前記閲覧重要度とを合成した得点をそれぞれ算出する第４の処理と、該複数の識別子を、前記得点に基づいて出力する第５の処理と、をコンピュータに実行させることを要旨とする。 The present invention according to claim 10 is a first process for associating a content and an identifier of the content and storing them in the content storage means, a search word used for a search request for the content, and a search based on the search word The second process of storing the identifier of the content browsed based on the result and the display order indicating the order in which the identifier is displayed as the search result in association with each other and storing it in the browsing history storage means, and included in the search request When a search term is received, a plurality of contents including the search term are read out from the content storage means, and sentence matching degrees indicating the frequency of the search term appearing in the plurality of contents are calculated. The identifier corresponding to each of the contents is further read out from the content storage means, and the identifier stored in the browsing history storage means that matches the identifier And calculating the average number of display ranks corresponding to the identifiers stored in the browsing history storage means that match the identifiers, and the browsing history having a display rank higher than the average value. The number of identifiers stored in the browsing history storage means having the display order equivalent to the average value with respect to the number of identifiers stored in the storage means is determined as the number of browsing requests for correction, and the browsing A third process for calculating a browsing importance obtained by dividing a request count by the correction browse request count, and the sentence suitability for a plurality of identifiers read from the content storage means in the third process. said viewing a fourth process for calculating severity and the combined scores, respectively, a plurality of identifiers, is performed and a fifth process of outputting on the basis of the score in a computer and It is the gist of.

請求項１１に記載の本発明は、請求項１０に記載の発明において、前記第３の処理が、前記検索要求に包含された検索語に一致する前記閲覧履歴格納手段に格納された検索語の数を求めて検索語利用回数とし、前記閲覧重要度は、前記閲覧要求回数を前記補正用閲覧要求回数で除した値を、更に該検索語利用回数で除した値であることを要旨とする。 According to an eleventh aspect of the present invention, in the invention according to the tenth aspect, in the third aspect, the third processing includes a search word stored in the browsing history storage means that matches the search word included in the search request. The number is used as the number of search word usage, and the browsing importance is a value obtained by dividing the number of browsing request times by the number of correction browsing request times and further dividing by the number of search word usage times. .

請求項１２に記載の本発明は、請求項１０又は１１のいずれか１項に記載の発明において、前記平均値よりも高い表示順位が、１位であることを要旨とする。 The gist of the present invention according to claim 12 is that, in the invention according to any one of claims 10 or 11, the display rank higher than the average value is first.

本発明によれば、ユーザの趣向を反映した順番でコンテンツを表示することができる。 According to the present invention, it is possible to display contents in an order that reflects user preferences.

〔第１の実施の形態〕
図１は、第１の実施の形態における情報検索システム１の構成を示す構成図である。本実施の形態における情報検索システム１は、ユーザ端末２と情報検索装置３とを備えた構成であり、互いにネットワーク４を介して接続されている。 [First Embodiment]
FIG. 1 is a configuration diagram showing a configuration of an information search system 1 according to the first embodiment. The information search system 1 according to the present embodiment includes a user terminal 2 and an information search device 3 and is connected to each other via a network 4.

ユーザ端末２は、モニタ，キーボード等の入力手段や、ネットワーク４に対する通信手段等を備え、利用者（以下、単に「ユーザ」と称する）による検索要求を受け付けて、その検索要求に包含された検索語を、ネットワーク４を介して後述する情報検索装置３の検索語受信部３１に送信する。また、ユーザ端末２は、ネットワーク４を介して情報検索装置３の検索結果出力部３５から送信された検索要求に対する検索結果を受信し、画面に表示することでユーザに検索結果を提示する。 The user terminal 2 includes input means such as a monitor and a keyboard, communication means for the network 4, etc., accepts a search request from a user (hereinafter simply referred to as “user”), and includes a search included in the search request. The word is transmitted to the search word receiving unit 31 of the information search device 3 to be described later via the network 4. Further, the user terminal 2 receives the search result for the search request transmitted from the search result output unit 35 of the information search device 3 via the network 4 and presents the search result to the user by displaying it on the screen.

更に、ユーザ端末２は、検索結果を閲覧したユーザにより選択された所望のコンテンツに対する閲覧要求を受け付けて、ネットワーク４を介してその閲覧要求を情報検索装置３の閲覧要求受信部３６に送信する。また、ネットワーク４を介して情報検索装置３のコンテンツ出力部３８から送信された閲覧要求に対するコンテンツを受信して、そのコンテンツを画面に表示する。 Furthermore, the user terminal 2 receives a browsing request for the desired content selected by the user who has browsed the search result, and transmits the browsing request to the browsing request receiving unit 36 of the information search device 3 via the network 4. Further, it receives the content corresponding to the browsing request transmitted from the content output unit 38 of the information search device 3 via the network 4 and displays the content on the screen.

情報検索装置３は、検索語受信部３１と、コンテンツ検索部３２と、閲覧履歴検索部３３と、適合度合成部３４と、検索結果出力部３５と、閲覧要求受信部３６と、閲覧履歴更新部３７と、コンテンツ出力部３８と、コンテンツ検索部３２に接続されたコンテンツ格納部３０１と、閲覧履歴検索部３３に接続された閲覧履歴格納部３０２とを備えた構成である。以下、これら各部が有する機能について説明する。 The information search device 3 includes a search word reception unit 31, a content search unit 32, a browsing history search unit 33, a matching level synthesis unit 34, a search result output unit 35, a browsing request reception unit 36, and a browsing history update. This includes a unit 37, a content output unit 38, a content storage unit 301 connected to the content search unit 32, and a browsing history storage unit 302 connected to the browsing history search unit 33. Hereinafter, the functions of these units will be described.

検索語受信部３１は、ネットワーク４に接続されており、ユーザ端末２から送信された検索語をコンテンツ検索部３２及び閲覧履歴検索部３３に送信する。 The search word receiving unit 31 is connected to the network 4 and transmits the search word transmitted from the user terminal 2 to the content search unit 32 and the browsing history search unit 33.

コンテンツ検索部３２は、検索語受信部３１から送信された検索語を受信して、その検索語を用いてコンテンツ格納部３０１を検索し、その検索結果に基づいて文章適合度を算出して、コンテンツ識別子と文章適合度とを含む検索結果を閲覧履歴検索部３３及び適合度合成部３４に送信する。また、コンテンツ検索部３２は、閲覧要求受信部３６を介してユーザ端末２から送信された閲覧要求に対し、対応するコンテンツの所在場所をコンテンツ格納部３０１から検索し、検索された所在場所に格納されたコンテンツを取得する。 The content search unit 32 receives the search term transmitted from the search term receiving unit 31, searches the content storage unit 301 using the search term, calculates the sentence matching degree based on the search result, A search result including the content identifier and the sentence matching degree is transmitted to the browsing history searching unit 33 and the matching degree synthesizing unit 34. Also, the content search unit 32 searches the content storage unit 301 for the location of the corresponding content in response to the browsing request transmitted from the user terminal 2 via the browsing request receiving unit 36, and stores it in the searched location. Get the content that was recorded.

閲覧履歴検索部３３は、検索語受信部３１から送信された検索語とコンテンツ検索部３２から送信された検索結果とを受信して、閲覧履歴格納部３０２を参照して閲覧重要度を算出し、その閲覧重要度を適合度合成部３４に送信する。 The browsing history search unit 33 receives the search word transmitted from the search word receiving unit 31 and the search result transmitted from the content search unit 32, and calculates the browsing importance by referring to the browsing history storage unit 302. Then, the browsing importance is transmitted to the fitness composition unit 34.

適合度合成部３４は、コンテンツ検索部３２から送信された検索結果と閲覧履歴検索部３３から送信された閲覧重要度とを受信して、文章適合度と閲覧重要度とを用いて最終的な得点を算出し、検索結果出力部３５に送信する。 The matching level composition unit 34 receives the search result transmitted from the content search unit 32 and the browsing importance level transmitted from the browsing history search unit 33, and finally uses the sentence matching level and the browsing importance level. The score is calculated and transmitted to the search result output unit 35.

検索結果出力部３５は、適合度合成部３４から送信された得点を受信し、ネットワーク４を介して得点順に並び替えたコンテンツのコンテンツ識別子をユーザ端末２に出力する。 The search result output unit 35 receives the score transmitted from the fitness synthesis unit 34, and outputs the content identifier of the content rearranged in the score order via the network 4 to the user terminal 2.

閲覧要求受信部３６は、ネットワーク４を介してユーザ端末２から送信された閲覧要求を受信し、この閲覧要求をコンテンツ検索部３２及び閲覧履歴更新部３７に送信する。 The browsing request receiving unit 36 receives the browsing request transmitted from the user terminal 2 via the network 4 and transmits the browsing request to the content search unit 32 and the browsing history update unit 37.

閲覧履歴更新部３７は、ユーザの閲覧履歴を閲覧履歴格納部３０２に追加し、閲覧履歴を更新する。 The browsing history update unit 37 adds the browsing history of the user to the browsing history storage unit 302 and updates the browsing history.

コンテンツ出力部３８は、コンテンツ検索部３２で取得されたコンテンツを受信して、ネットワーク４を介してユーザ端末２に出力する。 The content output unit 38 receives the content acquired by the content search unit 32 and outputs it to the user terminal 2 via the network 4.

コンテンツ格納部３０１には、図２に示すように、コンテンツ自身を示すコンテンツ文章情報と、このコンテンツを識別可能なコンテンツ識別子と、このコンテンツが格納されているコンテンツ所在場所とが関連付けて格納されている。コンテンツ文章情報とは、コンテンツの内容を示す文章であり、例えば、コンテンツがウェブ上のサーバに格納されたページであれば、そのページに記された文章を意味する。コンテンツ識別子とは、あるコンテンツを他のコンテンツから識別可能とするものであり、例えば、コンテンツがウェブ上のサーバに格納されたページであれば、そのページのＵＲＬ（Uniform Resource Locator）を意味する。また、コンテンツ所在場所とは、コンテンツのファイルが格納されている場所であり、例えば、ローカルの計算機にコンテンツが格納されている場合には、図２に示すようなパス情報が格納され、ウェブ上のコンテンツの場合には、ＵＲＬが格納される。 As shown in FIG. 2, the content storage unit 301 stores content text information indicating the content itself, a content identifier that can identify the content, and a content location where the content is stored in association with each other. Yes. The content sentence information is a sentence indicating the content of the content. For example, if the content is a page stored on a server on the web, it means a sentence written on the page. The content identifier makes it possible to identify a certain content from other content. For example, if the content is a page stored in a server on the web, it means a URL (Uniform Resource Locator) of the page. The content location is a location where content files are stored. For example, when content is stored in a local computer, path information as shown in FIG. In the case of this content, the URL is stored.

閲覧履歴格納部３０２には、過去の閲覧要求に対する閲覧履歴が格納されており、図３に示すように、コンテンツ識別子と、検索要求に包含された検索語と、コンテンツ識別子が検索結果として表示された順位を示す表示順位とが関連付けて格納されている。例えば、図３の上から２行目は、検索語を「ｇｏｏ」とする検索要求で得られた検索結果において、表示順位が「４」番目である「ｗｗｗ．ｇｏｏ．ｎｅ．ｊｐ／」が閲覧要求されたことを意味している。なお、閲覧履歴格納部３０２は、検索語に対してコンテンツ識別子及び表示順位が格納されている。 The browsing history storage unit 302 stores browsing histories for past browsing requests. As shown in FIG. 3, a content identifier, a search term included in the search request, and a content identifier are displayed as search results. The display order indicating the display order is stored in association with each other. For example, the second line from the top of FIG. 3 shows that “www.goo.ne.jp/” having the display rank “4” in the search result obtained by the search request with the search word “goo” is displayed. It means that browsing is requested. Note that the browsing history storage unit 302 stores content identifiers and display orders for search terms.

なお、コンテンツ検索部３２及び閲覧履歴検索部３３が備える前述の機能を、一つの検索部が備える構成とすることも可能であり、適合度合成部３４をその検索部に更に加える構成であっても、何ら効果に影響を与えるものではない。 Note that the above-described functions of the content search unit 32 and the browsing history search unit 33 can be configured to be included in one search unit, and the fitness level combining unit 34 is further added to the search unit. However, it does not affect the effect at all.

ネットワーク４は、例えば、ＬＡＮ（Local Area Network）、インターネット、公衆回線網、ケーブルテレビ網等を利用することができる。なお、インターネットとは、所定のプロトコルに基づいて相互リンクされたネットワークの集合体である。 As the network 4, for example, a LAN (Local Area Network), the Internet, a public line network, a cable television network, or the like can be used. The Internet is a collection of networks linked to each other based on a predetermined protocol.

続いて、本発明の実施の形態における情報検索装置３の処理の流れについて説明する。この処理の流れは、主に、ユーザが検索要求を行って検索結果を得るまでの検索要求段階と、検索結果に基づいてユーザが所望のコンテンツを要求して閲覧するまでの閲覧要求段階との２段階で構成されている。以下、それぞれの段階について、フローチャートを用いて説明する。 Next, the flow of processing of the information search device 3 in the embodiment of the present invention will be described. This processing flow mainly includes a search request stage until the user makes a search request and obtains a search result, and a browse request stage until the user requests and browses desired content based on the search result. It consists of two stages. Hereinafter, each step will be described with reference to flowcharts.

最初に、検索要求段階の処理の流れについて、図４を用いて説明する。 First, the flow of processing at the search request stage will be described with reference to FIG.

まず、検索語受信部３１は、ネットワーク４を介してユーザ端末２から送信された検索要求に包含された検索語を受信し、その検索語をコンテンツ検索部３２及び閲覧履歴検索部３３に送信する（Ｓ１０１）。 First, the search word receiving unit 31 receives a search word included in the search request transmitted from the user terminal 2 via the network 4 and transmits the search word to the content search unit 32 and the browsing history search unit 33. (S101).

次に、コンテンツ検索部３２は、検索語受信部３１から送信された検索語を受信し、その検索語を含む複数のコンテンツ及びコンテンツ識別子をコンテンツ格納部３０１から読み出して、検索語が各コンテンツに出現する頻度を示す文章適合度をそれぞれ算出し、コンテンツ識別子と文章適合度とを含む検索結果を閲覧履歴検索部３３及び適合度合成部３４に送信する（Ｓ１０２）。 Next, the content search unit 32 receives the search term transmitted from the search term receiving unit 31, reads a plurality of contents including the search term and the content identifier from the content storage unit 301, and the search term is assigned to each content. The sentence matching degree indicating the appearance frequency is calculated, and the search result including the content identifier and the sentence matching degree is transmitted to the browsing history searching unit 33 and the matching degree synthesizing unit 34 (S102).

ここで、文章適合度とは、検索語がコンテンツに出現する頻度を意味するものであり、例えば、ＴＦ（Term Frequency）法や、ＴＦ／ＩＤＦ（Term Frequency／Inverse Document Frequency）法などが一般的に利用されている。 Here, the sentence suitability means the frequency with which a search word appears in the content, and for example, TF (Term Frequency) method, TF / IDF (Term Frequency / Inverse Document Frequency) method, etc. are common. Has been used.

続いて、閲覧履歴検索部３３は、検索語受信部３１から送信された検索語と、コンテンツ検索部３２から送信された検索結果とを受信して、各コンテンツ識別子に対する閲覧重要度をそれぞれ算出し、その閲覧重要度を適合度合成部３４に送信する（Ｓ１０３）。なお、閲覧重要度の算出方法については後述する。 Subsequently, the browsing history search unit 33 receives the search word transmitted from the search word receiving unit 31 and the search result transmitted from the content search unit 32, and calculates the browsing importance for each content identifier. Then, the browsing importance is transmitted to the fitness composition unit 34 (S103). The browsing importance calculation method will be described later.

適合度合成部３４は、コンテンツ検索部３２から送信された検索結果と、閲覧履歴検索部３３から送信された閲覧重要度とを受信し、文章重要度と閲覧重要度とを合成した得点を算出する（Ｓ１０４）。なお、得点の算出方法は、加算に限られるものではなく、乗算等であっても良い。 The matching level synthesis unit 34 receives the search result transmitted from the content search unit 32 and the browsing importance level transmitted from the browsing history search unit 33, and calculates a score obtained by synthesizing the sentence importance level and the browsing importance level. (S104). The score calculation method is not limited to addition, but may be multiplication or the like.

（Ｓ１０３）及び（Ｓ１０４）の処理を、（Ｓ１０２）で検索された全てのコンテンツ識別子に対して繰り返し行う（Ｓ１０５）。 The processes of (S103) and (S104) are repeated for all content identifiers searched in (S102) (S105).

最後に、検索結果出力部３５は、適合度合成部３４から送信された各コンテンツ識別子に対する得点を受信し、そのコンテンツ識別子を得点が高い順番に並び替えて、ネットワーク４を介してユーザ端末２に出力する（Ｓ１０６）。 Finally, the search result output unit 35 receives the scores for the respective content identifiers transmitted from the fitness synthesis unit 34, rearranges the content identifiers in descending order, and sends them to the user terminal 2 via the network 4. Output (S106).

その後、ユーザ端末２では、例えば図５に示すような検索結果が画面に出力される。 Thereafter, in the user terminal 2, for example, a search result as shown in FIG. 5 is output on the screen.

次に、（Ｓ１０３）における閲覧重要度の計算方法について説明する。図６は、閲覧重要度の計算方法を示すフローチャートである。閲覧重要度の計算は、（Ｓ１０２）で得られた各コンテンツ識別子について、閲覧履歴検索部３３が、閲覧履歴格納部３０２を参照して算出する。 Next, the browsing importance calculation method in (S103) will be described. FIG. 6 is a flowchart illustrating a browsing importance calculation method. The browsing importance level is calculated by the browsing history search unit 33 with reference to the browsing history storage unit 302 for each content identifier obtained in (S102).

最初に、閲覧履歴検索部３３は、閲覧履歴格納部３０２からコンテンツ識別子を読み出して、コンテンツ検索部３２から送信されたコンテンツ識別子に一致するコンテンツ識別子の数を算出し、そのコンテンツ識別子の数を閲覧要求回数（Ｃ）とする（Ｓ２０１）。 First, the browsing history search unit 33 reads the content identifier from the browsing history storage unit 302, calculates the number of content identifiers that match the content identifier transmitted from the content search unit 32, and browses the number of the content identifiers. The number of requests (C) is set (S201).

例えば、コンテンツ検索部３２から送信されたコンテンツ識別子が「ｗｗｗ．ｇｏｏ．ｎｅ．ｊｐ／」であり、閲覧履歴格納部３０２に図３で示すコンテンツ識別子等を格納されている場合には、閲覧要求回数は３回（Ｃ＝３）となる。 For example, when the content identifier transmitted from the content search unit 32 is “www.goo.ne.jp/” and the content identifier shown in FIG. The number of times is 3 (C = 3).

次に、閲覧履歴検索部３３は、そのコンテンツ識別子に対応する表示順位の平均値を算出し、その平均値よりも高い表示順位を有する閲覧履歴格納部３０２に格納された識別子の数に対して、その平均値に同等な表示順位を有する閲覧履歴格納部３０２に格納されたコンテンツ識別子の数の比率を求めて、その比率を補正用閲覧要求回数（Ｒ）とする（Ｓ２０２）。 Next, the browsing history search unit 33 calculates the average value of the display order corresponding to the content identifier, and the number of identifiers stored in the browsing history storage unit 302 having a display order higher than the average value. Then, the ratio of the number of content identifiers stored in the browsing history storage unit 302 having the display order equivalent to the average value is obtained, and the ratio is set as the correction browsing request count (R) (S202).

例えば、図３の場合には、平均値は（４＋２＋８）／３＝約４．６となる。ここで、小数点以下を四捨五入し、平均値を整数値である５とする。そして、この平均値である５よりも高い表示順位、例えば１位を有するコンテンツ識別子の数と、この平均値である５と同等な表示順位である５位を有するコンテンツ識別子の数とを求める。ここで、表示順位を１位とするコンテンツ識別子の数が１６０、表示順位を５位とするコンテンツ識別子の数が８０の場合、補正用閲覧要求回数は８０／１６０＝０．５（Ｒ＝０．５）となる。 For example, in the case of FIG. 3, the average value is (4 + 2 + 8) / 3 = about 4.6. Here, the fractional part is rounded off and the average value is set to 5 which is an integer value. Then, the number of content identifiers having a display rank higher than 5, which is the average value, for example, the first rank, and the number of content identifiers having the fifth rank, which is a display rank equivalent to the average value of 5, are obtained. Here, when the number of content identifiers with the first display order is 160 and the number of content identifiers with the fifth display order is 80, the correction browsing request count is 80/160 = 0.5 (R = 0). .5).

続いて、閲覧履歴検索部３３は、検索語受信部３１から送信された検索語に一致する閲覧履歴格納部３０２に格納された検索語の数を求めて、検索語利用回数（Ｎ）とする（Ｓ２０３）。 Subsequently, the browsing history search unit 33 obtains the number of search terms stored in the browsing history storage unit 302 that matches the search term transmitted from the search term receiving unit 31 and sets the number of search terms used (N). (S203).

なお、（Ｓ２０１），（Ｓ２０２），（Ｓ２０３）における計算の順番は、上記に限られるものではなく、（Ｓ２０２）→（Ｓ２０１）→（Ｓ２０３）や、（Ｓ２０３）→（Ｓ２０１）→（Ｓ２０２）等、任意の順番であってもよく、何ら得られる効果に影響を与えるものではない。 The order of calculation in (S201), (S202), and (S203) is not limited to the above, and (S202) → (S201) → (S203) or (S203) → (S201) → (S202). ) Etc., and may be in any order, and does not affect the effects obtained.

最後に、閲覧履歴検索部３３は、（Ｓ２０１）で算出された閲覧要求回数（Ｃ）と、（Ｓ２０２）で算出された補正用閲覧要求回数（Ｒ）と、（Ｓ２０３）で算出された検索語利用回数（Ｎ）とを用いて、次式に基づく閲覧重要度（Ｓ）を算出する（Ｓ２０４）。 Finally, the browsing history search unit 33 performs the browsing request count (C) calculated in (S201), the correction browsing request count (R) calculated in (S202), and the search calculated in (S203). Using the word usage count (N), the browsing importance (S) based on the following equation is calculated (S204).

Ｓ＝Ｃ／Ｒ／Ｎ・・・式（１）
式（１）で示す閲覧重要度（Ｓ）は、閲覧要求回数（Ｃ）に比例し、より多く閲覧要求されたコンテンツについてはその値が大きくなるので、ユーザの嗜好を考慮することができ、ユーザの所望するコンテンツをより上位に出力することができる。 S = C / R / N (1)
The browsing importance (S) shown in the formula (1) is proportional to the number of browsing requests (C), and the value of the browsing requested content becomes larger, so that the user's preference can be taken into account. The content desired by the user can be output higher.

また、同式における閲覧重要度（Ｓ）は、閲覧要求回数（Ｃ）を補正用閲覧要求回数（Ｒ）で除算するので、より上位に表示されたコンテンツ識別子に対する閲覧要求は閲覧重要度（Ｓ）に対してより少なく寄与し、より下位に表示されたコンテンツ識別子に対する閲覧要求はより大きく寄与することになる。即ち、同式で示す閲覧重要度（Ｓ）に基づいて順序付けがなされた検索結果が表示され、ユーザにより閲覧要求が行われることで閲覧履歴格納部３０２が更新された後に、他のユーザが同じ検索語で検索する等の複数の検索要求が行われる状況を想定した場合に、検索結果として表示されるコンテンツ識別子の順番がユーザの所望する順番ではなく、固定的に表示される要因を排除することができる。より具体的な説明は後述する。 Further, the browsing importance (S) in the same formula divides the number of browsing requests (C) by the number of browsing requests for correction (R), so that the browsing request for the content identifier displayed at a higher level is the browsing importance (S ) And a browsing request for a content identifier displayed at a lower level will contribute more greatly. That is, the search results ordered based on the browsing importance (S) indicated by the same formula are displayed, and after the browsing history storage unit 302 is updated by a browsing request made by the user, other users are the same. When a situation is assumed in which a plurality of search requests such as searching by a search word is performed, the order of content identifiers displayed as search results is not the order desired by the user, but a factor that is fixedly displayed is eliminated. be able to. More specific explanation will be given later.

更に、同式における閲覧重要度（Ｓ）は、閲覧要求回数（Ｃ）を補正用閲覧要求回数（Ｒ）で除算した値を、更に検索語利用回数（Ｎ）で除算するので、検索語の違いによる閲覧要求回数（Ｃ）の大きさの偏りを正規化することが可能となる。即ち、どのような検索語が与えられた場合であっても、閲覧要求回数を補正用閲覧要求回数で除した値（Ｓ／Ｎ）の値を一定の値域に納めることが可能となる。 Further, the browsing importance level (S) in the above formula is obtained by dividing the value obtained by dividing the number of browsing requests (C) by the number of browsing requests for correction (R) by the number of times of search word usage (N). It is possible to normalize the deviation in the size of the number of browsing requests (C) due to the difference. That is, regardless of what search terms are given, the value (S / N) obtained by dividing the number of browsing requests by the number of browsing requests for correction can be stored in a certain range.

なお、式（１）の変形例として、式（２）を用いることもできる。 Note that, as a modification of the formula (1), the formula (2) can also be used.

Ｓ＝Ｃ／Ｒ・・・式（２）
また、閲覧要求回数（Ｃ），補正用閲覧要求回数（Ｒ），検索語利用回数（Ｎ）の値をそのまま用いることなく、式（３）〜式（５）で示すように、対数を用いた場合であっても、同様の効果を得ることができる。 S = C / R (2)
In addition, the logarithm is used as shown in Expressions (3) to (5) without using the values of the number of browsing requests (C), the number of browsing requests for correction (R), and the number of times of using search words (N) as they are. Even in such a case, the same effect can be obtained.

Ｓ＝ｌｏｇ（Ｃ）／Ｒ／Ｎ・・・式（３）
Ｓ＝ｌｏｇ（Ｃ）／ｌｏｇ（Ｒ）／ｌｏｇ（Ｎ）・・・式（４）
Ｓ＝Ｃ／Ｒ／ｌｏｇ（Ｎ）・・・式（５）
次に、閲覧要求段階の処理の流れについて、図７を用いて説明する。 S = log (C) / R / N (3)
S = log (C) / log (R) / log (N) (4)
S = C / R / log (N) (5)
Next, the flow of processing at the browsing request stage will be described with reference to FIG.

まず、閲覧要求受信部３６が、ネットワーク４を介してユーザ端末２から送信された閲覧要求をコンテンツ検索部３２及び閲覧履歴更新部３７に送信し、閲覧履歴更新部３７は、その閲覧要求を閲覧履歴格納部３０２に追加して閲覧履歴を更新する（Ｓ３０１）。 First, the browsing request receiving unit 36 transmits the browsing request transmitted from the user terminal 2 via the network 4 to the content search unit 32 and the browsing history update unit 37, and the browsing history update unit 37 browses the browsing request. The browsing history is updated by adding to the history storage unit 302 (S301).

次に、コンテンツ検索部３２は、閲覧要求に包含されたコンテンツ識別子に対応するコンテンツ所在場所をコンテンツ格納部３０１から検索し、検索された所在場所に格納されたコンテンツを取得する（Ｓ３０２）。 Next, the content search unit 32 searches the content storage unit 301 for the content location corresponding to the content identifier included in the browsing request, and acquires the content stored in the searched location (S302).

コンテンツ出力部３８は、コンテンツ検索部３２で取得されたコンテンツを受信して、ネットワーク４を介してユーザ端末２に出力する（Ｓ３０３）。 The content output unit 38 receives the content acquired by the content search unit 32 and outputs it to the user terminal 2 via the network 4 (S303).

最後に、式（１）〜式（５）で示す閲覧重要度（Ｓ）を用いることにより、検索結果の順番がユーザの所望する順番ではなく、固定的に表示される要因を排除することについて、図８を用いてより具体的に説明する。 Finally, by using the browsing importance (S) shown in the formulas (1) to (5), the search result order is not the order desired by the user, but the factor that is fixedly displayed is eliminated. This will be described more specifically with reference to FIG.

例えば、４つのコンテンツが存在し、そのコンテンツにそれぞれ対応するコンテンツ識別子をＣ１，Ｃ２，Ｃ３，Ｃ４とし、ユーザが所望するコンテンツはＣ４であると仮定する。また、ユーザが、何らかの検索語を用いて検索したとし、その検索語に対するそれぞれのコンテンツの文章適合度は、図８（ａ）の初期状態に示すように、１０，９，８，７であるとする。この初期状態における各コンテンツの閲覧要求回数（Ｃ）はいずれも０件なので、算出される得点は文章適合度のみで決まり、当然ながら、検索結果の表示順位は１，２，３，４となる。 For example, it is assumed that there are four contents, the content identifiers corresponding to the contents are C1, C2, C3, and C4, and the content desired by the user is C4. Also, assuming that the user searches using a certain search term, the sentence suitability of each content for that search term is 10, 9, 8, 7 as shown in the initial state of FIG. And Since the number of browsing requests (C) for each content in this initial state is zero, the calculated score is determined only by the text suitability, and naturally, the display order of search results is 1, 2, 3, 4. .

この表示順位を有する検索結果をユーザに一定期間提示し、（Ｓ３０１）で追加される閲覧要求の数を観察した場合、ユーザが所望するＣ４に対する閲覧要求の数が最も多くなることが予想される。 When the search results having this display order are presented to the user for a certain period and the number of browsing requests added in (S301) is observed, the number of browsing requests for C4 desired by the user is expected to be the largest. .

しかしながら、検索結果は、図８（ａ）で示す順番で表示されるので、前述したように、より上位のコンテンツについては、例えば図９に示すように、自然と閲覧要求が多くなされる傾向となる。従い、実際の閲覧要求回数（Ｃ）は、例えば、１０，７，５，５となる。 However, since the search results are displayed in the order shown in FIG. 8A, as described above, as shown in FIG. 9, for example, as shown in FIG. Become. Accordingly, the actual number of browsing requests (C) is, for example, 10, 7, 5, 5.

これらの閲覧要求が閲覧履歴格納部３０２に格納された状態で、新たに同じ検索語によって検索要求がされた場合の検索結果について説明する。なお、以下では、閲覧要求回数を閲覧重要度とする場合（Ｓ＝Ｃ）と、閲覧要求回数を補正用閲覧要求回数（Ｒ）で除算した値を閲覧重要度とする場合（Ｓ＝Ｃ／Ｒ）と、で計算した場合の違いについて説明する。 A search result when a search request is newly made by the same search word in a state where these browsing requests are stored in the browsing history storage unit 302 will be described. In the following, when the browsing request count is set as the browsing importance level (S = C), and when the browsing request count is divided by the correction browsing request count (R), the browsing importance level is set (S = C / The difference between the calculation in (R) and will be described.

Ｓ＝Ｃの場合、適合度合成部３４で算出される得点は、文章適合度と閲覧重要度とを加算した結果となるので、図８（ｂ）に示すように、２０，１６，１３，１２となり、検索結果の表示順位は１，２，３，４となる。即ち、ユーザが所望するＣ４の表示順位は、初期状態の表示順位と同じになる。つまり、検索結果は、固定的に表示されることになり、ユーザの所望するコンテンツを上位に浮上することができない。 In the case of S = C, the score calculated by the fitness level synthesis unit 34 is the result of adding the text fitness level and the browsing importance level, and therefore, as shown in FIG. 12 and the display order of search results is 1, 2, 3, and 4. That is, the display order of C4 desired by the user is the same as the display order in the initial state. That is, the search result is displayed in a fixed manner, and the content desired by the user cannot rise to the top.

一方、Ｓ＝Ｃ／Ｒの場合、例えば、Ｒの値が図９に示すように、１位から順番に１，０．７，０．５，０．４であった場合に、Ｒの計算では検索語の差異によらずに閲覧要求の回数を集計するので、ユーザが所望するＣ４が４位にあるという現象を全体的に平準化することができる。故に、適合度合成部３４で算出される得点は、図８（ｃ）に示すように、２０，１９，１８，１９．５となり、検索結果の表示順位は１，３，４，２となる。このように、Ｓ＝Ｃ／Ｒの場合には、ユーザの所望するＣ４を初期状態の４位から２位に浮上することができ、ユーザは容易に所望のコンテンツに到達することが可能となる。 On the other hand, when S = C / R, for example, when the value of R is 1, 0.7, 0.5, 0.4 in order from the first place as shown in FIG. Then, since the number of browsing requests is counted regardless of the difference in the search terms, the phenomenon that the user-desired C4 is in the fourth place can be leveled as a whole. Therefore, as shown in FIG. 8C, the scores calculated by the fitness composition unit 34 are 20, 19, 18, 19.5, and the display order of the search results is 1, 3, 4, 2. . Thus, in the case of S = C / R, the user's desired C4 can rise from the fourth place in the initial state to the second place, and the user can easily reach the desired content. .

このように、閲覧重要度（Ｓ）の計算において、閲覧要求回数（Ｃ）を補正用閲覧要求回数（Ｒ）で除することにより、検索結果においてより上位に表示されたコンテンツ識別子に対する閲覧要求を相対的に低く評価し、逆に、より下位に表示されたコンテンツ識別子に対する閲覧要求を相対的に高く評価するので、表示順位の固定化を防止し、ユーザの所望コンテンツをより上位に浮上させることが可能となる。他のＣ１，Ｃ２，Ｃ３についても同様に、ユーザの所望する度合いに応じて、表示順位が浮上、又は沈下することになる。結果として、Ｓ＝Ｃ／Ｒの場合には、初期状態の如何に関わらず、検索結果の表示順位をユーザの所望する順序へと変化させることが可能となる。 In this way, in the calculation of the browsing importance (S), by dividing the number of browsing requests (C) by the number of browsing requests for correction (R), browsing requests for content identifiers displayed higher in the search results are made. Relatively low evaluation, and conversely, high evaluation of browsing requests for content identifiers displayed at lower levels prevents the display order from being fixed and raises the user's desired content to higher levels. Is possible. Similarly, the display order of other C1, C2, and C3 rises or falls depending on the degree desired by the user. As a result, in the case of S = C / R, it is possible to change the display order of the search results to the order desired by the user regardless of the initial state.

本実施の形態によれば、検索要求に包含された検索語を受信した場合に、この検索語を含む複数のコンテンツをコンテンツ格納部３０１から読み出して、この検索語が複数のコンテンツに出現する頻度を示す文章適合度をそれぞれ算出するコンテンツ検索部３２と、この複数のコンテンツにそれぞれ対応するコンテンツ識別子を受信して、このコンテンツ識別子に一致する閲覧履歴格納部３０２に格納されたコンテンツ識別子の数をそれぞれ求めて閲覧要求回数とし、このコンテンツ識別子に一致する閲覧履歴格納部３０２に格納されたコンテンツ識別子に対応する表示順位の平均値をそれぞれ算出し、この平均値よりも高い表示順位を有する閲覧履歴格納部３０２に格納されたコンテンツ識別子の数に対して、この平均値と同等な表示順位を有する閲覧履歴格納部３０２に格納されたコンテンツ識別子の数の比率をそれぞれ求めて補正用閲覧要求回数とし、閲覧要求回数を補正用閲覧要求回数で除した閲覧重要度をそれぞれ算出する閲覧履歴検索部３３と、この閲覧履歴検索部３３でコンテンツ格納部３０１から読み出された複数のコンテンツ識別子に対し、文章適合度と閲覧重要度とを合成した得点をそれぞれ算出する適合度合成部３４と、この複数のコンテンツ識別子を、この得点に基づいて出力する検索結果出力部３５とを有するので、検索結果が固定的に表示されることを防止し、ユーザの趣向を反映した順番でコンテンツのコンテンツ識別子を出力することができる。 According to the present embodiment, when a search word included in a search request is received, a plurality of contents including the search word are read from the content storage unit 301, and the frequency at which the search word appears in the plurality of contents The content search unit 32 for calculating the sentence matching degree indicating each of the content identifiers, and the content identifier corresponding to each of the plurality of contents, and the number of content identifiers stored in the browsing history storage unit 302 matching the content identifier Each of the browsing requests is obtained and the average value of the display ranks corresponding to the content identifiers stored in the browsing history storage unit 302 matching the content identifiers is calculated, and the browsing history having a display rank higher than the average value. Display order equivalent to the average value for the number of content identifiers stored in the storage unit 302 A browsing history search unit that calculates the ratio of the number of content identifiers stored in the browsing history storage unit 302 and obtains the number of browsing requests for correction and calculates the degree of browsing importance divided by the number of browsing requests for correction. 33, and a fitness synthesis unit 34 for calculating scores obtained by synthesizing the text fitness and the browsing importance for the plurality of content identifiers read from the content storage unit 301 by the browsing history search unit 33, Since it has a search result output unit 35 that outputs a plurality of content identifiers based on this score, it prevents the search results from being displayed in a fixed manner, and sets the content identifiers of the content in an order that reflects the user's preferences. Can be output.

本実施の形態によれば、閲覧履歴検索部３３が、検索要求に包含された検索語に一致する閲覧履歴格納部３０２に格納された検索語の数を求めて検索語利用回数とし、閲覧重要度は、閲覧要求回数を補正用閲覧要求回数で除した値を、更に検索語利用回数で除した値なので、検索語の違いによる閲覧要求回数の大きさの偏りを正規化することができる。即ち、どのような検索語が与えられた場合であっても、閲覧要求回数を補正用閲覧要求回数で除した値を一定の領域に納めることができる。 According to the present embodiment, the browsing history search unit 33 obtains the number of search terms stored in the browsing history storage unit 302 that matches the search term included in the search request and sets the number of search terms used, and the browsing important The degree is a value obtained by dividing the number of browsing requests by the number of browsing requests for correction, and further divided by the number of times of use of search words, so that it is possible to normalize the bias in the number of browsing requests due to differences in search terms. That is, regardless of what search terms are given, a value obtained by dividing the number of browsing requests by the number of browsing requests for correction can be stored in a certain area.

本実施の形態によれば、平均値よりも高い表示順位が１位なので、検索結果が固定的に表示されることをより確実に防止し、ユーザの趣向をより反映した順番でコンテンツのコンテンツ識別子を出力することができる。 According to the present embodiment, since the display order higher than the average value is first, it is possible to more reliably prevent the search result from being displayed in a fixed manner, and the content identifiers of the contents in an order more reflecting the user's preference Can be output.

〔第２の実施の形態〕
図１０は、第２の実施の形態における情報検索システム１の構成を示す構成図である。本実施の形態における情報検索システム１は、第１の実施の形態と基本的には同様であり、情報検索装置３の備える構成が一部異なる構成である。 [Second Embodiment]
FIG. 10 is a configuration diagram showing a configuration of the information search system 1 in the second embodiment. The information search system 1 in the present embodiment is basically the same as that in the first embodiment, and the configuration of the information search apparatus 3 is partially different.

情報検索装置３は、閲覧履歴一括更新部３９と、閲覧履歴検索部３３に接続された閲覧要求回数格納部３０３と、閲覧履歴更新部３７及び閲覧履歴一括更新部３９に接続された閲覧履歴一時格納部３０４とを更に備えた構成である。その他の構成については、第１の実施の形態で説明したものと同様なので、ここでは重複説明を省略する。 The information search apparatus 3 includes a browsing history batch update unit 39, a browsing request count storage unit 303 connected to the browsing history search unit 33, a browsing history temporary unit connected to the browsing history update unit 37 and the browsing history batch update unit 39. The storage unit 304 is further provided. Since other configurations are the same as those described in the first embodiment, a duplicate description is omitted here.

閲覧要求回数格納部３０３は、閲覧履歴検索部３３により参照され、図１１に示すように、表示順位と補正用閲覧要求回数（Ｒ）とを関連付ける第１テーブルと、検索語と検索語利用回数（Ｎ）とを関連付ける第２テーブルとが格納されている。 The browsing request count storage unit 303 is referred to by the browsing history search unit 33. As shown in FIG. 11, the first table that associates the display order with the correction browsing request count (R), the search term, and the search term use count A second table for associating (N) is stored.

閲覧履歴一時格納部３０４には、閲覧履歴更新部３７からの更新要求により、ユーザによる一定期間の閲覧要求の閲覧履歴を格納し、格納された閲覧要求は、閲覧履歴一括更新部３９により、閲覧履歴格納部３０２の閲覧履歴に反映される。 The browsing history temporary storage unit 304 stores a browsing history of browsing requests for a certain period of time by the user in response to an update request from the browsing history update unit 37, and the stored browsing request is browsed by the browsing history batch update unit 39. This is reflected in the browsing history of the history storage unit 302.

閲覧履歴一括更新部３９は、閲覧履歴一時格納部３０４に格納された閲覧履歴を、閲覧履歴格納部３０２に反映する。 The browsing history batch update unit 39 reflects the browsing history stored in the browsing history temporary storage unit 304 in the browsing history storage unit 302.

次に、本実施の形態における情報検索装置３の処理の流れについて説明する。第１の実施の形態と同様に、検索要求段階と閲覧要求段階との２段階で構成されており、最初に検索要求段階の処理の流れについて説明する。 Next, the flow of processing of the information search device 3 in the present embodiment will be described. As in the first embodiment, the search request stage and the browse request stage are composed of two stages. First, the flow of processing in the search request stage will be described.

検索要求段階の処理の流れは、第１の実施の形態で説明した（Ｓ１０１）〜（Ｓ１０６）と基本的には同様であるが、（Ｓ１０３）における閲覧重要度の算出方法が異なるので、その算出方法について、図１２を用いて説明する。 The flow of processing at the search request stage is basically the same as (S101) to (S106) described in the first embodiment, but the browsing importance calculation method in (S103) is different. The calculation method will be described with reference to FIG.

最初に、閲覧履歴検索部３３は、閲覧履歴格納部３０２からコンテンツ識別子を読み出して、コンテンツ検索部３２から送信されたコンテンツ識別子に一致するコンテンツ識別子の数を算出し、そのコンテンツ識別子の数を閲覧要求回数（Ｃ）とする（Ｓ４０１）。 First, the browsing history search unit 33 reads the content identifier from the browsing history storage unit 302, calculates the number of content identifiers that match the content identifier transmitted from the content search unit 32, and browses the number of the content identifiers. The number of requests (C) is set (S401).

次に、閲覧履歴検索部３３は、閲覧履歴格納部３０２を参照して、そのコンテンツ識別子に対応する表示順位の平均値を算出し、閲覧要求回数格納部３０３に格納された第１テーブルを参照して、その平均値に同等な表示順位に対応する補正用閲覧要求回数（Ｒ）を読み出す（Ｓ４０２）。 Next, the browsing history search unit 33 refers to the browsing history storage unit 302, calculates the average value of the display order corresponding to the content identifier, and refers to the first table stored in the browsing request count storage unit 303. Then, the correction browsing request count (R) corresponding to the display order equivalent to the average value is read (S402).

続いて、閲覧履歴検索部３３は、閲覧要求回数格納部３０３に格納された第２テーブルを参照して、検索語受信部３１から送信された検索語に一致する検索語の検索語利用回数（Ｎ）を読み出す（Ｓ４０３）。 Subsequently, the browsing history search unit 33 refers to the second table stored in the browsing request count storage unit 303, and uses the search term usage count of the search term that matches the search term transmitted from the search term reception unit 31 ( N) is read out (S403).

最後に、閲覧履歴検索部３３は、（Ｓ４０１）で算出された閲覧要求回数（Ｃ）と、（Ｓ４０２）で読み出した補正用閲覧要求回数（Ｒ）と、（Ｓ４０３）で読み出した検索語利用回数（Ｎ）とを用いて、第１の実施の形態に記載した式（１）に基づいて閲覧重要度（Ｓ）を算出する（Ｓ４０４）。 Finally, the browsing history search unit 33 uses the browsing request count (C) calculated in (S401), the correction browsing request count (R) read in (S402), and the search word usage read in (S403). Using the number of times (N), the browsing importance level (S) is calculated based on the formula (1) described in the first embodiment (S404).

次に、閲覧要求段階の処理の流れについて、図１３を用いて説明する。 Next, the flow of processing at the browsing request stage will be described with reference to FIG.

まず、閲覧要求受信部３６が、ネットワーク４を介してユーザ端末２から送信された閲覧要求をコンテンツ検索部３２及び閲覧履歴更新部３７に送信し、閲覧履歴更新部３７は、その閲覧要求を閲覧履歴一時格納部３０４に追加する（Ｓ５０１）。 First, the browsing request receiving unit 36 transmits the browsing request transmitted from the user terminal 2 via the network 4 to the content search unit 32 and the browsing history update unit 37, and the browsing history update unit 37 browses the browsing request. The information is added to the history temporary storage unit 304 (S501).

次に、閲覧履歴一括更新部３９は、所定の時期が経過した時に、閲覧履歴一時格納部３０４に格納された閲覧履歴を、閲覧履歴格納部３０２に追加し、閲覧履歴一時格納部３０４を空の状態にする（Ｓ５０２）。 Next, the browsing history batch update unit 39 adds the browsing history stored in the browsing history temporary storage unit 304 to the browsing history storage unit 302 when the predetermined time has elapsed, and the browsing history temporary storage unit 304 is emptied. (S502).

ここで、閲覧履歴一時格納部３０４に格納された閲覧履歴の全てを閲覧履歴格納部３０２に追加しても良いし、一部であっても良い。 Here, all of the browsing histories stored in the browsing history temporary storage unit 304 may be added to the browsing history storage unit 302 or may be a part thereof.

続いて、コンテンツ検索部３２は、閲覧要求に包含されたコンテンツ識別子に対応するコンテンツ所在場所をコンテンツ格納部３０１から検索し、検索された所在場所に格納されたコンテンツを取得する（Ｓ５０３）。 Subsequently, the content search unit 32 searches the content storage unit 301 for the content location corresponding to the content identifier included in the browsing request, and acquires the content stored in the searched location (S503).

コンテンツ出力部３８は、コンテンツ検索部３２で取得されたコンテンツを受信して、ネットワーク４を介してユーザ端末２に出力する（Ｓ５０４）。 The content output unit 38 receives the content acquired by the content search unit 32 and outputs it to the user terminal 2 via the network 4 (S504).

本実施の形態によれば、事前に提供された補正用閲覧要求回数（Ｒ）及び検索語利用回数（Ｎ）を閲覧要求回数格納部３０３を用いるので、閲覧重要度をより高速に算出することができ、検索結果をより速くユーザに提供することができる。 According to the present embodiment, since the browsing request count storage unit 303 uses the correction browsing request count (R) and the search term usage count (N) provided in advance, the browsing importance level can be calculated at a higher speed. And the search result can be provided to the user faster.

本実施の形態によれば、閲覧履歴一時格納部３０４及び閲覧履歴一括更新部３９を更に用いるので、閲覧履歴一時格納部３０４に格納された一定期間の閲覧履歴の一部を閲覧履歴格納部３０２に反映することができる。 According to the present embodiment, since the browsing history temporary storage unit 304 and the browsing history batch update unit 39 are further used, a part of the browsing history stored in the browsing history temporary storage unit 304 for a certain period is used as the browsing history storage unit 302. Can be reflected.

〔第３の実施の形態〕
図１４は、第３の実施の形態における情報検索システム１の構成を示す構成図である。本実施の形態における情報検索システム１は、第１の実施の形態と基本的には同様であり、情報検索装置３の備える構成が一部異なる構成である。 [Third Embodiment]
FIG. 14 is a configuration diagram illustrating a configuration of the information search system 1 according to the third embodiment. The information search system 1 in the present embodiment is basically the same as that in the first embodiment, and the configuration of the information search apparatus 3 is partially different.

情報検索装置３は、閲覧履歴検索部３３に代えて、コンテンツ格納部３０１及び閲覧履歴格納部３０２を参照可能な閲覧重要度更新部４０を更に備えた構成である。また、その他の構成については、第１の実施の形態で説明したものと同様なので、ここでは重複説明を省略する。 The information search device 3 is configured to further include a browsing importance level update unit 40 that can refer to the content storage unit 301 and the browsing history storage unit 302 in place of the browsing history search unit 33. Other configurations are the same as those described in the first embodiment, and a duplicate description is omitted here.

閲覧重要度更新部４０は、閲覧履歴格納部３０２を参照し、各コンテンツに対する閲覧重要度を算出し、コンテンツ格納部３０１に格納する。 The browsing importance level update unit 40 refers to the browsing history storage unit 302, calculates the browsing importance level for each content, and stores it in the content storage unit 301.

コンテンツ格納部３０１には、図１５に示すように、コンテンツ自身を示すコンテンツ文章情報と、このコンテンツを識別可能なコンテンツ識別子と、このコンテンツが格納されているコンテンツ所在場所と、閲覧重要度更新部４０により更新された各検索語に対する閲覧重要度とが関連付けて格納されている。 As shown in FIG. 15, the content storage unit 301 includes content text information indicating the content itself, a content identifier that can identify the content, a location where the content is stored, and a browsing importance level update unit. The browsing importance for each search word updated by 40 is stored in association with each other.

まず、検索語受信部３１は、ネットワーク４を介してユーザ端末２から送信された検索要求に包含された検索語を受信し、その検索語をコンテンツ検索部３２に送信する（Ｓ６０１）。 First, the search word receiving unit 31 receives a search word included in the search request transmitted from the user terminal 2 via the network 4, and transmits the search word to the content search unit 32 (S601).

次に、コンテンツ検索部３２は、検索語受信部３１から送信された検索語を受信し、その検索語を含む複数のコンテンツ及びコンテンツ識別子をコンテンツ格納部３０１から読み出して、検索語が各コンテンツに出現する頻度を示す文章適合度をそれぞれ算出し、コンテンツ識別子と文章適合度とを含む検索結果を適合度合成部３４に送信する（Ｓ６０２）。 Next, the content search unit 32 receives the search term transmitted from the search term receiving unit 31, reads a plurality of contents including the search term and the content identifier from the content storage unit 301, and the search term is assigned to each content. A sentence matching degree indicating the frequency of appearance is calculated, and a search result including the content identifier and the sentence matching degree is transmitted to the matching degree synthesizing unit 34 (S602).

続いて、コンテンツ検索部３２は、各コンテンツ識別子に対し、送信された検索語に対応する閲覧重要度をコンテンツ格納部３０１から読み出して、その閲覧重要度を適合度合成部３４に送信する（Ｓ６０３）。 Subsequently, the content search unit 32 reads, for each content identifier, the browsing importance level corresponding to the transmitted search word from the content storage unit 301, and transmits the browsing importance level to the relevance level synthesis unit 34 (S603). ).

適合度合成部３４は、コンテンツ検索部３２から送信された検索結果及び閲覧重要度を受信し、文章重要度と閲覧重要度とを合成した得点を算出する（Ｓ６０４）。 The matching level synthesis unit 34 receives the search result and the browsing importance level transmitted from the content search unit 32, and calculates a score obtained by synthesizing the sentence importance level and the browsing importance level (S604).

（Ｓ６０３）及び（Ｓ６０４）の処理を、（Ｓ６０２）で検索された全てのコンテンツ識別子に対して繰り返し行う（Ｓ６０５）。 The processes of (S603) and (S604) are repeated for all content identifiers searched in (S602) (S605).

最後に、検索結果出力部３５は、適合度合成部３４から送信された各コンテンツ識別子に対する得点を受信し、そのコンテンツ識別子を得点が高い順番に並び替えて、ネットワーク４を介してユーザ端末２に出力する（Ｓ６０６）。 Finally, the search result output unit 35 receives the scores for the respective content identifiers transmitted from the fitness synthesis unit 34, rearranges the content identifiers in descending order, and sends them to the user terminal 2 via the network 4. It outputs (S606).

なお、閲覧重要度更新部４０は、所定の時期が経過した時点で、閲覧履歴格納部３０２を参照し、コンテンツ格納部３０１の閲覧重要度を更新する。閲覧重要度の計算方法については、第１の実施の形態で説明した計算方法と同様である。 The browsing importance level update unit 40 refers to the browsing history storage unit 302 and updates the browsing importance level of the content storage unit 301 when a predetermined time has elapsed. The calculation method of the browsing importance is the same as the calculation method described in the first embodiment.

また、閲覧要求段階の処理の流れについては、第１の実施の形態と同様なので、ここでは重複説明は省略する。 Further, since the flow of processing at the browsing request stage is the same as that in the first embodiment, a duplicate description is omitted here.

本実施の形態によれば、事前に計算された閲覧重要度を用いるので、検索結果をより速くユーザに提供することができる。 According to this embodiment, since the browsing importance calculated in advance is used, the search result can be provided to the user more quickly.

第１の実施の形態における情報検索システムの構成を示す構成図である。It is a block diagram which shows the structure of the information search system in 1st Embodiment. コンテンツ格納部に格納されたテーブルを示す図である。It is a figure which shows the table stored in the content storage part. 閲覧履歴格納部に格納された閲覧履歴を示す図である。It is a figure which shows the browsing history stored in the browsing history storage part. 検索要求段階の処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a process of a search request | requirement step. 検索結果の一例を示す図である。It is a figure which shows an example of a search result. 閲覧重要度の計算方法を示すフローチャートである。It is a flowchart which shows the calculation method of browsing importance. 閲覧要求段階の処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a process of a browsing request | requirement step. 初期状態と、Ｓ＝Ｃとする場合と、Ｓ＝Ｃ／Ｒとする場合との表示順位を示す比較図である。It is a comparison figure which shows the display order of the case where it is set as an initial state and S = C, and S = C / R. 表示順位と閲覧要求回数と補正用閲覧要求回数の一例を示す図である。It is a figure which shows an example of a display order, the browsing request frequency, and the correction browsing request frequency. 第２の実施の形態における情報検索システムの構成を示す構成図である。It is a block diagram which shows the structure of the information search system in 2nd Embodiment. 閲覧要求回数格納部に格納された第１テーブルと第２テーブルとを示す図である。It is a figure which shows the 1st table and 2nd table which were stored in the browsing request frequency storage part. 閲覧重要度の計算方法を示すフローチャートである。It is a flowchart which shows the calculation method of browsing importance. 閲覧要求段階の処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a process of a browsing request | requirement step. 第３の実施の形態における情報検索システムの構成を示す構成図である。It is a block diagram which shows the structure of the information search system in 3rd Embodiment. コンテンツ格納部に格納されたテーブルを示す図である。It is a figure which shows the table stored in the content storage part. 検索要求段階の処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a process of a search request | requirement step.

Explanation of symbols

１…情報検索システム
２…ユーザ端末
３…情報検索装置
４…ネットワーク
３１…検索語受信部
３２…コンテンツ検索部
３３…閲覧履歴検索部
３４…適合度合成部
３５…検索結果出力部
３６…閲覧要求受信部
３７…閲覧履歴更新部
３８…コンテンツ出力部
３９…閲覧履歴一括更新部
４０…閲覧重要度更新部
３０１…コンテンツ格納部
３０２…閲覧履歴格納部
３０３…閲覧要求回数格納部
３０４…閲覧履歴一時格納部 DESCRIPTION OF SYMBOLS 1 ... Information search system 2 ... User terminal 3 ... Information search device 4 ... Network 31 ... Search term receiving part 32 ... Content search part 33 ... Browsing history search part 34 ... Matching degree synthesis part 35 ... Search result output part 36 ... Browsing request Receiving unit 37 ... browsing history update unit 38 ... content output unit 39 ... browsing history batch update unit 40 ... browsing importance update unit 301 ... content storage unit 302 ... browsing history storage unit 303 ... browsing request count storage unit 304 ... browsing history temporary Storage

Claims

Content storage means for storing the content and an identifier of the content in association with each other;
The search term used for the content search request, the identifier of the content browsed based on the search result based on the search term, and the display order indicating the order in which the identifier is displayed as the search result are stored in association with each other. Browsing history storage means;
When a search term included in the search request is received, a plurality of contents including the search term are read from the content storage unit, and sentence matching degrees indicating how often the search term appears in the plurality of contents are respectively set. Calculate
The identifier corresponding to each of the plurality of contents is further read from the content storage unit, and the number of identifiers stored in the browsing history storage unit that matches the identifier is determined as the number of browsing requests,
The average number of display ranks corresponding to the identifiers stored in the browsing history storage means matching the identifiers is calculated, and the number of identifiers stored in the browsing history storage means having a display rank higher than the average value In contrast, the number of identifiers stored in the browsing history storage means having a display order equivalent to the average value is determined as the number of browsing requests for correction,
Retrieval means for calculating each browsing importance obtained by dividing the number of browsing requests by the number of browsing requests for correction;
Relevance combining means for calculating scores obtained by combining the sentence relevance and the browsing importance for a plurality of identifiers read from the content storage means by the search means;
Search result output means for outputting the plurality of identifiers based on the score;
An information retrieval apparatus comprising:

The search means obtains the number of search terms stored in the browsing history storage means that matches the search term included in the search request and sets the number of search terms used. The information search apparatus according to claim 1, wherein the value divided by the correction browsing request count is further divided by the search word usage count.

The information search apparatus according to claim 1, wherein a display rank higher than the average value is first.

A first step of associating a content with an identifier of the content and storing the content in the content storage means;
A browsing history storage unit that associates a search term used for a content search request, an identifier of a content browsed based on a search result based on the search term, and a display rank indicating a rank in which the identifier is displayed as a search result. A second step of storing in
When a search term included in the search request is received, a plurality of contents including the search term are read from the content storage unit, and sentence matching degrees indicating how often the search term appears in the plurality of contents are respectively set. Calculate
The identifier corresponding to each of the plurality of contents is further read from the content storage unit, and the number of identifiers stored in the browsing history storage unit that matches the identifier is determined as the number of browsing requests,
The average number of display ranks corresponding to the identifiers stored in the browsing history storage means matching the identifiers is calculated, and the number of identifiers stored in the browsing history storage means having a display rank higher than the average value In contrast, the number of identifiers stored in the browsing history storage means having a display order equivalent to the average value is determined as the number of browsing requests for correction,
A third step of calculating each browsing importance obtained by dividing the number of browsing requests by the number of browsing requests for correction;
A fourth step of calculating a score obtained by combining the sentence suitability and the browsing importance for each of the plurality of identifiers read from the content storage unit in the third step;
A fifth step of outputting the plurality of identifiers based on the score;
A method for retrieving information, comprising:

In the third step, the number of search terms stored in the browsing history storage means that matches the search term included in the search request is obtained and used as the number of search term use. 5. The information search method according to claim 4, wherein a value obtained by dividing the number of times by the number of browsing requests for correction is a value obtained by further dividing the number of times by using the search word.

The information search method according to claim 4, wherein a display rank higher than the average value is first.

A first process of associating the content with the content identifier and storing the content in the content storage means;
A browsing history storage unit that associates a search term used for a content search request, an identifier of a content browsed based on a search result based on the search term, and a display rank indicating a rank in which the identifier is displayed as a search result. A second process stored in
When a search term included in the search request is received, a plurality of contents including the search term are read from the content storage unit, and sentence matching degrees indicating how often the search term appears in the plurality of contents are respectively set. Calculate
The identifier corresponding to each of the plurality of contents is further read from the content storage unit, and the number of identifiers stored in the browsing history storage unit that matches the identifier is determined as the number of browsing requests,
The average number of display ranks corresponding to the identifiers stored in the browsing history storage means matching the identifiers is calculated, and the number of identifiers stored in the browsing history storage means having a display rank higher than the average value In contrast, the number of identifiers stored in the browsing history storage means having a display order equivalent to the average value is determined as the number of browsing requests for correction,
A third process for calculating each browsing importance obtained by dividing the number of browsing requests by the number of browsing requests for correction;
A fourth process for calculating a score obtained by combining the sentence suitability and the browsing importance for each of the plurality of identifiers read from the content storage unit in the third process;
A fifth process for outputting the plurality of identifiers based on the score;
An information search program for causing a computer to execute the above .

In the third process, the number of search terms stored in the browsing history storage unit that matches the search term included in the search request is obtained and used as the number of search term use. 8. The information search program according to claim 7, wherein a value obtained by dividing the number of times by the number of browsing requests for correction is a value obtained by further dividing by the number of times the search word is used.

9. The information search program according to claim 7, wherein a display rank higher than the average value is first.

A first process of associating the content with the content identifier and storing the content in the content storage means;
A browsing history storage unit that associates a search term used for a content search request, an identifier of a content browsed based on a search result based on the search term, and a display rank indicating a rank in which the identifier is displayed as a search result. A second process stored in
When a search term included in the search request is received, a plurality of contents including the search term are read from the content storage unit, and sentence matching degrees indicating how often the search term appears in the plurality of contents are respectively set. Calculate
The identifier corresponding to each of the plurality of contents is further read from the content storage unit, and the number of identifiers stored in the browsing history storage unit that matches the identifier is determined as the number of browsing requests,
The average number of display ranks corresponding to the identifiers stored in the browsing history storage means matching the identifiers is calculated, and the number of identifiers stored in the browsing history storage means having a display rank higher than the average value In contrast, the number of identifiers stored in the browsing history storage means having a display order equivalent to the average value is determined as the number of browsing requests for correction,
A third process for calculating each browsing importance obtained by dividing the number of browsing requests by the number of browsing requests for correction;
A fourth process for calculating a score obtained by combining the sentence suitability and the browsing importance for each of the plurality of identifiers read from the content storage unit in the third process;
A fifth process for outputting the plurality of identifiers based on the score;
A computer-readable recording medium on which is recorded an information search program characterized in that the computer is executed .

In the third process, the number of search terms stored in the browsing history storage unit that matches the search term included in the search request is obtained and used as the number of search term use. 11. The computer-readable recording medium storing an information search program according to claim 10, wherein a value obtained by dividing the number of times by the number of correction browsing requests is further divided by the number of times of use of the search word.

The computer-readable recording medium recording the information search program according to any one of claims 10 and 11, wherein a display rank higher than the average value is first.