JP2007183859A

JP2007183859A - Information retrieval system, information retrieval method, and information management apparatus

Info

Publication number: JP2007183859A
Application number: JP2006002286A
Authority: JP
Inventors: Koji Fujiwara; 浩次藤原
Original assignee: Konica Minolta Inc
Current assignee: Konica Minolta Inc
Priority date: 2006-01-10
Filing date: 2006-01-10
Publication date: 2007-07-19

Abstract

PROBLEM TO BE SOLVED: To provide an information retrieval system, an information retrieval method, and an information management apparatus that can refer to the using history of other user having a similar interest and concern without previously preparing data when a retrieval user retrieves desired information from information stored in a database, and can easily, efficiently, and accurately retrieve information of a further extended content base. SOLUTION: The using history of the retrieval user is compared with that of other user and the similarity is calculated based on the stored using history of each user, the relevance ratio between each information and retrieval condition presented by the retrieval user is calculated based on the using information of the similar users having high similarity, and information of high relevance ratio is retrieved. COPYRIGHT: (C)2007,JPO&INPIT

Description

複数のユーザがアクセスし、提示した検索条件に基づき、蓄積された情報からそれぞれのユーザの意図に沿った情報を検索するための情報検索システム、情報検索方法、及び情報管理装置に関する。 The present invention relates to an information search system, an information search method, and an information management device for searching information stored in accordance with each user's intention based on search conditions that are accessed and presented by a plurality of users.

インターネット利用が一般化し、Ｗｅｂのような大規模データベースから、一般ユーザが所望の情報を直接検索する機会が増えている。しかしＷｅｂ上で販売されている大量の商品データから好みに合う商品を探し出したり、Ｗｅｂ上または企業内文書データベースなどの大量の文書から目的の文書またはその関連の文書を探し出したりするのは、例え検索プログラムを使用したとしても大変な時間と労力が掛かる。 The use of the Internet has become common, and the opportunity for general users to directly search for desired information from a large-scale database such as the Web is increasing. However, searching for a product that suits your preference from a large amount of product data sold on the Web, or searching for a target document or related documents from a large amount of documents such as a Web or in-house document database is an example. Even using a search program takes a lot of time and effort.

通常、こうした大量の情報からの検索プログラムにはキーワード検索方式がよく用いられている。ところが、こうした検索効率の低さは、検索プログラムを使用する際にユーザが適切なキーワードを設定できない、あるいは設定キーワードが曖昧である、等に起因することも多い。キーワード検索は、文字列単位の検索であり、同義語まで範囲を広げたとしても、抜け落ちる情報は多い。それを防ぐために範囲を広げると所望の情報とは異なる情報が過大に入り込み、また絞り込みの必要が生じてくる。 Usually, a keyword search method is often used for such a search program from a large amount of information. However, such low search efficiency is often caused by the fact that the user cannot set an appropriate keyword when using a search program, or the set keyword is ambiguous. The keyword search is a character string search, and even if the range is expanded to synonyms, much information is missing. If the range is widened to prevent this, information different from the desired information will be excessive, and it will be necessary to narrow it down.

こういったキーワード方式の検索の課題に対して、検索者の望んでいる検索は、キーワードが一致するかどうかよりも、その情報の内容がジャンルとして一致あるいは類似しているかどうかであることから、カテゴリを重視する検索方法が開発されている（特許文献１参照）。 For these keyword-based search challenges, the search that the searcher wants is more about whether the content of the information matches or is similar to the genre, rather than whether the keywords match. Search methods that emphasize categories have been developed (see Patent Document 1).

特許文献１では、蓄積されている各情報に対し、予め内容が統計的に解析、分類され、対応するカテゴリ因子との関係がインデクス化されて記憶されており、検索時には、そのインデクスを参照し、検索要求に対応するカテゴリ因子と関連の強い情報を参照してくるという検索技術が提示されている。カテゴリを介在させて検索することにより、所望の情報と内容が近い情報を検索する精度を上げるというものである。 In Patent Document 1, the contents are statistically analyzed and classified in advance for each stored information, and the relationship with the corresponding category factor is indexed and stored. At the time of retrieval, the index is referred to. A search technique is proposed in which information strongly related to a category factor corresponding to a search request is referred to. By searching through categories, the accuracy of searching for information that is similar in content to the desired information is improved.

また、従来のキーワード一致ではなく、情報の内容を重視する検索方法として、その検索ユーザの履歴を利用し、そのユーザが過去にどういう情報を参照したかという点から類似の情報を見つけようという検索方法が開発されている（特許文献２参照）。 Also, as a search method that emphasizes the content of information rather than conventional keyword matching, a search that uses the search user's history to find similar information in terms of what information the user has referred to in the past A method has been developed (see Patent Document 2).

特許文献２では、検索ユーザの過去の閲覧サイトのアドレスを記憶しておき、インタネットのサイトを検索するに際し、過去に閲覧したサイトの情報内容を単語レベルに分解して統計的に出現頻度などを解析したデータを取得する。一方、記憶したアドレスから、リンクするサイトや関連サイトを調査し、データ解析し、先に取得した解析データと類似するサイトを見つけ出すという技術が提示されている。過去の参照情報を元に類似の情報を探すということで、所望の情報を検索する精度を上げるというものである。 In Patent Document 2, the address of a browsing user's past browsing site is stored, and when searching an Internet site, the information content of the site browsed in the past is decomposed into word levels, and the appearance frequency is statistically determined. Get the analyzed data. On the other hand, a technique has been proposed in which linked sites and related sites are investigated from stored addresses, data analysis is performed, and sites similar to the previously acquired analysis data are found. By searching for similar information based on past reference information, the accuracy of searching for desired information is improved.

また、興味の範囲や性別、年齢など、ユーザの属性が類似する他のユーザの参照結果を利用して、従来のキーワード検索の欠点を補おうという試みもなされてきている。例えばインターネットを通じての書籍購入の例では、ユーザが現在購入しようとしている本に対して、その本を購入した他のユーザが他にどんな本を購入しているかを提示することで、検索効率をアップするとともに潜在的なニーズの顕在化を図ろうという試みもなされている。 Attempts have also been made to make up for the shortcomings of conventional keyword searches using reference results of other users with similar user attributes, such as the range of interest, gender, and age. For example, in the case of purchasing books through the Internet, the search efficiency is improved by presenting what other users who have purchased the book purchase other books that the user is currently purchasing. At the same time, attempts have been made to realize potential needs.

また、特定の物品や情報だけでなく、それらに対するユーザの評価、すなわち嗜好をデータとして蓄積し、そのデータの他のユーザとの比較評価を元にして、興味の共通するような情報を検索しようという技術も開発されている（特許文献３）。 Also, not only specific articles and information but also user evaluations, that is, preferences are accumulated as data, and based on comparative evaluation with other users of the data, search for information that is of common interest A technique called “Patent Document 3” has also been developed.

特許文献３では、事前に各ユーザによる各情報に対する評価を蓄積記録しておき、検索時には、検索ユーザ評価とそれらのユーザ評価を比較し、興味の類似度を判別することで、検索時に類似度の重みを付けたフィルタリング処理を行い、同様の興味関心を持つ評価者が高く評価する情報が検索されるようにするという技術が提示されている。ユーザ評価を元に、嗜好の類似した他のユーザの参照結果を利用して、所望の情報を検索する精度を上げるというものである。
特開２００１−１８４３５８号公報特開２００１−１６００５６号公報特開平９−１９０４４３号公報 In Patent Document 3, evaluations for each piece of information by each user are accumulated and recorded in advance, and at the time of search, the search user evaluations are compared with those user evaluations, and the similarity of interest is discriminated. A technique has been proposed in which information that is highly evaluated by an evaluator having the same interest and interest is searched. Based on the user evaluation, the accuracy of searching for desired information is improved using the reference results of other users with similar preferences.
JP 2001-184358 A JP 2001-160056 A JP-A-9-190443

大量情報からの検索方法について、従来のキーワード検索方式は、上述のように様々な課題があった。特許文献１に記載の技術では、文字列ではなくカテゴリ因子を介在させ、内容ベースでの検索に近づいたものの、あくまで分類であり情報の内容自体を参照しているわけではない。特許文献２に記載の技術では、情報の内容を直接把握するのではなく、過去の利用履歴を参照することで、間接的に関心のある内容を介在させることができるが、あくまでそのユーザ自身が参照したもののデータに基づくものであり、発展性がない。書籍購入の例では、同じ本を購入した他のユーザの購入本を提示することで、関心の類似するであろう他のユーザを参照しているが、これもたまたま購入しようとしている本を介在させているだけで、十分な嗜好の類似性があるとはいえない。 As for the search method from a large amount of information, the conventional keyword search method has various problems as described above. In the technique described in Patent Document 1, although not a character string but a categorical factor is interposed to approach a content-based search, it is only a classification and does not refer to the information content itself. In the technique described in Patent Document 2, the content of the information is not directly grasped, but the content of interest can be interposed indirectly by referring to the past use history. It is based on the data of the reference, and has no expansibility. In the book purchase example, by presenting a purchase book of another user who purchased the same book, it refers to another user who may be interested in the same, but this also intervenes with the book that is about to be purchased It is not possible to say that there is sufficient preference similarity.

また特許文献３に記載の技術では、情報の内容に対するユーザ評価を介して興味関心の類似性を求め、同様のユーザ評価、すなわち同様の興味関心を有する他のユーザによる評価をフィルタリングに用いることで、より発展的に所望の情報を探索できるが、しかしこれも、予め各情報に対するユーザの評価を入力しておかなければならない。 Further, in the technique described in Patent Document 3, similarity of interest is obtained through user evaluation on the contents of information, and similar user evaluation, that is, evaluation by other users having similar interest is used for filtering. However, it is possible to search for desired information in a more expansive manner, but this also requires the user's evaluation for each information to be input in advance.

本発明の目的は、上記の課題を解決し、データベースに蓄積された情報から検索ユーザが所望の情報を検索するに当たり、事前にデータを準備する必要がなく、興味や関心の類似する他のユーザの利用履歴を参照することができ、簡単に、より広がりのある内容ベースの情報検索が、効率的に、且つ精度よく行える情報検索システム、情報検索方法、及び情報管理装置を提供することである。 The object of the present invention is to solve the above-mentioned problems, and when a search user searches for desired information from information stored in a database, there is no need to prepare data in advance, and other users who have similar interests and interests. It is possible to provide an information search system, an information search method, and an information management device that can refer to the usage history of the user and can easily and more broadly perform content-based information search efficiently and accurately. .

上記の課題を解決するために、本発明は以下の特徴を有するものである。 In order to solve the above problems, the present invention has the following features.

１．情報が蓄積されている情報記憶手段と、検索ユーザが所望の情報を検索するための検索条件を設定する検索条件設定手段と、前記検索条件設定手段により設定された検索条件に基づいて、前記情報記憶手段から情報の検索を行う情報検索手段と、前記情報検索手段により検索された情報から、前記検索ユーザに対して選択的に情報を利用させる情報提供手段と、前記情報提供手段に基づく情報の利用について、前記検索ユーザを含むユーザごとの利用履歴を蓄積する利用履歴記憶手段と、前記利用履歴記憶手段に蓄積された前記検索ユーザを含むユーザごとの利用履歴を管理する利用履歴管理手段と、を有する情報検索システムにおいて、前記利用履歴管理手段は、前記利用履歴記憶手段に蓄積された、前記検索ユーザの利用履歴と、前記検索ユーザ以外の他のユーザの利用履歴との類似度を算出する類似度算出手段と、前記類似度算出手段により算出された類似度に基づいて、前記検索ユーザと利用履歴の類似した前記検索ユーザ以外の他のユーザを検索する類似ユーザ検索手段と、を有し、前記情報検索手段は、前記類似ユーザ検索手段により検索された前記利用履歴の類似した前記検索ユーザ以外の他のユーザの情報利用履歴に基づいて、前記情報記憶手段に蓄積された各情報と前記検索条件との関連度を算出する関連度算出手段と、前記関連度算出手段により算出された関連度に基づいて、前記情報記憶手段から関連する情報を検索する関連情報検索手段と、を有する、ことを特徴とする情報検索システム。 1. Information storage means for storing information, search condition setting means for setting search conditions for a search user to search for desired information, and the information based on the search conditions set by the search condition setting means An information search unit that searches for information from a storage unit, an information providing unit that selectively uses information from the information searched by the information search unit, and information based on the information providing unit For usage, usage history storage means for storing a usage history for each user including the search user, usage history management means for managing a usage history for each user including the search user stored in the usage history storage means, The usage history management means includes the usage history of the search user stored in the usage history storage means, and the search. Similarity calculation means for calculating the similarity with the use history of other users other than the user, and other than the search user whose use history is similar to the search user based on the similarity calculated by the similarity calculation means Similar user search means for searching for other users, wherein the information search means is information use history of other users other than the search user similar to the use history searched by the similar user search means. Based on the relevance calculation means for calculating the relevance between the information stored in the information storage means and the search condition, and based on the relevance calculated by the relevance calculation means, the information storage means And an associated information retrieval means for retrieving associated information from the information retrieval system.

２．前記情報提供手段は、選択された情報の利用形態として、前記検索ユーザの指定する所定の媒体への情報の複写、或いは前記情報記憶手段とは異なる別の情報記憶手段への情報の複写、を前記検索ユーザに可能にさせる、ことを特徴とする１に記載の情報検索システム。 2. The information providing means may copy information to a predetermined medium designated by the search user, or copy information to another information storage means different from the information storage means, as a usage form of the selected information. 2. The information search system according to 1, wherein the search user is allowed.

３．前記情報提供手段に基づく前記検索ユーザによる情報の利用に対して、前記検索ユーザに課金する課金手段を有する、ことを特徴とする１または２に記載の情報検索システム。 3. 3. The information search system according to claim 1 or 2, further comprising charging means for charging the search user for use of information by the search user based on the information providing means.

４．前記利用履歴記憶手段に記憶された前記検索ユーザの利用履歴が所定の回数以下の場合は、前記利用履歴管理手段は、前記類似度算出手段による類似度の算出と、前記類似ユーザ検索手段による前記類似度に基づいた類似ユーザの検索を行わず、所定の他のユーザを類似度の高い類似ユーザとして設定する、ことを特徴とする１乃至３の何れか１項に記載の情報検索システム。 4). When the usage history of the search user stored in the usage history storage means is less than or equal to a predetermined number of times, the usage history management means calculates the similarity by the similarity calculation means and the similarity user search means 4. The information search system according to any one of claims 1 to 3, wherein a search for similar users based on similarity is not performed and a predetermined other user is set as a similar user with high similarity.

５．前記利用履歴記憶手段に記憶された前記検索ユーザの利用履歴が所定の回数以下の場合は、前記利用履歴管理手段は、前記検索ユーザに対する、類似度算出のための仮想の利用履歴を設定し、前記類似度算出手段は、設定された前記仮想の利用履歴に基づいて類似度を算出する、ことを特徴とする１乃至３の何れか１項に記載の情報検索システム。 5. When the usage history of the search user stored in the usage history storage means is a predetermined number or less, the usage history management means sets a virtual usage history for similarity calculation for the search user, The information search system according to any one of claims 1 to 3, wherein the similarity calculation unit calculates the similarity based on the set virtual usage history.

６．前記利用履歴記憶手段には、仮想のユーザの仮想の利用履歴が記憶されている、ことを特徴とする１乃至５の何れか１項に記載の情報検索システム。 6). 6. The information retrieval system according to any one of 1 to 5, wherein the usage history storage unit stores a virtual usage history of a virtual user.

７．前記類似度算出手段は、所定の期間以上利用履歴のない他のユーザは除外して、利用履歴の類似度を算出する、ことを特徴とする１乃至６の何れか１項に記載の情報検索システム。 7). 7. The information search according to any one of claims 1 to 6, wherein the similarity calculation means calculates a usage history similarity by excluding other users who have no usage history for a predetermined period of time. system.

８．前記関連度算出手段は、前記情報記憶手段に情報が記憶された日時に基づいて、関連度を算出する、ことを特徴とする１乃至７の何れか１項に記載の情報検索システム。 8). The information search system according to any one of claims 1 to 7, wherein the relevance calculation means calculates relevance based on a date and time when information is stored in the information storage means.

９．前記類似度算出手段は、利用履歴における利用日時に基づいて、類似度を算出する、ことを特徴とする１乃至８の何れか１項に記載の情報検索システム。 9. 9. The information search system according to any one of 1 to 8, wherein the similarity calculation unit calculates a similarity based on a use date and time in a use history.

１０．検索ユーザが所望の情報を検索するための検索条件を設定する検索条件設定工程と、前記検索条件設定工程において設定された検索条件に基づいて、情報が蓄積されている情報記憶手段から情報の検索を行う情報検索工程と、前記情報検索工程において検索された情報から、前記検索ユーザに対して選択的に情報を利用させる情報提供工程と、前記情報提供工程に基づく情報の利用について、前記検索ユーザを含むユーザごとの利用履歴を蓄積する利用履歴記録工程と、前記利用履歴記録工程において蓄積された前記検索ユーザを含むユーザごとの利用履歴を管理する利用履歴管理工程と、を備える情報検索方法において、前記利用履歴管理工程は、前記利用履歴記録工程において蓄積された、前記検索ユーザの利用履歴と、前記検索ユーザ以外の他のユーザの利用履歴との類似度を算出する類似度算出工程と、前記類似度算出工程において算出された類似度に基づいて、前記検索ユーザと利用履歴の類似した前記検索ユーザ以外の他のユーザを検索する類似ユーザ検索工程と、を備え、前記情報検索工程は、前記類似ユーザ検索工程において検索された前記利用履歴の類似した前記検索ユーザ以外の他のユーザの情報利用履歴に基づいて、前記情報記憶手段に蓄積された各情報と前記検索条件との関連度を算出する関連度算出工程と、前記関連度算出工程において算出された関連度に基づいて、前記情報記憶手段から関連する情報を検索する関連情報検索工程と、を備える、ことを特徴とする情報検索方法。 10. A search condition setting step for setting a search condition for a search user to search for desired information, and information search from information storage means in which information is stored based on the search condition set in the search condition setting step An information search step for performing search, an information providing step for selectively using information from the information searched in the information search step, and use of information based on the information providing step. In an information search method comprising: a usage history recording step for storing a usage history for each user including a usage history management step for managing a usage history for each user including the search user stored in the usage history recording step The use history management step includes the search user's use history accumulated in the use history recording step, and the search user. Based on the similarity calculated in the similarity calculation step for calculating the similarity with the usage history of other users outside, and other than the search user whose usage history is similar to the search user based on the similarity calculated in the similarity calculation step A similar user search step of searching for another user, wherein the information search step is based on information use history of other users other than the search user similar to the use history searched in the similar user search step. A relevance calculation step for calculating a relevance between each piece of information stored in the information storage means and the search condition, and a relevance from the information storage means based on the relevance calculated in the relevance calculation step. And a related information search step for searching for information to be performed.

１１．前記情報提供工程では、選択された情報の利用形態として、前記検索ユーザの指定する所定の媒体への情報の複写、或いは前記情報記憶手段とは異なる別の情報記憶手段への情報の複写、が前記検索ユーザに可能になる、ことを特徴とする１０に記載の情報検索方法。 11. In the information providing step, as a use form of the selected information, copying of information to a predetermined medium designated by the search user or copying of information to another information storage means different from the information storage means is performed. 11. The information search method according to 10, wherein the search user is enabled.

１２．前記情報提供工程に基づく前記検索ユーザによる情報の利用に対して、前記検索ユーザに課金する課金工程を備える、ことを特徴とする１０または１１に記載の情報検索方法。 12 The information search method according to claim 10 or 11, further comprising a charging step of charging the search user for use of information by the search user based on the information providing step.

１３．前記利用履歴記録工程において記録された前記検索ユーザの利用履歴が所定の回数以下の場合は、前記利用履歴管理工程では、前記類似度算出工程における類似度の算出と、前記類似ユーザ検索工程における前記類似度に基づいた類似ユーザの検索が行われず、所定の他のユーザが類似度の高い類似ユーザとして設定される、ことを特徴とする１０乃至１２の何れか１項に記載の情報検索方法。 13. When the usage history of the search user recorded in the usage history recording step is equal to or less than a predetermined number of times, the usage history management step calculates the similarity in the similarity calculation step and the similarity user search step. 13. The information search method according to any one of 10 to 12, wherein a search for similar users based on similarity is not performed, and a predetermined other user is set as a similar user with high similarity.

１４．前記利用履歴記録工程において記録された前記検索ユーザの利用履歴が所定の回数以下の場合は、前記利用履歴管理工程では、前記検索ユーザに対する、類似度算出のための仮想の利用履歴が設定され、前記類似度算出工程では、設定された前記仮想の利用履歴に基づいて類似度が算出される、ことを特徴とする１０乃至１２の何れか１項に記載の情報検索方法。 14 When the usage history of the search user recorded in the usage history recording step is less than or equal to a predetermined number of times, the usage history management step sets a virtual usage history for calculating the similarity for the search user, The information search method according to any one of claims 10 to 12, wherein, in the similarity calculation step, the similarity is calculated based on the set virtual use history.

１５．前記利用履歴記録工程において、仮想のユーザの仮想の利用履歴が記録されている、ことを特徴とする１０乃至１４の何れか１項に記載の情報検索方法。 15. 15. The information search method according to any one of 10 to 14, wherein a virtual usage history of a virtual user is recorded in the usage history recording step.

１６．前記類似度算出工程では、所定の期間以上利用履歴のない他のユーザを除外して、利用履歴の類似度が算出される、ことを特徴とする１０乃至１５の何れか１項に記載の情報検索方法。 16. The information according to any one of claims 10 to 15, wherein, in the similarity calculation step, the similarity of the usage history is calculated by excluding other users who have no usage history for a predetermined period or more. retrieval method.

１７．前記関連度算出工程では、前記情報記憶手段に情報が記憶された日時に基づいて、関連度が算出される、ことを特徴とする１０乃至１６の何れか１項に記載の情報検索方法。 17. 17. The information search method according to any one of 10 to 16, wherein in the relevance level calculating step, the relevance level is calculated based on a date and time when information is stored in the information storage unit.

１８．前記類似度算出工程では、利用履歴における利用日時に基づいて、類似度が算出される、ことを特徴とする１０乃至１７の何れか１項に記載の情報検索方法。 18. 18. The information search method according to any one of 10 to 17, wherein in the similarity calculation step, the similarity is calculated based on a use date and time in the use history.

１９．検索ユーザから所望の情報を検索するための検索条件を取得し、設定する検索条件設定手段と、前記検索ユーザに対して、検索された情報を選択的に利用させる情報提供手段と、前記情報提供手段に基づく情報の利用について、前記検索ユーザを含むユーザごとの利用履歴を蓄積する利用履歴記憶手段と、を有する情報管理装置において、前記利用履歴記憶手段に蓄積された、前記検索ユーザの利用履歴と、前記検索ユーザ以外の他のユーザの利用履歴との類似度を算出する類似度算出手段と、前記類似度算出手段により算出された類似度に基づいて、前記検索ユーザと利用履歴の類似した前記検索ユーザ以外の他のユーザを検索する類似ユーザ検索手段と、を有し、情報が蓄積されている情報記憶手段に対して、前記検索条件設定手段により設定された検索条件と、前記類似ユーザ検索手段により検索された前記利用履歴の類似した前記検索ユーザ以外の他のユーザの情報利用履歴に基づいて、情報検索を指示し、その検索結果を取得する、ことを特徴とする情報管理装置。 19. Search condition setting means for acquiring and setting search conditions for searching for desired information from a search user, information providing means for selectively using the searched information to the search user, and the information provision Use history storage means for accumulating use history for each user including the search user for use of information based on the means, in the information management apparatus, the use history of the search user stored in the use history storage means A similarity calculation unit that calculates a similarity between the usage history of other users other than the search user, and a similarity between the search user and the usage history based on the similarity calculated by the similarity calculation unit Similar user search means for searching for other users other than the search user, and for the information storage means for storing information, the search condition setting means Based on the set search condition and the information use history of other users other than the search user similar to the use history searched by the similar user search means, the information search is instructed and the search result is acquired. An information management apparatus characterized by that.

本発明によれば、データベースに蓄積された情報から検索ユーザが所望の情報を検索するに当たり、蓄積された各ユーザごとの利用履歴から、検索ユーザと他のユーザの利用履歴を比較して類似度を算出し、その類似度の高い類似ユーザを求めることができる。またその利用情報に基づき、各情報と検索ユーザの提示した検索条件との関連度を算出し、関連度の高い情報を検索することができる。 According to the present invention, when a search user searches for desired information from information stored in a database, the use history of each stored user is compared with the use history of the search user and other users. And a similar user having a high similarity can be obtained. Further, based on the usage information, the degree of association between each piece of information and the retrieval condition presented by the retrieval user can be calculated, and information with a high degree of association can be retrieved.

これにより、事前にデータを準備する必要なく、興味や関心の類似する他のユーザの利用履歴を参照することができ、簡単に、より広がりのある、内容をベースとした情報検索が、効率的に、且つ精度よく行える情報検索システム、情報検索方法、及び情報管理装置を提供することができる。 This makes it possible to refer to the usage history of other users with similar interests without having to prepare data in advance, making it easier and more efficient to search information based on content more easily. In addition, it is possible to provide an information search system, an information search method, and an information management device that can be performed with high accuracy.

以下、図面を参照して、本発明の実施形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

（情報検索システム１の構成）
まず、図１を参照して本発明の実施形態に係る情報検索システムの構成について説明する。図１には、情報検索システム１の構成例を示す。 (Configuration of information retrieval system 1)
First, the configuration of an information search system according to an embodiment of the present invention will be described with reference to FIG. FIG. 1 shows a configuration example of the information search system 1.

このシステム構成例では、検索され、利用される情報はコンテンツと呼称され、多くのコンテンツを有する情報検索システム１に、多数のユーザがアクセスする。各ユーザは検索条件を提示して、検索されたコンテンツの利用を要求し、システム管理者からの課金に応ずることによってコンテンツを利用するものである。 In this system configuration example, information to be searched for and used is referred to as content, and many users access the information search system 1 having a large amount of content. Each user presents a search condition, requests use of the searched content, and uses the content by responding to a charge from the system administrator.

コンテンツは、文書以外に音楽データや画像、動画など、形態は任意である。コンテンツの形態に応じて利用の形態も異なる。また商品を扱う場合には、コンテンツは商品に関する情報であるが、ユーザは課金に応ずることによって商品自体を入手する。 The content can be in any form other than documents, such as music data, images, and moving images. Depending on the form of content, the form of use is also different. When dealing with merchandise, the content is information about the merchandise, but the user obtains the merchandise by responding to the charge.

図１において、１１はコンテンツサーバであり、情報としてのコンテンツを記憶し、要求により検索処理を行い、ネットワークを通じてコンテンツの提供を行う。１２はユーザ情報管理サーバであり、コンテンツを利用するユーザとコンテンツの利用履歴に関する情報とを記憶し、管理する。１３は課金サーバであり、ユーザによるコンテンツの利用に対して、課金、及び決済の処理を行う。１５はユーザの端末であり、ユーザがコンテンツ利用のための入力操作、またコンテンツ利用のための表示、印刷などを行う。１６、１７は他のユーザの端末である。 In FIG. 1, 11 is a content server, which stores content as information, performs search processing upon request, and provides content through a network. A user information management server 12 stores and manages a user who uses the content and information regarding the usage history of the content. A billing server 13 performs billing and settlement processing for the use of content by the user. A user terminal 15 is used by the user to perform input operations for content use, display for content use, printing, and the like. Reference numerals 16 and 17 denote terminals of other users.

情報検索システム１としてコンテンツ提供事業者が設置したコンテンツサーバ１１、ユーザ情報管理サーバ、そして課金サーバは、ネットワーク１４を介して、ユーザの端末１５、あるいは１６、１７などと接続しており、各ユーザが各ユーザ端末１５などからネットワークを介してアクセスすることにより、コンテンツの検索、利用が行える。 The content server 11, user information management server, and billing server installed by the content provider as the information search system 1 are connected to the user terminal 15, 16 or 17 via the network 14, and each user Can be searched and used by accessing each user terminal 15 via the network.

コンテンツサーバ１１は、情報としてのコンテンツを記憶し、情報記憶手段として機能するコンテンツデータベース（コンテンツＤＢ）１１１と、コンテンツＤＢから情報を検索処理し、情報検索手段として機能する検索処理部１１２と、ネットワークを介して、検索と利用に関する要求や指示を送受信するためのネットワーク接続部１１３とからなる。 The content server 11 stores content as information, a content database (content DB) 111 that functions as information storage means, a search processing unit 112 that functions as information search means, searches information from the content DB, and a network And a network connection unit 113 for transmitting and receiving requests and instructions related to search and use.

また検索処理部１１２は、検索キーなどの検索条件を受けて、各情報コンテンツとの関連度を算出する関連度算出部１１２ａと、その関連度に基づいて関連する情報コンテンツを検索し、抽出する関連情報検索部１１２ｂとを含む。従って、関連度算出部１１２ａは関連度算出手段として機能し、関連情報検索部１１２ｂは関連情報検索手段として機能する。 In addition, the search processing unit 112 receives a search condition such as a search key, and searches for and extracts a relevance level calculation unit 112a that calculates a relevance level with each information content, and a related information content based on the relevance level. And a related information search unit 112b. Accordingly, the relevance calculation unit 112a functions as a relevance calculation unit, and the related information search unit 112b functions as a related information search unit.

ユーザ情報管理サーバ１２は、情報管理装置であり、ユーザーに関する情報を記憶し、情報コンテンツの利用履歴記憶手段として機能するユーザ管理データベース（ユーザ管理ＤＢ）１２１と、ユーザの情報コンテンツ利用履歴を管理し、利用履歴管理手段として機能する利用履歴管理部１２２と、ユーザを認証し、ユーザに対するコンテンツ利用の制御と管理を行うユーザ管理部１２３と、ネットワークを介して、検索と利用に関する要求や指示を送受信するためのネットワーク接続部１２４とからなる。 The user information management server 12 is an information management apparatus that stores information about users and manages a user management database (user management DB) 121 that functions as a usage history storage unit for information contents and a user information content usage history. , A usage history management unit 122 that functions as a usage history management unit, a user management unit 123 that authenticates a user and controls and manages content usage for the user, and transmits and receives requests and instructions related to search and usage via a network And a network connection unit 124.

利用履歴管理部１２２は、検索ユーザの利用履歴を参照し、他のユーザの利用履歴との類似度を算出する類似度算出部１２２ａと、その類似度に基づいて利用履歴の類似する他のユーザを検索し、抽出する類似ユーザ検索部１２２ｂとを含む。従って、類似度算出部１２２ａは類似度算出手段として機能し、類似ユーザ検索部１２２ｂは類似ユーザ検索手段として機能する。 The usage history management unit 122 refers to the usage history of the search user, calculates the similarity with other users' usage history, and another user whose usage history is similar based on the similarity And a similar user search unit 122b for searching for and extracting. Therefore, the similarity calculation unit 122a functions as a similarity calculation unit, and the similar user search unit 122b functions as a similar user search unit.

ユーザ管理部１２３は、後述のユーザ端末１５などと協働して、ユーザによるアクセスを受け付け、ユーザに関する情報や検索キーなどの検索に関する条件を取得する。従って検索条件設定手段として機能する。またユーザ管理部１２３は、ユーザのコンテンツ利用要求に対して、処理を行い、コンテンツ利用の許可を出力する。従って、情報提供手段としても機能する。 The user management unit 123, in cooperation with the user terminal 15 described later, accepts access by the user and acquires conditions related to the search such as information about the user and a search key. Therefore, it functions as search condition setting means. In addition, the user management unit 123 performs processing in response to the user's content use request and outputs a content use permission. Therefore, it also functions as information providing means.

課金サーバ１３は、課金手段として機能し、ユーザのコンテンツ利用要求に対して、課金処理を行う課金処理部１３２と、課金処理に対するユーザの応答に対して決済処理を行う決済処理部１３１と、課金に関する要求や指示を送受信するためのネットワーク接続部１３３とからなる。 The billing server 13 functions as a billing unit, and billing processing unit 132 that performs billing processing in response to a user's content usage request, settlement processing unit 131 that performs settlement processing in response to a user response to billing processing, and billing And a network connection unit 133 for transmitting and receiving requests and instructions related to.

ユーザ端末１５は、ネットワークに接続された、例えばパーソナルコンピュータなどの端末であり、情報検索システム１にアクセスするためのネットワーク接続部１５５、入力操作を行う入力部１５１、情報を表示するための表示部１５２、情報を媒体にプリントするための印刷部１５３、情報を記憶するための記憶部１５４、そしてそれらの動作を制御する制御部１５６などを備えている。 The user terminal 15 is a terminal connected to a network, such as a personal computer, for example, a network connection unit 155 for accessing the information search system 1, an input unit 151 for performing an input operation, and a display unit for displaying information. 152, a printing unit 153 for printing information on a medium, a storage unit 154 for storing information, and a control unit 156 for controlling operations thereof.

（情報検索システム１の動作）
図２は本実施形態に係る情報検索システム１の動作を示すシーケンス図である。図２を参照して情報検索システム１の動作について説明する。また適宜、図１や図３も参照する。 (Operation of information retrieval system 1)
FIG. 2 is a sequence diagram showing the operation of the information search system 1 according to this embodiment. The operation of the information search system 1 will be described with reference to FIG. Reference is also made to FIGS. 1 and 3 as appropriate.

図２はユーザ端末１５、コンテンツサーバ１１、課金サーバ１３、そしてユーザ情報管理サーバ１２の各動作について、図の上から下へ時系列的に動作の順序と流れを示したものである。そのシーケンスに沿って説明する。 FIG. 2 shows the order and flow of the operations of the user terminal 15, the content server 11, the billing server 13, and the user information management server 12 in time series from the top to the bottom of the drawing. It demonstrates along the sequence.

ユーザがユーザ端末１５から情報検索システム１にアクセスして、たとえば、ユーザ情報、検索情報２１１としてユーザＩＤ番号などユーザを特定する情報と検索キーなど所望の情報コンテンツを検索するための条件をユーザ情報管理サーバ１２に送信する。 When the user accesses the information search system 1 from the user terminal 15, for example, the user information, information for specifying the user such as the user ID number as the search information 211, and conditions for searching for the desired information content such as the search key are stored It transmits to the management server 12.

ユーザ情報管理サーバ１２では、ユーザ管理部１２３においてユーザ認証２２１が行われ、利用履歴管理部１２２により該当ユーザの利用履歴参照２２２が行われる。いずれもユーザ管理ＤＢ１２１が参照される。 In the user information management server 12, user authentication 221 is performed by the user management unit 123, and the usage history reference 222 of the corresponding user is performed by the usage history management unit 122. In either case, the user management DB 121 is referred to.

図３（ａ）には、ユーザ管理ＤＢ１２１の内部データ構造を示す。ユーザ認証２２１はユーザデータ４１（ＩＤ番号やパスワード、その他の個人情報が含まれる）を参照する。後述の課金処理などもユーザデータ４１を参照する。また利用履歴参照２２２では、利用履歴４２を参照する。利用履歴４２には、ユーザＩＤ以外に利用した各コンテンツのＩＤ番号と、各コンテンツに対する利用日付、課金情報、利用形態などの情報が含まれる。これらの利用履歴のデータを用いて、以下の類似ユーザ検索処理が行われる。 FIG. 3A shows the internal data structure of the user management DB 121. The user authentication 221 refers to user data 41 (including an ID number, a password, and other personal information). The user data 41 is also referred to for billing processing described later. In the usage history reference 222, the usage history 42 is referred to. The usage history 42 includes the ID number of each content used in addition to the user ID, and information such as the usage date, billing information, and usage mode for each content. The following similar user search process is performed using these usage history data.

引き続いて利用履歴管理部１２２で類似ユーザ検索２２３が行われ、他のユーザの利用履歴と比較することで、類似度の高いＮ人のユーザ（以後、Ｎユーザともいう）の抽出、すなわちＮユーザ抽出２２４を行い、Ｎユーザ利用コンテンツ情報２２５をコンテンツサーバ１１に送信する。類似ユーザ検索処理については後で詳細に説明する。 Subsequently, a similar user search 223 is performed in the usage history management unit 122, and by comparing with the usage history of other users, extraction of N users (hereinafter also referred to as N users) with high similarity, that is, N users Extraction 224 is performed, and N user use content information 225 is transmitted to the content server 11. The similar user search process will be described later in detail.

コンテンツサーバ１１では、検索処理部１１２がＮユーザ利用コンテンツ情報２２５に基づき重み設定２３１を行った上で、ユーザ提示の検索条件を用いて、キーワードを介した関連コンテンツ検索２３２により、関連度の高いＭコンテンツ抽出２３３を行い、ユーザに提示すべくユーザ端末１５に送信する。関連コンテンツ検索処理については、コンテンツＤＢ１１１が参照される。 In the content server 11, after the search processing unit 112 performs the weight setting 231 based on the N user use content information 225, the related content search 232 using the search condition presented by the user has a high degree of relevance. M content extraction 233 is performed and transmitted to the user terminal 15 for presentation to the user. For related content search processing, the content DB 111 is referred to.

図３（ｂ）には、コンテンツＤＢ１１１の内部データ構造を示す。ここではコンテンツとして書籍を例にとって示している。書籍コンテンツ３１は、情報としての書籍そのものの内容をデータ化したものであり、ユーザに提供するコンテンツ本体である。また検索用メタデータ３２は、コンテンツに関する情報であり、検索のために用いるデータである。具体的には、書籍ごとのＩＤ番号、タイトル、著者名、出版社発行年月日、分類、キーワードなどであり、ユーザの指定した検索条件に従い、これらのデータを用いて検索する。例えば検索キー（複数の単語からなる）がユーザ指定されれば、書籍の内容に従って付加されたキーワードと照合し検索処理を行う。 FIG. 3B shows the internal data structure of the content DB 111. Here, a book is shown as an example of content. The book content 31 is obtained by converting the content of the book itself as information into data, and is a content body provided to the user. The search metadata 32 is information relating to content and is data used for search. Specifically, the ID number, title, author name, publisher issue date, classification, keyword, and the like for each book are searched according to search conditions designated by the user. For example, if a search key (consisting of a plurality of words) is designated by the user, the search process is performed by matching with a keyword added according to the contents of the book.

関連コンテンツ検索処理については後で詳細に説明する。 The related content search process will be described later in detail.

ユーザ端末１５において、関連度の高いＭコンテンツが表示部１５２に表示され、ユーザは入力部１５１を用いて所望のコンテンツを選択し、コンテンツ指定、利用要求２１２をユーザ情報管理サーバ１２に送信する。 In the user terminal 15, M content having a high degree of relevance is displayed on the display unit 152, and the user selects a desired content using the input unit 151, and transmits a content designation / use request 212 to the user information management server 12.

ユーザ情報管理サーバ１２では、ユーザ管理部１２３がユーザによる利用要求を受けて課金処理要求２２６を課金サーバ１３に送信する。 In the user information management server 12, the user management unit 123 receives a usage request from the user and transmits a charging processing request 226 to the charging server 13.

課金サーバ１３では、課金処理部１３２において課金処理２４１が施され、ユーザ端末１５に課金内容が送信される。 In the billing server 13, billing processing 241 is performed in the billing processing unit 132, and billing details are transmitted to the user terminal 15.

ユーザ端末１５では、課金内容が表示部１５２に表示され、ユーザは入力部１５１を用いて、支払いの許諾と支払いのための情報を支払い処理２１３として課金サーバ１３に送信する。 In the user terminal 15, the charging contents are displayed on the display unit 152, and the user transmits the payment permission and information for payment to the charging server 13 as the payment process 213 using the input unit 151.

課金サーバ１３では、支払い処理２１３を受けて課金に対する決済処理２４２を行い、ユーザ情報管理サーバ１２に決済処理情報を送信する。 The billing server 13 receives the payment process 213, performs a settlement process 242 for the billing, and transmits the settlement process information to the user information management server 12.

ユーザ情報管理サーバ１２では、ユーザ管理部１２３が課金決済２４２を受けてコンテンツ提供許可２２７をコンテンツサーバ１１に送信する。 In the user information management server 12, the user management unit 123 receives the billing settlement 242 and transmits a content provision permission 227 to the content server 11.

コンテンツサーバ１１では、コンテンツ提供許可２２７を受けて、コンテンツをユーザに提供可能にし、ユーザ端末１５にコンテンツ提供２３４を送信する。 In response to the content provision permission 227, the content server 11 enables the content to be provided to the user, and transmits the content provision 234 to the user terminal 15.

ユーザ端末１５では、コンテンツ提供２３４が表示部１５２に表示され、ユーザは入力部１５１を用いて提供コンテンツを利用する。具体的には、コンテンツをダウンロードして記憶部１５４に保存する、印刷部１５３で印刷する、あるいは表示部１５２に表示するなどの利用形態がある。 In the user terminal 15, the content provision 234 is displayed on the display unit 152, and the user uses the provided content using the input unit 151. Specifically, there are usage forms such as downloading contents and storing them in the storage unit 154, printing them with the printing unit 153, or displaying them on the display unit 152.

一方、ユーザ情報管理サーバ１２では、利用履歴管理部１２２がユーザ管理ＤＢ１２１にアクセスし、該当ユーザの利用履歴４２を更新する。すなわち、新たに利用したコンテンツのＩＤ番号と日付、課金情報、利用形態などの情報が追加される。 On the other hand, in the user information management server 12, the usage history management unit 122 accesses the user management DB 121 and updates the usage history 42 of the corresponding user. That is, information such as the ID number and date of newly used content, billing information, and usage form is added.

（全体の処理フロー）
図４を用いて、情報検索システム１における情報検索方法、すなわちコンテンツの検索と利用の流れ全体の処理フローを説明する。図４は、コンテンツの検索と利用の流れを示すフローチャートである。 (Overall processing flow)
The information search method in the information search system 1, that is, the processing flow of the entire content search and use flow will be described with reference to FIG. FIG. 4 is a flowchart showing the flow of content search and use.

まず、ステップＳ１で、ユーザがユーザ端末１５から情報検索システム１にアクセスして、ユーザにより検索キーなど所望の情報コンテンツを検索するための条件が入力されると、ユーザ情報管理サーバ１２のユーザ管理部１２３は、ユーザＩＤ番号などユーザを特定する情報とともに検索キーなど所望の情報コンテンツを検索するための条件を取得し、設定する。すなわち、ステップＳ１は検索条件設定工程として機能する。 First, in step S1, when a user accesses the information search system 1 from the user terminal 15 and a condition for searching for desired information content such as a search key is input by the user, user management of the user information management server 12 is performed. The unit 123 acquires and sets a condition for searching for desired information content such as a search key together with information specifying the user such as a user ID number. That is, step S1 functions as a search condition setting step.

次にステップＳ２で、ユーザ情報管理サーバ１２のユーザ管理部１２３は、ユーザのＩＤ番号やパスワードを元に、ユーザ管理ＤＢ１２１のユーザデータ４１を参照し、ユーザ認証を行う。合わせて、利用履歴管理部１２２はユーザの利用履歴４２を参照し、次に行う類似ユーザ検索処理の基準とする。 Next, in step S2, the user management unit 123 of the user information management server 12 refers to the user data 41 of the user management DB 121 based on the user ID number and password, and performs user authentication. In addition, the usage history management unit 122 refers to the user usage history 42 and uses it as a reference for the similar user search process to be performed next.

ステップＳ３で、利用履歴管理部１２２はユーザ管理ＤＢ１２１を参照し、検索要求ユーザの利用履歴に対して他のユーザの利用履歴の類似度を算出し、類似ユーザの検出処理を行う。すなわち、ステップＳ３は利用履歴管理工程として機能する。このステップＳ３の工程の詳細については、図５の類似ユーザ検索処理のフローチャートを用いて後述する。 In step S 3, the usage history management unit 122 refers to the user management DB 121, calculates the similarity of the usage history of other users with respect to the usage history of the search requesting user, and performs a similar user detection process. That is, step S3 functions as a usage history management process. Details of the step S3 will be described later with reference to the flowchart of the similar user search process of FIG.

また本実施形態では、検索の度にこのステップＳ３の工程を実行しているが、定期的に、所定の時間ごとに実行し、各ユーザについて利用履歴の類似度が高い他のユーザを求めて、それを保持しておいてもよい。 In this embodiment, the process of step S3 is executed every time a search is performed. However, the process is periodically executed every predetermined time, and another user having a high usage history similarity is obtained for each user. You may keep it.

ステップＳ４で、コンテンツサーバ１１は類似度の高いＮユーザの利用コンテンツ情報を受け取り、検索処理部１１２がそれに基づく重みを設定し、コンテンツＤＢ１１１を参照し、検索キーと各コンテンツの関連度を算出し、関連コンテンツ検索処理を行う。すなわち、ステップＳ４は情報検索工程として機能する。このステップＳ４の工程の詳細については、図７の関連コンテンツ検索処理のフローチャートを用いて後述する。 In step S4, the content server 11 receives used content information of N users with high similarity, the search processing unit 112 sets weights based on the information, refers to the content DB 111, and calculates the degree of association between the search key and each content. Perform related content search processing. That is, step S4 functions as an information search process. Details of the step S4 will be described later with reference to the flowchart of related content search processing of FIG.

ステップＳ５では、関連度の高いＭコンテンツがユーザ端末１５の表示部１５２に表示され、ユーザは入力部１５１を用いて所望のコンテンツを選択、指定し、ユーザ管理サーバに利用要求を行う。ユーザ管理部１２３はユーザによる利用要求を受けて課金処理要求を課金サーバ１３に送信する。 In step S5, M content having a high degree of relevance is displayed on the display unit 152 of the user terminal 15, and the user selects and designates desired content using the input unit 151, and makes a use request to the user management server. The user management unit 123 receives a usage request from the user and transmits a charging process request to the charging server 13.

ステップＳ６では、課金サーバ１３の課金処理部１３２において課金処理が施され、ユーザ端末１５に課金情報が送信される。ステップＳ７では、課金内容がユーザ端末１５の表示部１５２に表示され、ユーザにより、入力部１５１を用いて、支払いの許諾と支払いのための情報が支払い処理として課金サーバ１３に送信されると、課金サーバ１３では、その支払い処理を受けて課金決済処理を行い、ユーザ情報管理サーバ１２に決済処理情報を送信する。すなわちステップＳ６とステップＳ７は課金工程として機能する。 In step S 6, billing processing is performed in the billing processing unit 132 of the billing server 13, and billing information is transmitted to the user terminal 15. In step S7, the contents of the charge are displayed on the display unit 152 of the user terminal 15, and when the user uses the input unit 151 to send the payment permission and information for payment to the charge server 13 as a payment process, The billing server 13 performs billing settlement processing in response to the payment processing, and transmits settlement processing information to the user information management server 12. That is, step S6 and step S7 function as a charging process.

ステップＳ８では、ユーザ情報管理サーバ１２のユーザ管理部１２３が課金決済情報を受けてコンテンツ提供許可をコンテンツサーバ１１に送信し、コンテンツサーバ１１は、コンテンツ提供許可を受けて、コンテンツをユーザに提供可能にし、ユーザ端末１５に該当コンテンツの提供を送信する。すなわち、ステップＳ８は情報提供工程として機能する。ユーザ端末１５では、コンテンツ提供が表示部１５２に表示され、ユーザは入力部１５１を用いて提供コンテンツを表示部１５２に表示する、印刷部１５３で印刷する、ダウンロードして記憶部１５４に保存する、などの利用形態で利用する。 In step S8, the user management unit 123 of the user information management server 12 receives the billing settlement information and transmits a content provision permission to the content server 11, and the content server 11 can receive the content provision permission and provide the content to the user. The provision of the corresponding content is transmitted to the user terminal 15. That is, step S8 functions as an information providing process. In the user terminal 15, content provision is displayed on the display unit 152, and the user displays the provided content on the display unit 152 using the input unit 151, prints it with the printing unit 153, downloads it, and stores it in the storage unit 154. It is used in the usage form such as.

ステップＳ９では、ユーザ情報管理サーバ１２の利用履歴管理部１２２がユーザ管理ＤＢ１２１にアクセスし、コンテンツを利用したユーザの利用履歴を更新する。すなわち、新たに利用したコンテンツのＩＤ番号と日付、課金情報、利用形態などの情報がユーザ管理ＤＢ１２１の利用履歴に追加される。すなわち、ステップＳ９は利用履歴記録工程として機能する。 In step S9, the usage history management unit 122 of the user information management server 12 accesses the user management DB 121 and updates the usage history of the user who used the content. That is, information such as the ID number and date of the newly used content, billing information, and usage mode is added to the usage history of the user management DB 121. That is, step S9 functions as a usage history recording process.

（類似ユーザ検索処理例１）
図５は、図４のステップＳ３、すなわち類似ユーザ検索処理の工程の詳細フローチャートである。図５を参照して類似ユーザ検索処理のフローを説明する。 (Similar user search processing example 1)
FIG. 5 is a detailed flowchart of the step S3 of FIG. 4, that is, the similar user search process. A flow of the similar user search process will be described with reference to FIG.

ステップＳ３１では、類似度算出部１２２ａで類似度算出処理が行われる。すなわち、各ユーザに対して利用履歴の類似度を算出し、類似度で順序づける。ステップＳ３１は類似度算出工程として機能する。 In step S31, similarity calculation processing is performed by the similarity calculation unit 122a. That is, the similarity of the usage history is calculated for each user, and the usage history is ordered. Step S31 functions as a similarity calculation step.

ステップＳ３２では、類似ユーザ検索部１２２ｂで、検索要求ユーザの利用履歴に対する各ユーザの類似度に基づいて、類似度の高いＮユーザが検索され、抽出される。ステップＳ３２は類似ユーザ検索工程として機能する。 In step S32, the similar user search unit 122b searches for and extracts N users with high similarity based on the similarity of each user to the usage history of the search requesting user. Step S32 functions as a similar user search step.

類似度算出工程について述べる前に、まず類似度の説明を行う。 Before describing the similarity calculation step, the similarity will be described first.

検索要求ユーザをｐとして、その他の任意のユーザをｑとする。いずれのユーザもユーザ管理ＤＢ１２１により利用履歴を参照することができる。 Let p be the search requesting user and q be any other user. Any user can refer to the use history by the user management DB 121.

ユーザｐのコンテンツ利用履歴に対する、任意の他のユーザｑのコンテンツ利用履歴の類似度を定義するために、まず任意のコンテンツｉについての類似の条件を次のように定める。 In order to define the similarity of the content usage history of any other user q with respect to the content usage history of the user p, first, a similar condition for an arbitrary content i is defined as follows.

類似の条件：任意のコンテンツｉがユーザｐに利用されており、かつ他のユーザｑにも利用されていること。 Similar conditions: Arbitrary content i is used by user p and also used by other users q.

この条件を満たすコンテンツでは数値が大きくなり、そうでなければ小さくなるようなｓを各コンテンツごとに求める、すなわち、
ユーザｑに対してコンテンツｉが上記類似の条件を満たすときｓｉ（ｐ，ｑ）＝１で、
ユーザｑに対してコンテンツｉが上記類似の条件を満たさないときｓｉ（ｐ，ｑ）＝０、
となるようにｓｉ（ｐ，ｑ）を定義する（表１参照）。 For content that satisfies this condition, the value is increased for each content, and s is determined for each content.
When content i satisfies the above similar condition for user q, si (p, q) = 1,
Si (p, q) = 0 when the content i does not satisfy the above similar condition for the user q
Si (p, q) is defined so that (see Table 1).

表１では、任意の他のユーザｑに対する任意のコンテンツｉについて、各ｓｉ（ｐ，ｑ）は、上記定義に従い１または０の値を有するものである。 In Table 1, for any content i for any other user q, each si (p, q) has a value of 1 or 0 according to the above definition.

すべてのコンテンツに対してｓｉ（ｐ，ｑ）の総和を取ることで、類似の条件を満たすコンテンツ数が多いほど数値が大きくなるように、つまり類似度が大きくなるように類似度Ｓ（ｐ，ｑ）を定義する。すなわち、 By taking the sum of si (p, q) for all the contents, the similarity S (p, q, so that the numerical value increases as the number of contents satisfying the similar condition increases, that is, the similarity increases. q) is defined. That is,

上記の式（１）に従って、すべての他のユーザｑについて、類似度Ｓ（ｐ，ｑ）を算出する。これにより利用履歴の類似度によるユーザの順序づけが可能になる。 The similarity S (p, q) is calculated for all other users q according to the above equation (1). Thereby, the user can be ordered according to the similarity of the usage history.

なお、本実施形態では類似度の高いＮユーザを求めるのに、すべての他のユーザについて類似度を算出しているが、探索用のインデクス構造を予め構築しておくことで、すべてのユーザではなく効率的に求めることも可能である。 In this embodiment, in order to obtain N users with high similarity, similarity is calculated for all other users. However, by building a search index structure in advance, It is also possible to obtain efficiently.

（類似度算出処理のフロー）
次に、具体的な類似度算出手順の例を説明する。 (Similarity calculation processing flow)
Next, an example of a specific similarity calculation procedure will be described.

図６は、図５のステップＳ３１、すなわち類似度算出処理の工程の詳細フローチャートである。図６を参照して類似度算出処理のフローを説明する。 FIG. 6 is a detailed flowchart of step S31 of FIG. 5, that is, a step of similarity calculation processing. A flow of similarity calculation processing will be described with reference to FIG.

ステップＳ３１０では、類似度の基準として検索要求ユーザｐの利用履歴を参照する。これが他のユーザの利用履歴の類似度を算出する基準となる。 In step S310, the use history of the search requesting user p is referred to as the similarity criterion. This is a reference for calculating the similarity of the usage history of other users.

ステップＳ３１１では、比較して類似度を求めるユーザとして任意の未設定の他のユーザｑを設定する。ここからステップＳ３１７までの工程が、ユーザｑの設定を変えて、すべての他のユーザに対して繰り返されることになる。すなわち、すべての他のユーザに対して類似度Ｓ（ｐ，ｑ）が算出されることになる。 In step S311, any other unset user q is set as a user who compares to obtain the similarity. The processes from here to step S317 are repeated for all other users while changing the setting of the user q. That is, the similarity S (p, q) is calculated for all other users.

ステップＳ３１２では、設定されたユーザｑに対して、初期設定として類似度Ｓ（ｐ，ｑ）が０に設定される。このＳ（ｐ，ｑ）に対して、すべてのコンテンツについてｓｉの加算を繰り返していくことにより、式（１）で表される類似度Ｓ（ｐ，ｑ）を算出するわけである。 In step S312, the similarity S (p, q) is set to 0 as an initial setting for the set user q. By repeating the addition of si for all contents with respect to this S (p, q), the similarity S (p, q) represented by the equation (1) is calculated.

ステップＳ３１３では、類似度算出のための参照コンテンツとして未設定の任意のコンテンツｉを設定する。ここからステップＳ３１５までの工程が、コンテンツｉの設定を変えて、すべてのコンテンツに対して繰り返されることになる。すなわち、すべてのコンテンツに対して類似の条件を満たすかどうかでｓｉ（ｐ，ｑ）が算出される（ステップＳ３１４）。また、すべてのコンテンツに対して、類似度Ｓ（ｐ，ｑ）に、参照したコンテンツｉのｓｉ（ｐ，ｑ）が加算され（ステップＳ３１５）、その結果すべてのコンテンツに対する繰り返しが終了すると、式（１）で表される、ｓｉ（ｐ，ｑ）の総和としての類似度Ｓ（ｐ，ｑ）が算出される（ステップＳ３１７）。 In step S313, an arbitrary content i that has not been set is set as the reference content for similarity calculation. The processes from here to step S315 are repeated for all contents by changing the setting of the contents i. That is, si (p, q) is calculated depending on whether similar conditions are satisfied for all contents (step S314). Further, for all contents, si (p, q) of the referenced content i is added to the similarity S (p, q) (step S315), and as a result, the repetition for all the contents is completed. The similarity S (p, q) represented by (1) as the sum of si (p, q) is calculated (step S317).

ステップＳ３１６では、未設定のコンテンツがないかどうか、すなわちすべてのコンテンツに対して処理を終了したかどうかを判定する。終了した場合（ステップＳ３１６：ＹＥＳ）は、ステップＳ３１７を実行する。終了していない場合（ステップＳ３１６：ＮＯ）は、ステップＳ３１３へもどり、未設定のコンテンツを設定してステップＳ３１５までの処理を繰り返す。 In step S316, it is determined whether there is no unset content, that is, whether the processing has been completed for all content. When the process is completed (step S316: YES), step S317 is executed. If not completed (step S316: NO), the process returns to step S313, sets unset content, and repeats the processing up to step S315.

ステップＳ３１７では、ユーザｑに対して算出された類似度Ｓ（ｐ，ｑ）を後で比較して順位づけるために一時保存しておく。 In step S317, the similarity S (p, q) calculated for the user q is temporarily stored for later comparison and ranking.

ステップＳ３１８では、未設定の他のユーザがいないかどうか、すなわちすべての他のユーザに対して処理を終了したかどうかを判定する。終了した場合（ステップＳ３１８：ＹＥＳ）は、ステップＳ３１９を実行する。終了していない場合（ステップＳ３１８：ＮＯ）は、ステップＳ３１１へもどり、未設定の他のユーザを設定してステップＳ３１７までの処理を繰り返す。 In step S318, it is determined whether there are no other users who have not been set, that is, whether the processing has been completed for all other users. When the process is completed (step S318: YES), step S319 is executed. If not completed (step S318: NO), the process returns to step S311 to set another user who has not been set and repeat the processing up to step S317.

ステップＳ３１９では、すべての他のユーザに対する類似度Ｓ（ｐ，ｑ）が出揃っており、一時保存されたそれらの類似度を比較し、類似度の高さで他のユーザの順序づけを行う。 In step S319, similarities S (p, q) for all other users are prepared, and the temporarily stored similarities are compared, and the other users are ordered according to the level of similarity.

これにより図５のステップＳ３２における類似度の高いＮユーザの抽出が行え、類似ユーザ検索処理を終えることができる。 Thereby, N users with high similarity in step S32 in FIG. 5 can be extracted, and the similar user search process can be completed.

（関連コンテンツ検索処理例１）
図７は、図４のステップＳ４、すなわち関連コンテンツ検索処理の工程の詳細フローチャートである。図７を参照して関連コンテンツ検索処理のフローを説明する。 (Related Content Search Processing Example 1)
FIG. 7 is a detailed flowchart of step S4 of FIG. 4, that is, the related content search process. The flow of related content search processing will be described with reference to FIG.

ステップＳ４１では、関連度算出部１１２ａで関連度算出処理が行われる。すなわち、各コンテンツに対して検索条件との関連度を算出し、関連度で順序づける。ステップＳ４１は関連度算出工程として機能する。 In step S41, the relevance calculation unit 112a performs relevance calculation processing. That is, the degree of association with the search condition is calculated for each content, and the order is based on the degree of association. Step S41 functions as an association degree calculation step.

ステップＳ４２では、関連情報検索部１１２ｂで、検索要求ユーザの指定した検索条件に対する各コンテンツの関連度に基づいて、関連度の高いＭ個のコンテンツが検索され、抽出される。ステップＳ４２は関連情報検索工程として機能する。 In step S42, the related information search unit 112b searches for and extracts M contents having a high degree of association based on the degree of association of each content with respect to the search condition designated by the search requesting user. Step S42 functions as a related information search process.

関連度算出工程について述べる前に、まず関連度の説明を行う。 Before describing the relevance level calculation step, the relevance level will be described first.

検索要求ユーザの指定した検索条件を、複数の単語からなる検索キーであるとする。各コンテンツには、既述したように検索用メタデータが付与されており、コンテンツＤＢ１１１により検索用メタデータを参照することができる。本実施形態では検索用メタデータとしてのキーワードを用いて関連度を算出する。 The search condition specified by the search requesting user is assumed to be a search key composed of a plurality of words. As described above, search metadata is assigned to each content, and the search metadata can be referred to by the content DB 111. In the present embodiment, the degree of association is calculated using a keyword as search metadata.

しかし、同様の手法によりキーワード以外の他のメタデータへも容易に拡張できる。また関連度は、メタデータではなく文書全文テキストを対象としてもよいし、文書に画像が含まれる場合は、画像との関連を算出し、関連度算出に加えてもよい。 However, it can be easily extended to metadata other than keywords by a similar method. The degree of relevance may be the full text text of the document instead of the metadata. If an image is included in the document, the relation with the image may be calculated and added to the relevance calculation.

ユーザｐの指定した検索キーと任意のコンテンツｉの関連度を定義するために、まず任意のキーワードｋについての関連の条件を次のように定める。 In order to define the degree of association between the search key designated by the user p and the arbitrary content i, first, the relationship condition for the arbitrary keyword k is determined as follows.

関連の条件：任意のキーワードｋが検索キーに含まれており、かつコンテンツｉのメタデータにも含まれていること。 Related conditions: An arbitrary keyword k is included in the search key and also included in the metadata of the content i.

本実施形態ではキーワードはコンテンツのメタデータに含まれるが、コンテンツ自身が文書でありコンテンツそのものにキーワードが含まれるような検索方法であってもよい。これを考慮して、以下、キーワードがメタデータに含まれる場合も含めて、キーワードがコンテンツに含まれるとか、コンテンツに現れるとかいう表現を用いる。 In this embodiment, the keyword is included in the metadata of the content, but a search method in which the content itself is a document and the keyword is included in the content itself may be used. Considering this, hereinafter, the expression that the keyword is included in the content or appears in the content is used, including the case where the keyword is included in the metadata.

この関連の条件を、次のように前半の条件ａと後半の条件ｂに分ける。 This related condition is divided into the first condition a and the second condition b as follows.

条件ａ：任意のキーワードｋが検索キーに含まれている
条件ｂ：任意のキーワードｋがコンテンツｉに含まれている
まず、この条件ａを満たすときには数値が大きくなり、そうでなければ小さくなるようなｘ（ｋ）を、各キーワードごとに求める、すなわち、
キーワードｋが上記条件ａを満たすときｘ（ｋ）＝１で、
キーワードｋが上記条件ａを満たさないときｘ（ｋ）＝０、
となるようにｘ（ｋ）を定義する（表２参照）。 Condition a: Arbitrary keyword k is included in the search key Condition b: Arbitrary keyword k is included in the content i First, the numerical value increases when this condition a is satisfied, and decreases otherwise. X (k) is determined for each keyword, that is,
When the keyword k satisfies the condition a, x (k) = 1,
When the keyword k does not satisfy the condition a, x (k) = 0,
X (k) is defined such that (see Table 2).

表２では、任意のキーワードｋについて、各ｘ（ｋ）は、上記定義に従い１または０の値を有するものである。 In Table 2, for any keyword k, each x (k) has a value of 1 or 0 according to the above definition.

また、上の条件ｂを満たすときには数値が大きくなり、そうでなければ小さくなるようなｄ（ｉ，ｋ）を、任意のコンテンツｉと任意のキーワードｋの組み合わせに対して求める、すなわち、
コンテンツｉとキーワードｋが上記条件ｂを満たすときｄ（ｉ，ｋ）＝１で、
コンテンツｉとキーワードｋが上記条件ｂを満たさないときｄ（ｉ，ｋ）＝０、
となるようにｄ（ｉ，ｋ）を定義する（表３参照）。 Further, d (i, k) that increases when the above condition b is satisfied, and decreases otherwise is obtained for a combination of an arbitrary content i and an arbitrary keyword k.
When content i and keyword k satisfy the above condition b, d (i, k) = 1,
When content i and keyword k do not satisfy the above condition b, d (i, k) = 0,
D (i, k) is defined so that (see Table 3).

表３では、任意のキーワードｋと任意のコンテンツｉとの組み合わせについて、各ｄ（ｉ，ｋ）は、上記定義に従い１または０の値を有するものである。 In Table 3, for a combination of an arbitrary keyword k and an arbitrary content i, each d (i, k) has a value of 1 or 0 according to the above definition.

ここで、コンテンツｉとキーワードｋの組み合わせに対して、ｘ（ｋ）×ｄ（ｉ，ｋ）を求めると、このｘ（ｋ）×ｄ（ｉ，ｋ）は、条件ａと条件ｂが同時に成立する、すなわち関連の条件を満たすときには数値が大きくなり（ｘ（ｋ）×ｄ（ｉ，ｋ）＝１）、そうでなければ小さくなる（ｘ（ｋ）×ｄ（ｉ，ｋ）＝０）。 Here, when x (k) × d (i, k) is obtained for the combination of the content i and the keyword k, the condition a and the condition b are the same for the x (k) × d (i, k). If the relation is satisfied, that is, the relation condition is satisfied, the numerical value becomes large (x (k) × d (i, k) = 1), and otherwise, the numerical value becomes small (x (k) × d (i, k) = 0). ).

すべてのキーワードに対してｘ（ｋ）×ｄ（ｉ，ｋ）の総和を取ることで、関連の条件を満たすキーワード数が多いほど数値が大きくなるように、つまり関連度が大きくなるように関連度Ｒ（ｉ）を定義する。すなわち、 By taking the sum of x (k) x d (i, k) for all keywords, the number increases as the number of keywords that satisfy the related condition increases, that is, the degree of relevance increases. A degree R (i) is defined. That is,

式（２）において、ｗ（ｋ）は、
ｗ（ｋ）＝１＋α・ｕ（ｋ）／ｖ（ｋ）（３）
である。 In equation (2), w (k) is
w (k) = 1 + α · u (k) / v (k) (3)
It is.

ｗ（ｋ）は、上記ｘ（ｋ）×ｄ（ｉ，ｋ）をキーワードについて総和して関連度Ｒ（ｉ）を求めるに際しての、重みとなる係数である。上記重みの算出式（３）におけるｕ（ｋ）／ｖ（ｋ）については表４参照。 w (k) is a coefficient that becomes a weight when the above-mentioned x (k) × d (i, k) is summed up for the keyword to obtain the relevance R (i). See Table 4 for u (k) / v (k) in the weight calculation formula (3).

表４においてｕ（ｋ）は類似度の高いＮユーザの利用したすべてのコンテンツにおけるキーワードｋの出現頻度であり、ｖ（ｋ）はすべてのコンテンツにおけるキーワードｋの出現頻度である。従って式（３）中のｕ（ｋ）／ｖ（ｋ）は、類似度の高いユーザが利用したコンテンツで出現頻度が高くなるようなキーワードが、そのコンテンツで現れると、関連度を高くするような因子である。これにより類似度の高いユーザの利用履歴を用いてコンテンツのより妥当な関連度を算出することができる。 In Table 4, u (k) is the appearance frequency of the keyword k in all contents used by N users with high similarity, and v (k) is the appearance frequency of the keyword k in all contents. Therefore, u (k) / v (k) in the expression (3) increases the relevance when a keyword that appears frequently in content used by a user with a high degree of similarity appears in the content. It is a serious factor. As a result, it is possible to calculate a more appropriate relevance level of the content using the use history of the user having a high degree of similarity.

類似度の高いユーザが利用したコンテンツで出現頻度が高いほど、関連度も高くなるが、どの程度その比率に効果を与えるかはｕ（ｋ）／ｖ（ｋ）に対する重み係数αで制御される。また、式（３）中の１は、類似度の高いユーザが利用したコンテンツにキーワードが全く現れない場合にも検索可能とするための項である。 The higher the appearance frequency of the content used by users with a high degree of similarity, the higher the degree of relevance, but the degree of effect is controlled by the weighting factor α for u (k) / v (k). . Further, 1 in the expression (3) is a term for enabling a search even when no keyword appears in the content used by a user having a high degree of similarity.

上記の式（２）に従って、すべてのコンテンツｉについて、関連度Ｒ（ｉ）を算出する。これにより検索キーとの関連度によるコンテンツの順序づけが可能になる。 Relevance R (i) is calculated for all contents i according to the above equation (2). As a result, the contents can be ordered by the degree of association with the search key.

なお、本実施形態では関連度の高いＭ個のコンテンツを求めるのに、すべてのコンテンツについて関連度を算出しているが、探索用のインデクス構造を予め構築しておくことで、すべてのコンテンツではなく効率的に求めることも可能である。 In this embodiment, in order to obtain M pieces of content having a high degree of relevance, the degree of relevance is calculated for all contents. However, by building a search index structure in advance, It is also possible to obtain efficiently.

（関連度算出処理のフロー）
次に、具体的な関連度算出手順の例を説明する。 (Relevance calculation process flow)
Next, an example of a specific relevance calculation procedure will be described.

図８は、図７のステップＳ４１、すなわち関連度算出処理の工程の詳細フローチャートである。図８を参照して関連度算出処理のフローを説明する。 FIG. 8 is a detailed flowchart of step S41 of FIG. 7, that is, a step of relevance calculation processing. With reference to FIG. 8, the flow of relevance calculation processing will be described.

ステップＳ４１０では、関連度の基準として検索要求ユーザｐの指定した検索条件、ここでは複数の単語を含む検索キーを参照する。これが任意のコンテンツの関連度を算出する基準となる。 In step S410, a search condition designated by the search request user p, here a search key including a plurality of words, is referred to as a criterion for the degree of association. This is a reference for calculating the relevance of any content.

ステップＳ４１１では、比較して関連度を求めるコンテンツとして任意の未設定のコンテンツｉを設定する。ここからステップＳ４１７までの工程が、コンテンツｉの設定を変えて、すべてのコンテンツに対して繰り返されることになる。すなわち、すべてのコンテンツに対して関連度Ｒ（ｉ）が算出されることになる。 In step S411, arbitrary unset content i is set as the content for which the relevance is obtained by comparison. The processes from here to step S417 are repeated for all contents by changing the setting of the contents i. That is, the relevance R (i) is calculated for all contents.

ステップＳ４１２では、設定されたコンテンツｉに対して、初期設定として関連度Ｒ（ｉ）が０に設定される。このＲ（ｉ）に対して、すべてのキーワードについて、前記ｘ（ｋ）×ｄ（ｉ，ｋ）に重み係数ｗ（ｋ）をかけたものの加算を繰り返していくことにより、式（２）で表される関連度Ｒ（ｉ）を算出するわけである。 In step S412, the degree of association R (i) is set to 0 as an initial setting for the set content i. For this R (i), by repeating the addition of x (k) × d (i, k) multiplied by the weighting factor w (k) for all keywords, The degree of association R (i) represented is calculated.

ステップＳ４１３では、関連度算出のための参照キーワードとして未設定の任意のキーワードｋを設定する。ここからステップＳ４１５までの工程が、キーワードｋの設定を変えて、すべてのキーワードに対して繰り返されることになる。すなわち、すべてのキーワードに対して関連の条件を満たすかどうかでｘ（ｋ）、ｄ（ｉ，ｋ）、重み係数ｗ（ｋ）が算出される（ステップＳ４１４）。また、すべてのキーワードに対して、関連度Ｒ（ｉ）に、参照したキーワードｋについてのｘ（ｋ）×ｄ（ｉ，ｋ）に重み係数ｗ（ｋ）をかけたものが加算され（ステップＳ４１５）、その結果すべてのキーワードに対する繰り返しが終了すると、式（２）で表される、ｘ（ｋ）×ｄ（ｉ，ｋ）に重み係数ｗ（ｋ）をかけたものの総和としての関連度Ｒ（ｉ）が算出される（ステップＳ４１７）。 In step S413, an arbitrary keyword k that has not been set is set as a reference keyword for calculating the degree of association. The processes from here to step S415 are repeated for all the keywords while changing the setting of the keyword k. That is, x (k), d (i, k), and weighting factor w (k) are calculated depending on whether the relevant conditions are satisfied for all keywords (step S414). In addition, for all keywords, the relevance R (i) is multiplied by x (k) × d (i, k) for the referenced keyword k multiplied by a weighting factor w (k) (step) S415) As a result, when the repetition for all keywords is completed, the degree of relevance as the sum of x (k) × d (i, k) multiplied by the weighting factor w (k) represented by equation (2) R (i) is calculated (step S417).

ステップＳ４１６では、未設定のキーワードがないかどうか、すなわちすべてのキーワードに対して処理を終了したかどうかを判定する。終了した場合（ステップＳ４１６：ＹＥＳ）は、ステップＳ４１７を実行する。終了していない場合（ステップＳ４１６：ＮＯ）は、ステップＳ４１３へもどり、未設定のキーワードを設定してステップＳ４１５までの処理を繰り返す。 In step S416, it is determined whether there are no unset keywords, that is, whether the processing has been completed for all keywords. When the process is completed (step S416: YES), step S417 is executed. If not completed (step S416: NO), the process returns to step S413, an unset keyword is set, and the process up to step S415 is repeated.

ステップＳ４１７では、コンテンツｉに対して算出された関連度Ｒ（ｉ）を後で比較して順位づけるために一時保存しておく。 In step S417, the degree of association R (i) calculated for the content i is temporarily stored for later comparison and ranking.

ステップＳ４１８では、未設定のコンテンツがないかどうか、すなわちすべてのコンテンツに対して処理を終了したかどうかを判定する。終了した場合（ステップＳ４１８：ＹＥＳ）は、ステップＳ４１９を実行する。終了していない場合（ステップＳ４１８：ＮＯ）は、ステップＳ４１１へもどり、未設定のコンテンツを設定してステップＳ４１７までの処理を繰り返す。 In step S418, it is determined whether there is no unset content, that is, whether processing has been completed for all content. If it has been completed (step S418: YES), step S419 is executed. If not completed (step S418: NO), the process returns to step S411, sets unset content, and repeats the processing up to step S417.

ステップＳ４１９では、すべてのコンテンツに対する関連度Ｒ（ｉ）が出揃っており、一時保存されたそれらの関連度を比較し、関連度の高さでコンテンツの順序づけを行う。 In step S419, the relevance levels R (i) for all the contents are gathered, the temporarily stored relevance levels are compared, and the contents are ordered according to the level of relevance.

これにより図７のステップＳ４２における関連度の高いＭコンテンツの抽出が行え、関連コンテンツ検索処理を終えることができる。 Accordingly, the M content having a high degree of relevance in step S42 in FIG. 7 can be extracted, and the related content search process can be completed.

（類似ユーザ検索処理例２）
図９は、図５を用いて説明した類似ユーザ検索処理の別の処理例を示すフローチャートである。図９を用いて、類似ユーザ検索処理例２を説明する。 (Similar user search processing example 2)
FIG. 9 is a flowchart illustrating another example of similar user search processing described with reference to FIG. A similar user search processing example 2 will be described with reference to FIG.

ステップＳ５１では、参照した検索ユーザの利用履歴について、類似度を算出するために十分かどうかを判定する。例えば、利用回数、あるいは利用コンテンツ数などが所定のしきい値以上であるかどうかによって、判定を行う。所定のしきい値以上であれば（ステップＳ５１：ＹＥＳ）、ステップＳ３１とステップＳ３２を実行する。ステップＳ３１とステップＳ３２の工程は、既に述べた図５と同様であり、説明は省略する。 In step S51, it is determined whether or not the usage history of the referred search user is sufficient to calculate the similarity. For example, the determination is performed based on whether the number of times of use or the number of used contents is equal to or greater than a predetermined threshold. If it is equal to or greater than the predetermined threshold (step S51: YES), steps S31 and S32 are executed. Steps S31 and S32 are the same as those already described with reference to FIG.

利用回数、あるいは利用コンテンツ数などが所定のしきい値以上でない場合（ステップＳ５１：ＮＯ）は、ステップＳ５２を実行し、通常のステップＳ３１とステップＳ３２の工程は実行しない。すなわち、ステップＳ５２では、予め定められた所定のユーザを類似度の高いユーザとして設定する。あるいは、仮想の利用履歴を有する仮想のユーザを設定してもよい。 If the number of times of use or the number of used contents is not equal to or greater than a predetermined threshold (step S51: NO), step S52 is executed, and the normal steps S31 and S32 are not executed. That is, in step S52, a predetermined user set in advance is set as a user having a high degree of similarity. Alternatively, a virtual user having a virtual usage history may be set.

このように、検索ユーザの利用履歴が十分でないときにも、ユーザのプロファイルを利用する、あるいは特定の傾向を想定するなどして予め代表的なユーザ、あるいは仮想のユーザを定めておき、類似ユーザとして設定することで、関連コンテンツの検索をより適切に行うことができる。 In this way, even when the search user's usage history is not sufficient, a representative user or a virtual user is determined in advance by using the user's profile or assuming a specific tendency, and similar users. By setting as, the related content can be searched more appropriately.

（類似ユーザ検索処理例３）
図１０は、図５を用いて説明した類似ユーザ検索処理のまた別の処理例を示すフローチャートである。図１０を用いて、類似ユーザ検索処理例３を説明する。 (Similar user search processing example 3)
FIG. 10 is a flowchart showing another example of similar user search processing described with reference to FIG. A similar user search processing example 3 will be described with reference to FIG.

利用回数、あるいは利用コンテンツ数などが所定のしきい値以上でない場合（ステップＳ５１：ＮＯ）は、ステップＳ５３を実行し、その後で通常のステップＳ３１とステップＳ３２の工程の実行へと戻る。ステップＳ５３では、予め定められた所定の利用履歴を検索ユーザの仮想の利用履歴として設定する。あるいは、アンケート方式などでユーザに仮想の利用履歴を入力させるようにしてもよい。この仮想の利用履歴をもとにして、次のステップＳ３１の類似度算出が行われる。 When the number of times of use or the number of contents used is not equal to or greater than a predetermined threshold (step S51: NO), step S53 is executed, and then the process returns to the normal execution of steps S31 and S32. In step S53, a predetermined predetermined usage history is set as a virtual usage history of the search user. Alternatively, the user may be caused to input a virtual usage history by a questionnaire method or the like. Based on this virtual usage history, similarity calculation in the next step S31 is performed.

このように、検索ユーザの利用履歴が十分でないときにも、仮想の利用履歴を設定することにより、あるいはユーザに入力させるなどして、嗜好の類似するユーザを検索し、類似ユーザとすることで、関連コンテンツの検索に利用することができ、より精度のよい類似ユーザの検索と、関連度の高いユーザ検索が可能になる。 In this way, even when the search user's usage history is not sufficient, by searching for a user with similar preferences by setting a virtual usage history or by allowing the user to input, the user can be a similar user. It is possible to search for related contents, and it is possible to search for similar users with higher accuracy and user searches with high relevance.

（類似ユーザ検索処理例４）
図１１は、図５を用いて説明した類似ユーザ検索処理のまた別の処理例を示すフローチャートである。図１１を用いて、類似ユーザ検索処理例４を説明する。 (Similar user search processing example 4)
FIG. 11 is a flowchart illustrating another example of the similar user search process described with reference to FIG. A similar user search processing example 4 will be described with reference to FIG.

ステップＳ５４では、予め定めた所定のコンテンツを上位に検索されるように望む設定かどうかを判定する。例えば、情報検索システム１の管理者が、所定のコンテンツが上位に検索されることを望むのであれば、仮想の利用履歴を有する仮想のユーザを登録することで、所定のコンテンツが上位に検索されるように導くこともできる。 In step S54, it is determined whether or not the predetermined setting is desired to be searched higher. For example, if the administrator of the information search system 1 desires that a predetermined content is searched for higher, the predetermined content is searched higher by registering a virtual user having a virtual usage history. Can also be guided.

ステップＳ５４で、上位検索を望む設定であれば（ステップＳ５４：ＹＥＳ）、ステップＳ５５を実行する。ステップＳ５５は、上位検索を望むコンテンツが同種のコンテンツとともに仮想的に利用されているような履歴を有する仮想のユーザを登録する。その後、通常のステップＳ３１とステップＳ３２の工程の実行へと戻ることで、登録した仮想ユーザが類似ユーザとして検索されやすくなり、しいては所定のコンテンツが関連コンテンツとして検索される可能性も増大する。 If it is a setting which desires a high-order search by step S54 (step S54: YES), step S55 will be performed. A step S55 registers a virtual user who has a history that the content for which a higher search is desired is virtually used together with the same type of content. Thereafter, by returning to the execution of the normal steps S31 and S32, the registered virtual user can be easily searched as a similar user, and the possibility that a predetermined content is searched as related content also increases. .

ステップＳ５４で、上位検索を望む設定でなければ（ステップＳ５４：ＮＯ）、そのまま通常のステップＳ３１とステップＳ３２の工程の実行が行われる。ステップＳ３１とステップＳ３２の工程は、既に述べた図５と同様であり、説明は省略する。 In step S54, if it is not a setting that desires a higher search (step S54: NO), the normal steps S31 and S32 are executed as they are. Steps S31 and S32 are the same as those already described with reference to FIG.

このように、予め定めた所定のコンテンツを上位に検索されるように望むような場合にも、仮想の利用履歴を有する仮想のユーザを登録することにより、登録した仮想ユーザが類似ユーザとして検索されやすくなり、しいては所定のコンテンツが関連コンテンツとして検索される可能性を上げることもできる。 In this way, even when it is desired to search for predetermined content that is determined in advance, by registering a virtual user having a virtual usage history, the registered virtual user is searched as a similar user. As a result, the possibility that a predetermined content is searched as related content can be increased.

（類似ユーザ検索処理例５）
図１２は、図５を用いて説明した類似ユーザ検索処理のまた別の処理例を示すフローチャートである。図１２を用いて、類似ユーザ検索処理例５を説明する。 (Similar user search processing example 5)
FIG. 12 is a flowchart illustrating another example of similar user search processing described with reference to FIG. A similar user search processing example 5 will be described with reference to FIG.

ステップＳ５６では、検索する他のユーザについて、類似度を算出するためのデータとしては古すぎる利用履歴を有する古いユーザを検索除外するかどうかを判定する。古いユーザを除外する場合（ステップＳ５６：ＹＥＳ）は、ステップＳ５７を実行する。古いユーザも除外しない場合（ステップＳ５６：ＮＯ）は、通常通りステップＳ３１とステップＳ３２を実行する。ステップＳ３１とステップＳ３２の工程は、既に述べた図５と同様であり、説明は省略する。 In step S56, for other users to be searched, it is determined whether or not to search for an old user having a usage history that is too old as data for calculating the similarity. If an old user is excluded (step S56: YES), step S57 is executed. If the old user is not excluded (step S56: NO), steps S31 and S32 are executed as usual. Steps S31 and S32 are the same as those already described with reference to FIG.

ステップＳ５７では、予め定められた所定の期間以上利用のないユーザを検索対象の他のユーザから除外する設定を行う。あるいは、所定の期間は入力して設定するようにしてもよい。例えば、最新の利用時期を参照して、それ以後アクセスしていない期間が所定の期間を越えるユーザについては、検索から除外すべき古いユーザと設定する。すなわち、この設定された期間に対してアクセスのあるユーザだけが次の類似度算出処理以後の対象ユーザと設定されるのである。 In step S57, a setting is made to exclude users who have not been used for a predetermined period of time from other users to be searched. Alternatively, a predetermined period may be input and set. For example, referring to the latest use time, a user who has not accessed after that exceeds a predetermined period is set as an old user to be excluded from the search. In other words, only users who have access during the set period are set as target users after the next similarity calculation process.

次いでステップＳ３１とステップＳ３２が実行される。ステップＳ３１とステップＳ３２の工程は、既に述べた図５の場合と、上記の古いユーザと設定されたユーザは検索から除外される点が異なる。それ以外は同様であり説明は省略する。 Next, step S31 and step S32 are executed. Steps S31 and S32 differ from the case of FIG. 5 described above in that the old user and the set user are excluded from the search. The rest is the same, and the description is omitted.

このように、検索対象の他のユーザの利用履歴が古くて、仮に類似ユーザであるとしても、その利用履歴が余り参考にならない場合は、検索の対象から除外することにより、関連コンテンツの検索において、古すぎるコンテンツが検索される可能性の増大することを防止でき、最新の類似ユーザの検索を通じて、関連度の高い最新のコンテンツ検索が可能になる。 In this way, even if the usage history of other users to be searched is old and the usage history is not very helpful even if it is a similar user, in the search for related content, it is excluded from the search target. Therefore, it is possible to prevent the possibility that content that is too old will be searched, and to search for the latest content with a high degree of relevance through the search for the latest similar user.

（関連コンテンツ検索処理例２）
図１３は、図７を用いて説明した関連コンテンツ検索処理のまた別の処理例を示すフローチャートである。図１３を用いて、関連コンテンツ検索処理例２を説明する。 (Related content search processing example 2)
FIG. 13 is a flowchart illustrating another example of the related content search process described with reference to FIG. The related content search processing example 2 will be described with reference to FIG.

ステップＳ６１では、検索するコンテンツについて、関連度の高いコンテンツとして新しいコンテンツをより重要視するかどうかを判定する。新しいコンテンツを重用する場合（ステップＳ６１：ＹＥＳ）は、ステップＳ６２を実行する。そうでない場合（ステップＳ６１：ＮＯ）は、通常通りステップＳ４１とステップＳ４２を実行する。ステップＳ４１とステップＳ４２の工程は、既に述べた図７と同様であり、説明は省略する。 In step S61, it is determined whether or not new content is regarded as more important as content with a high degree of relevance. If new content is to be used heavily (step S61: YES), step S62 is executed. When that is not right (step S61: NO), step S41 and step S42 are performed as usual. Steps S41 and S42 are the same as those already described with reference to FIG.

ステップＳ６２では、次の関連度算出処理に用いる重み係数ｗ（ｋ）の算出式を変更する。例えば、重み係数ｗ（ｋ）を次の式（４）のように、コンテンツｉの登録時期により異なるような算出式とする。 In step S62, the calculation formula of the weight coefficient w (k) used for the next relevance calculation process is changed. For example, the weighting coefficient w (k) is a calculation formula that varies depending on the registration time of the content i as in the following formula (4).

すなわちも式（２）におけるｗ（ｋ）は、
ｗ（ｋ，ｉ）＝１＋α・ｕ（ｋ）／ｖ（ｋ）＋β・１／ΔＴ（ｉ）（４）
に置き換えられる。 That is, w (k) in equation (2) is
w (k, i) = 1 + α · u (k) / v (k) + β · 1 / ΔT (i) (4)
Is replaced by

ここでΔＴ（ｉ）は、コンテンツｉの登録からの期間を示し、βは、それに対する重みの程度を表す所定の定数である。 Here, ΔT (i) indicates a period from the registration of the content i, and β is a predetermined constant indicating the degree of weighting for the period.

このように重み係数ｗ（ｋ）を変更した上で、次のステップＳ４１とステップＳ４２が実行される。ステップＳ４１とステップＳ４２の工程は、既に述べた図７の場合と、上記の関連度算出に利用される重み係数ｗ（ｋ）のみが異なる。それ以外は同様であり説明は省略する。 After changing the weighting coefficient w (k) in this way, the following steps S41 and S42 are executed. Steps S41 and S42 differ from the case of FIG. 7 described above only in the weight coefficient w (k) used for the above-described relevance calculation. The rest is the same, and the description is omitted.

すなわち、ステップＳ４１の関連度算出処理では、式（４）の重み係数ｗ（ｋ，ｉ）を用いることにより、最近登録された新しいコンテンツほど重み係数が大きくなり、新しいコンテンツほど関連度が高く算出されることになる。 That is, in the relevance calculation process in step S41, by using the weighting coefficient w (k, i) of Expression (4), the weighting coefficient increases for newly registered new contents, and the relevance degree increases for new contents. Will be.

このように、検索対象のコンテンツに、古くて余り参考にならないコンテンツが含まれる場合は、コンテンツの新しさに応じて重み係数を変更することにより、関連コンテンツの検索において、古すぎるコンテンツが検索される可能性が縮小され、最新のコンテンツが検索されやすくなり、関連度の高い最新のコンテンツ検索が可能になる。 In this way, when content to be searched includes content that is too old to be used for reference, content that is too old is searched for by searching for related content by changing the weighting factor according to the newness of the content. The possibility that the latest content is reduced, the latest content can be easily searched, and the latest content search with high relevance can be performed.

（類似ユーザ検索処理例６）
図１４は、図５を用いて説明した類似ユーザ検索処理のまた別の処理例を示すフローチャートである。図１４を用いて、類似ユーザ検索処理例６を説明する。 (Similar user search processing example 6)
FIG. 14 is a flowchart illustrating another example of the similar user search process described with reference to FIG. A similar user search processing example 6 will be described with reference to FIG.

ステップＳ５８では、類似ユーザを検索するについて、類似度を算出するためのデータとして、ユーザの最近の嗜好、すなわち新しい利用履歴をより重要視するかどうかを判定する。新しい利用履歴を重用する場合（ステップＳ５８：ＹＥＳ）は、ステップＳ５９を実行する。そうでない場合（ステップＳ５８：ＮＯ）は、通常通りステップＳ３１とステップＳ３２を実行する。ステップＳ３１とステップＳ３２の工程は、既に述べた図５と同様であり、説明は省略する。 In step S58, for searching for similar users, it is determined whether or not the user's recent preference, that is, a new usage history is more important as data for calculating similarity. If the new usage history is to be used heavily (step S58: YES), step S59 is executed. When that is not right (step S58: NO), step S31 and step S32 are performed as usual. Steps S31 and S32 are the same as those already described with reference to FIG.

ステップＳ５９では、次の類似度算出処理に用いるｓｉ（ｐ，ｑ）の定義を変更する。例えば、ｓｉ（ｐ，ｑ）を次のように、類似の条件を満たすときも必ずしも１ではなく、コンテンツｉの利用時期により異なるように定義する。 In step S59, the definition of si (p, q) used for the next similarity calculation process is changed. For example, si (p, q) is defined to be different depending on the use time of the content i, not necessarily 1, even when similar conditions are satisfied as follows.

ユーザｑに対してコンテンツｉが既述の類似の条件を満たすとき
ｓｉ（ｐ，ｑ）＝１／ΔＴ（ｉ）で、
ユーザｑに対してコンテンツｉが既述の類似の条件を満たさないとき
ｓｉ（ｐ，ｑ）＝０
と定義する。 When content i satisfies the similar condition described above for user q
si (p, q) = 1 / ΔT (i),
When content i does not satisfy the similar condition described above for user q
si (p, q) = 0
It is defined as

ここでΔＴ（ｉ）は、コンテンツｉの最新の利用日時からの期間を示す。最新の利用日時は、ユーザｐまたはユーザｑの利用履歴から算出される。 Here, ΔT (i) indicates a period from the latest use date and time of the content i. The latest usage date and time is calculated from the usage history of the user p or the user q.

このようにｓｉ（ｐ，ｑ）を変更した上で、次のステップＳ３１とステップＳ３２が実行される。ステップＳ３１とステップＳ３２の工程は、既に述べた図５の場合と、上記の類似度算出に利用されるｓｉ（ｐ，ｑ）のみが異なる。それ以外は同様であり説明は省略する。 After changing si (p, q) in this way, the following steps S31 and S32 are executed. Steps S31 and S32 differ from the case of FIG. 5 described above only in si (p, q) used for the above-described similarity calculation. The rest is the same, and the description is omitted.

すなわち、ステップＳ３１の類似度算出処理では、上記変更されたｓｉ（ｐ，ｑ）を用いることにより、コンテンツに対する最近の利用を類似度算出に重視することで、最近のユーザの嗜好が類似度算出に反映されることになる。 That is, in the similarity calculation process of step S31, the recent user's preference is calculated by using the changed si (p, q), with emphasis on the recent usage of the content in the similarity calculation. Will be reflected.

このように、類似ユーザ検索に用いる利用履歴に、古い利用履歴、すなわちユーザの古い嗜好を反映するコンテンツ利用が含まれる場合は、各コンテンツの利用時期の新しさに応じて類似度を算出するための定義を変更することにより、類似ユーザの検索において、ユーザの古すぎる利用履歴に基づき類似ユーザが検索される可能性が縮小され、最新の利用履歴に基づく検索が行われやすくなり、最新の利用履歴において類似度の高い類似ユーザ検索が可能になる。 In this way, when the usage history used for the similar user search includes an old usage history, that is, content usage reflecting the user's old preference, the similarity is calculated according to the new usage timing of each content. By changing the definition of, the possibility of searching for similar users based on the user's past usage history is reduced in the search for similar users, making it easier to perform searches based on the latest usage history and the latest usage. A similar user search having a high similarity in the history can be performed.

上述のように、本実施形態によれば、データベースに蓄積された情報から検索ユーザが所望の情報を検索するに当たり、蓄積された各ユーザごとの利用履歴から、検索ユーザと他のユーザの利用履歴を比較して類似度を算出し、その類似度の高い類似ユーザを求めることができる。またその利用情報に基づき、各情報と検索ユーザの提示した検索条件との関連度を算出し、関連度の高い情報を検索することができる。これにより、事前にデータを準備する必要なく、興味や関心の類似する他のユーザの利用履歴を参照することができ、簡単に、より広がりのある、内容をベースとした情報検索が、効率的に、且つ精度よく行える情報検索システム、情報検索方法、及び情報管理装置を提供することができる。 As described above, according to the present embodiment, when the search user searches for desired information from the information stored in the database, the use history of the search user and other users is determined from the stored use history for each user. Can be calculated by calculating the similarity, and a similar user having a high similarity can be obtained. Further, based on the usage information, the degree of association between each piece of information and the retrieval condition presented by the retrieval user can be calculated, and information with a high degree of association can be retrieved. This makes it possible to refer to the usage history of other users with similar interests without having to prepare data in advance, making it easier and more efficient to search information based on content more easily. In addition, it is possible to provide an information search system, an information search method, and an information management device that can be performed with high accuracy.

なお本発明の実施形態は、上記の実施形態に限定されるものではなく、本発明の趣旨に則る限り、様々な変更された形態もその範囲に含まれるものである。 The embodiment of the present invention is not limited to the above-described embodiment, and various modified forms are also included in the scope thereof as long as the gist of the present invention is met.

本実施形態に係る情報検索システム１の構成例を示すブロック図である。It is a block diagram which shows the structural example of the information search system 1 which concerns on this embodiment. 本実施形態に係る情報検索システム１の動作例を示すシーケンス図である。It is a sequence diagram which shows the operation example of the information search system 1 which concerns on this embodiment. 図１におけるユーザ管理ＤＢ１２１、及びコンテンツＤＢ１１１の内部データ構造の例を示す図である。It is a figure which shows the example of the internal data structure of the user management DB121 in FIG. 1, and content DB111. 情報検索システム１におけるコンテンツの検索と利用に関する全体の流れを示すフローチャートである。3 is a flowchart showing an overall flow regarding content search and use in the information search system 1. 図４におけるステップＳ３、すなわち類似ユーザ検索処理の工程を説明するフローチャートである。FIG. 5 is a flowchart for explaining step S 3 in FIG. 4, that is, a similar user search process. 図５におけるステップＳ３１、すなわち類似度算出処理の工程の詳細フローチャートである。6 is a detailed flowchart of step S31 in FIG. 5, that is, a step of similarity calculation processing. 図４におけるステップＳ４、すなわち関連コンテンツ検索処理の工程を説明するフローチャートである。FIG. 5 is a flowchart illustrating step S4 in FIG. 4, that is, a process of related content search processing. 図７におけるステップＳ４１、すなわち関連度算出処理の工程の詳細フローチャートである。It is a detailed flowchart of the process of step S41 in FIG. 図５における類似ユーザ検索処理の、別の処理例２を示すフローチャートである。It is a flowchart which shows another process example 2 of the similar user search process in FIG. 図５における類似ユーザ検索処理の、別の処理例３を示すフローチャートである。It is a flowchart which shows another process example 3 of the similar user search process in FIG. 図５における類似ユーザ検索処理の、別の処理例４を示すフローチャートである。It is a flowchart which shows another process example 4 of the similar user search process in FIG. 図５における類似ユーザ検索処理の、別の処理例５を示すフローチャートである。It is a flowchart which shows another process example 5 of the similar user search process in FIG. 図７における関連コンテンツ検索処理の、別の処理例２を示すフローチャートである。It is a flowchart which shows another process example 2 of the related content search process in FIG. 図５における類似ユーザ検索処理の、別の処理例６を示すフローチャートである。It is a flowchart which shows another process example 6 of the similar user search process in FIG.

Explanation of symbols

１情報検索システム
１１コンテンツサーバ
１２ユーザ情報管理サーバ
１３課金サーバ
１４コンピュータネットワーク
１５、１６、１７ユーザ端末
３１コンテンツ
３２検索用メタデータ
４１ユーザデータ
４２利用履歴
１１１コンテンツＤＢ
１１２検索処理部
１１２ａ関連度算出部
１１２ｂ関連情報検索部
１１３、１２４、１３３、１５５ネットワーク接続部
１２１ユーザ管理ＤＢ
１２２利用履歴管理部
１２２ａ類似度算出部
１２２ｂ類似ユーザ検索部
１２３ユーザ管理部
１３１決済処理部
１３２課金処理部
１５１入力部
１５２表示部
１５３印刷部
１５４記憶部
１５６制御部 DESCRIPTION OF SYMBOLS 1 Information search system 11 Content server 12 User information management server 13 Billing server 14 Computer network 15, 16, 17 User terminal 31 Content 32 Search metadata 41 User data 42 Usage history 111 Content DB
112 Search processing unit 112a Relevance calculation unit 112b Related information search unit 113, 124, 133, 155 Network connection unit 121 User management DB
122 Usage history management unit 122a Similarity calculation unit 122b Similar user search unit 123 User management unit 131 Settlement processing unit 132 Charge processing unit 151 Input unit 152 Display unit 153 Printing unit 154 Storage unit 156 Control unit

Claims

Information storage means for storing information; and
A search condition setting means for setting a search condition for a search user to search for desired information;
Information search means for searching for information from the information storage means based on the search conditions set by the search condition setting means;
Information providing means for selectively using information from the information searched by the information searching means to the search user;
Regarding the use of information based on the information providing means, usage history storage means for storing a usage history for each user including the search user;
In an information search system comprising: a usage history management unit that manages a usage history for each user including the search user stored in the usage history storage unit;
The usage history management means includes:
Similarity calculation means for calculating the similarity between the usage history of the search user stored in the usage history storage means and the usage history of other users other than the search user;
Similar user search means for searching for other users other than the search user whose use history is similar to the search user based on the similarity calculated by the similarity calculation means;
The information search means includes
Based on the information use history of other users other than the search user similar in the use history searched by the similar user search means, the degree of association between each information stored in the information storage means and the search condition is determined. Relevance calculating means for calculating;
Related information search means for searching related information from the information storage means based on the relevance calculated by the relevance calculation means;
An information retrieval system characterized by that.

The information providing means includes
The search user can copy the information to a predetermined medium designated by the search user or copy the information to another information storage means different from the information storage means as the use form of the selected information. Let
The information search system according to claim 1.

Charging means for charging the search user for use of information by the search user based on the information providing means;
The information search system according to claim 1 or 2, characterized in that

When the usage history of the search user stored in the usage history storage means is a predetermined number of times or less,
The usage history management means includes:
Without calculating the similarity by the similarity calculation means and searching for similar users based on the similarity by the similar user search means,
Set a predetermined other user as a similar user with a high similarity,
The information search system according to any one of claims 1 to 3, wherein

When the usage history of the search user stored in the usage history storage means is a predetermined number of times or less,
The usage history management means sets a virtual usage history for similarity calculation for the search user,
The similarity calculation means calculates the similarity based on the set virtual usage history;
The information search system according to any one of claims 1 to 3, wherein

The usage history storage means includes
The virtual user's virtual usage history is stored,
The information search system according to claim 1, wherein the information search system is an information search system.

The similarity calculation means includes:
Exclude other users who have no usage history for a certain period of time and calculate the similarity of usage history.
The information search system according to claim 1, wherein the information search system is an information search system.

The relevance calculation means includes:
Calculating the degree of relevance based on the date and time when the information was stored in the information storage means;
The information search system according to claim 1, wherein the information search system is an information search system.

The similarity calculation means includes:
Calculate similarity based on usage date and time in usage history,
The information search system according to claim 1, wherein the information search system is an information search system.

A search condition setting step for setting a search condition for the search user to search for desired information;
Based on the search condition set in the search condition setting step, an information search step of searching for information from the information storage means in which information is stored;
An information providing step for selectively using information to the search user from the information searched in the information search step;
Regarding the use of information based on the information providing step, a usage history recording step for accumulating a usage history for each user including the search user;
A usage history management step of managing a usage history for each user including the search user accumulated in the usage history recording step;
The usage history management step includes:
A similarity calculation step of calculating a similarity between the usage history of the search user and the usage history of other users other than the search user, accumulated in the usage history recording step;
A similar user search step of searching for other users other than the search user similar to the search user based on the similarity calculated in the similarity calculation step,
The information search step includes
Based on the information use history of other users other than the search user similar to the use history searched in the similar user search step, the degree of association between each information stored in the information storage means and the search condition is determined. Relevance calculation step to calculate,
A related information search step of searching for related information from the information storage means based on the relevance calculated in the relevance calculation step.
An information search method characterized by that.

In the information providing step,
As the usage form of the selected information, the search user can copy information to a predetermined medium designated by the search user or copy information to another information storage means different from the information storage means. Become,
The information search method according to claim 10.

A charging step of charging the search user for use of information by the search user based on the information providing step;
The information search method according to claim 10 or 11, wherein

When the usage history of the search user recorded in the usage history recording step is a predetermined number of times or less,
In the usage history management step,
Calculation of similarity in the similarity calculation step and search for similar users based on the similarity in the similar user search step are not performed,
A predetermined other user is set as a similar user having a high degree of similarity.
The information search method according to claim 10, wherein the information search method is a data search method.

When the usage history of the search user recorded in the usage history recording step is a predetermined number of times or less,
In the usage history management step, a virtual usage history for similarity calculation is set for the search user,
In the similarity calculation step, the similarity is calculated based on the set virtual usage history.
The information search method according to claim 10, wherein the information search method is a data search method.

In the usage history recording step,
The virtual usage history of the virtual user is recorded,
The information search method according to claim 10, wherein the information search method is a data search method.

In the similarity calculation step,
The usage history similarity is calculated by excluding other users who have no usage history for a predetermined period of time.
The information search method according to any one of claims 10 to 15, wherein

In the relevance calculation step,
The relevance is calculated based on the date and time when the information is stored in the information storage unit.
The information search method according to any one of claims 10 to 16, wherein

In the similarity calculation step,
The similarity is calculated based on the usage date and time in the usage history.
The information search method according to any one of claims 10 to 17, wherein:

Search condition setting means for acquiring and setting search conditions for searching for desired information from a search user;
Information providing means for selectively using the searched information to the search user;
Regarding the use of information based on the information providing means, usage history storage means for storing a usage history for each user including the search user;
In an information management device having
Similarity calculation means for calculating the similarity between the usage history of the search user stored in the usage history storage means and the usage history of other users other than the search user;
Similar user search means for searching for other users other than the search user whose use history is similar to the search user based on the similarity calculated by the similarity calculation means;
For the information storage means in which the information is stored, the search conditions set by the search condition setting means and the other users other than the search user similar in the usage history searched by the similar user search means Based on the information usage history, instruct information search and obtain the search results,
An information management apparatus characterized by that.