JP2002342379A

JP2002342379A - Method, system and program for ranking popularity of documents and recording medium with the program recorded thereon

Info

Publication number: JP2002342379A
Application number: JP2001145747A
Authority: JP
Inventors: Toshibumi Enomoto; 俊文榎本; Hiroyuki Toda; 浩之戸田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2001-05-16
Filing date: 2001-05-16
Publication date: 2002-11-29

Abstract

PROBLEM TO BE SOLVED: To perform a popularity ranking of documents, by which even documents having a little information can be highly accurately ranked while being hardly affected by a bias in utilization information. SOLUTION: A document retrieving part 10 stores the utilization information of the users of respective documents on a utilization information database 21. A document ranking system 30 calculates the result score of each of documents from the utilization information to the documents stored on the database, defines the average result score of the partial set of other documents having utilization information including the utilization information on each of documents as an expected score concerning that document, calculates the document score by combination weighting the result score and the expected score with respective significance degrees corresponding to the size of the partial set and stores it on a document score database 22. Document sets receiving a retrieval request from the user are ranked by using the document scores of the respective documents in the document sore database and transmitted to the user.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、コンピュータネッ
トワーク上の文書検索システムに係り、特に文書の人気
度を順位付けした検索結果リストを提示する文書の人気
度順位付けに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document search system on a computer network, and more particularly, to a document ranking system which presents a search result list in which the document popularity levels are ranked.

【０００２】[0002]

【従来の技術】従来、コンピュータネットワーク上の文
書検索において、検索文書の人気度を順位付ける方法と
して以下の方法が広く用いられている。2. Description of the Related Art Heretofore, in document search on a computer network, the following methods have been widely used as a method for ranking the popularity of search documents.

【０００３】（ａ）検索キーワードに含まれる単語が文
書中に出現する頻度を利用し、検索キーワードと類似度
が高い順に行う方法。(A) A method in which the frequency of occurrence of a word included in a search keyword in a document is used in the descending order of similarity with the search keyword.

【０００４】（ｂ）文書間の引用関係を利用し、多くの
文書からの引用や、多くの文書に引用されている文書か
らの引用が多い順に行う方法。(B) A method in which citations from many documents and citations from documents cited in many documents are used in descending order of citations using citation relationships between documents.

【０００５】しかしこれらの方法には以下の問題があ
る。However, these methods have the following problems.

【０００６】（ａ）実際に利用者が入力する検索キーワ
ードは数語程度のため、十分な検索精度が期待できな
い。(A) Since a search keyword actually input by a user is about several words, sufficient search accuracy cannot be expected.

【０００７】（ｂ）十分な量の文書間の引用関係が存在
していることが前提となるため、適用範囲が制限され
る。(B) It is assumed that there is a sufficient amount of citation relation between documents, so that the applicable range is limited.

【０００８】これらの問題や制限を受けない方法とし
て、（ａ）利用者が検索を行った際の各文書に対する参
照回数や、参照を行った利用者からの実際の評価といっ
た利用情報を使って、総合的な「人気度」を算出し、そ
れを基に行う方法が利用されている。[0008] As a method free from these problems and restrictions, (a) using the usage information such as the number of references to each document when the user performs a search and the actual evaluation from the user who made the search; A method of calculating an overall “popularity” and performing the calculation based on the calculated “popularity” is used.

【０００９】[0009]

【発明が解決しようとする課題】しかし、前記の従来技
術には、以下の問題がある。However, the above prior art has the following problems.

【００１０】・一般に、得られる利用情報の数は各文書
ごとに偏りが出る。また、新規文書が追加された場合、
他の文書との利用情報数の差が大きい。[0010] Generally, the number of pieces of usage information obtained is biased for each document. Also, when a new document is added,
The difference in the number of usage information from other documents is large.

【００１１】したがって、文書の人気度をその文書に対
する利用情報のみで決定すると精度が低くなる。Therefore, if the popularity of a document is determined based only on the usage information for the document, the accuracy will be reduced.

【００１２】本発明の目的は、従来の文書の利用情報か
らの人気度に加え、将来の人気度の期待値を考慮するこ
とにより、情報の偏りの影響を受けにくく、情報の少な
い文書に対しても精度の高い順位付けができる文書の人
気度順位付けの方法、装置、プログラム、および記録媒
体を提供することにある。[0012] An object of the present invention is to consider the expected value of the future popularity in addition to the popularity based on the usage information of the conventional document, so that the document is less susceptible to the bias of the information, and the document having less information is used. Another object of the present invention is to provide a method, an apparatus, a program, and a recording medium for ranking popularity of a document, which can rank the documents with high accuracy.

【００１３】[0013]

【課題を解決するための手段】本発明では、上記の課題
を解決するため、人気度（文書スコア）を、従来の文書
に対する利用情報から算出したスコア（実績スコア）に
加え、その文書が将来的に取り得るスコアを予測した期
待値（期待スコア）を以下に説明する方法で算出し、算
出した文書スコアに基づいて順位付けする。According to the present invention, in order to solve the above-mentioned problem, a popularity (document score) is added to a score (actual score) calculated from usage information for a conventional document, and the document is evaluated in the future. An expected value (expected score) that predicts a score that can be obtained in a predictable manner is calculated by the method described below, and ranking is performed based on the calculated document score.

【００１４】・文書スコアの算出法文書集合に対する利用情報の分布は、その文書集合の持
つ特性を表していると考えられるため、それぞれの文書
はその特性に沿った利用情報が将来与えられていく可能
性が高いと考えられる。すなわち、将来同じ利用情報を
持つ可能性のある他の文書の実績スコアを利用すること
で、精度の高い期待スコアを算出できる。Method of calculating document score Since the distribution of usage information for a document set is considered to represent the characteristics of the document set, usage information according to the characteristics is given to each document in the future. It is considered likely. That is, a highly accurate expected score can be calculated by using the actual score of another document that may have the same use information in the future.

【００１５】また、利用情報の多い文書に対しては、す
でに文書スコアは定まっていると考えられるため、期待
スコアよりも実績スコアを重視すべきであり、逆に利用
情報の少ない文書に対しては、実績スコアよりも期待ス
コアを重視すべきであると考えられる。It is considered that the document score has already been determined for a document having a large amount of use information. Therefore, the actual score should be more important than the expected score. It is considered that the expected score should be more important than the actual score.

【００１６】以上の考えを基に、本発明は、順位付けの
対象となる文書集合に対して、それぞれの文書に対する
利用情報から任意の方法で各文書の実績スコアを算出
し、各文書について、その文書に対する利用情報を包含
するような利用情報を持つ他の文書の部分集合を取り出
し、取り出した部分集合の平均実績スコアを期待スコア
とし、部分集合の大きさにより、実績スコアと期待スコ
アをそれぞれ重み付けした組み合わせで文書スコアを算
出するもので、以下の構成を特徴とする。Based on the above idea, according to the present invention, for a set of documents to be ranked, a performance score of each document is calculated from the usage information for each document by an arbitrary method. A subset of other documents that have usage information that includes the usage information for the document is extracted, and the average performance score of the extracted subset is used as the expected score. A document score is calculated by a weighted combination, and is characterized by the following configuration.

【００１７】（方法の発明）（１）コンピュータネットワーク上の文書検索に際し
て、文書の人気度を順位付けした検索結果リストを提示
する文書の人気度順位付け方法であって、各文書の利用
者の利用情報を利用情報データベースに格納しておき、
前記データベースに格納される順位付けの対象となる文
書集合に対して、それぞれの文書に対する利用情報から
任意の方法で各文書の実績スコアを算出し、各文書につ
いて、その文書に対する利用情報を包含するような利用
情報を持つ他の文書の部分集合の平均実績スコアを期待
スコアとし、前記部分集合の大きさにより前記実績スコ
アと期待スコアをそれぞれの重要度で重み付けした組み
合わせで文書スコアを算出して文書スコアデータベース
に格納し、利用者から検索要求を受けた文書集合に対し
て前記文書スコアデータベース中にある各文書の文書ス
コアを使って順位付けを行って利用者に送信することを
特徴とする。(Invention of Method) (1) A method of ranking the popularity of documents that presents a search result list in which the degrees of popularity of documents are ranked when searching for documents on a computer network. Store usage information in the usage information database,
For a set of documents to be ranked and stored in the database, a performance score of each document is calculated by an arbitrary method from usage information for each document, and each document includes usage information for the document. The average result score of a subset of other documents having such usage information is set as an expected score, and the document score is calculated by a combination of the size of the subset and the result score and the expected score weighted by their importance. The method is characterized in that the document set is stored in a document score database, the document set received a search request from the user is ranked using the document score of each document in the document score database, and transmitted to the user. .

【００１８】（２）前記文書スコアの算出は、前記利用
情報データベースから取り出した利用情報を使って任意
の手法で各文書の実績スコアを算出し、前記利用情報お
よび前記実績スコアを使って各文書の期待スコアおよび
重要度を算出し、前記実績スコアと期待スコアおよび重
要度から各文書の文書スコアを算出することを特徴とす
る。(2) In the calculation of the document score, the actual score of each document is calculated by an arbitrary method using the usage information extracted from the usage information database, and each document is calculated using the usage information and the actual score. Is calculated, and the document score of each document is calculated from the performance score, the expected score, and the importance.

【００１９】（３）前記期待スコアおよび重要度の算出
は、前記期待スコアの算出がなされていない任意の文書
を選択し、前記選択した文書が将来持つことができる利
用情報を持っている他の文書の集合を導出し、前記集合
に含まれる文書の平均実績スコアを算出して期待スコア
とし、前記集合の大きさから期待スコアの重要度を算出
することを特徴とする。(3) The calculation of the expected score and the importance is performed by selecting an arbitrary document for which the expected score has not been calculated, and selecting another document having usage information that the selected document can have in the future. A set of documents is derived, an average performance score of the documents included in the set is calculated as an expected score, and importance of the expected score is calculated from the size of the set.

【００２０】（システムの発明）（４）コンピュータネットワーク上の文書検索に際し
て、文書の人気度を順位付けした検索結果リストを提示
する文書の人気度順位付けシステムであって、各文書の
利用者の利用情報を格納しておく利用情報データベース
と、前記データベースに格納される順位付けの対象とな
る文書集合に対して、それぞれの文書に対する利用情報
から任意の方法で各文書の実績スコアを算出し、各文書
について、その文書に対する利用情報を包含するような
利用情報を持つ他の文書の部分集合の平均実績スコアを
期待スコアとし、前記部分集合の大きさにより前記実績
スコアと期待スコアをそれぞれの重要度で重み付けした
組み合わせで文書スコアを算出して文書スコアデータベ
ースに格納する文書ランキングシステムと、利用者から
検索要求を受けた文書集合に対して前記文書スコアデー
タベース中にある各文書の文書スコアを使って順位付け
を行って利用者に送信する文書検索部とを備えたことを
特徴とする。(System Invention) (4) A document popularity ranking system which presents a search result list in which document popularity is ranked when a document is searched on a computer network. A usage information database that stores usage information, and for a set of documents to be ranked stored in the database, an actual score of each document is calculated by an arbitrary method from usage information for each document, For each document, the expected score is the average performance score of a subset of other documents that have usage information that includes usage information for that document. A document ranking system that calculates a document score with a combination weighted by degrees and stores it in the document score database A document search unit that ranks a set of documents requested to be searched by a user using the document score of each document in the document score database and transmits the document set to the user. I do.

【００２１】（５）前記文書ランキングシステムは、前
記利用情報データベースから取り出した利用情報を使っ
て任意の手法で各文書の実績スコアを算出し、前記文書
スコア算出部へ返却する実績スコア算出部と、前記利用
情報および前記実績スコアを使って各文書の期待スコア
および重要度を算出して前記文書スコア算出部へ返却す
る期待スコア算出部と、前記実績スコアと期待スコアお
よび重要度から各文書の文書スコアを算出し、この算出
した文書スコアを前記文書スコアデータベースへ格納す
る文書スコア算出部とを備えたことを特徴とする。(5) The document ranking system calculates a performance score of each document by an arbitrary method using the usage information extracted from the usage information database, and returns a performance score calculation unit to the document score calculation unit. An expected score calculation unit that calculates an expected score and importance of each document using the use information and the performance score and returns the calculated score and importance to the document score calculation unit; A document score calculation unit that calculates a document score and stores the calculated document score in the document score database.

【００２２】（６）前記文書ランキングシステムは、前
記期待スコアの算出がなされていない任意の文書を選択
する手段と、前記選択した文書が将来持つことができる
利用情報を持っている他の文書の集合を導出する手段
と、前記集合に含まれる文書の平均実績スコアを算出し
て期待スコアとする手段と、前記集合の大きさから期待
スコアの重要度を算出する手段とを備えたことを特徴と
する。(6) The document ranking system includes a means for selecting an arbitrary document for which the expected score has not been calculated, and a method for selecting another document having use information that the selected document can have in the future. Means for deriving a set, means for calculating an average performance score of documents included in the set to obtain an expected score, and means for calculating importance of the expected score from the size of the set. And

【００２３】（プログラムの発明）（７）前記文書の人気度順位付け方法をコンピュータに
実行させるための処理手順を備えたことを特徴とする。(Invention of Program) (7) The program is characterized in that it has a processing procedure for causing a computer to execute the document popularity ranking method.

【００２４】（記録媒体の発明）（８）前記の文書の人気度順位付け方法における処理手
順をコンピュータに実行させるためのプログラムを、該
コンピュータが読み取り可能な記録媒体に記録したこと
を特徴とする。(Invention of Recording Medium) (8) A program for causing a computer to execute the processing procedure in the above-mentioned document popularity ranking method is recorded on a computer-readable recording medium. .

【００２５】[0025]

【発明の実施の形態】図１は、本発明の実施形態を示す
システム構成図である。同図において、本システムは、
文書検索部１０と、利用情報ＤＢ（データベース）２１
と、文書スコアＤＢ２２と、文書ランキングシステム３
０を持つ。FIG. 1 is a system configuration diagram showing an embodiment of the present invention. In this figure, the system is:
Document search unit 10 and usage information DB (database) 21
, Document score DB 22, and document ranking system 3
Has 0.

【００２６】文書ランキングシステム３０は、文書スコ
ア算出部３１と、実績スコア算出部３２と、期待スコア
算出部３３を持つ。The document ranking system 30 has a document score calculation unit 31, a performance score calculation unit 32, and an expected score calculation unit 33.

【００２７】文書検索部１０は、利用者から検索要求を
受け、その要求にあった文書集合を何からの手法により
取出し、その文書集合に対して文書スコアＤＢ２２中に
ある各文書の文書スコアを使って順位付けを行い、利用
者に送信する。また、検索の後、利用者からの文書への
参照や評価などの利用情報を取得し、利用情報ＤＢ２１
へ蓄積する。The document search unit 10 receives a search request from a user, extracts a set of documents that meet the request by any method, and calculates a document score of each document in the document score DB 22 for the set of documents. Use it to rank and send to users. Further, after the search, usage information such as reference to the document and evaluation from the user is acquired, and the usage information DB 21
To accumulate.

【００２８】利用情報ＤＢ２１は、各文書の利用者への
提示回数、利用者からの参照回数、評価回数、評価点な
どの利用情報を格納したデータベースである。文書スコ
アＤＢ２２は、各文書に対する文書スコアを格納したデ
ータベースである。The usage information DB 21 is a database storing usage information such as the number of presentations of each document to the user, the number of references from the user, the number of evaluations, and evaluation points. The document score DB 22 is a database that stores a document score for each document.

【００２９】文書ランキングシステム３０において、文
書スコア算出部３１は、利用情報ＤＢ２１に格納されて
いる、各文書の提示回数、参照回数、評価回数、評価点
など利用情報を取り出し、実績スコア算出部３２および
期待スコア算出部３３を使って各文書の実績スコアおよ
び期待スコアを取得し、それらから各文書の文書スコア
を算出し、文書スコアＤＢ２２に蓄積する。In the document ranking system 30, the document score calculation unit 31 extracts usage information such as the number of presentations, the number of references, the number of evaluations, and the evaluation points of each document stored in the usage information DB 21. Then, the actual score and the expected score of each document are obtained by using the expected score calculation unit 33, and the document score of each document is calculated from the obtained score and stored in the document score DB 22.

【００３０】実績スコア算出部３２は、文書スコア算出
部３１から呼び出され、各文書の提示回数、参照回数、
評価回数、評価点など利用情報から各文書の実績スコア
を算出し、文書スコア算出部３１へ返却する。The performance score calculation unit 32 is called from the document score calculation unit 31 and displays the number of presentations, the number of references,
The result score of each document is calculated from the usage information such as the number of evaluations and the evaluation points, and is returned to the document score calculation unit 31.

【００３１】期待スコア算出部３３は、文書スコア算出
部３１から呼び出され、各文書の提示回数、参照回数、
評価回数、評価点など利用情報および実績スコアから各
文書の期待スコアとその重要度を算出し、文書スコア算
出部３１へ返却する。The expected score calculation unit 33 is called from the document score calculation unit 31 and calculates the number of presentations, the number of references,
The expected score of each document and its importance are calculated from the usage information such as the number of evaluations and evaluation points and the performance score, and are returned to the document score calculation unit 31.

【００３２】図２は、文書ランキングシステム３０の動
作フローチャートを示す。同図において、ステップＳ４
１は、文書スコア算出部３１が、利用情報ＤＢ２１から
各文書の提示回数、参照回数、評価回数、評価点など利
用情報を取り出す。FIG. 2 is a flowchart showing the operation of the document ranking system 30. Referring to FIG.
In 1, the document score calculation unit 31 extracts usage information such as the number of presentations, the number of references, the number of evaluations, and the evaluation points of each document from the usage information DB 21.

【００３３】ステップＳ４２は、文書スコア算出部３１
が、取り出した利用情報を実績スコア算出部３２へ引き
渡し、実績スコア算出部３２が、それを使って任意の手
法で各文書の実績スコアを算出し、文書スコア算出部３
１へ返却する。In step S42, the document score calculation unit 31
Handed over the extracted usage information to the performance score calculation unit 32, and the performance score calculation unit 32 calculates the performance score of each document by using an arbitrary method using the obtained usage information.
Return to 1.

【００３４】ステップＳ４３は、文書スコア算出部３２
が、利用情報および実績スコアを期待スコア算出部へ引
き渡し、期待スコア算出部３３がそれらを使って各文書
の期待スコアおよび重要度を算出し、文書スコア算出部
３１へ返却する。期待スコアおよび重要度の算出法の一
例は、図３で説明する。In step S43, the document score calculation section 32
Handovers the usage information and the performance score to the expected score calculation unit, and the expected score calculation unit 33 calculates the expected score and the importance of each document by using them, and returns them to the document score calculation unit 31. An example of a method of calculating the expected score and the importance will be described with reference to FIG.

【００３５】ステップＳ４４は、文書スコア算出部３１
が、実績スコアと期待スコアおよび重要度から各文書の
文書スコアを算出する。例えば、重要度が０〜１の値を
とる場合、（１一重要度）×実績スコア＋重要度×期待スコアとする。In step S44, the document score calculation unit 31
Calculates the document score of each document from the actual score, the expected score, and the importance. For example, when the importance takes a value of 0 to 1, (1—one importance) × actual score + importance × expected score.

【００３６】ステップＳ４５は、文書スコア算出部３１
が、算出した各文書の文書スコアを文書スコアＤＢ２２
へ格納する。In step S45, the document score calculation unit 31
Stores the calculated document score of each document in the document score DB 22
To store.

【００３７】図３は、算出部３３が実行する期待スコア
およびその重要度の算出のフローチャートを示す。同図
において、ステップＳ５１は、期待スコアの算出がなさ
れていない任意の文書を選択する。FIG. 3 shows a flowchart of the calculation of the expected score and its importance performed by the calculation unit 33. In the figure, a step S51 selects an arbitrary document for which an expected score has not been calculated.

【００３８】ステップＳ５２は、選択した文書が将来持
つことができる利用情報を持っている他の文書の集合を
導出する。例えば、以下の条件を満たす文書の集合を導
出する。In step S52, a set of other documents having use information that the selected document can have in the future is derived. For example, a set of documents satisfying the following conditions is derived.

【００３９】・提示回数が大きい・参照回数が大きい・評価回数が大きい・「提示回数の差＞参照回数の差＞評価回数の差」ステップＳ５３は、ステップＳ５２で得た集合に含まれ
る文書の平均実績スコアを算出し、それを期待スコアと
する。The number of presentations is large. The number of references is large. The number of evaluations is large. “Difference in number of presentations> Difference in number of references> Difference in number of evaluations”. Calculate the average performance score and use it as the expected score.

【００４０】ステップＳ５４は、ステップＳ５２で得た
集合の大きさから期待スコアの重要度を算出する。例え
ば、ステップＳ５２で得た文書集合の文書数÷全文書数
とする。In step S54, the importance of the expected score is calculated from the size of the set obtained in step S52. For example, the number of documents in the document set obtained in step S52 ÷ the number of all documents.

【００４１】ステップＳ５５は分岐で、すべての文書に
対して期待スコアが算出し終われば終了し、そうでない
場合は、ステップＳ５１へ戻る。Step S55 is a branch. If the expected scores have been calculated for all the documents, the process ends. Otherwise, the process returns to step S51.

【００４２】なお、本発明は、図１に示した装置の一部
又は全部の処理機能をプログラムとして構成してコンピ
ュータを用いて実現すること、あるいは図２または図３
で示した処理手順をプログラムとして構成してコンピュ
ータに実行させることができる。また、コンピュータで
その各部の処理機能を実現するためのプログラム、ある
いはコンピュータにその処理手順を実行させるためのプ
ログラムを、そのコンピュータが読み取り可能な記録媒
体、例えば、ＦＤ（フロッピーディスク：登録商標）、
ＭＯ、ＲＯＭ、メモリカード、ＣＤ、ＤＶＤ、リムーバ
ブルディスクなどに記録して、保存したり、提供したり
することが可能であり、また、インターネットのような
通信ネットワークを介して配布したりすることが可能で
ある。According to the present invention, a part or all of the processing functions of the apparatus shown in FIG. 1 are implemented as a program and realized by using a computer, or FIG. 2 or FIG.
Can be configured as a program and executed by a computer. In addition, a program for realizing the processing function of each unit in the computer or a program for causing the computer to execute the processing procedure is stored in a recording medium readable by the computer, for example, FD (Floppy Disk: registered trademark),
It can be recorded on an MO, ROM, memory card, CD, DVD, removable disk, etc., stored or provided, and distributed via a communication network such as the Internet. It is possible.

【００４３】[0043]

【発明の効果】以上説明したように、本発明において
は、人気度（文書スコア）を、従来の文書に対する利用
情報から算出したスコア（実績スコア）に加え、その文
書が将来的に取りうるスコアを予測した期待値（期待ス
コア）を算出し、算出した文書スコアに基づいて順位づ
けすることにより、情報の偏りの影響を受けにくく、情
報の少ない文書に対しても精度の高い順位付けを行うこ
とができるという効果を奏する。As described above, according to the present invention, the degree of popularity (document score) is added to the score (actual score) calculated from the usage information for the conventional document, and the score that the document can take in the future is obtained. Calculates the expected value (expected score) that predicts and ranks the documents based on the calculated document scores, so that the document is hardly affected by the bias of information, and a high-precision ranking is performed even for a document with little information. It has the effect of being able to do so.

[Brief description of the drawings]

【図１】本発明の実施形態を示すシステム構成を示す
図。FIG. 1 is a diagram showing a system configuration according to an embodiment of the present invention.

【図２】実施形態のフローチャート。FIG. 2 is a flowchart of the embodiment.

【図３】実施形態における期待スコアおよびその重要度
算出のフローチャート。FIG. 3 is a flowchart of calculating an expected score and its importance in the embodiment.

[Explanation of symbols]

１…文書検索部２１…利用情報データベース２２…文書スコアデータベース３０…文書ランキングシステム３１…文書スコア算出部３２…実績スコア算出部３３…期待スコア算出部 DESCRIPTION OF SYMBOLS 1 ... Document search part 21 ... Usage information database 22 ... Document score database 30 ... Document ranking system 31 ... Document score calculation part 32 ... Actual score calculation part 33 ... Expected score calculation part

Claims

[Claims]

1. A document popularity ranking method for presenting a search result list in which document popularity is ranked at the time of document search on a computer network, wherein a usage information of a user of each document is used in a usage information database. For a set of documents to be ranked which is stored in the database, an actual result score of each document is calculated from the usage information for each document by an arbitrary method. The average performance score of a subset of other documents having usage information that includes usage information for the expected score is defined as an expected score, and the performance score and the expected score are weighted according to their importance according to the size of the subset. A document score is calculated and stored in a document score database. Popularity ranking of documents, characterized by transmitting using a document score of each document that is in the score database performs ranking to a user.

2. The document score is calculated by using the usage information extracted from the usage information database to calculate a performance score of each document by an arbitrary method, and using the usage information and the performance score to calculate the document score of each document. The method according to claim 1, wherein an expected score and an importance are calculated, and a document score of each document is calculated from the performance score, the expected score, and the importance.

3. The calculation of the expected score and the degree of importance is performed by selecting an arbitrary document for which the expected score has not been calculated, and selecting another document having usage information that the selected document can have in the future. The average score of documents included in the set is calculated as an expected score, and the importance of the expected score is calculated from the size of the set. How documents are ranked by popularity.

4. A document popularity ranking system that presents a search result list in which document popularity is ranked when a document is searched on a computer network, wherein the system stores user usage information of each document. A usage information database to be stored, and for a set of documents to be ranked stored in the database, a performance score of each document is calculated by an arbitrary method from usage information for each document, and for each document, The expected score is the average performance score of a subset of other documents having usage information that includes usage information for, and the combination of the performance score and the expected score weighted by their importance according to the size of the subset. A document ranking system that calculates the document score and stores it in the document score database, and receives search requests from users A document search unit that ranks the received document set by using a document score of each document in the document score database and transmits the document set to a user. system.

5. The document ranking system, wherein a performance score of each document is calculated by an arbitrary method using usage information extracted from the usage information database, and a performance score calculation unit is returned to the document score calculation unit. An expected score calculation unit that calculates an expected score and importance of each document using the usage information and the performance score and returns the calculated score and importance to the document score calculation unit; and a document of each document based on the performance score, the expected score, and the importance. The document popularity ranking system according to claim 4, further comprising: a document score calculation unit that calculates a score and stores the calculated document score in the document score database.

6. The document ranking system, comprising: means for selecting an arbitrary document for which the expected score has not been calculated; and a set of other documents having use information that the selected document can have in the future. Means for calculating the average performance score of the documents included in the set to obtain an expected score, and means for calculating the importance of the expected score from the size of the set. The document ranking system according to claim 4 or 5, wherein

7. A document popularity ranking method, comprising a processing procedure for causing a computer to execute the document popularity ranking method according to any one of claims 1 to 3. program.

8. A program for causing a computer to execute the processing procedure in the document popularity ranking method according to claim 1, wherein the program is recorded on a computer-readable recording medium. A recording medium that records a method of ranking popularity of documents characterized by the following.