JP2013122701A

JP2013122701A - Information retrieval device and information retrieval method

Info

Publication number: JP2013122701A
Application number: JP2011271191A
Authority: JP
Inventors: Kenichi Abe; 健一阿部
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2011-12-12
Filing date: 2011-12-12
Publication date: 2013-06-20

Abstract

PROBLEM TO BE SOLVED: To perform suitable recommendation according to an operation history of a user.SOLUTION: The information retrieval device includes profile generation means for generating profile information to be used for recommendation from document operation history information recorded with an operation history for each operation type of a user performing in relation to a plurality of registered documents, recommendation score calculation means for analyzing the profile information generated by the profile generation means and calculating each recommendation score of the plurality of documents on the basis of the analysis result, and recommendation result display means for displaying a recommendation result document calculated by the recommendation score calculation means on a display device, to solve a problem in which many unnecessary documents appear in the recommendation result and the suitable recommendation cannot be performed.

Description

本発明は情報検索装置及び情報検索方法に関し、特に、ユーザが文書を閲覧する際に、関連する文書を推薦するために用いて好適な技術に関する。 The present invention relates to an information search apparatus and an information search method, and more particularly, to a technique suitable for recommending a related document when a user views the document.

従来の検索システムとしては、ユーザが入力したキーワードに一致するコンテンツを検索するタイプが主流であった。これは、正しいキーワードを入力すれば高速に検索結果が得られる反面、ユーザが検索の度に新たにキーワードを考える必要がある。また、一般の非熟練ユーザにとっては正しいキーワードを構築することが困難であるという課題があった。 As a conventional search system, a type of searching for content that matches a keyword input by a user has been mainstream. This is because if a correct keyword is input, a search result can be obtained at high speed, but the user needs to consider a new keyword every time a search is performed. In addition, there is a problem that it is difficult for a general unskilled user to construct a correct keyword.

このような課題を解決するために、ユーザがコンテンツを利用した履歴からユーザの嗜好を推定し、そのユーザが好むと推定されたコンテンツを提示する情報推薦（レコメンド）を行う技術が提案されている。 In order to solve such a problem, a technique has been proposed in which a user's preference is estimated from a history of using the content, and information recommendation (recommendation) for presenting the content estimated to be preferred by the user is performed. .

特許文献１は、インターネット書籍販売サイトにてユーザが書籍を参照した際に、過去のユーザの購入履歴に基づき前記参照する書籍を購入した人が別途購入した書籍を提示する技術である。特許文献１では、大量の購入履歴から複数のユーザが過去に購入したコンテンツの履歴を統計的に学習し、個人を特定しないままパターン化した類似の嗜好を持つユーザにコンテンツを推薦する。特許文献１を始めとするインターネット上のＥＣ（electronic commerce）サイトでは、ユーザがそのコンテンツを好むという条件を推定するのに「購入」という１つの指標にフォーカスしている。 Patent Document 1 is a technique for presenting a book separately purchased by a person who has purchased the book to be referred to based on a past user purchase history when the user references the book on an Internet book sales site. In Patent Document 1, a history of contents purchased by a plurality of users in the past is statistically learned from a large amount of purchase histories, and contents are recommended to users having similar preferences that are patterned without specifying an individual. In an EC (electronic commerce) site on the Internet, such as Patent Document 1, the focus is on one index “purchase” in order to estimate a condition that a user likes the content.

一方、エンタープライズ環境における文書管理システムでは、ユーザは文書に対して様々な操作が可能であり、またそれぞれの操作は「購入」程強くユーザがコンテンツを好むことを示す指標ではない。したがって、オフィスの文書管理システムにおいては、単一の操作ではなく、複数の操作それぞれからユーザと文書、文書と文書の関連性を推定する必要がある。 On the other hand, in a document management system in an enterprise environment, a user can perform various operations on a document, and each operation is not an index indicating that the user likes content more strongly than “purchase”. Therefore, in an office document management system, it is necessary to estimate the relationship between a user and a document, and between a document and a document, not from a single operation but from each of a plurality of operations.

特許文献２は、このようなオフィス環境の関連文書を提示する技術である。特許文献２は、文書の作成から終了までのひとまとまりの文書集合へのアクセスを案件セッションとして管理し、該案件セッション毎の操作内容に基づいて文書同士の関連リンク付けを行い、ある文書が閲覧された際に関連リンクに基づいて関連文書を表示する。 Patent Document 2 is a technique for presenting a document related to such an office environment. Patent Document 2 manages access to a set of documents from creation to end of a document as a case session, links related documents based on the operation content of each case session, and a document is viewed When it is done, the related document is displayed based on the related link.

米国特許第６２６６６４９号明細書US Pat. No. 6,266,649 特開２０１１−２８４４７号公報JP 2011-28447 A

上記特許文献２にて提案されている関連文書表示システムにおいては、文書の作成開始情報をレコメンドに利用するが、その作成がどのような意図に基づいてなされたものかは考慮していない。また、文書から文書へのコピー＆ペースト操作により文書間の関連リンクを作成するが、単純に文書間の関連を付加するだけである。 The related document display system proposed in Patent Document 2 uses document creation start information as a recommendation, but does not consider what intention the creation was made based on. In addition, a link between documents is created by a copy-and-paste operation from document to document, but a relationship between documents is simply added.

例えば、オフィスの文書を作成する際には、様々な作成の目的の違いがある。このため、特許文献２のように、これら全ての作成操作を同様に扱うと、レコメンド結果に不要な文書が数多く出現してしまい、好適なレコメンドを行うことができない問題点があった。
本発明は前述の問題点に鑑み、ユーザの操作履歴に応じた好適なレコメンドを行うことができるようにすることを目的とする。 For example, when creating an office document, there are various differences in the purpose of creation. For this reason, as in Patent Document 2, if all these creation operations are handled in the same way, many unnecessary documents appear in the recommendation result, and there is a problem that a suitable recommendation cannot be performed.
The present invention has been made in view of the above-described problems, and it is an object of the present invention to make a suitable recommendation according to a user's operation history.

本発明の情報検索装置は、登録された複数の文書に対して行われたユーザの操作種別ごとの操作履歴が記録されている文書操作履歴情報からレコメンドに利用するプロファイル情報を生成するプロファイル生成手段と、前記プロファイル生成手段により生成されたプロファイル情報を分析し、分析結果に基づいて前記複数の文書のレコメンドスコアをそれぞれ計算するレコメンドスコア計算手段と、前記レコメンドスコア計算手段により計算されたレコメンド結果文書を表示装置に表示するレコメンド結果表示手段とを有することを特徴とする。 The information search apparatus according to the present invention is a profile generation unit that generates profile information used for recommendation from document operation history information in which operation history for each operation type of a user performed on a plurality of registered documents is recorded. A recommendation score calculation means for analyzing the profile information generated by the profile generation means and calculating a recommendation score for each of the plurality of documents based on the analysis result, and a recommendation result document calculated by the recommendation score calculation means And a recommendation result display means for displaying on a display device.

本発明によれば、不要なレコメンド文書を取り除くことが可能となり、文書検索及び文書レコメンドの精度を向上させることができる。 According to the present invention, unnecessary recommended documents can be removed, and the accuracy of document search and document recommendation can be improved.

第１の実施形態の情報検索装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the information search device of 1st Embodiment. 文書管理システムの主要な構成部とデータの流れを表すブロック図である。It is a block diagram showing the main components of a document management system and the flow of data. 第１の実施形態の操作ログの例を示す図である。It is a figure which shows the example of the operation log of 1st Embodiment. （機能１）を実現するためのユーザインタフェースの例を示す図である。It is a figure which shows the example of the user interface for implement | achieving (function 1). （機能２）を実現するためのユーザインタフェースの例を示す図である。It is a figure which shows the example of the user interface for implement | achieving (function 2). プロファイル生産部の処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a process of a profile production part. 文書関連度テーブルの構成例を示す図である。It is a figure which shows the structural example of a document relevance degree table. 文書関連度テーブルの構成例を示す図である。It is a figure which shows the structural example of a document relevance degree table. コピー分類テーブルの構成例を示す図である。It is a figure which shows the structural example of a copy classification table. 第１の実施形態のコピー操作分類の例を示す図である。It is a figure which shows the example of the copy operation classification | category of 1st Embodiment. コピーログを詳細に分類する処理の流れを説明するフローチャートである。It is a flowchart explaining the flow of a process which classifies a copy log in detail. 第１の実施形態のコピー操作分類の例を示す図である。It is a figure which shows the example of the copy operation classification | category of 1st Embodiment. 文書関連度によるレコメンド処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the recommendation process by a document relevance degree. 同時利用による関連度計算の処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the process of relevance calculation by simultaneous use. 第１の実施形態の文書関連度適用戦略の一例を示す図である。It is a figure which shows an example of the document relevance degree application strategy of 1st Embodiment. 関連度スコア補正処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a relevance score correction process. 文書重要度計算部の処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a process of a document importance calculation part. 第１の実施形態の文書重要度計算の適用戦略の例を示す図である。It is a figure which shows the example of the application strategy of the document importance calculation of 1st Embodiment. 文書の重要度スコアを補正する処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the process which correct | amends the importance score of a document.

（第１の実施形態）
以下、図面を参照して本発明の好適な実施形態を詳細に説明する。
図１は、本実施形態の情報検索装置１００の概略構成を示すブロック図である。図１において、ＣＰＵ１０１は、ＲＯＭ１０２に格納されている制御プログラムを実行することにより本装置全体の制御を行う。 (First embodiment)
Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the drawings.
FIG. 1 is a block diagram illustrating a schematic configuration of an information search apparatus 100 according to the present embodiment. In FIG. 1, the CPU 101 controls the entire apparatus by executing a control program stored in the ROM 102.

ＲＯＭ１０２は、ＣＰＵ１０１が実行するプログラムや各種パラメータを格納する。プログラムは、ＣＰＵ１０１で実行されることにより、後述するフローチャートに示す各処理を実行するための各種手段として、本実施形態の情報検索装置１００を機能させる。
ＲＡＭ１０３は、処理対象となる文書や各種データを記憶する。また、ＣＰＵ１０１によって実行するプログラムを展開するために用いられる。 The ROM 102 stores programs executed by the CPU 101 and various parameters. The program is executed by the CPU 101 to cause the information search apparatus 100 of the present embodiment to function as various means for executing each process shown in the flowchart described below.
The RAM 103 stores a document to be processed and various data. Further, it is used for developing a program to be executed by the CPU 101.

記憶装置１０４は、本実施形態の情報検索装置１００の処理対象となる文書データや操作ログ情報を格納する装置である。
ディスプレイ１０５は、本実施形態の処理を操作するユーザインタフェースを表示する。ディスプレイ１０５に表示されたユーザインタフェースを操作して文書レコメンド命令が行われると、ＲＡＭ１０３に展開された処理プログラムによって、レコメンド文書の計算処理が施される。処理結果はハードディスクなどの記憶装置１０４に再び記憶され、ディスプレイ１０５に表示されたユーザインタフェース上で確認することができる。
なお、本実施形態では後述するフローチャートの各ステップに対応する処理を、コンピュータ（ＣＰＵ１０１）を用いてソフトウェアで実現されるが、その処理の一部又は全部を電子回路などのハードウェアで実現するようにしてもよい。 The storage device 104 is a device that stores document data and operation log information to be processed by the information search device 100 of the present embodiment.
The display 105 displays a user interface for operating the processing of this embodiment. When a document recommendation command is issued by operating the user interface displayed on the display 105, a recommended document calculation process is performed by the processing program expanded in the RAM 103. The processing result is stored again in the storage device 104 such as a hard disk and can be confirmed on the user interface displayed on the display 105.
In this embodiment, processing corresponding to each step of the flowchart described later is realized by software using a computer (CPU 101), but part or all of the processing is realized by hardware such as an electronic circuit. It may be.

図２は、本実施形態の情報検索装置１００のＣＰＵ１０１がＲＯＭ１０２に格納されているプログラムを実行することにより構成される文書管理システム２００の主要な構成部とデータの流れを表すブロック図である。
文書管理システム２００は、登録された大量の文書に対する操作種別を検出し、操作ログ情報として記憶装置１０４に記録する。検出される操作種別は、例えば文書登録、文書オープン、文書クローズ、閲覧、編集、コピー、移動などがある。 FIG. 2 is a block diagram showing the main components of the document management system 200 configured by the CPU 101 of the information search apparatus 100 according to the present embodiment executing a program stored in the ROM 102 and the flow of data.
The document management system 200 detects an operation type for a large number of registered documents and records it in the storage device 104 as operation log information. The detected operation types include, for example, document registration, document open, document close, browsing, editing, copying, and moving.

２０１は、情報検索装置１００により検出された文書操作履歴情報を格納する操作ログである。操作ログ２０１には、前述した操作種別に加え、操作日時、操作ユーザ、操作対象文書などの情報が格納される。操作ログ２０１の詳細については後述する。 An operation log 201 stores document operation history information detected by the information search apparatus 100. The operation log 201 stores information such as the operation date and time, the operation user, and the operation target document in addition to the operation type described above. Details of the operation log 201 will be described later.

２０２は、操作ログ２０１を分析して、レコメンド（情報推薦）に利用するユーザや文書のプロファイル情報２０３を生成するプロファイル生成部である。プロファイル情報２０３には、各文書の閲覧、印刷回数、各ユーザによる各文書の操作回数等、ログから抽出されたユーザ及び文書の分析情報が格納される。プロファイル情報２０３の詳細については後述する。 A profile generation unit 202 analyzes the operation log 201 and generates profile information 203 of a user or a document used for recommendation (information recommendation). The profile information 203 stores user and document analysis information extracted from the log, such as browsing of each document, the number of times of printing, and the number of times each user operates each document. Details of the profile information 203 will be described later.

２０４は、プロファイル情報２０３の分析結果に基づき各文書のレコメンドスコアを計算するレコメンドスコア計算部であり、文書重要度計算部２０５及び文書関連度計算部２０６を有する。本実施形態のレコメンドスコア計算部２０４で説明するレコメンドスコアには２種類ある。１つは各文書の重要度を示すスコアであり、文書重要度計算部２０５により計算される。もう１つは文書同士の関連度を示すスコアであり、文書関連度計算部２０６により計算される。２０７は、レコメンドスコア計算部２０４により付与されたレコメンドスコアに基づいてレコメンド結果文書を表示するレコメンド結果表示部である。 A recommendation score calculation unit 204 calculates a recommendation score for each document based on the analysis result of the profile information 203, and includes a document importance calculation unit 205 and a document relevance calculation unit 206. There are two types of recommendation scores described in the recommendation score calculation unit 204 of the present embodiment. One is a score indicating the importance of each document, and is calculated by the document importance calculator 205. The other is a score indicating the degree of association between documents, and is calculated by the document association degree calculation unit 206. Reference numeral 207 denotes a recommendation result display unit that displays a recommendation result document based on the recommendation score given by the recommendation score calculation unit 204.

[操作ログ]
図３に、図２の操作ログ２０１の詳細を示す。
図３(ａ)は、操作ログ２０１に格納される情報の構造の例を示す図である。操作ログ２０１には、１つの操作につき少なくとも３０１から３０５の５つの情報が格納される。
第１の情報３０１は、操作が行われた時刻である。ここでは、ＯＳ（オペレーティングシステム）が保持するローカルタイムを利用する。 [Operation log]
FIG. 3 shows details of the operation log 201 of FIG.
FIG. 3A is a diagram illustrating an example of the structure of information stored in the operation log 201. The operation log 201 stores at least five pieces of information from 301 to 305 for each operation.
The first information 301 is the time when the operation was performed. Here, the local time held by the OS (operating system) is used.

第２の情報３０２は、操作ユーザ名または操作ユーザ識別子である。ユーザは文書を操作する前に予め情報検索装置１００へのログイン操作を行い、ユーザ情報を通知しておく。
第３の情報３０３は、操作対象文書名或いは識別子である。
第４の情報３０４は、操作種別である。操作種別には、情報検索装置１００のユーザインタフェース上で検出された文書への操作の種類を格納する。ここでは、「チェックイン」、「チェックアウト」、「ログイン」、「ログアウト」、「文書登録」、「文書オープン」、「文書クローズ」、「文書編集」、「文書コピー」、「文書移動」、「文書印刷」を格納する。
第５の情報３０５は、「文書コピー」及び「文書移動」操作を行った際の、コピー先及び移動先ファイル名（新ファイル名）を格納する。 The second information 302 is an operation user name or an operation user identifier. Before operating the document, the user performs a login operation to the information search apparatus 100 and notifies the user information in advance.
The third information 303 is an operation target document name or identifier.
The fourth information 304 is an operation type. The operation type stores the type of operation on the document detected on the user interface of the information search apparatus 100. Here, “check in”, “check out”, “login”, “logout”, “document registration”, “document open”, “document close”, “document edit”, “document copy”, “document move” , “Document printing” is stored.
The fifth information 305 stores a copy destination and a destination file name (new file name) when the “document copy” and “document move” operations are performed.

図３(ｂ)は、図３(ａ)に格納されているログから復元されるユーザ操作の例を示す図である。
図３(ａ)のログから、User Xが先週の週報「週報＿20110302.doc」をコピーして、今週の日付を含む「週報＿20110309.doc」にリネームする。さらに「週報＿20110309.doc」に対しUser AとUser Bが更新した、という一連の操作が復元される。 FIG. 3B is a diagram illustrating an example of a user operation restored from the log stored in FIG.
From the log in FIG. 3A, User X copies the last week's weekly report “Weekly_20110302.doc” and renames it to “Weekly_20110309.doc” including the date of this week. Furthermore, a series of operations in which User A and User B have updated “Weekly_20110309.doc” are restored.

[レコメンド結果表示]
図４及び図５は、図２の文書管理システム２００のレコメンド結果表示部２０７に表示される具体例を示す図である。本実施形態のレコメンド機能は次の２つを備える。
（イ）ユーザが文書管理システム上で、ある文書を選択したとき、選択された文書に関連する文書がレコメンドされるようにする機能（機能１）。
（ロ）ユーザが文書管理システム上で、あるフォルダを選択したとき、選択したフォルダ及びそのサブフォルダ内に存在する重要文書がレコメンドされるようにする機能（機能２）。 [Recommendation result display]
4 and 5 are diagrams showing specific examples displayed on the recommendation result display unit 207 of the document management system 200 of FIG. The recommendation function of this embodiment has the following two.
(A) A function (function 1) that recommends a document related to the selected document when the user selects a document on the document management system.
(B) A function (function 2) that, when the user selects a folder on the document management system, important documents existing in the selected folder and its subfolders are recommended.

図４が（機能１）の具体例を示し、図５が（機能２）の具体例を示す。
図４の４００は、（機能１）を実現するための文書管理システムのユーザインタフェースである。
図４において４０１は、フォルダビューであり、文書管理システム２００に登録されている全てのフォルダをツリー形式で表示している。
４０２は、ファイルビューである。ファイルビュー４０２には、文書管理システム２００に登録されている文書で、かつフォルダビュー４０１において選択されているフォルダの直下に存在するファイルが表示される。 4 shows a specific example of (Function 1), and FIG. 5 shows a specific example of (Function 2).
Reference numeral 400 in FIG. 4 denotes a user interface of the document management system for realizing (Function 1).
In FIG. 4, reference numeral 401 denotes a folder view, which displays all folders registered in the document management system 200 in a tree format.
Reference numeral 402 denotes a file view. The file view 402 displays a document registered in the document management system 200 and a file that exists immediately under the folder selected in the folder view 401.

４０３及び４０４は、レコメンド結果表示部２０７に表示される具体例を示している。４０３は、ファイルビュー４０２で選択された文書の関連文書がレコメンドされるレコメンド結果文書表示部である。図４の例では、ファイルビュー４０２において文書Ａが選択された直後に、文書Ａの関連文書である文書Ｘ及び文書Ｙがレコメンドされている。 Reference numerals 403 and 404 denote specific examples displayed on the recommendation result display unit 207. A recommendation result document display unit 403 recommends a related document of the document selected in the file view 402. In the example of FIG. 4, immediately after the document A is selected in the file view 402, the document X and the document Y that are related documents of the document A are recommended.

４０４は、レコメンドスコア表示部である。レコメンドスコアは後述する文書間関連度計算処理により定められ、レコメンド結果文書はこのスコアによってソートされて提示される。図４の例では、文書Ａと文書Ｘの関連度が９５、文書Ａと文書Ｙの関連度が６０であり、文書Ａに対する他の文書の関連度は６０未満であることを示している。 Reference numeral 404 denotes a recommendation score display unit. The recommendation score is determined by inter-document relevance calculation processing described later, and the recommendation result document is sorted and presented by this score. In the example of FIG. 4, the degree of association between the document A and the document X is 95, the degree of association between the document A and the document Y is 60, and the degree of association of the other documents with respect to the document A is less than 60.

図５の５００は、（機能２）を実現するための文書管理システムのユーザインタフェースである。
５０１は、フォルダビューであり、文書管理システム２００に登録されている全てのフォルダをツリー形式で表示している。
５０２は、ファイルビューである。５０２には、文書管理システム２００に登録されている文書で、かつフォルダビュー５０１において選択されているフォルダの直下に存在するファイルが表示される。この例では、フォルダＴの直下に文書Ａ、文書Ｂ、文書Ｃ、文書Ｄが存在する。 5 is a user interface of the document management system for realizing (Function 2).
A folder view 501 displays all folders registered in the document management system 200 in a tree format.
Reference numeral 502 denotes a file view. In 502, a file registered in the document management system 200 and a file existing immediately below the folder selected in the folder view 501 is displayed. In this example, a document A, a document B, a document C, and a document D exist immediately below the folder T.

５０３及び５０４は、レコメンド結果表示部２０７に表示される具体例を示す図である。５０３は、フォルダビュー５０１で選択されたフォルダ及びそのサブフォルダ以下に存在する全てのファイルから後述する重要度が高い文書がレコメンドされるレコメンド結果文書表示部である。この例では、フォルダビュー５０１においてフォルダＴが選択された直後にフォルダＴ以下に存在する重要文書である文書Ｋ及び文書Ｌがレコメンドされている。
５０４は、レコメンドスコア表示部である。レコメンドスコアは後述する文書間重要度計算処理により定められ、レコメンド結果文書はこのスコアによってソートされて提示される。図５の例では、文書Ｋの重要度が８８、文書Ｌの重要度が５０であり、フォルダＴ以下に存在する他の文書の重要度は５０未満であることを示す。 Reference numerals 503 and 504 denote specific examples displayed on the recommendation result display unit 207. Reference numeral 503 denotes a recommendation result document display unit that recommends a document having a high importance, which will be described later, from all files existing under the folder selected in the folder view 501 and its subfolders. In this example, immediately after the folder T is selected in the folder view 501, the document K and the document L, which are important documents existing under the folder T, are recommended.
Reference numeral 504 denotes a recommendation score display unit. The recommendation score is determined by an inter-document importance calculation process described later, and the recommendation result document is sorted and presented by this score. In the example of FIG. 5, the importance level of the document K is 88, the importance level of the document L is 50, and the importance levels of other documents existing in the folder T and below are less than 50.

[プロファイル生成]
図６は、図２のプロファイル生成部２０２により行なわれる処理の流れを示すフローチャートである。プロファイル情報２０３は、文書関連度情報を格納する文書関連度テーブル、文書重要度情報を格納する文書重要度テーブル、コピー操作の分類結果によってレコメンドスコアを補正するためのコピー分類テーブルを含む。プロファイル生成部２０２では、これら３つのテーブルに格納される情報を更新する。図６のフローチャートの各ステップに対応する処理は、ＲＯＭ１０２に記録されたプログラムを、ワークメモリとして機能するＲＡＭ１０３に展開してＣＰＵ１０１が実行することで実現する。 [Create Profile]
FIG. 6 is a flowchart showing a flow of processing performed by the profile generation unit 202 of FIG. The profile information 203 includes a document relevance table for storing document relevance information, a document importance table for storing document importance information, and a copy classification table for correcting a recommendation score according to a copy operation classification result. The profile generation unit 202 updates the information stored in these three tables. The processing corresponding to each step in the flowchart of FIG. 6 is realized by the CPU 101 developing the program recorded in the ROM 102 on the RAM 103 functioning as a work memory and executing it.

Ｓ６０１は、操作ログ２０１から先頭のログレコードを取得する。
Ｓ６０２は、Ｓ６０１で取得したログレコードに対応する文書関連度テーブルの情報を更新する。文書関連度テーブルの詳細については後述する。
Ｓ６０３は、Ｓ６０１で取得したログレコードに対応する文書重要度テーブルの情報を更新する。文書重要度テーブルの詳細については後述する。 In step S <b> 601, the first log record is acquired from the operation log 201.
In step S602, the information in the document relevance table corresponding to the log record acquired in step S601 is updated. Details of the document relevance table will be described later.
In step S603, the information in the document importance table corresponding to the log record acquired in step S601 is updated. Details of the document importance table will be described later.

Ｓ６０４は、Ｓ６０１で取得したログレコードに対応するコピー分類テーブルの情報を更新する。コピー分類テーブルの詳細については後述する。
Ｓ６０５は、操作ログ２０１の全てのログレコードを調べたかどうか判定する。まだ、調べていないレコードが存在すれば、Ｓ６０６へ進み、次のログレコードを取得し、その後、Ｓ６０２に進む。全てのログレコードを調べ終わっていれば終了する。操作ログ２０１のレコードは日々のユーザ操作に応じて刻々と増えていく。最新の操作ログ情報をレコメンド結果に反映させるためには、このプロファイル生成処理は数時間に１回、１日に１回など定期的に実行することが望ましい。 In step S604, the copy classification table information corresponding to the log record acquired in step S601 is updated. Details of the copy classification table will be described later.
In step S605, it is determined whether all log records of the operation log 201 have been checked. If there is a record that has not been checked yet, the process proceeds to S606, the next log record is acquired, and then the process proceeds to S602. Exit if all log records have been examined. The records of the operation log 201 increase every moment according to daily user operations. In order to reflect the latest operation log information in the recommendation result, it is desirable to periodically execute this profile generation process once every several hours or once a day.

図７は、図６のＳ６０２で更新する文書関連度テーブルの構成例を示す図である。
図７の７００は、文書関連度テーブルの一例を示す。図７に示すように、文書関連度テーブル７００の行７０１及び列７０２は、共に文書管理システム２００に登録されている全ての文書が格納され、全ての文書間に関連度が付与される。行７０１と列７０２の文書の交点の数字が文書間の関連度を示す。 FIG. 7 is a diagram showing a configuration example of the document relevance table updated in S602 of FIG.
700 in FIG. 7 shows an example of a document relevance table. As shown in FIG. 7, all the documents registered in the document management system 200 are stored in the row 701 and the column 702 of the document relevance degree table 700, and relevance is given to all the documents. The numbers at the intersections of the documents in row 701 and column 702 indicate the degree of association between documents.

本実施形態では、「同時に利用された文書同士」を関連する文書であるとする。同時に利用された文書同士とは、ユーザが同時に閲覧、印刷した文書を指す。例えば、操作ログ２０１から時系列順に、文書１オープン、文書２オープン、文書１クローズ、文書２クローズというログが得られたとすると、文書１と文書２は同時にオープンされていた期間がある。このような場合に文書１と文書２は同時に利用されたと判断し、文書関連度テーブル７００の文書１と文書２の交点の数値に＋１を加える。このようにして、文書関連度テーブル７００の全ての文書間に同時に利用された回数が蓄積される。 In this embodiment, it is assumed that “documents used simultaneously” are related documents. Documents used at the same time refer to documents that a user browses and prints at the same time. For example, if logs of document 1 open, document 2 open, document 1 close, and document 2 close are obtained from the operation log 201 in chronological order, there is a period in which document 1 and document 2 are opened simultaneously. In such a case, it is determined that the documents 1 and 2 are used at the same time, and +1 is added to the numerical value of the intersection of the documents 1 and 2 in the document relevance table 700. In this way, the number of times of simultaneous use between all the documents in the document relevance table 700 is accumulated.

ここで、本実施形態では文書が同時にオープンされた場合を説明したが、同時に利用されたと判定する基準はこれに限らない。同時にチェックイン／チェックアウト、ログイン／ログアウトされた文書や、これらの操作中に印刷された文書なども同時に利用された文書として利用できる。また、文書関連度を付与する基準は同時に利用された文書に限らない。コピー元文書（複製元文書）／コピー先文書（複製先文書）、ハイパーリンクによりリンクされている文書、内容が類似する文書などによって適応的に文書関連度を付与することもできる。 Here, in the present embodiment, the case where documents are opened at the same time has been described. At the same time, documents that are checked in / out, logged in / out, and documents that are printed during these operations can also be used as documents that are used at the same time. Further, the standard for assigning the document relevance is not limited to documents used at the same time. Document relevance can be adaptively given by a copy source document (replication source document) / copy destination document (replication destination document), a document linked by a hyperlink, a document having similar contents, or the like.

図８の８００は、図６のＳ６０３で更新する文書重要度テーブルの構成例を示す図である。本実施形態では、複数のユーザから数多く閲覧（オープン）、印刷された文書を重要度が高い文書であるとする。
文書重要度テーブル８００の行８０１は、文書管理システム２００に登録されている全ての文書であり、列８０２は、オープン操作、印刷操作を示す。行８０１と列８０２の交点の数値はその文書が操作された回数を示す。図８では、文書１は１０回閲覧され、５回印刷されたことを示す。 800 in FIG. 8 is a diagram showing a configuration example of the document importance level table updated in S603 in FIG. In the present embodiment, it is assumed that a document that has been viewed (opened) and printed by a plurality of users is a highly important document.
A row 801 of the document importance level table 800 indicates all documents registered in the document management system 200, and a column 802 indicates an open operation and a print operation. The numerical value at the intersection of row 801 and column 802 indicates the number of times the document has been manipulated. In FIG. 8, document 1 is viewed 10 times and printed 5 times.

図９の９００は、図６のＳ６０４で更新するコピー分類テーブルの構成例を示す図である。
９０１は、過去にコピー元となった文書を操作ログ２０１から抽出し、格納するコピー元文書である。９０２は、コピー元文書９０１に対応するコピー先文書リストである。コピー先文書リスト９０２の構造は、｛コピー先文書、分類番号｝のペアレコードを格納する。１つの文書から複数のコピーが行われたログが検出された場合は、コピー先文書リスト９０２の要素が複数のレコードを持つ。コピー先文書リスト９０２の要素である分類とは、コピー操作のログレコードからユーザがどのような意図により文書作成を行ったかを推定して分類したものである。分類の詳細を図１０に示す。 900 in FIG. 9 is a diagram illustrating a configuration example of the copy classification table updated in S604 in FIG.
Reference numeral 901 denotes a copy source document in which a copy source document in the past is extracted from the operation log 201 and stored. Reference numeral 902 denotes a copy destination document list corresponding to the copy source document 901. The structure of the copy destination document list 902 stores a pair record of {copy destination document, classification number}. When a log in which a plurality of copies are made from one document is detected, the element of the copy destination document list 902 has a plurality of records. The classification that is an element of the copy destination document list 902 is a classification based on an estimation of the purpose of the document creation by the user from the log record of the copy operation. Details of the classification are shown in FIG.

図１０は、図９におけるコピー先文書リスト９０２の分類番号を説明する図である。
本実施形態では、操作ログ２０１から抽出するコピー操作のログからユーザがどのような目的で文書作成を行ったのかを推定し、推定結果に応じて文書を分類する。そして、後述する文書重要度計算処理及び文書関連度計算処理において、該分類に応じた好適なレコメンドスコアを付与する。
分類の内容は、「差分更新」、「バックアップ」、「テンプレート」、「文書分割（１）」、「文書分割（２）」、「部分抽出」の６つのパターンである。 FIG. 10 is a diagram for explaining the classification numbers of the copy destination document list 902 in FIG.
In the present embodiment, the purpose of the user creating the document is estimated from the copy operation log extracted from the operation log 201, and the document is classified according to the estimation result. In a document importance level calculation process and a document relevance level calculation process, which will be described later, a suitable recommendation score corresponding to the classification is given.
The contents of the classification are six patterns of “difference update”, “backup”, “template”, “document division (1)”, “document division (2)”, and “partial extraction”.

「差分更新」は、例えば、過去の日付が入った古いバージョン（古い版）の文書をコピー（複製）してファイル名を現在の日付に直し、現在の日付のファイルを更新しているようなケースである。
「バックアップ」は、例えば、あるファイルをコピーして「＿old」などを含んだファイル名にリネームして保存しておくようなケースである。 “Differential update” means that, for example, an old version (old version) document containing a past date is copied (duplicated), the file name is changed to the current date, and the file on the current date is updated. It is a case.
“Backup” is a case where, for example, a certain file is copied and renamed to a file name including “_old” and stored.

「テンプレート」は、コピー元文書のデザインなどをそのまま用いてコピー先文書を更新するようなケースである。
「文書分割（１）」は、１つの文書をコピーして２つの文書に分割し、２つの文書でコピー元文書の内容を表すケースである。
「文書分割（２）」は、１つの文書から複数のコピーを作成し、コピー元文書は更新せずにコピー先の複数の文書でコピー元文書の内容を表すケースである。
「部分抽出」は、コピー元文書のうち一部を残してコピー先文書を更新するようなケースである。 The “template” is a case where the copy destination document is updated using the design of the copy source document as it is.
“Document division (1)” is a case where one document is copied and divided into two documents, and the contents of the copy source document are represented by two documents.
“Document division (2)” is a case where a plurality of copies are created from one document, and the contents of the copy source document are represented by a plurality of copy destination documents without updating the copy source document.
“Partial extraction” is a case where the copy destination document is updated while leaving a part of the copy source document.

操作ログ２０１から抽出されたコピー操作のログパターンから、これら６つのパターンに分類する。それぞれの分類の基準は、図１０の「ログからの判定法」の列に記載した方法を用いる。本判定法の処理の流れを図１１に示す。なお、本実施形態では６つのパターンに分類したが、これに限定されるものではない。後述するように、コピー分類によるスコア補正の戦略に応じて変更してもよい。 The log patterns of the copy operation extracted from the operation log 201 are classified into these six patterns. As the classification criteria, the method described in the column “Determination method from log” in FIG. 10 is used. The processing flow of this determination method is shown in FIG. In the present embodiment, the patterns are classified into six patterns, but the present invention is not limited to this. As will be described later, it may be changed according to the score correction strategy by copy classification.

図１１は、操作ログ２０１のコピーログを、図１０で示すように詳細に分類する処理の流れを説明するフローチャートである。図１１のフローチャートの各ステップに対応する処理は、ＲＯＭ１０２に記録されたプログラムを、ワークメモリとして機能するＲＡＭ１０３に展開してＣＰＵ１０１が実行することで実現する。
Ｓ１１０１では、操作ログ２０１から「操作種別」が「コピー」のログレコードを抽出し、取得する。以降のフローチャートにおいては、取得したコピー操作が図１０におけるどの分類に所属するのかを決定する関連度決定処理を行う。 FIG. 11 is a flowchart for explaining the flow of processing for classifying the copy log of the operation log 201 in detail as shown in FIG. The processing corresponding to each step in the flowchart of FIG. 11 is realized by the CPU 101 executing the program recorded in the ROM 102 on the RAM 103 functioning as a work memory.
In step S1101, a log record whose “operation type” is “copy” is extracted from the operation log 201 and acquired. In the subsequent flowcharts, relevance determination processing is performed to determine to which classification in FIG. 10 the acquired copy operation belongs.

Ｓ１１０２では、取得したコピーログの近傍のログにおいて、コピー先文書及びコピー元文書が両方とも更新されているかどうかを判定する。ここで、近傍のログとは、コピーログと更新ログの記録時間の差が、ある閾値よりも小さいものを用いる。例えば、３０分以内に両方更新されていた場合を抽出する。両方更新されていれば、Ｓ１１２０において、図１０における「文書分割（１）」のパターンであると判定し、処理を終了する。少なくともどちらかが更新されていなければ、Ｓ１１０３へ進む。 In step S1102, it is determined whether both the copy destination document and the copy source document have been updated in the log in the vicinity of the acquired copy log. Here, as the nearby log, a log whose difference in recording time between the copy log and the update log is smaller than a certain threshold is used. For example, a case where both have been updated within 30 minutes is extracted. If both have been updated, it is determined in S1120 that the pattern is “document division (1)” in FIG. 10, and the process ends. If at least one of them has not been updated, the process proceeds to S1103.

Ｓ１１０３では、コピー先文書が複数存在するか否かを判定する。取得したコピーログの近傍にコピー元文書を同一とするコピーログが存在した場合、コピー先文書が複数あると判定し、Ｓ１１０７へ進み、コピー先文書が全て更新されたか否かを判定する。そして、全て更新されていなかった場合はＳ１１０４に進む。また、全て更新されていた場合には、Ｓ１１２１において、図１０における「文書分割（２）」のパターンであると判定し、処理を終了する。 In S1103, it is determined whether there are a plurality of copy destination documents. If there is a copy log having the same copy source document in the vicinity of the acquired copy log, it is determined that there are a plurality of copy destination documents, and the process advances to step S1107 to determine whether all the copy destination documents have been updated. If not all have been updated, the process proceeds to S1104. If all have been updated, it is determined in S1121 that the pattern is “document division (2)” in FIG. 10, and the process ends.

また、Ｓ１１０３の判定の結果、存在しなかった場合、Ｓ１１０４へ進む。Ｓ１１０４では、コピー先文書が更新されたか否かを判定する。更新されていればＳ１１０５へと進み、更新されていなければＳ１１２２に進んで、図１０の「バックアップ」パターンであると判定し、処理を終了する。 On the other hand, if the result of determination in S1103 is that there is no such event, the process proceeds to S1104. In step S1104, it is determined whether the copy destination document has been updated. If it has been updated, the process proceeds to S1105. If it has not been updated, the process proceeds to S1122, where it is determined that the pattern is the “backup” pattern in FIG. 10, and the process ends.

Ｓ１１０５では、更新ログの操作前後の文書サイズの減少率がある閾値よりも小さいか否か判定する。小さければＳ１１０６へ進み、小さくなければコピー後に大幅に内容を削ったと判定し、図１０の「部分抽出」パターンだと判定する（Ｓ１１２３）。 In step S <b> 1105, it is determined whether the reduction rate of the document size before and after the update log operation is smaller than a certain threshold. If it is smaller, the process proceeds to S1106. If it is not smaller, it is determined that the content has been greatly cut after copying, and it is determined to be the “partial extraction” pattern of FIG. 10 (S1123).

Ｓ１１０６では、コピー元ファイルが多くのコピーログでコピー元となっているか否かを判定する。ここでは、Ｓ１１０１で取得したコピーログの近傍だけではなく、操作ログ２０１の全てのコピーログを走査して判定する。多くのログでコピー元となっていれば図１０の「テンプレート」パターンであると判定する（Ｓ１１２４）。コピー元となっていなければ、「差分更新」パターンであると判定する（Ｓ１１２５）。 In step S1106, it is determined whether the copy source file is a copy source in many copy logs. Here, not only the vicinity of the copy log acquired in S1101, but also all the copy logs of the operation log 201 are scanned and determined. If it is the copy source in many logs, it is determined as the “template” pattern of FIG. 10 (S1124). If it is not the copy source, it is determined that the pattern is the “difference update” pattern (S1125).

図１２は、操作ログ２０１から得られるコピーログの近傍パターンの一例を示す図である。図１２のコピーログがどのように図１０の分類項目に分類されるかを図１１のフローチャートを参照しながら説明する。
図１２(ａ)のログレコードが示す操作を説明のために復元したものが図１２(ｂ)である。
ログ１２００が示すのは、図１２(ｂ)にあるように、「特許明細書.doc」という文書をコピー、「特許明細書＿old.doc」というファイル名にリネームして保存しておく。その後、コピー元ファイルの「特許明細書.doc」を更新したという一連の操作である。 FIG. 12 is a diagram illustrating an example of a neighborhood pattern of a copy log obtained from the operation log 201. How the copy log of FIG. 12 is classified into the classification items of FIG. 10 will be described with reference to the flowchart of FIG.
FIG. 12B is a diagram in which the operation indicated by the log record in FIG.
As shown in FIG. 12B, the log 1200 shows a copy of the document “patent specification.doc”, renames the file “patent specification_old.doc”, and saves it. Thereafter, a series of operations in which “patent specification.doc” of the copy source file is updated.

図１１のＳ１１０２において、ログ１２００はコピー元のみ更新されているため、Ｓ１１０３へ進む。
Ｓ１１０３において、ログ１２００はコピー先文書が１つであるため、Ｓ１１０４へ進む。
Ｓ１１０４において、ログ１２００はコピー先文書が更新されていないため、Ｓ１１２２に進んで図１０の「バックアップ」パターンとして分類される。
このように、コピー操作がどの分類に属するのかを判定し、図９のコピー先文書リスト９０２に、図１０から得られる分類番号を付与する。
以上の処理を行うことで、図２のプロファイル生成部２０２は図２のプロファイル情報２０３を構成する文書関連度テーブル７００、文書重要度テーブル８００、コピー分類テーブル９００に格納する情報を生成し、更新する。 In S1102 of FIG. 11, the log 1200 is updated only for the copy source, and thus the process proceeds to S1103.
In S1103, since the log 1200 has one copy destination document, the process proceeds to S1104.
In S1104, since the copy destination document has not been updated in the log 1200, the process advances to S1122, and is classified as the “backup” pattern in FIG.
In this way, it is determined to which classification the copy operation belongs, and the classification number obtained from FIG. 10 is assigned to the copy destination document list 902 of FIG.
By performing the above processing, the profile generation unit 202 in FIG. 2 generates and stores information to be stored in the document relevance table 700, the document importance table 800, and the copy classification table 900 constituting the profile information 203 in FIG. To do.

[レコメンド]
次に、図２のレコメンドスコア計算部２０４について説明する。
本実施形態では、前述したとおり２つの基準で文書をレコメンドする。１つは文書間の関連性によるレコメンド（機能１）、もう１つは各々の文書重要度によるレコメンド（機能２）である。それぞれの機能を利用するユーザ操作については図４及び図５において説明した。 [recommend]
Next, the recommendation score calculation unit 204 in FIG. 2 will be described.
In this embodiment, the document is recommended based on two criteria as described above. One is a recommendation based on the relationship between documents (function 1), and the other is a recommendation based on the importance of each document (function 2). The user operation using each function has been described with reference to FIGS.

[文書関連度によるレコメンド]
図１３は、文書関連度によるレコメンド（機能１）の処理の流れを示すフローチャートである。図１３のフローチャートの各ステップに対応する処理は、ＲＯＭ１０２に記録されたプログラムを、ワークメモリとして機能するＲＡＭ１０３に展開してＣＰＵ１０１が実行することで実現する。
Ｓ１３０１では、ユーザが図４のユーザインタフェース上で選択した文書を取得する。
Ｓ１３０２では、Ｓ１３０１で取得した選択文書と同時利用された文書程スコアが高くなるように関連度スコアを計算して補正する。本ステップの詳細は後述する。
Ｓ１３０３では、Ｓ１３０２で求められた関連度スコアに対して、図１０で示すコピー分類それぞれに特有な補正項を利用して補正を行う。本ステップの詳細は後述する。
Ｓ１３０４では、Ｓ１３０２及び１３０３でスコア付けされた文書をスコア順にソートして出力する。 [Recommendations based on document relevance]
FIG. 13 is a flowchart showing a flow of processing of a recommendation (function 1) based on the document relevance level. The processing corresponding to each step in the flowchart of FIG. 13 is realized by the CPU 101 executing the program recorded in the ROM 102 on the RAM 103 that functions as a work memory.
In S1301, the document selected by the user on the user interface in FIG. 4 is acquired.
In S1302, the relevance score is calculated and corrected so that the score of the document used simultaneously with the selected document acquired in S1301 is higher. Details of this step will be described later.
In S1303, the relevance score obtained in S1302 is corrected using correction terms specific to each copy classification shown in FIG. Details of this step will be described later.
In S1304, the documents scored in S1302 and 1303 are sorted and output in the order of score.

図１４は、図１３のＳ１３０２で行われる、同時利用による関連度計算の処理の流れを示すフローチャートである。本実施形態では、同時利用による関連度は図７の文書関連度テーブル７００に格納されている値をスコアとして利用する。文書関連度テーブル７００に格納されている値は、文書と文書が同時に使われた頻度を示すため、この頻度が高い程文書間の関連度スコアが高くなる。 FIG. 14 is a flowchart showing a flow of relevance calculation processing by simultaneous use performed in S1302 of FIG. In the present embodiment, the degree of relevance due to simultaneous use uses a value stored in the document relevance degree table 700 of FIG. 7 as a score. Since the value stored in the document relevance level table 700 indicates the frequency with which the document and the document are used simultaneously, the relevance score between documents increases as the frequency increases.

Ｓ１４０１では、図７の文書関連度テーブル７００から、図１３のＳ１３０１で取得した文書に相当するレコードを取得する。Ｓ１３０１で選択された文書が「文書１」であれば、図７の先頭レコードを取得する。 In S1401, a record corresponding to the document acquired in S1301 of FIG. 13 is acquired from the document relevance table 700 of FIG. If the document selected in S1301 is “Document 1”, the first record in FIG. 7 is acquired.

Ｓ１４０２では、Ｓ１４０１で取得したレコードの１つ１つの列をスコア付け対象文書として取得する。「文書１」のレコードであれば、列７０２の「文書２」から順に取得する。
Ｓ１４０３では、Ｓ１４０２で取得した列７０２に対応する値を取得する。図７で、「文書１」と「文書２」の交点の値５を取得する。
Ｓ１４０４では、Ｓ１４０１で取得したレコードの全ての列を調べたか否かを判定する。全てを調べていなければ、Ｓ１４０６へ進み、次の列の値を取得する。全ての列を調べていればＳ１４０５へ進む。
Ｓ１４０５では、文書関連度テーブル７００から取得したそれぞれの文書の関連度を正規化する処理を行う。文書iのスコアNormalized＿score(i)は、以下のようにスコアの最大値によって正規化する。 In S1402, each column of the records acquired in S1401 is acquired as a scoring target document. If the record is “document 1”, the records are sequentially acquired from “document 2” in column 702.
In S1403, a value corresponding to the column 702 acquired in S1402 is acquired. In FIG. 7, the value 5 of the intersection of “Document 1” and “Document 2” is acquired.
In S1404, it is determined whether all the columns of the record acquired in S1401 have been examined. If not all are checked, the process proceeds to S1406, and the value of the next column is acquired. If all columns have been examined, the process proceeds to S1405.
In step S1405, a process for normalizing the relevance level of each document acquired from the document relevance level table 700 is performed. The score Normalized_score (i) of the document i is normalized by the maximum score as follows.

図１５及び図１６によって、図１３のＳ１３０３で行われる、コピー分類による関連度スコア補正についての説明を行う。
本実施形態では、文書コピーのログからその操作がどのような文書作成操作であったかを図１５に示すように分類し、それぞれの分類に応じたレコメンドを行う。 The relevance score correction by copy classification performed in S1303 of FIG. 13 will be described with reference to FIGS.
In this embodiment, the document creation operation is classified from the document copy log as shown in FIG. 15, and a recommendation according to each classification is performed.

図１５は、それぞれの分類に対する文書関連度の適用戦略の一例を示す図である。
コピー操作から文書間の関連度を求めるにあたり、図１５の各列に対応する２つの観点がある。１つは、コピー元文書に対する関連文書をコピー先文書の関連文書とする（関連分書引き継ぎ）か否かである。２つ目は、コピー元からコピー先、或いはコピー先からコピー元への関連度をどのように付与（コピー先文書関連度補正）するかという観点である。 FIG. 15 is a diagram illustrating an example of a document relevance application strategy for each classification.
There are two viewpoints corresponding to each column in FIG. 15 in obtaining the degree of association between documents from the copy operation. One is whether or not the related document for the copy source document is set as the related document for the copy destination document (related document takeover). The second point of view is how to assign the degree of association from the copy source to the copy destination or from the copy destination to the copy source (copy destination document relevance correction).

図１５に示すように、コピー元文書の関連文書については、「差分更新」パターンと「テンプレート」パターンのみ引き継がない。それ以外のパターンは引き継ぐようにしている。これにより、「差分更新」パターンにおいてはコピー元となった古いバージョンの文書やその関連文書が大量にレコメンド結果として出力されることを防ぐことができる。また、「テンプレート」パターンにおいては、テンプレートとなっているコピー元文書はレコメンドされにくくなる。 As shown in FIG. 15, only the “difference update” pattern and the “template” pattern are not inherited for the related document of the copy source document. Other patterns are taken over. As a result, in the “difference update” pattern, it is possible to prevent a large amount of old versions of documents and related documents from being copied from being output as recommendation results. Also, in the “template” pattern, the copy source document that is a template is less likely to be recommended.

コピー元とコピー先の関連度については、「バックアップ」パターンのコピー元文書からコピー先文書の関連度及び「テンプレート」、「文書分割（２）」パターンのコピー先からコピー元文書の関連度を補正する。図１５で、「変更なし」とは、図１３のＳ１３０２における同時利用による文書関連度計算によるレコメンドスコアをそのまま利用するということであり、「下げる」とは、上記のレコメンドスコアを低減する方向に補正するという意味である。図１５の例では、更新文書からバックアップ文書への関連度、テンプレート文書からそのテンプレートをコピーして作成した文書、複数に文書を分割した際の分割文書から元文書へのレコメンドスコアが低減され、レコメンドされにくくなる。 Regarding the relevance between the copy source and the copy destination, the relevance of the copy source document from the copy source document of the “backup” pattern and the relevance of the copy source document from the copy destination of the “template” and “document division (2)” patterns. to correct. In FIG. 15, “no change” means that the recommendation score based on the document relevance calculation by simultaneous use in S1302 of FIG. 13 is used as it is, and “decrease” is in the direction of reducing the recommendation score. It means to correct. In the example of FIG. 15, the degree of relevance from the updated document to the backup document, the document created by copying the template from the template document, and the recommendation score from the divided document to the original document when the document is divided into a plurality of documents are reduced. It becomes difficult to be recommended.

図１６は、図１５の適用戦略を元に、図１３のＳ１３０３で行われ、コピー分類による関連度スコア補正処理の流れを示したフローチャートである。図１６のフローチャートの各ステップに対応する処理は、ＲＯＭ１０２に記録されたプログラムを、ワークメモリとして機能するＲＡＭ１０３に展開してＣＰＵ１０１が実行することで実現する。
Ｓ１６０１では、図１３のＳ１３０１で選択した文書がコピー元或いはコピー先になっているログが操作ログ２０１に存在するか否かを判定する。存在すればＳ１６０２へ進み、存在しなければ図１３のＳ１３０２の文書関連度スコアを最終レコメンドスコアとして終了する。 FIG. 16 is a flowchart showing the flow of relevance score correction processing by copy classification performed in S1303 of FIG. 13 based on the application strategy of FIG. The processing corresponding to each step in the flowchart of FIG. 16 is realized by the CPU 101 executing the program recorded in the ROM 102 on the RAM 103 that functions as a work memory.
In S1601, it is determined whether or not a log in which the document selected in S1301 in FIG. If it exists, the process proceeds to S1602, and if it does not exist, the document relevance score in S1302 of FIG. 13 is ended as the final recommendation score.

Ｓ１６０２では、Ｓ１６０１でコピーログが存在すると判定された選択文書が、操作ログ２０１内でコピー元文書であったかコピー先文書であったかを判定する。コピー元文書であればＳ１６０３へ進み、コピー先文書であればＳ１６０６へ進む。 In step S1602, it is determined whether the selected document determined to have a copy log in step S1601 is a copy source document or a copy destination document in the operation log 201. If it is a copy source document, the process proceeds to S1603, and if it is a copy destination document, the process proceeds to S1606.

Ｓ１６０３では、選択文書がコピー元文書であった場合のコピー先文書を取得する。
Ｓ１６０４では、選択文書のコピーログが図１５の分類においてバックアップパターンに分類されているか否かを判断する。判定には、図９のコピー分類テーブル９００の分類番号を使用する。バックアップパターンであればＳ１６０５へ進み、バックアップパターンでなければ図１３のＳ１３０２の文書関連度スコアを最終レコメンドスコアとして終了する。 In S1603, a copy destination document is acquired when the selected document is a copy source document.
In step S1604, it is determined whether the copy log of the selected document is classified as a backup pattern in the classification of FIG. For the determination, the classification number in the copy classification table 900 of FIG. 9 is used. If it is a backup pattern, the process proceeds to S1605. If it is not a backup pattern, the document relevance score in S1302 of FIG. 13 is ended as the final recommendation score.

Ｓ１６０５では、図１５のバックアップパターンにあるように、コピー先文書のレコメンドスコアを下げる方向に補正する。（式１）で示されるコピー先文書iのスコアに対し、最終スコアFinal＿score(i)を次のように補正する。 In step S1605, correction is made in the direction of decreasing the recommendation score of the copy destination document as in the backup pattern of FIG. The final score Final_score (i) is corrected as follows with respect to the score of the copy destination document i shown in (Formula 1).

Ｓ１６０６では、Ｓ１６０２で選択文書がコピー先文書であると判定された場合に、そのコピーログにおけるコピー元文書を取得する。
Ｓ１６０７では、選択文書のコピーログが図１５の分類において差分更新パターンに分類されているか否かを判断する。差分更新パターンであれば、図１３のＳ１３０２の文書関連度スコアを最終レコメンドスコアとして終了する。差分更新パターンでなければＳ１６０８へ進む。 In S1606, when it is determined in S1602 that the selected document is a copy destination document, a copy source document in the copy log is acquired.
In S1607, it is determined whether or not the copy log of the selected document is classified into the difference update pattern in the classification of FIG. If it is a difference update pattern, the document relevance score in S1302 of FIG. 13 is ended as the final recommendation score. If it is not a difference update pattern, the process proceeds to S1608.

Ｓ１６０８では、コピーログがテンプレートパターンであるか否かを判定する。テンプレートパターンであればＳ１６１１へ進み、テンプレートパターンでなければＳ１６０９へ進む。
Ｓ１６０９では、コピー元文書の関連文書スコアを計算する。計算方法は、図１３のＳ１３０２で示した通りである。図１５に示すように、コピー元文書の関連文書を選択文書の関連文書として引き継ぐ。 In step S1608, it is determined whether the copy log is a template pattern. If it is a template pattern, the process proceeds to S1611. If it is not a template pattern, the process proceeds to S1609.
In step S1609, the related document score of the copy source document is calculated. The calculation method is as shown in S1302 of FIG. As shown in FIG. 15, the related document of the copy source document is taken over as the related document of the selected document.

Ｓ１６１０では、コピーログが文書分割（２）パターンか否かを判定する。文書分割（２）パターンであれば、Ｓ１６１１へ進み、そうでなければ終了する。終了時のレコメンドスコアは、選択文書の同時利用による文書関連度及び選択文書のコピー元文書の同時利用による文書関連度となる。
Ｓ１６１１では、図１５のテンプレートパターン及び文書分割（２）パターンにあるように、コピー元文書のレコメンドスコアを下げる方向に補正する。（式１）で示されるコピー先文書iのスコアに対し、最終スコアFinal＿score(i)を次のように補正する。 In S1610, it is determined whether or not the copy log is a document division (2) pattern. If it is a document division (2) pattern, the process advances to step S1611; otherwise, the process ends. The recommendation score at the end is the document relevance level due to the simultaneous use of the selected document and the document relevance level due to the simultaneous use of the copy source document of the selected document.
In S1611, correction is made in the direction of lowering the recommendation score of the copy source document as in the template pattern and document division (2) pattern of FIG. The final score Final_score (i) is corrected as follows with respect to the score of the copy destination document i shown in (Formula 1).

このようにして、操作ログ２０１から推定したコピー分類パターンに応じてレコメンドスコアの補正を行うことで、関連文書レコメンド結果に不要な文書が現れる可能性を低減させる。
なお、本実施形態では、コピー元文書に対する関連文書の引き継ぎと、コピー元文書とコピー先文書の関連度という２つの観点によってスコア付けをおこなったが、どちらか一方の観点を用いてスコアを計算してもよい。どちらか一方を用いる場合は、コピー分類の範囲を変更してもよい。例えば、関連度補正のみを行う場合、図１５において、コピー先とコピー元双方「変更なし」の場合、コピー元からコピー先のみ関連度を下げる場合、コピー先からコピー元のみ関連度を下げる場合、の３つの分類を使用することができる。 In this way, by correcting the recommendation score according to the copy classification pattern estimated from the operation log 201, the possibility that an unnecessary document appears in the related document recommendation result is reduced.
In this embodiment, scoring is performed from the two viewpoints of taking over the related document to the copy source document and the degree of relevance between the copy source document and the copy destination document. The score is calculated using either one of the viewpoints. May be. When either one is used, the copy classification range may be changed. For example, when only relevance correction is performed, when both the copy destination and the copy source are “no change” in FIG. 15, only the copy destination from the copy source is reduced, or only the copy source is reduced from the copy destination , Three classifications can be used.

[文書重要度によるレコメンド]
次に、図５の文書重要度によるレコメンドを実現する図２の文書重要度計算部２０５について説明する。
図１７は、文書重要度計算部２０５の処理の流れを示すフローチャートである。図１７のフローチャートの各ステップに対応する処理は、ＲＯＭ１０２に記録されたプログラムを、ワークメモリとして機能するＲＡＭ１０３に展開してＣＰＵ１０１が実行することで実現する。
Ｓ１７０１では、図５の５０１において選択されたフォルダを取得する。
Ｓ１７０２では、スコア付け対象文書を取得する。
Ｓ１７０３では、Ｓ１７０２で取得したスコア付け対象文書がＳ１７０１で取得した選択フォルダ及びサブフォルダ内に存在するか否かを判定する。フォルダ内に存在すればＳ１７０４へ進み、存在しなければＳ１７１０へ進んでスコア０を設定してＳ１７０７に進む。
Ｓ１７０４では、重要度スコアを計算する。本実施形態では、複数のユーザが印刷或いは閲覧した頻度が高い文書を重要度が高い文書であると定義する。文書iに対する閲覧・印刷頻度による重要度スコアは以下の式で求める。 [Recommendations based on document importance]
Next, the document importance calculation unit 205 in FIG. 2 that implements the recommendation based on the document importance in FIG. 5 will be described.
FIG. 17 is a flowchart showing a processing flow of the document importance degree calculation unit 205. The processing corresponding to each step in the flowchart of FIG. 17 is realized by the CPU 101 executing the program recorded in the ROM 102 on the RAM 103 that functions as a work memory.
In S1701, the folder selected in 501 in FIG. 5 is acquired.
In S1702, a scoring target document is acquired.
In S1703, it is determined whether the scoring target document acquired in S1702 exists in the selected folder and subfolder acquired in S1701. If it exists in the folder, the process proceeds to S1704. If it does not exist, the process proceeds to S1710 to set score 0, and the process proceeds to S1707.
In S1704, an importance score is calculated. In the present embodiment, a document that is frequently printed or viewed by a plurality of users is defined as a document having a high importance. The importance score based on the browsing / printing frequency for the document i is obtained by the following equation.

ここで、印刷頻度Print＿frequency、及び閲覧頻度Open＿frequencyは、図８の文書重要度テーブル８００の列８０２から得られる値である。τ及びυは印刷頻度と閲覧頻度のどちらを重視するかを決定する重みである。ここでは、印刷の重みを閲覧の重みの２倍（τ＝２、υ＝１）とする。 Here, the printing frequency Print_frequency and the browsing frequency Open_frequency are values obtained from the column 802 of the document importance table 800 of FIG. τ and υ are weights that determine which of the printing frequency and the browsing frequency is important. Here, it is assumed that the printing weight is twice the viewing weight (τ = 2, υ = 1).

Ｓ１７０５では、Ｓ１７０２で取得した文書のコピーログが操作ログ２０１に存在するか否かを判定する。コピーログが存在すれば、Ｓ１７０６へ進み、存在しなければＳ１７０７へ進む。
Ｓ１７０６では、図１０のコピー分類に応じて文書の重要度スコアを補正する。本処理の詳細は後述する。 In step S1705, it is determined whether the copy log of the document acquired in step S1702 exists in the operation log 201. If there is a copy log, the process proceeds to S1706, and if not, the process proceeds to S1707.
In S1706, the importance score of the document is corrected according to the copy classification of FIG. Details of this processing will be described later.

Ｓ１７０７では、全ての文書に対してスコア付けが行われたか否かを判定する。スコア付けされていればＳ１７０８へ進み、されていなければＳ１７０９へ進み、次のスコア付け対象文書を取得してＳ１７０３へ戻る。
Ｓ１７０８では、Ｓ１７０１からＳ１７０７で付与した重要度スコアが昇順になるように文書をソートしてレコメンド結果として出力する。 In step S1707, it is determined whether scoring has been performed for all documents. If it has been scored, the process advances to step S1708. If not, the process advances to step S1709 to acquire the next document to be scored, and the process returns to step S1703.
In step S1708, the documents are sorted so that the importance scores assigned in steps S1701 to S1707 are in ascending order and output as recommendation results.

次に、図１８及び図１９を参照して、図１７のＳ１７０６で行われるコピー分類による重要度スコア補正についての説明を行う。
本実施形態では、文書コピーのログからその操作がどのような文書作成操作であったかを図１０に示したように分類し、それぞれの分類に応じて適切なレコメンドを行う。 Next, with reference to FIGS. 18 and 19, the importance score correction by copy classification performed in S1706 of FIG. 17 will be described.
In this embodiment, the document creation operation is classified from the document copy log as shown in FIG. 10, and an appropriate recommendation is performed according to each classification.

図１８は、それぞれの分類に対する文書重要度計算の適用戦略の一例を示す図である。図１８に示すように、各分類に応じてコピー元或いはコピー先の文書の重要度を補正する。差分更新パターンでは、コピー元となっている前バージョンの文書の重要度を下げる。バックアップパターンでは、コピー先となっているバックアップ文書の重要度を下げる。文書分割（２）では、コピー先の複数の文書でコピー元の文書が再現できるので、コピー元の文書の重要度を下げる。 FIG. 18 is a diagram illustrating an example of an application strategy of document importance calculation for each classification. As shown in FIG. 18, the importance of the copy source or copy destination document is corrected according to each classification. In the differential update pattern, the importance of the previous version document that is the copy source is lowered. In the backup pattern, the importance of the backup document as the copy destination is lowered. In document division (2), since the copy source document can be reproduced with a plurality of copy destination documents, the importance of the copy source document is lowered.

図１９は、図１８の適用戦略を元にした図１７のＳ１７０６で行われる重要度スコア補正処理の流れを示すフローチャートである。
Ｓ１９０１では、図１７のＳ１７０５でコピーログが見つかったスコア付け対象文書が、コピー元文書であるか否かを判定する。コピー元であればＳ１９０２へ進み、コピー先であればＳ１９０４へ進む。 FIG. 19 is a flowchart showing the flow of importance score correction processing performed in S1706 of FIG. 17 based on the application strategy of FIG.
In S1901, it is determined whether or not the scoring target document for which the copy log is found in S1705 in FIG. 17 is a copy source document. If it is a copy source, the process proceeds to S1902, and if it is a copy destination, the process proceeds to S1904.

Ｓ１９０２では、スコア付け対象文書のコピーログが差分更新パターン或いは文書分割（２）パターンであるか否かを判定する。差分更新或いは文書分割（２）パターンであれば、Ｓ１９０３へ進み、図１８の適用戦略により、コピー元文書の重要度を低減させる方向にコピー元文書のスコアを補正する。（式４）の印刷・閲覧による重要度スコアに対して、最終重要度スコアFinal＿scoreを以下の（式５）のように補正する。 In S1902, it is determined whether the copy log of the document to be scored is a difference update pattern or a document division (2) pattern. If it is a difference update or document division (2) pattern, the process advances to step S1903, and the score of the copy source document is corrected in a direction of reducing the importance of the copy source document by the application strategy of FIG. The final importance score Final_score is corrected as shown in the following (Expression 5) with respect to the importance score by printing / viewing of (Expression 4).

Ｓ１９０４では、スコア付け対象文書のコピーログがバックアップパターンであるか否かを判定する。バックアップパターンであればＳ１９０５へ進み、そうでなければ終了する。
Ｓ１９０５では、図１８のバックアップパターンの適用戦略に基づき、コピー先文書の重要度を補正する。補正は、前述した（式５）と同様に計算する。
以上のように、コピーによる文書生成パターンによりコピー元或いはコピー先文書の重要度を適応的に決定することによって、フォルダ選択時に重要でない文書がレコメンドされる可能性が低減される。 In S1904, it is determined whether or not the copy log of the document to be scored is a backup pattern. If it is a backup pattern, the process advances to step S1905; otherwise, the process ends.
In step S1905, the importance level of the copy destination document is corrected based on the backup pattern application strategy shown in FIG. The correction is calculated in the same manner as (Equation 5) described above.
As described above, by appropriately determining the importance of the copy source document or the copy destination document based on the document generation pattern by copying, the possibility that an unimportant document is recommended when selecting a folder is reduced.

なお、本実施形態では文書関連度と文書重要度という２つの指標をレコメンドに用いたが、どちらか１つを用いてもよい。どちらか１つを用いる場合は、コピー分類の範囲を変更してもよい。例えば、文書重要度のみをレコメンド指標として用いた場合、図１８において、コピー元とコピー先文書の重要度が両方とも「変更なし」の場合、コピー元文書の重要度のみ「下げる」場合、コピー先文書の重要度のみ「下げる」場合、の３つの分類を使用する。 In the present embodiment, the two indexes of the document relevance level and the document importance level are used for the recommendation, but either one may be used. When either one is used, the copy classification range may be changed. For example, when only the document importance is used as the recommendation index, in FIG. 18, when both the copy source and copy destination documents have the “no change” importance, the copy source document importance only “decreases”, the copy In the case where only the importance level of the previous document is “reduced”, the following three classifications are used.

（第２の実施形態）
第１の実施形態では、レコメンドスコア計算時において先に印刷・閲覧頻度によるレコメンドスコアを計算して、後処理としてコピー分類補正を行った。しかし、プロファイル生成時に図７の文書関連度テーブル７００及び図８の文書重要度テーブル８００の値を、直接コピー履歴を利用して低減するようにしてもよい。これは、プロファイル生成時に図１１のコピー分類処理を実行した後、以下のようにする。すなわち、図１５及び図１８のスコア付け戦略に基づき、スコアを「下げる」と判定された文書に相当する文書関連度テーブルと文書重要度テーブルの値に対し、ペナルティー項としてある負の整数を加算（正の整数を減算）することで実現できる。 (Second Embodiment)
In the first embodiment, the recommendation score based on the printing / viewing frequency is first calculated when calculating the recommendation score, and the copy classification correction is performed as post-processing. However, the values in the document relevance table 700 in FIG. 7 and the document importance table 800 in FIG. 8 may be reduced by using the direct copy history at the time of profile generation. This is performed as follows after the copy classification process of FIG. 11 is executed at the time of profile generation. That is, based on the scoring strategies of FIGS. 15 and 18, a negative integer as a penalty term is added to the values of the document relevance table and the document importance table corresponding to the document whose score is determined to be “decreased” This can be realized by subtracting a positive integer.

例えば、ペナルティー項を−５とし、文書１と文書２がそれぞれバックアップパターンのコピー元文書、コピー先文書であった場合、図１５の適用戦略により文書１から文書２への関連度を下げる。図７の文書１と文書２の交点の値５に対しペナルティー項を加算して値を０とする。 For example, if the penalty term is -5 and the document 1 and the document 2 are the copy source document and the copy destination document of the backup pattern, respectively, the relevance from the document 1 to the document 2 is lowered by the application strategy of FIG. A penalty term is added to the intersection value 5 of document 1 and document 2 in FIG.

レコメンド時には、上記のようにコピー分類補正された文書関連度テーブル及び文書重要度テーブルの値をそのままレコメンドスコアとして出力する。
このようにして、プロファイル生成時にコピー分類補正を行うことにより、第１の実施形態と同様の効果が得られる。 At the time of recommendation, the values of the document relevance table and the document importance table that have been subjected to copy classification correction as described above are directly output as recommendation scores.
In this way, the same effect as that of the first embodiment can be obtained by performing the copy classification correction at the time of profile generation.

以上、本発明の実施形態で述べたように、ユーザが過去に文書を操作した履歴、特に文書コピーの履歴からどのような意図によって文書を作成しようとしたかを推定し、推定結果に基づいたレコメンドを行うようにした。これにより、ユーザの操作履歴に応じた好適なレコメンドを行うことができる。
また、コピーログからユーザがどのように文書を作成したか推定し、推定結果に応じた文書関連度及び文書重要度により文書をレコメンドすることで、レコメンド結果内の不要なノイズが削減され、レコメンド精度を向上させることができる。
なお、本実施形態ではユーザがファイルまたはフォルダを選択した際の文書のレコメンド機能について説明したが、本発明はそれらの機能に限定されるものではない。ユーザが文書を検索した際の検索結果を文書重要度順にソートする場合や、検索結果文書の関連文書を関連度順にユーザに提示する場合にも適用できる。 As described above in the embodiment of the present invention, the intent of creating a document is estimated from the history of the user's operation of the document in the past, particularly the history of document copy, and based on the estimation result Recommended to do. Thereby, the suitable recommendation according to a user's operation history can be performed.
In addition, by estimating how the user created the document from the copy log and recommending the document according to the document relevance and document importance according to the estimation result, unnecessary noise in the recommendation result is reduced, and the recommendation is recommended. Accuracy can be improved.
Although the document recommendation function when the user selects a file or folder has been described in this embodiment, the present invention is not limited to these functions. The present invention can also be applied to a case where search results when a user searches for a document are sorted in order of document importance, or a case where related documents of a search result document are presented to the user in order of relevance.

（その他の実施形態）
また、本発明は、以下の処理を実行することによっても実現される。即ち、上述した実施形態の機能を実現するソフトウェア（コンピュータプログラム）を、ネットワーク又は各種のコンピュータ読み取り可能な記憶媒体を介してシステム或いは装置に供給する。そして、そのシステム或いは装置のコンピュータ（またはＣＰＵやＭＰＵ等）がプログラムを読み出して実行する処理である。 (Other embodiments)
The present invention can also be realized by executing the following processing. That is, software (computer program) that realizes the functions of the above-described embodiments is supplied to a system or apparatus via a network or various computer-readable storage media. Then, the computer (or CPU, MPU, etc.) of the system or apparatus reads out and executes the program.

１００情報検索装置、２００文書管理システム、２０１操作ログ、２０２プロファイル生成部、２０３プロファイル情報、２０４レコメンドスコア計算部、２０５文書重要度計算部、２０６文書関連度計算部、２０７レコメンド結果表示部 DESCRIPTION OF SYMBOLS 100 Information retrieval apparatus, 200 Document management system, 201 Operation log, 202 Profile production | generation part, 203 Profile information, 204 Recommendation score calculation part, 205 Document importance calculation part, 206 Document relevance calculation part, 207 Recommendation result display part

Claims

Profile generation means for generating profile information to be used for recommendation from document operation history information in which operation history for each operation type of a user performed on a plurality of registered documents is recorded;
Analyzing the profile information generated by the profile generation unit, and calculating a recommendation score for each of the plurality of documents based on the analysis result; and
An information retrieval apparatus comprising: a recommendation result display means for displaying a recommendation result document calculated by the recommendation score calculation means on a display device.

The profile information has a copy classification table for storing a copy source document and a copy destination document,
The profile generation means extracts a document that has become a copy source in the past from the document operation history information and stores it in the copy source document, and creates the document from the history indicating that the document has been copied. 2. The information retrieval apparatus according to claim 1, wherein the information is classified into predetermined classification items and stored in the duplication destination document.

3. The information search apparatus according to claim 2, wherein the classification items are classified by estimating what purpose the user has created a document from a log record of a copy operation.

The profile information has a document relevance table in which all registered documents are stored and relevance between all documents is given,
The profile generation means updates the relevance information stored in the document relevance table every time the documents are used at the same time, and the relevance information is stored in the document relevance table. 4. The information search device according to 3.

4. The profile generation unit according to claim 2, wherein the profile generation unit determines whether to take over a document related to a copy source document as a document related to a copy destination document according to a result of classifying the document. Information retrieval device.

The profile information has a document importance table storing importance information of all registered documents,
3. The profile generation unit assumes that a document viewed or printed by a user is a document having high importance, and updates the importance information of the document every time the user browses or prints. The information search device according to any one of? 4.

The classification items include a pattern for duplicating and updating an old version document, a pattern for duplicating an old version and storing it as a backup, a pattern for duplicating and updating a template, and a pattern for duplicating and dividing a document. The information search apparatus according to any one of claims 2 to 6, wherein the information search apparatus is characterized in that:

The information search apparatus according to claim 2, wherein the recommendation score calculation unit calculates an importance score of the document according to the pattern of the classification item.

The information retrieval apparatus according to claim 2, wherein the recommendation score calculation unit calculates importance scores of a duplication source document and a duplication destination document according to a pattern of the classification item. .

The recommendation score calculating means includes a document importance calculating means for calculating a score indicating the importance of registered documents, and a document relevance calculating means for calculating a score indicating the degree of association between registered documents. The information search device according to claim 1, wherein:

A profile generation step of generating profile information used for recommendation from document operation history information in which operation history for each operation type of a user performed on a plurality of registered documents is recorded;
Analyzing the profile information generated in the profile generation step, and calculating a recommendation score for each of the plurality of documents based on the analysis result; and
A recommendation result display step of displaying a recommendation result document calculated in the recommendation score calculation step on a display device.

The profile information has a copy classification table for storing a copy source document and a copy destination document,
In the profile generation step, a document that has been a copy source in the past is extracted from the document operation history information and stored in the copy source document, and the purpose of creating the document from a history indicating that the document has been copied is 12. The information retrieval method according to claim 11, wherein the information is classified into predetermined classification items and stored in the duplication destination document.

13. The information search method according to claim 12, wherein the classification items are classified by estimating what kind of intention a user has created a document from a log record of a copy operation.

The profile information has a document relevance table in which all registered documents are stored and relevance between all documents is given,
13. The profile generation step includes updating the relevance information stored in the document relevance table every time a document is used at the same time. Or the information search method of 13.

The profile generation step determines whether or not to take over a document related to a copy source document as a document related to a copy destination document according to a result of classifying the document. Information retrieval method.

The profile information has a document importance table storing importance information of all registered documents,
13. The profile generation step assumes that a document viewed or printed by a user is a document having high importance, and updates importance information of the document every time the user browses or prints. The information search method of any one of -14.

The classification items include a pattern for duplicating and updating an old version document, a pattern for duplicating an old version and storing it as a backup, a pattern for duplicating and updating a template, and a pattern for duplicating and dividing a document. The information search method according to any one of claims 12 to 16, wherein the information search method is characterized in that:

The information retrieval method according to claim 12, wherein the recommendation score calculation step calculates an importance score of the document according to the pattern of the classification item.

The information search method according to any one of claims 12 to 18, wherein the recommendation score calculation step calculates importance scores of a duplication source document and a duplication destination document according to a pattern of the classification item. .

The recommendation score calculation step includes a document importance level calculation step for calculating a score indicating the importance level of the registered documents, and a document relevance level calculation step for calculating a score indicating the relevance level between the registered documents. The information search method according to claim 11, wherein:

A profile generation step of generating profile information used for recommendation from document operation history information in which operation history for each operation type of a user performed on a plurality of registered documents is recorded;
Analyzing the profile information generated in the profile generation step, and calculating a recommendation score for each of the plurality of documents based on the analysis result; and
A program for causing a computer to execute a recommendation result display step of displaying a recommendation result document calculated in the recommendation score calculation step on a display device.

A computer-readable storage medium storing the program according to claim 21.