JP2009129185A

JP2009129185A - Document management device, document management method and program

Info

Publication number: JP2009129185A
Application number: JP2007303429A
Authority: JP
Inventors: Takashi Doi; 敬司土井; Yoshitaka Shin; 吉高新; Toshihiro Kujirai; 俊宏鯨井; Naohiko Fukaya; 直彦深谷
Original assignee: Clarion Co Ltd; Hitachi Ltd
Current assignee: Hitachi Ltd; Faurecia Clarion Electronics Co Ltd
Priority date: 2007-11-22
Filing date: 2007-11-22
Publication date: 2009-06-11

Abstract

<P>PROBLEM TO BE SOLVED: To extract a related document wherein an actual situation of document preparation in a business project is considered. <P>SOLUTION: An input processing part 11 sets data according to the business project to a weighting table 146 and a related threshold table 147 based on data input from a terminal device 2. A related document extraction processing part 12 totals an appearance frequency of a keyword included in the weighting table 146 about each document registered in a document DB 14, and performs principal component analysis based on the totaled appearance frequency of the keyword and its weighting value to calculate a characteristic amount of each document. When a distance between coordinate positions expressed by the characteristic amount is not more than a related threshold value set in the related threshold value table 147, the document is decided as the related document, and the related document to a reference document is extracted. An output processing part 13 displays a name of the extracted related document on a terminal device 2. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、ドキュメントデータベースからユーザの業務に関連する、または、ユーザが指定するドキュメントに関連するドキュメントを提示するドキュメント管理装置、ドキュメント管理方法およびプログラムに関する。 The present invention relates to a document management apparatus, a document management method, and a program for presenting a document related to a user's business from a document database or related to a document designated by a user.

コンシューマ向けの工業製品（例えば、テレビ、パソコン、携帯電話機、カーナビ装置、乗用車など）は、毎年、あるいは、それ以上の頻度で、多数の新製品が市場に送り出され、その販売競争が世界中で繰り広げられている。そのような工業製品を設計・開発する現場では、そのたびごとに、膨大なドキュメントが作成され、そのドキュメントの作成に膨大なコストが費やされる。従って、新製品を設計・開発する場合、過去の製品のドキュメントを参考にしたり、その一部を再利用したりすることによって、ドキュメント作成の効率向上を図ることが、そのコスト削減の重要なポイントとなっている。 Industrial products for consumers (for example, TVs, personal computers, mobile phones, car navigation systems, passenger cars, etc.) are introduced to the market every year or more frequently, and their sales competition is worldwide. It is unfolding. In the field where such industrial products are designed and developed, a huge amount of documents are created each time, and a huge cost is spent on the creation of the documents. Therefore, when designing and developing new products, it is important to improve the efficiency of document creation by referring to past product documents or reusing a part of them. It has become.

ところで、コンシューマ向けの工業製品では、その製品の基本的な機能が大きく変わることはほとんどないが、デザイン、使い勝手、付加的機能の面においては、新製品ごとに新たな工夫が加えられる。従って、新製品の設計思想は、その新製品ごとに異なったものとなることが多い。そのような場合、その設計思想などに応じて設計・開発のワークフローにも工夫が加えられ、新製品が他の新製品や過去の製品と同じワークフローに従って開発されるとは限らない。また、製品の設計ツールやシミュレーションツールなどが変わることにより、設計・開発のワークフロー自体が変わることもある。 By the way, in the industrial products for consumers, the basic functions of the products are hardly changed. However, in terms of design, usability, and additional functions, new ideas are added for each new product. Therefore, the design philosophy of a new product is often different for each new product. In such a case, the design / development workflow is devised according to the design philosophy, and the new product is not always developed according to the same workflow as other new products or past products. In addition, the design / development workflow itself may change due to changes in product design tools and simulation tools.

ワークフローに変更が加えられると、当然ながら、その中で作成されるドキュメントの体系も変えられることが多い。その場合には、例えば、ある新製品の開発プロジェクトに属するメンバが、他の新製品または過去の製品の開発プロジェクトのドキュメントを参照しようとしても、自らの業務に関連するドキュメントを適切に抽出することは容易ではない。そのような状況では、他のまたは過去の製品の開発プロジェクトのドキュメントを効率よく利用することはできない。そこで、ユーザの業務に関連するドキュメントを検索したり、関連するドキュメントが登録・変更された場合には、その登録・変更の通知を受けたりするシステムの構築が求められる。 When changes are made to the workflow, of course, the system of documents created in the workflow is often changed. In that case, for example, even if a member who belongs to a development project of a new product tries to refer to a document of a development project of another new product or a past product, a document related to his / her business is appropriately extracted. Is not easy. In such situations, the documentation of other or past product development projects cannot be used efficiently. Therefore, it is required to construct a system that searches for documents related to the user's business and receives notification of registration / change when the related documents are registered / changed.

従来の技術として、ユーザがドキュメントを登録または変更したとき、その登録または変更したドキュメントを関連するユーザに確実かつ簡単に配布するために、配布するドキュメント自体に配布先情報を入れておき、その配布先情報が示す配布先へそのドキュメントを配布するという文書管理システムが知られている（特許文献１など参照）。 As a conventional technique, when a user registers or changes a document, in order to reliably and easily distribute the registered or changed document to related users, the distribution destination information is put in the distributed document itself, and the distribution is performed. A document management system is known in which the document is distributed to a distribution destination indicated by the destination information (see Patent Document 1).

また、他の従来の技術として、端末（ユーザ）間で行われるドキュメントの配布の履歴を、ユーザ間の業務の関連度を示す情報として蓄積し、あるドキュメントが登録されたときには、そのドキュメントの配布の履歴に基づき、ドキュメントを登録したユーザの業務に関連する他のユーザを抽出し、その抽出した他のユーザにドキュメントが登録されたことを通知するシステムが提案されている（特許文献２など参照）。 As another conventional technique, a history of document distribution performed between terminals (users) is stored as information indicating the degree of business relevance between users, and when a document is registered, the document is distributed. A system has been proposed in which other users related to the business of the user who registered the document are extracted based on the history of the document, and the extracted other user is notified that the document has been registered (see, for example, Patent Document 2). ).

また、他の従来の技術として、あるドキュメントに類似（関連）するドキュメントを、ドキュメントデータベースの中から抽出する技術として、それぞれのドキュメントに含まれるキーワードの主成分分析によってドキュメントの特徴量を求め、その特徴量に基づきドキュメント間の類似度を判定する技術が知られている（特許文献３など参照）。
特開２００５−２４６６７１号公報特開２００５−３４６２８１号公報特開２００２−１４９９９号公報 As another conventional technique, a document similar to (relevant to) a document is extracted from the document database, and the feature amount of the document is obtained by principal component analysis of keywords included in each document. A technique for determining a similarity between documents based on a feature amount is known (see Patent Document 3).
JP 2005-246671 A JP 2005-346281 A JP 2002-14999 A

しかしながら、特許文献１に開示されたシステムでは、配布先は、ユーザ自身が配布するドキュメントの中に記載する必要があるので、複数の開発プロジェクトが同時に入り乱れて進行し、また、それらの開発プロジェクトの組織がたびたび変更されるような場合には、ユーザは、ドキュメントの配布先を適切にかつ効率的に定めることができない。 However, in the system disclosed in Patent Document 1, since the distribution destination needs to be described in a document distributed by the user himself, a plurality of development projects are confused at the same time. When the organization is frequently changed, the user cannot appropriately and efficiently determine the distribution destination of the document.

また、特許文献２に開示されたシステムでは、ドキュメントが登録されたことを通知する通知先（他のユーザ）は、ドキュメントの配布の履歴、つまり、ユーザ間の業務の関連度だけによって決定され、そのとき、登録されたドキュメントと通知先のユーザとの直接的な関連度は考慮されない。従って、通知先のユーザは、自分の業務に関連のないドキュメントの登録通知を受ける可能性がある。 Further, in the system disclosed in Patent Document 2, a notification destination (other user) for notifying that a document has been registered is determined only by the distribution history of the document, that is, the degree of business relevance between users, At that time, the degree of direct association between the registered document and the notification destination user is not considered. Therefore, the notification destination user may receive a registration notification of a document that is not related to his / her business.

また、特許文献３に開示されたドキュメントの類似度を判定する技術は、ＷＥＢサイトなどに存在する一般的なドキュメント同士の一般的な類似度を求める方法が開示されたものである。従って、それは、互いに異なるワークフローを有する様々な開発プロジェクトで作成されたドキュメント同士の類似度を、その開発プロジェクトの実態を考慮した上で求める技術としては、必ずしも適切であるとはいえない。 The technique for determining the similarity of documents disclosed in Patent Document 3 discloses a method for obtaining a general similarity between general documents existing on a WEB site or the like. Therefore, it is not necessarily appropriate as a technique for obtaining the similarity between documents created in various development projects having different workflows in consideration of the actual situation of the development project.

以上のような従来技術の問題点に鑑み、本発明は、開発プロジェクトごとのドキュメント作成の実態を考慮してドキュメント間の特徴量を求め、その求めた特徴量に基づき、関連ドキュメントを抽出することができ、さらには、ドキュメントの登録通知を、その登録ドキュメントに関連するドキュメントの閲覧履歴のあるユーザに通知することが可能なドキュメント管理装置、ドキュメント管理方法およびプログラムを提供することを目的とする。 In view of the problems of the prior art as described above, the present invention obtains feature quantities between documents in consideration of the actual state of document creation for each development project, and extracts related documents based on the obtained feature quantities. Furthermore, an object of the present invention is to provide a document management apparatus, a document management method, and a program capable of notifying a user who has a browsing history of a document related to the registered document.

本発明におけるドキュメント管理装置は、端末装置に接続され、処理装置と記憶装置とを含んで構成され、その記憶装置には、複数のドキュメントが登録されたドキュメントデータベースと、ドキュメントの特徴量を算出するためのキーワードおよびその重み付け値を格納した重み付けテーブルと、ドキュメント同士の関連度を判定するための関連閾値を格納した関連閾値テーブルと、が保持され、その処理装置が、（１）端末装置から入力されるデータに基づき、重み付けテーブルおよび関連閾値テーブルのデータを、ドキュメントが作成される業務プロジェクトごとに設定し、（２）端末装置から入力されるデータに基づき、ドキュメントデータベースに登録されているドキュメントから１のドキュメントを基準ドキュメントとして選択し、（３）ドキュメントデータベースに登録されているドキュメントについて、重み付けテーブルに格納されているキーワードの出現回数を集計し、そのキーワードの出現回数と前記重み付けテーブルに格納されている重み付け値とに基づき、主成分分析を行い、それぞれのドキュメントの特徴量を算出し、（４）それぞれのドキュメントの特徴量が表す座標位置と基準ドキュメントの特徴量が表す座標位置との距離が関連閾値テーブルに格納されている関連閾値以下であったとき、そのドキュメントを関連ドキュメントとして抽出し、（５）その抽出した関連ドキュメントの名称を端末装置に表示することを特徴とする。 A document management apparatus according to the present invention is connected to a terminal device and includes a processing device and a storage device. The storage device calculates a document database in which a plurality of documents are registered, and document feature amounts. A weighting table storing keywords and weighting values thereof, and a related threshold table storing related threshold values for determining the degree of association between documents, and (1) a processing device is input from a terminal device Based on the received data, the weighting table and the related threshold table data are set for each business project in which the document is created. (2) Based on the data input from the terminal device, the document is registered in the document database. Select one document as the reference document (3) For documents registered in the document database, the number of occurrences of the keyword stored in the weighting table is totaled, and based on the number of appearances of the keyword and the weighting value stored in the weighting table, Component analysis is performed to calculate the feature amount of each document. (4) The distance between the coordinate position represented by the feature amount of each document and the coordinate position represented by the feature amount of the reference document is stored in the related threshold value table. When it is equal to or less than the related threshold, the document is extracted as a related document, and (5) the name of the extracted related document is displayed on the terminal device.

また、本発明のドキュメント管理装置においては、処理装置は、ドキュメントデータベースを閲覧したユーザの履歴を閲覧履歴テーブルとして記憶装置に蓄積し、新たなドキュメントをそのドキュメントデータベースを登録したときには、その新たなドキュメントを基準ドキュメントとして関連ドキュメントを抽出し、閲覧履歴テーブルを参照して、その関連ドキュメントを閲覧した履歴のあるユーザが使用中の端末装置に、前記新たなドキュメントが登録された旨のメッセージを表示する。 In the document management device of the present invention, the processing device accumulates the history of the user who viewed the document database in the storage device as a browsing history table, and when a new document is registered in the document database, the new document The related document is extracted with reference to the reference document, the browsing history table is referenced, and a message indicating that the new document has been registered is displayed on the terminal device being used by the user who has browsed the related document. .

本発明においては、ドキュメントの特徴量を算出するための重み付けテーブルのデータを、例えば、システム管理者などが、業務プロジェクトの業務方針やそのプロジェクトの製品設計思想などドキュメント作成の実態を考慮して、適宜、設定することができる。従って、その重み付けテーブルを用いて算出したドキュメントの特徴量には、業務プロジェクトごとのドキュメント作成の実態が反映されるので、本発明のドキュメント管理装置により抽出される関連ドキュメントは、ドキュメント作成の実態が考慮されたものとなる。 In the present invention, the weighting table data for calculating the document feature amount, for example, the system administrator, taking into account the actual state of document creation, such as the business policy of the business project and the product design concept of the project, It can be set as appropriate. Therefore, the document feature amount calculated using the weighting table reflects the actual document creation for each business project. Therefore, the related document extracted by the document management apparatus of the present invention has the actual document creation. It will be considered.

また、本発明においては、新たなドキュメントをドキュメントデータベースに登録した旨を通知するメッセージは、その新たなドキュメントに関連するドキュメントを閲覧したユーザが使用中の端末装置に表示される。従って、その通知を受けるユーザは、閲覧した履歴のあるドキュメント、つまり、自分の業務に関連するドキュメントに関連するドキュメントの登録通知を受けることができる。逆にいえば、自分の業務に関連しないドキュメントに関連するドキュメントの登録通知は受けることがない。 In the present invention, a message notifying that a new document has been registered in the document database is displayed on the terminal device being used by the user who has browsed the document related to the new document. Therefore, a user who receives the notification can receive a registration notification of a document having a history of browsing, that is, a document related to a document related to his / her business. In other words, a document registration notification related to a document not related to his / her business is not received.

本発明によれば、開発プロジェクトごとのドキュメント作成の実態を考慮してドキュメント間の特徴量を求め、その求めた特徴量に基づき、関連ドキュメントを抽出することができ、さらには、ドキュメントの登録通知を、その登録したドキュメントに関連するドキュメントの閲覧履歴のあるユーザに通知することが可能になる。 According to the present invention, it is possible to obtain a feature amount between documents in consideration of the actual state of document creation for each development project, extract a related document based on the obtained feature amount, and further notify the registration of a document. Can be notified to a user who has a browsing history of a document related to the registered document.

以下、図面を参照して本発明の実施形態について詳しく説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図１は、本発明の実施形態に係るドキュメント管理装置の機能ブロックの構成の例を示した図である。図１に示すように、ドキュメント管理装置１は、端末装置２に接続され、その機能ブロックとして、入力処理部１１、関連ドキュメント抽出処理部１２、出力処理部１３、ドキュメントＤＢ（ＤＢはデータベースの略）１４などを含んで構成される。 FIG. 1 is a diagram showing an example of a functional block configuration of a document management apparatus according to an embodiment of the present invention. As shown in FIG. 1, a document management apparatus 1 is connected to a terminal apparatus 2 and includes, as its functional blocks, an input processing unit 11, a related document extraction processing unit 12, an output processing unit 13, and a document DB (DB is an abbreviation for a database). ) 14 and the like.

ここで、入力処理部１１は、端末装置２から入力される様々なデータを受け付け、その入力データをドキュメントＤＢ１４に格納する機能を有し、さらに、ドキュメント登録部１１１、ドキュメント検索・閲覧部１１２、ユーザ情報入力部１１３、ユーザ認証部１１４、重み付け入力部１１５、関連閾値設定部１１６などの下位の機能ブロックを含んで構成される。 Here, the input processing unit 11 has a function of receiving various data input from the terminal device 2 and storing the input data in the document DB 14, and further includes a document registration unit 111, a document search / browsing unit 112, It includes lower-level functional blocks such as a user information input unit 113, a user authentication unit 114, a weighting input unit 115, and a related threshold value setting unit 116.

また、関連ドキュメント抽出処理部１２は、入力処理部１１から入力されるデータにより指定された基準ドキュメントについて、その関連ドキュメントを抽出する機能を有し、形態素解析部１２１、キーワード集計部１２２、主成分分析部１２３、関連ドキュメントマッチング部１２４などの下位の機能ブロックを含んで構成される。 The related document extraction processing unit 12 has a function of extracting a related document for a reference document specified by data input from the input processing unit 11, and includes a morphological analysis unit 121, a keyword totaling unit 122, a main component It is configured to include lower functional blocks such as an analysis unit 123 and a related document matching unit 124.

また、出力処理部１３は、関連ドキュメント抽出処理部１２で抽出された関連ドキュメントに基づき、ドキュメントの登録通知、または、関連ドキュメントの名称の一覧を、端末装置２に表示する機能を有し、関連ドキュメント登録通知部１３１、関連ドキュメント表示部１３２などの下位の機能ブロックを含んで構成される。 The output processing unit 13 has a function of displaying a document registration notification or a list of related document names on the terminal device 2 based on the related document extracted by the related document extraction processing unit 12. It includes lower functional blocks such as a document registration notification unit 131 and a related document display unit 132.

また、ドキュメントＤＢ１４は、ドキュメントそのもの（ドキュメントデータ１４１）を蓄積しておくほか、関連ドキュメント抽出処理部１２などで使用する様々なテーブルを記憶する機能を有する。ここで、その様々なテーブルとしては、ユーザ情報テーブル１４２、ドキュメント解析結果テーブル１４３、主成分分析結果テーブル１４４、閲覧履歴テーブル１４５、重み付けテーブル１４６、関連閾値テーブル１４７などがある。 The document DB 14 has a function of storing various tables used in the related document extraction processing unit 12 and the like in addition to storing the documents themselves (document data 141). Here, as the various tables, there are a user information table 142, a document analysis result table 143, a principal component analysis result table 144, a browsing history table 145, a weighting table 146, a related threshold value table 147, and the like.

以上のような構成を有するドキュメント管理装置１は、図示しない処理装置（以下、ＣＰＵ（Central Processing Unit）という）と記憶装置とを備えたいわゆるコンピュータによって構成される。この場合、記憶装置は、高速の半導体メモリであるＲＡＭ（Random Access Memory）や大容量の磁気メモリであるハードディスク装置などによって構成される。 The document management apparatus 1 having the above-described configuration is configured by a so-called computer including a processing device (hereinafter referred to as a CPU (Central Processing Unit)) and a storage device (not shown). In this case, the storage device includes a RAM (Random Access Memory) that is a high-speed semiconductor memory, a hard disk device that is a large-capacity magnetic memory, and the like.

入力処理部１１、関連ドキュメント抽出処理部１２および出力処理部１３のそれぞれを構成する下位の機能ブロックの機能については、図２以下の図を用いて詳しく説明するが、それらの機能ブロックの機能は、前記ＣＰＵが、前記記憶装置に格納されたそれぞれの機能ブロックに対応する所定のプログラムを、実行することによって実現される。また、ドキュメントＤＢ１４は、前記記憶装置上に構成される。 The functions of the lower-level functional blocks constituting each of the input processing unit 11, the related document extraction processing unit 12, and the output processing unit 13 will be described in detail with reference to FIG. The CPU is implemented by executing a predetermined program corresponding to each functional block stored in the storage device. The document DB 14 is configured on the storage device.

なお、図１では、ドキュメント管理装置１は、１つのコンピュータにより実現されるものとしているが、ＬＡＮ（Local Area Network）などのネットワークを介して相互に接続された複数のコンピュータにより実現されても構わない。例えば、ドキュメントＤＢ１４がデータベース管理コンピュータなどによって独立して構成されていてもよい。 In FIG. 1, the document management apparatus 1 is realized by a single computer, but may be realized by a plurality of computers connected to each other via a network such as a LAN (Local Area Network). Absent. For example, the document DB 14 may be configured independently by a database management computer or the like.

また、端末装置２は、キーボード、マウス、液晶ディスプレイなどを含んで構成され、ドキュメント管理装置１の入出力装置として用いられる。このとき、ドキュメント管理装置１に対し、複数の端末装置２が接続されてもよく、また、ドキュメント管理装置１と端末装置２とは、ＬＡＮなどのネットワークを介して接続されていてもよい。また、端末装置２は、パソコンなどのコンピュータによって構成され、ユーザがドキュメントを作成する装置であってもよい。 The terminal device 2 includes a keyboard, a mouse, a liquid crystal display, and the like, and is used as an input / output device of the document management device 1. At this time, a plurality of terminal devices 2 may be connected to the document management device 1, and the document management device 1 and the terminal device 2 may be connected via a network such as a LAN. Further, the terminal device 2 may be configured by a computer such as a personal computer, and a device in which a user creates a document.

図２は、ユーザ情報入力部１１３によって端末装置２に表示されるユーザ情報入力画面の例を示した図である。図２に示すように、ユーザ情報入力画面２１には、ユーザ名、ユーザＩＤ、パスワード、プロジェクト名、所属、電話番号などを入力するための入力ボックスと、送信ボタン２１１とが表示される。 FIG. 2 is a diagram illustrating an example of a user information input screen displayed on the terminal device 2 by the user information input unit 113. As shown in FIG. 2, the user information input screen 21 displays an input box for inputting a user name, user ID, password, project name, affiliation, telephone number, and the like, and a send button 211.

そこで、ドキュメント管理装置１のユーザ（以下、単に、ユーザという）の登録をしようとする者が、ユーザ情報入力画面２１に表示された入力ボックスに必要なデータを入力し、送信ボタン２１１をクリックすると、ドキュメント管理装置１のＣＰＵ（以下、単に、ＣＰＵという）は、入力ボックスから入力されたユーザ名、ユーザＩＤ、パスワード、プロジェクト名、所属、電話番号などの情報をユーザ情報テーブル１４２に登録する。これにより、ドキュメント管理装置１へのユーザ登録が行われたことになる。 Therefore, when a person who wants to register a user of the document management apparatus 1 (hereinafter simply referred to as a user) inputs necessary data in the input box displayed on the user information input screen 21 and clicks the send button 211. The CPU of the document management apparatus 1 (hereinafter simply referred to as “CPU”) registers information such as a user name, user ID, password, project name, affiliation, and telephone number input from the input box in the user information table 142. As a result, user registration in the document management apparatus 1 is performed.

図３は、ユーザ情報テーブル１４２の構成の例を示した図である。ユーザ情報テーブル１４２は、ドキュメント管理装置１に登録されたユーザに係る情報を記憶するテーブルであり、図３に示すように、ユーザ情報入力画面２１を介して入力されるユーザ名、ユーザＩＤ、パスワード、プロジェクト名、所属、電話番号などを格納するフィールドによって構成される。 FIG. 3 is a diagram showing an example of the configuration of the user information table 142. The user information table 142 is a table for storing information related to users registered in the document management apparatus 1. As shown in FIG. 3, the user name, user ID, and password input via the user information input screen 21 are stored in the user information table 142. It consists of fields that store the project name, affiliation, telephone number, and so on.

ここで、ユーザ情報テーブル１４２にはプロジェクト名が含まれているが、そのプロジェクト名は、ユーザ登録時にユーザが所属するプロジェクトの名称であるとする。なお、ユーザ登録後にユーザの所属プロジェクトが変更になった場合には、そのユーザが、ユーザ情報入力画面２１と同様の入力画面（図示せず）を用いて、その所属プロジェクトを変更するものとする。 Here, although the project name is included in the user information table 142, the project name is assumed to be the name of the project to which the user belongs at the time of user registration. When the user's affiliation project is changed after the user registration, the user changes the affiliation project using an input screen (not shown) similar to the user information input screen 21. .

図４は、ユーザ認証部１１４によって端末装置２に表示されるユーザ認証画面の例を示した図である。図４に示すように、ユーザ認証画面２２には、ユーザＩＤおよびパスワードを入力するための入力ボックスと、ログインボタン２２１とが表示される。 FIG. 4 is a diagram illustrating an example of a user authentication screen displayed on the terminal device 2 by the user authentication unit 114. As shown in FIG. 4, the user authentication screen 22 displays an input box for inputting a user ID and a password, and a login button 221.

ユーザがドキュメント管理装置１へログインする場合には、ユーザ認証画面２２から自分のユーザＩＤおよびパスワードを入力し、ログインボタン２２１をクリックする。すると、ＣＰＵは、入力ボックスから入力されたユーザＩＤとパスワードがユーザ情報テーブル１４２に登録されているか否かを判定し、登録されている場合には、そのユーザのログインを許可する。また、登録されていない場合には、ログインを拒否する。 When the user logs in to the document management apparatus 1, his / her user ID and password are input from the user authentication screen 22 and the login button 221 is clicked. Then, the CPU determines whether or not the user ID and password input from the input box are registered in the user information table 142, and if registered, allows the user to log in. If it is not registered, login is refused.

図５は、ドキュメント登録部１１１によって端末装置２に表示されるドキュメント登録画面の例を示した図である。図４に示すように、ドキュメント登録画面２３には、ログイン中のユーザ名（例えば、「山田太郎」）と、ドキュメント名、添付ファイルパスおよびプロジェクト名を入力するための入力ボックスと、登録ボタン２３１と、参照ボタン２３２とが表示される。 FIG. 5 is a diagram illustrating an example of a document registration screen displayed on the terminal device 2 by the document registration unit 111. As shown in FIG. 4, the document registration screen 23 includes a login user name (for example, “Taro Yamada”), an input box for inputting a document name, an attached file path, and a project name, and a registration button 231. And a reference button 232 is displayed.

ユーザが、そのそれぞれの入力ボックスに、ドキュメントデータベース１４に登録しようとしているドキュメントのドキュメント名（例えば、「アーキテクチャ設計書」）、そのドキュメントが存在するファイルパス名（例えば、「C:\Documents\Doc」）、および、そのドキュメントが属するプロジェクト名（例えば、「０７年モデル」）を入力すると、ＣＰＵは、その入力ボックスに入力されたドキュメント名および添付ファイルパス名によって指定されるドキュメント（例えば、「C:\Documents\Doc\アーキテクチャ設計書」）をドキュメントデータ１４１に登録する。 In the respective input boxes, the document name of the document to be registered in the document database 14 (for example, “architecture design document”) and the file path name where the document exists (for example, “C: \ Documents \ Doc ”) And the name of the project to which the document belongs (for example,“ 07 model ”), the CPU inputs the document specified by the document name and attachment path name input in the input box (for example,“ C: \ Documents \ Doc \ architecture design document ") is registered in the document data 141.

なお、ＣＰＵは、以上のドキュメントの登録処理の後に引き続いて、その登録したドキュメントに関連するドキュメントをドキュメントＤＢ１４の中から抽出する処理（関連ドキュメント抽出処理）を実行する。関連ドキュメント抽出処理は、関連ドキュメント抽出処理部１２により実行される処理であるが、その詳細については後記する。 The CPU executes a process of extracting a document related to the registered document from the document DB 14 (related document extraction process) following the above document registration process. The related document extraction process is a process executed by the related document extraction processing unit 12, and details thereof will be described later.

また、ユーザが、ドキュメント登録画面２３で参照ボタン２３２をクリックした場合には、ＣＰＵは、登録しようとしているドキュメントが属するファイルシステムのファイルパス構造を、所定のＧＵＩ（Graphical User Interface）に従って表示する。その場合、ユーザは、そのＧＵＩを利用することにより、容易に添付ファイルパスを探索したり選択したりすることができる。なお、そのファイルシステムは、ドキュメント管理装置１の記憶装置に構成されたファイルシステムに限定されるものではなく、端末装置２がコンピュータで構成され、ドキュメント作成に用いられるような場合には、端末装置２の記憶装置に構成されたファイルシステムであってもよい。 When the user clicks the reference button 232 on the document registration screen 23, the CPU displays the file path structure of the file system to which the document to be registered belongs according to a predetermined GUI (Graphical User Interface). In that case, the user can easily search for or select an attachment file path by using the GUI. Note that the file system is not limited to the file system configured in the storage device of the document management device 1, and in the case where the terminal device 2 is configured by a computer and used for document creation, the terminal device. It may be a file system configured in two storage devices.

図６は、ドキュメントデータ１４１の構成の例を示した図である。ドキュメントデータ１４１は、ドキュメントＤＢ１４に登録されたドキュメントの全体であり、各ドキュメントのデータは、ドキュメント名、プロジェクト名、登録者名およびドキュメント実体データを含んで構成される。ここで、ドキュメント実体データは、実体データそのものではなく、そのドキュメント実体データを記憶したファイルの名称であってもよい。 FIG. 6 is a diagram showing an example of the configuration of the document data 141. The document data 141 is the entire document registered in the document DB 14, and each document data includes a document name, a project name, a registrant name, and document entity data. Here, the document entity data may not be the entity data itself but the name of the file storing the document entity data.

図７は、ドキュメント検索・閲覧部１１２によって端末装置２に表示されるドキュメント検索画面の例を示した図である。図７に示すように、ドキュメント検索画面２４には、まず、ログイン中のユーザ名（例えば、「山田太郎」）と、ドキュメントを検索するためのキーワードを入力するための入力ボックスと、検索ボタン２４１とが表示される。 FIG. 7 is a diagram illustrating an example of a document search screen displayed on the terminal device 2 by the document search / view unit 112. As shown in FIG. 7, on the document search screen 24, first, the name of a logged-in user (for example, “Taro Yamada”), an input box for inputting a keyword for searching for a document, and a search button 241 are displayed. Is displayed.

そこで、ユーザがキーワードの入力ボックスにドキュメントを検索するためのキーワード（図示省略）を入力し、検索ボタン２４１をクリックすると、ＣＰＵは、入力されたキーワードに基づき、ドキュメントＤＢ１４に登録されているドキュメントを検索する。このとき、その検索に用いられる検索方法は、キーワードを用いた全文検索であっても、あるいは、キーワードに基づく他の検索方法であってもよい。 Therefore, when the user inputs a keyword (not shown) for searching for a document in the keyword input box and clicks the search button 241, the CPU selects a document registered in the document DB 14 based on the input keyword. Search for. At this time, the search method used for the search may be a full-text search using a keyword, or another search method based on a keyword.

ＣＰＵは、その検索により、入力されたキーワードに該当するドキュメントを抽出した場合には、検索結果ドキュメント一覧のポップアップ画面２４２を表示し、その中に、抽出したドキュメントのドキュメント名の一覧（検索結果ドキュメント一覧）と、閲覧ボタン２４３と、関連ボタン２４４とを表示する。 When the CPU extracts a document corresponding to the input keyword by the search, the CPU displays a pop-up screen 242 of the search result document list, in which a list of document names of the extracted documents (search result document) is displayed. List), a browse button 243, and a related button 244.

次に、ユーザがそのポップアップ画面２４２に表示されたドキュメント名の一覧から１つのドキュメント名（例えば、「操作マニュアル」）を選択し、閲覧ボタン２４３をクリックすると、ＣＰＵは、その選択されたドキュメント名を有するドキュメントの内容を、新たに開く表示画面上などに表示する。このとき、ＣＰＵは、その閲覧者、つまり、ログイン中のユーザのユーザ名（例えば、「山田太郎」）と、その閲覧されたドキュメントのドキュメント名（例えば、「操作マニュアル」）と、そのときの日時情報とを対応付けて閲覧履歴データとし、閲覧履歴テーブル１４５に蓄積する。 Next, when the user selects one document name (for example, “operation manual”) from the list of document names displayed on the pop-up screen 242 and clicks the browse button 243, the CPU selects the selected document name. The contents of the document having the are displayed on a newly opened display screen or the like. At this time, the CPU, the user name of the user who is logged in (for example, “Taro Yamada”), the document name of the viewed document (for example, “Operation Manual”), and the current time The date and time information is associated with each other as browsing history data and stored in the browsing history table 145.

また、ユーザがポップアップ画面２４２に表示されたドキュメント名の一覧から１つのドキュメント名を選択し、関連ボタン２４４をクリックすると、ＣＰＵは、その選択されたドキュメント名を有するドキュメントに関連するドキュメントを、ドキュメントＤＢ１４から抽出する処理（関連ドキュメント抽出処理）を実行する。 When the user selects one document name from the list of document names displayed on the pop-up screen 242 and clicks the related button 244, the CPU selects a document related to the document having the selected document name as the document. Processing to extract from the DB 14 (related document extraction processing) is executed.

図８は、閲覧履歴テーブルの構成の例を示した図である。閲覧履歴テーブルは、ドキュメントＤＢ１４に登録されたドキュメントが閲覧された履歴を蓄積したテーブルであり、閲覧者名、閲覧ドキュメント名、閲覧日時を格納するフィールドによって構成される。そして、閲覧履歴テーブル１４５への閲覧履歴データの蓄積（記録）は、ログイン中のユーザがドキュメント検索画面２４のポップアップ画面２４２で選択したドキュメントを閲覧したときに行われる。 FIG. 8 is a diagram showing an example of the configuration of the browsing history table. The browsing history table is a table in which a history of browsing documents registered in the document DB 14 is accumulated, and includes a field for storing a viewer name, a browsing document name, and a browsing date. The browsing history data is stored (recorded) in the browsing history table 145 when the logged-in user browses the document selected on the pop-up screen 242 of the document search screen 24.

後記するように、本実施形態では、閲覧履歴テーブル１４５は、ドキュメントとユーザとを関連付けるデータとして用いられる。すなわち、本実施形態では、あるユーザ「Ａ」があるドキュメント「Ｂ」を閲覧したという履歴に基づき、ユーザ「Ａ」は、ドキュメント「Ｂ」に関連した業務を担当していると判断する。さらに、ドキュメント「Ｂ」に関連するドキュメント「Ｃ」は、ユーザ「Ａ」の業務に関連していると判断する。 As will be described later, in the present embodiment, the browsing history table 145 is used as data associating a document with a user. In other words, in the present embodiment, based on the history that a certain user “A” has browsed a document “B”, the user “A” determines that he / she is in charge of work related to the document “B”. Further, it is determined that the document “C” related to the document “B” is related to the job of the user “A”.

続いて、図９以下の図を用いて、関連ドキュメント抽出処理の詳細について説明する。関連ドキュメント抽出処理では、基準となるドキュメントに関連するドキュメントをドキュメントＤＢ１４の中から抽出するが、その処理の入力データは、ドキュメントデータ１４１、基準となるドキュメントの名称、重み付けテーブル１４６、関連閾値テーブル１４７などである。 Next, details of the related document extraction process will be described with reference to FIG. In the related document extraction process, a document related to a reference document is extracted from the document DB 14. Input data of the process includes document data 141, a reference document name, a weighting table 146, and a related threshold value table 147. Etc.

ここで、基準となるドキュメントの名称とは、ドキュメント登録画面２３（図５参照）により入力されたドキュメント名、または、ドキュメント検索画面２４（図７参照）のポップアップ画面２４２で選択されたドキュメントのドキュメント名である。また、重み付けテーブル１４６、関連閾値テーブル１４７を構成するデータは、関連ドキュメント抽出処理を実行する前に、それぞれ、重み付け入力部１１５、関連閾値設定部１１６の処理を介して、ドキュメント管理装置１の管理者（以下、システム管理者という）よって設定される。 Here, the name of the reference document is the document name entered on the document registration screen 23 (see FIG. 5) or the document of the document selected on the pop-up screen 242 of the document search screen 24 (see FIG. 7). Name. Further, the data constituting the weighting table 146 and the related threshold value table 147 are managed by the document management apparatus 1 through the processes of the weighting input unit 115 and the related threshold value setting unit 116, respectively, before executing the related document extraction process. (Hereinafter referred to as a system administrator).

図９は、重み付け入力部１１５によって端末装置２に表示される重み付け入力画面の例を示した図、また、図１０は、重み付けテーブル１４６の構成の例を示した図である。図９（ａ）に示すように、重み付け入力画面２５には、プロジェクト名の一覧２５１と、キーワードの重み付け値の一覧２５２と、新規登録ボタン２５３と、変更ボタン２５４と、削除ボタン２５５とが表示される。 FIG. 9 is a diagram showing an example of a weighting input screen displayed on the terminal device 2 by the weighting input unit 115, and FIG. 10 is a diagram showing an example of the configuration of the weighting table 146. As shown in FIG. 9A, the weighting input screen 25 displays a project name list 251, a keyword weighting value list 252, a new registration button 253, a change button 254, and a delete button 255. Is done.

ここで、キーワードおよびその重み付け値は、ドキュメントの特徴量を抽出するためにシステム管理者により事前に設定されるデータであり、本実施形態では、キーワードの重み付け値は、プロジェクトごとに設定される。従って、重み付けテーブル１４６は、図１０に示すように、プロジェクトごとにそれぞれ独立したテーブルとして作成される。 Here, the keyword and its weight value are data set in advance by the system administrator in order to extract the feature amount of the document. In this embodiment, the keyword weight value is set for each project. Therefore, the weighting table 146 is created as an independent table for each project as shown in FIG.

そこで、システム管理者が、重み付け入力画面２５に表示されたプロジェクト名の一覧２５１で、プロジェクト名の１つ、例えば、「０７年モデル」を選択すると、ＣＰＵは、その選択されたプロジェクト名「０７年モデル」を有するプロジェクトに属するキーワードの重み付け値を、重み付けテーブル１４６から取得し、キーワードの重み付け値の一覧２５２として表示する。 Therefore, when the system administrator selects one of the project names, for example, “07 model” in the project name list 251 displayed on the weighting input screen 25, the CPU selects the selected project name “07”. The weight values of keywords belonging to the project having “year model” are acquired from the weight table 146 and displayed as a list 252 of keyword weight values.

次に、システム管理者が、重み付け入力画面２５で新規登録ボタン２５３をクリックすると、ＣＰＵは、図９（ｂ）に示すような新規重み付け登録画面２５６を表示する。このとき、新規重み付け登録画面２５６には、新たなキーワードおよびその重み付け値を入力するための入力ボックスと、登録ボタン２５７とが表示される。 Next, when the system administrator clicks the new registration button 253 on the weighting input screen 25, the CPU displays a new weighting registration screen 256 as shown in FIG. 9B. At this time, on the new weight registration screen 256, an input box for inputting a new keyword and its weight value and a registration button 257 are displayed.

そこで、システム管理者が、新たなキーワードおよび重み付け値をその入力ボックスに入力し、登録ボタン２５７をクリックすると、ＣＰＵは、入力ボックスに入力されたキーワードおよび重み付け値を、前にプロジェクト名の一覧２５２で選択したプロジェクトに対応する重み付けテーブル１４６に登録する。 Therefore, when the system administrator inputs a new keyword and weighting value in the input box and clicks the registration button 257, the CPU displays the keyword and weighting value input in the input box in the project name list 252 before. Are registered in the weighting table 146 corresponding to the project selected in.

また、システム管理者が、重み付け入力画面２５のキーワードの重み付け値の一覧２５２からキーワードを選択し、その選択したキーワードの重み付け値を変更し、変更ボタン２５４をクリックすると、ＣＰＵは、前記選択されたプロジェクトに対応する重み付けテーブル１４６において、そのキーワードの重み付け値を変更する。 Further, when the system administrator selects a keyword from the keyword weight value list 252 on the weight input screen 25, changes the weight value of the selected keyword, and clicks the change button 254, the CPU selects the selected keyword. In the weighting table 146 corresponding to the project, the weighting value of the keyword is changed.

また、システム管理者が、重み付け入力画面２５のキーワードの重み付け値の一覧２５２からキーワードを選択し、削除ボタン２５５をクリックすると、ＣＰＵは、前記選択されたプロジェクトに対応する重み付けテーブル１４６において、そのキーワードおよびその重み付け値を削除する。 When the system administrator selects a keyword from the keyword weight value list 252 on the weight input screen 25 and clicks the delete button 255, the CPU displays the keyword in the weight table 146 corresponding to the selected project. And its weight value.

図１１は、形態素解析部１２１に対する入力および出力の例を示した図である。形態素解析は、入力されるドキュメントに含まれる日本語文章などのテキストについて、そのテキストを文節または品詞に分解する処理であり、公知の処理である。本実施形態の場合、形態素解析部１２１へは、ドキュメントデータベースに登録されているドキュメント（ドキュメントデータ１４１の実体データ）に含まれるテキスト、例えば、「アーキテクチャ設計書」のテキスト１２１１が入力される。なお、ドキュメントに含まれるテキストは、日本語限定されるものではなく、英語などの外国語であってもよい。 FIG. 11 is a diagram illustrating an example of input and output to the morphological analysis unit 121. The morpheme analysis is a process that decomposes a text such as a Japanese sentence included in an input document into a clause or a part of speech, and is a known process. In the case of the present embodiment, text included in a document (substance data of document data 141) registered in the document database, for example, text 1211 of “architecture design document” is input to the morphological analysis unit 121. The text included in the document is not limited to Japanese and may be a foreign language such as English.

形態素解析部１２１へテキスト１２１１のようなドキュメントのテキストが入力されると、ＣＰＵは、その入力されたテキストについて形態素解析処理を実行し、その中から名詞節を抽出する。さらに、ＣＰＵは、その抽出した名詞節から、重み付けテーブル１４６に格納されているキーワードを抽出し、その抽出したキーワード、例えば、図１１の場合、「設計書」、「開発ツール」、「アーキテクチャ設計書」、「関連設計書」などのキーワードを、形態素解析部１２１の出力データとして出力する。 When the text of a document such as the text 1211 is input to the morpheme analysis unit 121, the CPU executes a morpheme analysis process on the input text and extracts a noun clause from the morpheme analysis process. Further, the CPU extracts the keywords stored in the weighting table 146 from the extracted noun clauses, and in the case of FIG. 11, for example, “design document”, “development tool”, “architecture design”. Keywords ”and“ related design documents ”are output as output data of the morphological analysis unit 121.

ＣＰＵは、形態素解析部１２１の処理に引き続いて、キーワード集計部１２２の処理を実行する。すなわち、ＣＰＵは、それぞれのドキュメントごとに、形態素解析部１２１によって出力されるキーワードの出現回数を集計し、その結果をドキュメント解析結果テーブル１４３に格納する。 The CPU executes the process of the keyword totaling unit 122 following the process of the morphological analysis unit 121. That is, the CPU counts the number of appearances of the keywords output by the morphological analysis unit 121 for each document, and stores the result in the document analysis result table 143.

図１２は、ドキュメント解析結果テーブル１４３の構成の例を示した図である。ドキュメント解析結果テーブル１４３は、ドキュメントごとにキーワードの出現回数を集計したテーブルであり、図１２に示すように、ドキュメント名、キーワード、出現回数の各フィールドを含んで構成される。ドキュメント解析結果テーブル１４３へのデータの格納は、前記の通り、キーワード集計部１２２の処理によって行われる。 FIG. 12 is a diagram showing an example of the configuration of the document analysis result table 143. The document analysis result table 143 is a table in which the number of occurrences of a keyword is tabulated for each document, and includes each field of a document name, a keyword, and the number of appearances as shown in FIG. Data is stored in the document analysis result table 143 by the processing of the keyword totaling unit 122 as described above.

図１３は、主成分分析部１２３に対する入力および出力の例を示した図である。主成分分析処理は、公知の処理であるが、本実施形態の場合、ドキュメントの特徴量を算出する処理として用いられる。そのため、その主成分分析処理の入力データとしては、図１３に示すように、キーワード集計部１２２により集計したドキュメントごとのキーワードの出現回数と、キーワードの重み付けテーブル１４６から得られるキーワードの重み付け値とが用いられる。 FIG. 13 is a diagram illustrating an example of input and output to the principal component analysis unit 123. The principal component analysis process is a known process, but in the case of the present embodiment, the principal component analysis process is used as a process of calculating a document feature amount. Therefore, as input data of the principal component analysis process, as shown in FIG. 13, the number of occurrences of the keyword for each document, which is aggregated by the keyword aggregation unit 122, and the keyword weighting value obtained from the keyword weighting table 146 are included. Used.

すなわち、ＣＰＵは、各ドキュメントそれぞれについて、そのドキュメントにおけるキーワードの出現回数とその重み付け値とを用いて、主成分分析処理を実行し、第１主成分および第２主成分の値を算出する。そして、その第１主成分および第２主成分の値を主成分分析部１２３の出力とする。 That is, for each document, the CPU executes the principal component analysis process using the number of occurrences of the keyword in the document and the weighted value, and calculates the values of the first principal component and the second principal component. Then, the values of the first principal component and the second principal component are used as the output of the principal component analysis unit 123.

なお、ここでいう第１主成分および第２主成分とは、入力データの主成分分析によって得られる固有値から、その値が大きい順に取った２つの固有値である。また、以上の処理によって算出された第１主成分および第２主成分の値は、ドキュメントの特徴量を表すものとして、主成分分析結果テーブル１４４に格納される。 Note that the first principal component and the second principal component here are two eigenvalues taken from the eigenvalues obtained by principal component analysis of the input data in descending order. Further, the values of the first principal component and the second principal component calculated by the above processing are stored in the principal component analysis result table 144 as representing the feature amount of the document.

また、図１０に示したように、キーワードの重み付け値がプロジェクトごとに独立して設定されるので、主成分分析処理は、そのキーワードの重み付け値に応じてプロジェクトごとに独立して行われる。 Also, as shown in FIG. 10, since the keyword weighting value is set independently for each project, the principal component analysis process is performed independently for each project according to the keyword weighting value.

図１４は、主成分分析結果テーブル１４４の構成の例を示した図である。主成分分析結果テーブル１４４は、ドキュメントごとにそのドキュメントの特徴量、つまり、第１主成分および第２主成分を記憶したテーブルであり、図１４に示すように、プロジェクト名、ドキュメント名、第１主成分、第２主成分の各フィールドを含んで構成される。なお、このとき、プロジェクト名は、主成分分析処理の入力データの重み付け値（図１３参照）として、重み付けテーブル１４６（図１０参照）のうち、どのプロジェクトの重み付け値を用いたかを表している。 FIG. 14 is a diagram showing an example of the configuration of the principal component analysis result table 144. The principal component analysis result table 144 is a table storing the document feature amount, that is, the first principal component and the second principal component for each document. As shown in FIG. 14, the project name, document name, first Each field includes the main component and the second main component. At this time, the project name indicates which project weight value is used in the weighting table 146 (see FIG. 10) as the weight value (see FIG. 13) of the input data of the principal component analysis process.

図１５は、関連ドキュメントマッチング部１２４により行われる関連ドキュメントマッチングの概念を説明するための図である。関連ドキュメントマッチングは、基準となるドキュメントに関連する関連ドキュメントを抽出する処理である。ここでは、ドキュメント間の関連度をドキュメントの特徴量、つまり、主成分分析部１２３により求めた第１主成分および第２主成分を利用する。 FIG. 15 is a diagram for explaining the concept of related document matching performed by the related document matching unit 124. Related document matching is a process of extracting related documents related to a reference document. Here, the degree of association between documents is used as the feature amount of the document, that is, the first principal component and the second principal component obtained by the principal component analysis unit 123.

すなわち、関連ドキュメントマッチングでは、図１５に示すように、第１主成分をｘ座標、第２主成分をｙ座標として表し、そのｘ−ｙ座標系（ｘ−ｙ平面）で、各ドキュメントの特徴量（第１主成分，第２主成分）が表す点をプロットする。そして、そのドキュメントの特徴量が表す点間の距離が小さいものほど、ドキュメントの関連度が大きいと判定する。そこで、関連ドキュメントマッチングの処理では、基準となるドキュメントとその特徴量が表す点間の距離があらかじめ設定された閾値以内のドキュメントを、関連ドキュメントとして抽出する。なお、本実施形態では、この閾値を関連閾値と呼ぶ。 That is, in the related document matching, as shown in FIG. 15, the first principal component is expressed as x-coordinate and the second principal component is expressed as y-coordinate, and the feature of each document is expressed in the xy coordinate system (xy plane). The points represented by the quantities (first principal component, second principal component) are plotted. Then, it is determined that the smaller the distance between the points represented by the feature amount of the document, the greater the degree of relevance of the document. Therefore, in the related document matching process, a document whose distance between the reference document and the point represented by the feature amount is within a preset threshold is extracted as a related document. In the present embodiment, this threshold value is referred to as a related threshold value.

例えば、図１５において、基準となるドキュメントが「アーキテクチャ設計書」であった場合、「アーキテクチャ設計書」の特徴量が表す点と、「外部仕様書」の特徴量が表す点とが、関連閾値以下であったときには、「外部仕様書」は、「アーキテクチャ設計書」に関連するドキュメントとして抽出される。一方、「アーキテクチャ設計書」の特徴量が表す点と、その他のドキュメント（図の例では、「計画書」および「操作マニュアル」）の特徴量が表す点とが、いずれも、関連閾値より大きかったときには、それらのドキュメントは、「アーキテクチャ設計書」に関連するドキュメントとして抽出されない。 For example, in FIG. 15, when the reference document is “architecture design document”, the point represented by the feature quantity of “architecture design document” and the point represented by the feature quantity of “external specification” are related threshold values. When it is the following, the “external specification” is extracted as a document related to the “architecture design”. On the other hand, the points represented by the feature values of the “architecture design document” and the points represented by the feature values of the other documents (“plan” and “operation manual” in the example in the figure) are both greater than the relevant threshold value. The documents are not extracted as documents related to the “architecture design document”.

なお、図１０に示したように、キーワードの重み付け値がプロジェクトごとに独立して設定され、また、図１４に示すように、主成分分析結果テーブル１４４も、実質的にプロジェクトごとに独立して作成される。従って、関連ドキュメントマッチング処理も、そのキーワードの重み付け値に応じてプロジェクトごとに独立して行われる。 As shown in FIG. 10, the keyword weighting value is set independently for each project. As shown in FIG. 14, the principal component analysis result table 144 is also substantially independent for each project. Created. Therefore, the related document matching process is also performed independently for each project according to the weight value of the keyword.

その結果、図１５に示した特徴量を表すｘ−ｙ座標系は、プロジェクトごとに独立に生成されることになり、関連ドキュメントマッチング処理では、プロジェクトごとに異なった関連ドキュメントが抽出され得ることになる。しかしながら、プロジェクトごとに異なった関連ドキュメントが抽出されることこそ、各プロジェクトにおけるドキュメント作成の実態を反映した結果といえる。 As a result, the xy coordinate system representing the feature amount shown in FIG. 15 is generated independently for each project, and different related documents can be extracted for each project in the related document matching process. Become. However, it can be said that the fact that different related documents are extracted for each project reflects the actual state of document creation in each project.

なお、以上の説明では、特徴量は、第１主成分および第２主成分による２次元のデータであるとしているが、第１主成分だけを用いた１次次元のデータ、または、第１主成分および第２主成分に第３主成分などを加えた３次元以上のデータであってもよい。 In the above description, the feature amount is two-dimensional data using the first principal component and the second principal component. However, the first-dimensional data using only the first principal component or the first principal component is used. It may be three-dimensional or more data obtained by adding a third principal component to the component and the second principal component.

図１６は、関連閾値設定部１１６によって端末装置２に表示される関連閾値設定画面の例を示した図、また、図１７は、関連閾値テーブル１４７の構成の例を示した図である。図１６に示すように、関連閾値設定画面２６には、プロジェクト名および関連閾値を入力する入力ボックスと、登録ボタン２６１とが表示される。 FIG. 16 is a diagram illustrating an example of a related threshold setting screen displayed on the terminal device 2 by the related threshold setting unit 116, and FIG. 17 is a diagram illustrating an example of a configuration of the related threshold value table 147. As shown in FIG. 16, an input box for inputting a project name and a related threshold value and a registration button 261 are displayed on the related threshold value setting screen 26.

そこで、システム管理者が、関連閾値設定画面２６でプロジェクト名および関連閾値を入力ボックスに入力し、登録ボタン２６１を登録すると、その入力ボックスに入力されたプロジェクト名および関連閾値が、関連閾値テーブル１４７に登録される。このとき、関連閾値テーブル１４７は、プロジェクト名と関連閾値とを含んだフィールドによって構成される。 Therefore, when the system administrator inputs the project name and the related threshold value in the input box on the related threshold value setting screen 26 and registers the registration button 261, the project name and the related threshold value input in the input box are stored in the related threshold value table 147. Registered in At this time, the related threshold value table 147 includes fields including the project name and the related threshold value.

なお、関連閾値テーブル１４７を用いて関連閾値を関連閾値テーブル１４７に登録する処理は、関連ドキュメント抽出処理が実行される前に、あらかじめ実行されておくものとする。このとき、関連閾値の設定値は、システム管理者によって経験的に定められる。また、関連閾値は、ドキュメントごとに設定されるのではなく、プロジェクトごとに一括して設定される。 Note that the process of registering the related threshold value in the related threshold value table 147 using the related threshold value table 147 is executed in advance before the related document extraction process is executed. At this time, the setting value of the related threshold is empirically determined by the system administrator. In addition, the related threshold is not set for each document, but is set for each project in a lump.

図１８は、関連ドキュメント登録通知部１３１によって端末装置２に表示される関連ドキュメント登録通知画面の例を示した図である。図１８に示した関連ドキュメント登録通知画面２７は、図５のドキュメント登録画面２３でログイン中の「山田太郎」が「アーキテクチャ設計書」を登録したとき、関連ドキュメント抽出処理部１２によって「アーキテクチャ設計書」に関連する関連ドキュメントとして「外部仕様書」が抽出されたときの結果を表示したものである。なお、このとき抽出される関連ドキュメントは、１つの限定されることはなく、複数抽出されてもよい。 FIG. 18 is a diagram illustrating an example of a related document registration notification screen displayed on the terminal device 2 by the related document registration notification unit 131. The related document registration notification screen 27 shown in FIG. 18 is displayed by the related document extraction processing unit 12 when the “Taro Yamada” who is logged in on the document registration screen 23 of FIG. "External specification" is extracted as a related document related to "", and the result is displayed. In addition, the related document extracted at this time is not limited to one, and a plurality of related documents may be extracted.

ＣＰＵは、関連ドキュメント抽出処理部１２から関連ドキュメント名「外部仕様書」を受け取ると、関連ドキュメント登録通知部１３１の処理として、閲覧履歴テーブル１４５（図８参照）から関連ドキュメント「外部仕様書」の閲覧者「中村花子」を抽出する。なお、このとき抽出する閲覧者は、「中村花子」１人に限定されることはなく、複数人であってもよい。ちなみに、図８の閲覧履歴テーブル１４５の例では、「山田太郎」も「外部仕様書」の閲覧者である。 When the CPU receives the related document name “external specification” from the related document extraction processing unit 12, as a process of the related document registration notification unit 131, the CPU stores the related document “external specification” from the browsing history table 145 (see FIG. 8). The viewer “Hanako Nakamura” is extracted. Note that the number of viewers extracted at this time is not limited to one “Hanako Nakamura”, and may be a plurality of people. Incidentally, in the example of the browsing history table 145 in FIG. 8, “Taro Yamada” is also a viewer of “external specifications”.

続いて、ＣＰＵは、閲覧者（例えば、「中村花子」）がログインした端末装置２に関連ドキュメント登録通知画面２７を表示する。その関連ドキュメント登録通知画面２７には、閲覧者「中村花子」の業務に関連すると判断される関連ドキュメント「アーキテクチャ設計書」が登録されたことを通知するメッセージが表示される。また、関連ドキュメント登録通知画面２７には、参照ボタン２７１が併せて表示され、閲覧者「中村花子」が参照ボタン２７１をクリックすると、ＣＰＵは、その関連ドキュメント「アーキテクチャ設計書」の内容を端末装置２に表示する。 Subsequently, the CPU displays a related document registration notification screen 27 on the terminal device 2 to which a viewer (for example, “Hanako Nakamura”) has logged in. The related document registration notification screen 27 displays a message notifying that the related document “architecture design document” determined to be related to the work of the viewer “Hanako Nakamura” has been registered. Further, a reference button 271 is also displayed on the related document registration notification screen 27. When the viewer “Hanako Nakamura” clicks the reference button 271, the CPU displays the contents of the related document “architecture design document” as a terminal device. 2 is displayed.

図１９は、関連ドキュメント表示部１３２によって端末装置２に表示される関連ドキュメント表示画面の例を示した図である。図１９に示した関連ドキュメント表示画面２８は、図７のドキュメント検索画面２４でログイン中のユーザ「山田太郎」が、そのポップアップ画面２４２の検索結果ドキュメント一覧から、「操作マニュアル」を選択し、関連ボタン２４４をクリックすることにより、関連ドキュメント抽出処理部１２によって「操作マニュアル」に関連する関連ドキュメントとして「外部仕様書」、「操作マニュアル」などが抽出されたときの結果を表示したものである。 FIG. 19 is a diagram illustrating an example of a related document display screen displayed on the terminal device 2 by the related document display unit 132. In the related document display screen 28 shown in FIG. 19, the user “Taro Yamada” who is logged in on the document search screen 24 of FIG. 7 selects “Operation Manual” from the search result document list on the pop-up screen 242, When the button 244 is clicked, the result when the “external specification”, “operation manual” and the like are extracted as related documents related to the “operation manual” by the related document extraction processing unit 12 is displayed.

すなわち、ＣＰＵは、関連ドキュメント抽出処理部１２から関連ドキュメント名「外部仕様書」、「操作マニュアル」などを受け取ると、関連ドキュメント表示部１３２の処理として、関連ドキュメント表示画面２８を表示し、その中には、ログイン中のユーザ「山田太郎」の業務に関連すると判断される関連ドキュメント「外部仕様書」、「操作マニュアル」などが抽出されたことを通知するメッセージが表示される。このとき、複数の関連ドキュメントがある場合には、基準となるドキュメントと関連度が大きい（図１５に示したｘ−ｙ空間でドキュメントの特徴量が表す点間の距離が近い）ドキュメントを優先して（例えば、上に）表示する。 That is, when the CPU receives the related document name “external specification”, “operation manual”, etc. from the related document extraction processing unit 12, it displays the related document display screen 28 as a process of the related document display unit 132. Displays a message notifying that related documents “external specification”, “operation manual”, and the like that are determined to be related to the work of the logged-in user “Taro Yamada” are extracted. At this time, when there are a plurality of related documents, priority is given to a document having a high degree of relevance to the reference document (the distance between points represented by the document feature amount in the xy space shown in FIG. 15 is short). (For example, above).

また、関連ドキュメント表示画面２８には、参照ボタン２８１が併せて表示されるので、ユーザ「山田太郎」が、その関連ドキュメントの１つ、例えば、「外部仕様書」を選択して、参照ボタン２８１をクリックすると、ＣＰＵは、その選択された関連ドキュメント「外部仕様書」の内容を端末装置２に表示する。 Further, since the reference button 281 is also displayed on the related document display screen 28, the user “Taro Yamada” selects one of the related documents, for example, “external specification”, and the reference button 281 is displayed. When the button is clicked, the CPU displays the contents of the selected related document “external specification” on the terminal device 2.

図２０は、ドキュメント管理装置１に新たなドキュメントが登録されたとき、そのドキュメントの登録通知を、そのドキュメントに関連するユーザに通知する処理の流れの例を示した図である。なお、この処理は、ドキュメント登録部１１１の処理によって開始される処理である。 FIG. 20 is a diagram illustrating an example of a flow of processing for notifying a user related to a document of a registration notification of the document when a new document is registered in the document management apparatus 1. This process is started by the process of the document registration unit 111.

ドキュメント管理装置１のＣＰＵは、まず、ドキュメント登録部１１１の処理として、ドキュメント登録画面２３（図５参照）を介して入力されるドキュメント名を有するドキュメントをドキュメントＤＢ１４に登録する（ステップＳ１）。 First, the CPU of the document management apparatus 1 registers a document having a document name input via the document registration screen 23 (see FIG. 5) in the document DB 14 as processing of the document registration unit 111 (step S1).

次に、ＣＰＵは、形態素解析部１２１の処理として、ステップＳ１で登録したドキュメントも含め、ドキュメントＤＢ１４に登録されている各ドキュメントについて形態素解析を実施し、キーワード（あらかじめ定めた名詞）を抽出し（ステップＳ２）。そして、ＣＰＵは、キーワード集計部１２２の処理として、それぞれのドキュメントについてステップＳ２で抽出したキーワードの出現回数を集計し（ステップＳ３）、ドキュメント解析結果テーブル１４３を作成する。 Next, as a process of the morphological analysis unit 121, the CPU performs morphological analysis on each document registered in the document DB 14 including the document registered in step S1, and extracts keywords (predetermined nouns) ( Step S2). Then, as a process of the keyword totaling unit 122, the CPU totals the number of appearances of the keywords extracted in step S2 for each document (step S3), and creates a document analysis result table 143.

次に、ＣＰＵは、主成分分析部１２３の処理として、各ドキュメントについて、キーワードの出現回数とその重み付け値とに基づき、主成分分析を行い、各ドキュメントの特徴量として、第１主成分および第２主成分を抽出し（ステップＳ４）、主成分分析結果テーブル１４４を作成する。このとき、主成分分析結果テーブル１４４は、プロジェクトごとに作成される。 Next, as a process of the principal component analysis unit 123, the CPU performs a principal component analysis on each document based on the number of occurrences of the keyword and its weight value, and the first principal component and the first principal component are used as the feature amount of each document. Two principal components are extracted (step S4), and a principal component analysis result table 144 is created. At this time, the principal component analysis result table 144 is created for each project.

次に、ＣＰＵは、関連ドキュメントマッチング部１２４の処理として、各ドキュメントの特徴量の第１主成分をｘ座標、第２主成分をｙ座標で表し、そのｘ−ｙ平面上で各ドキュメントの特徴量が表す点間の距離を算出する（ステップＳ５）。そして、ステップＳ１で登録したドキュメントと、その特徴量が表す点間の距離があらかじめ設定された関連閾値以内に含まれるドキュメントを関連ドキュメントして抽出する（ステップＳ６）。なお、ステップＳ５およびステップＳ６は、ステップＳ４で作成される主成分分析結果テーブル１４４に応じて、プロジェクトごとに実行される。 Next, as processing of the related document matching unit 124, the CPU represents the first principal component of the feature amount of each document by the x coordinate and the second principal component by the y coordinate, and features of each document on the xy plane. The distance between the points represented by the quantities is calculated (step S5). Then, the document registered in step S1 and the document whose distance between the points represented by the feature amount is included within the preset related threshold are extracted as related documents (step S6). Steps S5 and S6 are executed for each project according to the principal component analysis result table 144 created in step S4.

次に、ＣＰＵは、関連ドキュメント登録通知部１３１の処理として、閲覧履歴テーブル１４５から、ステップＳ６で抽出した関連ドキュメントを閲覧した履歴のあるユーザを抽出し、その抽出したユーザに、新たなドキュメントが登録されたことを通知する（ステップＳ７）。
また、ドキュメント管理装置１には、ドキュメント検索・閲覧部１１２によって開始される処理もある。その処理の流れは、図２０とほとんど同じなので、図示を省略するが、次の点において相違する。 Next, as a process of the related document registration notification unit 131, the CPU extracts a user who has a history of browsing the related document extracted in step S6 from the browsing history table 145, and a new document is added to the extracted user. The registration is notified (step S7).
Further, the document management apparatus 1 also has a process started by the document search / browsing unit 112. Since the processing flow is almost the same as that in FIG. 20, illustration is omitted, but the following points are different.

まず、ステップ１のドキュメントを登録する処理に代わり、ドキュメント検索・閲覧部１１２による関連ドキュメントの抽出を求める処理になる。この場合、関連ドキュメントを抽出して通知する相手が、その関連ドキュメントの抽出を求めたユーザに限定されるので、ステップＳ４では、ＣＰＵは、そのユーザが属するプロジェクトについてのみ主成分分析を行えばよく、従って、そのユーザが属さないプロジェクトの主成分分析結果テーブル１４４の作成を省略することができる。また、ＣＰＵは、ステップＳ５およびステップＳ６においても、そのユーザが属さないプロジェクトについての処理を省略することができる。さらに、ステップＳ７の処理に代えて、関連ドキュメント表示部１３２により、関連ドキュメントを単に表示するだけの処理で済ませることができる。 First, in place of the process of registering the document in step 1, the process for requesting extraction of related documents by the document search / browsing unit 112 is performed. In this case, since the partner who extracts and notifies the related document is limited to the user who requested the extraction of the related document, the CPU only needs to perform the principal component analysis for the project to which the user belongs in step S4. Therefore, the creation of the principal component analysis result table 144 of the project to which the user does not belong can be omitted. Further, the CPU can omit the process for the project to which the user does not belong in step S5 and step S6. Furthermore, instead of the process of step S7, the related document display unit 132 can simply perform the process of displaying the related document.

なお、以上に説明した処理の流れでは、形態素解析によりキーワードを抽出（ステップＳ２）しているが、本実施形態の場合、キーワードおよびその重み付け値からなる重み付けテーブル１４６を事前に設定するので、必ずしも、形態素解析を行う必要はない。その場合、形態素解析に代えて、重み付けテーブル１４６に記憶されているキーワードのそれぞれについてドキュメントを全文検索する処理などを行えばよい。 In the processing flow described above, keywords are extracted by morphological analysis (step S2). However, in the case of this embodiment, the weighting table 146 including keywords and their weight values is set in advance. There is no need to perform morphological analysis. In that case, instead of the morphological analysis, a process of searching the document for the full text for each of the keywords stored in the weighting table 146 may be performed.

以上のように、本実施形態によれば、システム管理者がプロジェクトの実態に合わせてプロジェクトごとに独立にキーワードの重み付け値を定めることができるので、ドキュメント管理装置１は、関連ドキュメント抽出処理部１２における主成分分析処理により、プロジェクトごとのドキュメント作成の実態を考慮した関連ドキュメントを抽出することができるようになる。 As described above, according to the present embodiment, since the system administrator can independently determine the weighting value of the keyword for each project in accordance with the actual state of the project, the document management apparatus 1 includes the related document extraction processing unit 12. With the principal component analysis processing in, relevant documents can be extracted in consideration of the actual document creation for each project.

本発明の実施形態に係るドキュメント管理装置の機能ブロックの構成の例を示した図。The figure which showed the example of the structure of the functional block of the document management apparatus which concerns on embodiment of this invention. ユーザ情報入力部によって端末装置に表示されるユーザ情報入力画面の例を示した図。The figure which showed the example of the user information input screen displayed on a terminal device by a user information input part. ユーザ情報テーブルの構成の例を示した図。The figure which showed the example of the structure of the user information table. ユーザ認証部によって端末装置に表示されるユーザ認証画面の例を示した図。The figure which showed the example of the user authentication screen displayed on a terminal device by a user authentication part. ドキュメント登録部によって端末装置に表示されるドキュメント登録画面の例を示した図。The figure which showed the example of the document registration screen displayed on a terminal device by a document registration part. ドキュメントデータの構成の例を示した図。The figure which showed the example of the structure of document data. ドキュメント検索・閲覧部によって端末装置に表示されるドキュメント検索画面の例を示した図。The figure which showed the example of the document search screen displayed on a terminal device by a document search and browsing part. 閲覧履歴テーブルの構成の例を示した図。The figure which showed the example of the structure of the browsing history table. 重み付け入力部によって端末装置に表示される重み付け入力画面の例を示した図。The figure which showed the example of the weighting input screen displayed on a terminal device by the weighting input part. 重み付けテーブルの構成の例を示した図。The figure which showed the example of the structure of the weighting table. 形態素解析部に対する入力および出力の例を示した図。The figure which showed the example of the input with respect to a morphological analysis part, and an output. ドキュメント解析結果テーブルの構成の例を示した図。The figure which showed the example of the structure of the document analysis result table. 主成分分析部に対する入力および出力の例を示した図。The figure which showed the example of the input and output with respect to a principal component analysis part. 主成分分析結果テーブルの構成の例を示した図。The figure which showed the example of the structure of the principal component analysis result table. 関連ドキュメントマッチング部により行われる関連ドキュメントマッチングの概念を説明するための図。The figure for demonstrating the concept of the related document matching performed by the related document matching part. 関連閾値設定部によって端末装置に表示される関連閾値設定画面の例を示した図。The figure which showed the example of the related threshold value setting screen displayed on a terminal device by the related threshold value setting part. 関連閾値テーブルの構成の例を示した図。The figure which showed the example of the structure of the related threshold value table. 関連ドキュメント登録通知部によって端末装置に表示される関連ドキュメント登録通知画面の例を示した図。The figure which showed the example of the related document registration notification screen displayed on a terminal device by the related document registration notification part. 関連ドキュメント表示部によって端末装置に表示される関連ドキュメント表示画面の例を示した図。The figure which showed the example of the related document display screen displayed on a terminal device by a related document display part. ドキュメントの登録通知を、そのドキュメントに関連するユーザに通知する処理の流れの例を示した図。The figure which showed the example of the flow of a process which notifies the registration relevant notification to the user relevant to the document.

Explanation of symbols

１ドキュメント管理装置
２端末装置
１１入力処理部
１２関連ドキュメント抽出処理部
１３出力処理部
１４ドキュメントデータベース（ドキュメントＤＢ）
２１ユーザ情報入力画面
２２ユーザ認証画面
２３ドキュメント登録画面
２４ドキュメント検索画面
２５入力画面
２６関連閾値設定画面
２７関連ドキュメント登録通知画面
２８関連ドキュメント表示画面
１１１ドキュメント登録部
１１２ドキュメント検索・閲覧部
１１３ユーザ情報入力部
１１４ユーザ認証部
１１５重み付け入力部
１１６関連閾値設定部
１２１形態素解析部
１２２キーワード集計部
１２３主成分分析部
１２４関連ドキュメントマッチング部
１３１関連ドキュメント登録通知部
１３２関連ドキュメント表示部
１４１ドキュメントデータ
１４２ユーザ情報テーブル
１４３ドキュメント解析結果テーブル
１４４主成分分析結果テーブル
１４５閲覧履歴テーブル
１４６重み付けテーブル
１４７関連閾値テーブル 1 Document Management Device 2 Terminal Device 11 Input Processing Unit 12 Related Document Extraction Processing Unit 13 Output Processing Unit 14 Document Database (Document DB)
21 User Information Input Screen 22 User Authentication Screen 23 Document Registration Screen 24 Document Search Screen 25 Input Screen 26 Related Threshold Setting Screen 27 Related Document Registration Notification Screen 28 Related Document Display Screen 111 Document Registration Unit 112 Document Search / Browsing Unit 113 User Information Input Unit 114 user authentication unit 115 weighting input unit 116 related threshold setting unit 121 morpheme analysis unit 122 keyword counting unit 123 principal component analysis unit 124 related document matching unit 131 related document registration notification unit 132 related document display unit 141 document data 142 user information table 143 Document Analysis Result Table 144 Principal Component Analysis Result Table 145 Browsing History Table 146 Weighting Table 147 Related The value table

Claims

Connected to the terminal device and configured to include a processing device and a storage device. The storage device stores a document database in which a plurality of documents are registered, a keyword for calculating document features, and a weighted value thereof. A weight management table and a relation threshold table storing a relation threshold for determining the degree of relation between the documents,
The processing device is
Based on the data input from the terminal device, the data of the weighting table and the related threshold table is set for each business project in which the document is created,
Based on data input from the terminal device, one document is selected as a reference document from documents registered in the document database,
For documents registered in the document database, the number of occurrences of the keyword stored in the weighting table is totaled, and based on the total number of occurrences of the keyword and the weighting value stored in the weighting table, Perform component analysis, calculate feature values for each document,
When the distance between the coordinate position represented by the feature amount of the document and the coordinate position represented by the feature amount of the reference document is equal to or less than the related threshold value stored in the related threshold value table, the document is extracted as a related document;
The document management apparatus, wherein the name of the extracted related document is displayed on the terminal device.

Connected to the terminal device and configured to include a processing device and a storage device. The storage device stores a document database in which a plurality of documents are registered, a keyword for calculating document features, and a weighted value thereof. A weight management table and a related threshold value table storing a related threshold value for determining the degree of relevance between the documents,
The processing device is
Based on the data input from the terminal device, the data of the weighting table and the related threshold table is set for each business project in which the document is created,
The history of a user who has browsed a document included in the document database is stored in the storage device as a browsing history table,
When an input for instructing registration of a new document is made from the terminal device to the document database, the new document instructed for registration is selected as a reference document,
For documents registered in the document database, the number of occurrences of the keyword stored in the weighting table is totaled, and based on the total number of occurrences of the keyword and the weighting value stored in the weighting table, Perform component analysis, calculate feature values for each document,
When the distance between the coordinate position represented by the feature amount of the document and the coordinate position represented by the feature amount of the reference document is equal to or less than the related threshold value stored in the related threshold value table, the document is extracted as a related document;
With reference to the browsing history table, extract a user who has a history of browsing the extracted related document,
A document management apparatus, wherein a message indicating that the new document has been registered is displayed on the terminal device being used by the extracted user.

Connected to the terminal device and configured to include a processing device and a storage device. The storage device stores a document database in which a plurality of documents are registered, a keyword for calculating document features, and a weighted value thereof. A document management method in a document management apparatus that holds a weighting table and a related threshold value table that stores a related threshold value for determining a degree of relevance between the documents,
The processing device is
Based on the data input from the terminal device, the data of the weighting table and the related threshold table is set for each business project in which the document is created,
Based on data input from the terminal device, one document is selected as a reference document from documents registered in the document database,
For documents registered in the document database, the number of occurrences of the keyword stored in the weighting table is totaled, and based on the total number of occurrences of the keyword and the weighting value stored in the weighting table, Perform component analysis, calculate feature values for each document,
When the distance between the coordinate position represented by the feature amount of the document and the coordinate position represented by the feature amount of the reference document is equal to or less than the related threshold value stored in the related threshold value table, the document is extracted as a related document;
A document management method, wherein the name of the extracted related document is displayed on the terminal device.

Connected to the terminal device and configured to include a processing device and a storage device. The storage device stores a document database in which a plurality of documents are registered, a keyword for calculating document features, and a weighted value thereof. A document management method in a document management apparatus that holds a weighting table and a related threshold value table that stores a related threshold value for determining a degree of relevance between the documents,
The processing device is
Based on the data input from the terminal device, the data of the weighting table and the related threshold table is set for each business project in which the document is created,
The history of a user who has browsed a document included in the document database is stored in the storage device as a browsing history table,
When an input for instructing registration of a new document is made from the terminal device to the document database, the new document instructed for registration is selected as a reference document,
For documents registered in the document database, the number of occurrences of the keyword stored in the weighting table is totaled, and based on the total number of occurrences of the keyword and the weighting value stored in the weighting table, Perform component analysis, calculate feature values for each document,
When the distance between the coordinate position represented by the feature amount of the document and the coordinate position represented by the feature amount of the reference document is equal to or less than the related threshold value stored in the related threshold value table, the document is extracted as a related document;
With reference to the browsing history table, extract a user who has a history of browsing the extracted related document,
A document management method comprising: displaying a message indicating that the new document has been registered on the extracted terminal device being used by the user.

Connected to the terminal device and configured to include a processing device and a storage device. The storage device stores a document database in which a plurality of documents are registered, a keyword for calculating document features, and a weighted value thereof. A program in a document management apparatus that holds a weighting table and a related threshold table that stores a related threshold value for determining a degree of association between the documents,
In the processing device,
Based on data input from the terminal device, processing for setting the data of the weighting table and the related threshold table for each business project in which the document is created,
A process of selecting one document as a reference document from documents registered in the document database based on data input from the terminal device;
For documents registered in the document database, the number of occurrences of the keyword stored in the weighting table is totaled, and based on the total number of occurrences of the keyword and the weighting value stored in the weighting table, A process of performing component analysis and calculating the feature value of each document;
When the distance between the coordinate position represented by the feature amount of the document and the coordinate position represented by the feature amount of the reference document is equal to or less than the associated threshold stored in the associated threshold table, the document is extracted as the associated document When,
Processing for displaying the name of the extracted related document on the terminal device;
A program for running

Connected to the terminal device and configured to include a processing device and a storage device. The storage device stores a document database in which a plurality of documents are registered, a keyword for calculating document features, and a weighted value thereof. A program in a document management apparatus that holds a weighting table and a related threshold table that stores a related threshold value for determining a degree of association between the documents,
In the processing device,
Based on data input from the terminal device, processing for setting the data of the weighting table and the related threshold table for each business project in which the document is created,
A process of accumulating a history of a user who has viewed a document included in the document database in the storage device as a browsing history table;
When an instruction to register a new document in the document database is input from the terminal device, a process of selecting a new document instructed to register as a reference document by the above,
For documents registered in the document database, the number of occurrences of the keyword stored in the weighting table is totaled, and based on the total number of occurrences of the keyword and the weighting value stored in the weighting table, A process of performing component analysis and calculating the feature value of each document;
When the distance between the coordinate position represented by the feature amount of the document and the coordinate position represented by the feature amount of the reference document is equal to or less than the associated threshold stored in the associated threshold table, the document is extracted as the associated document When,
A process of referring to the browsing history table and extracting a user having a history of browsing the extracted related document;
A process of displaying a message indicating that the new document has been registered on the terminal device being used by the extracted user;
A program for running