JP6479232B1

JP6479232B1 - Document management apparatus and document management method

Info

Publication number: JP6479232B1
Application number: JP2018034979A
Authority: JP
Inventors: 弘美南場
Original assignee: Mitsubishi Electric Engineering Co Ltd
Current assignee: Mitsubishi Electric Engineering Co Ltd
Priority date: 2018-02-28
Filing date: 2018-02-28
Publication date: 2019-03-06
Anticipated expiration: 2038-02-28
Also published as: JP2019149117A

Abstract

【課題】属性情報の登録によるユーザ負担の軽減を可能とし、且つ、属性情報のデータ量の増大を抑制可能とする。【解決手段】検索条件が指定された場合に、データ記録部１２から当該検索条件と一致するキーワードが含まれる属性情報を抽出する文書検索部２２と、文書検索部２２で用いられた検索条件を含む検索履歴情報をデータ記録部１２に記録する履歴記録部２３と、指定日時に、データ記録部１２から抽出条件を満たす検索条件を抽出する検索条件抽出部２５と、ファイルシステム１１から、検索条件抽出部２５により抽出された検索条件と一致するキーワードが含まれるファイルを抽出し、当該ファイルに紐づく文書を抽出する全文検索部２６と、データ記録部１２に対し、全文検索部２６により抽出された文書の属性情報に、検索条件抽出部２５により抽出された検索条件であるキーワードを記録するキーワード登録部２７とを備えた。【選択図】図１It is possible to reduce a burden on a user by registering attribute information and to suppress an increase in the data amount of attribute information. When a search condition is specified, a document search unit for extracting attribute information including a keyword that matches the search condition from a data recording unit, and a search condition used by the document search unit. The search history information including the search history information included in the data recording unit 12, the search condition extraction unit 25 that extracts the search conditions satisfying the extraction condition from the data recording unit 12 at the specified date and time, and the file system 11 A full-text search unit 26 that extracts a file including a keyword that matches the search condition extracted by the extraction unit 25 and extracts a document associated with the file, and a full-text search unit 26 for the data recording unit 12 And a keyword registration unit 27 for recording a keyword which is a search condition extracted by the search condition extraction unit 25 in the attribute information of the document. [Selection] Figure 1

Description

この発明は、文書を管理する文書管理装置及び文書管理方法に関する。 The present invention relates to a document management apparatus and a document management method for managing documents.

従来から、文字を示す電子データ（ファイル）を管理することで、文書を管理する文書管理装置が知られている。なお、文書は、１つ又は複数のファイルから構成される。
この文書管理装置では、キーワードを用いた文書検索を可能とするため、ユーザが、文書登録の際に、文書の属性情報を登録する必要がある。この際、より検索性を高めるためには、ユーザが十分な属性情報を登録する必要があり、ユーザ負担が増大する。 2. Description of the Related Art Conventionally, document management apparatuses that manage documents by managing electronic data (files) indicating characters are known. A document is composed of one or a plurality of files.
In this document management apparatus, it is necessary for a user to register document attribute information when registering a document in order to enable document search using a keyword. At this time, in order to further improve the searchability, it is necessary for the user to register sufficient attribute information, which increases the burden on the user.

これに対し、属性情報の登録の手間を軽減するため、文書管理装置が、文書登録の際に、その文書の内容からキーワードを自動で抽出し、その抽出したキーワードを属性情報として登録するものが知られている（例えば特許文献１参照）。 On the other hand, in order to reduce the trouble of registering attribute information, a document management apparatus automatically extracts a keyword from the content of the document and registers the extracted keyword as attribute information when registering the document. It is known (for example, see Patent Document 1).

特開平９−１９８３９５号公報JP-A-9-198395

しかしながら、特許文献１に開示されるような従来の文書管理装置では、使用頻度の少ない不要なキーワードであっても属性情報として登録される可能性が高く、属性情報のデータ量が増大するという課題がある。 However, in the conventional document management apparatus as disclosed in Patent Document 1, there is a high possibility that even unnecessary keywords that are less frequently used are registered as attribute information, and the amount of attribute information data increases. There is.

この発明は、上記のような課題を解決するためになされたもので、属性情報の登録によるユーザ負担の軽減が可能であり、且つ、属性情報のデータ量の増大を抑制可能な文書管理装置を提供することを目的としている。 The present invention has been made to solve the above-described problems, and provides a document management apparatus that can reduce the burden on the user by registering attribute information and can suppress an increase in the data amount of attribute information. It is intended to provide.

この発明に係る文書管理装置は、１つ以上のキーワードである検索条件が指定された場合に、ファイルシステムに記録されているファイルに紐づく文書の属性情報を記録しているデータ記録部から、当該検索条件と一致するキーワードが含まれる属性情報を抽出する文書検索部と、文書検索部で用いられた検索条件を含む検索履歴情報を、データ記録部に記録する履歴記録部と、指定された日時に、データ記録部から、抽出条件を満たす検索条件を抽出する検索条件抽出部と、ファイルシステムから、検索条件抽出部により抽出された検索条件と一致するキーワードが含まれるファイルを抽出し、当該ファイルに紐づく文書を抽出する全文検索部と、データ記録部に対し、全文検索部により抽出された文書の属性情報に、検索条件抽出部により抽出された検索条件であるキーワードを拡張キーワードとして記録するキーワード登録部とを備えたことを特徴とする。 When a search condition that is one or more keywords is specified, the document management apparatus according to the present invention includes a data recording unit that records attribute information of a document associated with a file recorded in the file system, A document search unit for extracting attribute information including a keyword matching the search condition, a history recording unit for recording search history information including the search condition used in the document search unit in the data recording unit, and a designated At the time of day, a search condition extraction unit that extracts a search condition that satisfies the extraction condition from the data recording unit, and a file that includes a keyword that matches the search condition extracted by the search condition extraction unit is extracted from the file system. The full-text search unit that extracts the document associated with the file and the attribute information of the document extracted by the full-text search unit are added to the data recording unit by the search condition extraction unit. Which is the extracted search condition Keyword is characterized in that a keyword registration unit that records the extended keyword.

この発明によれば、上記のように構成したので、属性情報の登録によるユーザ負担の軽減が可能であり、且つ、属性情報のデータ量の増大を抑制可能である。 According to this invention, since it comprised as mentioned above, the user burden by registration of attribute information can be reduced, and the increase in the data amount of attribute information can be suppressed.

この発明の実施の形態１に係る文書管理装置の構成例を示す図である。It is a figure which shows the structural example of the document management apparatus which concerns on Embodiment 1 of this invention. この発明の実施の形態１におけるデータ記録部に記録される属性情報の項目の一例を示す図である。It is a figure which shows an example of the item of the attribute information recorded on the data recording part in Embodiment 1 of this invention. この発明の実施の形態１におけるデータ記録部に記録される検索履歴テーブルの一例を示す図である。It is a figure which shows an example of the search history table recorded on the data recording part in Embodiment 1 of this invention. この発明の実施の形態１におけるデータ記録部に記録される抽出条件情報の項目の一例を示す図である。It is a figure which shows an example of the item of the extraction condition information recorded on the data recording part in Embodiment 1 of this invention. この発明の実施の形態１に係る文書管理装置による文書登録の動作例を示すフローチャートである。It is a flowchart which shows the operation example of the document registration by the document management apparatus concerning Embodiment 1 of this invention. この発明の実施の形態１に係る文書管理装置によるファイル変更の動作例を示すフローチャートである。It is a flowchart which shows the operation example of the file change by the document management apparatus concerning Embodiment 1 of this invention. この発明の実施の形態１に係る文書管理装置によるキーワード検索の動作例を示すフローチャートである。It is a flowchart which shows the operation example of the keyword search by the document management apparatus concerning Embodiment 1 of this invention. この発明の実施の形態１に係る文書管理装置によるキーワード登録の動作例を示すフローチャートである。It is a flowchart which shows the operation example of the keyword registration by the document management apparatus concerning Embodiment 1 of this invention. 図９Ａ、図９Ｂは、この発明の実施の形態１における処理部のハードウェア構成例を示す図である。9A and 9B are diagrams showing a hardware configuration example of the processing unit according to Embodiment 1 of the present invention.

以下、この発明の実施の形態について図面を参照しながら詳細に説明する。
実施の形態１．
図１はこの発明の実施の形態１に係る文書管理装置の構成例を示す図である。
文書管理装置は、文字を示す電子データ（ファイル）を管理することで、文書を管理する。なお、文書は１つ以上のファイルから成る。また以下では、文書管理装置が有する機能のうち、キーワードを用いた文書検索に関する機能について示す。この文書管理装置は、図１に示すように、管理部１及び処理部２を備えている。管理部１は、ファイルシステム１１、データ記録部１２及びインデックス部１３を有している。処理部２は、文書登録部２１、文書検索部２２、履歴記録部２３、抽出条件設定部２４、検索条件抽出部２５、全文検索部２６及びキーワード登録部２７を有している。なお、抽出条件設定部２４、検索条件抽出部２５及びキーワード登録部２７は、管理処理部２８を構成する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
Embodiment 1 FIG.
FIG. 1 is a diagram showing a configuration example of a document management apparatus according to Embodiment 1 of the present invention.
The document management apparatus manages documents by managing electronic data (files) indicating characters. A document consists of one or more files. Hereinafter, among the functions of the document management apparatus, functions related to document search using keywords will be described. As shown in FIG. 1, the document management apparatus includes a management unit 1 and a processing unit 2. The management unit 1 includes a file system 11, a data recording unit 12, and an index unit 13. The processing unit 2 includes a document registration unit 21, a document search unit 22, a history recording unit 23, an extraction condition setting unit 24, a search condition extraction unit 25, a full text search unit 26, and a keyword registration unit 27. The extraction condition setting unit 24, the search condition extraction unit 25, and the keyword registration unit 27 constitute a management processing unit 28.

ファイルシステム１１は、ファイルを記録する。
データ記録部１２は、属性情報を含む文書情報、検索履歴情報を有する検索履歴テーブル、及び、抽出条件情報を記録する。属性情報は、ファイルシステム１１に記録されているファイルに紐づく文書の属性を示す情報である。検索履歴情報は、文書管理装置でキーワードを用いた文書検索が行われた際の履歴を示す情報である。抽出条件情報は、文書管理装置で属性情報としてキーワードの自動登録を行う際に候補とするキーワードの抽出条件、及び、キーワードの自動登録の方法を示す情報である。 The file system 11 records a file.
The data recording unit 12 records document information including attribute information, a search history table having search history information, and extraction condition information. The attribute information is information indicating the attribute of the document associated with the file recorded in the file system 11. The search history information is information indicating a history when a document search using a keyword is performed in the document management apparatus. The extraction condition information is information indicating a keyword extraction condition and a keyword automatic registration method that are candidates for automatic registration of keywords as attribute information in the document management apparatus.

属性情報には、例えば図２に示すように、文書名称、文書番号、登録日、更新日、登録者、更新者、格納先、キーワード、ファイル名称、及び、拡張キーワード等、を示す情報が含まれている。
文書名称は、文書の名称である。文書番号は、文書を識別する番号（副版）である。登録日は、文書が文書管理装置に登録された日である。更新日は、文書の最終更新日である。登録者は、文書を登録したユーザの名称である。更新者は、文書の最終更新をしたユーザの名称である。記録先は、文書を構成するファイルの記録先を示す階層である。キーワードは、文書登録の際に、ユーザにより登録された文書検索のためのキーワードである。ファイル名称は、文書を構成するファイル毎の名称である。ファイル備考は、ファイル毎の説明文である。拡張キーワードは、文書管理装置がファイル毎に自動で抽出する文書検索のためのキーワードである。 For example, as shown in FIG. 2, the attribute information includes information indicating a document name, a document number, a registration date, an update date, a registrant, an updater, a storage location, a keyword, a file name, an extended keyword, and the like. It is.
The document name is the name of the document. The document number is a number (subversion) for identifying a document. The registration date is the date when the document is registered in the document management apparatus. The update date is the last update date of the document. The registrant is the name of the user who registered the document. The updater is the name of the user who last updated the document. The recording destination is a hierarchy indicating the recording destination of the files constituting the document. The keyword is a keyword for document search registered by the user at the time of document registration. The file name is a name for each file constituting the document. A file remark is an explanatory text for each file. The extended keyword is a keyword for document search that is automatically extracted for each file by the document management apparatus.

検索履歴情報には、例えば図３に示すように、検索条件、最終更新日時、実施回数、ヒット件数、及び、前回対象、を示す情報が含まれる。
検索条件は、文書検索で用いられた１つ以上のキーワードである。最終更新日時は、検索条件が用いられた最終日時である。実施回数は、検索条件を用いて文書検索を行った回数である。ヒット件数は、検索条件と一致するキーワードが含まれる属性情報の件数である。前回対象は、前回実施したキーワードの自動登録において、検索条件抽出部２５が対象件数の条件で検索条件の抽出を行った結果、対象となったか否かを示し、図３の例では、前回のキーワードの自動登録で対象となった場合を１とし、それ以外を０としている。 For example, as shown in FIG. 3, the search history information includes information indicating the search condition, the last update date, the number of executions, the number of hits, and the previous target.
The search condition is one or more keywords used in document search. The last update date and time is the last date and time when the search condition is used. The number of executions is the number of times a document search is performed using a search condition. The number of hits is the number of attribute information including a keyword that matches the search condition. The last target indicates whether or not the search condition extraction unit 25 extracted the search condition under the condition of the number of objects in the automatic registration of the keyword performed last time, and in the example of FIG. The case where it becomes a target by automatic registration of keywords is set to 1, and the other cases are set to 0.

抽出条件情報には、例えば図４に示すように、対象件数、対象ヒット件数、対象更新日時、対象実施回数、及び、実行条件、を示す情報が含まれる。
対象件数は、検索履歴テーブルに含まれる検索条件のうち、検索条件抽出部２５が優先順位の高い順に抽出する検索条件の件数である。なお、検索条件の優先順位は、例えば、最終更新日が新しく且つ実施回数が多い順等のように、適宜設定される。対象ヒット件数は、検索履歴テーブルに含まれる検索条件のうち、検索条件抽出部２５が抽出する検索条件のヒット件数の範囲である。対象更新日時は、検索履歴テーブルに含まれる検索条件のうち、検索条件抽出部２５が抽出する検索条件の最終更新日時の範囲である。対象実施回数は、検索履歴テーブルに含まれる検索条件のうち、検索条件抽出部２５が抽出する検索条件の実施回数の範囲である。実行条件は、文書管理装置でキーワードの自動登録を行う際に、属性情報の拡張キーワードにキーワードを追加するか上書きするかを指定する項目である。 For example, as illustrated in FIG. 4, the extraction condition information includes information indicating the number of target cases, the number of target hits, the target update date and time, the number of target executions, and the execution conditions.
The target number is the number of search conditions that the search condition extraction unit 25 extracts from the search conditions included in the search history table in descending order of priority. Note that the priority order of the search conditions is set as appropriate, for example, in the order of the latest update date and the highest number of implementations. The target hit count is a range of hit counts of search conditions extracted by the search condition extraction unit 25 among the search conditions included in the search history table. The target update date / time is a range of the last update date / time of the search condition extracted by the search condition extraction unit 25 among the search conditions included in the search history table. The target execution count is a range of the number of executions of the search condition extracted by the search condition extraction unit 25 among the search conditions included in the search history table. The execution condition is an item for designating whether to add or overwrite a keyword to the extended keyword of the attribute information when automatically registering the keyword in the document management apparatus.

インデックス部１３は、ファイルシステム１１に記録されたファイルからインデックスを抽出し、そのインデックスを記録する。このインデックス部１３によるインデックスの生成及び記録は定期的に実施される。 The index unit 13 extracts an index from the file recorded in the file system 11 and records the index. Index generation and recording by the index unit 13 are periodically performed.

ファイルシステム１１、データ記録部１２及びインデックス部１３としては、例えば、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、フラッシュメモリ、ＥＰＲＯＭ（ＥｒａｓａｂｌｅＰｒｏｇｒａｍｍａｂｌｅＲＯＭ）、ＥＥＰＲＯＭ（ＥｌｅｃｔｒｉｃａｌｌｙＥＰＲＯＭ）等の不揮発性又は揮発性の半導体メモリ、磁気ディスク、フレキシブルディスク、光ディスク、コンパクトディスク、ミニディスク、又は、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）等が用いられる。 As the file system 11, the data recording unit 12, and the index unit 13, for example, a RAM (Random Access Memory), a ROM (Read Only Memory), a flash memory, an EPROM (Erasable Programmable ROM), an EEPROM (Electrically EPROM), etc. Alternatively, a volatile semiconductor memory, a magnetic disk, a flexible disk, an optical disk, a compact disk, a mini disk, a DVD (Digital Versatile Disc), or the like is used.

なお図１では、文書管理装置の内部に管理部１が設けられた場合を示している。しかしながら、これに限らず、文書管理装置の外部に管理部１が設けられてもよい。 FIG. 1 shows a case where the management unit 1 is provided inside the document management apparatus. However, the present invention is not limited to this, and the management unit 1 may be provided outside the document management apparatus.

文書登録部２１は、外部からファイルが転送されて文書の登録が要求された場合に、ファイルシステム１１に当該ファイルを記録させ、また、当該ファイルの情報を含めユーザにより設定された文書の属性情報を、データ記録部１２に記録する。
また、文書登録部２１は、登録済みの文書に対して外部からファイルの変更が要求された場合に、ファイルシステム１１に記録されている当該ファイルを変更する。また、文書登録部２１は、データ記録部１２に記録されている上記ファイルの情報を含めユーザにより変更された文書の属性情報を変更する。この際、文書登録部２１は、上記属性情報のうち、上記ファイルに紐づく拡張キーワードについては削除する。 The document registration unit 21 causes the file system 11 to record the file when the file is transferred from the outside and the registration of the document is requested, and the attribute information of the document set by the user including the information of the file Is recorded in the data recording unit 12.
The document registration unit 21 changes the file recorded in the file system 11 when a file change is requested from the outside for a registered document. In addition, the document registration unit 21 changes the attribute information of the document changed by the user, including the information on the file recorded in the data recording unit 12. At this time, the document registration unit 21 deletes the extended keyword associated with the file from the attribute information.

文書検索部２２は、外部から検索条件が指定されて文書検索が要求された場合に、データ記録部１２から当該検索条件と一致するキーワードが含まれる文書の属性情報を抽出する。この文書検索部２２により抽出された属性情報は外部に通知される。 When a search condition is designated from the outside and a document search is requested, the document search unit 22 extracts attribute information of a document including a keyword that matches the search condition from the data recording unit 12. The attribute information extracted by the document search unit 22 is notified to the outside.

履歴記録部２３は、文書検索部２２で用いられた検索条件を含む検索履歴情報を、データ記録部１２に記録する。 The history recording unit 23 records search history information including the search conditions used in the document search unit 22 in the data recording unit 12.

抽出条件設定部２４は、外部からの要求に応じ、データ記録部１２で記録される抽出条件情報の設定及び確認を行う。 The extraction condition setting unit 24 sets and confirms the extraction condition information recorded by the data recording unit 12 in response to an external request.

検索条件抽出部２５は、外部又は内部スケジューラにより指定された日時に、データ記録部１２から、抽出条件情報が示す抽出条件を満たす検索条件を抽出する。この検索条件抽出部２５により抽出された検索条件を示す情報は全文検索部２６に通知される。また、検索条件抽出部２５は、抽出条件情報に含まれる実行条件が上書きを示している場合には、上記日時に、データ記録部１２に記録されている属性情報から、検索履歴テーブルの前回対象が０である検索条件に対応する拡張キーワードを全て削除する。 The search condition extraction unit 25 extracts a search condition that satisfies the extraction condition indicated by the extraction condition information from the data recording unit 12 at the date and time designated by the external or internal scheduler. Information indicating the search conditions extracted by the search condition extraction unit 25 is notified to the full-text search unit 26. In addition, when the execution condition included in the extraction condition information indicates overwriting, the search condition extraction unit 25 determines the previous target of the search history table from the attribute information recorded in the data recording unit 12 at the above date and time. All the extended keywords corresponding to the search condition with 0 is deleted.

全文検索部２６は、ファイルシステム１１から、検索条件抽出部２５により抽出された検索条件と一致するキーワードが含まれるファイルを抽出し、当該ファイルに紐づく文書を抽出する。図１の例では、全文検索部２６は、インデックス部１３に記録されているインデックスからキーワードが一致するファイルを抽出し、当該ファイルに紐づく文書を抽出する。この全文検索部２６により抽出されたファイル及び文書を示す情報はキーワード登録部２７に通知される。 The full-text search unit 26 extracts from the file system 11 a file that includes a keyword that matches the search condition extracted by the search condition extraction unit 25, and extracts a document associated with the file. In the example of FIG. 1, the full-text search unit 26 extracts a file with a matching keyword from the index recorded in the index unit 13 and extracts a document associated with the file. Information indicating the files and documents extracted by the full-text search unit 26 is notified to the keyword registration unit 27.

キーワード登録部２７は、データ記録部１２に対し、全文検索部２６により抽出された文書の属性情報に、検索条件抽出部２５により抽出された検索条件であるキーワードを拡張キーワードとして記録する。 The keyword registration unit 27 records the keyword that is the search condition extracted by the search condition extraction unit 25 as an extended keyword in the attribute information of the document extracted by the full-text search unit 26 in the data recording unit 12.

次に、文書管理装置の動作例について、図５〜８を参照しながら説明する。
まず、文書管理装置による文書登録の動作例について、図５を参照しながら説明する。
文書管理装置による文書登録では、外部からファイルが転送されて文書の登録が要求されると、図５に示すように、まず、文書登録部２１は、ファイルシステム１１に当該ファイルを記録する（ステップＳＴ５０１）。なお、文書登録部２１は、１つの文書に複数の異なるファイルを登録することもできる。
また、文書登録部２１は、上記ファイルの情報を含めユーザにより設定された文書の属性情報を、データ記録部１２に記録する（ステップＳＴ５０２）。 Next, an operation example of the document management apparatus will be described with reference to FIGS.
First, an operation example of document registration by the document management apparatus will be described with reference to FIG.
In document registration by the document management apparatus, when a file is transferred from the outside and registration of the document is requested, the document registration unit 21 first records the file in the file system 11 as shown in FIG. ST501). Note that the document registration unit 21 can also register a plurality of different files in one document.
Further, the document registration unit 21 records the attribute information of the document set by the user including the file information in the data recording unit 12 (step ST502).

次に、文書管理装置によるファイル変更の動作例について、図６を参照しながら説明する。
文書管理装置によるファイル変更では、登録済みの文書に対して外部からファイルの変更が要求されると、図６に示すように、まず、文書登録部２１は、ファイルシステム１１に記録されている当該ファイルを変更する（ステップＳＴ６０１）。
また、文書登録部２１は、データ記録部１２に記録されている属性情報をユーザにより設定された内容に変更する（ステップＳＴ６０２）。
また、文書登録部２１は、上記属性情報のうち、文書管理装置が自動で登録する属性情報である拡張キーワードについては変更したファイルのものは削除する（ステップＳＴ６０３）。 Next, an example of file change operation by the document management apparatus will be described with reference to FIG.
In the file change by the document management apparatus, when a file change is requested from the outside with respect to a registered document, first, as shown in FIG. The file is changed (step ST601).
Further, the document registration unit 21 changes the attribute information recorded in the data recording unit 12 to the contents set by the user (step ST602).
Also, the document registration unit 21 deletes the changed keyword of the extended keyword that is the attribute information automatically registered by the document management apparatus from the attribute information (step ST603).

次に、文書管理装置によるキーワードを用いた文書検索の動作例について、図７を参照しながら説明する。
文書管理装置によるキーワードを用いた文書検索では、外部から検索条件が指定されて文書検索が要求されると、図７に示すように、まず、文書検索部２２は、データ記録部１２から当該検索条件と一致するキーワードが含まれる文書の属性情報を抽出する（ステップＳＴ７０１）。この文書検索部２２により抽出された属性情報は外部に通知される。 Next, an example of document search operation using keywords by the document management apparatus will be described with reference to FIG.
In the document search using keywords by the document management apparatus, when a search condition is specified from the outside and a document search is requested, the document search unit 22 first searches the data recording unit 12 for the search as shown in FIG. The attribute information of the document including the keyword that matches the condition is extracted (step ST701). The attribute information extracted by the document search unit 22 is notified to the outside.

次いで、履歴記録部２３は、文書検索部２２で用いられた検索条件を含む検索履歴情報を、データ記録部１２に記録する（ステップＳＴ７０２）。履歴記録部２３は、例えば、データ記録部１２に対し、文書検索部２２で用いられた検索条件及び当該検索条件でのヒット件数を記録し、また、当該検索条件を用いた最終更新日及び実施回数の更新を行う。 Next, the history recording unit 23 records search history information including the search conditions used in the document search unit 22 in the data recording unit 12 (step ST702). The history recording unit 23 records, for example, the search conditions used in the document search unit 22 and the number of hits under the search conditions, and the last update date and the execution using the search conditions. Update the number of times.

次に、文書管理装置によるキーワードの自動登録の動作例について、図８を参照しながら説明する。なお以下では、抽出条件情報に、対象件数、対象ヒット件数、対象更新日時、対象実施回数、及び、実行条件、を示す情報が含まれているものとする。
文書管理装置によるキーワードの自動登録では、図８に示すように、検索条件抽出部２５は、外部又は内部スケジューラにより指定された日時になったかを判定する（ステップＳＴ８０１）。
このステップＳＴ８０１において、検索条件抽出部２５が上記日時にはなっていないと判定した場合には、シーケンスはステップＳＴ８０１に戻る。
なお、指定日時による実行は、シーケンスによる判定に限らず、タイマ機能による割込み処理又は外部スケジュールによるイベント起動等でもよく、その方法は問わない。 Next, an operation example of automatic keyword registration by the document management apparatus will be described with reference to FIG. In the following, it is assumed that the extraction condition information includes information indicating the number of targets, the number of target hits, the target update date, the number of target executions, and the execution conditions.
In the automatic keyword registration by the document management apparatus, as shown in FIG. 8, the search condition extraction unit 25 determines whether the date and time designated by the external or internal scheduler has come (step ST801).
In step ST801, if the search condition extraction unit 25 determines that the date and time have not come, the sequence returns to step ST801.
Note that the execution based on the designated date and time is not limited to the determination based on the sequence, but may be interrupt processing by the timer function or event activation by an external schedule, and the method thereof is not limited.

一方、ステップＳＴ８０１において、検索条件抽出部２５は、上記日時になったと判定した場合には、抽出条件情報に含まれる実行条件が上書きを示しているかを判定する（ステップＳＴ８０２）。
このステップＳＴ８０２において、検索条件抽出部２５は、実行条件が上書きを示していると判定した場合に、データ記録部１２に記録されている属性情報から、検索履歴テーブルの前回対象が０である検索条件に対応する拡張キーワードを全て削除する（ステップＳＴ８０３）。 On the other hand, if the search condition extraction unit 25 determines in step ST801 that the date and time have come, it determines whether or not the execution condition included in the extraction condition information indicates overwriting (step ST802).
In this step ST802, if the search condition extraction unit 25 determines that the execution condition indicates overwriting, the search target in the search history table is 0 from the attribute information recorded in the data recording unit 12. All the extended keywords corresponding to the conditions are deleted (step ST803).

一方、ステップＳＴ８０２において、検索条件抽出部２５は、実行条件が上書きを示していないと判定した場合、すなわち実行条件が追加を示している場合には、ステップＳＴ８０３はスキップされ、シーケンスはステップＳＴ８０４へ移行する。 On the other hand, in step ST802, if the search condition extraction unit 25 determines that the execution condition does not indicate overwriting, that is, if the execution condition indicates addition, step ST803 is skipped, and the sequence proceeds to step ST804. Transition.

次いで、検索条件抽出部２５は、データ記録部１２に記録されている検索履歴テーブルに対し、抽出条件に含まれる対象件数に該当する検索条件の前回対象を１とし、それ以外の検索条件の前回対象を０とする（ステップＳＴ８０４）。 Next, the search condition extraction unit 25 sets the previous target of the search condition corresponding to the number of targets included in the extraction condition to 1 for the search history table recorded in the data recording unit 12, and sets the previous search condition of the other search conditions. The target is set to 0 (step ST804).

次いで、検索条件抽出部２５は、データ記録部１２に記録されている検索履歴テーブルのうち、抽出条件情報に含まれる対象件数に該当する検索条件が有るかを判定する（ステップＳＴ８０５）。 Next, the search condition extraction unit 25 determines whether there is a search condition corresponding to the target number included in the extraction condition information in the search history table recorded in the data recording unit 12 (step ST805).

このステップＳＴ８０５において、検索条件抽出部２５は、対象件数に該当する検索条件が有ると判定した場合に、当該検索条件のうち、抽出条件情報に含まれる対象ヒット件数、対象更新日時及び対象実施回数に該当する検索条件が有るかを判定する（ステップＳＴ８０６）。
このステップＳＴ８０６において、検索条件抽出部２５は、上記各条件に該当する検索条件が有ると判定した場合には、シーケンスはステップＳＴ８０７へ移行する。
一方、検索条件抽出部２５は、ステップＳＴ８０５において対象件数に該当する検索条件が無いと判定した場合、又は、ステップＳＴ８０６において上記各条件に該当する検索条件が無いと判定した場合には、その後の処理はスキップされ、シーケンスは終了する。 In step ST805, when the search condition extraction unit 25 determines that there is a search condition corresponding to the number of cases, the number of target hits included in the extraction condition information, the target update date and time, and the number of target executions among the search conditions. It is determined whether there is a search condition corresponding to (step ST806).
In step ST806, if the search condition extraction unit 25 determines that there is a search condition corresponding to each of the above conditions, the sequence proceeds to step ST807.
On the other hand, if it is determined in step ST805 that there is no search condition corresponding to the target number in step ST805, or if it is determined in step ST806 that there is no search condition corresponding to each of the above conditions, Processing is skipped and the sequence ends.

ステップＳＴ８０１〜ＳＴ８０６の処理により、検索条件抽出部２５は、データ記録部１２から、抽出条件情報が示す抽出条件を満たす検索条件を抽出することができる。
なお、キーワードの自動登録を行うことで、前回も今回も対象件数の条件には該当するが、ヒット件数の条件が今回は範囲外となる検索条件が発生する可能性がある。そこで、検索履歴情報に前回対象を示す情報を含めることで、上記のようなケースで検索条件が対象となったりならなかったりすることを繰り返さないようにする。 Through the processing of steps ST801 to ST806, the search condition extraction unit 25 can extract search conditions that satisfy the extraction condition indicated by the extraction condition information from the data recording unit 12.
It should be noted that by performing automatic keyword registration, there may be a search condition in which the condition for the number of hits applies to the previous time and the current time, but the condition for the number of hits is outside the range this time. Therefore, by including information indicating the previous target in the search history information, it is possible not to repeat that the search condition is not the target in the above case.

次いで、全文検索部２６は、ファイルシステム１１から、検索条件抽出部２５により抽出された検索条件と一致するキーワードが含まれるファイルを抽出し、当該ファイルに紐づく文書を抽出する（ステップＳＴ８０７）。図１の例では、全文検索部２６は、インデックス部１３に記録されているインデックスからキーワードが一致するファイルを抽出し、当該ファイルに紐づく文書を抽出する。 Next, the full-text search unit 26 extracts from the file system 11 a file containing a keyword that matches the search condition extracted by the search condition extraction unit 25, and extracts a document associated with the file (step ST807). In the example of FIG. 1, the full-text search unit 26 extracts a file with a matching keyword from the index recorded in the index unit 13 and extracts a document associated with the file.

次いで、キーワード登録部２７は、データ記録部１２に対し、全文検索部２６により抽出された文書の属性情報に、検索条件抽出部２５により抽出された検索条件であるキーワードを拡張キーワードとして記録する（ステップＳＴ８０８）。その後、シーケンスは終了する。 Next, the keyword registration unit 27 records the keyword that is the search condition extracted by the search condition extraction unit 25 as an extended keyword in the attribute information of the document extracted by the full-text search unit 26 in the data recording unit 12 ( Step ST808). Thereafter, the sequence ends.

このように、実施の形態１に係る文書管理装置では、キーワードを用いた文書検索が実施される度にその履歴を記録している。そして、文書管理装置は、キーワードの自動登録の際に、上記履歴のうちの抽出条件に合致する検索条件のみを抽出して全文検索を行い、全文検索により得られた検索条件を対応するファイルに紐づく文書の属性情報として拡張キーワードを自動登録している。これにより、属性情報の登録によるユーザ負担の軽減が可能であり、また、有効な拡張キーワードのみを登録可能であるため、属性情報のデータ量の増大を抑制可能である。 As described above, the document management apparatus according to the first embodiment records the history every time a document search using a keyword is performed. Then, when automatically registering the keyword, the document management device extracts only the search condition that matches the extraction condition in the history and performs a full-text search, and the search condition obtained by the full-text search is stored in a corresponding file. An extended keyword is automatically registered as attribute information of the associated document. As a result, it is possible to reduce the burden on the user by registering the attribute information, and it is possible to register only valid expansion keywords, and therefore it is possible to suppress an increase in the data amount of the attribute information.

また、文書管理装置は、キーワードの自動登録方法として、キーワードの上書きを行うことで、自動登録した拡張キーワードのデータ量の増大を抑制できる。
また、文書管理装置は、文書登録の際にユーザが指定したキーワードと、文書管理装置が自動登録した拡張キーワードとを分けて管理することで、実行条件の変更、又は、ファイルの変更又は削除といった操作と連動して拡張キーワードを自動的に削除可能となる。
また、ヒット件数を抽出条件とすることで検索結果の期待度が高い半面、結果の少ないキーワードに絞って、自動的に検出することができる。すなわち、ヒット件数の条件は、ある検索条件でヒットした属性情報の件数が多い場合（例えば１０００件）に、文書管理装置が自動で当該検索条件であるキーワードを拡張キーワードとして登録しないようにするためのものである。このヒット件数は、よく検索されるのにあまりヒットしない検索条件を拡張キーワードとして登録することでユーザが探したいものを自動で検索可能とし、既に多くの検索結果がでているものについては除外することを目的としている。 Further, the document management apparatus can suppress an increase in the data amount of the automatically registered extended keyword by overwriting the keyword as an automatic keyword registration method.
Further, the document management apparatus separately manages the keyword specified by the user at the time of document registration and the extended keyword automatically registered by the document management apparatus, thereby changing the execution condition or changing or deleting the file. Extended keywords can be automatically deleted in conjunction with the operation.
In addition, by using the number of hits as an extraction condition, search results can be automatically detected by narrowing down to a keyword with a low result, while the degree of expectation of a search result is high. That is, the condition for the number of hits is to prevent the document management apparatus from automatically registering the keyword that is the search condition as an extended keyword when the number of attribute information hit under a certain search condition is large (for example, 1000). belongs to. The number of hits can be searched automatically by registering search conditions that are frequently searched but do not hit very much as extended keywords so that users can search automatically and exclude those that already have many search results. The purpose is that.

なお、検索履歴テーブルにおける検索条件の登録可能数を制限してもよい。これによっても、属性情報のデータ量の増大を抑制できる。 Note that the number of search conditions that can be registered in the search history table may be limited. This also can suppress an increase in the data amount of the attribute information.

以上のように、この実施の形態１によれば、１つ以上のキーワードである検索条件が指定された場合に、ファイルシステム１１に記録されているファイルと紐づく文書の属性情報を記録しているデータ記録部１２から、当該検索条件と一致するキーワードが含まれる属性情報を抽出する文書検索部２２と、文書検索部２２で用いられた検索条件を含む検索履歴情報を、データ記録部１２に記録する履歴記録部２３と、指定された日時に、データ記録部１２から、抽出条件を満たす検索条件を抽出する検索条件抽出部２５と、ファイルシステム１１から、検索条件抽出部２５により抽出された検索条件と一致するキーワードが含まれるファイルを抽出し、当該ファイルに紐づく文書を抽出する全文検索部２６と、データ記録部１２に対し、全文検索部２６により抽出された文書の属性情報に、検索条件抽出部２５により抽出された検索条件であるキーワードを拡張キーワードとして記録するキーワード登録部２７とを備えたので、属性情報の登録によるユーザ負担の軽減が可能であり、且つ、属性情報のデータ量の増大を抑制可能である。 As described above, according to the first embodiment, when the search condition that is one or more keywords is designated, the attribute information of the document associated with the file recorded in the file system 11 is recorded. The document search unit 22 for extracting attribute information including a keyword that matches the search condition from the data recording unit 12 and the search history information including the search condition used in the document search unit 22 are stored in the data recording unit 12. Extracted by the search condition extraction unit 25 from the history recording unit 23 for recording, the search condition extraction unit 25 for extracting the search condition satisfying the extraction condition from the data recording unit 12 at the designated date and time, and the file system 11. A full-text search unit 26 that extracts a file including a keyword that matches the search condition and extracts a document associated with the file and the data recording unit 12 Since the attribute information of the document extracted by the search unit 26 is provided with a keyword registration unit 27 that records a keyword that is a search condition extracted by the search condition extraction unit 25 as an extended keyword, the user burden due to registration of attribute information Can be reduced, and an increase in the amount of attribute information can be suppressed.

最後に、図９を参照して、実施の形態１における処理部２のハードウェア構成例を説明する。
処理部２における文書登録部２１、文書検索部２２、履歴記録部２３、抽出条件設定部２４、検索条件抽出部２５、全文検索部２６及びキーワード登録部２７の各機能は、処理回路５１により実現される。処理回路５１は、図９Ａに示すように、専用のハードウェアであってもよいし、図９Ｂに示すように、メモリ５３に記録されるプログラムを実行するＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ、中央処理装置、処理装置、演算装置、マイクロプロセッサ、マイクロコンピュータ、プロセッサ、又はＤＳＰ（ＤｉｇｉｔａｌＳｉｇｎａｌＰｒｏｃｅｓｓｏｒ）ともいう）５２であってもよい。 Finally, an example of the hardware configuration of the processing unit 2 in the first embodiment will be described with reference to FIG.
The functions of the document registration unit 21, document search unit 22, history recording unit 23, extraction condition setting unit 24, search condition extraction unit 25, full-text search unit 26, and keyword registration unit 27 in the processing unit 2 are realized by the processing circuit 51. Is done. The processing circuit 51 may be dedicated hardware as shown in FIG. 9A, or as shown in FIG. 9B, a CPU (Central Processing Unit) that executes a program recorded in the memory 53, a central processing unit, It may be a processing device, an arithmetic device, a microprocessor, a microcomputer, a processor, or a DSP (Digital Signal Processor)) 52.

処理回路５１が専用のハードウェアである場合、処理回路５１は、例えば、単一回路、複合回路、プログラム化したプロセッサ、並列プログラム化したプロセッサ、ＡＳＩＣ（ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）、ＦＰＧＡ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）、又はこれらを組み合わせたものが該当する。文書登録部２１、文書検索部２２、履歴記録部２３、抽出条件設定部２４、検索条件抽出部２５、全文検索部２６及びキーワード登録部２７の各部の機能それぞれを処理回路５１で実現してもよいし、各部の機能をまとめて処理回路５１で実現してもよい。 When the processing circuit 51 is dedicated hardware, the processing circuit 51 may be, for example, a single circuit, a composite circuit, a programmed processor, a parallel programmed processor, an ASIC (Application Specific Integrated Circuit), or an FPGA (Field Programmable Gate). Array) or a combination thereof. The processing circuit 51 may realize the functions of the document registration unit 21, document search unit 22, history recording unit 23, extraction condition setting unit 24, search condition extraction unit 25, full-text search unit 26, and keyword registration unit 27. Alternatively, the functions of the respective units may be collectively realized by the processing circuit 51.

処理回路５１がＣＰＵ５２の場合、文書登録部２１、文書検索部２２、履歴記録部２３、抽出条件設定部２４、検索条件抽出部２５、全文検索部２６及びキーワード登録部２７の機能は、ソフトウェア、ファームウェア、又はソフトウェアとファームウェアとの組み合わせにより実現される。ソフトウェア及びファームウェアはプログラムとして記述され、メモリ５３に記録される。処理回路５１は、メモリ５３に記録されたプログラムを読み出して実行することにより、各部の機能を実現する。すなわち、処理部２は、処理回路５１により実行されるときに、例えば図５〜８に示した各ステップが結果的に実行されることになるプログラムを記録するためのメモリ５３を備える。また、これらのプログラムは、文書登録部２１、文書検索部２２、履歴記録部２３、抽出条件設定部２４、検索条件抽出部２５、全文検索部２６及びキーワード登録部２７の手順及び方法をコンピュータに実行させるものであるともいえる。ここで、メモリ５３としては、例えば、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、フラッシュメモリ、ＥＰＲＯＭ（ＥｒａｓａｂｌｅＰｒｏｇｒａｍｍａｂｌｅＲＯＭ）、ＥＥＰＲＯＭ（ＥｌｅｃｔｒｉｃａｌｌｙＥＰＲＯＭ）等の不揮発性又は揮発性の半導体メモリ、磁気ディスク、フレキシブルディスク、光ディスク、コンパクトディスク、ミニディスク、又はＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）等が該当する。 When the processing circuit 51 is the CPU 52, the functions of the document registration unit 21, document search unit 22, history recording unit 23, extraction condition setting unit 24, search condition extraction unit 25, full-text search unit 26, and keyword registration unit 27 are software, This is realized by firmware or a combination of software and firmware. Software and firmware are described as programs and recorded in the memory 53. The processing circuit 51 reads out and executes the program recorded in the memory 53, thereby realizing the function of each unit. That is, the processing unit 2 includes a memory 53 for recording a program that, when executed by the processing circuit 51, for example, causes each step shown in FIGS. 5 to 8 to be executed as a result. Also, these programs store the procedures and methods of the document registration unit 21, document search unit 22, history recording unit 23, extraction condition setting unit 24, search condition extraction unit 25, full text search unit 26, and keyword registration unit 27 on a computer. It can be said that it is what is executed. Here, as the memory 53, for example, a nonvolatile or volatile semiconductor memory such as a RAM (Random Access Memory), a ROM (Read Only Memory), a flash memory, an EPROM (Erasable Programmable ROM), an EEPROM (Electrically EPROM), or the like. A magnetic disk, a flexible disk, an optical disk, a compact disk, a mini disk, a DVD (Digital Versatile Disc), or the like is applicable.

なお、文書登録部２１、文書検索部２２、履歴記録部２３、抽出条件設定部２４、検索条件抽出部２５、全文検索部２６及びキーワード登録部２７の各機能について、一部を専用のハードウェアで実現し、一部をソフトウェア又はファームウェアで実現するようにしてもよい。例えば、文書登録部２１については専用のハードウェアとしての処理回路５１でその機能を実現し、文書検索部２２、履歴記録部２３、抽出条件設定部２４、検索条件抽出部２５、全文検索部２６及びキーワード登録部２７については処理回路５１がメモリ５３に記録されたプログラムを読み出して実行することによってその機能を実現することが可能である。 Note that some of the functions of the document registration unit 21, document search unit 22, history recording unit 23, extraction condition setting unit 24, search condition extraction unit 25, full-text search unit 26, and keyword registration unit 27 are partially dedicated hardware. It may be realized by a part, and a part may be realized by software or firmware. For example, the function of the document registration unit 21 is realized by a processing circuit 51 as dedicated hardware. The document search unit 22, the history recording unit 23, the extraction condition setting unit 24, the search condition extraction unit 25, and the full-text search unit 26. The keyword registration unit 27 can realize its function by the processing circuit 51 reading and executing the program recorded in the memory 53.

このように、処理回路５１は、ハードウェア、ソフトウェア、ファームウェア、又はこれらの組み合わせによって、上述の各機能を実現することができる。 As described above, the processing circuit 51 can realize the above-described functions by hardware, software, firmware, or a combination thereof.

なお、本願発明はその発明の範囲内において、実施の形態の任意の構成要素の変形、もしくは実施の形態の任意の構成要素の省略が可能である。 In the present invention, any constituent element of the embodiment can be modified or any constituent element of the embodiment can be omitted within the scope of the invention.

１管理部、２処理部、１１ファイルシステム、１２データ記録部、１３インデックス部、２１文書登録部、２２文書検索部、２３履歴記録部、２４抽出条件設定部、２５検索条件抽出部、２６全文検索部、２７キーワード登録部、２８管理処理部、５１処理回路、５２ＣＰＵ、５３メモリ。 DESCRIPTION OF SYMBOLS 1 Management part, 2 Processing part, 11 File system, 12 Data recording part, 13 Index part, 21 Document registration part, 22 Document search part, 23 History recording part, 24 Extraction condition setting part, 25 Search condition extraction part, 26 Full text Search unit, 27 Keyword registration unit, 28 Management processing unit, 51 Processing circuit, 52 CPU, 53 Memory.

Claims

When a search condition that is one or more keywords is specified, a keyword that matches the search condition is included from the data recording unit that records the attribute information of the document associated with the file recorded in the file system. A document search unit for extracting attribute information
A history recording unit that records search history information including search conditions used in the document search unit in the data recording unit;
A search condition extraction unit that extracts a search condition that satisfies the extraction condition from the data recording unit at a designated date and time;
A full-text search unit that extracts a file including a keyword that matches the search condition extracted by the search condition extraction unit from the file system, and extracts a document associated with the file;
A document management unit comprising: a keyword registration unit that records, as an extended keyword, a keyword that is a search condition extracted by the search condition extraction unit in the attribute information of the document extracted by the full-text search unit. apparatus.

In response to a request from the outside, the search condition extraction unit was previously executed from the attribute information recorded in the data recording unit at the date and time when the execution condition included in the extraction condition indicates overwriting. 2. The document management apparatus according to claim 1, wherein, in automatic keyword registration, all extended keywords corresponding to search conditions that do not correspond to the number of search conditions extracted in descending order of priority are deleted. .

The document management apparatus according to claim 1, wherein the extraction condition is set by a user.

When a search condition that is one or more keywords is specified, the document search unit receives the search condition from the data recording unit that records the attribute information of the document associated with the file recorded in the file system. Extract attribute information that contains matching keywords,
The history recording unit records search history information including the search conditions used in the document search unit in the data recording unit,
The search condition extraction unit extracts a search condition satisfying the extraction condition from the data recording unit at a specified date and time,
The full-text search unit extracts, from the file system, a file containing a keyword that matches the search condition extracted by the search condition extraction unit, extracts a document associated with the file,
The keyword registration unit records the keyword that is the search condition extracted by the search condition extraction unit as an extended keyword in the attribute information of the document extracted by the full-text search unit to the data recording unit. Document management method.