JP5592747B2

JP5592747B2 - File search device and file search program

Info

Publication number: JP5592747B2
Application number: JP2010229198A
Authority: JP
Inventors: 正和松浦; 達也衛藤; 誠也鶴田
Original assignee: Hitachi Solutions Ltd
Current assignee: Hitachi Solutions Ltd
Priority date: 2010-10-12
Filing date: 2010-10-12
Publication date: 2014-09-17
Anticipated expiration: 2030-10-12
Also published as: JP2012083929A

Description

本発明は、検索対象のファイルについて予めインデックスファイルを作成し、インデックスファイルを参照してファイルを検索するファイル検索装置およびファイル検索プログラムに関する。 The present invention relates to a file search apparatus and a file search program for creating an index file in advance for a search target file and searching for the file by referring to the index file.

近年、情報のデジタル化が急速に進展している。かつては、銀行の口座や市町村の住民票等、企業・官公庁等のコンピュータシステムが基幹業務を実行するときに参照するファイルを、データベース（Data Base）等に記憶することがデジタル化の中心であった。 In recent years, digitization of information has been progressing rapidly. In the past, the main point of digitization was to store in the database (Data Base) files that are referred to when a computer system such as a bank account or municipal resident's card executes a core business. It was.

一方、現在では、企業等の組織において日常業務で作成される各種の書類が、各従業員のクライアントＰＣ（Personal Computer）に文書ファイルとして記憶され、電子メールの添付文書として他のクライアントＰＣに送信され、あるいは、組織全体の共有情報としてファイルサーバに記憶される。そして、ファイルサーバに記憶された文書ファイルは、さまざまな従業員により参照され、時にはクライアントＰＣに複製され、更新される。
このようにして、多量の文書ファイルがさまざまなコンピュータに分散記憶されることにより、組織内に、重複あるいは類似した多数の文書ファイルが存在することになる。また、各コンピュータの記憶容量も増大し続けている。 On the other hand, various documents created in daily work in organizations such as companies are now stored as document files on each employee's client PC (Personal Computer) and sent to other client PCs as e-mail attachments. Alternatively, it is stored in the file server as shared information for the entire organization. The document file stored in the file server is referred to by various employees, and is sometimes copied and updated on the client PC.
In this way, a large number of document files are distributed and stored in various computers, so that a large number of duplicate or similar document files exist in the organization. In addition, the storage capacity of each computer continues to increase.

一方、組織内において、例えば、既に退職した従業員がかつて作成した文書ファイルが必要になる等、記憶場所が不明な文書ファイルを参照したい場合が多々生ずる。このような場合、文書ファイル内に含まれているはずのキーワードによる全文検索等によって文書ファイルを検索するという方法が広く行われている。
ただし、組織内の全てのコンピュータから文書ファイルを検索しようとすると、各従業員のクライアントＰＣにも全ての従業員からのアクセスを許可する必要があり、セキュリティ上好ましくない。このため、組織全体で共有すべき文書ファイルについては所定のファイルサーバに記憶しておくということが一般に行われている。 On the other hand, there are many cases in the organization where it is desired to refer to a document file whose storage location is unknown, for example, a document file created once by an already retired employee is required. In such a case, a method of searching a document file by a full-text search using a keyword that should be included in the document file is widely used.
However, if a document file is to be searched from all computers in the organization, it is necessary to permit access from all employees to the client PC of each employee, which is not preferable in terms of security. For this reason, a document file to be shared by the entire organization is generally stored in a predetermined file server.

しかし、文書ファイルを所定のいくつかのファイルサーバに記憶させたとしても、文書ファイルが必要になるたびにファイルサーバに記憶されている全ての文書ファイルを検索しようとすると、ファイルＩＯ（Input Output）等を行うために膨大な時間を必要とする。
そこで、特許文献１に開示されているファイル検索装置は、ファイルサーバに記憶されているファイルについて、記憶場所やキーワード等、検索に使用するための比較的小容量の情報（インデックス情報）をインデックスファイルとして記憶する。すなわち、このファイル検索装置では、インデックスファイルを作成することで、検索時に、複数ファイルについてのインデックス情報を１回のファイルＩＯで取得することが可能となり、各ファイルへのファイルＩＯを不要とすることができる。この結果、ファイル検索時のレスポンスタイムを短くすることができるとともに、ファイルサーバの負荷を軽減することができる。 However, even if document files are stored in some predetermined file servers, every time a document file is needed, if all document files stored in the file server are searched, a file IO (Input Output) It takes a huge amount of time to do so.
Therefore, the file search device disclosed in Patent Document 1 uses a relatively small amount of information (index information) to be used for a search, such as a storage location or a keyword, for an index file. Remember as. That is, in this file search apparatus, by creating an index file, it is possible to acquire index information for a plurality of files with a single file IO at the time of search, and file IO for each file is unnecessary. Can do. As a result, the response time when searching for a file can be shortened, and the load on the file server can be reduced.

特開２００３−１６２５４５号公報JP 2003-162545 A

しかしながら、特許文献１に開示されているファイル検索装置によってファイル検索を行うためには、検索対象としたい全てのファイルについて、インデックスファイルを作成する必要がある。
インデックス情報の記憶容量は、ファイル自体と比べれば比較的小さい。しかし、キーワードによる全文検索を行うことができるようにするためには、ファイルに含まれるキーワードを解析してインデックス情報に記憶しておく必要がある。このため、ファイルに含まれるキーワードの数が多くなれば、それだけ大きな容量を必要とする。従って、検索対象としたいファイルの数が増えるほど、インデックスファイルの記憶容量は大きくなっていく。 However, in order to perform a file search by the file search apparatus disclosed in Patent Document 1, it is necessary to create an index file for all files that are to be searched.
The storage capacity of the index information is relatively small compared to the file itself. However, in order to be able to perform a full text search using keywords, it is necessary to analyze the keywords contained in the file and store them in the index information. For this reason, as the number of keywords included in the file increases, a larger capacity is required. Accordingly, as the number of files to be searched increases, the storage capacity of the index file increases.

ここで、例えば組織内におけるファイル検索を検討すると、ファイルを共有のファイルサーバに記憶する場合であっても、部署ごとにアクセス可能なフォルダ等が限定されており、当該限定されたフォルダ等の配下にファイルを記憶し、検索する場合が多い。このような場合、フォルダに、例えば“作業報告書用フォルダ”といったような名称を付けておき、当該フォルダ内には当該名称に合致したファイルを記憶しておくという方法、すなわちフォルダによってファイルを分類するという方法をとる場合が多い。そして、このようにした場合、フォルダのツリー構造を手繰っていくことで必要なファイルが検索できるので、キーワードによる全文検索を必要とすることはまれである。 Here, for example, when considering a file search in an organization, even if the file is stored in a shared file server, the accessible folder for each department is limited, and the subordinate folder such as the limited folder is under control. In many cases, files are stored and searched. In such a case, a name such as “work report folder” is given to the folder, and a file matching the name is stored in the folder, that is, the file is classified by the folder. In many cases, the method of doing is taken. In such a case, since a necessary file can be searched by manipulating the tree structure of the folder, it is rare that a full-text search using a keyword is required.

一方、例えば、特許文献、技術論文等、組織外部から入手したファイルや、ＷＥＢサーバなど、組織外部のサーバに存在しているファイルについては、入手した時点では意図しなかった用途で参照することも多く、キーワードによる全文検索が必要な場合も多い。
このように組織におけるファイル検索を検討すると、全文検索を可能にすべきファイルと、全文検索が不要なファイルが存在している。これは組織に限ったことではなく、例えば個人的に行うファイル検索についても同じである。 On the other hand, files obtained from outside the organization, such as patent documents and technical papers, and files existing on a server outside the organization, such as a WEB server, may be referred to for purposes that were not intended at the time of acquisition. In many cases, full text search using keywords is required.
Considering file search in an organization in this way, there are files that should be capable of full text search and files that do not require full text search. This is not limited to an organization, and the same applies to, for example, a personal file search.

本発明の目的は、全文検索を可能にすべきファイルと全文検索が不要なファイルとが存在しているファイル検索環境において、効率的にファイルを検索することができるファイル検索装置およびファイル検索プログラムを提供することである。 An object of the present invention is to provide a file search apparatus and a file search program capable of efficiently searching for a file in a file search environment in which there are files that should be capable of full-text search and files that do not require full-text search. Is to provide.

上記目的を達成するために、本発明のファイル検索装置は、
記憶装置から、複数の検索対象ファイルからなる第１の検索対象ファイル群と、複数の検索対象ファイルからなる第２の検索対象ファイル群を取得するファイル取得手段と、
前記ファイル取得手段によって取得された第１の検索対象ファイル群に含まれる各検索対象ファイルについて、当該各検索対象ファイルの名前であるファイル名と、当該各検索対象ファイルの記憶されている記憶位置を示すファイルパスと、当該各検索対象ファイルの内容から抽出された複数のキーワードとを有するインデックス１レコードを所定のタイミングで作成し、当該各インデックス１レコードを含むインデックス１ファイルを記憶するインデックス１ファイル記憶手段と、
前記ファイル取得手段によって取得された第２の検索対象ファイル群に含まれる各検索対象ファイルについて、当該各検索対象ファイルの名前であるファイル名と当該各検索対象ファイルの記憶されている記憶位置を示すファイルパスと、当該各検索対象ファイルの内容を代表する１以上のメタデータとを有するインデックス２レコードを所定のタイミングで作成し、当該各インデックス２レコードを含むインデックス２ファイルを記憶するインデックス２ファイル記憶手段と、
メタデータに関する条件と全文検索の条件のいずれか一方または両方を含む検索条件を受け付ける検索条件受付手段と、
前記検索条件受付手段によって受け付けられた検索条件がメタデータに関する条件を含む場合に、前記インデックス２ファイルに含まれる各インデックス２レコードを当該メタデータに関する条件に基づいて検索し、当該メタデータに関する条件を満たすインデックス２レコードを取得するメタデータ検索手段と、
前記検索条件受付手段によって受け付けられた検索条件がメタデータに関する条件および全文検索の条件を含む場合には、前記インデックス１ファイルに含まれる各インデックス１レコードのうち、前記メタデータ検索手段によって取得されたインデックス２レコードに含まれるファイルパスと同一のファイルパスを含む各インデックス１レコードを前記全文検索の条件に基づいて検索し、前記検索条件受付手段によって受け付けられた検索条件がメタデータに関する条件を含まず、全文検索の条件を含む場合には、前記インデックス１ファイルに含まれる全てのインデックス１レコードを前記全文検索の条件に基づいて検索し、前記全文検索の条件を満たすインデックス１レコードを取得するキーワード検索手段と、
を備える。 In order to achieve the above object, the file search apparatus of the present invention provides:
A file acquisition means for acquiring a first search target file group consisting of a plurality of search target files and a second search target file group consisting of a plurality of search target files from the storage device;
For each search target file included in the first search target file group acquired by the file acquisition means, the file name that is the name of each search target file and the storage location in which each search target file is stored Index 1 file storage for creating an index 1 record having a specified file path and a plurality of keywords extracted from the contents of each search target file at a predetermined timing, and storing the index 1 file including each index 1 record Means,
For each search target file included in the second search target file group acquired by the file acquisition means, the file name that is the name of each search target file and the storage location in which each search target file is stored are shown. Index 2 file storage for creating an index 2 record having a file path and one or more metadata representing the contents of each search target file at a predetermined timing and storing an index 2 file including each index 2 record Means,
Search condition receiving means for receiving a search condition including one or both of a condition relating to metadata and a condition of full-text search;
When the search condition received by the search condition receiving means includes a condition related to metadata, each index 2 record included in the index 2 file is searched based on the condition related to the metadata, and the condition related to the metadata is set. A metadata search means for acquiring index 2 records satisfying;
When the search condition received by the search condition receiving means includes a metadata-related condition and a full-text search condition, it is acquired by the metadata search means out of each index 1 record included in the index 1 file . Each index 1 record including the same file path as the file path included in the index 2 record is searched based on the full-text search condition, and the search condition received by the search condition receiving means does not include a condition related to metadata. When the full text search condition is included, a keyword search that searches all index 1 records included in the index 1 file based on the full text search condition and obtains an index 1 record that satisfies the full text search condition. Means,
Is provided.

好ましくは、本発明のファイル検索装置は、
仮想分類の名称を示す仮想分類表示名と、前記インデックス２ファイルに含まれるいずれかのインデックス２レコードが当該仮想分類に分類される可能性の有無を示すインデックス２レコード有無情報と、前記インデックス２ファイルに含まれる各インデックス２レコードに対応する検索対象ファイルを当該仮想分類に分類するための分類条件と、上位階層の仮想分類の有無を示す上位階層有無情報と、上位階層の仮想分類を識別するための上位階層識別情報とを有する複数の仮想分類定義レコードを含む仮想分類定義ファイルを記憶する仮想分類定義ファイル記憶手段と、
前記仮想分類定義ファイル記憶手段によって記憶された前記仮想分類定義ファイルに含まれる各仮想分類定義レコードのインデックス２レコード有無情報と分類条件と上位階層有無情報と上位階層識別情報とに基づいて、各仮想分類と各インデックス２レコードに対応する検索対象ファイルとによって構成されるツリー状の階層構造を示す仮想分類定義情報を作成する仮想分類定義手段と、
前記仮想分類定義手段によって作成された仮想分類定義情報と、当該仮想分類定義情報によって示されるツリー状の階層構造を構成する各仮想分類の仮想分類表示名および各検索対象ファイルのファイル名とを出力する仮想分類出力手段と、
を備える。 Preferably, the file search device of the present invention is
A virtual classification display name indicating the name of the virtual classification, an index 2 record presence / absence information indicating whether any index 2 record included in the index 2 file may be classified into the virtual classification, and the index 2 file In order to identify the classification condition for classifying the search target file corresponding to each index 2 record included in the virtual classification into the virtual classification, the upper hierarchy presence / absence information indicating the presence / absence of the upper hierarchy virtual classification, and the upper hierarchy virtual classification Virtual classification definition file storage means for storing a virtual classification definition file including a plurality of virtual classification definition records having higher layer identification information;
Based on the index 2 record presence / absence information, the classification condition, the upper hierarchy presence / absence information, and the upper hierarchy identification information of each virtual classification definition record included in the virtual classification definition file stored by the virtual classification definition file storage unit, Virtual classification definition means for creating virtual classification definition information indicating a tree-like hierarchical structure constituted by a classification and a search target file corresponding to each index 2 record;
Outputs the virtual classification definition information created by the virtual classification definition means, the virtual classification display name of each virtual classification constituting the tree-like hierarchical structure indicated by the virtual classification definition information, and the file name of each search target file Virtual classification output means,
Is provided.

好ましくは、本発明のファイル検索装置は、
キーワードと、当該キーワードを識別するためのキーワード識別情報とを有する複数のキーワード辞書レコードを含むキーワード辞書ファイルを記憶するキーワード辞書ファイル記憶手段と、
前記キーワード辞書ファイルに含まれるキーワード辞書レコードのキーワード識別情報と、当該キーワード識別情報に関連する１以上の前記キーワード辞書ファイルに含まれるキーワード辞書レコードのキーワード識別情報とを有する複数のキーワード関連定義レコードを含むキーワード関連定義ファイルを記憶するキーワード関連定義ファイル記憶手段と、
前記メタデータ検索手段によって検索を行う際に、当該検索のメタデータに関する条件に含まれるキーワードと、前記キーワード辞書ファイルに含まれる各キーワード辞書レコードのキーワードおよびキーワード識別情報と、前記キーワード関連定義ファイルに含まれる各キーワード関連定義レコードのキーワード識別情報および当該キーワード識別情報に関連する１以上のキーワード識別情報とに基づいて、前記仮想分類定義レコードを生成する仮想分類動的生成手段と、
を備える。 Preferably, the file search device of the present invention is
Keyword dictionary file storage means for storing a keyword dictionary file including a plurality of keyword dictionary records having a keyword and keyword identification information for identifying the keyword;
A plurality of keyword related definition records having keyword identification information of a keyword dictionary record included in the keyword dictionary file and keyword identification information of a keyword dictionary record included in one or more of the keyword dictionary files related to the keyword identification information. Keyword-related definition file storage means for storing a keyword-related definition file including:
When a search is performed by the metadata search means, the keywords included in the conditions relating to the metadata of the search, the keywords and keyword identification information of each keyword dictionary record included in the keyword dictionary file, and the keyword related definition file Virtual classification dynamic generation means for generating the virtual classification definition record based on the keyword identification information of each keyword-related definition record included and one or more keyword identification information related to the keyword identification information;
Is provided.

好ましくは、本発明のファイル検索装置は、
前記インデックス２レコードが、当該インデックス２レコードに対応する検索対象ファイルを識別するためのファイル識別情報を含み、
関連定義レコードを識別するための関連定義レコード識別情報と、関連定義の名称を示す表示名と、インデックス２レコードを検索する検索条件を取得するための検索条件取得条件とを有する１以上の関連定義レコードを含む関連定義ファイルを記憶する関連定義ファイル記憶手段と、
前記メタデータ検索手段によって取得されたインデックス２レコードのファイル名およびファイルパスと、前記関連定義ファイル記憶手段に記憶されている関連定義ファイルの各関連定義レコードに含まれる表示名とを出力する関連定義情報出力手段と、
前記関連定義情報出力手段によって出力された各表示名の中から選択された表示名に対応する関連定義レコード識別情報と、選択された当該表示名とともに前記関連定義情報出力手段によって出力されたファイル名およびファイルパスに対応するインデックス２レコードのファイル識別情報とを受け付ける受付手段と、
前記受付手段によって受け付けられたファイル識別情報を含むインデックス２レコードと、前記受付手段によって受け付けられた関連定義レコード識別情報を含む前記関連定義ファイル記憶手段に記憶されている関連定義レコードの検索条件取得条件とに基づいて検索条件を取得する検索条件取得手段と、
前記インデックス２ファイルに含まれる各インデックス２レコードを検索して、前記検索条件取得手段によって取得された検索条件を満たすインデックス２レコードを取得し、取得した当該インデックス２レコードに含まれるファイル名とファイルパスを出力する関連検索手段と、
を備える。 Preferably, the file search device of the present invention is
The index 2 record includes file identification information for identifying a search target file corresponding to the index 2 record;
One or more relation definitions having relation definition record identification information for identifying the relation definition record, a display name indicating the name of the relation definition, and a search condition acquisition condition for acquiring the search condition for searching the index 2 record An associated definition file storage means for storing an associated definition file including a record;
Outputting said metadata search means acquires index 2 record by file name and file path, and a display name that is part of each relevant definition record associated definition files stored in the associated definition file storage unit Related definition information output means,
Related definition record identification information corresponding to the display name selected from each display name output by the related definition information output means, and the file name output by the related definition information output means together with the selected display name Receiving means for receiving the file identification information of the index 2 record corresponding to the file path ;
Search condition acquisition condition for related definition record stored in the related definition file storage means including the index 2 record containing the file identification information accepted by the accepting means and the related definition record identification information accepted by the accepting means a search condition acquisition unit that acquires a search condition based on bets,
Each index 2 record included in the index 2 file is searched, an index 2 record satisfying the search condition acquired by the search condition acquisition unit is acquired, and the file name and file path included in the acquired index 2 record Related search means for outputting
Is provided.

また、本発明のファイル検索プログラムは、
コンピュータを、
記憶装置から、複数の検索対象ファイルからなる第１の検索対象ファイル群と、複数の検索対象ファイルからなる第２の検索対象ファイル群を取得するファイル取得手段と、
前記ファイル取得手段によって取得された第１の検索対象ファイル群に含まれる各検索対象ファイルについて、当該各検索対象ファイルの名前であるファイル名と、当該各検索対象ファイルの記憶されている記憶位置を示すファイルパスと、当該各検索対象ファイルの内容から抽出された複数のキーワードとを有するインデックス１レコードを所定のタイミングで作成し、当該各インデックス１レコードを含むインデックス１ファイルを記憶するインデックス１ファイル記憶手段と、
前記ファイル取得手段によって取得された第２の検索対象ファイル群に含まれる各検索対象ファイルについて、当該各検索対象ファイルの名前であるファイル名と当該各検索対象ファイルの記憶されている記憶位置を示すファイルパスと、当該各検索対象ファイルの内容を代表する１以上のメタデータとを有するインデックス２レコードを所定のタイミングで作成し、当該各インデックス２レコードを含むインデックス２ファイルを記憶するインデックス２ファイル記憶手段と、
メタデータに関する条件と全文検索の条件のいずれか一方または両方を含む検索条件を受け付ける検索条件受付手段と、
前記検索条件受付手段によって受け付けられた検索条件がメタデータに関する条件を含む場合に、前記インデックス２ファイルに含まれる各インデックス２レコードを当該メタデータに関する条件に基づいて検索し、当該メタデータに関する条件を満たすインデックス２レコードを取得するメタデータ検索手段と、
前記検索条件受付手段によって受け付けられた検索条件がメタデータに関する条件および全文検索の条件を含む場合には、前記インデックス１ファイルに含まれる各インデックス１レコードのうち、前記メタデータ検索手段によって取得されたインデックス２レコードに含まれるファイルパスと同一のファイルパスを含む各インデックス１レコードを前記全文検索の条件に基づいて検索し、前記検索条件受付手段によって受け付けられた検索条件がメタデータに関する条件を含まず、全文検索の条件を含む場合には、前記インデックス１ファイルに含まれる全てのインデックス１レコードを前記全文検索の条件に基づいて検索し、前記全文検索の条件を満たすインデックス１レコードを取得するキーワード検索手段と、
して機能させる。 The file search program of the present invention is
Computer
A file acquisition means for acquiring a first search target file group consisting of a plurality of search target files and a second search target file group consisting of a plurality of search target files from the storage device;
For each search target file included in the first search target file group acquired by the file acquisition means, the file name that is the name of each search target file and the storage location in which each search target file is stored Index 1 file storage for creating an index 1 record having a specified file path and a plurality of keywords extracted from the contents of each search target file at a predetermined timing, and storing the index 1 file including each index 1 record Means,
For each search target file included in the second search target file group acquired by the file acquisition means, the file name that is the name of each search target file and the storage location in which each search target file is stored are shown. Index 2 file storage for creating an index 2 record having a file path and one or more metadata representing the contents of each search target file at a predetermined timing and storing an index 2 file including each index 2 record Means,
Search condition receiving means for receiving a search condition including one or both of a condition relating to metadata and a condition of full-text search;
When the search condition received by the search condition receiving means includes a condition related to metadata, each index 2 record included in the index 2 file is searched based on the condition related to the metadata, and the condition related to the metadata is set. A metadata search means for acquiring index 2 records satisfying;
When the search condition received by the search condition receiving means includes a metadata-related condition and a full-text search condition, it is acquired by the metadata search means out of each index 1 record included in the index 1 file . Each index 1 record including the same file path as the file path included in the index 2 record is searched based on the full-text search condition, and the search condition received by the search condition receiving means does not include a condition related to metadata. When the full text search condition is included, a keyword search that searches all index 1 records included in the index 1 file based on the full text search condition and obtains an index 1 record that satisfies the full text search condition. Means,
And make it work.

本発明によれば、全文検索を可能にすべきファイルと全文検索が不要なファイルとが存在しているファイル検索環境において、効率的にファイルを検索することができる。 ADVANTAGE OF THE INVENTION According to this invention, a file can be searched efficiently in the file search environment where the file which should enable full text search, and the file which does not require full text search exist.

第１の実施形態に係るファイル検索システムの構成の一例を示す図である。It is a figure which shows an example of a structure of the file search system which concerns on 1st Embodiment. インデックス１ファイルのデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of an index 1 file. インデックス２ファイルのデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of an index 2 file. インデックス２ファイルに含まれるシステムメタデータのデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of the system metadata contained in an index 2 file. インデックス２ファイルに含まれる標準メタデータのデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of the standard metadata contained in an index 2 file. 仮想分類定義ファイルのデータ構成の一例を示す図である。It is a figure which shows an example of the data structure of a virtual classification definition file. 関連定義ファイルのデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of a related definition file. インデックス１作成部の動作の一例を示すフローチャートである。It is a flowchart which shows an example of operation | movement of an index 1 preparation part. インデックス２作成部の動作の一例を示すフローチャートである。It is a flowchart which shows an example of operation | movement of the index 2 preparation part. 検索要求部の動作の一例を示すフローチャートである。It is a flowchart which shows an example of operation | movement of a search request | requirement part. 検索部の動作の一例を示すフローチャートである。It is a flowchart which shows an example of operation | movement of a search part. インデックス１検索部の動作の一例を示すフローチャートである。It is a flowchart which shows an example of operation | movement of an index 1 search part. ログイン画面を例示した図である。It is the figure which illustrated the login screen. 検索要求画面を例示した図である。It is the figure which illustrated the search request screen. 検索要求画面を例示した図である。It is the figure which illustrated the search request screen. 複合検索における検索部等の動作の一例を示すフローチャートである。It is a flowchart which shows an example of operation | movement of the search part etc. in a compound search. 検索要求画面での検索結果の表示内容を例示した図である。It is the figure which illustrated the display contents of the search result on a search request screen. 検索要求画面での検索結果の表示内容を例示した図である。It is the figure which illustrated the display contents of the search result on a search request screen. 第２の実施形態に係るファイル検索システムの構成の一例を示す図である。It is a figure which shows an example of a structure of the file search system which concerns on 2nd Embodiment. キーワード辞書ファイルのデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of a keyword dictionary file. キーワード関連定義ファイルのデータ構成の一例を示す図である。It is a figure which shows an example of a data structure of a keyword related definition file. 仮想分類動的生成における検索部等の動作の一例を示すフローチャートである。It is a flowchart which shows an example of operation | movement of the search part etc. in virtual classification dynamic generation. 検索要求画面を例示した図である。It is the figure which illustrated the search request screen.

以下、本発明の第１の実施形態に係るファイル検索システムについて図面を参照しながら説明する。
なお、本明細書と特許請求の範囲において、特に説明を付さない限り、「ファイル」とは、文書ファイル、画像ファイル等のファイルのみならず、データベース全体やデータベースの各レコード等も含み、閲覧、視聴、メール送受信あるいは外部記憶媒体へのコピー等の対象となりうる電子データを意味する。 Hereinafter, a file search system according to a first embodiment of the present invention will be described with reference to the drawings.
In the present specification and claims, unless otherwise specified, the “file” includes not only a file such as a document file and an image file, but also the entire database and each record of the database. This means electronic data that can be subject to viewing, mail transmission / reception, copying to an external storage medium, and the like.

＜ファイル検索システム全体の構成・機能＞
図１は、本発明の第１の実施形態に係るファイル検索システムの構成の一例を示す。
第１の実施形態に係るファイル検索システムは、ファイル検索サーバ１と、ファイルサーバ４と、クライアント３とを備える。ファイル検索サーバ１と、ファイルサーバ４と、クライアント３は、ＬＡＮ（Local Area Network）等の有線又は無線の通信回線９により通信可能に接続されている。
ここで、ファイル検索サーバ１、ファイルサーバ４及びクライアント３はそれぞれ１台ずつ図示しているが、それぞれ２台以上存在していてもよい。また、ファイル検索サーバ１と、ファイルサーバ４と、クライアント３とはそれぞれ異なる装置である必要はなく、例えば、ファイル検索サーバ１とファイルサーバ４とクライアント３の機能を１台の装置によって実現することも可能である。
通信回線９はＬＡＮに限定されるものではなく、例えばＷＡＮ（Wide Area Network）やインターネットとすることもできるし、これらの組み合わせとすることもできる。 <Configuration and function of the entire file search system>
FIG. 1 shows an example of the configuration of a file search system according to the first embodiment of the present invention.
The file search system according to the first embodiment includes a file search server 1, a file server 4, and a client 3. The file search server 1, the file server 4, and the client 3 are communicably connected via a wired or wireless communication line 9 such as a LAN (Local Area Network).
Here, one file search server 1, one file server 4, and one client 3 are illustrated, but two or more each may exist. The file search server 1, the file server 4, and the client 3 do not have to be different devices. For example, the functions of the file search server 1, the file server 4, and the client 3 are realized by a single device. Is also possible.
The communication line 9 is not limited to a LAN, and can be, for example, a WAN (Wide Area Network), the Internet, or a combination thereof.

後に詳述するように、ファイル検索サーバ１はファイルサーバ４に記憶されているファイルの名称等を収集して、インデックスファイルを作成して記憶しておく。そして、クライアント３がファイル検索サーバ１にファイル検索要求（以下「検索要求」と略記することがある。）を送信すると、ファイル検索サーバ１はそのインデックスファイルを参照してファイル検索を行い、クライアント３にファイル検索結果（以下「検索結果」と略記することがある。）を送信する。 As will be described in detail later, the file search server 1 collects file names and the like stored in the file server 4 to create and store an index file. When the client 3 transmits a file search request (hereinafter sometimes abbreviated as “search request”) to the file search server 1, the file search server 1 performs a file search with reference to the index file, and the client 3 The file search result (hereinafter may be abbreviated as “search result”) is sent to.

＜クライアント３の構成・機能＞
クライアント３はコンピュータであり、入力装置３２、表示装置３３、図示しない主記憶装置および記憶装置を備える。
入力装置３２はキーボード、マウス等を含む。クライアント３の操作者は、入力装置３２を操作することで、クライアント３が実行するべき処理を指示することができる。すなわち入力装置３２はクライアント３の入力手段として機能する。
表示装置３３は液晶ディスプレイ、プリンタ等を含む。表示装置３３は、クライアント３が実行した処理の結果等を表示あるいは印刷する。すなわち表示装置３３はクライアント３の表示手段および出力手段として機能する。
主記憶装置はＲＡＭ（Random
Access Memory）等を含む。記憶装置は磁気ディスク等を含み、クライアント３に内蔵されるか、または外部接続される。主記憶装置と記憶装置は、クライアント３の記憶手段として機能する。 <Configuration and function of client 3>
The client 3 is a computer and includes an input device 32, a display device 33, a main storage device and a storage device (not shown).
The input device 32 includes a keyboard, a mouse, and the like. The operator of the client 3 can instruct a process to be executed by the client 3 by operating the input device 32. That is, the input device 32 functions as input means for the client 3.
The display device 33 includes a liquid crystal display, a printer, and the like. The display device 33 displays or prints the result of processing executed by the client 3. That is, the display device 33 functions as a display unit and an output unit of the client 3.
The main memory is RAM (Random
Access Memory). The storage device includes a magnetic disk or the like, and is built in the client 3 or externally connected. The main storage device and the storage device function as storage means of the client 3.

クライアント３は、図示していないがＣＰＵ（Central Processing Unit）を備える。主記憶装置には検索要求プログラム等のプログラムが記憶されている。ＣＰＵがその命令コードを実行することにより検索要求部３１等の機能が実現される。
また、ＣＰＵは、検索要求プログラム等の応用プログラムの命令コードを実行する際、ＯＳ（Operating System）等のプログラムの命令コードも実行する場合がある。以上のようなプログラム実行にかかわる技術は周知であるので、以降の説明および図面においては、検索要求部３１というように、あたかも各種プログラムにより実現される処理を実行するハードウェアが存在するかのように記載する。なお、実際に各部（例えば検索要求部３１）をハードウェアで構成することも可能である。 Although not shown, the client 3 includes a CPU (Central Processing Unit). A program such as a search request program is stored in the main storage device. The function of the search request unit 31 and the like is realized by the CPU executing the instruction code.
Further, when executing the instruction code of an application program such as a search request program, the CPU may also execute the instruction code of a program such as an OS (Operating System). Since the technique related to program execution as described above is well known, in the following description and drawings, it is as if there is hardware that executes processing realized by various programs, such as the search request unit 31. It describes. It should be noted that each unit (for example, the search request unit 31) can actually be configured by hardware.

検索要求部３１は、クライアント３の操作者が入力装置３２によって入力したファイル検索条件およびファイル検索指示（例えば、ファイル名が「作業報告」であるファイルを検索するように、という指示）を受けて、検索条件式を作成し、ファイル検索サーバ１に検索条件式を含む検索要求を送信する。
また、ファイル検索サーバ１がクライアント３に送信した検索結果を受信し、表示装置３３に表示する。検索要求部３１を実現するための検索要求プログラムは、第１の実施形態に係わる独自のプログラムであってもよいし、例えばＷＥＢブラウザのプログラムであってもよい。ＷＥＢブラウザを検索要求部３１として使用する場合、後述するファイル検索サーバ１の検索部１３は、例えばＷＥＢアプリケーションとすればよい。 The search request unit 31 receives a file search condition and a file search instruction (for example, an instruction to search for a file whose file name is “work report”) input by the operator of the client 3 using the input device 32. The search condition formula is created, and a search request including the search condition formula is transmitted to the file search server 1.
Further, the search result transmitted from the file search server 1 to the client 3 is received and displayed on the display device 33. The search request program for realizing the search request unit 31 may be a unique program according to the first embodiment, or may be a WEB browser program, for example. When a WEB browser is used as the search request unit 31, the search unit 13 of the file search server 1 to be described later may be a WEB application, for example.

＜ファイルサーバ４の構成・機能＞
ファイルサーバ４はコンピュータであり、記憶装置４２を備える。
記憶装置４２は磁気ディスク等を含む。記憶装置４２はファイルサーバ４に内蔵され、または外部接続される。図１では、１台の記憶装置４２がファイルサーバ４に接続されているかのように例示しているが、実際には、むしろ２台以上の記憶装置４２がファイルサーバ４に接続されることが多く、また、２台以上の記憶装置４２が２台以上のファイルサーバ４に、切り替え可能に接続されることも多い。
記憶装置４２にはさまざまなファイルが記憶されているが、図１では、ファイルのうち後述するファイル検索サーバ１がインデックス作成対象とするものを、「検索対象ファイル４３」と記述している。すなわち、検索対象ファイル４３という特定のファイルが存在するのではなく、検索対象ファイル４３は、例えば記憶装置４２に記憶されている全てのファイルであってもよいし、あるいは、特定のフォルダ内の全てのファイル等、記憶装置４２に記憶されている一部のファイルであってもよい。
検索対象ファイル４３には、オーナー、グループ、その他利用者に対する、参照・更新・実行権限を表すアクセス権限を設定することができる。 <Configuration / Function of File Server 4>
The file server 4 is a computer and includes a storage device 42.
The storage device 42 includes a magnetic disk or the like. The storage device 42 is built in the file server 4 or connected externally. In FIG. 1, an example is shown as if one storage device 42 is connected to the file server 4, but actually, two or more storage devices 42 may rather be connected to the file server 4. In many cases, two or more storage devices 42 are connected to two or more file servers 4 in a switchable manner.
Various files are stored in the storage device 42. In FIG. 1, a file to be indexed by the file search server 1 described later is described as a “search target file 43”. That is, a specific file called the search target file 43 does not exist, and the search target file 43 may be, for example, all the files stored in the storage device 42 or all the files in the specific folder Some of the files stored in the storage device 42 may be used.
In the search target file 43, access authority representing reference / update / execution authority for the owner, group, and other users can be set.

ファイルサーバ４は、図示していないがＣＰＵと、ＲＡＭ等で構成された主記憶装置とを備える。主記憶装置にはファイル管理プログラム等のプログラムが記憶されている。ＣＰＵがその命令コードを実行することによりファイル管理部４１等の機能が実現される。ファイル管理部４１は、検索対象ファイル４３を含め、記憶装置４２に記憶されているファイルの記憶位置等を管理する。また、ファイル管理部４１は、ファイル検索サーバ１からファイルの格納場所等を受信し、当該格納場所に記憶された検索対象ファイル４３等を読み込み、当該ファイルの記憶内容をファイル検索サーバ１に送信する機能も有している。 Although not shown, the file server 4 includes a CPU and a main storage device including a RAM and the like. A program such as a file management program is stored in the main storage device. Functions of the file management unit 41 and the like are realized by the CPU executing the instruction code. The file management unit 41 manages the storage location and the like of files stored in the storage device 42 including the search target file 43. In addition, the file management unit 41 receives the storage location of the file from the file search server 1, reads the search target file 43 stored in the storage location, and transmits the storage content of the file to the file search server 1. It also has a function.

＜ファイル検索サーバ１の構成・機能＞
ファイル検索サーバ１はコンピュータであり、記憶装置２と、図示しない主記憶装置とを備える。記憶装置２は磁気ディスク等を含む。記憶装置２はファイルサーバ１に内蔵されるか、または外部接続される。図１では、１台の記憶装置２がファイル検索サーバ１に接続されているかのように例示しているが、２台以上であってもよい。主記憶装置はＲＡＭ等で構成される。記憶装置２と主記憶装置は、ファイルサーバ１の記憶手段として機能する。
記憶装置２には、インデックス１ファイル２１、インデックス２ファイル２２、仮想分類定義ファイル２３、関連定義ファイル２４、キーワード辞書ファイル２５とキーワード関連定義ファイル２６が記憶されている。これらのファイルの記憶内容等については、以降のファイル検索サーバ１の機能説明と合わせて説明する。 <Configuration / Function of File Search Server 1>
The file search server 1 is a computer and includes a storage device 2 and a main storage device (not shown). The storage device 2 includes a magnetic disk or the like. The storage device 2 is built in the file server 1 or connected externally. In FIG. 1, an example is shown as if one storage device 2 is connected to the file search server 1, but there may be two or more. The main storage device is composed of a RAM or the like. The storage device 2 and the main storage device function as storage means of the file server 1.
The storage device 2 stores an index 1 file 21, an index 2 file 22, a virtual classification definition file 23, a related definition file 24, a keyword dictionary file 25, and a keyword related definition file 26. The contents stored in these files will be described together with the functional description of the file search server 1 hereinafter.

ファイル検索サーバ１は、図示していないがＣＰＵを備える。ファイル検索サーバ１の主記憶装置には、ファイル検索プログラム等のプログラムが記憶されている。ＣＰＵがその命令コードを実行することにより、インデックス１作成部１１、インデックス２作成部１２、検索部１３、インデックス１検索部１４の各機能が実現される。
インデックス１作成部１１は、例えば毎日同じ時刻等の所定のタイミングで、検索対象ファイル４３を参照し、インデックス１ファイル２１を作成する。インデックス１ファイル２１には、後述するように、ファイル名、ファイル内容から抽出したキーワード等が記憶される。
どのファイルを検索対象ファイル４３とするかは、例えば、図示していないが、記憶装置２内の所定のファイルに予め記憶装置４２のファイルパス（“/etc/usr1/”等）を１以上記憶しておき、当該ファイルパス配下のファイルを検索対象ファイル４３とすることができる。なお、記憶装置４２が２台以上存在する場合等には、当該ファイルパスがどの記憶装置上に存在するか等も含めて記憶しておけばよい。以上のようにインデックス１ファイル２１を作成することで、インデックス１ファイル２１を参照してファイル検索することが可能になり、検索の都度、検索対象ファイル４３を参照する必要がなくなるので、検索に要する処理時間を短縮することができる。 The file search server 1 includes a CPU (not shown). A program such as a file search program is stored in the main storage device of the file search server 1. When the CPU executes the instruction code, each function of the index 1 creation unit 11, the index 2 creation unit 12, the search unit 13, and the index 1 search unit 14 is realized.
The index 1 creation unit 11 creates the index 1 file 21 with reference to the search target file 43 at a predetermined timing such as the same time every day. As will be described later, the index 1 file 21 stores a file name, keywords extracted from the file contents, and the like.
Which file is the search target file 43 is not shown, for example, but one or more file paths (such as “/ etc / usr1 /”) of the storage device 42 are stored in advance in a predetermined file in the storage device 2. In addition, the file under the file path can be set as the search target file 43. When there are two or more storage devices 42, etc., the storage device including the storage device on which the file path exists may be stored. By creating the index 1 file 21 as described above, it becomes possible to search for a file by referring to the index 1 file 21, and it is not necessary to refer to the search target file 43 every time it is searched. Processing time can be shortened.

なお、以上のようにして作成されたインデックス１ファイル２１を参照して検索するとき、インデックス１ファイル２１作成後に検索対象ファイル４３が削除され、あるいは変更されている場合がある。このような場合に、インデックス１ファイル２１を検索すると、検索対象ファイル４３を検索した場合とは異なる検索結果が得られることになる。例えばファイル名が「作業報告」であるファイルについてインデックス１ファイル２１が作成された後、当該ファイルが削除された場合、インデックス１ファイル２１を参照するとファイル名が「作業報告」のファイルが存在しているように見えるが、検索対象ファイル４３には存在していない。
そこで、前述したように、インデックス１作成部１１は、例えば毎日同じ時刻等の所定のタイミングで処理を行うようにすることができる。このようにすることで、インデックス１ファイル２１が定期的に更新され、検索対象ファイル４３と大きく乖離することを防ぐことができる。 When searching with reference to the index 1 file 21 created as described above, the search target file 43 may be deleted or changed after the index 1 file 21 is created. In such a case, when the index 1 file 21 is searched, a search result different from that when the search target file 43 is searched is obtained. For example, if an index 1 file 21 is created for a file whose file name is “work report” and then the file is deleted, when the index 1 file 21 is referenced, there is a file whose file name is “work report”. The search target file 43 does not exist.
Therefore, as described above, the index 1 creation unit 11 can perform processing at a predetermined timing such as the same time every day. By doing so, it is possible to prevent the index 1 file 21 from being periodically updated and greatly deviating from the search target file 43.

そして、インデックス１作成部１１が処理を行うタイミングの間隔を短くすることで（例えば、１時間おきに処理を行うようにすることで）、インデックス１ファイル２１と検索対象ファイル４３の乖離を、より小さくすることができる。ただし、インデックス１作成部１１が処理を行うタイミングの間隔を短くするということは、検索対象ファイル４３の全ファイルに対するＩＯが発生する間隔も短くなるということなので、ファイルサーバ４の性能等も考慮した上で、インデックス１作成部１１が処理を行うタイミングの間隔を決める必要がある。
例えば、ファイルサーバ４において、ＣＰＵ使用率、一定時間におけるＩＯ頻度等を常時監視するプログラムを走行させておき、ＣＰＵ使用率、一定時間におけるＩＯ頻度等が所定値以下になった場合に、その旨をインデックス１作成部１１に送信し、インデックス１作成部１１が処理開始する等も、有効な方法である。あるいは、ファイルサーバ４において、検索対象ファイル４３におけるＩＯを常時監視するプログラムを走行させておき、検索対象ファイル４３が更新等されるたびに、インデックス１作成部１１にその旨を通知し、インデックス１ファイル２１内の当該ファイルについてのインデックス情報が更新されるようにしてもよい。 Then, by shortening the interval of the timing at which the index 1 creation unit 11 performs processing (for example, by performing processing every hour), the divergence between the index 1 file 21 and the search target file 43 is further increased. Can be small. However, shortening the interval at which the index 1 creation unit 11 performs processing means that the interval at which IO is generated for all files of the search target file 43 is also shortened, so the performance of the file server 4 is also taken into consideration. In the above, it is necessary to determine the timing interval at which the index 1 creation unit 11 performs processing.
For example, when the file server 4 runs a program that constantly monitors the CPU usage rate, the IO frequency at a certain time, etc., and the CPU usage rate, the IO frequency at a certain time, etc. become below a predetermined value, that fact Is also transmitted to the index 1 creation unit 11 and the index 1 creation unit 11 starts processing. Alternatively, in the file server 4, a program that constantly monitors the IO in the search target file 43 is run, and whenever the search target file 43 is updated, the index 1 creation unit 11 is notified of this, and the index 1 The index information for the file in the file 21 may be updated.

インデックス２作成部１２も、インデックス１作成部１１と同様に、例えば毎日同じ時刻等の所定のタイミングで、検索対象ファイル４３を参照し、インデックス２ファイル２２を作成する。第１の実施形態においては、インデックス２ファイル２２には、後述するように、たとえば、ファイルの内容から判断したファイルに記憶されている文書のタイトル等が記憶される。すなわち、インデックス１ファイル２１の記憶内容と、インデックス２ファイル２２の記憶内容は一部が重複するとしても、完全には同一でない。 Similarly to the index 1 creation unit 11, the index 2 creation unit 12 also creates the index 2 file 22 by referring to the search target file 43 at a predetermined timing such as the same time every day. In the first embodiment, as will be described later, for example, the index 2 file 22 stores the title of the document stored in the file determined from the contents of the file. That is, the stored contents of the index 1 file 21 and the stored contents of the index 2 file 22 are not completely the same, even if some of them overlap.

どのファイルを検索対象ファイル４３とするかは、インデックス１作成部１１と同様に、記憶装置２内の所定のファイルに予め記憶装置４２のファイルパス（“/etc/usr1/”等）を１以上記憶しておき、当該ファイルパス配下のファイルを検索対象ファイル４３とすることができる。インデックス１作成部１１の検索対象ファイル４３とインデックス２作成部１２の検索対象ファイル４３は同一であってもよいし、一部が重複してもよいし、全く異なっていても構わない。 As with the index 1 creation unit 11, which file is the search target file 43 is set to one or more file paths (such as “/ etc / usr1 /”) of the storage device 42 in advance in a predetermined file in the storage device 2. The file under the file path can be stored as the search target file 43. The search target file 43 of the index 1 creation unit 11 and the search target file 43 of the index 2 creation unit 12 may be the same, a part of them may overlap, or they may be completely different.

なお、以上のようにして作成されたインデックス２ファイル２２を参照して検索するとき、インデックス１ファイル２１を参照して検索するときと同様に、インデックス２ファイル２２の記憶内容と検索対象ファイル４３の内容が乖離することがあるという問題が発生する。そして、この問題は、インデックス１作成部１１と同様に、インデックス２作成部１２が処理を行うタイミングの間隔を短くする等によって解決することができる。 When searching with reference to the index 2 file 22 created as described above, the stored contents of the index 2 file 22 and the search target file 43 are searched in the same manner as when searching with reference to the index 1 file 21. There arises a problem that the contents may deviate. This problem can be solved by shortening the interval of the timing at which the index 2 creation unit 12 performs processing, as with the index 1 creation unit 11.

さらに、インデックス１ファイル２１とインデックス２ファイル２２の作成タイミングが異なると、インデックス１ファイル２１とインデックス２ファイル２２の内容が乖離することがあるという問題も発生する。この問題の解決方法のひとつは、インデックス１作成部１１とインデックス２作成部１２のインデックスファイル作成処理開始タイミングを一致させることである。具体的には、例えば、インデックス２作成部１２がインデックスファイル作成処理を開始する直前に、インデックス１作成部１１にインデックス作成処理開始要求を送信し、インデックス１作成部１１は当該要求を受信するとインデックス作成処理を開始するようにすればよい。 Furthermore, when the creation timings of the index 1 file 21 and the index 2 file 22 are different, the contents of the index 1 file 21 and the index 2 file 22 may be different from each other. One solution to this problem is to match the index file creation processing start timings of the index 1 creation unit 11 and the index 2 creation unit 12. Specifically, for example, immediately before the index 2 creation unit 12 starts the index file creation process, an index creation process start request is transmitted to the index 1 creation unit 11, and when the index 1 creation unit 11 receives the request, the index 1 creation unit 11 The creation process may be started.

もっとも、インデックスファイル作成処理開始タイミングが一致しても、各検索対象ファイル４３を参照するタイミングが一致するとは限らないが、インデックス１ファイル２１とインデックス２ファイル２２の内容が多少乖離していたとしても、検索目的での使用に大きな支障があるわけではない。従って、インデックスファイル作成処理開始タイミングを一致させれば、ほとんどの場合、実用的には問題はない。
ただし、インデックス１ファイル２１とインデックス２ファイル２２の内容の乖離を完全に防止する必要がある場合がある。このような場合には、例えば、ファイルサーバ４において、検索対象ファイル４３におけるＩＯを常時監視するプログラムを走行させておき、検索対象ファイル４３が更新等されるたびに、インデックス１ファイル２１及びインデックス２ファイル２２内の当該ファイルについての情報を更新するようにしてもよい。 Of course, even if the index file creation processing start timings coincide, the timings of referring to the search target files 43 do not necessarily coincide, but even if the contents of the index 1 file 21 and the index 2 file 22 are slightly different from each other This does not mean that there will be a major hindrance to use for search purposes. Therefore, if the index file creation processing start timings are matched, there is no practical problem in most cases.
However, there may be a case where it is necessary to completely prevent the difference between the contents of the index 1 file 21 and the index 2 file 22. In such a case, for example, in the file server 4, a program for constantly monitoring the IO in the search target file 43 is run, and each time the search target file 43 is updated, the index 1 file 21 and the index 2. Information about the file in the file 22 may be updated.

検索部１３は、ファイル検索サーバ１がクライアント３からの検索要求を受信した場合に起動され、ファイル検索を実行する。具体的には、インデックス１ファイル２１とインデックス２ファイル２２の一方又は両方を参照して、検索要求の検索条件式に合致するファイルが存在するかどうか判定し、存在する場合には、合致したファイルのファイル名等からなる検索結果をクライアント３に送信する。 The search unit 13 is activated when the file search server 1 receives a search request from the client 3 and executes a file search. More specifically, one or both of the index 1 file 21 and the index 2 file 22 are referred to determine whether there is a file that matches the search condition expression of the search request. The search result including the file name and the like is transmitted to the client 3.

ここで、検索部１３は、インデックス１ファイル２１を参照する必要がある場合、インデックス１検索部１４に検索条件式を渡し、インデックス１検索部１４は、インデックス１ファイル２１を参照して、受け取った検索条件式に合致するファイルが存在するかどうか判定する。インデックス１ファイル２１には、ファイル名、ファイル内容から抽出したキーワードが記憶されているため、インデックス１検索部１４による検索は、検索部１３の検索機能の一部を補完することができる。
後述するように、第１の実施形態においては、クライアント３の操作者がいわゆる全文検索を要求した場合に、インデックス１検索部１４による検索を行う。検索部１３自体の機能から全文検索機能を分離することにより、インデックス１検索部１４を実現するためのプログラムとして、既に存在するさまざまな全文検索プログラムを使用することが可能になり、例えば、検索要求から社会科学分野の文献の検索が求められているのか、自然科学分野の文献の検索が求められているのか等の検索目的を判定して、検索目的に合致したインデックス１検索部１４によって全文検索を行うといったことも可能になる。 Here, when it is necessary to refer to the index 1 file 21, the search unit 13 passes the search condition expression to the index 1 search unit 14, and the index 1 search unit 14 refers to the index 1 file 21 and receives it. It is determined whether there is a file that matches the search condition expression. Since the index 1 file 21 stores keywords extracted from the file name and file contents, the search by the index 1 search unit 14 can complement a part of the search function of the search unit 13.
As will be described later, in the first embodiment, when the operator of the client 3 requests a so-called full-text search, the index 1 search unit 14 performs a search. By separating the full-text search function from the function of the search unit 13 itself, various existing full-text search programs can be used as programs for realizing the index 1 search unit 14, for example, search requests A full text search is performed by the index 1 search unit 14 that matches a search purpose, such as whether a search for a document in the social science field is requested or whether a search for a document in the natural science field is required. It is also possible to do.

検索部１３は、検索対象ファイル４３をインデックス２ファイル２２に記憶された文書のタイトル等のメタデータ検索キーワードにより分類して、ツリー状の階層を作成する機能（以下「仮想分類機能」という。）、言い換えれば、インデックス２レコード２２０（図３を参照して後で説明する。）を分類する機能を有しており、仮想分類を行う際には、分類条件等を定義した仮想分類定義ファイル２３を参照する。 The search unit 13 classifies the search target file 43 according to metadata search keywords such as document titles stored in the index 2 file 22 and creates a tree-like hierarchy (hereinafter referred to as “virtual classification function”). In other words, it has a function of classifying the index 2 record 220 (to be described later with reference to FIG. 3), and when performing virtual classification, a virtual classification definition file 23 that defines classification conditions and the like. Refer to

検索部１３は、メタデータ検索のキーワードに関連するキーワードにより仮想分類定義を動的に生成する機能（以下「仮想分類動的生成機能」という。）を有している。仮想分類動的生成機能で生成した仮想分類定義を参照し、仮想分類機能を使用することでメタデータ検索キーワードに関連したキーワードによる分類を行うことができる。
仮想分類動的生成を行う際には、キーワードを登録するキーワード辞書ファイル２５とキーワード辞書ファイル２５に登録されたキーワードの関連を定義したキーワード関連定義ファイル２６を参照する。当然ながら、キーワード辞書ファイル２５とキーワード関連定義ファイル２６を作成・更新等するプログラムも存在するが、本発明とは直接関係がないので、以降の説明ではキーワード辞書ファイル２５とキーワード関連定義ファイル２６が作成済みであるという前提で説明する。 The search unit 13 has a function (hereinafter referred to as “virtual classification dynamic generation function”) that dynamically generates a virtual classification definition using a keyword related to a keyword for metadata search. By using the virtual classification function with reference to the virtual classification definition generated by the virtual classification dynamic generation function, classification using keywords related to the metadata search keyword can be performed.
When the virtual classification dynamic generation is performed, the keyword dictionary file 25 for registering keywords and the keyword association definition file 26 that defines the association of keywords registered in the keyword dictionary file 25 are referred to. Of course, there are programs for creating / updating the keyword dictionary file 25 and the keyword related definition file 26. However, since they are not directly related to the present invention, the keyword dictionary file 25 and the keyword related definition file 26 will be described in the following description. It will be explained on the assumption that it has been created.

また、検索部１３は、インデックス２ファイル２２に記憶された、検索対象ファイル４３の記憶装置４２上の記憶位置を参照して、ツリー状の階層を作成する機能（以下「物理階層作成機能」という。）を有している。
さらに、検索部１３は、検索結果と関連するファイルを検索する機能（以下「関連検索機能」という。）を有しており、関連検索を行う際には、関連検索条件等を定義した関連定義ファイル２４を参照する。当然ながら、関連定義ファイル２４を作成・更新等するプログラムも存在するが、本発明とは直接関係がないので、以降の説明では関連定義ファイル２４が作成済みであるという前提で説明する。 The search unit 13 refers to the storage position of the search target file 43 stored in the index 2 file 22 on the storage device 42 and creates a tree-like hierarchy (hereinafter referred to as “physical hierarchy creation function”). .)have.
Further, the search unit 13 has a function of searching for a file related to the search result (hereinafter referred to as “related search function”), and when performing a related search, a related definition that defines a related search condition or the like. Refer to the file 24. Of course, there is a program for creating / updating the related definition file 24, but since it is not directly related to the present invention, the following description will be made on the assumption that the related definition file 24 has been created.

＜各ファイルの構成・機能＞
図２は、インデックス１ファイル２１のデータ構成の一例を示す。
インデックス１ファイル２１は、各検索対象ファイル４３と対応するインデックス１レコード２１０から構成される。すなわち各インデックス１レコード２１０は、インデックス１レコード２１０作成時点の検索対象ファイル４３と１対１に対応している。
インデックス１レコード２１０は、ファイル名２１１、ファイルパス２１２、アクセス権限２１３およびキーワード２１４の各データ項目から構成される。 <Configuration and function of each file>
FIG. 2 shows an example of the data structure of the index 1 file 21.
The index 1 file 21 includes index 1 records 210 corresponding to the search target files 43. That is, each index 1 record 210 has a one-to-one correspondence with the search target file 43 when the index 1 record 210 is created.
The index 1 record 210 includes data items of a file name 211, a file path 212, an access authority 213, and a keyword 214.

ファイル名２１１には、対応する検索対象ファイル４３のファイル名、例えば“作業報告１.doc”が設定される。
ファイルパス２１２には、対応する検索対象ファイル４３の絶対パス、例えば“//etc/usr1/作業報告１.doc” （ルート直下の”etc“フォルダ内の”usr1“フォルダ内の”作業報告１.doc“）が設定される。なお、ファイルパス２１２によって検索対象ファイル４３の記憶装置４２上の記憶位置を識別することができるが、ファイルサーバ４に複数の記憶装置４２が接続されている場合、記憶装置４２を特定する識別や、論理ボリューム名等も、ファイルパス２１２の一部として、又はファイルパス２１２とは別のデータ項目として設定すればよい。また、ファイルの絶対パス以外の情報、例えば予め定めた所定のファイルを基準とする相対パス、あるいはファイルが記憶されている論理ブロック番号によっても、検索対象ファイル４３の記憶位置を識別することは可能である。インデックス１レコード２１０には、ファイルパス２１２に変えて、又はファイルパス２１２に加えて、このようなデータ項目を設けてもよい。 In the file name 211, the file name of the corresponding search target file 43, for example, “work report 1.doc” is set.
The file path 212 includes an absolute path of the corresponding search target file 43, for example, “// etc / usr1 / work report 1.doc” (work report 1 in the “usr1” folder in the “etc” folder immediately below the root). .doc ") is set. Note that the storage location of the search target file 43 on the storage device 42 can be identified by the file path 212, but when a plurality of storage devices 42 are connected to the file server 4, The logical volume name and the like may be set as a part of the file path 212 or as a data item different from the file path 212. It is also possible to identify the storage location of the search target file 43 by information other than the absolute path of the file, for example, a relative path based on a predetermined file or a logical block number in which the file is stored. It is. Such data items may be provided in the index 1 record 210 in place of or in addition to the file path 212.

アクセス権限２１３には、対応する検索対象ファイル４３に対して設定されているアクセス権限が設定される。具体的には、例えばファイルサーバ４のファイル管理部４１等によって付与され、ファイルの属性情報として記憶されているアクセス権限（例えば、ＵＮＩＸ（登録商標）等で使われている、オーナー、グループ、その他利用者に対する、参照・更新・実行権限を表す３桁の数値（７７７等））が設定される。
キーワード２１４には、対応する検索対象ファイル４３の内容から抽出したキーワードが設定される。例えば、検索対象ファイル４３の内容に“現場”という文字列が複数個含まれている場合“現場”を抽出する等、さまざまな構文解析手法等によって、１以上のキーワードを抽出し、キーワード２１４に設定することができる。一般に、キーワード２１４には多数の語句等が記憶され、インデックス１レコード２１０のサイズの大部分がキーワード２１４のために使用される。
なお、以上のほか、ファイル作成者等、他のファイル属性情報についても、インデックス１レコード２１０のデータ項目とすることができる。
以上のようにして作成されたインデックス１レコード２１０（インデックス情報）は、インデックス１検索部１４により参照される。 In the access authority 213, the access authority set for the corresponding search target file 43 is set. More specifically, for example, the access authority (for example, UNIX (registered trademark), etc., which is given by the file management unit 41 of the file server 4 and stored as file attribute information, etc., owner, group, etc. A three-digit numerical value (such as 777) representing the reference / update / execution authority for the user is set.
In the keyword 214, a keyword extracted from the content of the corresponding search target file 43 is set. For example, when the search target file 43 includes a plurality of character strings “site”, one or more keywords are extracted by various syntax analysis methods such as extracting “site”, and the keyword 214 is extracted. Can be set. In general, a large number of phrases and the like are stored in the keyword 214, and most of the size of the index 1 record 210 is used for the keyword 214.
In addition to the above, other file attribute information such as a file creator can also be a data item of the index 1 record 210.
The index 1 record 210 (index information) created as described above is referred to by the index 1 search unit 14.

図３は、インデックス２ファイル２２のデータ構成の一例を示す。
インデックス２ファイル２２は、各検索対象ファイル４３と対応するインデックス２レコード２２０から構成される。すなわち各インデックス２レコード２２０は、インデックス２レコード２２０作成時点の検索対象ファイル４３と１対１に対応している。
インデックス２レコード２２０は、システムメタデータ２２１、標準メタデータ２２２及びユーザ定義メタデータ２２３から構成される。
システムメタデータ２２１はインデックス２作成部１２によって設定され、利用者がその設定内容を直接変更することはできない。
一方、標準メタデータ２２２にはインデックス２作成部１２によってファイルの内容を代表するメタデータが設定される。図１に示していないが、利用者が、ファイル検索サーバ１のＣＰＵにメタデータ変更プログラムを実行させて、標準メタデータ２２２の設定内容を直接変更することができる。
また、ユーザ定義メタデータ２２３は利用者がデータ構造を定義し、ファイルの内容を代表するメタデータを設定・変更するデータ項目である。インデックス２作成部１２は、ユーザ定義メタデータ２２３を設定しない。 FIG. 3 shows an example of the data structure of the index 2 file 22.
The index 2 file 22 includes index 2 records 220 corresponding to the search target files 43. That is, each index 2 record 220 has a one-to-one correspondence with the search target file 43 when the index 2 record 220 is created.
The index 2 record 220 includes system metadata 221, standard metadata 222, and user-defined metadata 223.
The system metadata 221 is set by the index 2 creation unit 12, and the user cannot change the setting contents directly.
On the other hand, metadata representing the contents of the file is set in the standard metadata 222 by the index 2 creation unit 12. Although not shown in FIG. 1, the user can directly change the setting contents of the standard metadata 222 by causing the CPU of the file search server 1 to execute a metadata change program.
User-defined metadata 223 is a data item in which a user defines a data structure and sets / changes metadata representing the contents of a file. The index 2 creation unit 12 does not set the user-defined metadata 223.

図４は、システムメタデータ２２１のデータ構成の一例を示す図である。
システムメタデータ２２１は、ファイルＩＤ２２１ａ、ファイル名２２１ｂ及びファイルパス２２１ｃから構成される。
ファイルＩＤ２２１ａには、対応する検索対象ファイル４３を一意に識別可能なＩＤ（Identifier）が設定される。具体的には、例えば、新たに検索対象ファイル４３のインデックス２レコード２２０を作成するたびに、１から始まる通番を設定すればよい。
ファイル名２２１ｂには、インデックス１レコード２１０のファイル名２１１と同様、対応する検索対象ファイル４３のファイル名、例えば“作業報告１.doc”が設定される。
ファイルパス２２１ｃには、インデックス１レコード２１０のファイルパス２１２と同様に、例えば対応する検索対象ファイル４３の絶対パス（“//etc/usr1/作業報告１.doc”等）が設定される。
なお、以上のほか、ファイル作成者、アクセス権限等、他のファイル属性情報についても、システムメタデータ２２１のデータ項目とすることができる。 FIG. 4 is a diagram illustrating an example of the data configuration of the system metadata 221.
The system metadata 221 includes a file ID 221a, a file name 221b, and a file path 221c.
In the file ID 221a, an ID (Identifier) capable of uniquely identifying the corresponding search target file 43 is set. Specifically, for example, each time a new index 2 record 220 of the search target file 43 is created, a serial number starting from 1 may be set.
Similarly to the file name 211 of the index 1 record 210, the file name of the corresponding search target file 43, for example, “work report 1.doc” is set in the file name 221b.
Similar to the file path 212 of the index 1 record 210, for example, the absolute path (“// etc / usr1 / work report 1.doc” or the like) of the corresponding search target file 43 is set in the file path 221c.
In addition to the above, other file attribute information such as file creator and access authority can also be used as data items of the system metadata 221.

図５は、標準メタデータ２２２のデータ構成の一例を示す図である。
標準メタデータ２２２は、タイトル２２２ａ、文書記載日２２２ｂ及びセキュリティランク２２２ｃから構成される。
タイトル２２２ａには、対応する検索対象ファイル４３に記憶されている文書等の題名が“作業報告書”のように設定される。具体的には、インデックス２作成部１２は、例えば、当該ファイルを印刷する場合の表示イメージを構築し、先頭ページ上段に、他の文字より大きな文字サイズで印刷される文字列を文書等の題名とみなし、タイトル２２２ａに設定する。
文書記載日２２２ｂには、対応する検索対象ファイル４３に記憶されている文書等が記載された日付が“２００９年８月５日”のように設定される。具体的には、インデックス２作成部１２は、例えば、当該ファイルを印刷したときに先頭ページ上段に印刷される文字列のうち、“年”、“月”、“日”、“作成”を含む等、作成日付らしい文字列を文書等の作成日付とみなし、文書記載日２２２ｂに設定する。 FIG. 5 is a diagram illustrating an example of a data configuration of the standard metadata 222.
The standard metadata 222 includes a title 222a, a document description date 222b, and a security rank 222c.
In the title 222a, the title of the document or the like stored in the corresponding search target file 43 is set as “work report”. Specifically, for example, the index 2 creation unit 12 constructs a display image when the file is printed, and sets a character string to be printed with a character size larger than other characters on the top of the first page as a title of a document or the like. Is set as the title 222a.
In the document description date 222b, the date when the document stored in the corresponding search target file 43 is described is set as “August 5, 2009”. Specifically, the index 2 creation unit 12 includes, for example, “year”, “month”, “day”, and “creation” among the character strings printed on the top of the first page when the file is printed. The character string that seems to be the creation date is regarded as the creation date of the document, and is set as the document description date 222b.

セキュリティランク２２２ｃには、対応する検索対象ファイル４３に記憶されている文書等の機密度が“極秘”、“秘密”のように設定される。具体的には、インデックス２作成部１２は、例えば、当該ファイルに印刷される文字列のうち、“取扱注意”、“複製禁止”等、機密にする必要があることを示していると思われる文字列を抽出し、抽出した文字列の内容、数等によって機密度を判定し、セキュリティランク２２２ｃに設定する。
なお、以上のほか、文書の保管期間等、検索対象ファイル４３を印刷した場合の表示イメージ等から判断可能な他の情報についても、標準メタデータ２２２のデータ項目とすることができる。
以上のようにして作成されたインデックス２レコード２２０（インデックス情報）は、前述したように、検索部１３により参照される。 In the security rank 222c, the confidentiality of the document or the like stored in the corresponding search target file 43 is set as “top secret” or “secret”. Specifically, it is considered that the index 2 creation unit 12 indicates that it is necessary to keep confidential information such as “handling precautions” and “duplication prohibited” in the character string printed in the file. The character string is extracted, the confidentiality is determined based on the content, number, etc. of the extracted character string, and set to the security rank 222c.
In addition to the above, other information that can be determined from a display image when the search target file 43 is printed, such as a document storage period, can also be used as data items of the standard metadata 222.
The index 2 record 220 (index information) created as described above is referred to by the search unit 13 as described above.

図６は、仮想分類定義ファイル２３のデータ構成の一例を示す。
仮想分類定義ファイル２３は、１以上の仮想分類定義レコード２３０から構成される。仮想分類定義レコード２３０は、仮想分類ＩＤ２３１、表示名２３２、条件２３３及び上位仮想分類ＩＤ２３４の各データ項目から構成される。 FIG. 6 shows an example of the data configuration of the virtual classification definition file 23.
The virtual classification definition file 23 is composed of one or more virtual classification definition records 230. The virtual classification definition record 230 includes data items of a virtual classification ID 231, a display name 232, a condition 233, and a higher virtual classification ID 234.

仮想分類ＩＤ２３１には、当該仮想分類定義レコード２３０を一意に識別可能な値が“1”、“2”、“3”のように設定される。
表示名２３２には、当該仮想分類の名称が“タイトル”、“作業報告”のように設定される。
条件２３３には、当該仮想分類による分類条件が“条件なし”、“タイトル２２２ａに「作業報告書」（という文字列）が含まれる”のように設定される。分類条件に“条件なし”が設定されている場合、当該仮想分類に分類されるインデックス２レコード２２０が存在しないことを意味し、分類条件に何らかの条件が設定されている場合、インデックス２レコード２２０のうち、当該条件を満足するレコードが当該仮想分類に分類されることを意味する。従って、１つのインデックス２レコード２２０が２以上の仮想分類に分類される場合もあるし、どの仮想分類にも分類されない場合もある。
上位仮想分類ＩＤ２３４には、当該仮想分類定義レコード２３０の上位の仮想分類定義レコード２３０を一意に識別可能な値が“0（上位なし）”、“1”、“2”、“3”のように設定される。 In the virtual classification ID 231, values that can uniquely identify the virtual classification definition record 230 are set as “1”, “2”, and “3”.
In the display name 232, the name of the virtual classification is set as “title”, “work report”, and the like.
The condition 233 is set such that the classification condition based on the virtual classification is “no condition” and “the title 222a includes“ work report ”(character string).” The classification condition is “no condition”. If it is set, it means that there is no index 2 record 220 classified in the virtual classification, and if any condition is set in the classification condition, a record that satisfies the condition in the index 2 record 220 Therefore, one index 2 record 220 may be classified into two or more virtual classifications, or may not be classified into any virtual classification.
In the upper virtual classification ID 234, values that can uniquely identify the upper virtual classification definition record 230 of the virtual classification definition record 230 are “0 (no upper)”, “1”, “2”, “3”, and the like. Set to

図１４と図１５を参照しながら後で説明するように、仮想分類定義レコード２３０は検索部１３によって参照され、表示名２３２や、当該表示名２３２に係わる条件２３３を満足するインデックス２レコード２２０のファイル名２２１ｂが、表示装置３３にツリー状に階層表示される。従って、１つのインデックス２レコード２２０がツリーの２箇所以上で表示される場合もあるし、ツリーのどこにも表示されない場合もある。 As will be described later with reference to FIGS. 14 and 15, the virtual classification definition record 230 is referred to by the search unit 13, and the display name 232 and the index 2 record 220 satisfying the condition 233 related to the display name 232 are stored. The file name 221b is hierarchically displayed on the display device 33 in a tree shape. Therefore, one index 2 record 220 may be displayed at two or more places in the tree, or may not be displayed anywhere in the tree.

図７は、関連定義ファイル２４のデータ構成の一例を示す。
関連定義ファイル２４は、１以上の関連定義レコード２４０から構成される。関連定義レコード２４０は、関連定義ＩＤ２４１、表示名２４２及び条件２４３の各データ項目から構成される。
関連定義ＩＤ２４１には、当該関連定義レコード２４０を一意に識別可能な値が“1”、“2”、“3”のように設定される。
表示名２４２には、当該関連定義の名称が“タイトル”のように設定される。
条件２４３には、当該仮想分類による分類条件が“タイトル２２２ａが当該検索結果と等しい”のように設定される。なお、条件２４３には、例えば、“当該検索ファイルのファイル名の前に「コピー〜」が追加されている”、“当該検索ファイルのファイル名の後ろに数字が追加されている”等、さまざまな条件を設定可能としてもよい。
関連定義ファイル２４を用いた関連検索については、図１７と図１８を参照しながら後で詳細に説明する。 FIG. 7 shows an example of the data structure of the related definition file 24.
The related definition file 24 is composed of one or more related definition records 240. The related definition record 240 includes data items of a related definition ID 241, a display name 242 and a condition 243.
In the related definition ID 241, values that can uniquely identify the related definition record 240 are set as “1”, “2”, and “3”.
In the display name 242, the name of the relevant definition is set as “title”.
In the condition 243, the classification condition based on the virtual classification is set such that “title 222a is equal to the search result”. The condition 243 includes various conditions such as “addition of“ copy ”before the file name of the search file”, “addition of a number after the file name of the search file”, etc. Various conditions may be settable.
The related search using the related definition file 24 will be described in detail later with reference to FIGS. 17 and 18.

図２０と図２１は、それぞれキーワード辞書ファイル２５とキーワード関連定義ファイル２６のデータ構成の一例を示す。
上述した仮想分類定義レコード２３０は、利用者が予め作成しておいてもよいが、メタデータ検索を行う際にキーワード辞書ファイル２５とキーワード関連定義ファイル２６を用いて動的に作成することが可能である。
キーワード辞書ファイル２５は、１以上のキーワード辞書レコード２５０から構成される。キーワード辞書レコード２５０は、キーワードＩＤ２５１、言語コード２５２及びキーワード２５３の各データ項目から構成される。
キーワードＩＤ２５１には、キーワードを一意に識別可能な値が“1”、“2”、“3”のように設定される。
言語コード２５２には、キーワード２５３の言語種別を表す３文字の言語コードが“jpn”や“eng”のように設定される。なお、言語コード“jpn”と“eng”は、それぞれ日本語と英語に対応する。
キーワード２５３には、当該キーワードが“仕様書”のように設定される。
キーワード辞書レコード２５０は、キーワードＩＤ２５１と言語コード２５２で一意に識別できる。また、キーワード辞書ファイル２５には、キーワード２５３が同一のキーワード辞書レコード２５０は設定できない。
例えば、キーワードＩＤ２５１、言語コード２５２及びキーワード２５３に、それぞれ“1”、“jpn”、“仕様書”が設定されたキーワード辞書レコード２５０と、それぞれ“1”、“eng”、“specifications”が設定されたキーワード辞書レコード２５０が２レコード存在する場合、この２つのレコードのキーワードは、言語は異なるが同一のキーワードということになる。 20 and 21 show examples of data structures of the keyword dictionary file 25 and the keyword related definition file 26, respectively.
The virtual classification definition record 230 described above may be created in advance by the user, but can be dynamically created using the keyword dictionary file 25 and the keyword related definition file 26 when performing a metadata search. It is.
The keyword dictionary file 25 is composed of one or more keyword dictionary records 250. The keyword dictionary record 250 includes data items of a keyword ID 251, a language code 252, and a keyword 253.
In the keyword ID 251, values that can uniquely identify the keyword are set as “1”, “2”, “3”.
In the language code 252, a three-letter language code representing the language type of the keyword 253 is set as “jpn” or “eng”. The language codes “jpn” and “eng” correspond to Japanese and English, respectively.
In the keyword 253, the keyword is set as “specification”.
The keyword dictionary record 250 can be uniquely identified by the keyword ID 251 and the language code 252. Further, a keyword dictionary record 250 having the same keyword 253 cannot be set in the keyword dictionary file 25.
For example, keyword dictionary record 250 in which “1”, “jpn”, and “specification” are set in keyword ID 251, language code 252, and keyword 253, and “1”, “eng”, and “specifications” are set, respectively. If there are two recorded keyword dictionary records 250, the keywords of these two records are the same keyword although the languages are different.

キーワード関連定義ファイル２６は、１以上のキーワード関連定義レコード２６０から構成される。キーワード関連定義レコード２６０は、キーワードＩＤ２６１、関連キーワードＩＤ２６２、最大キーワード長２６３及び言語条件２６４から構成される。
キーワードＩＤ２６１には、キーワード辞書ファイル２５に登録されているキーワード辞書レコード２５０のキーワードＩＤ２５１の値が設定される。
関連キーワードＩＤ２６２には、キーワードＩＤ２６１と同様にキーワード辞書ファイル２５に登録されているキーワード辞書レコード２５０のキーワードＩＤ２５１の値が１つ以上設定される。キーワード関連定義レコード２６０は、キーワードＩＤ２６１と値が同一であるキーワードＩＤ２５１のキーワード辞書レコード２５０に含まれるキーワード２５３に設定されているキーワードと、関連キーワードＩＤ２６２と値が同一であるキーワードＩＤ２５１のキーワード辞書レコード２５０に含まれるキーワード２５３に設定されているキーワードとが関連することを指定する。
なお、関連キーワードＩＤ２６２に２つ以上のキーワードＩＤを設定する場合、“1, 2”のように「, （カンマ）」で区切って設定する。 The keyword related definition file 26 includes one or more keyword related definition records 260. The keyword related definition record 260 includes a keyword ID 261, a related keyword ID 262, a maximum keyword length 263, and a language condition 264.
In the keyword ID 261, the value of the keyword ID 251 of the keyword dictionary record 250 registered in the keyword dictionary file 25 is set.
In the related keyword ID 262, one or more values of the keyword ID 251 of the keyword dictionary record 250 registered in the keyword dictionary file 25 are set as in the keyword ID 261. The keyword related definition record 260 is a keyword dictionary record having a keyword ID 251 whose value is the same as that of the keyword set in the keyword 253 included in the keyword dictionary record 250 of the keyword ID 251 having the same value as the keyword ID 261. It is designated that the keyword set in the keyword 253 included in 250 is related.
When two or more keyword IDs are set in the related keyword ID 262, they are set by being separated by “, (comma)” such as “1, 2”.

最大キーワード長２６３には、キーワード辞書ファイル２５から関連キーワードＩＤ２６２のキーワードを取得するときのキーワードの最大文字数の制限が“0（制限しない）”、“10”のように設定される。
言語条件２６４には、キーワード辞書ファイル２５から関連するキーワードを取得するときのキーワードの言語種別を検索キーワードの言語種別と同一に限定するかしないかの値が“0（同一言語のみ）”、“1（限定しない）”のように設定される。 In the maximum keyword length 263, the limit on the maximum number of characters of the keyword when the keyword of the related keyword ID 262 is acquired from the keyword dictionary file 25 is set to “0 (no limit)” or “10”.
In the language condition 264, the value of whether or not to limit the language type of the keyword when acquiring the related keyword from the keyword dictionary file 25 to the same as the language type of the search keyword is “0 (only the same language)”, “ 1 (not limited) ”is set.

例えば、キーワード辞書ファイル２５のキーワードＩＤ２５１、言語コード２５２及びキーワード２５３に、それぞれ以下のように設定された３つのキーワード辞書レコード２５０が存在するとする。
（１）“1”、“jpn”、“仕様書”
（２）“1”、“eng”、“specifications”
（３）“2”、“jpn”、“設計書”
そして、キーワード関連定義ファイル２６０のキーワードＩＤ２６１、関連キーワードＩＤ２６２、最大キーワード長２６３及び言語条件２６４に、それぞれ“2”、“1”、“3”、“0（同一言語のみ）”が設定されたキーワード関連定義レコード２６０が存在するとする。この場合、キーワード“設計書”にキーワード“仕様書”が関連していることになる。しかし、キーワード“specifications”は、キーワード関連定義レコード２６０の言語条件２６４が“0（同一言語のみ）”となっているため、関連するキーワードとみなさない。
なお、キーワード辞書ファイル２５とキーワード関連定義ファイル２６を用いた仮想分類定義レコード２３０の動的作成（仮想分類動的生成機能）については、図２２と図２３を参照しつつ後で説明する。
以上で第１の実施形態のファイル検索システムの構成・機能の説明を終了し、以下、各部の処理のフローチャートを参照しつつ、第１の実施形態のファイル検索システムの動作を説明する。 For example, it is assumed that the keyword ID 251, the language code 252, and the keyword 253 of the keyword dictionary file 25 have three keyword dictionary records 250 set as follows.
(1) “1”, “jpn”, “specifications”
(2) “1”, “eng”, “specifications”
(3) “2”, “jpn”, “design document”
Then, “2”, “1”, “3”, “0 (same language only)” are set in the keyword ID 261, the related keyword ID 262, the maximum keyword length 263, and the language condition 264 of the keyword related definition file 260, respectively. It is assumed that a keyword related definition record 260 exists. In this case, the keyword “specification” is related to the keyword “design document”. However, the keyword “specifications” is not regarded as a related keyword because the language condition 264 of the keyword related definition record 260 is “0 (only the same language)”.
The dynamic creation (virtual classification dynamic generation function) of the virtual classification definition record 230 using the keyword dictionary file 25 and the keyword-related definition file 26 will be described later with reference to FIGS. 22 and 23.
The description of the configuration and functions of the file search system according to the first embodiment has been completed, and the operation of the file search system according to the first embodiment will be described below with reference to the flowcharts of the processing of each unit.

＜各プログラムの動作＞
図８は、インデックス１作成部１１の動作の一例を示すフローチャートである。
インデックス１作成部１１は、処理を開始すると、検索対象ファイル４３のインデックス１レコード２１０を作成等する（Ｓ８０１）。
具体的には、例えば、ファイル管理部４１に、記憶装置２内の所定のファイルに予め記憶されているファイルパス（例えば“/etc/usr1/”）に含まれているファイルを読込、送信するように要求する。そして、ファイル管理部４１からファイルを受信すると、取得したファイルのファイルパス（例えば“/etc/usr1/作業報告１.doc”）がファイルパス２１２に設定されているインデックス１レコード２１０が存在するか判定する。そして、存在しない場合は、取得したファイルからキーワードを抽出し、当該ファイルのファイル名、ファイルパス、アクセス権限及び抽出したキーワードを、それぞれファイル名２１１、ファイルパス２１２、アクセス権限２１３及びキーワード２１４に設定して、インデックス１レコード２１０を追加する。一方、存在する場合は、当該インデックス１レコード２１０（以下「更新対象レコード」という。）のアクセス権限２１３及びキーワード２１４を更新する。 <Operation of each program>
FIG. 8 is a flowchart showing an example of the operation of the index 1 creation unit 11.
When the process is started, the index 1 creation unit 11 creates the index 1 record 210 of the search target file 43 (S801).
Specifically, for example, the file management unit 41 reads and transmits a file included in a file path (for example, “/ etc / usr1 /”) stored in advance in a predetermined file in the storage device 2. To request. When a file is received from the file management unit 41, is there an index 1 record 210 in which the file path of the acquired file (for example, “/ etc / usr1 / work report 1.doc”) is set in the file path 212? judge. If not, the keyword is extracted from the acquired file, and the file name, file path, access authority, and extracted keyword of the file are set in the file name 211, file path 212, access authority 213, and keyword 214, respectively. Then, an index 1 record 210 is added. On the other hand, if it exists, the access authority 213 and the keyword 214 of the index 1 record 210 (hereinafter referred to as “update target record”) are updated.

以上の処理を、予め設定されているファイルパス配下のファイル全てについて実行した後、新たに作成したインデックス１レコード２１０以外のレコードで、更新対象レコードにならなかったインデックス１レコード２１０が存在する場合、当該インデックス１レコード２１０を削除する。
なお、インデックス１レコード２１０を作成等する方法は、上述の方法に限られない。例えば、処理開始後、インデックス１ファイル２１を削除し、ファイル管理部４１に、予め設定されているファイルパス（例えば“/etc/usr1/”）に含まれているファイルを読込、送信するように要求し、受信したファイルについて、インデックス１レコード２１０を作成してもよい。 When the above processing is executed for all files under the preset file path, and there is an index 1 record 210 that has not become an update target record in records other than the newly created index 1 record 210, The index 1 record 210 is deleted.
The method for creating the index 1 record 210 is not limited to the method described above. For example, after the processing starts, the index 1 file 21 is deleted, and the file management unit 41 reads and transmits the file included in the preset file path (for example, “/ etc / usr1 /”). An index 1 record 210 may be created for the requested and received file.

以上で説明したように、第１の実施形態においては、インデックス１作成部１１が処理を行うたびに、処理時点の検索対象ファイル４３と１対１に対応するインデックス１レコード２１０が作成される。 As described above, in the first embodiment, each time the index 1 creation unit 11 performs processing, an index 1 record 210 corresponding to the search target file 43 at the time of processing is created.

図９は、インデックス２作成部１２の動作の一例を示すフローチャートである。
インデックス２作成部１２は、処理を開始すると、検索対象ファイル４３のインデックス２レコード２２０を作成等する（Ｓ９０１）。具体的な処理内容は、インデックス１作成部１１と同様なので、以下、異なる点のみを説明する。
まず、前述したように、インデックス２作成部１２の検索対象ファイル４３は、インデックス１作成部１１の検索対象ファイル４３と同一である必要はない。例えば、記憶装置４２に記憶されている全てのファイルを、インデックス１作成部１１の検索対象ファイル４３とし、記憶装置４２に記憶されているファイルの一部（例えばクライアント３の操作者が通常参照するファイルのみ）をインデックス２作成部１２の検索対象ファイル４３とすることができる。このようにすることで、後述する検索処理において、通常はインデックス２ファイル２２のみを検索することで検索結果として表示されるファイルの数を抑え、一方、必要に応じてインデックス１ファイル２１を検索することで、普段は参照しないファイルも検索結果として表示させることができる。 FIG. 9 is a flowchart illustrating an example of the operation of the index 2 creation unit 12.
When the process is started, the index 2 creation unit 12 creates the index 2 record 220 of the search target file 43 (S901). Since the specific processing content is the same as that of the index 1 creation unit 11, only different points will be described below.
First, as described above, the search target file 43 of the index 2 creation unit 12 is not necessarily the same as the search target file 43 of the index 1 creation unit 11. For example, all the files stored in the storage device 42 are set as the search target files 43 of the index 1 creation unit 11, and a part of the files stored in the storage device 42 (for example, an operator of the client 3 normally refers to it). File only) can be the search target file 43 of the index 2 creation unit 12. In this way, in the search processing described later, normally, only the index 2 file 22 is searched to reduce the number of files displayed as search results, while the index 1 file 21 is searched as necessary. As a result, files that are not normally referenced can be displayed as search results.

逆に、記憶装置４２に記憶されているファイルの一部（例えば論文、判決文等、語句が比較的厳密に使用されている文書ファイル）をインデックス１作成部１１の検索対象ファイル４３とし、記憶装置４２に記憶されている全てのファイルを、インデックス２作成部１２の検索対象ファイル４３とすることができる。
このようにすることで、定義をあいまいにしたまま使用されている語句等が、インデックス１レコード２１０のキーワード２１４に抽出される可能性が減少する。当該語句等が検索キーワードに一致して当該ファイルが検索結果として表示されても、実際には求めるファイルではない可能性が比較的高い。このため、インデックス１ファイル２１の容量を比較的小さく抑えることができるとともに、後述する検索処理において、キーワードによる全文検索を行った場合に、検索結果として求めるファイルが含まれる可能性を高めることができる。 Conversely, a part of the file stored in the storage device 42 (for example, a document file in which a phrase is used relatively strictly, such as a paper or a judgment sentence) is used as the search target file 43 of the index 1 creation unit 11 and stored. All the files stored in the device 42 can be set as the search target files 43 of the index 2 creation unit 12.
By doing so, the possibility that a phrase or the like that is used while the definition is ambiguous is extracted as the keyword 214 of the index 1 record 210 is reduced. Even if the word or the like matches the search keyword and the file is displayed as a search result, there is a relatively high possibility that the file is not actually the desired file. For this reason, the capacity of the index 1 file 21 can be kept relatively small, and when a full-text search using a keyword is performed in the search processing described later, the possibility of including a file to be obtained as a search result can be increased. .

また、以下のようにすることで、インデックス２レコード２２０の不要な更新を避けることもできる。
例えば、インデックス２レコード２２０のデータ項目として更新日時を設け、インデックス２レコード２２０を作成・更新する際には当該処理を行った年月日時分秒を設定する。そして、インデックス２作成部１２がインデックス２レコード２２０を更新しようとする際に、インデックス２レコード２２０の更新日時と検索対象ファイル４３の更新日時（一般にファイル管理部４１が、ファイルの属性情報の一つとして設定する）を比較して、検索対象ファイル４３の更新日時の方が新しければ、インデックス２レコード２２０を作成した後での当該検索対象ファイル４３の内容が変わっている可能性があるので、更新対象とする。
また、インデックス２レコード２２０のデータ項目としてアクセス権限を設けた場合には、インデックス２作成部１２がインデックス２レコード２２０を更新しようとする際に、インデックス２レコード２２０のアクセス権限と検索対象ファイル４３のアクセス権限を比較し、異なっていれば更新対象とすればよい。 In addition, unnecessary updating of the index 2 record 220 can be avoided by performing the following.
For example, an update date is provided as a data item of the index 2 record 220, and when the index 2 record 220 is created / updated, the year / month / day / hour / minute / second in which the process is performed is set. When the index 2 creation unit 12 tries to update the index 2 record 220, the update date and time of the index 2 record 220 and the update date and time of the search target file 43 (generally, the file management unit 41 is one of the attribute information of the file). If the update date / time of the search target file 43 is newer, the contents of the search target file 43 after the creation of the index 2 record 220 may have changed. Update target.
Further, when the access authority is provided as the data item of the index 2 record 220, when the index 2 creation unit 12 attempts to update the index 2 record 220, the access authority of the index 2 record 220 and the search target file 43 are updated. The access authority is compared, and if they are different, they can be updated.

さらに、インデックス２作成部１２は、インデックス２レコード２２０を更新する際、標準メタデータ２２２の設定内容が、前述したメタデータ変更プログラムを使用して直接変更されているか判定し、直接変更されている場合は、標準メタデータ２２２は更新しない。このようにするためには、例えば、標準メタデータ２２２のデータ項目として「直接変更有無」を設け、インデックス２レコード２２０を作成する際にはデータ項目「直接変更有無」に“直接変更なし”を設定し、メタデータ変更プログラムを使用して直接変更した場合に、データ項目「直接変更有無」に“直接変更有り”を設定するようにすればよい。なお、インデックス２作成部１２は、ユーザ定義メタデータ２２３を更新しない。 Furthermore, when updating the index 2 record 220, the index 2 creation unit 12 determines whether the setting content of the standard metadata 222 has been directly changed using the metadata change program described above, and has been changed directly. In this case, the standard metadata 222 is not updated. In order to do this, for example, “direct change presence / absence” is provided as the data item of the standard metadata 222, and when the index 2 record 220 is created, the data item “direct change presence / absence” is set to “no direct change”. When it is set and changed directly using the metadata change program, “directly changed” may be set in the data item “directly changed”. The index 2 creation unit 12 does not update the user-defined metadata 223.

標準メタデータ２２２のデータ項目として、さらに「直接変更後ファイル更新有無」を設けてもよい。インデックス２作成部１２がインデックス２レコード２２０を作成するとき、データ項目「直接変更後ファイル更新有無」に“更新なし”を設定する。そして、インデックス２作成部１２がインデックス２レコード２２０を更新するとき、当該インデックス２レコード２２０がメタデータ変更プログラムを使用して直接変更されているかどうかを判定し、直接変更されていて、かつ対応する検索対象ファイル４３の内容が変わっている場合、データ項目「直接変更後ファイル更新有無」に“更新有り”を設定する。
このようにすることで、クライアント３の操作者がメタデータ変更プログラムを使用して当該インデックス２レコード２２０を参照する際や、後述するようにファイル検索結果を表示する際に、標準メタデータ２２２の直接変更を行った後で、表示装置３３に表示されるファイルの内容が更新されていることを知らせることができ、クライアント３の操作者は、再度標準メタデータ２２２の直接変更を行う必要があるかどうか判断することができる。
以上で説明したように、第１の実施形態においては、インデックス２作成部１２が処理を行うたびに、処理時点の検索対象ファイル４３と１対１に対応するインデックス２レコード２２０が作成される。 As a data item of the standard metadata 222, “file update presence / absence after direct change” may be further provided. When the index 2 creation unit 12 creates the index 2 record 220, “no update” is set in the data item “whether or not the file is updated after direct change”. Then, when the index 2 creation unit 12 updates the index 2 record 220, it is determined whether or not the index 2 record 220 is directly changed using the metadata change program, and the index 2 record 220 is directly changed and corresponds. If the contents of the search target file 43 have changed, “updated” is set in the data item “presence / absence of file update after direct change”.
In this way, when the operator of the client 3 refers to the index 2 record 220 using the metadata change program or displays the file search result as described later, the standard metadata 222 is stored. After the direct change, it is possible to notify that the content of the file displayed on the display device 33 has been updated, and the operator of the client 3 needs to directly change the standard metadata 222 again. Can be determined.
As described above, in the first embodiment, each time the index 2 creation unit 12 performs processing, an index 2 record 220 corresponding to the search target file 43 at the time of processing is created.

図１０は、検索要求部３１の動作の一例を示すフローチャートである。
検索要求部３１は、クライアント３の操作者が入力装置３２を使用して起動する。
検索要求部３１は、起動されると、ログイン処理を行う（Ｓ１００１）。具体的には、表示装置３３に図１３のようなログイン画面を表示し、クライアント３の操作者が入力装置３２を使用して利用者ＩＤおよびパスワードを入力し、「実行」ボタンを押すと、入力された利用者ＩＤとパスワードが適切か否か、および検索対象ファイル４３に対する利用者ＩＤのアクセス権限を判定する。なお、上記のようなログイン処理自体は周知技術であるので、これ以上の説明は省略する。 FIG. 10 is a flowchart illustrating an example of the operation of the search request unit 31.
The search request unit 31 is activated by the operator of the client 3 using the input device 32.
When activated, the search request unit 31 performs a login process (S1001). Specifically, when the login screen as shown in FIG. 13 is displayed on the display device 33, the operator of the client 3 inputs the user ID and password using the input device 32, and presses the “execute” button. It is determined whether or not the input user ID and password are appropriate, and the user ID access authority for the search target file 43 is determined. In addition, since the above login process itself is a well-known technique, further description is abbreviate | omitted.

検索要求部３１は、入力された利用者ＩＤとパスワードが適切であると判定した場合、表示装置３３に検索要求画面を表示する（Ｓ１００２）。図１４に検索要求画面（Ｄ１４０１）の表示例を示す。図１４に示すように、検索要求画面は、仮想分類表示部（Ｄ１４０２）、物理フォルダ表示部（Ｄ１４０３）、検索条件部（Ｄ１４０４）、検索結果部（Ｄ１４０５）および「検索」、「メタデータ編集」、「終了」の各ボタンから構成されている。 When it is determined that the input user ID and password are appropriate, the search request unit 31 displays a search request screen on the display device 33 (S1002). FIG. 14 shows a display example of the search request screen (D1401). As shown in FIG. 14, the search request screen includes a virtual classification display unit (D1402), a physical folder display unit (D1403), a search condition unit (D1404), a search result unit (D1405), “search”, “metadata edit”. ”And“ End ”buttons.

検索要求部３１は、検索要求画面（Ｄ１４０１）を表示する際に、検索部１３の仮想分類機能を利用して、仮想分類表示部（Ｄ１４０２）を表示する。具体的には、検索要求部３１は検索部１３に、仮想分類の初期表示内容を送信するように要求し、検索部１３は、仮想分類定義レコード２３０のうち、上位仮想分類ＩＤ２３４に“0（上位なし）”が設定されているレコード（すなわち最上位の仮想分類定義レコード２３０）の表示名２３２を、検索要求部３１に送信し、検索要求部３１は、受信した表示名を仮想分類表示部（Ｄ１４０２）に表示する。また、検索要求部３１は、各表示名２３２の前に、「＋」を四角く囲んだ図形を表示する。後述するように、クライアント３の操作者は、この図形をマウスでクリックする等の操作を行うことで、当該仮想分類に含まれている他の仮想分類やファイルを表示することができる。 When displaying the search request screen (D1401), the search request unit 31 displays the virtual classification display unit (D1402) using the virtual classification function of the search unit 13. Specifically, the search request unit 31 requests the search unit 13 to transmit the initial display content of the virtual classification, and the search unit 13 sets “0 ( The display name 232 of the record in which “no higher order” is set (that is, the highest virtual classification definition record 230) is transmitted to the search request unit 31, and the search request unit 31 uses the received display name as the virtual classification display unit. (D1402). In addition, the search request unit 31 displays a figure in which “+” is squared before each display name 232. As will be described later, the operator of the client 3 can display other virtual classifications and files included in the virtual classification by performing an operation such as clicking the figure with a mouse.

例えば、仮想分類ＩＤ２３１、表示名２３２、条件２３３及び上位仮想分類ＩＤ２３４に、それぞれ“1”、“タイトル”、“条件なし”、“0（上位なし）”が設定された仮想分類定義レコード２３０と、それぞれ“2”、“作業報告”、“タイトル２２２ａに「作業報告書」が含まれる”、“1”が設定された仮想分類定義レコード２３０が存在するとする。この場合、仮想分類表示部（Ｄ１４０２）のタイトルの前に表示された「＋」を四角く囲んだ図形をクリックすると、図１５のＤ１４０２に示すように、あたかも“タイトル”という名前のフォルダ内に“作業報告”という名前のフォルダが含まれているかのようなツリー表示がなされる。
更に、タイトル２２２ａに”作業報告書“が含まれているインデックス２レコード２２０が４レコード存在し、それぞれのファイル名２２１ｂが”作業報告１.doc“、”作業報告２.doc“、”作業報告３.doc“及び”報告書.doc“だった場合、作業報告前に表示された「＋」を四角く囲んだ図形をクリックすると、図１５の仮想分類表示部（Ｄ１４０２）に示すように、“作業報告”という名前のフォルダ内に、”作業報告１.doc“、”作業報告２.doc“、”作業報告３.doc“及び”報告書.doc“の４つのファイルが含まれているかのようなツリー表示がなされる。 For example, a virtual classification definition record 230 in which “1”, “title”, “no condition”, and “0 (no upper)” are set in the virtual classification ID 231, the display name 232, the condition 233, and the upper virtual classification ID 234, respectively. , Suppose that there is a virtual classification definition record 230 in which “2”, “work report”, “title 222a contains“ work report ””, and “1” are set. When a figure surrounded by a square “+” displayed in front of the title of D1402) is clicked, a folder named “Work Report” appears in the folder named “Title” as shown in D1402 of FIG. A tree is displayed as if it were included.
Furthermore, there are four index 2 records 220 in which “work report” is included in the title 222a, and each file name 221b is “work report 1.doc”, “work report 2.doc”, “work report”. 3. In the case of “doc” and “report.doc”, when a figure surrounded by a square “+” displayed before the work report is clicked, as shown in the virtual classification display section (D1402) of FIG. Whether the folder named “Work Report” contains four files: “Work Report 1.doc”, “Work Report 2.doc”, “Work Report 3.doc”, and “Report.doc”. A tree display like this is made.

また、検索要求部３１は、検索要求画面（Ｄ１４０１）を表示する際に、検索部１３の物理階層作成機能を利用して、物理フォルダ表示部（Ｄ１４０３）を表示する。具体的には、検索要求部３１は検索部１３に、物理フォルダの初期表示内容を送信するように要求する。検索部１３は、インデックス２レコード２２０のシステムメタデータ２２１内のファイルパス２２１ｃを参照してフォルダのツリー状階層を作成して、そのツリー状階層を検索要求部３１に送信する。検索要求部３１は、受信したツリー状階層に含まれるフォルダの名称を物理フォルダ表示部（Ｄ１４０３）に表示する。
また、検索要求部３１は、各フォルダの名称の前に、「＋」を四角く囲んだ図形を表示する。クライアント３の操作者は、この図形をマウスでクリックする等の操作を行うことで、当該フォルダに含まれている他のフォルダやファイルを表示することができる。 Further, when displaying the search request screen (D1401), the search request unit 31 displays the physical folder display unit (D1403) using the physical layer creation function of the search unit 13. Specifically, the search request unit 31 requests the search unit 13 to transmit the initial display contents of the physical folder. The search unit 13 refers to the file path 221 c in the system metadata 221 of the index 2 record 220, creates a tree-like hierarchy of folders, and transmits the tree-like hierarchy to the search request unit 31. The search request unit 31 displays the name of the folder included in the received tree-like hierarchy on the physical folder display unit (D1403).
Further, the search request unit 31 displays a figure in which “+” is enclosed in a square before the name of each folder. The operator of the client 3 can display other folders and files included in the folder by performing an operation such as clicking the figure with a mouse.

なお、検索条件部（Ｄ１４０４）及び検索結果部（Ｄ１４０５）の表示内容は図１４に示すとおりであり、検索結果部（Ｄ１４０５）には検索結果は表示されていない。
クライアント３の操作者は入力装置３２を使用して検索条件部（Ｄ１４０４）の各項目を入力する。検索条件部（Ｄ１４０４）に入力した項目は検索条件となり、例えば、全文に“現場”、タイトルに“作業報告”を入力した場合、インデックス１レコード２１０のキーワード２１４に“現場”が設定されており、かつ、インデックス２レコード２２０のタイトル２２２ａに“作業報告”が設定されているファイルが検索され、検索結果部（Ｄ１４０５）に、検索結果がスクロール表示される。 The display contents of the search condition part (D1404) and the search result part (D1405) are as shown in FIG. 14, and no search result is displayed in the search result part (D1405).
The operator of the client 3 uses the input device 32 to input each item in the search condition part (D1404). The item entered in the search condition part (D1404) is a search condition. For example, when “site” is entered in the full text and “work report” is entered in the title, “site” is set in the keyword 214 of the index 1 record 210. In addition, a file in which “work report” is set in the title 222a of the index 2 record 220 is searched, and the search result is scroll-displayed in the search result section (D1405).

検索条件部（Ｄ１４０４）の各項目については、さまざまな入力方法を可能にすることで検索の利便性を向上させることができる。例えば、全文に“not現場”、“現場 and 作業”等の論理式入力を可能にすればよい。なお、検索条件部（Ｄ１４０４）の入力項目は、図示した項目に限る必要はなく、例えばアクセス権限、セキュリティランク等、インデックス１ファイル２１及びインデックス２ファイル２２のデータ項目に合せて決めればよい。
検索要求部３１は、検索要求画面表示（Ｓ１００２）後、検索ボタン、メタデータ編集ボタンまたは終了ボタンが押下されるのを待つ（Ｓ１００３、Ｓ１００４）。検索ボタンが押された場合、すなわち検索要求を検知した場合（Ｓ１００３：ＹＥＳ）、検索処理（Ｓ１００５、Ｓ１００６、Ｓ１００７）を行う。また、終了ボタンが押された場合、すなわち終了要求を検知した場合（Ｓ１００４：ＹＥＳ）、処理を終了する。 For each item of the search condition part (D1404), the convenience of search can be improved by enabling various input methods. For example, it may be possible to input logical expressions such as “not on-site” and “on-site and work” in the whole text. Note that the input items of the search condition part (D1404) are not limited to the illustrated items, and may be determined according to the data items of the index 1 file 21 and the index 2 file 22, such as access authority and security rank.
After the search request screen is displayed (S1002), the search request unit 31 waits for the search button, metadata edit button, or end button to be pressed (S1003, S1004). When the search button is pressed, that is, when a search request is detected (S1003: YES), search processing (S1005, S1006, S1007) is performed. When the end button is pressed, that is, when an end request is detected (S1004: YES), the process ends.

なお、図１０には示していないが、検索要求部３１は、メタデータ編集ボタンが押された場合、メタデータ編集処理を行う。具体的には、検索要求部３１は、クライアント３の操作者に編集対象のファイルを指定するように要求し、指定されたファイルの標準メタデータ２２２及びユーザ定義メタデータ２２３の、その時点での設定内容を表示する。そして、検索要求部３１は、クライアント３の操作者が入力した内容で、標準メタデータ２２２及びユーザ定義メタデータ２２３の設定内容を変更する。このとき、前述したように、標準メタデータ２２２の直接変更後ファイル更新有無に“更新有り”が設定されている場合、表示装置３３にその旨のメッセージを表示してもよい。 Although not shown in FIG. 10, the search request unit 31 performs metadata editing processing when the metadata editing button is pressed. Specifically, the search request unit 31 requests the operator of the client 3 to specify a file to be edited, and the standard metadata 222 and user-defined metadata 223 of the specified file at that time Display the settings. Then, the search request unit 31 changes the setting contents of the standard metadata 222 and the user-defined metadata 223 with the contents input by the operator of the client 3. At this time, as described above, when “updated” is set in the file update presence / absence after direct modification of the standard metadata 222, a message to that effect may be displayed on the display device 33.

検索要求部３１は、検索要求を検知した場合、検索条件部（Ｄ１４０４）の入力内容（検索条件）を、検索部１３に送信する（Ｓ１００５）。例えば、全文に“現場”、タイトルに“作業報告”が入力された場合、“全文=現場タイトル=作業報告”といった検索条件式を作成し、ログイン画面で入力された利用者ＩＤとともに、検索部１３に送信する。ここで、検索条件式は検索部１３によって解釈される式であり、検索部１３が解釈可能な構文規則等に従って作成すればよい。
検索要求部３１は、検索条件式を検索部１３に送信した後、検索部１３から検索結果を受信するまで待つ（Ｓ１００６）。検索結果を受信する（Ｓ１００６：ＹＥＳ）と、検索結果を検索要求画面の検索結果部（Ｄ１４０５）に表示し（Ｓ１００７）、再び、検索ボタン等が押下されるのを待つ（Ｓ１００３、Ｓ１００４）。 When the search request unit 31 detects the search request, the search request unit 31 transmits the input content (search condition) of the search condition unit (D1404) to the search unit 13 (S1005). For example, if “on-site” is entered for the full text and “work report” is entered for the title, a search condition expression such as “full text = site title = work report” is created, and the user ID entered on the login screen and the search section 13 to send. Here, the search condition expression is an expression interpreted by the search unit 13 and may be created according to a syntax rule or the like that can be interpreted by the search unit 13.
After transmitting the search condition formula to the search unit 13, the search request unit 31 waits until a search result is received from the search unit 13 (S1006). When the search result is received (S1006: YES), the search result is displayed on the search result part (D1405) of the search request screen (S1007), and it waits for the search button or the like to be pressed again (S1003, S1004).

図１１は、検索部１３の動作の一例を示すフローチャートである。
ファイル検索サーバ１がクライアント３からの検索要求を受信すると、検索部１３が起動される。
検索部１３は、まず、検索要求に含まれる検索条件式を解析し、メタデータ検索を行うか、すなわち、インデックス２ファイル２２のシステムメタデータ２２１等を参照して検索する必要があるか判定する（Ｓ１１０１）。例えば、検索条件式が“全文=現場タイトル=作業報告”である場合、標準メタデータ２２２のタイトル２２２ａを参照して検索する必要があると判定する。また、例えば、検索条件式が“全文=現場”である場合、メタデータ検索を行なわず、全文検索のみ行うと判定する。
検索部１３は、メタデータ検索を行うと判定した場合（Ｓ１１０１：ＹＥＳ）、インデックス２ファイル２２による検索を行う（Ｓ１１０２）。具体的には、検索条件式から、システムメタデータ２２１等に関する条件を抽出し、条件に合致するインデックス２レコード２２０を選択する（以下「メタデータ合致レコード」という。）。例えば、検索条件式が“全文=現場タイトル=作業報告”である場合、標準メタデータ２２２のタイトル２２２ａに“作業報告”が設定されているインデックス２レコード２２０を選択する。 FIG. 11 is a flowchart illustrating an example of the operation of the search unit 13.
When the file search server 1 receives a search request from the client 3, the search unit 13 is activated.
The search unit 13 first analyzes the search condition expression included in the search request and determines whether or not to perform metadata search, that is, whether or not it is necessary to search by referring to the system metadata 221 or the like of the index 2 file 22. (S1101). For example, if the search condition expression is “full text = site title = work report”, it is determined that the search needs to be performed with reference to the title 222 a of the standard metadata 222. For example, when the search condition formula is “full text = on-site”, it is determined that only the full text search is performed without performing the metadata search.
When it is determined that the metadata search is to be performed (S1101: YES), the search unit 13 performs a search using the index 2 file 22 (S1102). Specifically, a condition related to the system metadata 221 and the like is extracted from the search condition formula, and an index 2 record 220 that matches the condition is selected (hereinafter referred to as “metadata matching record”). For example, when the search condition formula is “full text = site title = work report”, the index 2 record 220 in which “work report” is set in the title 222 a of the standard metadata 222 is selected.

検索部１３は、メタデータ検索（Ｓ１１０２）を行った後、又はメタデータ検索を行わないと判定した場合（Ｓ１１０１：ＮＯ）、全文検索を行うか、すなわち、インデックス１ファイル２１のキーワード２１４を参照して検索する必要があるか判定する（Ｓ１１０３）。例えば、検索条件式が“全文=現場タイトル=作業報告”または“全文=現場”である場合、キーワード２１４を参照して検索する必要があると判定する。
検索部１３は、全文検索を行うと判定した場合（Ｓ１１０３：ＹＥＳ）、インデックス１ファイル２１による全文検索を行う（Ｓ１１０４）。具体的には、検索部１３は、検索条件式から、全文検索の条件を抽出し、メタデータ検索（Ｓ１１０２）を行った場合には、メタデータ合致レコードのファイルパス２２１ｃ、及び検索要求部３１から受信した利用者ＩＤとともに、インデックス１検索部１４に渡す。また、検索部１３は、メタデータ検索を行なわなかった場合には、インデックス２ファイル２２の全てのインデックス２レコード２２０をメタデータ合致レコードとして選択し、当該レコードのファイルパス２２１ｃを、全文検索の条件及び検索要求部３１から受信した利用者ＩＤとともに、インデックス１検索部１４に渡す。
インデックス１検索部１４は、後述するように、受け取った全文検索の条件等を参照して検索を行い、最終的に検索結果とすべきインデックス２レコード２２０（以下「キーワード合致レコード」という。）のファイルパス２２１ｃを検索部１３に渡す。 After performing the metadata search (S1102) or when determining that the metadata search is not performed (S1101: NO), the search unit 13 performs a full-text search, that is, refers to the keyword 214 of the index 1 file 21. It is then determined whether it is necessary to search (S1103). For example, if the search condition formula is “full text = site title = work report” or “full text = site”, it is determined that the search is necessary with reference to the keyword 214.
When it is determined that the full text search is performed (S1103: YES), the search unit 13 performs a full text search using the index 1 file 21 (S1104). Specifically, when the search unit 13 extracts a full-text search condition from the search condition formula and performs a metadata search (S1102), the file path 221c of the metadata matching record and the search request unit 31 are searched. Together with the user ID received from the index 1 search unit 14. When the metadata search is not performed, the search unit 13 selects all the index 2 records 220 of the index 2 file 22 as metadata matching records, and sets the file path 221c of the record as a full-text search condition. And the user ID received from the search request unit 31 is passed to the index 1 search unit 14.
As will be described later, the index 1 search unit 14 performs a search by referring to the received full-text search conditions and the like, and finally searches for an index 2 record 220 (hereinafter referred to as “keyword match record”) that should be the search result. The file path 221c is passed to the search unit 13.

検索部１３は、全文検索を行わないと判定した場合（Ｓ１１０３：ＮＯ）は、メタデータ合致レコードを検索要求部３１への送信対象とし、一方、全文検索を実行（Ｓ１１０４）した場合は、キーワード合致レコードを検索要求部３１への送信対象とする。検索部１３は、送信対象としたインデックス２レコード２２０の各データ項目を検索要求部３１に送信する（Ｓ１１０５）。
検索部１３は、送信後、処理を終了する。 When it is determined that the full text search is not performed (S1103: NO), the search unit 13 sets the metadata matching record as a transmission target to the search request unit 31. On the other hand, when the full text search is executed (S1104), the keyword The matching record is a transmission target to the search request unit 31. The search unit 13 transmits each data item of the index 2 record 220 to be transmitted to the search request unit 31 (S1105).
The search part 13 complete | finishes a process after transmission.

図１２は、インデックス１検索部１４の動作の一例を示すフローチャートである。
インデックス１検索部１４は、例えば、各メタデータ合致レコードに対応するインデックス１レコード２１０を検索する（Ｓ１２０１）。具体的には、インデックス１検索部１４は、検索部１３から受け取ったメタデータ合致レコードのファイルパス２２１ｃ全てにつき、ファイルパス２１２に同一値が設定されているインデックス１レコード２１０を参照し、当該レコードのアクセス権限２１３により、検索部１３から受信した利用者ＩＤがアクセス権限を有するか判定する。そして、アクセス権限を有すると判定した場合、当該レコードのキーワード２１４が、検索部１３から受信した全文検索の条件を満足するか判定する。
インデックス１検索部１４は、全文検索の条件を満足すると判定されたファイルパス２２１ｃを検索部１３に渡し（Ｓ１２０２）、処理を終了する。 FIG. 12 is a flowchart showing an example of the operation of the index 1 search unit 14.
For example, the index 1 search unit 14 searches the index 1 record 210 corresponding to each metadata matching record (S1201). Specifically, the index 1 search unit 14 refers to the index 1 record 210 in which the same value is set in the file path 212 for all the file paths 221c of the metadata matching records received from the search unit 13, and the record The access authority 213 determines whether the user ID received from the search unit 13 has access authority. If it is determined that the user has access authority, it is determined whether the keyword 214 of the record satisfies the full-text search condition received from the search unit 13.
The index 1 search unit 14 passes the file path 221c determined to satisfy the full-text search condition to the search unit 13 (S1202), and ends the process.

ところで、インデックス１検索部１４を実現するプログラムとして、既に存在するさまざまな全文検索プログラムを使用するような場合、インデックス１作成部１１も当該インデックス１検索部１４に対応したプログラムを使用することになる。その場合、一般的には、インデックス１ファイル２１に係わる検索対象ファイル４３とインデックス２ファイル２２に係わる検索対象ファイル４３が異なり、インデックス２ファイル２２とは関係なく全文検索の条件によりインデックス１ファイル２１が検索されることになる。そうすると、例えば、インデックス２ファイル２２に係わる検索対象ファイル４３として、クライアント３の操作者が普段よく使用するファイルを設定した場合も、クライアント３の操作者が全文検索のみを要求すると、インデックス１ファイル２１が検索されてしまうので、普段よく使用しないファイルも検索結果として表示されることになる。 By the way, when using various existing full-text search programs as a program for realizing the index 1 search unit 14, the index 1 creation unit 11 also uses a program corresponding to the index 1 search unit 14. . In this case, generally, the search target file 43 related to the index 1 file 21 and the search target file 43 related to the index 2 file 22 are different, and the index 1 file 21 depends on the full-text search conditions regardless of the index 2 file 22. Will be searched. Then, for example, even if the operator of the client 3 sets a file that is frequently used as the search target file 43 related to the index 2 file 22, if the operator of the client 3 requests only a full-text search, the index 1 file 21 Will be searched, so files that are not normally used will be displayed as search results.

このように検索されるのが好ましい場合もあるが、好ましくない場合もある。そこで、全文検索のみを行う場合に、インデックス２ファイル２２が作成された検索対象ファイル４３のみを検索対象とするか（すなわち、メタデータ作成済みファイルのみを検索対象とするか）、それとも、インデックス２ファイル２２に係わらずインデックス１ファイル２１の検索対象ファイル４３全てを検索対象とするか（すなわち、メタデータ未作成のファイルも検索対象とするか）を、検索要求画面で指定できるようにしてもよい。 It may be preferable to search in this way, but it may not be preferable. Therefore, when only full-text search is performed, only the search target file 43 in which the index 2 file 22 is created is set as the search target (that is, only the file for which metadata has been created is set as the search target), or the index 2 Whether to search all the search target files 43 of the index 1 file 21 regardless of the file 22 (that is, whether to search for files for which metadata is not created) may be specified on the search request screen. .

このようにした場合、メタデータ未作成のファイルも検索対象とするように指定された場合は、インデックス１検索部１４はインデックス２ファイル２２とは関係なく全文検索の条件によりインデックス１ファイル２１を検索する。一方、メタデータ作成済みファイルのみを検索対象とするように指定された場合は、検索部１３は、メタデータ検索が要求されていない場合（図１１のＳ１１０１でＮＯの場合）も、インデックス２ファイル２２の全てのインデックス２レコード２２０をメタデータ合致レコードとして選択し、当該レコードのファイルパス２２１ｃを、全文検索の条件及び検索要求部３１から受信した利用者ＩＤとともに、インデックス１検索部１４に渡す。 In this case, when it is specified that a file for which metadata has not been created is also set as a search target, the index 1 search unit 14 searches the index 1 file 21 according to the full-text search condition regardless of the index 2 file 22. To do. On the other hand, when it is specified that only the metadata created file is to be searched, the search unit 13 also searches for the index 2 file even when the metadata search is not requested (NO in S1101 of FIG. 11). All 22 index 2 records 220 are selected as metadata matching records, and the file path 221c of the record is passed to the index 1 search unit 14 together with the full text search condition and the user ID received from the search request unit 31.

＜仮想分類動的生成機能についての補足説明＞
図２２は、仮想分類動的生成における検索部１３等の動作の一例を示すフローチャートである。
検索部１３は、まず、検索要求に含まれる検索条件式を解析し、メタデータ検索かどうかを判定する（Ｓ１３０１）。
検索部１３は、メタデータ検索を行うと判定した場合（Ｓ１３０１：ＹＥＳ）、キーワード辞書ファイル２５の検索を行う（Ｓ１３０２）。具体的には、メタデータ検索のキーワードがキーワード辞書ファイル２５のキーワード２５３に設定されているキーワード辞書レコード２５０を選択する（以下「キーワード辞書合致レコード」という。）。例えば、検索条件式が“タイトル＝設計書”である場合、キーワード辞書ファイル２５のキーワード２５３に“設計書”が設定されているキーワード辞書レコード２５０を選択する。
検索部１３は、メタデータ検索キーワードがキーワード辞書ファイル２５に存在する場合（Ｓ１３０３：ＹＥＳ）、キーワード辞書合致レコードのキーワードＩＤ２５１で、キーワード関連定義ファイル２６の検索を行う（Ｓ１３０４）。具体的には、キーワード辞書合致レコードのキーワードＩＤ２５１が“1”の場合、キーワード関連定義ファイル２６のキーワードＩＤ２６１に“1”が設定されているキーワード関連定義レコード２６０を選択する（以下「キーワード関連合致レコード」という。）。 <Supplementary explanation about the virtual classification dynamic generation function>
FIG. 22 is a flowchart illustrating an example of the operation of the search unit 13 and the like in the virtual classification dynamic generation.
First, the search unit 13 analyzes a search condition expression included in the search request and determines whether or not the metadata search is performed (S1301).
When it is determined that the metadata search is performed (S1301: YES), the search unit 13 searches the keyword dictionary file 25 (S1302). Specifically, the keyword dictionary record 250 in which the keyword for metadata search is set in the keyword 253 of the keyword dictionary file 25 is selected (hereinafter referred to as “keyword dictionary matching record”). For example, when the search condition expression is “title = design document”, the keyword dictionary record 250 in which “design document” is set in the keyword 253 of the keyword dictionary file 25 is selected.
When the metadata search keyword exists in the keyword dictionary file 25 (S1303: YES), the search unit 13 searches the keyword related definition file 26 with the keyword ID 251 of the keyword dictionary matching record (S1304). Specifically, when the keyword ID 251 of the keyword dictionary match record is “1”, the keyword related definition record 260 in which “1” is set in the keyword ID 261 of the keyword related definition file 26 is selected (hereinafter referred to as “keyword related match”). Record ").

検索部１３は、キーワード辞書合致レコードのキーワードＩＤ２５１がキーワード関連定義ファイル２６に存在する場合（Ｓ１３０５：ＹＥＳ）、キーワード関連合致レコードの関連キーワードＩＤ２６２に設定されているキーワードＩＤの数分、関連キーワードＩＤ２６２に設定されているキーワードＩＤでキーワード辞書ファイル２５を検索する（Ｓ１３０６）。具体的には、キーワード関連合致レコードの関連キーワードＩＤ２６２に“2, 3, 4”が設定されている場合、キーワード辞書ファイル２５のキーワードＩＤ２５１に“2”または“3”または“4”が設定されているキーワード辞書レコード２５０を選択する（以下「関連キーワード辞書合致レコード」という。）。
さらに、図示していないが、キーワード関連合致レコードの最大キーワード長２６３に設定されている値が“1”以上の場合、関連キーワード辞書合致レコードを関連キーワード辞書合致レコードのキーワードの長さが最大キーワード長２６３の値以下のレコードのみに絞り込む。
さらに、図示していないが、キーワード関連合致レコードの言語条件２６４が“0（同一言語のみ）”に設定されている場合、関連キーワード辞書合致レコードを関連キーワード辞書合致レコードの言語コード２５２とキーワード辞書合致レコードの言語コード２５２が同一のレコードのみに絞り込む。 When the keyword ID 251 of the keyword dictionary matching record exists in the keyword related definition file 26 (S1305: YES), the search unit 13 sets the related keyword ID 262 by the number of keyword IDs set in the related keyword ID 262 of the keyword related matching record. The keyword dictionary file 25 is searched with the keyword ID set to (S1306). Specifically, when “2, 3, 4” is set in the related keyword ID 262 of the keyword related matching record, “2”, “3”, or “4” is set in the keyword ID 251 of the keyword dictionary file 25. The selected keyword dictionary record 250 is selected (hereinafter referred to as “related keyword dictionary matching record”).
Further, although not shown, when the value set in the maximum keyword length 263 of the keyword related match record is “1” or more, the related keyword dictionary match record is the keyword whose maximum keyword length is in the related keyword dictionary match record. Narrow down to records that are less than or equal to the length 263 value.
Further, although not shown, when the language condition 264 of the keyword related match record is set to “0 (only the same language)”, the related keyword dictionary match record is set to the language code 252 of the related keyword dictionary match record and the keyword dictionary. Only records with the same language code 252 of matching records are narrowed down.

検索部１３は、メタデータ検索キーワードと、上述した方法により選択されて絞り込まれたキーワード辞書レコード２５０から仮想分類定義レコードを作成する（Ｓ１３０７）。
具体的には、検索部１３は、メタデータ検索キーワードを仮想分類定義レコード２３０の表示名２３２に設定して、最上位の仮想分類定義レコード２３０を作成する。例えば、メタデータ検索条件が“タイトル＝設計書”である場合、仮想分類ＩＤ２３１、表示名２３２、条件２３３及び上位仮想分類ＩＤ２３４に、それぞれ“1”、“設計書”、“条件なし”、“0（上位なし）”を設定し仮想分類定義レコード２３０を作成する（以下「動的生成ルート仮想分類レコード」という。）。
検索部１３は、次に、関連キーワード辞書合致レコードから動的生成ルート仮想分類レコードの下位の仮想分類定義レコード２３０を作成する。例えば、メタデータ検索条件が“タイトル＝設計書”で、メタデータ検索キーワード“設計書”に関連するキーワードが“仕様書”と“定義書”である場合、仮想分類ＩＤ２３１、表示名２３２、条件２３３及び上位仮想分類ＩＤ２３４に、それぞれ“2”、“仕様書”、“タイトル＝仕様書”、“1”を設定した仮想分類定義レコード２３０と、それぞれ“3”、“定義書”、“タイトル＝定義書”、“1”を設定した仮想分類定義レコード２３０を作成する（以下「動的生成仮想分類レコード」という。）。このとき、条件２３３は、「メタデータ検索条件の項目（“タイトル”）＝関連するキーワード（“仕様書”もしくは“定義書”）」という条件を作成し設定する。また、上位仮想分類ＩＤ２３４には、動的生成ルート仮想分類レコードの仮想分類ＩＤ２３１の値“1”を設定する。 The search unit 13 creates a virtual classification definition record from the metadata search keyword and the keyword dictionary record 250 selected and narrowed down by the above-described method (S1307).
Specifically, the search unit 13 sets the metadata search keyword to the display name 232 of the virtual classification definition record 230 and creates the highest virtual classification definition record 230. For example, when the metadata search condition is “title = design document”, the virtual classification ID 231, the display name 232, the condition 233, and the upper virtual classification ID 234 are “1”, “design document”, “no condition”, “ 0 (no higher rank) ”is set to create a virtual classification definition record 230 (hereinafter referred to as“ dynamically generated route virtual classification record ”).
Next, the search unit 13 creates a virtual classification definition record 230 subordinate to the dynamically generated root virtual classification record from the related keyword dictionary match record. For example, when the metadata search condition is “title = design document” and the keywords related to the metadata search keyword “design document” are “specification document” and “definition document”, the virtual classification ID 231, display name 232, condition 233 and upper virtual classification ID 234 are set to “2”, “specification”, “title = specification”, “1”, respectively, and “3”, “definition”, “title” respectively. = Definition document ”,“ 1 ”is set, and a virtual classification definition record 230 is created (hereinafter referred to as“ dynamically generated virtual classification record ”). At this time, the condition 233 creates and sets a condition “metadata search condition item (“ title ”) = related keyword (“ specification ”or“ definition ”)”. Further, the value “1” of the virtual classification ID 231 of the dynamically generated root virtual classification record is set as the upper virtual classification ID 234.

検索部１３は、動的生成ルート仮想分類レコードと動的生成仮想分類レコードとに基づいて仮想分類定義を作成する。仮想分類定義の作成が終了すると、検索部１３は、作成した仮想分類定義を検索要求部３１に送信する（Ｓ１３０８）。なお、検索部１３は、動的生成ルート仮想分類レコードと動的生成仮想分類レコードを含む仮想分類定義ファイルを記憶装置２に記憶させず、ファイル検索サーバ１のメモリに記憶するのみとしてもよい。
検索要求部３１は、受信した仮想分類定義を、仮想分類機能と同様に図２３の仮想分類表示部（Ｄ１４０２）にツリー表示する。 The search unit 13 creates a virtual classification definition based on the dynamic generation root virtual classification record and the dynamic generation virtual classification record. When the creation of the virtual classification definition is completed, the search unit 13 transmits the created virtual classification definition to the search request unit 31 (S1308). Note that the search unit 13 may store only the virtual classification definition file including the dynamic generation root virtual classification record and the dynamic generation virtual classification record in the memory of the file search server 1 without storing the virtual classification definition file in the storage device 2.
The search request unit 31 displays the received virtual classification definition as a tree on the virtual classification display unit (D1402) of FIG. 23, as with the virtual classification function.

＜メタデータ検索と全文検索の両方を行う場合の処理についての補足説明＞
第１の実施形態のファイル検索システムでは、上述したとおり、クライアント３の操作者が全文検索を要求した場合にのみ、インデックス１ファイル２１を利用して検索を行う。ところで、一般に全文検索を行わない場合に比べて、全文検索に要する処理時間は長くなるので、クライアント３の操作者が検索要求してから検索結果が表示されるまでの待ち時間が長くなる。そこで、クライアント３の操作者が思いがけずに長時間待たされることを防止することが望ましい。
以下、メタデータ検索と全文検索の両方を行う場合（以下「複合検索」という。）において、上記の対策を行う場合の検索部１３等の動作について説明する。 <Supplementary explanation about processing when performing both metadata search and full-text search>
In the file search system of the first embodiment, as described above, the search is performed using the index 1 file 21 only when the operator of the client 3 requests a full text search. By the way, since the processing time required for full-text search is generally longer than when full-text search is not performed, the waiting time until the search result is displayed after the operator of the client 3 makes a search request is increased. Therefore, it is desirable to prevent the operator of the client 3 from waiting for a long time unexpectedly.
Hereinafter, when both the metadata search and the full-text search are performed (hereinafter referred to as “composite search”), the operation of the search unit 13 and the like when performing the above countermeasure will be described.

図１６は、複合検索における検索部１３等の動作の一例を示すフローチャートである。
図１６のＳ１６５０〜Ｓ１６５５は、複合検索時に、図１１のＳ１１０４〜Ｓ１１０５において検索部１３が行う処理の詳細を示しており、Ｓ１６０１〜Ｓ１６０７は、当該処理に対応して検索要求部３１が図１０のＳ１００６〜Ｓ１００７において行う処理の詳細を示している。 FIG. 16 is a flowchart illustrating an example of the operation of the search unit 13 and the like in the complex search.
S1650 to S1655 in FIG. 16 show details of processing performed by the search unit 13 in S1104 to S1105 in FIG. 11 during complex search. In S1601 to S1607, the search request unit 31 corresponds to the processing shown in FIG. The details of the processing performed in S1006 to S1007 are shown.

検索部１３は、検索結果件数、すなわちメタデータ検索で検索されたメタデータ合致レコードのレコード数を、予め定めた所定件数（以下「最大検索件数」という。）と比較する（Ｓ１６５０）。
そして、メタデータ合致レコードのレコード数、すなわち全文検索対象数が最大検索件数より大きい場合（Ｓ１６５０：ＹＥＳ）には、検索要求部３１に、処理続行有無を確認するメッセージを送信し（Ｓ１６５１）、検索要求部３１から検索処理を続行するか否かの確認結果を受信するまで待つ（Ｓ１６５２）。 The search unit 13 compares the number of search results, that is, the number of records of metadata matching records searched in the metadata search, with a predetermined number (hereinafter referred to as “maximum search number”) (S1650).
When the number of metadata matching records, that is, the number of full-text search targets is larger than the maximum number of searches (S1650: YES), a message for confirming whether or not processing is continued is transmitted to the search request unit 31 (S1651). It waits until receiving a confirmation result as to whether or not to continue the search process from the search request unit 31 (S1652).

検索要求部３１は、検索部１３から検索処理を続行するか否かを確認するメッセージを受信すると、当該メッセージを表示装置３３に表示し、クライアント３の操作者に検索処理を続行するか否かの応答を要求する（Ｓ１６０１）。具体的には、例えば、ポップアップ画面に、確認メッセージと、「検索続行」及び「中止」のボタンを表示し、いずれかのボタンがクリックされるまで待てばよい。
検索要求部３１は、クライアント３の操作者が「中止」のボタンをクリックする等により検索中止を指示した場合（Ｓ１６０２：ＮＯ）、検索部１３に「検索中止」指示を送信して、処理を終了する（Ｓ１６０３）。従って、検索要求部３１は、検索結果を表示することなく、再び、検索要求画面で検索ボタン等が押下されるのを待つ（図１０のＳ１００３、Ｓ１００４）。
検索要求部３１は、クライアント３の操作者が「検索続行」のボタンをクリックする等により検索続行を指示した場合（Ｓ１６０２でＹＥＳの場合）、検索部１３に「検索続行」指示を送信し、図には明示していないが、検索部１３から検索結果を受信するまで待つ。 When receiving a message confirming whether or not to continue the search process from the search unit 13, the search request unit 31 displays the message on the display device 33 and determines whether or not to continue the search process to the operator of the client 3. Is requested (S1601). Specifically, for example, a confirmation message and buttons “Continue Search” and “Cancel” may be displayed on a pop-up screen, and wait until any button is clicked.
When the operator of the client 3 gives an instruction to cancel the search by clicking the “Cancel” button or the like (S1602: NO), the search request unit 31 transmits an instruction “Cancel Search” to the search unit 13 to perform the process. The process ends (S1603). Accordingly, the search request unit 31 waits for the search button or the like to be pressed again on the search request screen without displaying the search result (S1003 and S1004 in FIG. 10).
When the operator of the client 3 instructs to continue the search by clicking the “Continue Search” button or the like (YES in S1602), the search request unit 31 transmits a “Continue Search” instruction to the search unit 13, Although not clearly shown in the figure, it waits until a search result is received from the search unit 13.

検索部１３は、検索要求部３１から「検索中止」指示又は「検索続行」指示を受信すると、受信した指示によって処理を変える（Ｓ１６５２）。すなわち、「検索中止」指示を受信した場合は（Ｓ１６５２：ＮＯ）処理を終了する。一方、「検索続行」指示を受信した場合は（Ｓ１６５２：ＹＥＳ）、メタデータ合致レコードのうち最大検索件数分のレコードのファイルパス２２１ｃ、全文検索の条件及び検索部３１から受信した利用者ＩＤをインデックス１検索部１４に渡して、全文検索を実行させる（Ｓ１６５３）。
なお、全文検索対象数が最大検索件数以下の場合（Ｓ１６５０：ＮＯ）には、検索要求部３１に、処理続行有無を確認するメッセージを送信することなく、メタデータ合致レコードのファイルパス２２１ｃ、全文検索の条件及び検索要求部３１から受信した利用者ＩＤをインデックス１検索部１４に送信して、全文検索を実行させる（Ｓ１６５３）。 When the search unit 13 receives a “search stop” instruction or a “continue search” instruction from the search request unit 31, the search unit 13 changes processing according to the received instruction (S1652). That is, when the “search stop” instruction is received (S1652: NO), the process is terminated. On the other hand, when the “Continue Search” instruction is received (S1652: YES), the file path 221c of the record corresponding to the maximum number of searches among the metadata matching records, the full-text search condition, and the user ID received from the search unit 31 are displayed. The full text search is executed by passing it to the index 1 search unit 14 (S1653).
If the number of full-text search targets is equal to or less than the maximum number of searches (S1650: NO), the file path 221c of the metadata matching record, the full-text is sent without sending a message to confirm whether or not processing is continued to the search request unit 31. The search condition and the user ID received from the search request unit 31 are transmitted to the index 1 search unit 14 to execute a full text search (S1653).

全文検索が終了すると、検索部１３は、キーワード合致レコードの各データ項目を検索要求部３１に送信する（Ｓ１６５４）。このとき、全てのメタデータ合致レコードについて全文検索を実行したか、それとも全文検索未実行のメタデータ合致レコードが残っているかの識別も送信する。
検索部１３は、次に、検索結果部（Ｄ１４０５）に追加表示が可能かどうかを判定する（Ｓ１６５５）。具体的には、メタデータ合致レコードに全文検索未実行のレコードが残っており、かつ検索要求部３１に送信した検索結果の累計値が、検索要求画面（Ｄ１４０１）の検索結果部（Ｄ１４０５）に表示可能な件数として予め定めた件数（以下「最大表示件数」という。）より小さい場合（Ｓ１６５５：ＹＥＳ）には、再び、検索要求部３１から検索処理を続行するか否かの確認結果を受信するまで待つ（Ｓ１６５２）。一方、検索結果部（Ｄ１４０５）に追加表示できない場合（Ｓ１６５５：ＮＯ）には、検索部１３は処理を終了する。 When the full-text search is completed, the search unit 13 transmits each data item of the keyword match record to the search request unit 31 (S1654). At this time, the identification of whether or not the full-text search has been executed for all the metadata matching records or the metadata matching records for which the full-text search has not been executed remains.
Next, the search unit 13 determines whether additional display is possible in the search result unit (D1405) (S1655). More specifically, a record that has not been subjected to full-text search remains in the metadata matching record, and the cumulative value of the search results transmitted to the search request unit 31 is displayed in the search result portion (D1405) of the search request screen (D1401). When the number of cases that can be displayed is smaller than a predetermined number (hereinafter referred to as “maximum number of cases to be displayed”) (S1655: YES), a confirmation result as to whether or not to continue the search process is received from the search request unit 31 again. (S1652). On the other hand, when additional display is not possible in the search result part (D1405) (S1655: NO), the search part 13 ends the process.

検索要求部３１は、検索部１３から検索結果を受信すると、検索結果部（Ｄ１４０５）に検索結果を表示する（Ｓ１６０５）。なお、前述したように、検索結果部（Ｄ１４０５）に追加表示が可能な限り、メタデータ合致レコードを対象とした全文検索が繰り返し実行される。従って、検索結果は検索結果部（Ｄ１４０５）に追加表示する。例えば、１回目の全文検索の結果が３件で、２回目の全文検索の結果が４件の場合、検索結果部（Ｄ１４０５）には７件の検索結果が表示される。 When receiving the search result from the search unit 13, the search request unit 31 displays the search result in the search result unit (D1405) (S1605). As described above, as long as additional display is possible in the search result part (D1405), full-text search for metadata matching records is repeatedly executed. Therefore, the search result is additionally displayed in the search result part (D1405). For example, when the result of the first full-text search is 3 and the result of the second full-text search is 4, the search result portion (D1405) displays 7 search results.

検索要求部３１は、次に、全文検索未実行のメタデータ合致レコードが残っており（前述したように、検索部１３から、全てのメタデータ合致レコードについて全文検索を実行したか、それとも全文検索未実行のメタデータ合致レコードが残っているかの識別が送信される）、かつ検索結果部（Ｄ１４０５）に追加表示が可能かどうか判定する（Ｓ１６０６）。検索要求部３１は、全文検索未実行のメタデータ合致レコードが残っており、かつ検索結果部（Ｄ１４０５）に追加表示が可能な場合（Ｓ１６０６：ＹＥＳ）には、処理続行有無を確認するメッセージを表示装置３３に表示し（Ｓ１６０７）、再び、クライアント３の操作者に検索処理を続行するか否かの応答を要求する（Ｓ１６０２）。 Next, the search request unit 31 still has a metadata matching record that has not been executed yet. (As described above, the search unit 13 has executed a full text search for all metadata matching records, or a full text search. It is determined whether or not additional metadata can be additionally displayed in the search result part (D1405). If there is a metadata matching record that has not been executed in full text search and additional display is possible in the search result part (D1405) (S1606: YES), the search request part 31 sends a message for confirming whether or not to continue processing. The information is displayed on the display device 33 (S1607), and the operator of the client 3 is requested again for a response as to whether or not to continue the search process (S1602).

一方、検索要求部３１は、全てのメタデータ合致レコードについて全文検索を実行したか、又は検索結果部（Ｄ１４０５）に追加表示ができない場合（Ｓ１６０６：ＮＯ）には、処理を終了する。従って、検索要求部３１は、検索結果部（Ｄ１４０５）にそれまでの検索結果を表示して、再び、検索ボタン等が押下されるのを待つ（図１０のＳ１００３、Ｓ１００４）。 On the other hand, if the search request unit 31 has performed full text search for all metadata matching records or cannot additionally display the search result unit (D1405) (S1606: NO), the process ends. Accordingly, the search request unit 31 displays the search results so far in the search result unit (D1405) and waits for the search button or the like to be pressed again (S1003 and S1004 in FIG. 10).

以上のように、全文検索対象数が最大検索件数より大きい場合に、クライアント３の操作者に検索処理を続行するか否かを問い合わせ、「検索続行」が指示された場合に全文検索する。従って、クライアント３の操作者は、検索時間が長くなりそうな場合には、いったん検索処理を中止し、例えば、メタデータの検索条件をさらに絞り込んで検索を行うことができる。
また、全文検索は最大検索件数単位で繰り返し行われ、全文検索を行うたびに、検索結果が追加表示されるので、クライアント３の操作者は、比較的短時間のうちに、検索結果を次々と確認することができる。 As described above, when the full-text search target number is larger than the maximum search number, the operator of the client 3 is inquired as to whether or not to continue the search process, and when “continue search” is instructed, the full-text search is performed. Therefore, if the search time is likely to be long, the operator of the client 3 can stop the search process once and perform a search by further narrowing down the metadata search conditions, for example.
Further, the full-text search is repeatedly performed in units of the maximum number of searches, and each time a full-text search is performed, the search results are additionally displayed. Therefore, the operator of the client 3 can retrieve the search results one after another in a relatively short time. Can be confirmed.

＜関連検索についての補足説明＞
図１７は、検索要求画面（Ｄ１４０１）に含まれる検索結果部（Ｄ１４０５）の表示内容を例示する。
図１７においては、ファイル名とファイルパスが表示されているが、この他に、タイトル、文書記載日等、インデックス２レコード２２０等のデータ項目を表示してもよいし、例えば、インデックス２レコード２２０に検索対象ファイル４３の内容の一部を設定しておき、これを表示してもよい。
また、図１７において点線で囲った関連検索指示部（Ｄ１７０１）には、関連定義レコード２４０の表示名２４２に設定された名称が表示される。図１７の例においては、表示名２４２にそれぞれ“タイトル”と“記載日付”が設定された２つの関連定義レコード２４０が存在しており、この表示名２４２が表示されている。 <Supplementary explanation about related search>
FIG. 17 illustrates the display contents of the search result part (D1405) included in the search request screen (D1401).
In FIG. 17, the file name and file path are displayed. In addition to this, data items such as the index 2 record 220, such as the title and the document description date, may be displayed, for example, the index 2 record 220. Alternatively, a part of the contents of the search target file 43 may be set and displayed.
In addition, the name set in the display name 242 of the related definition record 240 is displayed in the related search instruction section (D1701) surrounded by a dotted line in FIG. In the example of FIG. 17, there are two related definition records 240 in which “title” and “description date” are set in the display name 242, respectively, and this display name 242 is displayed.

この状態で、クライアント３の操作者が、例えば“タイトル”と表示された部分をクリックすると、検索要求部３１は、検索部１３に、“タイトル”に係わる関連検索を行うように要求する。具体的には、表示装置３３には表示されていないが、検索結果とともに検索部１３から受信した、当該検索結果ファイルに係わるファイルＩＤ２２１ａ及び“タイトル”に係わる関連定義ＩＤ２４１を、関連検索要求とともに検索部１３に送信する。
検索部１３は関連検索要求を受信すると、受信した関連定義ＩＤ２４１が設定された関連定義レコード２４０の条件２４３を参照し、条件２４３に設定されている条件に従って、インデックス２レコード２２０を検索し、検索結果を検索要求部３１に送信する。 In this state, when the operator of the client 3 clicks, for example, a part displayed as “title”, the search request unit 31 requests the search unit 13 to perform a related search related to “title”. Specifically, although not displayed on the display device 33, the file ID 221a related to the search result file and the related definition ID 241 related to the “title” received from the search unit 13 together with the search result are searched together with the related search request. To the unit 13.
Upon receiving the related search request, the search unit 13 refers to the condition 243 of the related definition record 240 in which the received related definition ID 241 is set, searches the index 2 record 220 according to the condition set in the condition 243, and searches The result is transmitted to the search request unit 31.

例えば、関連定義ＩＤ２４１、表示名２４２及び条件２４３に、それぞれ“1”、“タイトル”、“タイトル２２２ａが当該検索結果と等しい”が設定された関連定義レコード２４０が存在し、タイトル２２２ａに”作業報告書“が含まれているインデックス２レコード２２０が４レコード存在し、それぞれのファイル名２２１ｂが”作業報告１.doc“、”作業報告２.doc“、”作業報告３.doc“及び”報告書.doc“であり、表示装置３３に検索結果として”作業報告１.doc“が表示されているとする。この場合、関連検索指示部（Ｄ１７０１）には、”タイトル“の文字が表示される。クライアント３の操作者が、”タイトル“の文字をクリックすると、検索要求部３１は、検索部１３に、”作業報告１.doc“のファイルＩＤ及び関連定義ＩＤ（“1”）を送信する。
すると、検索部１３は、関連定義ＩＤ２４１が“1”である関連定義レコード２４０の条件２４３を参照し、“タイトル２２２ａが当該検索結果と等しい”が設定されているので、受信したファイルＩＤによって、”作業報告１.doc“のタイトル２２２ａを取得し、”作業報告１.doc“と同じく、タイトル２２２ａに”作業報告書“が含まれている”作業報告２.doc“、”作業報告３.doc“及び”報告書.doc“の３つのファイルを検索し、検索結果（例えば、ファイル名とファイルパス）を検索要求部３１に送信する。そして、検索要求部３１は、図１８に示すように、検索結果部（Ｄ１４０５）に関連検索結果を表示する。
なお、関連検索結果の表示件数には特に制限を設けないようにしてもよいし、例えば、最大５件だけ表示し、５件を超える場合は、別画面に表示するようにしてもよい。 For example, there is a related definition record 240 in which “1”, “title”, and “title 222a are equal to the search result” are set in the related definition ID 241, display name 242, and condition 243, respectively. There are four index 2 records 220 including the report “, and each file name 221b is“ work report 1.doc ”,“ work report 2.doc ”,“ work report 3.doc ”and“ report ”. Suppose that “work report 1.doc” is displayed on the display device 33 as a search result. In this case, the text “title” is displayed in the related search instruction section (D1701). When the operator of the client 3 clicks on the “title” character, the search request unit 31 transmits the file ID and the related definition ID (“1”) of “work report 1.doc” to the search unit 13.
Then, the search unit 13 refers to the condition 243 of the related definition record 240 whose related definition ID 241 is “1”, and “Title 222a is equal to the search result” is set. The title 222a of the “work report 1.doc” is acquired, and the work report 2.doc “,” “work report 3. Three files “doc” and “report.doc” are searched, and a search result (for example, a file name and a file path) is transmitted to the search request unit 31. Then, the search request unit 31 displays the related search result in the search result unit (D1405) as shown in FIG.
Note that the number of related search results displayed may not be particularly limited. For example, only a maximum of 5 items may be displayed, and when the number of related search results exceeds 5, the number may be displayed on a separate screen.

＜ファイル検索システムのその他の実施形態＞
本発明に係わるファイル検索システムは、第１の実施形態に限られず、さまざまな実施形態とすることができる。その１例を以下に説明する。 <Other Embodiments of File Search System>
The file search system according to the present invention is not limited to the first embodiment, and can be various embodiments. One example will be described below.

図１９は、本発明の第２の実施形態に係るファイル検索システムの構成の一例を示す。
第２の実施形態においては、第１の実施形態と異なり、ファイル検索サーバ１は、インデックス１作成部１１及びインデックス１検索部１４を備えていない。
そのかわり、ファイル検索サーバ５が、通信回線９によって、クライアント３、ファイルサーバ４、ＷＥＢサーバ７及びファイル検索サーバ１と通信可能に接続されている。ファイル検索サーバ５は、コンピュータであり、図示しないＣＰＵおよび主記憶装置、記憶装置６を備えている。ファイル検索サーバ５の主記憶装置には、所定のプログラムが記憶されている。ＣＰＵがその命令コードを実行することにより、インデックス１作成部１１、インデックス１検索部１４の各機能が実現される。また、インデックス１ファイル２１はファイル検索サーバ１の記憶装置２には記憶されず、ファイル検索サーバ５の記憶装置６に記憶される。
第２の実施形態と第１の実施形態の構成上の大きな相違点は以上のとおりである。 FIG. 19 shows an example of the configuration of a file search system according to the second embodiment of the present invention.
In the second embodiment, unlike the first embodiment, the file search server 1 does not include the index 1 creation unit 11 and the index 1 search unit 14.
Instead, the file search server 5 is communicably connected to the client 3, the file server 4, the WEB server 7, and the file search server 1 through the communication line 9. The file search server 5 is a computer and includes a CPU, a main storage device, and a storage device 6 (not shown). A predetermined program is stored in the main storage device of the file search server 5. When the CPU executes the instruction code, the functions of the index 1 creation unit 11 and the index 1 search unit 14 are realized. The index 1 file 21 is not stored in the storage device 2 of the file search server 1 but is stored in the storage device 6 of the file search server 5.
The major differences in configuration between the second embodiment and the first embodiment are as described above.

第２の実施形態のファイル検索システムにおいては、クライアント３、ファイルサーバ４、ＷＥＢサーバ７、ファイル検索サーバ１及びファイル検索サーバ５が、通信回線９によって相互に通信可能に接続されている。このような構成により、例えば、ある組織がデータセンタに設置されたファイルサーバ４にファイルを記憶させているとき、ファイルサーバ４に記憶された検索対象ファイル４３について、インデックス２ファイル２２、仮想分類定義ファイル２３、関連定義ファイル２４、キーワード辞書ファイル２５及びキーワード関連定義ファイル２６を作成することで、メタデータ検索、仮想分類表示、物理フォルダ表示及び関連検索を可能にしつつ、当該組織が管理していないＷＥＢサーバ７の記憶装置７１に記憶された検索対象ファイル７２については、ファイル検索サーバ５のインデックス１作成部１１が、いわゆるＷＥＢクローリングにより、例えばＡｐａｃｈｅ（Ａｐａｃｈｅは登録商標）等のＷＥＢサーバプログラムを介して、インデックス１ファイル２１を作成して、全文検索を可能にすることができる。 In the file search system of the second embodiment, a client 3, a file server 4, a WEB server 7, a file search server 1, and a file search server 5 are connected to each other via a communication line 9 so that they can communicate with each other. With such a configuration, for example, when a certain organization stores a file in the file server 4 installed in the data center, the index 2 file 22, the virtual classification definition for the search target file 43 stored in the file server 4 is stored. By creating the file 23, the related definition file 24, the keyword dictionary file 25, and the keyword related definition file 26, metadata search, virtual classification display, physical folder display, and related search are possible, but the organization does not manage them. For the search target file 72 stored in the storage device 71 of the WEB server 7, the index 1 creation unit 11 of the file search server 5 performs, for example, a so-called WEB crawling via a WEB server program such as Apache (Apache is a registered trademark). Index Create a scan 1 file 21, it is possible to enable a full-text search.

また、第２の実施形態においては、ファイル検索サーバ５は、ファイル検索を行おうとする組織が設置する必要はなく、世の中に存在しているファイル検索サーバの機能を使用して全文検索等を行うことができるので、柔軟性・拡張性の高い検索システムを構築することができる。 In the second embodiment, the file search server 5 does not need to be set up by an organization that performs file search, and performs a full-text search or the like by using a file search server function that exists in the world. Therefore, it is possible to construct a search system with high flexibility and expandability.

以上説明したように、本発明によれば、全文検索を可能にすべきファイルと全文検索が不要なファイルとが存在しているファイル検索環境において、効率的にファイルを検索することができる。
例えば、組織内において作成された文書等のファイルについては、インデックス２ファイル２２を作成しておき、メタデータ検索を行えばよい。メタデータ検索キーワードを用いてインデックス２ファイル２２を検索することで検索結果として表示されるファイルの数を抑えることができる。たとえ、ファイルを間違ったフォルダに記憶したような場合でも、ファイルの使用目的、タイトル等が記憶された小容量のインデックス２ファイル２２があれば、当該ファイルを検索できる場合が多い。
一方、例えば、特許文献、技術論文等、組織外部から入手したファイルや、ＷＥＢサーバなど、組織外部のサーバに存在しているファイルについては、入手した時点では意図しなかった用途で参照することも多く、キーワードによる全文検索が必要な場合も多い。このようなファイルについてはインデックス１ファイル２１を作成しておき、全文検索を行えばよい。 As described above, according to the present invention, it is possible to efficiently search for a file in a file search environment in which there are files that should be capable of full text search and files that do not require full text search.
For example, for a file such as a document created in an organization, an index 2 file 22 may be created and a metadata search may be performed. By searching the index 2 file 22 using the metadata search keyword, the number of files displayed as a search result can be suppressed. Even if the file is stored in the wrong folder, the file can often be searched if there is a small-capacity index 2 file 22 storing the purpose of use of the file, the title, and the like.
On the other hand, files obtained from outside the organization, such as patent documents and technical papers, and files existing on a server outside the organization, such as a WEB server, may be referred to for purposes that were not intended at the time of acquisition. In many cases, full text search using keywords is required. For such a file, an index 1 file 21 may be created and a full text search may be performed.

１…ファイル検索サーバ、１１…インデックス１作成部、１２…インデックス２作成部、１３…検索部、１４…インデックス１検索部、２…記憶装置、２１…インデックス１ファイル、２２…インデックス２ファイル、２３…仮想分類定義ファイル、２４…関連定義ファイル、２５…キーワード辞書ファイル、２６…キーワード関連定義ファイル、３…クライアント、３１…検索要求部、３２…入力装置、３３…表示装置、４…ファイルサーバ、４１…ファイル管理部、４２…記憶装置、４３…検索対象ファイル、５…ファイル検索サーバ、６…記憶装置、７…ＷＥＢサーバ、７１…記憶装置、７２…検索対象ファイル、９…通信回線 DESCRIPTION OF SYMBOLS 1 ... File search server, 11 ... Index 1 creation part, 12 ... Index 2 creation part, 13 ... Search part, 14 ... Index 1 search part, 2 ... Storage device, 21 ... Index 1 file, 22 ... Index 2 file, 23 ... virtual classification definition file, 24 ... related definition file, 25 ... keyword dictionary file, 26 ... keyword related definition file, 3 ... client, 31 ... search request section, 32 ... input device, 33 ... display device, 4 ... file server, DESCRIPTION OF SYMBOLS 41 ... File management part, 42 ... Storage device, 43 ... Search object file, 5 ... File search server, 6 ... Storage device, 7 ... WEB server, 71 ... Storage device, 72 ... Search object file, 9 ... Communication line

Claims

A file acquisition means for acquiring a first search target file group consisting of a plurality of search target files and a second search target file group consisting of a plurality of search target files from the storage device;
For each search target file included in the first search target file group acquired by the file acquisition means, the file name that is the name of each search target file and the storage location in which each search target file is stored Index 1 file storage for creating an index 1 record having a specified file path and a plurality of keywords extracted from the contents of each search target file at a predetermined timing, and storing the index 1 file including each index 1 record Means,
For each search target file included in the second search target file group acquired by the file acquisition means, the file name that is the name of each search target file and the storage location in which each search target file is stored are shown. Index 2 file storage for creating an index 2 record having a file path and one or more metadata representing the contents of each search target file at a predetermined timing and storing an index 2 file including each index 2 record Means,
Search condition receiving means for receiving a search condition including one or both of a condition relating to metadata and a condition of full-text search;
When the search condition received by the search condition receiving means includes a condition related to metadata, each index 2 record included in the index 2 file is searched based on the condition related to the metadata, and the condition related to the metadata is set. A metadata search means for acquiring index 2 records satisfying;
When the search condition received by the search condition receiving means includes a metadata-related condition and a full-text search condition, it is acquired by the metadata search means out of each index 1 record included in the index 1 file . Each index 1 record including the same file path as the file path included in the index 2 record is searched based on the full-text search condition, and the search condition received by the search condition receiving means does not include a condition related to metadata. When the full text search condition is included, a keyword search that searches all index 1 records included in the index 1 file based on the full text search condition and obtains an index 1 record that satisfies the full text search condition. Means,
A file search apparatus comprising:

A virtual classification display name indicating the name of the virtual classification, an index 2 record presence / absence information indicating whether any index 2 record included in the index 2 file may be classified into the virtual classification, and the index 2 file In order to identify the classification condition for classifying the search target file corresponding to each index 2 record included in the virtual classification into the virtual classification, the upper hierarchy presence / absence information indicating the presence / absence of the upper hierarchy virtual classification, and the upper hierarchy virtual classification Virtual classification definition file storage means for storing a virtual classification definition file including a plurality of virtual classification definition records having higher layer identification information;
Based on the index 2 record presence / absence information, the classification condition, the upper hierarchy presence / absence information, and the upper hierarchy identification information of each virtual classification definition record included in the virtual classification definition file stored by the virtual classification definition file storage unit, Virtual classification definition means for creating virtual classification definition information indicating a tree-like hierarchical structure constituted by a classification and a search target file corresponding to each index 2 record;
Outputs the virtual classification definition information created by the virtual classification definition means, the virtual classification display name of each virtual classification constituting the tree-like hierarchical structure indicated by the virtual classification definition information, and the file name of each search target file Virtual classification output means,
The file search apparatus according to claim 1, further comprising:

Keyword dictionary file storage means for storing a keyword dictionary file including a plurality of keyword dictionary records having a keyword and keyword identification information for identifying the keyword;
A plurality of keyword related definition records having keyword identification information of a keyword dictionary record included in the keyword dictionary file and keyword identification information of a keyword dictionary record included in one or more of the keyword dictionary files related to the keyword identification information. Keyword-related definition file storage means for storing a keyword-related definition file including:
When a search is performed by the metadata search means, the keywords included in the conditions relating to the metadata of the search, the keywords and keyword identification information of each keyword dictionary record included in the keyword dictionary file, and the keyword related definition file Virtual classification dynamic generation means for generating the virtual classification definition record based on the keyword identification information of each keyword-related definition record included and one or more keyword identification information related to the keyword identification information;
The file search apparatus according to claim 2 , further comprising:

The index 2 record includes file identification information for identifying a search target file corresponding to the index 2 record;
One or more relation definitions having relation definition record identification information for identifying the relation definition record, a display name indicating the name of the relation definition, and a search condition acquisition condition for acquiring the search condition for searching the index 2 record An associated definition file storage means for storing an associated definition file including a record;
Outputting said metadata search means acquires index 2 record by file name and file path, and a display name that is part of each relevant definition record associated definition files stored in the associated definition file storage unit Related definition information output means,
Related definition record identification information corresponding to the display name selected from each display name output by the related definition information output means, and the file name output by the related definition information output means together with the selected display name Receiving means for receiving the file identification information of the index 2 record corresponding to the file path ;
Search condition acquisition condition for related definition record stored in the related definition file storage means including the index 2 record containing the file identification information accepted by the accepting means and the related definition record identification information accepted by the accepting means a search condition acquisition unit that acquires a search condition based on bets,
Each index 2 record included in the index 2 file is searched, an index 2 record satisfying the search condition acquired by the search condition acquisition unit is acquired, and the file name and file path included in the acquired index 2 record Related search means for outputting
The file search apparatus according to any one of claims 1 to 3, further comprising:

Computer
A file acquisition means for acquiring a first search target file group consisting of a plurality of search target files and a second search target file group consisting of a plurality of search target files from the storage device;
For each search target file included in the first search target file group acquired by the file acquisition means, the file name that is the name of each search target file and the storage location in which each search target file is stored Index 1 file storage for creating an index 1 record having a specified file path and a plurality of keywords extracted from the contents of each search target file at a predetermined timing, and storing the index 1 file including each index 1 record Means,
For each search target file included in the second search target file group acquired by the file acquisition means, the file name that is the name of each search target file and the storage location in which each search target file is stored are shown. Index 2 file storage for creating an index 2 record having a file path and one or more metadata representing the contents of each search target file at a predetermined timing and storing an index 2 file including each index 2 record Means,
Search condition receiving means for receiving a search condition including one or both of a condition relating to metadata and a condition of full-text search;
When the search condition received by the search condition receiving means includes a condition related to metadata, each index 2 record included in the index 2 file is searched based on the condition related to the metadata, and the condition related to the metadata is set. A metadata search means for acquiring index 2 records satisfying;
When the search condition received by the search condition receiving means includes a metadata-related condition and a full-text search condition, it is acquired by the metadata search means out of each index 1 record included in the index 1 file . Each index 1 record including the same file path as the file path included in the index 2 record is searched based on the full-text search condition, and the search condition received by the search condition receiving means does not include a condition related to metadata. When the full text search condition is included, a keyword search that searches all index 1 records included in the index 1 file based on the full text search condition and obtains an index 1 record that satisfies the full text search condition. Means,
File search program to make it function.