JPH1166092A

JPH1166092A - Device and method for managing and retrieving document

Info

Publication number: JPH1166092A
Application number: JP9223990A
Authority: JP
Inventors: Satoru Yashiro; 哲八代
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1997-08-20
Filing date: 1997-08-20
Publication date: 1999-03-09

Abstract

PROBLEM TO BE SOLVED: To provide a device and method for managing and retrieving document at low cost appropriate for various environments of different retrieval frequency and document update frequency. SOLUTION: Concerning this method, one of plural index preparing parts (S7-S10 and S11-S12) for preparing plural indexes in different forms is selected based on a predetermined value (selection process) (S6). Further, the index is prepared while using the said index preparing part selected in the said selection process (index preparation process) (S70-S10 or S11-S12). Then, while using all the generated indexes in different forms, a document is retrieved (retrieval process) (S15-S19).

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】文書を検索するための索引を
作成して文書を管理する文書管理検索装置および方法に
関する。[0001] 1. Field of the Invention [0002] The present invention relates to a document management search apparatus and method for creating an index for searching a document and managing the document.

【０００２】[0002]

【従来の技術】従来の文書管理検索装置において、文書
の全文を対象に検索を実行して所望の文書を検索する検
索方法があるが、大量の文書が存在する場合にはそれら
の文書から所望の文書を検索するには時間がかかりすぎ
ていた。そこで、文書に含まれる文字列と、その文字列
が含まれる文書とを対応づける索引を作ることで検索効
率を向上していた。また、文書に応じて索引の作成を行
って検索効率を優先させる場合には、索引が作成されて
いる文書については検索時にそれを用い、索引が作成さ
れていない文書については低速な全文検索を行ってい
た。2. Description of the Related Art In a conventional document management and retrieval apparatus, there is a retrieval method for retrieving a desired document by executing a retrieval on the entire text of the document. However, when a large number of documents exist, a desired document is retrieved from those documents. It took too long to search for documents. Therefore, search efficiency has been improved by creating an index that associates a character string included in a document with a document that includes the character string. Also, when creating an index according to the document and giving priority to search efficiency, use the indexed document at the time of search, and use a low-speed full-text search for an unindexed document. I was going.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、索引の
作成には比較的長い処理時間がかかるため、文書検索を
行う頻度に比べて文書の更新頻度が高い環境では索引作
成のための処理時間が必要以上にかかり、システムのパ
フォーマンスが落ちるという問題があった。また、索引
の有無によって検索方法を変えるシステムにおいては、
索引はデータベース管理システムを用いて管理し、索引
を作成していない文書はファイルシステムを用いて管理
するために、システムクラッシュが起きた場合に備えて
両システム間の整合性を確保するための複雑なロジック
を必要とするため、開発コストがかかっていた。However, since the creation of an index takes a relatively long processing time, the processing time for creating an index is required in an environment where the frequency of updating documents is higher than the frequency of performing a document search. As a result, there was a problem that the performance of the system was reduced. In a system that changes the search method depending on the presence or absence of an index,
Indexes are managed using a database management system, and documents that have not been indexed are managed using a file system, which is a complex system that ensures consistency between the two systems in the event of a system crash. It required development logic, which required development costs.

【０００４】本発明は、上記問題に鑑みてなされたもの
で、検索頻度と文書更新頻度の異なる様々な環境に適合
する文書管理検索装置及び方法を低コストで提供するこ
とを目的とする。The present invention has been made in view of the above problems, and has as its object to provide, at low cost, a document management and retrieval apparatus and method adapted to various environments having different retrieval frequencies and document update frequencies.

【０００５】[0005]

【課題を解決するための手段】上記目的を達成するた
め、本発明の文書管理検索装置は、複数の特性の異なる
索引を作成する複数の索引作成手段と、前記複数の索引
作成手段の内、１つの索引作成手段をあらかじめ決めら
れた値に基づいて選択する選択手段と、前記選択手段に
より選択された前記索引作成手段を用いて索引を作成す
るように制御する制御手段と、前記複数の索引作成手段
で作成された索引を用いて、文書を検索する検索手段と
を備える。In order to achieve the above object, a document management and retrieval apparatus according to the present invention comprises: a plurality of index creation means for creating a plurality of indexes having different characteristics; Selection means for selecting one index creation means based on a predetermined value; control means for controlling creation of an index using the index creation means selected by the selection means; and the plurality of indexes A search unit that searches for a document using the index created by the creation unit.

【０００６】また、複数の特性の異なる索引を作成する
複数の索引作成手段を用いて文書を検索するための索引
を作成し文書を管理する本発明の文書管理検索方法は、
上記複数の索引作成手段の内、１つの索引作成手段をあ
らかじめ決められた値に基づいて選択する選択工程と、
前記選択工程で選択した索引作成手段を用いて索引を作
成する索引作成工程とを有する。A document management search method according to the present invention for creating an index for searching a document by using a plurality of index creating means for creating an index having a plurality of different characteristics and managing the document,
A selection step of selecting one of the plurality of index creation means based on a predetermined value;
An index creation step of creating an index using the index creation means selected in the selection step.

【０００７】好ましくは、上記特性の異なる索引を用い
て、文書を検索する検索工程を更に有する。[0007] Preferably, the method further includes a search step of searching for a document using the indexes having different characteristics.

【０００８】更に、本発明の文書管理検索のプログラム
コードが格納されたコンピュータ可読メモリは、複数の
特性の異なる索引を作成する複数の索引作成プログラム
コード手段と、前記複数の索引作成プログラムコード手
段の内、１つの索引作成プログラムコード手段をあらか
じめ決められた値に基づいて選択する選択プログラムコ
ード手段と、前記選択プログラムコード手段により選択
された前記索引作成プログラムコード手段を用いて索引
を作成するように制御するプログラムコード手段と、前
記複数の索引作成プログラムコード手段で作成された索
引を用いて、文書を検索する検索プログラムコード手段
とを備える。Further, the computer readable memory storing the document management / search program code of the present invention comprises a plurality of index creation program code means for creating a plurality of indexes having different characteristics, and a plurality of index creation program code means. In this case, an index is created by using a selection program code means for selecting one index creation program code means based on a predetermined value, and the index creation program code means selected by the selection program code means. And a search program code unit for searching for a document using the index created by the plurality of index creation program code units.

【０００９】[0009]

【発明の実施の形態】以下、本発明の文書管理検索装置
を添付図面に基づいて詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, a document management and retrieval apparatus according to the present invention will be described in detail with reference to the accompanying drawings.

【００１０】図１は、本発明の一実施例に係る文書管理
検索装置の概略構成を表すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of a document management and retrieval apparatus according to one embodiment of the present invention.

【００１１】同図において、１は本実施例の文書管理検
索装置全体の制御を司るＣＰＵである。ＣＰＵ１は、バ
ス８を介して、ディスプレイ２、コマンド入力部３、Ｒ
ＯＭ４、ＲＡＭ５、二次記憶装置６、記憶媒体読取装置
７と、相互にデータの送受が可能であるように接続され
ている。In FIG. 1, reference numeral 1 denotes a CPU which controls the entire document management and retrieval apparatus of the present embodiment. The CPU 1 controls the display 2, the command input unit 3, and the R via the bus 8.
The OM 4, the RAM 5, the secondary storage device 6, and the storage medium reading device 7 are connected so that data can be transmitted and received between them.

【００１２】ディスプレイ２は、ＣＲＴディスプレイま
たは液晶ディスプレイなどにより構成され、文書情報や
選択肢などを表示する。コマンド入力部３は、キーボー
ドやマウスなどにより構成され、各種処理の命令、注目
点の移動、選択操作、文字、記号等の入力を行うために
使用される。The display 2 is constituted by a CRT display or a liquid crystal display, and displays document information and options. The command input unit 3 includes a keyboard, a mouse, and the like, and is used for inputting various processing instructions, movement of a point of interest, selection operation, characters, symbols, and the like.

【００１３】ＲＯＭ４は、前記ＣＰＵ１が実行する各種
の処理プログラムなどを記憶する読み出し専用メモリで
あり、その一部の領域である文書登録プログラム部４ａ
には、文書を登録したり、単語抽出をしたり、ディスプ
レイ２に表示出力するためのプログラムやデータが記憶
されている。ＲＡＭ５は、各種演算結果や入力された情
報を一時的に格納する読み出し書き込み可能メモリであ
る。２次記憶部６は、ハードディスクやＭＯ等により構
成され、索引や文書内容などが記憶される。The ROM 4 is a read-only memory for storing various processing programs to be executed by the CPU 1 and the like.
Stores programs and data for registering a document, extracting words, and outputting and displaying on the display 2. The RAM 5 is a readable and writable memory that temporarily stores various calculation results and input information. The secondary storage unit 6 is configured by a hard disk, an MO, or the like, and stores an index, document contents, and the like.

【００１４】記憶媒体読取装置７は、ＣＤ−ＲＯＭドラ
イブやＦＤドライブなどから構成され、プログラムコー
ドを格納した記憶媒体であるＣＤ−ＲＯＭ７１やフロッ
ピディスク７２などからプログラムを読み出すために使
用される。The storage medium reading device 7 is composed of a CD-ROM drive, an FD drive and the like, and is used to read a program from a storage medium storing a program code, such as a CD-ROM 71 or a floppy disk 72.

【００１５】以上のように構成された文書管理検索装置
のＣＰＵ１が実行する制御処理を、図２のフローチャー
トに沿って説明する。A control process executed by the CPU 1 of the document management / search apparatus configured as described above will be described with reference to the flowchart of FIG.

【００１６】図２は、図１のＲＯＭ４の文書登録プログ
ラム部４ａに記憶されたプログラムによる文書管理制御
手順を示すフローチャートである。FIG. 2 is a flowchart showing a document management control procedure by a program stored in the document registration program section 4a of the ROM 4 of FIG.

【００１７】まず、システムの電源を投入するとステッ
プＳ１に進み、必要な初期化処理を行い、ステップＳ２
にてコマンド入力部３からのコマンドの入力を待つ。こ
こでは、環境設定コマンド、文書登録コマンド、及び検
索コマンドのいずれかが入力される。何らかのコマンド
が入力されたらステップＳ３に進み、そのコマンドの評
価を行う。First, when the power of the system is turned on, the process proceeds to step S1, where necessary initialization processing is performed.
Waits for a command input from the command input unit 3. Here, one of an environment setting command, a document registration command, and a search command is input. If any command is input, the process proceeds to step S3, and the command is evaluated.

【００１８】コマンド評価の結果、環境設定コマンドで
あると判断した場合、ステップＳ１３に進み、文書登録
の速度を優先して索引を作成する文書登録優先モード
と、文書検索時の検索速度を優先して索引を作成する検
索優先モードとの割合を示すレートを入力する。文書登
録優先モードでは、高速に文書の索引を作成することが
できるが、この索引を用いて検索を行うと時間がかか
る。また、検索優先モードでは、このモードで作成され
た索引を用いると高速に検索を行うことができるが、索
引の作成に時間がかかるという欠点がある。このため、
レートを変更することにより、文書管理検索装置の性能
やユーザーの好み等に応じてこの２つのモードを用いて
行う処理の割合を変更することができる。なお、文書登
録優先モード及び検索優先モードによりそれぞれ作成さ
れた索引については、後に詳述する。As a result of the command evaluation, if it is determined that the command is an environment setting command, the flow advances to step S13 to prioritize a document registration priority mode for creating an index by giving priority to the speed of document registration and a search speed at the time of document search. Enter the rate indicating the ratio to the search priority mode for creating an index. In the document registration priority mode, a document index can be created at high speed, but it takes time to perform a search using this index. In addition, in the search priority mode, a high-speed search can be performed by using an index created in this mode, but there is a disadvantage that it takes time to create the index. For this reason,
By changing the rate, the rate of processing performed using these two modes can be changed in accordance with the performance of the document management and search device, user preferences, and the like. The indexes created in the document registration priority mode and the search priority mode will be described later in detail.

【００１９】図６はディスプレイ２に表示されるレート
入力画面の一例である。コマンド入力部３のマウスを用
いてつまみ６１を移動し、入力終了ボックスをクリック
することで、つまみ６１の位置に応じて０から１００の
間のレート値が設定される。FIG. 6 shows an example of a rate input screen displayed on the display 2. By moving the knob 61 using the mouse of the command input unit 3 and clicking on the input end box, a rate value between 0 and 100 is set according to the position of the knob 61.

【００２０】本実施例においては、つまみ６１を移動し
てレート値を例えば４０に設定した場合、登録する複数
の文書の内、その約４０％が文書登録優先モードで処理
され、残りの文書約６０％が検索優先モードで処理され
て索引が作成される。In this embodiment, when the rate value is set to, for example, 40 by moving the knob 61, about 40% of the plurality of documents to be registered are processed in the document registration priority mode, and the remaining documents are processed. 60% are processed in search priority mode to create an index.

【００２１】入力が終了したら、設定されたレート値を
二次記憶装置６に記憶する。When the input is completed, the set rate value is stored in the secondary storage device 6.

【００２２】また、ステップＳ３においてコマンドを評
価した結果、文書登録コマンドであると判断された場合
にはステップＳ４に進み、登録する文書を文字成分に分
解する。If it is determined in step S3 that the command is a document registration command, the flow advances to step S4 to decompose the document to be registered into character components.

【００２３】図３は、入力した文書内容と、それを文字
成分に分解した例を示す図である。FIG. 3 is a diagram showing an input document content and an example in which it is decomposed into character components.

【００２４】次にステップＳ５に進み、０から１００ま
での間の乱数を発生させる。次のステップＳ６では、設
定したレート値とステップＳ５で発生した乱数を比較
し、乱数が設定したレートより上かどうか判断する。設
定したレートよりも上の場合はステップＳ７に進み、検
索優先モードによる索引の作成処理を開始する。また、
設定したレート値以下の場合はステップＳ１１に進み、
文書登録優先モードによる索引の作成処理を開始する。
このように、乱数とステップＳ１３で設定したレート値
とを比較することにより、設定されたレート値に近い割
合で、登録する文書の処理を文書登録優先モードまたは
検索優先モードにアトランダムに振り分けて行うことが
できる。Then, the process proceeds to a step S5, wherein a random number between 0 and 100 is generated. In the next step S6, the set rate value is compared with the random number generated in step S5, and it is determined whether the random number is higher than the set rate. If the rate is higher than the set rate, the process proceeds to step S7, and the process of creating an index in the search priority mode is started. Also,
When the rate value is equal to or less than the set rate value, the process proceeds to step S11,
The index creation process in the document registration priority mode is started.
In this way, by comparing the random number with the rate value set in step S13, the processing of the document to be registered is randomly assigned to the document registration priority mode or the search priority mode at a rate close to the set rate value. It can be carried out.

【００２５】ステップＳ７では、登録する文書にユニー
クなＩＤを付与し、分解した文字成分とＩＤとを関連づ
けて、新たなレコードデータを作成する。次に、ステッ
プＳ８に進み、分解した文字成分をキーとして既存の索
引からレコードデータを読み込む。次にステップＳ９に
進み、登録する文書のレコードデータと読み込まれたレ
コードデータとをマージする。In step S7, a unique ID is assigned to the document to be registered, and the decomposed character components are associated with the ID to create new record data. Next, the process proceeds to step S8, where record data is read from the existing index using the decomposed character components as keys. Next, the process proceeds to step S9, where the record data of the document to be registered and the read record data are merged.

【００２６】ここで得られるレコードデータの例を図４
に示す。このレコードデータの特長は、登録する文書の
文字成分及びそれに関連付けられた文書ＩＤと、既に存
在する同一キーのレコードデータとをマージすること
で、検索時に処理する必要があるレコード件数を減らす
ことができ、処理コストを減らすことができることであ
る。一方、既に存在するレコードを読み込んだり、マー
ジ処理や出力するデータが多くなるため、索引作成のた
めの処理時間が比較的長くなる。FIG. 4 shows an example of the record data obtained here.
Shown in The feature of this record data is to reduce the number of records that need to be processed at the time of retrieval by merging the character component of the document to be registered and the document ID associated therewith with the existing record data of the same key. And the processing cost can be reduced. On the other hand, since the existing records are read, merged, and output data increases, the processing time for index creation becomes relatively long.

【００２７】そして、ステップＳ９でのマージ処理を終
了するとステップＳ１０に進み、得られたレコードデー
タを二次記憶装置６内のインデックスデータベースに出
力する。When the merging process in step S9 is completed, the flow advances to step S10 to output the obtained record data to the index database in the secondary storage device 6.

【００２８】また、ステップＳ６において、ステップＳ
５で発生した乱数が設定したレート値以下であると判断
した場合、ステップＳ１１に進み、文書登録優先モード
によるインデックスの作成処理を開始する。In step S6, step S
When it is determined that the random number generated in step 5 is equal to or less than the set rate value, the process proceeds to step S11, and the process of creating an index in the document registration priority mode is started.

【００２９】ステップＳ１１で作成されるレコードデー
タの例を図５に示す。このレコードデータの特長は、先
頭が同じ文字のレコードデータを１つにまとめること
で、文書登録時に処理する必要があるレコード件数を減
らすことができ、また、既に存在するレコードデータの
サーチやマージ処理を行わないため、索引作成にかかる
処理時間を短くすることができることである。一方、検
索時には同一キーのレコードデータが多くなったり、先
頭が同じ文字のレコードから必要な選択処理を行う必要
があるため、検索時間が比較的長くなる。FIG. 5 shows an example of the record data created in step S11. The advantage of this record data is that the number of records that need to be processed at the time of document registration can be reduced by combining record data with the same character at the beginning into one, and search and merge processing of existing record data can be performed. Therefore, the processing time required for creating the index can be shortened. On the other hand, at the time of retrieval, the record time of the same key is increased, or necessary selection processing must be performed from records having the same character at the beginning, so that the retrieval time is relatively long.

【００３０】次にステップＳ１２に進み、得られたレコ
ードデータをインデックスデータベースに出力する。Then, the process proceeds to a step S12, wherein the obtained record data is output to the index database.

【００３１】また、ステップＳ３のコマンド評価の結
果、検索コマンドであると判断した場合、ステップＳ１
５に進んで検索条件を入力する。If it is determined in step S3 that the command is a search command as a result of the command evaluation, step S1
Proceed to step 5 to enter search conditions.

【００３２】図７はディスプレイ２に表示される検索条
件の入力画面の一例である。FIG. 7 is an example of a search condition input screen displayed on the display 2.

【００３３】検索条件の入力が終了したら、ステップＳ
１６に進み、ステップＳ４と同様に文字成分に分解す
る。例えば、図７に示すように検索する言葉として「大
田区」が入力されている場合、「大田」および「田区」
に分解する。After inputting the search conditions, step S
Proceeding to 16, the image is decomposed into character components as in step S4. For example, as shown in FIG. 7, when "Ota-ku" is input as a search word, "Ota" and "Ta-ku"
Decompose into

【００３４】次にステップＳ１７に進み、ステップＳ４
で分解した文字成分、「大田」および「田区」をキーと
してステップＳ１０及びステップＳ１２で登録されたイ
ンデックスデータベースをサーチし、該当するレコード
データを読み込む。読み込まれたレコードデータの例を
図８に示す。文字成分「大田」を含む文書のＩＤとして
１２および１４、文字成分「田区」を含む文書のＩＤと
して１、２、３、１４が検索結果として得られている。Next, the process proceeds to step S17, and step S4
The index database registered in step S10 and step S12 is searched using the character components “Daejeon” and “Taku” decomposed in the above as keys, and the corresponding record data is read. FIG. 8 shows an example of the read record data. As search results, 12 and 14 are obtained as IDs of documents containing the character component "Daejeon", and 1, 2, 3, and 14 are obtained as IDs of documents containing the character component "Taku".

【００３５】次にステップＳ１８に進み、すべての文字
成分キーが含まれる文書ＩＤの積集合を求める。図７及
び８に示す例では、｛１２，１４｝∩｛１，２，３，１
４｝＝｛１４｝である。そしてステップＳ１９に進み、
得られた結果を出力する。Then, the process proceeds to a step S18, wherein an intersection of the document IDs including all the character component keys is obtained. In the example shown in FIGS. 7 and 8, {12,14} 1,2,3,1
4} = {14}. Then, the process proceeds to step S19,
Output the obtained result.

【００３６】なお、上記の発明の実施の形態において
は、２種類の索引を作成したが本発明はこれに限られる
ものではなく、３種類以上の索引を任意の割合で作成す
ることも可能である。In the above embodiment, two types of indexes are created, but the present invention is not limited to this, and three or more types of indexes can be created at an arbitrary ratio. is there.

【００３７】また、上記発明の実施の形態においてはイ
ンデックスデータベースは二次記憶装置６内に記憶され
るが、本発明はこれに限られるものではなく、他の記憶
媒体に記憶することも可能である。In the embodiment of the present invention, the index database is stored in the secondary storage device 6. However, the present invention is not limited to this, and the index database can be stored in another storage medium. is there.

【００３８】更に、上記発明の実施の形態においては、
乱数を発生し、この乱数とあらかじめ設定されたレート
値とを比較することにより文書登録優先モードにより索
引を作成するか検索優先モードにより索引を作成するか
を決定したが、レート値が示す望みの割合でいずれかの
モードが選択されるのであれば、どのような決定方法を
用いてもよい。Further, in the embodiment of the present invention,
A random number is generated, and the random number is compared with a preset rate value to determine whether to create an index in the document registration priority mode or to create an index in the search priority mode. Any determination method may be used as long as one of the modes is selected in proportion.

【００３９】また、本発明は、複数の機器から構成され
るシステムに適用しても、一つの機器からなる装置に適
用してもよい。Further, the present invention may be applied to a system constituted by a plurality of devices or to an apparatus constituted by a single device.

【００４０】また、本発明の目的は、前述した実施形態
の機能を実現するソフトウェアのプログラムコードを記
録した記憶媒体を、システムあるいは装置に供給し、そ
のシステムあるいは装置のコンピュータ（またはＣＰＵ
やＭＰＵ）が記憶媒体であるＣＤ−ＲＯＭ７１またはフ
ロッピディスク７２などに格納されたプログラムコード
を記憶媒体読取装置７により読出し、直接またはＲＡＭ
５に記憶後に実行することによっても、達成されること
は言うまでもない。Another object of the present invention is to provide a storage medium storing a program code of software for realizing the functions of the above-described embodiments to a system or an apparatus, and to provide a computer (or CPU) of the system or the apparatus.
Or MPU) reads a program code stored in a storage medium such as a CD-ROM 71 or a floppy disk 72 by a storage medium reading device 7 and directly or in a RAM.
Needless to say, this can also be achieved by executing after storing in No. 5.

【００４１】この場合、記憶媒体から読出されたプログ
ラムコード自体が前述した実施形態の機能を実現するこ
とになり、そのプログラムコードを記憶した記憶媒体は
本発明を構成することになる。In this case, the program code itself read from the storage medium realizes the functions of the above-described embodiment, and the storage medium storing the program code constitutes the present invention.

【００４２】プログラムコードを供給するための記憶媒
体としては、例えば、フロッピディスク，ハードディス
ク，光ディスク，光磁気ディスク，ＣＤ−ＲＯＭ，ＣＤ
−Ｒ，磁気テープ，不揮発性のメモリカード，ＲＯＭな
どを用いることができる。As a storage medium for supplying the program code, for example, a floppy disk, hard disk, optical disk, magneto-optical disk, CD-ROM, CD
-R, a magnetic tape, a nonvolatile memory card, a ROM, or the like can be used.

【００４３】また、コンピュータが読出したプログラム
コードを実行することにより、前述した実施形態の機能
が実現されるだけでなく、そのプログラムコードの指示
に基づき、コンピュータ上で稼働しているＯＳ（オペレ
ーティングシステム）などが実際の処理の一部または全
部を行い、その処理によって前述した実施形態の機能が
実現される場合も含まれることは言うまでもない。When the computer executes the readout program code, not only the functions of the above-described embodiment are realized, but also the OS (Operating System) running on the computer based on the instruction of the program code. ) May perform some or all of the actual processing, and the processing may realize the functions of the above-described embodiments.

【００４４】さらに、記憶媒体から読出されたプログラ
ムコードが、コンピュータに挿入された機能拡張ボード
やコンピュータに接続された機能拡張ユニットに備わる
メモリに書込まれた後、そのプログラムコードの指示に
基づき、その機能拡張ボードや機能拡張ユニットに備わ
るＣＰＵなどが実際の処理の一部または全部を行い、そ
の処理によって前述した実施形態の機能が実現される場
合も含まれることは言うまでもない。Further, after the program code read from the storage medium is written into a memory provided in a function expansion board inserted into the computer or a function expansion unit connected to the computer, the program code is read based on the instruction of the program code. It goes without saying that the CPU included in the function expansion board or the function expansion unit performs part or all of the actual processing, and the processing realizes the functions of the above-described embodiments.

【００４５】本発明を上記記憶媒体に適用する場合、そ
の記憶媒体には、先に説明した図２のフローチャートに
対応するプログラムコードを格納することになるが、簡
単に説明すると、図９のメモリマップ例に示す各モジュ
ールを記憶媒体に格納することになる。When the present invention is applied to the storage medium, the storage medium stores program codes corresponding to the flowchart of FIG. 2 described above. Each module shown in the map example is stored in the storage medium.

【００４６】すなわち、少なくとも複数の「索引作成モ
ジュール」、「索引作成モジュール選択モジュール」お
よび「検索モジュール」の各モジュールのプログラムコ
ードを記憶媒体に格納すればよい。That is, the program codes of at least a plurality of "index creation modules", "index creation module selection modules" and "search modules" may be stored in a storage medium.

【００４７】[0047]

【発明の効果】以上説明したように、本発明によれば、
検索頻度と文書更新頻度の異なる様々な環境に適合する
文書管理検索装置および方法を低コストで提供すること
ができる。As described above, according to the present invention,
A document management search apparatus and method suitable for various environments having different search frequencies and document update frequencies can be provided at low cost.

【００４８】[0048]

[Brief description of the drawings]

【図１】本発明の実施の形態に係る文書管理検索装置の
概略構成を表すブロック図である。FIG. 1 is a block diagram illustrating a schematic configuration of a document management and retrieval device according to an embodiment of the present invention.

【図２】図１の文書登録プログラム部４ａに記憶された
プログラムによる文書管理制御手順を示すフローチャー
トである。FIG. 2 is a flowchart showing a document management control procedure by a program stored in a document registration program unit 4a of FIG.

【図３】文字成分が分解された例を示した図である。FIG. 3 is a diagram showing an example in which a character component is decomposed.

【図４】検索優先モードによる処理で得られたレコード
データの例を示した図である。FIG. 4 is a diagram showing an example of record data obtained by processing in a search priority mode.

【図５】文書登録優先モードによる処理で得られたレコ
ードデータの様子を示した図である。FIG. 5 is a diagram showing a state of record data obtained by processing in a document registration priority mode.

【図６】環境設定の入力画面の一例を示した図である。FIG. 6 is a diagram showing an example of an environment setting input screen.

【図７】検索条件の入力画面の一例を示した図である。FIG. 7 is a diagram showing an example of a search condition input screen.

【図８】読み込まれたレコードデータの例を示す図であ
る。FIG. 8 is a diagram showing an example of read record data.

【図９】本発明の文書管理検索に用いられるプログラム
コードのメモリマップ図である。FIG. 9 is a memory map diagram of a program code used for document management search of the present invention.

[Explanation of symbols]

１ＣＰＵ２ディスプレイ３コマンド入力部４ＲＯＭ４ａ文書登録プログラム部５ＲＡＭ６二次記憶装置７記憶媒体読取装置７８バス７１ＣＤ−ＲＯＭ７２フロッピディスク DESCRIPTION OF SYMBOLS 1 CPU 2 Display 3 Command input part 4 ROM 4a Document registration program part 5 RAM 6 Secondary storage device 7 Storage medium reading device 7 8 Bus 71 CD-ROM 72 Floppy disk

Claims

[Claims]

1. A plurality of index creating means for creating a plurality of indexes having different characteristics, and a selecting means for selecting one of the plurality of index creating means based on a predetermined value; Control means for controlling the creation of an index using the index creation means selected by the selection means, and a search means for searching for a document using the indexes created by the plurality of index creation means A document management and retrieval device characterized by the above-mentioned.

2. The index having different characteristics includes a first index and a second index, and the first index can perform a search faster than using the second index. Index is first
2. The document management and retrieval apparatus according to claim 1, wherein an index can be created at a higher speed than the index of (1).

3. The document management and retrieval apparatus according to claim 1, further comprising setting means for setting a value used by said selection means.

4. The document management and retrieval apparatus according to claim 1, wherein the predetermined value is a ratio indicating a ratio of using the plurality of index creation units.

5. A random number generating means for generating a random number, wherein said selecting means selects an index creating means to be used based on the random number generated by said random number generating means and a predetermined value. The document management and retrieval device according to any one of claims 1 to 4, wherein:

6. The document management and retrieval apparatus according to claim 1, further comprising a storage unit that stores the indexes created by the plurality of index creation units.

7. The document management and retrieval apparatus according to claim 1, wherein said selection means selects said index creation means on a document-by-document basis.

8. A document management and retrieval method for creating a document search index using a plurality of index creation means for creating a plurality of indexes having different characteristics and managing the documents, wherein the plurality of index creations are performed. Among the means, a selection step of selecting one index creation means based on a predetermined value; and an index creation step of creating an index using the index creation means selected in the selection step. Document management search method to do.

9. The apparatus according to claim 8, further comprising a retrieval step of retrieving a document using the indexes having different characteristics.
Document management search method described in.

10. The index having the different characteristics includes a first index and a second index, and the first index can perform a search faster than using the second index. 10. The document management search method according to claim 8, wherein an index can be created faster than the first index.

11. The document management search method according to claim 8, further comprising a setting step of setting a value used in said selecting step.

12. The document management search method according to claim 8, wherein the predetermined value is a ratio indicating a ratio of using the plurality of index creation units.

13. A random number generating step of generating a random number, wherein the selecting step selects an index creating means to be used based on the random number generated in the random number generating step and a predetermined value. 8. The method according to claim 8, wherein
2. The document management search method according to any one of 2.

14. The document management and retrieval method according to claim 8, wherein in said selecting step, said index creation means is selected for each document.

15. A computer-readable memory storing a document management search program code, wherein: a plurality of index creation program code means for creating an index having a plurality of different characteristics; A selection program code means for selecting one index creation program code means based on a predetermined value; and controlling to create an index using the index creation program code means selected by the selection program code means. A computer readable memory comprising: a program code unit configured to execute a search; and a search program code unit configured to search for a document using an index created by the plurality of index creation program code units.