JPH0557624B2

JPH0557624B2 -

Info

Publication number: JPH0557624B2
Application number: JP58127805A
Authority: JP
Inventors: Masamitsu Sato
Original assignee: Tokyo Shibaura Electric Co Ltd
Current assignee: Toshiba Corp
Priority date: 1983-07-15
Filing date: 1983-07-15
Publication date: 1993-08-24
Also published as: JPS6020241A

Description

【発明の詳細な説明】〔発明の技術分野〕本発明は、関係モデに基いて作られた文書デー
タフアイルにおける文書データ検索装置に関す
る。DETAILED DESCRIPTION OF THE INVENTION [Technical Field of the Invention] The present invention relates to a document data retrieval device in a document data file created based on a relational model.

[Technical background of the invention and its problems]

情報検索システムにおいて、その検索対象とな
るデータフアイルを構築するモデルとして、デー
タの取扱いのしやすさから、関係モデルが使われ
る事が多くなつて来ている。 In information retrieval systems, relational models are increasingly being used as models for constructing data files to be searched because of their ease of handling data.

関係モデルは、文書データフアイルを例にとる
と、第１図の様な表形式で表わされる。ここで、
「分類」、「作成日」、「キーワード」……を属性と
言い、「予算」、「830220」、「計算機」……をその
値（属性値）という。 Taking a document data file as an example, the relational model is expressed in a table format as shown in FIG. here,
"Classification", "creation date", "keyword"... are called attributes, and "budget", "830220", "calculator"... are called their values (attribute values).

この表の横１段の列をタツプル（又はレコー
ド）と言い、この表をリレーシヨンと言う。本例
では、これを主リレーシヨンと呼ぶことにする。 One horizontal row of this table is called a tuple (or record), and this table is called a relation. In this example, this will be called the main relation.

この表は、「文書番号１」の文書は、分類が予
算で、作成日が830220で、キーワードが計算機、
……であることを示している。（他の文書につい
ても同様である。）さて、第１図の例で、「キーワード」という属
性について考えてみる。文書につけるキーワード
は、一般に、唯一つではなく、複数個であること
が多い。 In this table, the document with "document number 1" has a classification of budget, a creation date of 830220, and a keyword of "calculator".
It shows that... (The same applies to other documents.) Now, let's consider the attribute "keyword" in the example of FIG. 1. In general, there is often not just one keyword attached to a document, but multiple keywords.

しかし、関係モデルにおいては、一タツプルに
ついては、一属性につき属性値しか設定できな
い。これでは、文書データフアイルの機能として
不十分である。 However, in the relational model, only an attribute value can be set for one attribute for one tuple. This is insufficient as a function of the document data file.

従来技術を用いて、この不都合を解決するため
には、次の２つの方式が考えれる。 In order to solve this problem using conventional technology, the following two methods can be considered.

ａ設定したいキーワード個数分の属性を設定す
る。この場合のリレーシヨンは、第２図の様に
なる。a. Set attributes for the number of keywords you want to set. The relation in this case is as shown in FIG.

ｂ主リレーシヨンとは別に、キーワードだけの
リレーシヨンを作成する。b. Create a keyword-only relation separate from the main relation.

ａでは、データフアイルを使用する時、どのキ
ーワード値をどこのキーワード欄に設定するかを
決めておく必要がある。これは、ユーザーにとつ
て大きな負担である。 In a, when using a data file, it is necessary to decide which keyword value to set in which keyword column. This is a heavy burden on the user.

また、ユーザーにとつては、表面的には第３図
の様に扱え、データフアイル管理装置内では、第
２図の形に扱うことも考えられる。 Furthermore, for the user, it may be handled superficially as shown in FIG. 3, and within the data file management device, it may be handled as shown in FIG. 2.

しかし、この場合、キーワードがＡであると
いう文書の文書番号をとり出すという検索（式で
書くとキーワード＝Ａ）を行う時、「キーワー
ド１」、「キーワード２」……「キーワードｎ」に
ついて、キーワード１＝Ａ、キーワード２＝
Ａ、……キーワードｎ＝Ａという検索を行い、
最後にこれらすべての結果のＲをとることにな
る。 However, in this case, when performing a search to retrieve the document number of a document whose keyword is A (keyword = A when written in the formula), for "keyword 1", "keyword 2"... "keyword n", Keyword 1=A, Keyword 2=
A... Perform a search with keyword n=A,
Finally, we will take R of all these results.

この時、各属性に対応した索引テーブルを作
り、これらを磁気デイスク上に持ち、これをアク
セスするのが、普通である。従つて、この場合は
ｎ回のデイスクアクセスが生じる。 At this time, it is common to create index tables corresponding to each attribute, store these on a magnetic disk, and access them. Therefore, in this case, n disk accesses occur.

このように、この方式では、アｎケの索引テーブルを作る必要がある。 In this way, in this method, We need to create n index tables.

イデイスクアクセスが少くともｎ回あつて検
索に時間がかかる。B. The disk is accessed at least n times and it takes a long time to search.

という欠点がある。There is a drawback.

方式ｂでは、主リレーシヨンの他にもう一つで
きることになる。従つて一つのデータ（文書）に
ついて、２つのリレーシヨンを取扱うことになり
不便である。 In method b, one more thing can be done in addition to the main relation. Therefore, two relations must be handled for one data (document), which is inconvenient.

[Purpose of the invention]

本発明は、関係モデルを用いたデータフアイル
において、従来技術の欠点を解消し、一つの属性
に対し、複数の値（属性値）を設定できる文書デ
ータ検索装置を提供するものである。 The present invention provides a document data retrieval device that eliminates the drawbacks of the prior art in a data file using a relational model and can set a plurality of values (attribute values) for one attribute.

［発明の概略］本発明は、文書データ毎に文書番号が付されて
いる複数の文書データから、所望の文書データの
属性値を入力し、該属性値を有する文書番号を出
力することで、前記所望の文書データを検索する
文書データ検索装置において、前記文書番号に対
応する前記文書データの複数の属性及びこれら属
性に各々に対して、１つの属性値を対応させて前
記文書番号と共に記憶する単値索引テーブル並び
に複数の属性値と対応させて前記文書番号と共に
記憶する多値索引テーブルと、前記文書データの
各々の属性が有している属性値の、単値か多値か
の種別及び数を記憶する属性情報記憶手段と、こ
の属性情報記憶手段による各々の属性が有してい
る属性値の種別及び数を参照して、前記単値索引
テーブル若しくは前記多値索引テーブルより、検
索のための属性値に該当する属性値を有する文書
番号を取り出す索引手段とを備え、前記多値索引
テーブルを、１つの文書データについて、１つの
属性の異なる複数の属性値に対して、各々該文書
データの文書番号と対応させて記憶するように構
成し、この多値索引テーブルを参照して、検索の
ための属性値に該当する属性値を有する文書番号
を取り出すことを特徴とする文書データ検索装置
である。[Summary of the Invention] The present invention provides a method for inputting an attribute value of desired document data from a plurality of document data in which a document number is assigned to each document data, and outputting a document number having the attribute value. In the document data search device for searching the desired document data, a plurality of attributes of the document data corresponding to the document number and one attribute value are associated with each of these attributes and stored together with the document number. A single-value index table, a multi-value index table that is stored together with the document number in association with a plurality of attribute values, and the type of attribute value, whether single-value or multi-value, that each attribute of the document data has; The attribute information storage means stores the number, and the attribute information storage means stores the type and number of attribute values that each attribute has, and performs a search from the single-value index table or the multi-value index table. an indexing means for extracting a document number having an attribute value corresponding to an attribute value for one document data, and indexing means for extracting a document number having an attribute value corresponding to an attribute value for one document data. Document data retrieval characterized in that the document data is stored in correspondence with the document number of the data, and the document number having the attribute value corresponding to the attribute value for search is retrieved by referring to the multi-valued index table. It is a device.

[Embodiments of the invention]

第４図は、本発明の一実施例の構成を示すブロ
ツク図である。 FIG. 4 is a block diagram showing the configuration of one embodiment of the present invention.

一般に、情報検索装置においては、原データ
（文書そのものなど）を記憶しておく装置や検索
式を入力する装置が必要であるが、本発明の主た
る対象となる装置以外は省略してある。 Generally, an information retrieval device requires a device for storing original data (such as the document itself) and a device for inputting a search formula, but devices other than those that are the main object of the present invention are omitted.

また、本実施例では、キーワードの個数は４ケ
とする。 Further, in this embodiment, the number of keywords is four.

単値索引テーブル１は、第３図の「キーワー
ド」以外の属性の様に一タツプルについては一属
性当り一つの値しかない属性（これを単値属性と
呼ぶ）のための索引ケーブルである。その構成は
第５図の通りである。値の順、文書番号順に構成
されている。このテーブルは、従来技術と同じ構
成である。多値索引テーブル２は、第３図の「キ
ーワード」の様に一タツプルについては一属性当
り複数の値のある属性（これを多値属性と呼ぶ）
のための牽引テーブルである。その構成は第６図
の通りである。値の順になつており、同一の値に
対し同一の文書番号はない。 The single-value index table 1 is an index cable for attributes that have only one value per attribute for one tuple (this is called a single-value attribute), such as attributes other than the "keyword" shown in FIG. Its configuration is shown in FIG. It is organized in order of value and document number. This table has the same configuration as the prior art. The multivalued index table 2 contains attributes that have multiple values per attribute (this is called a multivalued attribute) for one tuple, such as "keyword" in Figure 3.
It is a tow table for. Its configuration is shown in FIG. They are ordered by value, and there are no identical document numbers for the same value.

索引テーブルＲ／Ｗ回路３は、制御部９より送
られて来る検索要求信号、属性、属性値を受け
て、単値索引テーブル１又は多値索引テーブル２
より文書番号をとり出し、結果メモリー６に書き
込んだり、制御部９よりの単値索引テーブル又は
多値索引テーブル書込要求信号と属性、属性値、
文書番号を受けて、単値索引テーブル１又は多値
索引テーブル２へ書き込む回路である。 The index table R/W circuit 3 receives the search request signal, attributes, and attribute values sent from the control unit 9, and converts the single-value index table 1 or the multi-value index table 2.
The document number is taken out and written into the result memory 6, and the single value index table or multivalue index table write request signal from the control unit 9, attributes, attribute values,
This circuit receives the document number and writes it into the single-value index table 1 or the multi-value index table 2.

属性情報メモリーＲ／Ｗ回路４は、制御部９よ
りの属性情報メモリー読出し要求、書き込み要求
信号を受けて属性情報メモリー５より指定された
属性についての情報を読み出したり、逆に属性情
報メモリー５へ書き込んだりする。 The attribute information memory R/W circuit 4 receives an attribute information memory read request and a write request signal from the control unit 9 and reads out information about a specified attribute from the attribute information memory 5, and vice versa. I write things down.

属性情報メモリー５は、属性についての情報を
たくわえうおくメモリーである。その構成を第７
図に示す。 The attribute information memory 5 is a memory that stores information about attributes. Its composition is the seventh
As shown in the figure.

第７図で、種別とは、単値か多値かを示し、そ
れぞれ０又は１がセツトされる。値の大きさと
は、値の最大長（バイト数）を示す。値の数と
は、一タツプルにつき一属性当りいくつかの値を
付けられるかを示した数である。単値属性の場合
は必ず１である。 In FIG. 7, the type indicates whether it is single value or multivalue, and is set to 0 or 1, respectively. The value size indicates the maximum length (number of bytes) of the value. The number of values is a number indicating how many values can be assigned to one attribute per one tuple. In the case of a single-valued attribute, it is always 1.

結果メモリー６は、索引テーブルＲ／Ｗ回路３
が単値索引テーブル１又は多値索引ケーブル２よ
り読み出した文書番号を蓄積しておくメモリーで
ある。 The result memory 6 is the index table R/W circuit 3
is a memory that stores document numbers read from the single-value index table 1 or the multi-value index cable 2.

管理データ記憶部７は、原データ（文書そのも
ののなど）を管理するデータを記憶している部分
であり、第３図の形式に多少変更を加えた第８図
の形式をとつている。 The management data storage section 7 is a section that stores data for managing original data (such as the document itself), and has the format shown in FIG. 8, which is slightly modified from the format shown in FIG. 3.

第８図では、第３図に比べて、キーワードの値
を設定する前に、その未使用エリアの先頭を示す
数字を入れている。（初期値１）例えば、文書番
号0001では、第３番目のエリアからは値がセツト
されていないことを示す。このエリアの番号は１
からｎまで（今の場合は４）付けられる。ｎは第
７図の属性定メモリーの「値の数」と同じ値で
ある。 In FIG. 8, compared to FIG. 3, a number indicating the beginning of the unused area is entered before setting the value of the keyword. (Initial value 1) For example, document number 0001 indicates that no value has been set from the third area. The number of this area is 1
to n (4 in this case). n is the same value as the "number of values" of the attribute constant memory in FIG.

管理データＲ／Ｗ回路８は、制御部９からの信
号を受けて管理データ記憶部７へ書き込み又は、
管理データ記憶部７からの読出しを行う回路であ
る。 The management data R/W circuit 8 receives a signal from the control unit 9 and writes it into the management data storage unit 7 or
This is a circuit that reads data from the management data storage section 7.

制御部９は、外部からの検索要求信号や登録要
求信号を受けて、１〜８の各回路に要求を出した
り、結果を受けとつたりする。そして、外部へ結
果データや応答信号を送出する。 The control unit 9 receives search request signals and registration request signals from the outside, issues requests to each of the circuits 1 to 8, and receives results. Then, result data and response signals are sent to the outside.

実施例を用いて本発明の動作を述べる。 The operation of the present invention will be described using an example.

本実施例では、管理データ記憶部７に関係モデ
ルを用いている。 In this embodiment, a relational model is used in the management data storage section 7.

まず、データを登録する場合を説明する。 First, the case of registering data will be explained.

登録要求は、次の様な形で、外部から制御部９
へデータ登録要求信号と共に送られて来る。 A registration request is sent from the outside to the control unit 9 in the following form.
is sent along with a data registration request signal.

文書番号0001の「分類」は予算「作成日」は830220 「キーワード」は計算機、Ａ …… 制御部９は、属性情報メモリーＲ／Ｗ回路４
へ、属性情報メモリー読出し要求信号と共に「分
類」を送る。The "classification" of document number 0001 is the budget, the "creation date" is 830220, the "keyword" is the calculator, A... the control unit 9 is the attribute information memory R/W circuit 4
, sends the "classification" together with the attribute information memory read request signal.

属性情報メモリーＲ／Ｗ回路４は、これを受け
て属性情報メモリー５より「分類」についてのデ
ータを読出し、読出終了信号と共に制御部９へ返
す。 In response to this, the attribute information memory R/W circuit 4 reads data regarding "classification" from the attribute information memory 5 and returns it to the control unit 9 together with a read end signal.

制御部９は、「分類」が単値属性であり、大き
さが50バイトであることを知り、これらの情報と
共に値である「予算」文書番号0001を、管理デー
タ書込要求信号と共に管理データＲ／Ｗ回路８へ
送る。 The control unit 9 learns that "classification" is a single-valued attribute and has a size of 50 bytes, and sends this information along with the value "budget" document number 0001 to the management data along with a management data write request signal. Send to R/W circuit 8.

管理データＲ／Ｗ回路８では、これらを受け
て、値を50バイトに調整し、管理データ記憶部７
の文書番号0001の「分類」のエリアに、値である
「予算」を書き込む。 The management data R/W circuit 8 receives these, adjusts the value to 50 bytes, and stores it in the management data storage section 7.
Write the value "Budget" in the "Classification" area of document number 0001.

そして、制御部９へ書込終了信号を返す。 Then, a write end signal is returned to the control unit 9.

その後制御部９は、索引テーブルＲ／Ｗ回路３
に対し、単値データ書込要求信号と共に、属性名
（「分類」）、値（「予算」）、文書番号0001、値の大
きさ（50）を送る。 After that, the control unit 9 controls the index table R/W circuit 3.
, the attribute name (``classification''), value (``budget''), document number 0001, and value size (50) are sent along with the single value data write request signal.

索引テーブルＲ／Ｗ回路３は、これを受けて、
単値索引テーブル１に、第５図の様に情報を書き
込み、制御部９へ書込終了信号を返す。ここの処
理は従来の技術と同様である。「作成日」も単値
属性であるので、同様の処理が行なわれる。 In response to this, the index table R/W circuit 3
Information is written into the single-value index table 1 as shown in FIG. 5, and a write end signal is returned to the control section 9. The processing here is similar to the conventional technology. Since "creation date" is also a single-valued attribute, similar processing is performed.

次に、制御部９は、属性情報メモリーＲ／Ｗ回
路４の働きにより、「キーワード」についての情
報を属性情報メモリー５より得る。制御部９は、
「キーワード」が多値属性であり、値が４つ設定
でき、「キーワード」全体の大きさが128バイトで
あることを知る。 Next, the control unit 9 obtains information about the "keyword" from the attribute information memory 5 through the function of the attribute information memory R/W circuit 4. The control unit 9 is
Learn that the "keyword" is a multivalued attribute, four values can be set, and the total size of the "keyword" is 128 bytes.

そして、「キーワード」一つの大きさを計算す
る。（今の場合は32バイト）その後、これらの情
報と、値である「計算機」と「Ａ」、文書番号
0001を、管理データ書込要求信号と共に管理デー
タＲ／Ｗ回路８へ送る。 Then, calculate the size of one "keyword". (32 bytes in this case) After that, this information, the values "calculator" and "A", and the document number
0001 is sent to the management data R/W circuit 8 together with the management data write request signal.

管理データＲ／Ｗ回路８では、「キーワード」
一つの値の大きさ（32バイト）に合わせて管理デ
ータ記憶部７の文書番号0001の「キーワード」の
エリアに、未使用エリアの示すエリアから「計算
機」と「Ａ」を書き込む。 In the management data R/W circuit 8, "keyword"
``Computer'' and ``A'' are written in the ``keyword'' area of document number 0001 of the management data storage unit 7 from the area indicated by the unused area according to the size of one value (32 bytes).

そして、未使用エリアの値を３とし、制御部９
へ書込終了信号を返す。その後制御部９は、索引
テーブルＲ／Ｗ回路３に対し、多値データ書込要
求信号と共に、属性名（「キーワード」）、値（「計
算機」、（「Ａ」）、文書番号0001、１つの値の大
きさ（32）を送る。 Then, the value of the unused area is set to 3, and the control unit 9
Returns a write end signal to. Thereafter, the control unit 9 sends the index table R/W circuit 3 a multivalued data write request signal along with the attribute name ("keyword"), value ("calculator", ("A"), document number 0001, 1 Sends the magnitude of one value (32).

索引テーブルＲ／Ｗ回路３は、これを受けて、
多値索引テーブル２に、値の順に従つて、第６図
の様に情報を書き込む。 In response to this, the index table R/W circuit 3
Information is written into the multi-value index table 2 in the order of values as shown in FIG.

この時、ア同一の値が既にあり、文書番号も同一の時
は、実際の書込みはしない。 At this time, a. If the same value already exists and the document number is also the same, the actual writing will not be performed.

イ同一の値が既にあるが、文書番号が異なる時
は、同一の値については文書番号順となるよう
に書き込む。B. If the same value already exists but the document numbers are different, write the same values in the order of the document numbers.

書き込みが終つたならば、制御部９へ書込終了
信号を返す。 When writing is completed, a write end signal is returned to the control unit 9.

制御部９は、登録要求に示されたすべての属性
について上記の作業が終了したならば、外部へ登
録終了信号を返す。 When the control unit 9 completes the above operations for all attributes indicated in the registration request, it returns a registration completion signal to the outside.

以上が登録である。 The above is the registration.

このようにして管理データ記憶部７と、単値索
引テーブル１、多値索引テーブル２へデータが書
き込まれる。 In this way, data is written to the management data storage section 7, the single-value index table 1, and the multi-value index table 2.

次に、「キーワード」が２ケついていた文書に
対し、キーワードを１ケ追加する場合を説明す
る。 Next, a case will be described in which one keyword is added to a document that has two "keywords".

文書番号0002のキーワードとして「精密機械」
と「ロボツト」が付いていて、これに「センサ
ー」を追加すると仮定する。外部からは、属性値
追加要求信号と、属性名（「キーワード」）と値
（「センサー」）が制御部９へ送られて来る。 "Precision machinery" as the keyword for document number 0002
and ``robot'', and assume that you want to add ``sensor'' to this. An attribute value addition request signal, an attribute name (“keyword”), and a value (“sensor”) are sent from the outside to the control unit 9.

制御部９は、登録の時と同様の処理を行なう。
管理データ記憶部７の「キーワード」未使用エリ
アの示すエリアに「センサー」が書かれ、未使用
エリアは４が書き込まれる。 The control unit 9 performs the same processing as at the time of registration.
“Sensor” is written in the area indicated by the “keyword” unused area of the management data storage unit 7, and 4 is written in the unused area.

又、多値索引テーブル２には、「センサー」と
文書番号0002が書かれる。 Furthermore, in the multi-value index table 2, "sensor" and document number 0002 are written.

こうして、「キーワード」が追加される。 In this way, a "keyword" is added.

次に、検索について説明する。 Next, search will be explained.

まず、単値属性の場合を説明する。 First, the case of a single value attribute will be explained.

外部より、制御部９に対し、分類＝予算という
検索要求（「分類」が予算である文書の文書番号
をとり出して結果メモリー６に蓄える。）が来た
と仮定する。 Assume that a search request for classification = budget is received from the outside to the control unit 9 (the document number of a document whose "classification" is budget is extracted and stored in the result memory 6).

制御部９は、登録の場合と同様にして「分類」
についての情報を属性情報Ｒ／Ｗ回路４を介して
得る。そして「分類」が単値属性であることを知
り、索引テーブルＲ／Ｗ回路３へ、属性名（「分
類」）、値（予算」）、値の大きさ（50）と共に単値
データ検索要求信号を送る。 The control unit 9 performs "classification" in the same manner as in the case of registration.
information about the attribute information is obtained via the attribute information R/W circuit 4. Then, knowing that "classification" is a single-value attribute, a single-value data search request is sent to the index table R/W circuit 3 along with the attribute name ("classification"), value (budget), and value size (50). send a signal.

索引テーブルＲ／Ｗ回路３は単値索引テーブル
１より、該当する文書番号0001、……をとり出
し、結果メモリー６へ蓄える。 The index table R/W circuit 3 extracts the corresponding document numbers 0001, . . . from the single value index table 1 and stores them in the result memory 6.

その後、制御部９へ検索終了信号を返す。制御
部９は、外部へ検索終了信号を返す。 Thereafter, a search end signal is returned to the control unit 9. The control unit 9 returns a search end signal to the outside.

次に、多値属性の場合を説明する。 Next, the case of multivalued attributes will be explained.

外部から制御部９へは、キーワード＝Ａとい
う形の検策策要求が来る。 A measure request in the form of keyword=A comes to the control unit 9 from the outside.

制御部９は、登録の場合と同様にして「キーワ
ード」についての情報を属性情報Ｒ／Ｗ回路４を
介して得る。そして「キーワード」が多値属性で
あることを知り、キーワード一つの値の大きさ
（今の場合は32バイト）を求める。その後、索引
テーブルＲ／Ｗ回路回路３へ、属性名（「キーワ
ード」）、値（「Ａ」）、一つの値の大きさ（32）
と共に多値データ検索要求信号を送る。 The control unit 9 obtains information about the "keyword" via the attribute information R/W circuit 4 in the same manner as in the case of registration. Then, we learn that "keyword" is a multivalued attribute, and find the size of the value of one keyword (32 bytes in this case). Then, to index table R/W circuit circuit 3, attribute name (“keyword”), value (“A”), size of one value (32)
At the same time, a multivalued data search request signal is sent.

索引テーブルＲ／Ｗ回路３は、多値索引テーブ
ル２より、該当する文書番号0001、……をとり出
し、結果メモリー６へ蓄える。 The index table R/W circuit 3 extracts the corresponding document numbers 0001, . . . from the multivalued index table 2 and stores them in the result memory 6.

以下は単値属性と同様である。 The following is the same as the single value attribute.

〔Effect of the invention〕

以上、述べた様に、本発明は、関係モデルに基
いたフアイルにおいて多値属性の取扱いを実現し
たものである。 As described above, the present invention realizes handling of multivalued attributes in a file based on a relational model.

従来技術を用いて、本発明の動作説明と同等の
事を実施すると仮定する。 Assume that prior art is used to implement the equivalent of the operational description of the present invention.

比較のため、利用者にとつては、本発明と同様
に、第３図の様に見えるものとする。 For comparison, it is assumed that the system appears to the user as shown in FIG. 3, similar to the present invention.

又、リレーシヨンは１つ、キーワードの個数は
４とする。 Also, the number of relations is one and the number of keywords is four.

この条件の下では、従来技術では、検索装置内
部において第３図を、第２図の様な構成に置きか
えた形で構成することになる。 Under this condition, in the prior art, the search device is constructed by replacing the structure shown in FIG. 3 with the structure shown in FIG. 2.

すなわち、管理データ記憶部７の構成が第２図
の形となる。（但し、ｎは４）従つて多値索引テ
ーブル２はなくなり、第９図に示すキーワード用
の索引テーブルが第５図に変わつた構成（第５図
のデイレクトリの作成者の次にキーワード１、キ
ーワード２………が続く形）となる。属性情報メ
モリー５は、第７図ではなく、第１０図の構成と
なる。第７図にあつた種別はなくなり、すべて単
値続性となる。 That is, the configuration of the management data storage section 7 is as shown in FIG. (However, n is 4) Therefore, the multivalued index table 2 is eliminated, and the index table for keywords shown in FIG. 9 is changed to the one shown in FIG. 5. Keyword 2... followed by...). The attribute information memory 5 has the configuration shown in FIG. 10 instead of FIG. 7. The types shown in Fig. 7 disappear, and everything becomes single-value continuity.

そして、制御部９は、「キーワードは、キーワ
ード１からキーワード４から構成される。」とい
うことを記憶しておく必要がある。 The control unit 9 needs to remember that "the keyword is composed of keyword 1 to keyword 4."

今、キーワード＝Ａという検索要求が外部か
ら制御部９へ与えられたとする。 Now, assume that a search request with keyword=A is given to the control unit 9 from the outside.

制御部９では、この要求を、（キーワード１＝Ａ）＋（キーワード２＝
Ａ）＋（キーワード３＝Ａ）＋（キーワード４＝
Ａ）（＋は論理和を示す。）という要求に置きかえる。 In the control unit 9, this request is expressed as (keyword 1=A)+(keyword 2=
A) + (Keyword 3 = A) + (Keyword 4 =
A) Replace with the request (+ indicates logical sum).

そして、それぞれのカツコ内の検索を、本発明
の動作説明で延べた様に、単値属性として実行す
る。 Then, the search within each bracket is executed as a single value attribute, as described in the explanation of the operation of the present invention.

そして、最後にその結果の和をとることにな
る。 Finally, the sum of the results is calculated.

従つて、索引テーブルＲ／Ｗ回路３への検索要
求が４回出て、索引テーブル参照が４回必要にな
る。 Therefore, the search request to the index table R/W circuit 3 is issued four times, and the index table reference is required four times.

各テーブルは、磁気デイスク上にそれぞれフア
イルとして構築されるのが普通なので、実際に
は、デイスク入出力が少くとも４回生ずる。本発
明では、索引テーブル参照が１回であつたので、
本方式と同等の結果を従来方式で求めるとする
と、少くとも４倍のデイスクアクセス時間がかか
ることになる。 Since each table is typically constructed as a file on a magnetic disk, in practice disk input/output occurs at least four times. In the present invention, since the index table is referenced only once,
If the conventional method were to obtain the same results as the present method, it would take at least four times as long to access the disk.

一般に、一タツプルにつき一属性にｎケの値を
認め、これを従来方式によつて本発明と同等の効
果を実現しようとすると、少くともｎ倍の時間が
かかることになる。 Generally, if n values are allowed for one attribute per tuple and an attempt is made to achieve the same effect as the present invention using the conventional method, it will take at least n times as long.

以上述べた様に、本方式を用いれば、従来技術
にくらべて格段の少ない時間で多値属性の取扱い
を実現できるのである。 As described above, by using this method, it is possible to handle multivalued attributes in a much shorter amount of time than with conventional techniques.

多値属性の取扱いができるということは、検索
のみならず、利用者に対してデータの取扱いをや
りやすくするものであり、関係モデルの持つ柔軟
性をより発揮するものである。 Being able to handle multi-valued attributes not only makes it easier for users to handle data, but also makes it easier for users to handle data, making the flexibility of relational models even more effective.

[Other embodiments of the invention]

本発明の動作説明において、検索要求をキーワ
ード＝Ａの場合で説明したが、＝＞、＜、≧、≦に
変えても本発明の効果は変らない。 In the explanation of the operation of the present invention, the case where the search request is keyword=A has been explained, but the effect of the present invention does not change even if the search request is changed to =>, <, ≧, or ≦.

本発明の動作説明においては、牽引テーブル
Ｒ／Ｗ回路３は単値属性索引テーブル１、多値索
引テーブル２に共通としたが、それぞれのテーブ
ルに対応した単値索引テーブルＲ／Ｗ回路、多値
索引テーブルＲ／Ｗ回路に分けてもよい。 In the explanation of the operation of the present invention, the pulling table R/W circuit 3 is common to the single-value attribute index table 1 and the multi-value index table 2, but the single-value index table R/W circuit and the multi-value index table R/W circuit corresponding to each table are It may be divided into a value index table R/W circuit.

この場合、制御部９は、属性が単値であれば単
値索引テーブルＲ／Ｗ回路へ、多値であれば多値
索引テーブルＲ／Ｗ回路へ読出しや書込要求信号
を出すことになる。 In this case, the control unit 9 issues a read or write request signal to the single value index table R/W circuit if the attribute is single value, and to the multivalue index table R/W circuit if the attribute is multivalue. .

このように、この発明の要旨を逸脱しない限り
どのような変形をもこの発明に含まれるのは当然
である。 As described above, it is natural that any modifications are included in the present invention as long as they do not depart from the gist of the invention.

[Brief explanation of drawings]

第１図は、リレーシヨンを示す図、第２図は、
キーワードがｎケある時の従来技術でのリレーシ
ヨンを示す、第３図は、キーワードがｎケの場合
のリレーシヨンを示す図、第４図は、本発明の一
実施例の構成を示すブロツク図、第５図は単値索
引テーブル１の構成を示す図、第６図は、多他索
引テーブル２の構成を示す図、第７図は、属性情
報メモリー５の構成を示す図、第８図は、管理デ
ータ記憶部７の構成を示す図、第９図は、従来技
術を用いた時のキーワードのための牽引テーブル
の構成を示す図、第１０図は、従来技術を用いた
時のキーワードのための属性情報メモリー５の構
成を示す図である。１……単索引テーブル、２……多索引テーブ
ル、３……索引テーブルＲ／Ｗ回路、４……金属
情報メモリ−Ｒ／Ｗ回路、５……属性情報メモリ
ー、６……結果メモリー、７……管理データ記憶
部、８……管理データＲ／Ｗ回路、９……制御
部。 Figure 1 is a diagram showing relations, Figure 2 is a diagram showing relationships.
FIG. 3 shows a relation in the prior art when there are n keywords. FIG. 4 shows a block diagram showing the configuration of an embodiment of the present invention. 5 shows the structure of the single-value index table 1, FIG. 6 shows the structure of the multi-value index table 2, FIG. 7 shows the structure of the attribute information memory 5, and FIG. 8 shows the structure of the attribute information memory 5. 9 shows the structure of the management data storage unit 7, FIG. 9 shows the structure of the retrieval table for keywords when using the conventional technology, and FIG. 3 is a diagram showing the configuration of an attribute information memory 5 for keywords. FIG. 1...Single index table, 2...Multi-index table, 3...Index table R/W circuit, 4...Metal information memory-R/W circuit, 5...Attribute information memory, 6...Result memory, 7 ... Management data storage section, 8 ... Management data R/W circuit, 9 ... Control section.

Claims

[Scope of Claims] 1. By inputting the attribute value of desired document data from a plurality of document data in which a document number is attached to each document data, and outputting the document number having the attribute value, the desired document data can be obtained. A document data search device for searching document data, comprising: a plurality of attributes of the document data corresponding to the document number; and a single value that stores one attribute value in association with each of these attributes together with the document number; an index table and a multi-value index table that is stored together with the document number in correspondence with a plurality of attribute values; and a type and number of attribute values, whether single-value or multi-value, that each attribute of the document data has. With reference to the attribute information storage means to store, and the type and number of attribute values that each attribute has in this attribute information storage means, from the single-value index table or the multi-value index table, an indexing means for extracting a document number having an attribute value corresponding to the attribute value, and the multi-value index table is configured to retrieve a document number having an attribute value corresponding to the attribute value, and for each document data, the multi-value index table is A document data retrieval device characterized in that it is configured to be stored in correspondence with a document number, and retrieves a document number having an attribute value corresponding to an attribute value for a search by referring to this multivalued index table.